Received: by 10.213.65.68 with SMTP id h4csp3662577imn; Tue, 10 Apr 2018 02:37:35 -0700 (PDT) X-Google-Smtp-Source: AIpwx4+OuG1Okd1yWxp9DgjYIyrefbx1gTtKa0opstj3JQYBdbER6Nc+pO4clONOWUbA5H3WN9JX X-Received: by 2002:a17:902:8ec5:: with SMTP id x5-v6mr279134plo.391.1523353055802; Tue, 10 Apr 2018 02:37:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523353055; cv=none; d=google.com; s=arc-20160816; b=umU3LIojGuNkizh3/R03Kb1oqnJThhj/IEQobgdjpdBxfzvmH/wEZE9bB6z4gYHQkM KlcqdkNw5ISVQSUJQtdxZ1hszRR9hIGq7IfkzRqtCC9nzX/r4HisAACl7mHe8gEx8A1T d9WIA7umCzLHU1GMUvOtWMd/L7jovUCLWwRI6s9ks/Mbzl+KrA7swyIXzM1uNHja0CIt X1kn5SWLCZKKuC4LK3ddkapc555umMGVn6kfXshA6yOqy6i/s1o5Rvt8t3CUy4oB9NZh Sse/9VFCq98XdIg0scEDsnPvIKsWB0WzykqhsVQ5Je4xsIDSML1dkex91ro3RlVxQU6O xr3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=D8DU3dpCLlk6FfMMUbZSqLCVz8gOLQZLI2GM/ys68Oo=; b=X3P4u/vFmlF5BRD+cKWsXfaYepGC9J9ARGDH/YiQMxC7BsLfiWcrqIs/cbMzeSlxr4 0B1ToSazJCuREQYh7VdjNoPLtgxrQZYZij1n9WzusFhGmkLskcptosWog3EQb1m96/89 5iNl0ovg7yLRoCnO3dP0Fxppv73pr7Te8qvD3wDIbBDnJl3h8tymwgUwmBU2KWU6SB3x 0qxUTLEVqQ5mxQg19mp34RQaeIzARSvShk4QZUEtY8dAwzpFHycd/ETK9q+5xbwOwVRw oHmh5LsN8JQk/kn+w0HN76AKGkjGzDE9kMA9XN8OdEdHZjO0n8/SkWj3F3Wsqqgcc9e2 692w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d90-v6si2276474pld.515.2018.04.10.02.36.58; Tue, 10 Apr 2018 02:37:35 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752555AbeDJJcp (ORCPT + 99 others); Tue, 10 Apr 2018 05:32:45 -0400 Received: from mx2.suse.de ([195.135.220.15]:50606 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751770AbeDJJco (ORCPT ); Tue, 10 Apr 2018 05:32:44 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 7B5BBAF5E; Tue, 10 Apr 2018 09:32:42 +0000 (UTC) Date: Tue, 10 Apr 2018 11:32:41 +0200 From: Michal Hocko To: Jan Kara Cc: Minchan Kim , Andrew Morton , linux-mm , LKML , Johannes Weiner , Chris Fries Subject: Re: [PATCH] mm: workingset: fix NULL ptr dereference Message-ID: <20180410093241.GA21835@dhcp22.suse.cz> References: <20180409015815.235943-1-minchan@kernel.org> <20180410082243.GW21835@dhcp22.suse.cz> <20180410085531.m2xvzi7nenbrgbve@quack2.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180410085531.m2xvzi7nenbrgbve@quack2.suse.cz> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 10-04-18 10:55:31, Jan Kara wrote: > On Tue 10-04-18 10:22:43, Michal Hocko wrote: > > On Mon 09-04-18 10:58:15, Minchan Kim wrote: > > > Recently, I got a report like below. > > > > > > [ 7858.792946] [] __list_del_entry+0x30/0xd0 > > > [ 7858.792951] [] list_lru_del+0xac/0x1ac > > > [ 7858.792957] [] page_cache_tree_insert+0xd8/0x110 > > > [ 7858.792962] [] __add_to_page_cache_locked+0xf8/0x4e0 > > > [ 7858.792967] [] add_to_page_cache_lru+0x50/0x1ac > > > [ 7858.792972] [] pagecache_get_page+0x468/0x57c > > > [ 7858.792979] [] __get_node_page+0x84/0x764 > > > [ 7858.792986] [] f2fs_iget+0x264/0xdc8 > > > [ 7858.792991] [] f2fs_lookup+0x3b4/0x660 > > > [ 7858.792998] [] lookup_slow+0x1e4/0x348 > > > [ 7858.793003] [] walk_component+0x21c/0x320 > > > [ 7858.793008] [] path_lookupat+0x90/0x1bc > > > [ 7858.793013] [] filename_lookup+0x8c/0x1a0 > > > [ 7858.793018] [] vfs_fstatat+0x84/0x10c > > > [ 7858.793023] [] SyS_newfstatat+0x28/0x64 > > > > > > v4.9 kenrel already has the d3798ae8c6f3,("mm: filemap: don't > > > plant shadow entries without radix tree node") so I thought > > > it should be okay. When I was googling, I found others report > > > such problem and I think current kernel still has the problem. > > > > > > https://bugzilla.redhat.com/show_bug.cgi?id=1431567 > > > https://bugzilla.redhat.com/show_bug.cgi?id=1420335 > > > > > > It assumes shadow entry of radix tree relies on the init state > > > that node->private_list allocated should be list_empty state. > > > Currently, it's initailized in SLAB constructor which means > > > node of radix tree would be initialized only when *slub allocates > > > new page*, not *new object*. So, if some FS or subsystem pass > > > gfp_mask to __GFP_ZERO, slub allocator will do memset blindly. > > > That means allocated node can have !list_empty(node->private_list). > > > It ends up calling NULL deference at workingset_update_node by > > > failing list_empty check. > > > > > > This patch should fix it. > > > > > > Fixes: 449dd6984d0e ("mm: keep page cache radix tree nodes in check") > > > Reported-by: Chris Fries > > > Cc: Johannes Weiner > > > Cc: Jan Kara > > > Signed-off-by: Minchan Kim > > > > Regardless of whether it makes sense to use __GFP_ZERO from the upper > > layer or not, it is subtle as hell to rely on the pre-existing state > > for a newly allocated object. So yes this makes perfect sense. > > > > Do we want CC: stable? > > Acked-by: Michal Hocko > > Well, for hot allocations we do rely on previous state a lot. After all > that's what slab constructor was created for. Whether radix tree node > allocation is such a hot path is a question for debate, I agree. I really doubt that LIST_INIT is something to notice for the radix tree allocation. So I would rather have safe code than rely on the previous state which is really subtle. Btw. I am not a huge fan of ctor semantic as we have it. I am not really sure all users understand when it is called... -- Michal Hocko SUSE Labs