Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp5397049pxv; Wed, 28 Jul 2021 09:45:35 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzd1VxQ8Nl6OxNiwBxnalXLXS6NZmvBj5YX79O6G/hRiTPeXfdWiEf1nO1VYYnS9VSDlAyd X-Received: by 2002:a50:8dcc:: with SMTP id s12mr890029edh.105.1627490735542; Wed, 28 Jul 2021 09:45:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1627490735; cv=none; d=google.com; s=arc-20160816; b=BtRgYmT7TV1CdzfcoNxHfvbSeYrzStLilTvwHLMavUFe0DR8f2a4G1uHXrHuNO2FdY OK+dgOeLUJjabr5NFcZFDVUXY8uy6Ou1W2YDzKk9INDud8ygHqyZZRKptxBuBB67fCh6 8u08FzVjp5k28/fRkAXgfk2AVmdMfy1gQ+wadRUql8OW4rtvHYyqqCI9Pa5ofL6euXS0 vGoZndGyivVnNK4pTOBDmB9PP3bMePDDiqb5vruWAdQcU7VmX488ODMQa5YIOWupLQil +zRYn1LUgE+rIHQbHnNE6YEPSnGgnbDouZPQJAcejt/zWiMr1yYAzCvfF1xWzFsO4Xlw rREQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=03doPF0Hm9VX9+U1BaRb0OdFxpJZ53fMHg22McZ99ig=; b=nFjd7QKS1AF23NNdjex8tdFUiyK0y/CoY/jOLl2atVh8YnoPNk8m1maywKMY6JAQlr GN7y1JhUSS0DBmMzjW0NkNSNfFleLDz3W7VDeL2YrF3TdnjBg07Q8eZBtIL6ORM0BfBY Ba+/VcxRsG83zeXDywKQhl5tDYRVGoyphKD+Jb8dmKg/N88nhCF8n83e1RhOw3h5IFcy 9m8R9XbUV38TH5t8jnmoOLmLnJGU7nV1G5PofakHWq2vAeD2iNSw2RPcdpRkavm2Ccyl Xnes8NGMWBEstRqT/ESi2ccske0gEZMiehGIGmmC6UDEwtOzzaiJSTK3bEVAkRrMo+Mc 0oNQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=eaol6RxX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i18si141256edc.507.2021.07.28.09.45.11; Wed, 28 Jul 2021 09:45:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=eaol6RxX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=NONE dis=NONE) header.from=suse.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230027AbhG1QnL (ORCPT + 99 others); Wed, 28 Jul 2021 12:43:11 -0400 Received: from smtp-out1.suse.de ([195.135.220.28]:58028 "EHLO smtp-out1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229581AbhG1QnL (ORCPT ); Wed, 28 Jul 2021 12:43:11 -0400 Received: from relay2.suse.de (relay2.suse.de [149.44.160.134]) by smtp-out1.suse.de (Postfix) with ESMTP id 9AC4A22331; Wed, 28 Jul 2021 16:43:08 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1627490588; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=03doPF0Hm9VX9+U1BaRb0OdFxpJZ53fMHg22McZ99ig=; b=eaol6RxXf7+m2YKwFbYesAbm4xgVs/1VQcxOfo++HD7LxE1zGg6ukvWgka8lVscyStIBep e6F0QMB3gKyfxylz7xo9HPQ0proQV82xTS1TyNw0T6g1c0hz0J+T4LZL2hYtLcrqG9dF85 db4KjKnW2o2SWDr5LuCOQWBurbOrTOs= Received: from suse.cz (unknown [10.100.201.86]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by relay2.suse.de (Postfix) with ESMTPS id 918D4A3B83; Wed, 28 Jul 2021 16:43:07 +0000 (UTC) Date: Wed, 28 Jul 2021 18:43:07 +0200 From: Michal Hocko To: Shakeel Butt Cc: Roman Gushchin , Wang Hai , Christoph Lameter , Pekka Enberg , David Rientjes , Joonsoo Kim , Andrew Morton , Vlastimil Babka , Johannes Weiner , Alexei Starovoitov , wangkefeng.wang@huawei.com, Linux MM , LKML Subject: Re: [PATCH] mm/memcg: fix NULL pointer dereference in memcg_slab_free_hook() Message-ID: References: <20210728091348.272714-1-wanghai38@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 28-07-21 07:10:26, Shakeel Butt wrote: > +Roman > > On Wed, Jul 28, 2021 at 6:23 AM Michal Hocko wrote: > > > > On Wed 28-07-21 17:13:48, Wang Hai wrote: > > > When I use kfree_rcu() to free a large memory allocated by > > > kmalloc_node(), the following dump occurs. > > > > > > BUG: kernel NULL pointer dereference, address: 0000000000000020 > > > [...] > > > Oops: 0000 [#1] SMP > > > [...] > > > Workqueue: events kfree_rcu_work > > > RIP: 0010:__obj_to_index include/linux/slub_def.h:182 [inline] > > > RIP: 0010:obj_to_index include/linux/slub_def.h:191 [inline] > > > RIP: 0010:memcg_slab_free_hook+0x120/0x260 mm/slab.h:363 > > > [...] > > > Call Trace: > > > kmem_cache_free_bulk+0x58/0x630 mm/slub.c:3293 > > > kfree_bulk include/linux/slab.h:413 [inline] > > > kfree_rcu_work+0x1ab/0x200 kernel/rcu/tree.c:3300 > > > process_one_work+0x207/0x530 kernel/workqueue.c:2276 > > > worker_thread+0x320/0x610 kernel/workqueue.c:2422 > > > kthread+0x13d/0x160 kernel/kthread.c:313 > > > ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:294 > > > > > > When kmalloc_node() a large memory, page is allocated, not slab, > > > so when freeing memory via kfree_rcu(), this large memory should not > > > be used by memcg_slab_free_hook(), because memcg_slab_free_hook() is > > > is used for slab. > > > > > > So in this case, there is no need to do anything with this large > > > page in memcg_slab_free_hook(), just skip it. > > > > > > Fixes: 270c6a71460e ("mm: memcontrol/slab: Use helpers to access slab page's memcg_data") > > > > Are you sure that this commit is really breaking the code. Unless I have > > missed something there shouldn't be any real change wrt. large > > allocations here. page_has_obj_cgroups is just a different name for what > > what page_objcgs is giving us. > > Actually they are different. For MEMCG_DATA_KMEM page, > page_has_obj_cgroups() will return false while page_objcgs() on > non-VM_DEBUG kernels will return "struct obj_cgroup *" instead of > "struct obj_cgroup **". Right. Thanks for the clarification. I have missed that subtle difference. -- Michal Hocko SUSE Labs