Received: by 2002:a05:6a10:c7c6:0:0:0:0 with SMTP id h6csp1490799pxy; Mon, 2 Aug 2021 03:01:53 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwrjWnn95BpqAjn1JEy7VNmHRGE026e2HaDN9XP0QrLmIdKIALM76gtTPnz9YDsUdTAsL71 X-Received: by 2002:a05:6402:d5a:: with SMTP id ec26mr18472788edb.4.1627898513759; Mon, 02 Aug 2021 03:01:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1627898513; cv=none; d=google.com; s=arc-20160816; b=NJd+dEqTW4OuwpfER9LxuSnOef+uxwwuf+qHS38rzYUQohk+POjAwvuXGF7LpDQWtA eZpMEMxVO36Co57kEWkBHZSdUjRbsd+gOVo97l7e3w5j9J2HMKF5HJ8LDylTbyX0DrnD xvHUa2iwxOuV07OD1MDbx0LztFzoUcSoRNCVy2I5cQfBTKDMuILusi+9kckd5jBQUSu3 uMHEQYmJiqKSK0wG+tSTYo5YOixM0GHaLxKT0QsO+uMuOi09RREKDPn/cAZjJyxNkRyD P0MeTkuqqAZmhvj3ZczlehxFqN7Rj4S2uWfZHu0+N9zJq8iSmOvA810sdfpy60yNo2Ny bdPg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=KAB7ykqvbFssCp3RK6Kg6yqNfz6tLlSXVo0fxogaOoI=; b=vwJiZylZA9jEO0d9XqLVvG/CVxKIb32iDqb4IagS2YOpaEhoiKao3UB+M848Rw5ipx 4wUHT+SYIEyuB3npe/6PAE/ryrH24KjeFkQWihhdcq/wR4NzUBklsYyd90bVlGojr6qL FTrrWTJvImGT8WsHrKT26rBp6zGkyVcEl3iFgPQYJprEpjdNhSK/dQdv6ArUCvqg6PTX GakgYRsECq/4dHiBYngRaKiAqkPVjeREdIGMHNhIkDkoCx33CbZQZFKuFMAp6VIFPuOp X+ec/srdSM6XjjefUQxx38KLK0BG7dtQlmEP7kEMMHM/wcHfrqcV0hPtNDajVCx6gjaF 2TmQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id jw3si8907726ejc.625.2021.08.02.03.01.30; Mon, 02 Aug 2021 03:01:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233160AbhHBKAX (ORCPT + 99 others); Mon, 2 Aug 2021 06:00:23 -0400 Received: from szxga02-in.huawei.com ([45.249.212.188]:7913 "EHLO szxga02-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232973AbhHBKAW (ORCPT ); Mon, 2 Aug 2021 06:00:22 -0400 Received: from dggeme703-chm.china.huawei.com (unknown [172.30.72.53]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4GdYLF5kG6z82wG; Mon, 2 Aug 2021 17:56:21 +0800 (CST) Received: from [10.174.179.25] (10.174.179.25) by dggeme703-chm.china.huawei.com (10.1.199.99) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.2176.2; Mon, 2 Aug 2021 18:00:10 +0800 Subject: Re: [PATCH 4/5] mm, memcg: avoid possible NULL pointer dereferencing in mem_cgroup_init() To: Michal Hocko CC: Roman Gushchin , , , , , , , , , , , References: <20210729125755.16871-1-linmiaohe@huawei.com> <20210729125755.16871-5-linmiaohe@huawei.com> From: Miaohe Lin Message-ID: <47daf062-f510-edb3-6ec7-f8e7615ad8a0@huawei.com> Date: Mon, 2 Aug 2021 18:00:10 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.174.179.25] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To dggeme703-chm.china.huawei.com (10.1.199.99) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/8/2 14:43, Michal Hocko wrote: > On Sat 31-07-21 10:05:51, Miaohe Lin wrote: >> On 2021/7/30 14:44, Michal Hocko wrote: >>> On Thu 29-07-21 20:12:43, Roman Gushchin wrote: >>>> On Thu, Jul 29, 2021 at 08:57:54PM +0800, Miaohe Lin wrote: >>>>> rtpn might be NULL in very rare case. We have better to check it before >>>>> dereferencing it. Since memcg can live with NULL rb_tree_per_node in >>>>> soft_limit_tree, warn this case and continue. >>>>> >>>>> Signed-off-by: Miaohe Lin >>>>> --- >>>>> mm/memcontrol.c | 2 ++ >>>>> 1 file changed, 2 insertions(+) >>>>> >>>>> diff --git a/mm/memcontrol.c b/mm/memcontrol.c >>>>> index 5b4592d1e0f2..70a32174e7c4 100644 >>>>> --- a/mm/memcontrol.c >>>>> +++ b/mm/memcontrol.c >>>>> @@ -7109,6 +7109,8 @@ static int __init mem_cgroup_init(void) >>>>> rtpn = kzalloc_node(sizeof(*rtpn), GFP_KERNEL, >>>>> node_online(node) ? node : NUMA_NO_NODE); >>>>> >>>>> + if (WARN_ON_ONCE(!rtpn)) >>>>> + continue; >>>> >>>> I also really doubt that it makes any sense to continue in this case. >>>> If this allocations fails (at the very beginning of the system's life, it's an __init function), >>>> something is terribly wrong and panic'ing on a NULL-pointer dereference sounds like >>>> a perfect choice. >>> >>> Moreover this is 24B allocation during early boot. Kernel will OOM and >>> panic when not being able to find any victim. I do not think we need to >> >> Agree with you. But IMO it may not be a good idea to leave the rtpn without NULL check. We should defend >> it though it could hardly happen. But I'm not insist on this check. I will drop this patch if you insist. > > It is not that I would insist. I just do not see any point in the code > churn. This check is not going to ever trigger and there is nothing you > can do to recover anyway so crashing the kernel is likely the only > choice left. > I hope I get the point now. What you mean is nothing we can do to recover and panic'ing on a NULL-pointer dereference is a perfect choice ? Should we declare that we leave the rtpn without NULL check on purpose like below ? Many thanks. @@ -7109,8 +7109,12 @@ static int __init mem_cgroup_init(void) rtpn = kzalloc_node(sizeof(*rtpn), GFP_KERNEL, node_online(node) ? node : NUMA_NO_NODE); - if (WARN_ON_ONCE(!rtpn)) - continue; + /* + * If this allocation fails (at the very beginning of the + * system's life, it's an __init function), something is + * terribly wrong and panic'ing on a NULL-pointer + * dereference sounds like a perfect choice. + */ rtpn->rb_root = RB_ROOT; rtpn->rb_rightmost = NULL; spin_lock_init(&rtpn->lock);