Received: by 2002:a05:6358:9144:b0:117:f937:c515 with SMTP id r4csp10981354rwr; Fri, 12 May 2023 16:41:17 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ6fCEq/MXk7hh8FjmxAPEdos4IBq3W2PUiCYgU+da5VzT4D4MG76HEA76cAa+gundgw/4eB X-Received: by 2002:a17:902:ea0b:b0:1ac:7d8a:365 with SMTP id s11-20020a170902ea0b00b001ac7d8a0365mr23906568plg.26.1683934876685; Fri, 12 May 2023 16:41:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1683934876; cv=none; d=google.com; s=arc-20160816; b=tlJ93RigyBN4u9tWV3eiWLUPBYDw1OLtBi+0hcIqfVAPu5QCIg1kuP1+/DNMhzQTbe iNwneHRUBXbAUBrjcGtCycYtZmkfrKd7XVOP8LRLJf6szj7AB/gvnsHxalqOd8Iayoxb ezhJIWKeZtNuEytihM60s5VZCAp0HpmfcnvB+cn9nBK2YYEHVQoAqPncl4oktmW+E+cv vdGO00DUWNL3DDAIvdJwJJjTafNbe9+rkPYvTa/O78mbrFfyw45hU54g6MyTEXxJVZnI 8Vr7Z6rLiTtyi9bEgqC8AUHHW79+2us8hiO99smOwfGnWxyqc3D4MNPej3BiO8OtUxlt 3x+g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=WwD37z9h6jdNDySCqsq+5Bn22j80TnDXuDcqLh+8ujs=; b=VMZhGV0yveW1BhCzm0Wo/KfSf8dfFp8r7JeaJTXet0pl0bgqqMYK6xRVpmzuFAHmDy GMHv5+wlUlQ5be0q3Wi21UElhHXdEqxeF4s5dhaAH0ZLthoWN8qpnoSaeuHJQL2QNJQ2 Gq+iPinsOiFpdzLPECvRgZ8r+RDD8F5+3A7WOssU6ZaQAxalAgbAh6nyloygFDxZUuMR NUDeF3MWHcMxSE97WHRY0zqMeDdYBJnJKxQcL6fzMHyBhy7kCA7bnfrdDzmyLP/4YJcn wn0oElyDHqBbp2MsJM4ggiVpnSsCvkerl0yWmG1SHqGa+wAPzGo85yw9Z4rkAUJTUYQ4 Nwbg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 21-20020a170902c11500b001a66c437b2esi10034882pli.20.2023.05.12.16.41.02; Fri, 12 May 2023 16:41:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240533AbjELXPU (ORCPT + 99 others); Fri, 12 May 2023 19:15:20 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35340 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230202AbjELXPS (ORCPT ); Fri, 12 May 2023 19:15:18 -0400 Received: from www262.sakura.ne.jp (www262.sakura.ne.jp [202.181.97.72]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4AE936EA1 for ; Fri, 12 May 2023 16:15:16 -0700 (PDT) Received: from fsav119.sakura.ne.jp (fsav119.sakura.ne.jp [27.133.134.246]) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTP id 34CNEgSO017364; Sat, 13 May 2023 08:14:42 +0900 (JST) (envelope-from penguin-kernel@I-love.SAKURA.ne.jp) Received: from www262.sakura.ne.jp (202.181.97.72) by fsav119.sakura.ne.jp (F-Secure/fsigk_smtp/550/fsav119.sakura.ne.jp); Sat, 13 May 2023 08:14:42 +0900 (JST) X-Virus-Status: clean(F-Secure/fsigk_smtp/550/fsav119.sakura.ne.jp) Received: from [192.168.1.6] (M106072142033.v4.enabler.ne.jp [106.72.142.33]) (authenticated bits=0) by www262.sakura.ne.jp (8.15.2/8.15.2) with ESMTPSA id 34CNDROl017213 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NO); Sat, 13 May 2023 08:13:27 +0900 (JST) (envelope-from penguin-kernel@I-love.SAKURA.ne.jp) Message-ID: <61d7ac52-3352-b7cd-8908-890a29a007d9@I-love.SAKURA.ne.jp> Date: Sat, 13 May 2023 08:13:27 +0900 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.10.1 Subject: Re: [PATCH] debugobject: don't wake up kswapd from fill_pool() Content-Language: en-US To: Thomas Gleixner , Andrew Morton Cc: syzbot , syzkaller-bugs@googlegroups.com, Ingo Molnar , linux-kernel@vger.kernel.org, linux-mm References: <000000000000008ddb05fb5e2576@google.com> <6577e1fa-b6ee-f2be-2414-a2b51b1c5e30@I-love.SAKURA.ne.jp> <20230511204458.819f9009d2ef8b46cc163191@linux-foundation.org> <87v8gxbthf.ffs@tglx> <39c79d27-73ea-06a8-62fe-2b64d0fd8db5@I-love.SAKURA.ne.jp> <87bkipbeyr.ffs@tglx> From: Tetsuo Handa In-Reply-To: <87bkipbeyr.ffs@tglx> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-3.8 required=5.0 tests=BAYES_00,NICE_REPLY_A, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2023/05/13 3:07, Thomas Gleixner wrote: > On Fri, May 12 2023 at 22:09, Tetsuo Handa wrote: >> On 2023/05/12 21:54, Thomas Gleixner wrote: >>> On Fri, May 12 2023 at 19:57, Tetsuo Handa wrote: >>>> On 2023/05/12 12:44, Andrew Morton wrote: >>>>> On Thu, 11 May 2023 22:47:32 +0900 Tetsuo Handa wrote: >>>>> >>>>>> syzbot is reporting lockdep warning in fill_pool(), for GFP_ATOMIC is >>>>>> (__GFP_HIGH | __GFP_KSWAPD_RECLAIM) which wakes up kswapd. >>>>>> Since fill_pool() might be called with arbitrary locks held, >>>>>> fill_pool() should not assume that holding pgdat->kswapd_wait is safe. >>> >>> https://lore.kernel.org/lkml/871qjldbes.ffs@tglx/ >> >> .config says IS_ENABLED(CONFIG_PREEMPT_RT) == false, and lockdep says about >> base->lock => pgdat->kswapd_wait => p->pi_lock => rq->__lock => base->lock >> dependency but does not say about db->lock. >> >> How can your patch fix this problem? > > It's described in the changelog, no? I can't find a proof that lookup_object() never returns NULL when debug_object_activate() is called. > > The main change is to make the refill invocation conditional when the > lookup fails. That's how that code has been from day one. Making refill conditional helps reducing frequency of doing allocations. I want a proof that allocations never happens in the worst scenario. Are you saying that some debugobject function other than debug_object_activate() guarantees that memory for that object was already allocated before debug_object_activate() is called for the first time for that object, _and_ such debugobject function is called without locks held? > > The patch which closed the race recently wreckaged those refill > oportunities and the fix for that introduced this problem. > > Thanks, > > tglx