Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp1228244rwl; Wed, 12 Apr 2023 09:48:35 -0700 (PDT) X-Google-Smtp-Source: AKy350YA/rug3FrVXrUaRFrrDPU9EhOa3Evechna0yJQwXd494rTTF6nv6WqxIf96uXNsc2SLyZX X-Received: by 2002:a62:1717:0:b0:635:cfa6:ee5d with SMTP id 23-20020a621717000000b00635cfa6ee5dmr8561817pfx.7.1681318115149; Wed, 12 Apr 2023 09:48:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1681318115; cv=none; d=google.com; s=arc-20160816; b=hNsssCiMeRmc340omUUkZXnBO4O0zOc4JlQubAYU3VURPDTYvFqJDYFmmuEzyA/UeQ LKg270HgO0Qrj45GHiWV1bia1hUw8TH9OqdCYv81lA+7+usFEGMriebDfwLCa+zPPCs4 Od0m4YrO+hxTTvtndKgtwXAdrlcMgbCDi8pj/hssq1AaHVLku/MuUh6BQcWGaJ7vFwDT 1zGFY8rPKgpFN94ThTz0jx7UlfsgMdwvF6uVYRJUdBP4ClET4xnwDxRzahae+Ld1s+dD CawQNflIVlk97XzryJm+Om16YlgVPYo2JvDLYnP5jadmkfxyrGgrivpKiYjMOZhBIQr+ SivQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=mblYIOQp15DPpNJLZonRj1Q4Mutz9sHweMpSOIOysuo=; b=a+AC9Gy4PKEsdJog7qdK7+dZriXGhvqukB2PTrrTtoTVGsKmN9ODvZgneRhc/SDssR NXfIm0vV7hmdnr+URneCbveNgMDQCZqPgkEhg+5VX2B217RHiiRVj6GhuVmm7puJs8BN CL+LPwQzEIDRGQASbuSjIwambdg7txLJ5xhFzXHntuSOXfOfw+kzmiG0YqUZCZfkoOU/ c4y10ItqbAVfC88blaeE2E6TbAYMINpglZbGYZcP2zzaTc5VMhDprsqWn/lSQQgcHt5i etMVnqRMUFjmuOQJnSOzIfZ9UbgUAREnRsSY2CFCjYF12vJlgkUI0CNQmkEvtSBvMCGW nBRQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=erPiXlRd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id bs7-20020a632807000000b005138e3da7e7si17074559pgb.467.2023.04.12.09.48.22; Wed, 12 Apr 2023 09:48:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=erPiXlRd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231492AbjDLQqr (ORCPT + 99 others); Wed, 12 Apr 2023 12:46:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42562 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230136AbjDLQq0 (ORCPT ); Wed, 12 Apr 2023 12:46:26 -0400 Received: from mail-pl1-x629.google.com (mail-pl1-x629.google.com [IPv6:2607:f8b0:4864:20::629]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7BA628A56 for ; Wed, 12 Apr 2023 09:45:02 -0700 (PDT) Received: by mail-pl1-x629.google.com with SMTP id d9443c01a7336-1a5266d204dso3776075ad.1 for ; Wed, 12 Apr 2023 09:45:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1681317892; x=1683909892; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date:message-id:reply-to; bh=mblYIOQp15DPpNJLZonRj1Q4Mutz9sHweMpSOIOysuo=; b=erPiXlRdUqCIakRu1oMia3Dog0aIQXlkglwS6vTrMxlss7fwcC+h+CTYbqa1cPR5H/ dKdLJa+uCjEG41vNTzeplvpf8rTd8HxLjoxXX/G3XvC3g9HeHbRcUYbJvnf2h0tpBRZN 4wrAirJiciiw/Jc6xh89K4G6K4i4bAUANPAFlj7Q4YDNWcTnZpR0t2t/U4/ZSZ/GcH1C pMGIuEsx+S5VTpO0f0GnX4bFUKdrDUBS7243d7a3TVGZ269Rj34Ojq1Xbfb37bBOA6th dUHli54N2TWld3+dGHW1ieY9Bkh4I9wkeirVd5Y23BH8rlRKJmvAp8jZajpJ1mUFgX19 WmBA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1681317892; x=1683909892; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=mblYIOQp15DPpNJLZonRj1Q4Mutz9sHweMpSOIOysuo=; b=IAo04hFgwtdADxQX/l+MOcOANK5ici7iTv8Mn15VXxak+2fX2fLnjn9IK9PPpzulgd SCEPQiqWlr2e30w42VJdELykInH3mTRkFw/8/OeYkvj8dPuaA5Gs6bOjTI7ZIu/X9+Wz L/rTSqeG1Q53djOFFAYcfiPa6XL7FYUJ0uFJofyVFsWePeoa+Sx4ij9cQkdJ6pReSVq/ osQYL3upYDHXchstsEF3fUPSepQzKxs7+InnvVYF6Bjc8126ExXUY8SUhhInnaSqyYok nLkHnmswOzrZMZEMSA9haxxmjmAydl0sJdVptZry7FYIzLmCMow1xPAi3XfZ+YdT6HDs 9TRA== X-Gm-Message-State: AAQBX9dX6LTAsE3na8IFVmA1033x3sG0+VndOZpe3r62B6brRCmFLl7u cnk0Ecq5FwwXFyUHtUfX2fjTriR3R4//ZKPWGhU= X-Received: by 2002:a17:902:8f82:b0:19d:1e21:7f59 with SMTP id z2-20020a1709028f8200b0019d1e217f59mr19185705plo.0.1681317892609; Wed, 12 Apr 2023 09:44:52 -0700 (PDT) Received: from [10.200.10.123] ([139.177.225.225]) by smtp.gmail.com with ESMTPSA id iz14-20020a170902ef8e00b001a2104d706fsm1873858plb.225.2023.04.12.09.44.46 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 12 Apr 2023 09:44:52 -0700 (PDT) Message-ID: <76e15f10-d3f1-2cab-63cb-25aa3b4f2cd4@bytedance.com> Date: Thu, 13 Apr 2023 00:44:42 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.9.1 Subject: Re: [PATCH] mm: slub: annotate kmem_cache_node->list_lock as raw_spinlock Content-Language: en-US To: Peter Zijlstra Cc: Vlastimil Babka , "Zhang, Qiang1" , Boqun Feng , Qi Zheng , "42.hyeyoo@gmail.com" <42.hyeyoo@gmail.com>, "akpm@linux-foundation.org" , "roman.gushchin@linux.dev" , "iamjoonsoo.kim@lge.com" , "rientjes@google.com" , "penberg@kernel.org" , "cl@linux.com" , "linux-mm@kvack.org" , "linux-kernel@vger.kernel.org" , Zhao Gongyi , Sebastian Andrzej Siewior , Thomas Gleixner , RCU , "Paul E . McKenney" References: <20230411130854.46795-1-zhengqi.arch@bytedance.com> <932bf921-a076-e166-4f95-1adb24d544cf@bytedance.com> <20230412124735.GE628377@hirez.programming.kicks-ass.net> From: Qi Zheng In-Reply-To: <20230412124735.GE628377@hirez.programming.kicks-ass.net> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-3.2 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2023/4/12 20:47, Peter Zijlstra wrote: > On Wed, Apr 12, 2023 at 08:50:29AM +0200, Vlastimil Babka wrote: > >>> --- a/lib/debugobjects.c >>> +++ b/lib/debugobjects.c >>> @@ -562,10 +562,10 @@ __debug_object_init(void *addr, const struct debug_obj_descr *descr, int onstack >>> unsigned long flags; >>> >>> /* >>> - * On RT enabled kernels the pool refill must happen in preemptible >>> + * The pool refill must happen in preemptible >>> * context: >>> */ >>> - if (!IS_ENABLED(CONFIG_PREEMPT_RT) || preemptible()) >>> + if (preemptible()) >>> fill_pool(); >> >> +CC Peterz >> >> Aha so this is in fact another case where the code is written with >> actual differences between PREEMPT_RT and !PREEMPT_RT in mind, but >> CONFIG_PROVE_RAW_LOCK_NESTING always assumes PREEMPT_RT? > > Ooh, tricky, yes. PROVE_RAW_LOCK_NESTING always follows the PREEMP_RT > rules and does not expect trickery like the above. > > Something like the completely untested below might be of help.. > > --- > diff --git a/include/linux/lockdep_types.h b/include/linux/lockdep_types.h > index d22430840b53..f3120d6a7d9e 100644 > --- a/include/linux/lockdep_types.h > +++ b/include/linux/lockdep_types.h > @@ -33,6 +33,7 @@ enum lockdep_wait_type { > enum lockdep_lock_type { > LD_LOCK_NORMAL = 0, /* normal, catch all */ > LD_LOCK_PERCPU, /* percpu */ > + LD_LOCK_WAIT, /* annotation */ > LD_LOCK_MAX, > }; > > diff --git a/kernel/locking/lockdep.c b/kernel/locking/lockdep.c > index 50d4863974e7..a4077f5bb75b 100644 > --- a/kernel/locking/lockdep.c > +++ b/kernel/locking/lockdep.c > @@ -2279,8 +2279,9 @@ static inline bool usage_skip(struct lock_list *entry, void *mask) > * As a result, we will skip local_lock(), when we search for irq > * inversion bugs. > */ > - if (entry->class->lock_type == LD_LOCK_PERCPU) { > - if (DEBUG_LOCKS_WARN_ON(entry->class->wait_type_inner < LD_WAIT_CONFIG)) > + if (entry->class->lock_type != LD_LOCK_NORMAL) { > + if (entry->class->lock_type == LD_LOCK_PERCPU && > + DEBUG_LOCKS_WARN_ON(entry->class->wait_type_inner < LD_WAIT_CONFIG)) > return false; > > return true; > @@ -4752,7 +4753,8 @@ static int check_wait_context(struct task_struct *curr, struct held_lock *next) > > for (; depth < curr->lockdep_depth; depth++) { > struct held_lock *prev = curr->held_locks + depth; > - u8 prev_inner = hlock_class(prev)->wait_type_inner; > + struct lock_class *class = hlock_class(prev); > + u8 prev_inner = class->wait_type_inner; > > if (prev_inner) { > /* > @@ -4762,6 +4764,12 @@ static int check_wait_context(struct task_struct *curr, struct held_lock *next) > * Also due to trylocks. > */ > curr_inner = min(curr_inner, prev_inner); > + > + /* > + * Allow override for annotations. > + */ > + if (unlikely(class->lock_type == LD_LOCK_WAIT)) > + curr_inner = prev_inner; > } > } > > diff --git a/lib/debugobjects.c b/lib/debugobjects.c > index df86e649d8be..fae71ef72a16 100644 > --- a/lib/debugobjects.c > +++ b/lib/debugobjects.c > @@ -565,8 +565,16 @@ __debug_object_init(void *addr, const struct debug_obj_descr *descr, int onstack > * On RT enabled kernels the pool refill must happen in preemptible > * context: > */ > - if (!IS_ENABLED(CONFIG_PREEMPT_RT) || preemptible()) > + if (!IS_ENABLED(CONFIG_PREEMPT_RT) || preemptible()) { > + static lockdep_map dep_map = { static struct lockdep_map dep_map = { > + .name = "wait-type-override", > + .wait_type_inner = LD_WAIT_SLEEP, > + .lock_type = LD_LOCK_WAIT, > + }; > + lock_map_acquire(&dep_map); > fill_pool(); > + lock_map_release(&dep_map); > + } > > db = get_bucket((unsigned long) addr); > I just tested the above code, and then got the following warning: [ 0.001000][ T0] ============================= [ 0.001000][ T0] [ BUG: Invalid wait context ] [ 0.001000][ T0] 6.3.0-rc6-next-20230412+ #21 Not tainted [ 0.001000][ T0] ----------------------------- [ 0.001000][ T0] swapper/0/0 is trying to lock: [ 0.001000][ T0] ffffffff825bcb80 (wait-type-override){....}-{4:4}, at: __debug_object_init+0x0/0x590 [ 0.001000][ T0] other info that might help us debug this: [ 0.001000][ T0] context-{5:5} [ 0.001000][ T0] 2 locks held by swapper/0/0: [ 0.001000][ T0] #0: ffffffff824f5178 (timekeeper_lock){....}-{2:2}, at: timekeeping_init+0xf1/0x270 [ 0.001000][ T0] #1: ffffffff824f5008 (tk_core.seq.seqcount){....}-{0:0}, at: start_kernel+0x31a/0x800 [ 0.001000][ T0] stack backtrace: [ 0.001000][ T0] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 6.3.0-rc6-next-20230412+ #21 [ 0.001000][ T0] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014 [ 0.001000][ T0] Call Trace: [ 0.001000][ T0] [ 0.001000][ T0] dump_stack_lvl+0x77/0xc0 [ 0.001000][ T0] __lock_acquire+0xa74/0x2960 [ 0.001000][ T0] ? save_trace+0x3f/0x320 [ 0.001000][ T0] ? add_lock_to_list+0x97/0x130 [ 0.001000][ T0] lock_acquire+0xe0/0x300 [ 0.001000][ T0] ? debug_object_active_state+0x180/0x180 [ 0.001000][ T0] __debug_object_init+0x47/0x590 [ 0.001000][ T0] ? debug_object_active_state+0x180/0x180 [ 0.001000][ T0] ? lock_acquire+0x100/0x300 [ 0.001000][ T0] hrtimer_init+0x23/0xc0 [ 0.001000][ T0] ntp_init+0x70/0x80 [ 0.001000][ T0] timekeeping_init+0x12c/0x270 [ 0.001000][ T0] ? start_kernel+0x31a/0x800 [ 0.001000][ T0] ? _printk+0x5c/0x80 [ 0.001000][ T0] start_kernel+0x31a/0x800 [ 0.001000][ T0] secondary_startup_64_no_verify+0xf4/0xfb [ 0.001000][ T0] It seems that the LD_WAIT_SLEEP we set is already greater than the LD_WAIT_SPIN of the current context. -- Thanks, Qi