Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp709986pxm; Thu, 3 Mar 2022 03:07:20 -0800 (PST) X-Google-Smtp-Source: ABdhPJy6TTTa99bNu1dPAtTIfZ8CPpRtnn7q4L+xzlI1MzBNNU0hqTDBTDSLYIjO5Vtnvq1XjpWV X-Received: by 2002:a05:6402:1e91:b0:415:ecdb:bb42 with SMTP id f17-20020a0564021e9100b00415ecdbbb42mr651358edf.367.1646305640331; Thu, 03 Mar 2022 03:07:20 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646305640; cv=none; d=google.com; s=arc-20160816; b=TTCZ2mXFbyg+mCQmXOW5ctG+zbfd/7K3CNcmbGl8PrfLB9xC/05zx6RkdzlTPvO+7g SyP289NxQUQbmQWa0kU7R0wff+NHIrwJBVujzZScf6+0696xxxuERTwvKcThgLz2ZvTW z0E9YMUSB2+j/TC3RSQ3GYEjOloX30JIpemJ2A6DFj0YlPgptHmvW6XsfPZGg3crYf8V te/NaVWuNDFuFnutCXWbrTFJ75v4pE2jEZIw5slNxpLjFWEvUCQwGP8SzLeCnVAD/ff8 VrTNXHyeVETNua20k4OalS9c+ijkALPCja2RigQzYRZ+GwFhMC5CaO2jLyCXyoDnoY6c UXjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=wymZkx0Rc2RszZJiAehIh9pMUVncPBaTrbloQW9Q/V0=; b=O5+Wb6U+xYSYPeWYzRZtk58A/sTz7xybHLRG6vSj74pNy87EtoFU1eDgCkM6ZVADch hefQeQoi4tfeQtY4hSxGWGwAa5p+B+oZydDcuRA4mSAcskUZCwuhUXAsQ1wLg3pI7/IW QJr1ynfc7z+Ho3E3ztu8+y7eTM/+kQ8UxHBlDhD9x1VEieJzzVRnuwlvc1pmDK6c9VZW R9QjOyMsm4Iw87LPKOVJCizuspNKI7VgE6qbmLrbv8VFBIeCD9lGeMV+loa5bsKVSoHj woQOh1AqrrWfLnIDe/U5m4s9VGYTS6mMHCb4Qf6l7zzoRltLXrc3/IOVO0rbxiEfYClx ZYNg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hc44-20020a17090716ac00b006d8d52ba7easi1328791ejc.239.2022.03.03.03.06.50; Thu, 03 Mar 2022 03:07:20 -0800 (PST) Received-SPF: pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232149AbiCCJtg (ORCPT + 99 others); Thu, 3 Mar 2022 04:49:36 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57494 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231984AbiCCJtf (ORCPT ); Thu, 3 Mar 2022 04:49:35 -0500 Received: from lgeamrelo11.lge.com (lgeamrelo12.lge.com [156.147.23.52]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 878121480D1 for ; Thu, 3 Mar 2022 01:48:47 -0800 (PST) Received: from unknown (HELO lgemrelse7q.lge.com) (156.147.1.151) by 156.147.23.52 with ESMTP; 3 Mar 2022 18:48:45 +0900 X-Original-SENDERIP: 156.147.1.151 X-Original-MAILFROM: byungchul.park@lge.com Received: from unknown (HELO X58A-UD3R) (10.177.244.38) by 156.147.1.151 with ESMTP; 3 Mar 2022 18:48:45 +0900 X-Original-SENDERIP: 10.177.244.38 X-Original-MAILFROM: byungchul.park@lge.com Date: Thu, 3 Mar 2022 18:48:24 +0900 From: Byungchul Park To: Hyeonggon Yoo <42.hyeyoo@gmail.com> Cc: torvalds@linux-foundation.org, damien.lemoal@opensource.wdc.com, linux-ide@vger.kernel.org, adilger.kernel@dilger.ca, linux-ext4@vger.kernel.org, mingo@redhat.com, linux-kernel@vger.kernel.org, peterz@infradead.org, will@kernel.org, tglx@linutronix.de, rostedt@goodmis.org, joel@joelfernandes.org, sashal@kernel.org, daniel.vetter@ffwll.ch, chris@chris-wilson.co.uk, duyuyang@gmail.com, johannes.berg@intel.com, tj@kernel.org, tytso@mit.edu, willy@infradead.org, david@fromorbit.com, amir73il@gmail.com, bfields@fieldses.org, gregkh@linuxfoundation.org, kernel-team@lge.com, linux-mm@kvack.org, akpm@linux-foundation.org, mhocko@kernel.org, minchan@kernel.org, hannes@cmpxchg.org, vdavydov.dev@gmail.com, sj@kernel.org, jglisse@redhat.com, dennis@kernel.org, cl@linux.com, penberg@kernel.org, rientjes@google.com, vbabka@suse.cz, ngupta@vflare.org, linux-block@vger.kernel.org, paolo.valente@linaro.org, josef@toxicpanda.com, linux-fsdevel@vger.kernel.org, viro@zeniv.linux.org.uk, jack@suse.cz, jack@suse.com, jlayton@kernel.org, dan.j.williams@intel.com, hch@infradead.org, djwong@kernel.org, dri-devel@lists.freedesktop.org, airlied@linux.ie, rodrigosiqueiramelo@gmail.com, melissa.srw@gmail.com, hamohammed.sa@gmail.com Subject: Re: [PATCH v3 00/21] DEPT(Dependency Tracker) Message-ID: <20220303094824.GA24977@X58A-UD3R> References: <1646042220-28952-1-git-send-email-byungchul.park@lge.com> <20220303001812.GA20752@X58A-UD3R> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_HI, RCVD_IN_MSPIKE_H5,RCVD_IN_MSPIKE_WL,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Thu, Mar 03, 2022 at 08:03:21AM +0000, Hyeonggon Yoo wrote: > On Thu, Mar 03, 2022 at 09:18:13AM +0900, Byungchul Park wrote: > > Hi Hyeonggon, > > > > Dept also allows the following scenario when an user guarantees that > > each lock instance is different from another at a different depth: > > > > lock A0 with depth > > lock A1 with depth + 1 > > lock A2 with depth + 2 > > lock A3 with depth + 3 > > (and so on) > > .. > > unlock A3 > > unlock A2 > > unlock A1 > > unlock A0 Look at this. Dept allows object->lock -> other_object->lock (with a different depth using *_lock_nested()) so won't report it. > > However, Dept does not allow the following scenario where another lock > > class cuts in the dependency chain: > > > > lock A0 with depth > > lock B > > lock A1 with depth + 1 > > lock A2 with depth + 2 > > lock A3 with depth + 3 > > (and so on) > > .. > > unlock A3 > > unlock A2 > > unlock A1 > > unlock B > > unlock A0 > > > > This scenario is clearly problematic. What do you think is going to > > happen with another context running the following? > > > > First of all, I want to say I'm not expert at locking primitives. > I may be wrong. It's okay. Thanks anyway for your feedback. > > > 45 * scan_mutex [-> object->lock] -> kmemleak_lock -> other_object->lock (SINGLE_DEPTH_NESTING) > > > 46 * > > > 47 * No kmemleak_lock and object->lock nesting is allowed outside scan_mutex > > > 48 * regions. > > lock order in kmemleak is described above. > > and DEPT detects two cases as deadlock: > > 1) object->lock -> other_object->lock It's not a deadlock *IF* two have different depth using *_lock_nested(). Dept also allows this case. So Dept wouldn't report it. > 2) object->lock -> kmemleak_lock, kmemleak_lock -> other_object->lock But this usage is risky. I already explained it in the mail you replied to. I copied it. See the below. context A > > lock A0 with depth > > lock B > > lock A1 with depth + 1 > > lock A2 with depth + 2 > > lock A3 with depth + 3 > > (and so on) > > .. > > unlock A3 > > unlock A2 > > unlock A1 > > unlock B > > unlock A0 ... context B > > lock A1 with depth > > lock B > > lock A2 with depth + 1 > > lock A3 with depth + 2 > > (and so on) > > .. > > unlock A3 > > unlock A2 > > unlock B > > unlock A1 where Ax : object->lock, B : kmemleak_lock. A deadlock might occur if the two contexts run at the same time. > And in kmemleak case, 1) and 2) is not possible because it must hold > scan_mutex first. This is another issue. Let's focus on whether the order is okay for now. > I think the author of kmemleak intended lockdep to treat object->lock > and other_object->lock as different class, using raw_spin_lock_nested(). Yes. The author meant to assign a different class according to its depth using a Lockdep API. Strictly speaking, those are the same class anyway but we assign a different class to each depth to avoid Lockdep splats *IF* the user guarantees the nesting lock usage is safe, IOW, guarantees each lock instance is different at a different depth. I was fundamentally asking you... so... is the nesting lock usage safe for real? I hope you distinguish between the safe case and the risky case when *_lock_nested() is involved. Thoughts? Thanks, Byungchul > Am I missing something? > > Thanks. > > > lock A1 with depth > > lock B > > lock A2 with depth + 1 > > lock A3 with depth + 2 > > (and so on) > > .. > > unlock A3 > > unlock A2 > > unlock B > > unlock A1 > > > > It's a deadlock. That's why Dept reports this case as a problem. Or am I > > missing something? > > > > Thanks, > > Byungchul > > > > > --------------------------------------------------- > > > context A's detail > > > --------------------------------------------------- > > > context A > > > [S] __raw_spin_lock_irqsave(&object->lock:0) > > > [W] __raw_spin_lock_irqsave(kmemleak_lock:0) > > > [E] spin_unlock(&object->lock:0) > > > > > > [S] __raw_spin_lock_irqsave(&object->lock:0): > > > [] scan_gray_list+0x84/0x13c > > > stacktrace: > > > dept_ecxt_enter+0x88/0xf4 > > > _raw_spin_lock_irqsave+0xf0/0x1c4 > > > scan_gray_list+0x84/0x13c > > > kmemleak_scan+0x2d8/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > > > > [W] __raw_spin_lock_irqsave(kmemleak_lock:0): > > > [] scan_block+0x3c/0x128 > > > stacktrace: > > > __dept_wait+0x8c/0xa4 > > > dept_wait+0x6c/0x88 > > > _raw_spin_lock_irqsave+0xb8/0x1c4 > > > scan_block+0x3c/0x128 > > > scan_gray_list+0xc4/0x13c > > > kmemleak_scan+0x2d8/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > > > > [E] spin_unlock(&object->lock:0): > > > [] scan_block+0x60/0x128 > > > > > > --------------------------------------------------- > > > context B's detail > > > --------------------------------------------------- > > > context B > > > [S] __raw_spin_lock_irqsave(kmemleak_lock:0) > > > [W] _raw_spin_lock_nested(&object->lock:0) > > > [E] spin_unlock(kmemleak_lock:0) > > > > > > [S] __raw_spin_lock_irqsave(kmemleak_lock:0): > > > [] scan_block+0x3c/0x128 > > > stacktrace: > > > dept_ecxt_enter+0x88/0xf4 > > > _raw_spin_lock_irqsave+0xf0/0x1c4 > > > scan_block+0x3c/0x128 > > > kmemleak_scan+0x19c/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > > > > [W] _raw_spin_lock_nested(&object->lock:0): > > > [] scan_block+0xb4/0x128 > > > stacktrace: > > > dept_wait+0x74/0x88 > > > _raw_spin_lock_nested+0xa8/0x1b0 > > > scan_block+0xb4/0x128 > > > kmemleak_scan+0x19c/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > [E] spin_unlock(kmemleak_lock:0): > > > [] scan_block+0x60/0x128 > > > stacktrace: > > > dept_event+0x7c/0xfc > > > _raw_spin_unlock_irqrestore+0x8c/0x120 > > > scan_block+0x60/0x128 > > > kmemleak_scan+0x19c/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > --------------------------------------------------- > > > information that might be helpful > > > --------------------------------------------------- > > > CPU: 1 PID: 38 Comm: kmemleak Tainted: G W 5.17.0-rc1+ #1 > > > Hardware name: linux,dummy-virt (DT) > > > Call trace: > > > dump_backtrace.part.0+0x9c/0xc4 > > > show_stack+0x14/0x28 > > > dump_stack_lvl+0x9c/0xcc > > > dump_stack+0x14/0x2c > > > print_circle+0x2d4/0x438 > > > cb_check_dl+0x6c/0x70 > > > bfs+0xc0/0x168 > > > add_dep+0x88/0x11c > > > add_wait+0x2d0/0x2dc > > > __dept_wait+0x8c/0xa4 > > > dept_wait+0x6c/0x88 > > > _raw_spin_lock_irqsave+0xb8/0x1c4 > > > scan_block+0x3c/0x128 > > > scan_gray_list+0xc4/0x13c > > > kmemleak_scan+0x2d8/0x54c > > > kmemleak_scan_thread+0xac/0xd4 > > > kthread+0xd4/0xe4 > > > ret_from_fork+0x10/0x20 > > > > > > > =================================================== > > > > DEPT: Circular dependency has been detected. > > > > 5.17.0-rc1+ #1 Tainted: G W > > > > --------------------------------------------------- > > > > summary > > > > --------------------------------------------------- > > > > *** AA DEADLOCK *** > > > > > > > > context A > > > > [S] __raw_spin_lock_irqsave(&object->lock:0) > > > > [W] _raw_spin_lock_nested(&object->lock:0) > > > > [E] spin_unlock(&object->lock:0) > > > > > > > > [S]: start of the event context > > > > [W]: the wait blocked > > > > [E]: the event not reachable > > > > --------------------------------------------------- > > > > context A's detail > > > > --------------------------------------------------- > > > > context A > > > > [S] __raw_spin_lock_irqsave(&object->lock:0) > > > > [W] _raw_spin_lock_nested(&object->lock:0) > > > > [E] spin_unlock(&object->lock:0) > > > > > > > > [S] __raw_spin_lock_irqsave(&object->lock:0): > > > > [] scan_gray_list+0x84/0x13c > > > > stacktrace: > > > > dept_ecxt_enter+0x88/0xf4 > > > > _raw_spin_lock_irqsave+0xf0/0x1c4 > > > > scan_gray_list+0x84/0x13c > > > > kmemleak_scan+0x2d8/0x54c > > > > kmemleak_scan_thread+0xac/0xd4 > > > > kthread+0xd4/0xe4 > > > > ret_from_fork+0x10/0x20 > > > > > > > > [E] spin_unlock(&object->lock:0): > > > > [] scan_block+0x60/0x128 > > > > --------------------------------------------------- > > > > information that might be helpful > > > > --------------------------------------------------- > > > > CPU: 1 PID: 38 Comm: kmemleak Tainted: G W 5.17.0-rc1+ #1 > > > > Hardware name: linux,dummy-virt (DT) > > > > Call trace: > > > > dump_backtrace.part.0+0x9c/0xc4 > > > > show_stack+0x14/0x28 > > > > dump_stack_lvl+0x9c/0xcc > > > > dump_stack+0x14/0x2c > > > > print_circle+0x2d4/0x438 > > > > cb_check_dl+0x44/0x70 > > > > bfs+0x60/0x168 > > > > add_dep+0x88/0x11c > > > > add_wait+0x2d0/0x2dc > > > > __dept_wait+0x8c/0xa4 > > > > dept_wait+0x6c/0x88 > > > > _raw_spin_lock_nested+0xa8/0x1b0 > > > > scan_block+0xb4/0x128 > > > > scan_gray_list+0xc4/0x13c > > > > kmemleak_scan+0x2d8/0x54c > > > > kmemleak_scan_thread+0xac/0xd4 > > > > kthread+0xd4/0xe4 > > > > ret_from_fork+0x10/0x20 > > > > > > > [...] > > > > > > -- > > > Thank you, You are awesome! > > > Hyeonggon :-) > > -- > Thank you, You are awesome! > Hyeonggon :-)