Received: by 2002:ab2:788f:0:b0:1ee:8f2e:70ae with SMTP id b15csp433389lqi; Thu, 7 Mar 2024 01:21:37 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCXu+1Jx1ImgJ/leqY+tkCrh5lgTcnR32CSzlsPrQ/WUD93YyOtzH3zyfxOd7x7izSDwKdFKuj49Zf0OkoQShs90OOzFyR9xrmIl/0GIFA== X-Google-Smtp-Source: AGHT+IGbVhnPni4PzckRKSeII4U15ewCJEHkxYTeN2pEadfjp7QFEVEKhkTJh0ZHPxtCqpax3zuv X-Received: by 2002:a05:6a20:12ca:b0:1a1:15ff:43b with SMTP id v10-20020a056a2012ca00b001a115ff043bmr7385408pzg.23.1709803297471; Thu, 07 Mar 2024 01:21:37 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1709803297; cv=pass; d=google.com; s=arc-20160816; b=nrc71rc2tO9Ww0BLoLZOpd4uVuizm86XGGiQSZUPW9Tpbox8KtdAebj7tkwQ/enAOZ 6pkIkEpZWr4aD5swZl0BnWkA915v0w3sp1BHsuN9AmehbDD76BmX7Ox0kQ9Ceb/7DLd0 D47lZ7i1NF0+c69u7W6RMlwI+vINMURp+8HKbmmMz/LcuRU1mYRgf7xXSIM8kfJ1CL5H SBtienJMxtKUNG0zJEB01WF7QHhmrTOgq6PrbhTLKh2j/BzFMko9JKdBwXCBM2NohakE oxZgvfxXb9RmqMEpq8+aMcpt9UBJDr41mDU8FCDNfiRoz3W/8YwmSdbTyvU9nE0wRAVE 3xDw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:list-unsubscribe:list-subscribe:list-id :precedence:date:message-id:dkim-signature; bh=4QBG2QheUKTY1OnbPSDP0o85jjGq1EgwtF8LIisWhmc=; fh=kVQ4P1r61KDnj250Ai303siGGVd4+TAGdV2oiRXVqKg=; b=pKbVb1xNRa9/hxk6kJok77BxUVQU+Lp2+p7QiigpJMiL6rS4g+blltcZRWPyRbyb2D mxdnNrZKCjS7kfFQPx8FnJTC0B/OHZAWUYQtsi9lf/l0W+dpl/9gazrS6FElaXdTUyfL Lm/nNvqAEi3MuD0sdd7GASKN2wBCzQgd2sMTLmmAHi0cmymJDpxffT9wFttOXZiEDJk5 UaA7UeLWgOPNiJpp14rtwUV4j5tY7l+3WfPwmmXs7cl1OQAllc/gF6zxXRcO8BRjmx70 zSYU7JYTrpe4hOGcx7nCsAhQeaxqzMIrh9G9EAWadBEasByJpynVJFD0a1J1hZpewVUb 0qIQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b=jOWC3ehH; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-95206-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-95206-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [139.178.88.99]) by mx.google.com with ESMTPS id bw33-20020a056a0204a100b005dc9404f489si1161579pgb.338.2024.03.07.01.21.37 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 07 Mar 2024 01:21:37 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-95206-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) client-ip=139.178.88.99; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b=jOWC3ehH; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-95206-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-95206-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id 8D1C128412D for ; Thu, 7 Mar 2024 09:20:50 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id DAFCF83CAF; Thu, 7 Mar 2024 09:20:14 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b="jOWC3ehH" Received: from out30-101.freemail.mail.aliyun.com (out30-101.freemail.mail.aliyun.com [115.124.30.101]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id DD350823C4 for ; Thu, 7 Mar 2024 09:20:10 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=115.124.30.101 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709803213; cv=none; b=FJKEUYJmAz/EZrGEHROfYI5uVscEtLUDwYwEaAkZOcrvPWQYUwKJgi1FiYGpnItknaR2o1ut+y+UCT3Ymuvk7QcblkCylxGXq6dPEjE1vDuKF5XTuk76ufuDXl9KIvD71SHYWseo+ScGmqdnDpSaRrJW6iaXYar8pW7ypZU9dcU= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709803213; c=relaxed/simple; bh=YT/FbZZyceAX2Mb3Zbme+x0madJmou1w+qr8J8YUkXs=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=Kl9leYnXlq7j3wHZWGYUB31jd4PzBkdWtYsyA3NxvM8tNnlxN/VXz6hsoxY2qUuSaor3K67j8MC35Rz6o3WBX9/k6k+PUr6exCIyR5bI/NTFl7bWA+CcRnwWF4o88AiJGD+PQOYwgZV1U6v25viHdaEl2fBi9mW33MtlPddVS6M= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com; spf=pass smtp.mailfrom=linux.alibaba.com; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b=jOWC3ehH; arc=none smtp.client-ip=115.124.30.101 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linux.alibaba.com DKIM-Signature:v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1709803208; h=Message-ID:Date:MIME-Version:Subject:To:From:Content-Type; bh=4QBG2QheUKTY1OnbPSDP0o85jjGq1EgwtF8LIisWhmc=; b=jOWC3ehHqZBqbsl4W/bne15gfu/A7M+oN6I81ScxnJZygUKMNow+S1yDE12cuh+LKFEGyvO9S2a8yu/1gKv8rD1MSAPLNDec0RsZDGgdpCqoHhDIEBGPbPBWp3HENZ13vSWBSsm2edSddzBiDk+AHTN7jnPVwusdG+UhyxEF6W8= X-Alimail-AntiSpam:AC=PASS;BC=-1|-1;BR=01201311R411e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046049;MF=hsiangkao@linux.alibaba.com;NM=1;PH=DS;RN=13;SR=0;TI=SMTPD_---0W2-5tED_1709803206; Received: from 30.97.48.224(mailfrom:hsiangkao@linux.alibaba.com fp:SMTPD_---0W2-5tED_1709803206) by smtp.aliyun-inc.com; Thu, 07 Mar 2024 17:20:07 +0800 Message-ID: <50205009-07e6-4e7e-9ac5-e6d04e12e62d@linux.alibaba.com> Date: Thu, 7 Mar 2024 17:20:06 +0800 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH] erofs: fix lockdep false positives on initializing erofs_pseudo_mnt To: Christian Brauner Cc: Jingbo Xu , Baokun Li , linux-erofs@lists.ozlabs.org, xiang@kernel.org, chao@kernel.org, huyue2@coolpad.com, linux-kernel@vger.kernel.org, yangerkun@huawei.com, houtao1@huawei.com, yukuai3@huawei.com, chengzhihao1@huawei.com, Al Viro References: <20240307024459.883044-1-libaokun1@huawei.com> <7e262242-d90d-4f61-a217-f156219eaa4d@linux.alibaba.com> <38934cc4-58da-47b4-a120-00a2f3a56836@linux.alibaba.com> <20240307-segmentieren-sitzkissen-5086f5e1f99f@brauner> From: Gao Xiang In-Reply-To: <20240307-segmentieren-sitzkissen-5086f5e1f99f@brauner> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Hi Christian, On 2024/3/7 17:17, Christian Brauner wrote: > On Thu, Mar 07, 2024 at 12:18:52PM +0800, Gao Xiang wrote: >> Hi, >> >> (try to +Cc Christian and Al here...) >> >> On 2024/3/7 11:41, Jingbo Xu wrote: >>> Hi Baokun, >>> >>> Thanks for catching this! >>> >>> >>> On 3/7/24 10:52 AM, Gao Xiang wrote: >>>> Hi Baokun, >>>> >>>> On 2024/3/7 10:44, Baokun Li wrote: >>>>> Lockdep reported the following issue when mounting erofs with a >>>>> domain_id: >>>>> >>>>> ============================================ >>>>> WARNING: possible recursive locking detected >>>>> 6.8.0-rc7-xfstests #521 Not tainted >>>>> -------------------------------------------- >>>>> mount/396 is trying to acquire lock: >>>>> ffff907a8aaaa0e0 (&type->s_umount_key#50/1){+.+.}-{3:3}, >>>>>                         at: alloc_super+0xe3/0x3d0 >>>>> >>>>> but task is already holding lock: >>>>> ffff907a8aaa90e0 (&type->s_umount_key#50/1){+.+.}-{3:3}, >>>>>                         at: alloc_super+0xe3/0x3d0 >>>>> >>>>> other info that might help us debug this: >>>>>   Possible unsafe locking scenario: >>>>> >>>>>         CPU0 >>>>>         ---- >>>>>    lock(&type->s_umount_key#50/1); >>>>>    lock(&type->s_umount_key#50/1); >>>>> >>>>>   *** DEADLOCK *** >>>>> >>>>>   May be due to missing lock nesting notation >>>>> >>>>> 2 locks held by mount/396: >>>>>   #0: ffff907a8aaa90e0 (&type->s_umount_key#50/1){+.+.}-{3:3}, >>>>>             at: alloc_super+0xe3/0x3d0 >>>>>   #1: ffffffffc00e6f28 (erofs_domain_list_lock){+.+.}-{3:3}, >>>>>             at: erofs_fscache_register_fs+0x3d/0x270 [erofs] >>>>> >>>>> stack backtrace: >>>>> CPU: 1 PID: 396 Comm: mount Not tainted 6.8.0-rc7-xfstests #521 >>>>> Call Trace: >>>>>   >>>>>   dump_stack_lvl+0x64/0xb0 >>>>>   validate_chain+0x5c4/0xa00 >>>>>   __lock_acquire+0x6a9/0xd50 >>>>>   lock_acquire+0xcd/0x2b0 >>>>>   down_write_nested+0x45/0xd0 >>>>>   alloc_super+0xe3/0x3d0 >>>>>   sget_fc+0x62/0x2f0 >>>>>   vfs_get_super+0x21/0x90 >>>>>   vfs_get_tree+0x2c/0xf0 >>>>>   fc_mount+0x12/0x40 >>>>>   vfs_kern_mount.part.0+0x75/0x90 >>>>>   kern_mount+0x24/0x40 >>>>>   erofs_fscache_register_fs+0x1ef/0x270 [erofs] >>>>>   erofs_fc_fill_super+0x213/0x380 [erofs] >>>>> >>>>> This is because the file_system_type of both erofs and the pseudo-mount >>>>> point of domain_id is erofs_fs_type, so two successive calls to >>>>> alloc_super() are considered to be using the same lock and trigger the >>>>> warning above. >>>>> >>>>> Therefore add a nodev file_system_type named erofs_anon_fs_type to >>>>> silence this complaint. In addition, to reduce code coupling, refactor >>>>> out the erofs_anon_init_fs_context() and erofs_kill_pseudo_sb() functions >>>>> and move the erofs_pseudo_mnt related code to fscache.c. >>>>> >>>>> Signed-off-by: Baokun Li >>>> >>>> IMHO, in the beginning, I'd like to avoid introducing another fs type >>>> for erofs to share (meta)data between filesystems since it will cause >>>> churn, could we use some alternative way to resolve this? >>> >>> Yeah as Gao Xiang said, this is initially intended to avoid introducing >>> anothoer file_system_type, say erofs_anon_fs_type. >>> >>> What we need is actually a method of allocating anonymous inode as a >>> sentinel identifying each blob. There is indeed a global mount, i.e. >>> anon_inode_mnt, for allocating anonymous inode/file specifically. At >>> the time the share domain feature is introduced, there's only one >>> anonymous inode, i.e. anon_inode_inode, and all the allocated anonymous >>> files are bound to this single anon_inode_inode. Thus we decided to >>> implement a erofs internal pseudo mount for this usage. >>> >>> But I noticed that we can now allocate unique anonymous inodes from >>> anon_inode_mnt since commit e7e832c ("fs: add LSM-supporting anon-inode >>> interface"), though the new interface is initially for LSM usage. >> >> Yes, as summary, EROFS now maintains a bunch of anon inodes among >> all different filesystem instances, so that like >> >> blob sharing or >> page cache sharing across filesystems can be done. >> >> In brief, I think the following patch is a good idea but it >> hasn't been landed until now: >> https://lore.kernel.org/r/20210309155348.974875-3-hch@lst.de >> >> Other than that, is it a good idea to introduce another fs type >> (like erofs_anon_fs_type) for such usage? > > It depends. If you're allocating a lot of inodes then having a separate > filesystem type for erofs makes sense. If it's just a few then it > probably doesn't matter. If you need custom inode operations for these > anonymous inodes then it also makes sense to have a separate filesystem > type. Yeah, I think some time this year we will finish a formal page cache sharing design and implementation for both bdev and fscache mode. So a separate filesystem type seems more reasonable in the future, thanks for your confirmation! Thanks, Gao Xiang