Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935036AbbEPBs0 (ORCPT ); Fri, 15 May 2015 21:48:26 -0400 Received: from zeniv.linux.org.uk ([195.92.253.2]:58727 "EHLO ZenIV.linux.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S934523AbbEPBsY (ORCPT ); Fri, 15 May 2015 21:48:24 -0400 Date: Sat, 16 May 2015 02:47:18 +0100 From: Al Viro To: NeilBrown Cc: Linus Torvalds , Andreas Dilger , Dave Chinner , Linux Kernel Mailing List , linux-fsdevel , Christoph Hellwig Subject: Re: [RFC][PATCHSET v3] non-recursive pathname resolution & RCU symlinks Message-ID: <20150516014718.GO7232@ZenIV.linux.org.uk> References: <20150511180650.GA4147@ZenIV.linux.org.uk> <20150513222533.GA24192@ZenIV.linux.org.uk> <20150514033040.GF7232@ZenIV.linux.org.uk> <20150514112304.GT15721@dastard> <20150516093022.51e1464e@notabene.brown> <20150516112503.2f970573@notabene.brown> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20150516112503.2f970573@notabene.brown> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1667 Lines: 30 On Sat, May 16, 2015 at 11:25:03AM +1000, NeilBrown wrote: > But surely those things can be managed with a spinlock. > > I think a big part of the problem is that the VFS tries to control > filesystems rather than provide services to them. What with being the thing syscalls talk to for sending the requests to filesystems... Do you really want to push the pathname resolution into fs code? You've looked at it lately, right? > I'm not convinced that serialising 'lookup' calls is vital. If two threads > find a 'not-validated' dentry, and both try to look up the inode, they > will both ultimately get the same struct_inode from the icache, and will both > succeed in connecting it to the dentry. Obviously it would be better to > avoid two concurrent NFS "LOOKUP" requests, but that is a problem for NFS to > solve. I suspect that using d_fsdata to point to a pending LOOKUP request > would allow the "second" thread to wait for that request to finish. Other > filesystems would take a completely different approach. See upthread regarding multiple negative dentries with the same name and fun consequences thereof. There might be _NO_ inode. At all. dcache has a large negative component and without it you'd get really fucked on NFS as soon as you try to compile anything. Shitloads of headers, looked up in a lot of directories. Most of the lookups ending up negative. We really do need that stuff... -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/