Return-Path: Received: from mail-qk0-f178.google.com ([209.85.220.178]:36629 "EHLO mail-qk0-f178.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753807AbdEZQt3 (ORCPT ); Fri, 26 May 2017 12:49:29 -0400 Received: by mail-qk0-f178.google.com with SMTP id u75so11658808qka.3 for ; Fri, 26 May 2017 09:49:29 -0700 (PDT) Message-ID: <1495817366.4299.3.camel@redhat.com> Subject: Re: [PATC_H] locks: Set fl_nspid at file_lock allocation From: Jeff Layton To: Benjamin Coddington Cc: Alexander Viro , bfields@fieldses.org, linux-fsdevel@vger.kernel.org, linux-nfs@vger.kernel.org Date: Fri, 26 May 2017 12:49:26 -0400 In-Reply-To: References: <896e8ca302614f71f3030015ebf3befe2b40d3c4.1495122972.git.bcodding@redhat.com> <1495126525.3956.10.camel@poochiereds.net> <1069ECDA-D96D-495E-BB7B-128A926AD3DF@redhat.com> <1495140075.3956.13.camel@poochiereds.net> <6ED42F9C-F290-451C-89A9-4A7C3FFDEFA6@redhat.com> Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Sender: linux-nfs-owner@vger.kernel.org List-ID: On Fri, 2017-05-26 at 11:22 -0400, Benjamin Coddington wrote: > On 19 May 2017, at 8:35, Benjamin Coddington wrote: > > > On 18 May 2017, at 16:41, Jeff Layton wrote: > > > > > On Thu, 2017-05-18 at 13:36 -0400, Benjamin Coddington wrote: > > > > > On Thu, 2017-05-18 at 12:02 -0400, Benjamin Coddington wrote: > > > > > > diff --git a/fs/locks.c b/fs/locks.c > > > > > > index af2031a1fcff..959b3f93f4bd 100644 > > > > > > --- a/fs/locks.c > > > > > > +++ b/fs/locks.c > > > > > > @@ -249,7 +249,7 @@ locks_dump_ctx_list(struct list_head *list, char > > > > > > *list_type) > > > > > > struct file_lock *fl; > > > > > > > > > > > > list_for_each_entry(fl, list, fl_list) { > > > > > > - pr_warn("%s: fl_owner=%p fl_flags=0x%x fl_type=0x%x fl_pid=%u\n", > > > > > > list_type, fl->fl_owner, fl->fl_flags, fl->fl_type, fl->fl_pid); > > > > > > + pr_warn("%s: fl_owner=%p fl_flags=0x%x fl_type=0x%x fl_pid=%u\n", > > > > > > list_type, fl->fl_owner, fl->fl_flags, fl->fl_type, > > > > > > pid_vnr(fl->fl_nspid)); > > > > > > } > > > > > > } > > > > > > > > > > > > > > > > Probably should change the format to say fl_nspid=%u here, just to be > > > > > clear. Might also want to keep fl_pid in there since the lock manager > > > > > could set it. > > > > > > > > Yes, I thought about just spitting out both. Let's do that. > > > > > > > > > > @@ -2074,7 +2075,7 @@ static int posix_lock_to_flock(struct flock > > > > > > *flock, struct file_lock *fl) > > > > > > #if BITS_PER_LONG == 32 > > > > > > static void posix_lock_to_flock64(struct flock64 *flock, struct > > > > > > file_lock *fl) > > > > > > { > > > > > > - flock->l_pid = IS_OFDLCK(fl) ? -1 : fl->fl_pid; > > > > > > + flock->l_pid = IS_OFDLCK(fl) ? -1 : pid_vnr(fl->fl_nspid); > > > > > > > > > > What about the lock managers that _do_ set fl->fl_pid. With nfsv2/3, > > > > > this is always going to give you back the pid of lockd, AFAICT. > > > > > > > > But isn't this really what you want? If a local process wants to know > > > > who > > > > holds a conflicting lock, the fl_pid of a remote system is really pretty > > > > useless. Not only that, but there's no way for the local process to > > > > know > > > > when the pid is local or remote. Better to be consistent and always > > > > return > > > > something that's useful. > > > > > > > > > > The l_pid field in struct flock (and by extension fl_pid) is pretty > > > poorly defined in the spec(s), especially when there is a remote host > > > involved. Programs that rely on it are insane, of course...but Linux has > > > always behaved this way. > > > > But if it is completely useless today, then we can change it without > > worrying about breaking it because it is already broken. There's no > > documentation anywhere that informs users of F_GETLK or /proc/locks that > > l_pid is completely unreliable. > > > > Do you know why linux hasn't picked up l_sysid? I can't seem to find any > > previous discussion about it. > > > > > In the absence of a compelling reason to change it, I think we should > > > keep the behavior in this respect as close as possible to what we have > > > now. > > > > I think the reason would be l_pid should at least have some consistent > > meaning. I think now that this patch probably shouldn't change it, but it > > should be changed. > > > > > > > Do we want to present the pid value that the client sent here instead > > > > > in > > > > > that case? Maybe the lm could set a fl_flag to indicate that the pid > > > > > should be taken directly from fl_pid here? Then you could move the > > > > > above > > > > > logic to a static inline or something. > > > > > > > > > > Alternately, you could add a new lm_present_pid operation to lock > > > > > managers to format the pid as they see fit. > > > > > > > > Either works to solve the problem, but I still think that F_GETLK and > > > > /proc/locks should only return local pids. > > It turns out that the lm_present_pid approach is not sufficient and we > should instead use a flag, since the non-lock-manager fs/lockd/clntproc.c > wants to use fl->fl_pid = 0 in the case where the client is testing and > finds a conflicting lock. > > So, the client considers remote pids to be bogus, which makes a lot of sense > to me. > Yeah, not much it can do with a pid, really... > Additionally, after testing, today's kernel returns lockd's pid on a local > F_GETLCK for a remotely-held NFS lock. So, I think our understanding of the > situation needs to be reversed. Lock manager's locks are locally reporting > the local lock pid, but sometimes a remote lock needs to override the local > pid to set fl_pid. > Fair enough. Now that I look...v4 locks set by knfsd just pick up the pid of whatever the nfsd thread it happens to be running in. From nfsd4_lock: file_lock->fl_pid = current->tgid; So, it sounds like it really is totally meaningless then. In that case I'll reverse my earlier opinion, and say that if it's easier to just set it to whatever lockd's pid is, then that'd be fine with me. OTOH, pid_t is an int, and I don't think negative pids are valid (are they?) Maybe we should set it to -1 for a remote lock (like we do for OFD locks). Or, could consider declaring a new value (-2?) to represent a remote lock? -- Jeff Layton