LinuxLists.cc - [PATCH v4 00/10] nfsd: more delegation fixes to prepare for client

2014-07-18 15:13:43

Subject: [PATCH v4 00/10] nfsd: more delegation fixes to prepare for client_mutex removal

v4:
- close more potential races in setlease code, and fix some bugs in
error handling in that code.

- clean up delegation setting functions, eliminating unused arguments
and avoiding allocations when there has already been a delegation
break

- add separate spinlock for block_delegations/delegation_blocked code

v3:
- use alternate method for checking for delegation break races after
getting a lease (just check fi_had_conflict instead)

- drop file_has_lease patch -- no longer needed

- move cl_revoked handling patch into this set. It means altering a
few of the later patches, but it keeps the set more topically
coherent

v2:
- move remove_stid call from nfs4_free_stid and into callers

Yet another respin of the delegation rework to prepare for client_mutex
removal. This fixes all of the error handling bugs I could find and
should fix up the potential races between setting a delegation and
having it broken.

I also added some cleanup of the delegation setting code itself, moving
the allocation into the nfs4_set_delegation call, so that we can avoid
allocating one when we know there has already been a conflict.

Jeff Layton (7):
nfsd: Protect the nfs4_file delegation fields using the fi_lock
nfsd: Fix delegation revocation
nfsd: ensure that clp->cl_revoked list is protected by clp->cl_lock
nfsd: drop unused stp arg to alloc_init_deleg
nfsd: clean up arguments to nfs4_open_delegation
nfsd: clean up nfs4_set_delegation
nfsd: give block_delegation and delegation_blocked its own spinlock

Trond Myklebust (3):
nfsd: Move the delegation reference counter into the struct nfs4_stid
nfsd: simplify stateid allocation and file handling
nfsd: Convert delegation counter to an atomic_long_t type

fs/nfsd/nfs4state.c | 250 +++++++++++++++++++++++++++++++++-------------------
fs/nfsd/state.h | 2 +-
2 files changed, 161 insertions(+), 91 deletions(-)

--
1.9.3

2014-07-18 17:50:02

by J. Bruce Fields

[permalink] [raw]

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

On Fri, Jul 18, 2014 at 01:31:40PM -0400, Jeff Layton wrote:
> On Fri, 18 Jul 2014 12:28:25 -0400
> "J. Bruce Fields" <[email protected]> wrote:
>
> > On Fri, Jul 18, 2014 at 11:13:27AM -0400, Jeff Layton wrote:
> > > Move more of the delegation fields to be protected by the fi_lock. It's
> > > more granular than the state_lock and in later patches we'll want to
> > > be able to rely on it in addition to the state_lock.
> > >
> > > Also, the current code in nfs4_setlease calls vfs_setlease and uses the
> > > client_mutex to ensure that it doesn't disappear before we can hash the
> > > delegation. With the client_mutex gone, we'll have a potential race
> > > condition.
> > >
> > > It's possible that the delegation could be recalled after we acquire the
> > > lease but before we ever get around to hashing it. If that happens, then
> > > we'd have a nfs4_file that *thinks* it has a delegation, when it
> > > actually has none.
> >
> > I understand now, thanks: so the lease break code walks the list of
> > delegations associated with the file, finds none, and issues no recall,
> > but the open code continues merrily on and returns a delegation, with
> > the result that we return the client a delegation that will never be
> > recalled.
> >
> > That could be worded more carefully, and would be worth a separate patch
> > (since the bug predates the new locking).
> >
>
> Yes, that's basically correct. I'd have to think about how to fix that
> with the current code. It's probably doable if you think it's
> worthwhile, but I'll need to rebase this set on top of it.

Well, I was wondering if this patch could just be split in two, no need
to backport further than that.

> > > Attempt to acquire a delegation. If that succeeds, take the spinlocks
> > > and then check to see if the file has had a conflict show up since then.
> > > If it has, then we assume that the lease is no longer valid and that
> > > we shouldn't hand out a delegation.
> > >
> > > There's also one more potential (but very unlikely) problem. If the
> > > lease is broken before the delegation is hashed, then it could leak.
> > > In the event that the fi_delegations list is empty, reset the
> > > fl_break_time to jiffies so that it's cleaned up ASAP by
> > > the normal lease handling code.
> >
> > Is there actually any guarantee time_out_leases() will get called on
> > this inode again?
> >
> > --b.
> >
>
> Yes. Lease breaks are handled in two phases. We walk the i_flock list
> and issue a ->lm_break on each lease, and then later we walk the list
> again after putting the task to sleep, and try to time out the leases.
> So by doing this, we should ensure that the task will wake up after
> sleeping and delete it.

In the case of an interrupt or a nonblocking break (which is what nfsd
will do), then time_out_leases isn't called again from what I could
tell.

--b.

>
> > >
> > > Signed-off-by: Trond Myklebust <[email protected]>
> > > Signed-off-by: Jeff Layton <[email protected]>
> > > ---
> > > fs/nfsd/nfs4state.c | 90
> > > +++++++++++++++++++++++++++++++++++++++-------------- 1 file
> > > changed, 66 insertions(+), 24 deletions(-)
> > >
> > > diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> > > index fd4deb049ddf..9ab067e85b51 100644
> > > --- a/fs/nfsd/nfs4state.c
> > > +++ b/fs/nfsd/nfs4state.c
> > > @@ -624,6 +624,8 @@ nfs4_put_delegation(struct nfs4_delegation *dp)
> > >
> > > static void nfs4_put_deleg_lease(struct nfs4_file *fp)
> > > {
> > > + lockdep_assert_held(&state_lock);
> > > +
> > > if (!fp->fi_lease)
> > > return;
> > > if (atomic_dec_and_test(&fp->fi_delegees)) {
> > > @@ -643,11 +645,10 @@ static void
> > > hash_delegation_locked(struct nfs4_delegation *dp, struct
> > > nfs4_file *fp) {
> > > lockdep_assert_held(&state_lock);
> > > + lockdep_assert_held(&fp->fi_lock);
> > >
> > > dp->dl_stid.sc_type = NFS4_DELEG_STID;
> > > - spin_lock(&fp->fi_lock);
> > > list_add(&dp->dl_perfile, &fp->fi_delegations);
> > > - spin_unlock(&fp->fi_lock);
> > > list_add(&dp->dl_perclnt,
> > > &dp->dl_stid.sc_client->cl_delegations); }
> > >
> > > @@ -659,17 +660,18 @@ unhash_delegation(struct nfs4_delegation *dp)
> > >
> > > spin_lock(&state_lock);
> > > dp->dl_stid.sc_type = NFS4_CLOSED_DELEG_STID;
> > > + spin_lock(&fp->fi_lock);
> > > list_del_init(&dp->dl_perclnt);
> > > list_del_init(&dp->dl_recall_lru);
> > > - spin_lock(&fp->fi_lock);
> > > list_del_init(&dp->dl_perfile);
> > > spin_unlock(&fp->fi_lock);
> > > - spin_unlock(&state_lock);
> > > if (fp) {
> > > nfs4_put_deleg_lease(fp);
> > > - put_nfs4_file(fp);
> > > dp->dl_file = NULL;
> > > }
> > > + spin_unlock(&state_lock);
> > > + if (fp)
> > > + put_nfs4_file(fp);
> > > }
> > >
> > > static void destroy_revoked_delegation(struct nfs4_delegation *dp)
> > > @@ -3143,10 +3145,19 @@ static void nfsd_break_deleg_cb(struct
> > > file_lock *fl) */
> > > fl->fl_break_time = 0;
> > >
> > > - fp->fi_had_conflict = true;
> > > spin_lock(&fp->fi_lock);
> > > - list_for_each_entry(dp, &fp->fi_delegations, dl_perfile)
> > > - nfsd_break_one_deleg(dp);
> > > + fp->fi_had_conflict = true;
> > > + /*
> > > + * If there are no delegations on the list, then we can't
> > > count on this
> > > + * lease ever being cleaned up. Set the fl_break_time to
> > > jiffies so that
> > > + * time_out_leases will do it ASAP. The fact that
> > > fi_had_conflict is now
> > > + * true should keep any new delegations from being hashed.
> > > + */
> > > + if (list_empty(&fp->fi_delegations))
> > > + fl->fl_break_time = jiffies;
> > > + else
> > > + list_for_each_entry(dp, &fp->fi_delegations,
> > > dl_perfile)
> > > + nfsd_break_one_deleg(dp);
> > > spin_unlock(&fp->fi_lock);
> > > }
> > >
> > > @@ -3493,46 +3504,77 @@ static int nfs4_setlease(struct
> > > nfs4_delegation *dp) {
> > > struct nfs4_file *fp = dp->dl_file;
> > > struct file_lock *fl;
> > > - int status;
> > > + struct file *filp;
> > > + int status = 0;
> > >
> > > fl = nfs4_alloc_init_lease(fp, NFS4_OPEN_DELEGATE_READ);
> > > if (!fl)
> > > return -ENOMEM;
> > > - fl->fl_file = find_readable_file(fp);
> > > - status = vfs_setlease(fl->fl_file, fl->fl_type, &fl);
> > > - if (status)
> > > - goto out_free;
> > > + filp = find_readable_file(fp);
> > > + if (!filp) {
> > > + /* We should always have a readable file here */
> > > + WARN_ON_ONCE(1);
> > > + return -EBADF;
> > > + }
> > > + status = vfs_setlease(filp, fl->fl_type, &fl);
> > > + if (status) {
> > > + locks_free_lock(fl);
> > > + goto out_fput;
> > > + }
> > > + spin_lock(&state_lock);
> > > + spin_lock(&fp->fi_lock);
> > > + /* Did the lease get broken before we took the lock? */
> > > + status = -EAGAIN;
> > > + if (fp->fi_had_conflict)
> > > + goto out_unlock;
> > > + /* Race breaker */
> > > + if (fp->fi_lease) {
> > > + status = 0;
> > > + atomic_inc(&fp->fi_delegees);
> > > + hash_delegation_locked(dp, fp);
> > > + goto out_unlock;
> > > + }
> > > fp->fi_lease = fl;
> > > - fp->fi_deleg_file = fl->fl_file;
> > > + fp->fi_deleg_file = filp;
> > > atomic_set(&fp->fi_delegees, 1);
> > > - spin_lock(&state_lock);
> > > hash_delegation_locked(dp, fp);
> > > + spin_unlock(&fp->fi_lock);
> > > spin_unlock(&state_lock);
> > > return 0;
> > > -out_free:
> > > - if (fl->fl_file)
> > > - fput(fl->fl_file);
> > > - locks_free_lock(fl);
> > > +out_unlock:
> > > + spin_unlock(&fp->fi_lock);
> > > + spin_unlock(&state_lock);
> > > +out_fput:
> > > + if (filp)
> > > + fput(filp);
> > > return status;
> > > }
> > >
> > > static int nfs4_set_delegation(struct nfs4_delegation *dp, struct
> > > nfs4_file *fp) {
> > > + int status = 0;
> > > +
> > > if (fp->fi_had_conflict)
> > > return -EAGAIN;
> > > get_nfs4_file(fp);
> > > + spin_lock(&state_lock);
> > > + spin_lock(&fp->fi_lock);
> > > dp->dl_file = fp;
> > > - if (!fp->fi_lease)
> > > + if (!fp->fi_lease) {
> > > + spin_unlock(&fp->fi_lock);
> > > + spin_unlock(&state_lock);
> > > return nfs4_setlease(dp);
> > > - spin_lock(&state_lock);
> > > + }
> > > atomic_inc(&fp->fi_delegees);
> > > if (fp->fi_had_conflict) {
> > > - spin_unlock(&state_lock);
> > > - return -EAGAIN;
> > > + status = -EAGAIN;
> > > + goto out_unlock;
> > > }
> > > hash_delegation_locked(dp, fp);
> > > +out_unlock:
> > > + spin_unlock(&fp->fi_lock);
> > > spin_unlock(&state_lock);
> > > - return 0;
> > > + return status;
> > > }
> > >
> > > static void nfsd4_open_deleg_none_ext(struct nfsd4_open *open, int
> > > status) --
> > > 1.9.3
> > >
>
>
> --
> Jeff Layton <[email protected]>

2014-07-21 22:50:45

by NeilBrown

[permalink] [raw]

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

On Mon, 21 Jul 2014 17:17:57 -0400 "J. Bruce Fields" <[email protected]>
wrote:

> On Tue, Jul 22, 2014 at 06:40:49AM +1000, NeilBrown wrote:
> > On Mon, 21 Jul 2014 07:44:12 -0400 Jeff Layton <[email protected]>
> > wrote:
> >
> > > On Mon, 21 Jul 2014 17:02:54 +1000
> > > NeilBrown <[email protected]> wrote:
> >
> > > > > hash = arch_fast_hash(&fh->fh_base, fh->fh_size, 0);
> > > > >
> > > > > __set_bit(hash&255, bd->set[bd->new]);
> > > > > __set_bit((hash>>8)&255, bd->set[bd->new]);
> > > > > __set_bit((hash>>16)&255, bd->set[bd->new]);
> > > > > + spin_lock(&blocked_delegations_lock);
> > > >
> > > > __set_bit isn't atomic. The spin_lock should be taken *before* these
> > > > __set_bit() calls.
> > > >
> > > > Otherwise, looks fine.
> > > >
> > > > Thanks,
> > > > NeilBrown
> > > >
> > > >
> > >
> > > Ok. I guess the worry is that we could end up setting bits in the
> > > middle of swapping the two fields? Makes sense -- fixed in my repo.
> >
> > It is more subtle than that.
> > __set_bit() will:
> > read a value from memory to a register
> > set a bit in the register
> > write the register back out to memory
> >
> > If two threads both run __set_bit on the same word of memory at the same
> > time, one of the updates can get lost.
> > set_bit() (no underscore) performs an atomic RMW to avoid this, but is more
> > expensive.
> > spin_lock() obviously ensures the required exclusion and as we are going to
> > take the lock anyway we may as well take it before setting bits so we can use
> > the non-atomic (cheaper) __set_bit function.
> >
> > > I'll send out the updated set later today (it also includes a few nits
> > > that HCH pointed out last week).
> > >
> > > As a side note...I wonder how much we'll get in the way of false
> > > positives with this scheme?
> >
> > If a future version of NFSv4 could allow delegations to be granted while a
> > file is open (oh, it seems you are the only client using this file at the
> > moment, you can treat this "open" as a delegation if you like) a few false
> > positives would be a complete non-issue.
>
> For what it's worth, I think 4.1 provides what you're asking for here;
> see
>
> http://tools.ietf.org/html/rfc5661#section-20.7
>
> and the discussion of the various WANT_ flags in
>
> http://tools.ietf.org/html/rfc5661#section-18.16.3
>
> As far as I know none of that is implemented yet.
>
> --b.

I guess I should really read the 4.1 (and 4.2) spec some day....
Though the 20.7 section seems to be about saying "resources in general are
available" rather than "this specific file that you wanted a delegation for
but didn't get one is how up for delegation"....
But I only had a quick read so I might have missed something.

Thanks,
NeilBrown

Attachments:

signature.asc (828.00 B)

2014-07-18 15:13:56

Subject: [PATCH v4 00/10] nfsd: more delegation fixes to prepare for client_mutex removal

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Attachments:

Subject: [PATCH v4 09/10] nfsd: clean up nfs4_set_delegation

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: [PATCH v4 08/10] nfsd: clean up arguments to nfs4_open_delegation

Subject: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 04/10] nfsd: Fix delegation revocation

Subject: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 04/10] nfsd: Fix delegation revocation

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: [PATCH v4 06/10] nfsd: Convert delegation counter to an atomic_long_t type

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 07/10] nfsd: drop unused stp arg to alloc_init_deleg

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Attachments:

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Attachments:

Subject: Re: [PATCH v4 03/10] nfsd: simplify stateid allocation and file handling

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: [PATCH v4 07/10] nfsd: drop unused stp arg to alloc_init_deleg

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: [PATCH v4 03/10] nfsd: simplify stateid allocation and file handling

Subject: [PATCH v4 05/10] nfsd: ensure that clp->cl_revoked list is protected by clp->cl_lock

Subject: Re: [PATCH v4 09/10] nfsd: clean up nfs4_set_delegation

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: [PATCH v4 02/10] nfsd: Move the delegation reference counter into the struct nfs4_stid

Subject: Re: [PATCH v4 08/10] nfsd: clean up arguments to nfs4_open_delegation

Subject: Re: [PATCH v4 05/10] nfsd: ensure that clp->cl_revoked list is protected by clp->cl_lock

Subject: Re: [PATCH v4 09/10] nfsd: clean up nfs4_set_delegation

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Subject: Re: [PATCH v4 10/10] nfsd: give block_delegation and delegation_blocked its own spinlock

Attachments:

Subject: Re: [PATCH v4 01/10] nfsd: Protect the nfs4_file delegation fields using the fi_lock

Subject: [PATCH v4 04/10] nfsd: Fix delegation revocation