2021-11-23 12:22:27

by Jeffrey Layton

[permalink] [raw]
Subject: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

Vasily reported a case where vfs_lock_file took a very long time to
return (longer than a lease period). The laundromat eventually ran and
reaped the thing and when the vfs_lock_file returned, it ended up
accessing freed memory.

Don't put entries onto the LRU until vfs_lock_file returns.

Reported-by: Vasily Averin <[email protected]>
Signed-off-by: Jeff Layton <[email protected]>
---
fs/nfsd/nfs4state.c | 6 ++++--
1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
index bfad94c70b84..8cfef84b9355 100644
--- a/fs/nfsd/nfs4state.c
+++ b/fs/nfsd/nfs4state.c
@@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
}

if (fl_flags & FL_SLEEP) {
- nbl->nbl_time = ktime_get_boottime_seconds();
spin_lock(&nn->blocked_locks_lock);
list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
- list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
spin_unlock(&nn->blocked_locks_lock);
}

@@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
nn->somebody_reclaimed = true;
break;
case FILE_LOCK_DEFERRED:
+ nbl->nbl_time = ktime_get_boottime_seconds();
+ spin_lock(&nn->blocked_locks_lock);
+ list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
+ spin_unlock(&nn->blocked_locks_lock);
nbl = NULL;
fallthrough;
case -EAGAIN: /* conflock holds conflicting lock */
--
2.33.1



2021-11-23 15:20:15

by Vasily Averin

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

On 23.11.2021 15:22, Jeff Layton wrote:
> Vasily reported a case where vfs_lock_file took a very long time to
> return (longer than a lease period). The laundromat eventually ran and
> reaped the thing and when the vfs_lock_file returned, it ended up
> accessing freed memory.
>
> Don't put entries onto the LRU until vfs_lock_file returns.

Cc: [email protected]
Fixes: 7919d0a27f1e "nfsd: add a LRU list for blocked locks"

> Reported-by: Vasily Averin <[email protected]>
> Signed-off-by: Jeff Layton <[email protected]>
Tested-by: Vasily Averin <[email protected]>

> ---
> fs/nfsd/nfs4state.c | 6 ++++--
> 1 file changed, 4 insertions(+), 2 deletions(-)
>
> diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> index bfad94c70b84..8cfef84b9355 100644
> --- a/fs/nfsd/nfs4state.c
> +++ b/fs/nfsd/nfs4state.c
> @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> }
>
> if (fl_flags & FL_SLEEP) {
> - nbl->nbl_time = ktime_get_boottime_seconds();
> spin_lock(&nn->blocked_locks_lock);
> list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
> - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> spin_unlock(&nn->blocked_locks_lock);
> }
>
> @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> nn->somebody_reclaimed = true;
> break;
> case FILE_LOCK_DEFERRED:
> + nbl->nbl_time = ktime_get_boottime_seconds();
> + spin_lock(&nn->blocked_locks_lock);
> + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> + spin_unlock(&nn->blocked_locks_lock);
> nbl = NULL;
> fallthrough;
> case -EAGAIN: /* conflock holds conflicting lock */
>


2021-11-23 15:21:55

by Chuck Lever

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns



> On Nov 23, 2021, at 10:20 AM, Vasily Averin <[email protected]> wrote:
>
> On 23.11.2021 15:22, Jeff Layton wrote:
>> Vasily reported a case where vfs_lock_file took a very long time to
>> return (longer than a lease period). The laundromat eventually ran and
>> reaped the thing and when the vfs_lock_file returned, it ended up
>> accessing freed memory.
>>
>> Don't put entries onto the LRU until vfs_lock_file returns.
>
> Cc: [email protected]
> Fixes: 7919d0a27f1e "nfsd: add a LRU list for blocked locks"
>
>> Reported-by: Vasily Averin <[email protected]>
>> Signed-off-by: Jeff Layton <[email protected]>
> Tested-by: Vasily Averin <[email protected]>

I assume Bruce is grabbing this for v5.16-rc.


>> ---
>> fs/nfsd/nfs4state.c | 6 ++++--
>> 1 file changed, 4 insertions(+), 2 deletions(-)
>>
>> diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
>> index bfad94c70b84..8cfef84b9355 100644
>> --- a/fs/nfsd/nfs4state.c
>> +++ b/fs/nfsd/nfs4state.c
>> @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
>> }
>>
>> if (fl_flags & FL_SLEEP) {
>> - nbl->nbl_time = ktime_get_boottime_seconds();
>> spin_lock(&nn->blocked_locks_lock);
>> list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
>> - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
>> spin_unlock(&nn->blocked_locks_lock);
>> }
>>
>> @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
>> nn->somebody_reclaimed = true;
>> break;
>> case FILE_LOCK_DEFERRED:
>> + nbl->nbl_time = ktime_get_boottime_seconds();
>> + spin_lock(&nn->blocked_locks_lock);
>> + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
>> + spin_unlock(&nn->blocked_locks_lock);
>> nbl = NULL;
>> fallthrough;
>> case -EAGAIN: /* conflock holds conflicting lock */
>>
>

--
Chuck Lever




2021-11-23 15:59:20

by J. Bruce Fields

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

On Tue, Nov 23, 2021 at 07:22:23AM -0500, Jeff Layton wrote:
> Vasily reported a case where vfs_lock_file took a very long time to
> return (longer than a lease period). The laundromat eventually ran and
> reaped the thing and when the vfs_lock_file returned, it ended up
> accessing freed memory.

By the way, once we've called vfs_lock_file(), is there anything
preventing nfsd4_cb_notify_lock_release() from freeing nbl before we get
here?

>
> Don't put entries onto the LRU until vfs_lock_file returns.
>
> Reported-by: Vasily Averin <[email protected]>
> Signed-off-by: Jeff Layton <[email protected]>
> ---
> fs/nfsd/nfs4state.c | 6 ++++--
> 1 file changed, 4 insertions(+), 2 deletions(-)
>
> diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> index bfad94c70b84..8cfef84b9355 100644
> --- a/fs/nfsd/nfs4state.c
> +++ b/fs/nfsd/nfs4state.c
> @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> }
>
> if (fl_flags & FL_SLEEP) {
> - nbl->nbl_time = ktime_get_boottime_seconds();
> spin_lock(&nn->blocked_locks_lock);
> list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
> - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> spin_unlock(&nn->blocked_locks_lock);
> }
>
> @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> nn->somebody_reclaimed = true;
> break;
> case FILE_LOCK_DEFERRED:
> + nbl->nbl_time = ktime_get_boottime_seconds();
> + spin_lock(&nn->blocked_locks_lock);
> + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> + spin_unlock(&nn->blocked_locks_lock);
> nbl = NULL;
> fallthrough;
> case -EAGAIN: /* conflock holds conflicting lock */
> --
> 2.33.1

2021-11-23 16:31:05

by Jeffrey Layton

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

On Tue, 2021-11-23 at 10:59 -0500, J. Bruce Fields wrote:
> On Tue, Nov 23, 2021 at 07:22:23AM -0500, Jeff Layton wrote:
> > Vasily reported a case where vfs_lock_file took a very long time to
> > return (longer than a lease period). The laundromat eventually ran and
> > reaped the thing and when the vfs_lock_file returned, it ended up
> > accessing freed memory.
>
> By the way, once we've called vfs_lock_file(), is there anything
> preventing nfsd4_cb_notify_lock_release() from freeing nbl before we get
> here?
>

No, I don't think there is. Good catch.

Hmm...the only way I can see to fix that would be to add a refcount to
these things, in which case we probably don't need this patch since it
would prevent the original issue as well...

> >
> > Don't put entries onto the LRU until vfs_lock_file returns.
> >
> > Reported-by: Vasily Averin <[email protected]>
> > Signed-off-by: Jeff Layton <[email protected]>
> > ---
> > fs/nfsd/nfs4state.c | 6 ++++--
> > 1 file changed, 4 insertions(+), 2 deletions(-)
> >
> > diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> > index bfad94c70b84..8cfef84b9355 100644
> > --- a/fs/nfsd/nfs4state.c
> > +++ b/fs/nfsd/nfs4state.c
> > @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > }
> >
> > if (fl_flags & FL_SLEEP) {
> > - nbl->nbl_time = ktime_get_boottime_seconds();
> > spin_lock(&nn->blocked_locks_lock);
> > list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
> > - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > spin_unlock(&nn->blocked_locks_lock);
> > }
> >
> > @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > nn->somebody_reclaimed = true;
> > break;
> > case FILE_LOCK_DEFERRED:
> > + nbl->nbl_time = ktime_get_boottime_seconds();
> > + spin_lock(&nn->blocked_locks_lock);
> > + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > + spin_unlock(&nn->blocked_locks_lock);
> > nbl = NULL;
> > fallthrough;
> > case -EAGAIN: /* conflock holds conflicting lock */
> > --
> > 2.33.1

--
Jeff Layton <[email protected]>

2021-11-23 16:42:03

by J. Bruce Fields

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

On Tue, Nov 23, 2021 at 11:31:01AM -0500, Jeff Layton wrote:
> On Tue, 2021-11-23 at 10:59 -0500, J. Bruce Fields wrote:
> > On Tue, Nov 23, 2021 at 07:22:23AM -0500, Jeff Layton wrote:
> > > Vasily reported a case where vfs_lock_file took a very long time to
> > > return (longer than a lease period). The laundromat eventually ran and
> > > reaped the thing and when the vfs_lock_file returned, it ended up
> > > accessing freed memory.
> >
> > By the way, once we've called vfs_lock_file(), is there anything
> > preventing nfsd4_cb_notify_lock_release() from freeing nbl before we get
> > here?
> >
>
> No, I don't think there is. Good catch.

It may be a rare race (an rpc's not normally going to reply in that
time), but I wouldn't be surprised if there's some error condition where
it's possible.

> Hmm...the only way I can see to fix that would be to add a refcount to
> these things, in which case we probably don't need this patch since it
> would prevent the original issue as well...

Depending on how long that might take, I'd be OK with applying this as a
stopgap.

--b.

>
> > >
> > > Don't put entries onto the LRU until vfs_lock_file returns.
> > >
> > > Reported-by: Vasily Averin <[email protected]>
> > > Signed-off-by: Jeff Layton <[email protected]>
> > > ---
> > > fs/nfsd/nfs4state.c | 6 ++++--
> > > 1 file changed, 4 insertions(+), 2 deletions(-)
> > >
> > > diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> > > index bfad94c70b84..8cfef84b9355 100644
> > > --- a/fs/nfsd/nfs4state.c
> > > +++ b/fs/nfsd/nfs4state.c
> > > @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > > }
> > >
> > > if (fl_flags & FL_SLEEP) {
> > > - nbl->nbl_time = ktime_get_boottime_seconds();
> > > spin_lock(&nn->blocked_locks_lock);
> > > list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
> > > - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > > spin_unlock(&nn->blocked_locks_lock);
> > > }
> > >
> > > @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > > nn->somebody_reclaimed = true;
> > > break;
> > > case FILE_LOCK_DEFERRED:
> > > + nbl->nbl_time = ktime_get_boottime_seconds();
> > > + spin_lock(&nn->blocked_locks_lock);
> > > + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > > + spin_unlock(&nn->blocked_locks_lock);
> > > nbl = NULL;
> > > fallthrough;
> > > case -EAGAIN: /* conflock holds conflicting lock */
> > > --
> > > 2.33.1
>
> --
> Jeff Layton <[email protected]>

2021-11-23 17:02:10

by Jeffrey Layton

[permalink] [raw]
Subject: Re: [PATCH] nfsd: don't put blocked locks on LRU until after vfs_lock_file returns

On Tue, 2021-11-23 at 11:41 -0500, J. Bruce Fields wrote:
> On Tue, Nov 23, 2021 at 11:31:01AM -0500, Jeff Layton wrote:
> > On Tue, 2021-11-23 at 10:59 -0500, J. Bruce Fields wrote:
> > > On Tue, Nov 23, 2021 at 07:22:23AM -0500, Jeff Layton wrote:
> > > > Vasily reported a case where vfs_lock_file took a very long time to
> > > > return (longer than a lease period). The laundromat eventually ran and
> > > > reaped the thing and when the vfs_lock_file returned, it ended up
> > > > accessing freed memory.
> > >
> > > By the way, once we've called vfs_lock_file(), is there anything
> > > preventing nfsd4_cb_notify_lock_release() from freeing nbl before we get
> > > here?
> > >
> >
> > No, I don't think there is. Good catch.
>
> It may be a rare race (an rpc's not normally going to reply in that
> time), but I wouldn't be surprised if there's some error condition where
> it's possible.
>
> > Hmm...the only way I can see to fix that would be to add a refcount to
> > these things, in which case we probably don't need this patch since it
> > would prevent the original issue as well...
>
> Depending on how long that might take, I'd be OK with applying this as a
> stopgap.
>
>

I won't have the cycles to do that anytime soon, unfortunately, so you
may want to.

> >
> > > >
> > > > Don't put entries onto the LRU until vfs_lock_file returns.
> > > >
> > > > Reported-by: Vasily Averin <[email protected]>
> > > > Signed-off-by: Jeff Layton <[email protected]>
> > > > ---
> > > > fs/nfsd/nfs4state.c | 6 ++++--
> > > > 1 file changed, 4 insertions(+), 2 deletions(-)
> > > >
> > > > diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c
> > > > index bfad94c70b84..8cfef84b9355 100644
> > > > --- a/fs/nfsd/nfs4state.c
> > > > +++ b/fs/nfsd/nfs4state.c
> > > > @@ -6966,10 +6966,8 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > > > }
> > > >
> > > > if (fl_flags & FL_SLEEP) {
> > > > - nbl->nbl_time = ktime_get_boottime_seconds();
> > > > spin_lock(&nn->blocked_locks_lock);
> > > > list_add_tail(&nbl->nbl_list, &lock_sop->lo_blocked);
> > > > - list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > > > spin_unlock(&nn->blocked_locks_lock);
> > > > }
> > > >
> > > > @@ -6982,6 +6980,10 @@ nfsd4_lock(struct svc_rqst *rqstp, struct nfsd4_compound_state *cstate,
> > > > nn->somebody_reclaimed = true;
> > > > break;
> > > > case FILE_LOCK_DEFERRED:
> > > > + nbl->nbl_time = ktime_get_boottime_seconds();
> > > > + spin_lock(&nn->blocked_locks_lock);
> > > > + list_add_tail(&nbl->nbl_lru, &nn->blocked_locks_lru);
> > > > + spin_unlock(&nn->blocked_locks_lock);
> > > > nbl = NULL;
> > > > fallthrough;
> > > > case -EAGAIN: /* conflock holds conflicting lock */
> > > > --
> > > > 2.33.1
> >
> > --
> > Jeff Layton <[email protected]>

--
Jeff Layton <[email protected]>