LinuxLists.cc - Re: [pnfs] nfs41: sunrpc: handle clnt==NULL in call

2008-11-06 07:19:54

Subject: Re: [pnfs] nfs41: sunrpc: handle clnt==NULL in call_status

On Nov 5, 2008, at 7:05 PM, Labiaga, Ricardo wrote:

>>>>>
>>>>> Signed-off-by: Benny Halevy <[email protected]>
>>>>> ---
>>>>>
>>>>> Trond, I'm not sure if this can happen without nfs41.
>>>>> However, please consider this patch for upstream since
>>>>> it is safe to do in any case.
>>>>>
>>>>> Benny
>>>>>
>>>>> net/sunrpc/clnt.c | 8 +++++---
>>>>> 1 files changed, 5 insertions(+), 3 deletions(-)
>>>>>
>>>>> diff --git a/net/sunrpc/clnt.c b/net/sunrpc/clnt.c
>>>>> index 78fc483..b555d9f 100644
>>>>> --- a/net/sunrpc/clnt.c
>>>>> +++ b/net/sunrpc/clnt.c
>>>>> @@ -1206,7 +1206,8 @@ call_status(struct rpc_task *task)
>>>>> break;
>>>>> case -ECONNREFUSED:
>>>>> case -ENOTCONN:
>>>>> - rpc_force_rebind(clnt);
>>>>> + if (clnt)
>>>>> + rpc_force_rebind(clnt);
>>>>> task->tk_action = call_bind;
>>>>> break;
>>>>> case -EAGAIN:
>>>>> @@ -1217,9 +1218,10 @@ call_status(struct rpc_task *task)
>>>>> rpc_exit(task, status);
>>>>> break;
>>>>> default:
>>>>> - if (clnt->cl_chatty)
>>>>> + if (!clnt || clnt->cl_chatty)
>>>>> printk("%s: RPC call returned error %d\n",
>>>>> - clnt->cl_protname, -status);
>>>>> + clnt ? clnt->cl_protname : "<unknown
>>> protocol>",
>>>>> + -status);
>>>>> rpc_exit(task, status);
>>>>> }
>>>>> }
>>>> BIG NACK!
>>>>
>>>> How does even it make sense for a task to get past call_transmit
> and
>>>> call_status without having task->tk_client set? This sounds like
>>> serious
>>>> borkenness in the nfsv4.1 patches...
>>
>> Ricardo,
>>
>> rpc_run_bc_task sets no task_setup_data.rpc_client when calling
>> rpc_new_task.
>> We might be able to use to fore channel rpc client,
>
> We could, though it would cause the reconnection to occur in the
> backchannel code path. Haven't thought it through, but it sounds
> cleaner to rebind the session (or reset it if the server went away) in
> the forechannel context. I wonder if it would be acceptable to simply
> drop the backchannel request, and have the forechannel reestablish the
> connection (on a retransmission)?
>
>> but
>> I'm still concerned that using this path for sending the callback
>> replies is wrong.
>
> The mainline RPC call_transmit() is already able to deal with RPC
> transmissions that don't expect a reply. This is pretty similar to
> sending a reply, so we're leveraging the existing rpc_xprt.
>
> One alternative could be to construct an svc_xprt for the backchannel
> and use svc_send() for the reply. I wonder if it's really a better
> approach since we already have the rpc_xprt available.

I failed to mention that with the existing approach we're able to
synchronize forechannel calls and backchannel replies on the rpc_xprt
(XPRT_LOCKED bit). It uses the synchronization mechanism already in
mainline to prevent mixing the payload of requests and replies.

- ricardo

>
>>>
>>> The "Callback slot table overflowed" message means we couldn't
> obtain a
>>> pre-allocated rpc_rqst to process the callback. When this occurs,
> the
>>> client sets the XPRT_CLOSE_WAIT bit to close the connection. (As an
>>> aside, should we be using xprt_force_disconnect() instead?).
>>
>> could be, but we might be doing the right thing on the client side
>> in this case already.
>
> We're correctly closing the connection, but doesn't seem like we're
> acquiring the lock, as it is done in xprt_force_disconnect().
> xprt_force_disconnect() wasn't around when I first added the autoclose
> code in xs_tcp_read_callback().
>
>>
>>>
>>> This leads me to believe the server exceeded the number of
> outstanding
>>> requests allowed on the backchannel (slot count of one at the
> moment),
>>> the new request caused us to close the connection and pulled the rug
>>> from under the callback service. I'll investigate further.
>>
>> We simply got ENOTCONN back (see value of RBX)
>
> Right, I was trying to explain what may have caused the connection
> closed error. I'm suggesting the unexpected request closed the
> connection, and was dropped on the floor by xs_tcp_read_callback().
> After the first request was processed, it encountered a closed
> connection during the reply. This is why the original request dies
> with
> ENOTCONN.
>
> I'll take a look at the svc_send() path using svc_xprt. Though I
> think
> your proposed patch to not force a rebind in the backchannel is
> appropriate. I guess I'm pleading to Trond to reconsider the Big NACK
> :-)
>
> - ricardo
>
>>
>>>
>>> Is this reproducible?
>>
>> Not yet.
>>
>> Benny
>>
>>>
>>> - ricardo
>>>
>>>
>>>> Trond
>>>> ...
>>>>

2008-11-06 07:48:47

by Benny Halevy

[permalink] [raw]

Subject: Re: [pnfs] nfs41: sunrpc: handle clnt==NULL in call_status

On Nov. 06, 2008, 9:19 +0200, Ricardo Labiaga <[email protected]> wrote:
> On Nov 5, 2008, at 7:05 PM, Labiaga, Ricardo wrote:
>
>>>>>> Signed-off-by: Benny Halevy <[email protected]>
>>>>>> ---
>>>>>>
>>>>>> Trond, I'm not sure if this can happen without nfs41.
>>>>>> However, please consider this patch for upstream since
>>>>>> it is safe to do in any case.
>>>>>>
>>>>>> Benny
>>>>>>
>>>>>> net/sunrpc/clnt.c | 8 +++++---
>>>>>> 1 files changed, 5 insertions(+), 3 deletions(-)
>>>>>>
>>>>>> diff --git a/net/sunrpc/clnt.c b/net/sunrpc/clnt.c
>>>>>> index 78fc483..b555d9f 100644
>>>>>> --- a/net/sunrpc/clnt.c
>>>>>> +++ b/net/sunrpc/clnt.c
>>>>>> @@ -1206,7 +1206,8 @@ call_status(struct rpc_task *task)
>>>>>> break;
>>>>>> case -ECONNREFUSED:
>>>>>> case -ENOTCONN:
>>>>>> - rpc_force_rebind(clnt);
>>>>>> + if (clnt)
>>>>>> + rpc_force_rebind(clnt);
>>>>>> task->tk_action = call_bind;
>>>>>> break;
>>>>>> case -EAGAIN:
>>>>>> @@ -1217,9 +1218,10 @@ call_status(struct rpc_task *task)
>>>>>> rpc_exit(task, status);
>>>>>> break;
>>>>>> default:
>>>>>> - if (clnt->cl_chatty)
>>>>>> + if (!clnt || clnt->cl_chatty)
>>>>>> printk("%s: RPC call returned error %d\n",
>>>>>> - clnt->cl_protname, -status);
>>>>>> + clnt ? clnt->cl_protname : "<unknown
>>>> protocol>",
>>>>>> + -status);
>>>>>> rpc_exit(task, status);
>>>>>> }
>>>>>> }
>>>>> BIG NACK!
>>>>>
>>>>> How does even it make sense for a task to get past call_transmit
>> and
>>>>> call_status without having task->tk_client set? This sounds like
>>>> serious
>>>>> borkenness in the nfsv4.1 patches...
>>> Ricardo,
>>>
>>> rpc_run_bc_task sets no task_setup_data.rpc_client when calling
>>> rpc_new_task.
>>> We might be able to use to fore channel rpc client,
>> We could, though it would cause the reconnection to occur in the
>> backchannel code path. Haven't thought it through, but it sounds
>> cleaner to rebind the session (or reset it if the server went away) in
>> the forechannel context. I wonder if it would be acceptable to simply
>> drop the backchannel request, and have the forechannel reestablish the
>> connection (on a retransmission)?
>>
>>> but
>>> I'm still concerned that using this path for sending the callback
>>> replies is wrong.
>> The mainline RPC call_transmit() is already able to deal with RPC
>> transmissions that don't expect a reply. This is pretty similar to
>> sending a reply, so we're leveraging the existing rpc_xprt.
>>
>> One alternative could be to construct an svc_xprt for the backchannel
>> and use svc_send() for the reply. I wonder if it's really a better
>> approach since we already have the rpc_xprt available.
>
> I failed to mention that with the existing approach we're able to
> synchronize forechannel calls and backchannel replies on the rpc_xprt
> (XPRT_LOCKED bit). It uses the synchronization mechanism already in
> mainline to prevent mixing the payload of requests and replies.

Right, but we need to do this at the xprt layer. I don't think that
this mandates starting the reply process from the rpc layer
as we do today in bc_send.

Note that bc_svc_process gets both an rpc_rqst and a svc_rqst.
Although we forge a svc_rqst including setting up its rq_xprt
which is a svc_xprt, I'm not sure what exactly it is used for.
Eventually, we end up sending the reply (via bc_send) using
the rpc_rqst and its associated rpc_xprt. This will allow
us to synchronize properly on the bi-directional channel.

>
> - ricardo
>

[snip]

2008-11-07 02:41:44

by Labiaga, Ricardo

[permalink] [raw]

Subject: RE: [pnfs] nfs41: sunrpc: handle clnt==NULL in call_status

> From: Benny Halevy [mailto:[email protected]]

[snip]

> >>> Ricardo,
> >>>
> >>> rpc_run_bc_task sets no task_setup_data.rpc_client when calling
> >>> rpc_new_task.
> >>> We might be able to use to fore channel rpc client,
> >> We could, though it would cause the reconnection to occur in the
> >> backchannel code path. Haven't thought it through, but it sounds
> >> cleaner to rebind the session (or reset it if the server went away)
in
> >> the forechannel context. I wonder if it would be acceptable to
simply
> >> drop the backchannel request, and have the forechannel reestablish
the
> >> connection (on a retransmission)?
> >>
> >>> but
> >>> I'm still concerned that using this path for sending the callback
> >>> replies is wrong.
> >> The mainline RPC call_transmit() is already able to deal with RPC
> >> transmissions that don't expect a reply. This is pretty similar to
> >> sending a reply, so we're leveraging the existing rpc_xprt.
> >>
> >> One alternative could be to construct an svc_xprt for the
backchannel
> >> and use svc_send() for the reply. I wonder if it's really a better
> >> approach since we already have the rpc_xprt available.
> >
> > I failed to mention that with the existing approach we're able to
> > synchronize forechannel calls and backchannel replies on the
rpc_xprt
> > (XPRT_LOCKED bit). It uses the synchronization mechanism already in
> > mainline to prevent mixing the payload of requests and replies.
>
> Right, but we need to do this at the xprt layer. I don't think that
> this mandates starting the reply process from the rpc layer
> as we do today in bc_send.
>
> Note that bc_svc_process gets both an rpc_rqst and a svc_rqst.
> Although we forge a svc_rqst including setting up its rq_xprt
> which is a svc_xprt, I'm not sure what exactly it is used for.

Correct, we build the svc_rqst so that we can use it in the processing
of the backchannel request. This way we reuse all of the existing v4
backchannel logic for decoding, authenticating, dispatching, and result
checking in svc_process_common().

> Eventually, we end up sending the reply (via bc_send) using
> the rpc_rqst and its associated rpc_xprt. This will allow
> us to synchronize properly on the bi-directional channel.

If we really want to avoid the RPC state machine, I guess we could first
serialize on the rpc_rqst by calling test_and_set_bit(XPRT_LOCKED,
&xprt->state) to ensure no outgoing RPC grabs the transport. We then
issue the reply using svc_send() with the svc_rqst. This does avoid
having to copy the send buffer into the rpc_rqst, avoid building an
rpc_task, and avoid entering the RPC state machine, though it requires
an extra synchronization on the svc_xprt.xpt_mutex.

Is this along the lines of what you're thinking? I can give it a try...

- ricardo

>
> >
> > - ricardo
> >
>
> [snip]