2015-04-01 08:06:49

by Robert Baldyga

[permalink] [raw]
Subject: Re: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT

Hi,

On 03/31/2015 08:53 PM, Baxter, Jim wrote:
> Hi,
>
> I have been looking at an issue where a phone that is the Function FS
> host sometimes locks up and causes the function:
> static ssize_t ffs_epfile_io(struct file *file, struct ffs_io_data
> *io_data) in drivers/usb/gadget/function/f_fs.c to timeout after
> MAX_SCHEDULE_TIMEOUT jiffies.
>
> We are opening the ffs connection with the O_NONBLOCK option and this
> timeout seems to be far too long.
>
> The code causing the wait is here (line 834):
>
> if (unlikely(ret < 0)) {
> /* nop */
> } else if (unlikely(
> wait_for_completion_interruptible(&done))) {
> ret = -EINTR;
> usb_ep_dequeue(ep->ep, req);
> } else {
>
> The function call wait_for_completion_interruptible(&done) equates to:
> wait_for_completion_interruptible_timeout(&done, MAX_SCHEDULE_TIMEOUT);
>
>
> I was wondering what to change to fix this, is it reasonable to use a
> very small timeout if it is O_NONBLOCK or is any blocking not allowed?
>
> I would appreciate any thoughts on how this should work?

FunctionFS is very specific, because read/write operations are directly
translated into USB requests, which are asynchronous, so you cannot use
O_NONBLOCK.

If you need non-blocking API you can use Asynchronous I/O (AIO). You can
find some examples in kernel sources (tools/usb/ffs-aio-example/).

Br,
Robert Baldyga


2015-04-01 17:29:16

by Jim Baxter

[permalink] [raw]
Subject: Re: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT

>
> FunctionFS is very specific, because read/write operations are directly
> translated into USB requests, which are asynchronous, so you cannot use
> O_NONBLOCK.
>
> If you need non-blocking API you can use Asynchronous I/O (AIO). You can
> find some examples in kernel sources (tools/usb/ffs-aio-example/).
>
> Br,
> Robert Baldyga
>

Thank you, that sounds like the best approach.
In this case I think perhaps the long wait without any data is an
problem with the imx6 Chipidea USB controller.

I guess it should suspend and drop the connections if there is no
traffic for more than 10ms?

Thanks.
Jim Baxter

2015-04-02 00:25:37

by Peter Chen

[permalink] [raw]
Subject: Re: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT

On Wed, Apr 01, 2015 at 06:29:05PM +0100, Baxter, Jim wrote:
> >
> > FunctionFS is very specific, because read/write operations are directly
> > translated into USB requests, which are asynchronous, so you cannot use
> > O_NONBLOCK.
> >
> > If you need non-blocking API you can use Asynchronous I/O (AIO). You can
> > find some examples in kernel sources (tools/usb/ffs-aio-example/).
> >
> > Br,
> > Robert Baldyga
> >
>
> Thank you, that sounds like the best approach.
> In this case I think perhaps the long wait without any data is an
> problem with the imx6 Chipidea USB controller.

What's the possible problem?

>
> I guess it should suspend and drop the connections if there is no
> traffic for more than 10ms?
>

If the Device side NAK host's IN/OUT token continually, the pipe will
not be stopped, the host will send token continually until the application
cancel this request.

--

Best Regards,
Peter Chen

2015-05-22 11:12:47

by Jim Baxter

[permalink] [raw]
Subject: Re: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT


> On Wed, Apr 01, 2015 at 06:29:05PM +0100, Baxter, Jim wrote:
>>>
>>> FunctionFS is very specific, because read/write operations are directly
>>> translated into USB requests, which are asynchronous, so you cannot use
>>> O_NONBLOCK.
>>>
>>> If you need non-blocking API you can use Asynchronous I/O (AIO). You can
>>> find some examples in kernel sources (tools/usb/ffs-aio-example/).
>>>
>>> Br,
>>> Robert Baldyga
>>>
>>
>> Thank you, that sounds like the best approach.
>> In this case I think perhaps the long wait without any data is an
>> problem with the imx6 Chipidea USB controller.
>
> What's the possible problem?

Sorry for the delay in replying, I have been getting some more details
with a USB Analyser.

The scenario is that the NCM device is enumerating so we see the
messages to:

SetAddress (1)
GetDescriptor (Device)
GetDescriptor (StringN)
GetDescriptor (Configuration)
SetConfiguration (1)
GetDescriptor (String iInterface)
GetDescriptor (String iInterface)

At this point the NCM host sends Writes to the F_FS EP0 but for some
reason the host device does not respond and only issues SOF packets for
hours. This happens occasionally and is fixed by turning the device off
and on again.


Unless I am mistaken from a NCM gadget point of view the attached device
is working correctly and there is no way to know it has failed, is that
correct?

>
>>
>> I guess it should suspend and drop the connections if there is no
>> traffic for more than 10ms?
>>
>
> If the Device side NAK host's IN/OUT token continually, the pipe will
> not be stopped, the host will send token continually until the application
> cancel this request.
>

2015-05-25 01:55:58

by Peter Chen

[permalink] [raw]
Subject: RE: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT


> >>>
> >>> FunctionFS is very specific, because read/write operations are
> >>> directly translated into USB requests, which are asynchronous, so
> >>> you cannot use O_NONBLOCK.
> >>>
> >>> If you need non-blocking API you can use Asynchronous I/O (AIO). You
> >>> can find some examples in kernel sources (tools/usb/ffs-aio-example/).
> >>>
> >>> Br,
> >>> Robert Baldyga
> >>>
> >>
> >> Thank you, that sounds like the best approach.
> >> In this case I think perhaps the long wait without any data is an
> >> problem with the imx6 Chipidea USB controller.
> >
> > What's the possible problem?
>
> Sorry for the delay in replying, I have been getting some more details with a
> USB Analyser.
>
> The scenario is that the NCM device is enumerating so we see the messages to:
>
> SetAddress (1)
> GetDescriptor (Device)
> GetDescriptor (StringN)
> GetDescriptor (Configuration)
> SetConfiguration (1)
> GetDescriptor (String iInterface)
> GetDescriptor (String iInterface)
>
> At this point the NCM host sends Writes to the F_FS EP0 but for some reason
> the host device does not respond and only issues SOF packets for hours. This
> happens occasionally and is fixed by turning the device off and on again.
>
>

We may find this 'some reason', is it device error or host error?

Do you have below patch in your code:

commit 953c66469735aed8d2ada639a72b150f01dae605
Author: Abbas Raza <[email protected]>
Date: Thu Jul 17 19:34:31 2014 +0800

usb: chipidea: udc: Disable auto ZLP generation on ep0

There are 2 methods for ZLP (zero-length packet) generation:
1) In software
2) Automatic generation by device controller

1) is implemented in UDC driver and it attaches ZLP to IN packet if
descriptor->size < wLength
2) can be enabled/disabled by setting ZLT bit in the QH

Peter

> Unless I am mistaken from a NCM gadget point of view the attached device is
> working correctly and there is no way to know it has failed, is that correct?
>
> >
> >>
> >> I guess it should suspend and drop the connections if there is no
> >> traffic for more than 10ms?
> >>
> >
> > If the Device side NAK host's IN/OUT token continually, the pipe will
> > not be stopped, the host will send token continually until the
> > application cancel this request.
> >

2015-06-01 16:25:36

by Jim Baxter

[permalink] [raw]
Subject: Re: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT

>>>>>
>>>>> FunctionFS is very specific, because read/write operations are
>>>>> directly translated into USB requests, which are asynchronous, so
>>>>> you cannot use O_NONBLOCK.
>>>>>
>>>>> If you need non-blocking API you can use Asynchronous I/O (AIO). You
>>>>> can find some examples in kernel sources (tools/usb/ffs-aio-example/).
>>>>>
>>>>> Br,
>>>>> Robert Baldyga
>>>>>
>>>>
>>>> Thank you, that sounds like the best approach.
>>>> In this case I think perhaps the long wait without any data is an
>>>> problem with the imx6 Chipidea USB controller.
>>>
>>> What's the possible problem?
>>
>> Sorry for the delay in replying, I have been getting some more details with a
>> USB Analyser.
>>
>> The scenario is that the NCM device is enumerating so we see the messages to:
>>
>> SetAddress (1)
>> GetDescriptor (Device)
>> GetDescriptor (StringN)
>> GetDescriptor (Configuration)
>> SetConfiguration (1)
>> GetDescriptor (String iInterface)
>> GetDescriptor (String iInterface)
>>
>> At this point the NCM host sends Writes to the F_FS EP0 but for some reason
>> the host device does not respond and only issues SOF packets for hours. This
>> happens occasionally and is fixed by turning the device off and on again.
>>
>>
>
> We may find this 'some reason', is it device error or host error?
>
> Do you have below patch in your code:
>
> commit 953c66469735aed8d2ada639a72b150f01dae605
> Author: Abbas Raza <[email protected]>
> Date: Thu Jul 17 19:34:31 2014 +0800
>
> usb: chipidea: udc: Disable auto ZLP generation on ep0
>
> There are 2 methods for ZLP (zero-length packet) generation:
> 1) In software
> 2) Automatic generation by device controller
>
> 1) is implemented in UDC driver and it attaches ZLP to IN packet if
> descriptor->size < wLength
> 2) can be enabled/disabled by setting ZLT bit in the QH
>
> Peter

Yes I have that patch, could that be a problem?

Jim

>
>> Unless I am mistaken from a NCM gadget point of view the attached device is
>> working correctly and there is no way to know it has failed, is that correct?
>>
>>>
>>>>
>>>> I guess it should suspend and drop the connections if there is no
>>>> traffic for more than 10ms?
>>>>
>>>
>>> If the Device side NAK host's IN/OUT token continually, the pipe will
>>> not be stopped, the host will send token continually until the
>>> application cancel this request.
>>>

2015-06-02 03:01:23

by Peter Chen

[permalink] [raw]
Subject: RE: usb: gadget: f_fs: O_NONBLOCK waits MAX_SCHEDULE_TIMEOUT


> >>>>
> >>>> Thank you, that sounds like the best approach.
> >>>> In this case I think perhaps the long wait without any data is an
> >>>> problem with the imx6 Chipidea USB controller.
> >>>
> >>> What's the possible problem?
> >>
> >> Sorry for the delay in replying, I have been getting some more
> >> details with a USB Analyser.
> >>
> >> The scenario is that the NCM device is enumerating so we see the
> messages to:
> >>
> >> SetAddress (1)
> >> GetDescriptor (Device)
> >> GetDescriptor (StringN)
> >> GetDescriptor (Configuration)
> >> SetConfiguration (1)
> >> GetDescriptor (String iInterface)
> >> GetDescriptor (String iInterface)
> >>
> >> At this point the NCM host sends Writes to the F_FS EP0 but for some
> >> reason the host device does not respond and only issues SOF packets
> >> for hours. This happens occasionally and is fixed by turning the device off
> and on again.
> >>
> >>
> >
> > We may find this 'some reason', is it device error or host error?
> >
> > Do you have below patch in your code:
> >
> > commit 953c66469735aed8d2ada639a72b150f01dae605
> > Author: Abbas Raza <[email protected]>
> > Date: Thu Jul 17 19:34:31 2014 +0800
> >
> > usb: chipidea: udc: Disable auto ZLP generation on ep0
> >
> > There are 2 methods for ZLP (zero-length packet) generation:
> > 1) In software
> > 2) Automatic generation by device controller
> >
> > 1) is implemented in UDC driver and it attaches ZLP to IN packet if
> > descriptor->size < wLength
> > 2) can be enabled/disabled by setting ZLT bit in the QH
> >
> > Peter
>
> Yes I have that patch, could that be a problem?
>

No, you need that patch.
Get to know which side has problem first.

Peter

> Jim
>
> >
> >> Unless I am mistaken from a NCM gadget point of view the attached
> >> device is working correctly and there is no way to know it has failed, is that
> correct?
> >>
> >>>
> >>>>
> >>>> I guess it should suspend and drop the connections if there is no
> >>>> traffic for more than 10ms?
> >>>>
> >>>
> >>> If the Device side NAK host's IN/OUT token continually, the pipe
> >>> will not be stopped, the host will send token continually until the
> >>> application cancel this request.
> >>>