by Felipe Balbi

[permalink] [raw]

Subject: Re: [PATCH v7 05/10] usb: dwc3: make controller clear transfer resources after complete

Hi,

Anurag Kumar Vulisha <[email protected]> writes:
> @@ -2487,6 +2497,11 @@ static void dwc3_endpoint_interrupt(struct dwc3 *dwc,
> }
>
> switch (event->endpoint_event) {
> + case DWC3_DEPEVT_XFERCOMPLETE:
> + if (!dep->stream_capable)
> + break;
> + dep->flags &= ~DWC3_EP_TRANSFER_STARTED;
> + /* Fall Through */

instead, let's add a proper handler for this:

dwc3_gadget_endpoint_transfer_complete(dep, event);

That handler should be "self-sufficient". IOW, we shouldn't have this
fall through here. This means that the other patch where you modify
dwc3_gadget_transfer_in_progress() shouldn't be necessary, since that
event shouldn't run for stream capable endpoints.

While rewriting the patches, please rebase on my testing/next as I have
applied a few of the patches in this series.

--
balbi

Attachments:

signature.asc (847.00 B)

2018-12-05 09:09:33

Hi,

Anurag Kumar Vulisha <[email protected]> writes:
> HI Felipe,
>
>>-----Original Message-----
>>From: Felipe Balbi [mailto:[email protected]]
>>Sent: Friday, December 07, 2018 11:42 AM
>>To: Anurag Kumar Vulisha <[email protected]>; Greg Kroah-Hartman
>><[email protected]>; Shuah Khan <[email protected]>; Alan Stern
>><[email protected]>; Johan Hovold <[email protected]>; Jaejoong Kim
>><[email protected]>; Benjamin Herrenschmidt <[email protected]>;
>>Roger Quadros <[email protected]>; Manu Gautam <[email protected]>;
>>[email protected]; Bart Van Assche <[email protected]>; Mike
>>Christie <[email protected]>; Matthew Wilcox <[email protected]>; Colin Ian
>>King <[email protected]>
>>Cc: [email protected]; [email protected];
>>[email protected]; Thinh Nguyen <[email protected]>; Tejas Joglekar
>><[email protected]>; Ajay Yugalkishore Pandey <[email protected]>
>>Subject: RE: [PATCH v7 09/10] usb: dwc3: Check for IOC/LST bit in both event->status
>>and TRB->ctrl fields
>>
>>
>>Hi,
>>
>>Anurag Kumar Vulisha <[email protected]> writes:
>>>>> @@ -2286,7 +2286,12 @@ static int
>>>>dwc3_gadget_ep_reclaim_completed_trb(struct dwc3_ep *dep,
>>>>> if (event->status & DEPEVT_STATUS_SHORT && !chain)
>>>>> return 1;
>>>>>
>>>>> - if (event->status & (DEPEVT_STATUS_IOC | DEPEVT_STATUS_LST))
>>>>> + if ((event->status & DEPEVT_STATUS_IOC) &&
>>>>> + (trb->ctrl & DWC3_TRB_CTRL_IOC))
>>>>> + return 1;
>>>>
>>>>this shouldn't be necessary. According to databook, event->status
>>>>contains the bits from the completed TRB. Which means that
>>>>event->status & IOC will always be equal to trb->ctrl & IOC.
>>>>
>>> Thanks for reviewing this patch. Lets consider an example where a
>>> request has num_sgs > 0 and each sg is mapped to a TRB and the last
>>> TRB has the IOC bit set. Once the controller is done with the
>>> transfer, it generates XferInProgress for the last TRB (since IOC bit
>>> is set). As a part of trb reclaim process
>>> dwc3_gadget_ep_reclaim_trb_sg() calls
>>> dwc3_gadget_ep_reclaim_completed_trb() for req->num_sgs times. Since
>>> the event already has the IOC bit set, the loop is exited from the
>>> loop at the very first TRB and the remaining TRBs (mapped to the sglist) are left
>>unhandled.
>>> To avoid this we modified the code to exit only if both TRB & event
>>> has the IOC bit set.
>>
>>Seems like IOC case should just test for chain flag as well:
>>
>
> Okay. Along with this logic the code for updating chain bit should also be modified I guess.

not really

> Since the IOC bit is also set when there are not enough TRBs available, the code should be
> modified to not set DWC3_TRB_CTRL_CHN bit when the IOC bit is set. I will update below
> changes along with your suggestions and resend the patches.

no. Actually I don't think we're allowed to split a scatter/gather like
that. I did that quite a while ago, but I don't think we're allowed to
do so. What we should do, in that case, is not even queue that request
until we have enough for all members of the scatter/gather. But that's a
separate patch, anyway.

--
balbi

Attachments:

signature.asc (847.00 B)

2018-12-10 09:45:38

by Anurag Kumar Vulisha

[permalink] [raw]

Subject: RE: [PATCH v7 09/10] usb: dwc3: Check for IOC/LST bit in both event->status and TRB->ctrl fields

Hi Felipe,

>-----Original Message-----
>From: Felipe Balbi [mailto:[email protected]]
>Sent: Monday, December 10, 2018 12:24 PM
>To: Anurag Kumar Vulisha <[email protected]>; Greg Kroah-Hartman
><[email protected]>; Shuah Khan <[email protected]>; Alan Stern
><[email protected]>; Johan Hovold <[email protected]>; Jaejoong Kim
><[email protected]>; Benjamin Herrenschmidt <[email protected]>;
>Roger Quadros <[email protected]>; Manu Gautam <[email protected]>;
>[email protected]; Bart Van Assche <[email protected]>; Mike
>Christie <[email protected]>; Matthew Wilcox <[email protected]>; Colin Ian
>King <[email protected]>
>Cc: [email protected]; [email protected];
>[email protected]; Thinh Nguyen <[email protected]>; Tejas Joglekar
><[email protected]>; Ajay Yugalkishore Pandey <[email protected]>
>Subject: RE: [PATCH v7 09/10] usb: dwc3: Check for IOC/LST bit in both event->status
>and TRB->ctrl fields
>
>
>Hi,
>
>Anurag Kumar Vulisha <[email protected]> writes:
>> HI Felipe,
>>
>>>-----Original Message-----
>>>From: Felipe Balbi [mailto:[email protected]]
>>>Sent: Friday, December 07, 2018 11:42 AM
>>>To: Anurag Kumar Vulisha <[email protected]>; Greg Kroah-Hartman
>>><[email protected]>; Shuah Khan <[email protected]>; Alan Stern
>>><[email protected]>; Johan Hovold <[email protected]>; Jaejoong Kim
>>><[email protected]>; Benjamin Herrenschmidt <[email protected]>;
>>>Roger Quadros <[email protected]>; Manu Gautam <[email protected]>;
>>>[email protected]; Bart Van Assche <[email protected]>; Mike
>>>Christie <[email protected]>; Matthew Wilcox <[email protected]>; Colin
>Ian
>>>King <[email protected]>
>>>Cc: [email protected]; [email protected];
>>>[email protected]; Thinh Nguyen <[email protected]>; Tejas
>Joglekar
>>><[email protected]>; Ajay Yugalkishore Pandey
><[email protected]>
>>>Subject: RE: [PATCH v7 09/10] usb: dwc3: Check for IOC/LST bit in both event-
>>status
>>>and TRB->ctrl fields
>>>
>>>
>>>Hi,
>>>
>>>Anurag Kumar Vulisha <[email protected]> writes:
>>>>>> @@ -2286,7 +2286,12 @@ static int
>>>>>dwc3_gadget_ep_reclaim_completed_trb(struct dwc3_ep *dep,
>>>>>> if (event->status & DEPEVT_STATUS_SHORT && !chain)
>>>>>> return 1;
>>>>>>
>>>>>> - if (event->status & (DEPEVT_STATUS_IOC | DEPEVT_STATUS_LST))
>>>>>> + if ((event->status & DEPEVT_STATUS_IOC) &&
>>>>>> + (trb->ctrl & DWC3_TRB_CTRL_IOC))
>>>>>> + return 1;
>>>>>
>>>>>this shouldn't be necessary. According to databook, event->status
>>>>>contains the bits from the completed TRB. Which means that
>>>>>event->status & IOC will always be equal to trb->ctrl & IOC.
>>>>>
>>>> Thanks for reviewing this patch. Lets consider an example where a
>>>> request has num_sgs > 0 and each sg is mapped to a TRB and the last
>>>> TRB has the IOC bit set. Once the controller is done with the
>>>> transfer, it generates XferInProgress for the last TRB (since IOC bit
>>>> is set). As a part of trb reclaim process
>>>> dwc3_gadget_ep_reclaim_trb_sg() calls
>>>> dwc3_gadget_ep_reclaim_completed_trb() for req->num_sgs times. Since
>>>> the event already has the IOC bit set, the loop is exited from the
>>>> loop at the very first TRB and the remaining TRBs (mapped to the sglist) are left
>>>unhandled.
>>>> To avoid this we modified the code to exit only if both TRB & event
>>>> has the IOC bit set.
>>>
>>>Seems like IOC case should just test for chain flag as well:
>>>
>>
>> Okay. Along with this logic the code for updating chain bit should also be modified I
>guess.
>
>not really
>
>> Since the IOC bit is also set when there are not enough TRBs available, the code
>should be
>> modified to not set DWC3_TRB_CTRL_CHN bit when the IOC bit is set. I will update
>below
>> changes along with your suggestions and resend the patches.
>
>no. Actually I don't think we're allowed to split a scatter/gather like
>that. I did that quite a while ago, but I don't think we're allowed to
>do so. What we should do, in that case, is not even queue that request
>until we have enough for all members of the scatter/gather. But that's a
>separate patch, anyway.
>

Okay. I have a doubt here, not pushing the request until all sgs are mapped to enough TRBs
might remove the driver complexity but reduce the performance (since we are waiting
until enough TRBs are available). Are we okay with that?

Thanks,
Anurag Kumar Vulisha

2018-12-10 09:45:58

Hi Felipe,

Resending...

Since I am waiting on your suggestion, thought of giving remainder.

Thanks,
Anurag Kumar Vulisha

>-----Original Message-----
>From: Anurag Kumar Vulisha
>Sent: Wednesday, December 12, 2018 8:41 PM
>To: 'Alan Stern' <[email protected]>; Felipe Balbi <[email protected]>
>Cc: Greg Kroah-Hartman <[email protected]>; Shuah Khan
><[email protected]>; Johan Hovold <[email protected]>; Jaejoong Kim
><[email protected]>; Benjamin Herrenschmidt <[email protected]>;
>Roger Quadros <[email protected]>; Manu Gautam <[email protected]>;
>[email protected]; Bart Van Assche <[email protected]>; Mike
>Christie <[email protected]>; Matthew Wilcox <[email protected]>; Colin Ian
>King <[email protected]>; [email protected]; linux-
>[email protected]; [email protected]; Thinh Nguyen
><[email protected]>; Tejas Joglekar <[email protected]>; Ajay
>Yugalkishore Pandey <[email protected]>
>Subject: RE: [PATCH v7 01/10] usb: gadget: udc: Add timer support for usb requests
>
>
>Hi Felipe,
>
>>-----Original Message-----
>>From: Alan Stern [mailto:[email protected]]
>>Sent: Friday, December 07, 2018 10:40 PM
>>To: Felipe Balbi <[email protected]>
>>Cc: Anurag Kumar Vulisha <[email protected]>; Greg Kroah-Hartman
>><[email protected]>; Shuah Khan <[email protected]>; Johan Hovold
>><[email protected]>; Jaejoong Kim <[email protected]>; Benjamin
>>Herrenschmidt <[email protected]>; Roger Quadros <[email protected]>;
>Manu
>>Gautam <[email protected]>; [email protected]; Bart Van
>>Assche <[email protected]>; Mike Christie <[email protected]>; Matthew
>>Wilcox <[email protected]>; Colin Ian King <[email protected]>; linux-
>>[email protected]; [email protected]; [email protected];
>>Thinh Nguyen <[email protected]>; Tejas Joglekar
>><[email protected]>; Ajay Yugalkishore Pandey <[email protected]>
>>Subject: RE: [PATCH v7 01/10] usb: gadget: udc: Add timer support for usb requests
>>
>>On Fri, 7 Dec 2018, Felipe Balbi wrote:
>>
>>>
>>> hi,
>>>
>>> Anurag Kumar Vulisha <[email protected]> writes:
>>> >>Does the data book suggest a value for the timeout?
>>> >>
>>> >
>>> > No, the databook doesn't mention about the timeout value
>>> >
>>> >>> >At this point, it seems that the generic approach will be messier than having
>>every
>>> >>> >controller driver implement its own fix. At least, that's how it appears to me.
>>>
>>> Why, if the UDC implementation will, anyway, be a timer?
>>
>>It's mostly a question of what happens when the timer expires. (After
>>all, starting a timer is just as easy to do in a UDC driver as it is in
>>the core.) When the timer expires, what can the core do?
>>
>>Don't say it can cancel the offending request and resubmit it. That
>>leads to the sort of trouble we discussed earlier in this thread. In
>>particular, we don't want the class driver's completion routine to be
>>called when the cancel occurs. Furthermore, this leads to a race:
>>Suppose the class driver decides to cancel the request at the same time
>>as the core does a cancel and resubmit. Then the class driver's cancel
>>could get lost and the request would remain on the UDC's queue.
>>
>>What you really want to do is issue the appropriate stop and restart
>>commands to the hardware, while leaving the request logically active
>>the entire time. The UDC core can't do this, but a UDC driver can.
>>
>
>I agree with Alan's comment as it looks like there may be some corner cases
>where the issue may occur with dequeue approach. Are you okay if the timeout
>handler gets moved to the dwc3 driver (the timers still added into udc layer)?
>Please let us know your suggestion on this
>
>Thanks,
>Anurag Kumar Vulisha
>
>>> >>(Especially if dwc3 is the only driver affected.)
>>> >
>>> > As discussed above, the issue may happen with other gadgets too. As I got divide
>>opinions
>>> > on this implementation and both the implementations looks fine to me, I am
>little
>>confused
>>> > on which should be implemented.
>>> >
>>> > @Felipe: Do you agree with Alan's implementation? Please let us know your
>>suggestion
>>> > on this.
>>>
>>> I still think a generic timer is a better solution since it has other uses.
>>
>>Putting a struct timer into struct usb_request is okay with me, but I
>>wouldn't go any farther than that.
>>
>>> >>Since the purpose of the timeout is to detect a deadlock caused by a
>>> >>hardware bug, I suggest a fixed and relatively short timeout value such
>>> >>as one second. Cancelling and requeuing a few requests at 1-second
>>> >>intervals shouldn't add very much overhead.
>>>
>>> I wouldn't call this a HW bug though. This is just how the UDC
>>> behaves. There are N streams and host can move data in any stream at any
>>> time. This means that host & gadget _can_ disagree on what stream to
>>> start next.
>>
>>But the USB 3 spec says what should happen when the host and gadget
>>disagree in this way, doesn't it? And it doesn't say that they should
>>deadlock. :-) Or have I misread the spec?
>>
>>> One way to avoid this would be to never pre-start any streams and always
>>> rely on XferNotReady, but that would mean greatly reduced throughput for
>>> streams.
>>
>>It would be good if there was some way to actively detect the problem
>>instead of passively waiting for a timer to expire.
>>
>>Alan Stern