Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755618AbcJMNIZ (ORCPT ); Thu, 13 Oct 2016 09:08:25 -0400 Received: from mail-yw0-f173.google.com ([209.85.161.173]:34012 "EHLO mail-yw0-f173.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754936AbcJMNIS (ORCPT ); Thu, 13 Oct 2016 09:08:18 -0400 MIME-Version: 1.0 In-Reply-To: <87y41struu.fsf@linux.intel.com> References: <87r37kviic.fsf@linux.intel.com> <87inswvg80.fsf@linux.intel.com> <87a8e8vaif.fsf@linux.intel.com> <877f9cv7no.fsf@linux.intel.com> <87y41struu.fsf@linux.intel.com> From: Baolin Wang Date: Thu, 13 Oct 2016 20:41:17 +0800 Message-ID: Subject: Re: [PATCH v2] usb: dwc3: gadget: Wait for end transfer complete before free irq To: Felipe Balbi Cc: Greg KH , Mark Brown , USB , LKML Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2514 Lines: 66 Hi Felipe, On 13 October 2016 at 19:23, Felipe Balbi wrote: > > Hi, > > Baolin Wang writes: >>>> Baolin Wang writes: >>>>>>>> I'm thinking this is a bug in configfs interface of Gadget API, not >>>>>>>> dwc3. The only reason for this to happen would be if we get to >>>>>>>> ->udc_stop() with endpoints still enabled. >>>>>>>> >>>>>>>> Can you check if that's the case? i.e. can you check if any endpoints >>>>>>>> are still enabled when we get here? >>>>>>> >>>>>>> No, it is not any endpoints are still enabled. Like I said in commit >>>>>>> message, we will start end transfer command when disable the endpoint, >>>>>>> if the end transfer command complete event comes after we have freed >>>>>>> the gadget irq, it will trigger the interrupt line all the time to >>>>>>> crash the system. >>>>>> >>>>>> I see what the problem is. Databook tells us we *must* set CMDIOC when >>>>>> issuing EndTransfer command and we should always wait for Command >>>>>> Complete IRQ. However, we took a shortcut and just delayed for 100us >>>>>> after issuing End Transfer. >>>>> >>>>> Yes, but 100us delay is not enough in some scenarios, like changing >>>>> function with configfs frequently, we will met problems. >>>> >>>> heh, 100us *is* enough. However we still get an IRQ because we requested >>>> for it. If you want to fix this, then you need to find a way to get rid >>>> of that 100us udelay() and add a proper wait_for_completion() to delay >>>> execution until command complete IRQ fires up. >>> >>> I haven't tested this yet, but it shows the idea (I think we might still >>> have a race with ep_queue if we're still waiting for End Transfer, but >> >> Yes, maybe we need check DWC3_EP_END_TRANSFER_PENDING flag when >> queuing one request. > > Yeah, I'll add that check later. I still need to make sure we would > still try to kick any transfers that might have been queued while > waiting for End Transfer Command Complete IRQ. OK. But I still worried if there are other races in some places which is not easy to find out by testing. No introducing race condition would be one better solution, anyway I would like to test the patch you send out firstly. > >>> that's easy to sort out by checking for DWC3_EP_END_TRANSFER_PENDING >>> before calling kick_transfer). Could you have a look and, perhaps, run a >>> test? >> >> Sure. I will test it and send out the result tomorrow. > > Thank you > > -- > balbi -- Baolin.wang Best Regards