Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp4501728pxf; Tue, 23 Mar 2021 12:09:09 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxaqd7iDxu+CzkQWBg0nG6OnVLSlF1gJTPlqSXsbfVi53e2SAVAVT5aF2k0+daDsEsL03Zo X-Received: by 2002:a17:906:f210:: with SMTP id gt16mr6357999ejb.206.1616526549751; Tue, 23 Mar 2021 12:09:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1616526549; cv=none; d=google.com; s=arc-20160816; b=f8kjgF8b1ZrCmSn4ls9uiF4ccSEF63d/xYpv90KXhXo4Sl+jQci4SJFJe6qfglB+mM gRseW6ywa0WyBSisszRiZCwwb0m2QjmvpbVnk1lVrW4JcD3WolP9DYF9nymvgu4tcM63 mfcLAbPyvBoA4CxqJxf12Sb9DCVS1y7trhM0IM5LsO7WJilUcFJpVH6pWCTk8P9m/xW7 bJ37sH04e9eblp4bYx/cUeSVnmRU5TJsxaQpl2n4s9aGi6g2GCAHRORQexFfUx+bkQo4 rkj7WEJ7XQrOO27/+HdAQ+0XvBQn+ovkqGL9cK+xJOco8CYCzFYPzIaq7NoMbv3h19q8 y09w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dmarc-filter:sender:dkim-signature; bh=U0wS5SEO3kg8v/bLJKCoxZNSoatcAM4H16JoMJirSEQ=; b=jOTXsrumr1EO+oclDbacUhvxlId+ur55cJ+ZGTgPbVC9gy5DdsdxV3l20Tcnougaa1 RdMvlBYLlcblPwZVH154nP5RDGlellxefI/S7TuJmCeKcpWNUA5htMTcFdsI2cSd6URf Q/7ErR3cYkfwP5RJV5hgSs2TQYaDa8MUMCgFJmD/MeS3CNWGaD9A49et2KeOKCDdaCsL y3r5gSSupOC95xsbwpW4y65F423udqNel/9CGXwfbC5MJO99LN/Iz23sAun+HzTMVWHg YDf7qhkjemaOTfbhfCQo2th4/bBjmsfWrRsbubnP0Fnro/WgJBpg0RoyWCIsumNoUXZk tJ2w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@mg.codeaurora.org header.s=smtp header.b=VluVNdWE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cc4si14376623edb.347.2021.03.23.12.08.46; Tue, 23 Mar 2021 12:09:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@mg.codeaurora.org header.s=smtp header.b=VluVNdWE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232917AbhCWTGr (ORCPT + 99 others); Tue, 23 Mar 2021 15:06:47 -0400 Received: from m43-7.mailgun.net ([69.72.43.7]:57585 "EHLO m43-7.mailgun.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232986AbhCWTGY (ORCPT ); Tue, 23 Mar 2021 15:06:24 -0400 DKIM-Signature: a=rsa-sha256; v=1; c=relaxed/relaxed; d=mg.codeaurora.org; q=dns/txt; s=smtp; t=1616526383; h=Content-Transfer-Encoding: Content-Type: In-Reply-To: MIME-Version: Date: Message-ID: From: References: Cc: To: Subject: Sender; bh=U0wS5SEO3kg8v/bLJKCoxZNSoatcAM4H16JoMJirSEQ=; b=VluVNdWEnOzcUCRFJyNFAe0KTHbye+DNble8V1EsHC4ioVFPky1W4EMrLxCTAN498URBWR+l AbnUViLUgMu1b/+LslGR82PBcwR7Vx7Rqz2Lr1bALKVxUxW06/KpQeGb9Y9LaYMMQNzOqh61 h+CtdU18mzG0MHJGMWlIaHq+54Q= X-Mailgun-Sending-Ip: 69.72.43.7 X-Mailgun-Sid: WyI0MWYwYSIsICJsaW51eC1rZXJuZWxAdmdlci5rZXJuZWwub3JnIiwgImJlOWU0YSJd Received: from smtp.codeaurora.org (ec2-35-166-182-171.us-west-2.compute.amazonaws.com [35.166.182.171]) by smtp-out-n06.prod.us-east-1.postgun.com with SMTP id 605a3c206dc1045b7de8b6a5 (version=TLS1.2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256); Tue, 23 Mar 2021 19:06:08 GMT Sender: wcheng=codeaurora.org@mg.codeaurora.org Received: by smtp.codeaurora.org (Postfix, from userid 1001) id 47FE1C43462; Tue, 23 Mar 2021 19:06:07 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-caf-mail-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.9 required=2.0 tests=ALL_TRUSTED,BAYES_00, NICE_REPLY_A,SPF_FAIL autolearn=no autolearn_force=no version=3.4.0 Received: from [10.110.91.149] (i-global254.qualcomm.com [199.106.103.254]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: wcheng) by smtp.codeaurora.org (Postfix) with ESMTPSA id BFAD7C433CA; Tue, 23 Mar 2021 19:06:05 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 smtp.codeaurora.org BFAD7C433CA Authentication-Results: aws-us-west-2-caf-mail-1.web.codeaurora.org; dmarc=none (p=none dis=none) header.from=codeaurora.org Authentication-Results: aws-us-west-2-caf-mail-1.web.codeaurora.org; spf=fail smtp.mailfrom=wcheng@codeaurora.org Subject: Re: [PATCH v3] usb: dwc3: gadget: Prevent EP queuing while stopping transfers To: Andy Shevchenko Cc: Felipe Balbi , Greg Kroah-Hartman , Linux Kernel Mailing List , USB References: <1615507142-23097-1-git-send-email-wcheng@codeaurora.org> <716dca12-2bfc-789f-ca74-5555852e4c8b@codeaurora.org> <39fdd3c8-9682-6109-f47d-7f7bffc4b85e@codeaurora.org> <5252d085-bbd6-0409-a8ca-2b73fe269259@codeaurora.org> From: Wesley Cheng Message-ID: <8e229264-03d6-fe79-ed1a-5f7ee7b722b4@codeaurora.org> Date: Tue, 23 Mar 2021 12:06:04 -0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 3/23/2021 10:27 AM, Andy Shevchenko wrote: > On Tue, Mar 23, 2021 at 1:19 AM Wesley Cheng wrote: >> >> Hi Andy, >> >> On 3/22/2021 2:14 PM, Andy Shevchenko wrote: >>> On Mon, Mar 22, 2021 at 10:06 PM Wesley Cheng wrote: >>>> >>>> Hi Andy, >>>> >>>> On 3/22/2021 12:34 PM, Andy Shevchenko wrote: >>>>> On Mon, Mar 22, 2021 at 8:49 PM Wesley Cheng wrote: >>>>>> >>>>>> Hi Andy, >>>>>> >>>>>> On 3/22/2021 5:48 AM, Andy Shevchenko wrote: >>>>>>> On Fri, Mar 12, 2021 at 2:01 AM Wesley Cheng wrote: >>>>>>>> >>>>>>>> In the situations where the DWC3 gadget stops active transfers, once >>>>>>>> calling the dwc3_gadget_giveback(), there is a chance where a function >>>>>>>> driver can queue a new USB request in between the time where the dwc3 >>>>>>>> lock has been released and re-aquired. This occurs after we've already >>>>>>>> issued an ENDXFER command. When the stop active transfers continues >>>>>>>> to remove USB requests from all dep lists, the newly added request will >>>>>>>> also be removed, while controller still has an active TRB for it. >>>>>>>> This can lead to the controller accessing an unmapped memory address. >>>>>>>> >>>>>>>> Fix this by ensuring parameters to prevent EP queuing are set before >>>>>>>> calling the stop active transfers API. >>>>>>> >>>>>>> >>>>>>> commit f09ddcfcb8c569675066337adac2ac205113471f >>>>>>> Author: Wesley Cheng >>>>>>> Date: Thu Mar 11 15:59:02 2021 -0800 >>>>>>> >>>>>>> usb: dwc3: gadget: Prevent EP queuing while stopping transfers >>>>>>> >>>>>>> effectively broke my gadget setup. >>>>>>> >>>>>>> The output of the kernel (followed by non responsive state of USB controller): >>>>>>> >>>>>>> [ 195.228586] using random self ethernet address >>>>>>> [ 195.233104] using random host ethernet address >>>>>>> [ 195.245306] usb0: HOST MAC aa:bb:cc:dd:ee:f2 >>>>>>> [ 195.249732] usb0: MAC aa:bb:cc:dd:ee:f1 >>>>>>> # [ 195.773594] IPv6: ADDRCONF(NETDEV_CHANGE): usb0: link becomes ready >>>>>>> [ 195.780585] ------------[ cut here ]------------ >>>>>>> [ 195.785217] dwc3 dwc3.0.auto: No resource for ep2in >>>>>>> [ 195.790162] WARNING: CPU: 0 PID: 217 at >>>>>>> drivers/usb/dwc3/gadget.c:360 dwc3_send_gadget_ep_cmd+0x4b9/0x670 >>>>>>> [ 195.799760] Modules linked in: usb_f_eem u_ether libcomposite >>>>>>> brcmfmac brcmutil mmc_block pwm_lpss_pci pwm_lps >>>>>>> s snd_sof_pci_intel_tng snd_sof_pci snd_sof_acpi_intel_byt >>>>>>> snd_sof_intel_ipc snd_sof_acpi snd_sof snd_sof_nocodec >>>>>>> spi_pxa2xx_platform snd_sof_xtensa_dsp spi_pxa2xx_pci >>>>>>> extcon_intel_mrfld intel_mrfld_adc sdhci_pci cqhci sdhci m >>>>>>> mc_core intel_mrfld_pwrbtn intel_soc_pmic_mrfld hci_uart btbcm btintel >>>>>>> [ 195.835604] CPU: 0 PID: 217 Comm: irq/16-dwc3 Not tainted 5.12.0-rc4+ #60 >>>>>>> [ 195.842403] Hardware name: Intel Corporation Merrifield/BODEGA BAY, >>>>>>> BIOS 542 2015.01.21:18.19.48 >>>>>>> [ 195.851191] RIP: 0010:dwc3_send_gadget_ep_cmd+0x4b9/0x670 >>>>>>> [ 195.856608] Code: cd 00 00 00 44 89 44 24 20 48 89 4c 24 18 e8 ee >>>>>>> f7 e4 ff 48 8b 4c 24 18 4c 89 f2 48 c7 c7 b9 >>>>>>> ed 4f a0 48 89 c6 e8 ef 24 43 00 <0f> 0b 41 be ea ff ff ff 44 8b 44 24 >>>>>>> 20 e9 80 fc ff ff 41 83 fe 92 >>>>>>> [ 195.875381] RSP: 0000:ffffa53c00373ba8 EFLAGS: 00010086 >>>>>>> [ 195.880617] RAX: 0000000000000000 RBX: 0000000000001387 RCX: 00000000ffffdfff >>>>>>> [ 195.887755] RDX: 00000000ffffdfff RSI: 00000000ffffffea RDI: 0000000000000000 >>>>>>> [ 195.894893] RBP: ffff9ce8c8f2b028 R08: ffffffffa0732288 R09: 0000000000009ffb >>>>>>> [ 195.902034] R10: 00000000ffffe000 R11: 3fffffffffffffff R12: 0000000000041006 >>>>>>> [ 195.909170] R13: ffffa53c00373c24 R14: ffff9ce8c11dadb0 R15: ffff9ce8c2861700 >>>>>>> [ 195.916310] FS: 0000000000000000(0000) GS:ffff9ce8fe200000(0000) >>>>>>> knlGS:0000000000000000 >>>>>>> [ 195.924409] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >>>>>>> [ 195.930161] CR2: 00000000f7f694a0 CR3: 0000000038e0c000 CR4: 00000000001006f0 >>>>>>> [ 195.937300] Call Trace: >>>>>>> [ 195.939755] __dwc3_gadget_ep_enable+0x2d4/0x4e0 >>>>>>> [ 195.944393] ? dwc3_remove_requests.constprop.0+0x86/0x170 >>>>>> >>>>>> Odd that this change would affect the USB enablment path, as they were >>>>>> focused on the pullup disable path. Would you happen to have any >>>>>> downstream changes on top of v5.12-rc4 we could review to see if they >>>>>> are still required? (ie where is the dwc3_remove_requests() coming from >>>>>> during ep enable) >>>>> >>>>> You may check my branch [1] on GH. Basically you may be interested in >>>>> the commit: >>>>> 0f86df1294ee7523060cc16eafaf4898c693eab0 REVERTME: usb: dwc3: gadget: >>>>> skip endpoints ep[18]{in,out} >>>>> Otherwise it's a clean v5.12-rc4 with a revert and another USB PHY >>>>> suspend fix (which also shouldn't affect this). >>>> >>>> Can you link your GH reference? >>> >>> Oops, sorry. >>> Here we are: >>> >>> [1]: https://github.com/andy-shev/linux/tree/eds-acpi >>> >> Thanks, I took a look and even tried it on my device running 5.12-rc4, >> but wasn't able to see the same problem. Could you help collect the >> ftrace after enabling the tracing KCONFIG and running the below sequence? >> >> 1. Mount debugfs >> 2. Set up tracing instance >> >> mkdir /sys/kernel/debug/tracing/instances/usb >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_complete_trb/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_ctrl_req/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_ep_dequeue/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_ep_queue/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_gadget_ep_cmd/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_gadget_ep_disable/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_gadget_ep_enable/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_gadget_giveback/enable >> echo 1 > >> /sys/kernel/debug/tracing/instances/usb/events/dwc3/dwc3_prepare_trb/enable >> echo 1 > /sys/kernel/debug/tracing/instances/usb/tracing_on >> >> 3. Run your test, which should include: >> - echo "" > /sys/kernel/config/usb_gadget/g1/UDC >> - echo > /sys/kernel/config/usb_gadget/g1/UDC >> >> 4. Collect the trace output: >> cat /sys/kernel/debug/tracing/instances/usb/trace > > Here we are (I cherry-picked again reverted patch, other stays the same) [2]. > On top I put a warning, so you may see timestamps. > > Dunno how long it will stay there, please confirm that you got it. > > [2]: https://paste.ubuntu.com/p/jNF565ypPp/ > Hi Andy, Great, thanks, got access to it. Will take a look and update you. Thanks Wesley Cheng -- The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project