Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp1905265pxu; Fri, 27 Nov 2020 19:34:54 -0800 (PST) X-Google-Smtp-Source: ABdhPJyTzt9ddczRMPFhpB46gIi+zI2EUsWgY8uQm4XYUWR9w+sg2Jt8lX0Ibqai53JhxuqcUBxj X-Received: by 2002:a17:906:c826:: with SMTP id dd6mr11831622ejb.191.1606534494327; Fri, 27 Nov 2020 19:34:54 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606534494; cv=none; d=google.com; s=arc-20160816; b=Z84hdB8h1LKRU2iMXCWwBwk2i3pQrRTXJoQgGE+J2NA/oOvEw3XCXeRHmnflDYXRYW dI/MBi4p0UUbV9Ge2+4wHfg0B3XuG4aAH/7NbY1343DocXIRQscu9RXVSoynBIY027rA v07Z2na6Vm0BN0y74hg5fqQVx6QL6fYJvH07aDgpyqQFfcj8TRLl/UZ6VZEeC6+I//vk AolhH2C/vn9Btc2qye975mZppF1CGUlv+Lnx4P0E4pYVnzK5WWYZqs3d159lve1OJPpG T8P45uONbVs98JqTGMHbO3ZzDkw/xC883d86V4y+YBk4s7KWkTPTrlizd5lmR+TV/Eei EoLQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject; bh=vFNO6DMFh9iWnmIZyPf39lDcEvjxXCdXaa5m7X3mWrE=; b=RwntHiJhfy2x9m0HsAM1P2paPZvrltBqkqFSpH1/RDsYIBSrY4zVd7GT90IArb8O2e Glq9vtl0NU7BvIxEKAOEa2GHlgWbedmLCwYchCpHCbnxFOzKM/Eil0UtUQruMVFPz++D 95yvqwLP2ARZLdwDaA5ke9baRjF9HL2kNPlJZQw788H6ysoSdU3+2IpEwQIagDYOLx+l ZbkZ6ip1Ph9m+72fQuJVvlz62lVU0FiW5Cq9CLdwSR1nolJ83LwHHmWtLYRVy/JAEf9+ O2MW/U9O814GArpeokXA973AEiXsaC8sGn9iI2v6Irf94ef1+w6n89JWZX3XaAReCVMd 6z9w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id n12si7365529edb.173.2020.11.27.19.34.30; Fri, 27 Nov 2020 19:34:54 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730967AbgK1D0N (ORCPT + 99 others); Fri, 27 Nov 2020 22:26:13 -0500 Received: from szxga04-in.huawei.com ([45.249.212.190]:8192 "EHLO szxga04-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726740AbgK1DWz (ORCPT ); Fri, 27 Nov 2020 22:22:55 -0500 Received: from DGGEMS405-HUB.china.huawei.com (unknown [172.30.72.58]) by szxga04-in.huawei.com (SkyGuard) with ESMTP id 4CjcHd27CLzkbkR; Sat, 28 Nov 2020 11:22:21 +0800 (CST) Received: from [10.67.102.118] (10.67.102.118) by DGGEMS405-HUB.china.huawei.com (10.3.19.205) with Microsoft SMTP Server id 14.3.487.0; Sat, 28 Nov 2020 11:22:47 +0800 Subject: Re: [PATCH] USB:ehci:fix an interrupt calltrace error To: Alan Stern CC: , , References: <1606361673-573-1-git-send-email-liulongfang@huawei.com> <20201126160830.GA827745@rowland.harvard.edu> <96b4d366-c94c-9708-da12-5693bf16b716@huawei.com> <20201127154718.GA861473@rowland.harvard.edu> From: liulongfang Message-ID: <3c2366c8-4b3e-dac0-48ad-6b33b6eed10e@huawei.com> Date: Sat, 28 Nov 2020 11:22:47 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20201127154718.GA861473@rowland.harvard.edu> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 8bit X-Originating-IP: [10.67.102.118] X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/11/27 23:47, Alan Stern Wrote: > On Fri, Nov 27, 2020 at 10:29:03AM +0800, liulongfang wrote: >> On 2020/11/27 0:08, Alan Stern Wrote: >>> On Thu, Nov 26, 2020 at 11:34:33AM +0800, Longfang Liu wrote: >>>> The system goes to suspend when using USB audio player. This causes >>>> the USB device continuous send interrupt signal to system, When the >>>> number of interrupts exceeds 100000, the system will forcibly close >>>> the interrupts and output a calltrace error. >>> >>> This description is very confusing. USB devices do not send interrupt >>> signals to the host. Do you mean that the device sends a wakeup >>> request? Or do you mean something else? >> The irq type is IRQ_NONE,It's counted in the note_interrupt function. >> From the analysis of the driver code, that are indeed interrupt signals. > > Above you wrote: "the USB device continuous send interrupt signal to > system". But that's not correct. The interrupt signals are sent by the > USB host controller, not by the USB audio device. > OK, I will modify the description in the next patch. > The patch description should mention that this happens only with some > Synopsys host controllers. > >>>> When the system goes to suspend, the last interrupt is reported to >>>> the driver. At this time, the system has set the state to suspend. >>>> This causes the last interrupt to not be processed by the system and >>>> not clear the interrupt state flag. This uncleared interrupt flag >>>> constantly triggers new interrupt event. This causing the driver to >>>> receive more than 100,000 interrupts, which causes the system to >>>> forcibly close the interrupt report and report the calltrace error. >>> >>> If the driver receives an interrupt, it is supposed to process the event >>> even if the host controller is suspended. And when ehci_irq() runs, it >>> clears the bits that are set in the USBSYS register. >> When the host controller is suspended, the ehci_suspend() will clear >> the HCD_FLAG_HW_ACCESSIBLE, and then usb_hcd_irq() will return IRQ_NONE >> directly without calling ehci_irq(). > > Yes. But ehci_bus_suspend() runs _before_ the host controller is > suspended. While ehci_bus_suspend() is running, usb_hcd_irq() _will_ > call ehci_irq(), and ehci_irq() _will_ clear the status bits. > > After the host controller is suspended it is not supposed to generate > any interrupt signals at all, because ehci_suspend() writes 0 to the > USBINTR register, and it does this _before_ clearing > HCD_FLAG_HW_ACCESSIBLE. > According to this process, there should be no interruption storm problem, but the current fact is that the problem has occurred, so the actual execution process did not follow the correct process above. >>> Why is your system getting interrupts? That is, which bits are set in >>> the USBSTS register? >> BIT(5) and BIT(3) are setted, STS_IAA and STS_FLR. > > STS_FLR is not set in the USBINTR register, but STS_IAA is. So that's > the one which matters. > >>>> so, when the driver goes to sleep and changes the system state to >>>> suspend, the interrupt flag needs to be cleared. >>>> >>>> Signed-off-by: Longfang Liu >>>> --- >>>> drivers/usb/host/ehci-hub.c | 5 +++++ >>>> 1 file changed, 5 insertions(+) >>>> >>>> diff --git a/drivers/usb/host/ehci-hub.c b/drivers/usb/host/ehci-hub.c >>>> index ce0eaf7..5b13825 100644 >>>> --- a/drivers/usb/host/ehci-hub.c >>>> +++ b/drivers/usb/host/ehci-hub.c >>>> @@ -348,6 +348,11 @@ static int ehci_bus_suspend (struct usb_hcd *hcd) >>>> >>>> /* Any IAA cycle that started before the suspend is now invalid */ >>>> end_iaa_cycle(ehci); >>>> + >>>> + /* clear interrupt status */ >>>> + if (ehci->has_synopsys_hc_bug) >>>> + ehci_writel(ehci, INTR_MASK | STS_FLR, &ehci->regs->status); >>> >>> This is a very strange place to add your new code -- right in the middle >>> of the IAA and unlink handling. Why not put it in a more reasonable >>> place?After the IAA is processed, clear the STS_IAA interrupt state flag. >>> >>> Also, the patch description does not mention has_synopsys_hc_bug. The >>> meaning of this flag has no connection with the interrupt status >>> register, so why do you use it here? >> Because of our USB IP comes from Synopsys, and the uncleared flage is also caused by >> special hardware design, in addition, we have not tested other manufacturers' USB >> controllers.We don’t know if other manufacturers’ designs have this problem, >> so this modification is only limited to this kind of design. > > Clearing the STS_IAA flag won't hurt, no matter who manufactured the > controller. So your patch should look more like this: > > + /* Some Synopsys controllers mistakenly leave IAA turned on */ > + ehci_writel(ehci, STS_IAA, &ehci->regs->status); > > And these lines should come before the "Any IAA cycle..." comment line. > Does that fix the problem? I will conduct a round of testing based on this modification and provide the test results. > > Alan Stern > . > Thanks. Longfang Liu