Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp1406668pxk; Mon, 31 Aug 2020 19:33:18 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw7ND7rvIrH2/erC1RRP+2NsxW9l+U+JNvn1yD5WiORFPHLDZZXX0KFip3A95mhqL3fLh7U X-Received: by 2002:aa7:d3d6:: with SMTP id o22mr2589200edr.293.1598927598216; Mon, 31 Aug 2020 19:33:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598927598; cv=none; d=google.com; s=arc-20160816; b=U7nPpglpLrGrY0MToQ3bVyw3bM4kS1vh6A24jAW3hXBZf+b2DFyVWUIvBiSumNj1RJ k1WKq5cYVU0nzTDa4eb62DViHC2vN37KJw2eeoBu2M70uI10mcxwtsk/f4fyjy8IJsuJ nr305cA35sYiR0NEC4e5323xLrN/0cbUw2EMXIGbLdbrEkhJRMc8OMb53EH7a/huymLt jndhsApIyjOr9ML9Xjc8urIkKwl3jaUMIRO9U5YeE7n8eL7YDnoGR41tO5gd11/9vb48 my0c2WydJOjGLZKVK5AIZMwYQOLGR2sTWIMevxIyoY2aeTnNDv5jbtcPqAVonXECQMxO Za1w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=9Uw3G+l62hOTSBaFUKJQgR2X5SbhN6FxMGRfUCc8rS8=; b=N8QTGLx7ZV5HHhBK75fKnkReT/pD2u9LKm1r6zB75Zv3Sm/6VQmDSiNbFUEl+4u/kt VhODkigMnP1P6A2JvoI/CUpH0Smr893afHGJiHK9N2/hZql1zw6hi2v8dzVckoRy54LP lq3v8stoxO6CRpHiv4/u5rU+gR9bW63mblfm8P/oNPTk0YakSRFikv7Xskf5mojzOAnG vrLXDo/yreNYpJ/5411llIIWzJQxR9OWjW+9qz2pzRpnuShlgnRmDqWPLld+hNu9WhBb k4YHTmfmoQAH5fzaaoex1kUr9Wz1yibsPkH3kBtMakvw0U3qf+m88cmNgQRsHYaNBqtX tZ2w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b7si1135580edr.361.2020.08.31.19.32.55; Mon, 31 Aug 2020 19:33:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726512AbgIACbS (ORCPT + 99 others); Mon, 31 Aug 2020 22:31:18 -0400 Received: from netrider.rowland.org ([192.131.102.5]:54251 "HELO netrider.rowland.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1725993AbgIACbS (ORCPT ); Mon, 31 Aug 2020 22:31:18 -0400 Received: (qmail 572242 invoked by uid 1000); 31 Aug 2020 22:31:17 -0400 Date: Mon, 31 Aug 2020 22:31:17 -0400 From: Alan Stern To: Khalid Aziz Cc: gregkh@linuxfoundation.org, erkka.talvitie@vincit.fi, linux-usb@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [RFC RESEND PATCH 0/1] USB EHCI: repeated resets on full and low speed devices Message-ID: <20200901023117.GD571008@rowland.harvard.edu> References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 31, 2020 at 10:23:30AM -0600, Khalid Aziz wrote: > [Resending since I screwed up linux-usb mailing list address in > cut-n-paste in original email] > > > I recently replaced the motherboard on my desktop with an MSI B450-A > Pro Max motherboard. Since then my keybaords, mouse and tablet have > become very unreliable. I see messages like this over and over in > dmesg: > > ug 23 00:01:49 rhapsody kernel: [198769.314732] usb 1-2.4: reset full-speed USB > device number 27 using ehci-pci > Aug 23 00:01:49 rhapsody kernel: [198769.562234] usb 1-2.1: reset full-speed USB > device number 28 using ehci-pci > Aug 23 00:01:52 rhapsody kernel: [198772.570704] usb 1-2.1: reset full-speed USB > device number 28 using ehci-pci > Aug 23 00:02:02 rhapsody kernel: [198782.526669] usb 1-2.4: reset full-speed USB > device number 27 using ehci-pci > Aug 23 00:02:03 rhapsody kernel: [198782.714660] usb 1-2.1: reset full-speed USB > device number 28 using ehci-pci > Aug 23 00:02:04 rhapsody kernel: [198784.210171] usb 1-2.3: reset low-speed USB device number 26 using ehci-pci > Aug 23 00:02:06 rhapsody kernel: [198786.110181] usb 1-2.4: reset full-speed USB device number 27 using ehci-pci > Aug 23 00:02:08 rhapsody kernel: [198787.726158] usb 1-2.4: reset full-speed USB device number 27 using ehci-pci > Aug 23 00:02:10 rhapsody kernel: [198790.126628] usb 1-2.1: reset full-speed USB device number 28 using ehci-pci > Aug 23 00:02:10 rhapsody kernel: [198790.314141] usb 1-2.4: reset full-speed USB device number 27 using ehci-pci > Aug 23 00:02:12 rhapsody kernel: [198792.518765] usb 1-2.4: reset full-speed USB device number 27 using ehci-pci > > The devices I am using are: > > - Logitech K360 wireless keyboard > - Wired Lenovo USB keyboard > - Wired Lenovo USB mouse > - Wired Wacom Intuos tablet > > After a reset, the wireless keyboard simply stops working. Rest of > the devices keep seeing intermittent failure. > > I tried various combinations of hubs and USB controllers to see what > works. MSI B450-A motherboard has USB 3.0 and USB 3.1 controllers. I > added a USB 2.0 PCI card as well for this test: > > 03:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset USB 3.1 XHCI Controller (rev 01) > 29:01.0 USB controller: NEC Corporation OHCI USB Controller (rev 43) > 29:01.1 USB controller: NEC Corporation OHCI USB Controller (rev 43) > 29:01.2 USB controller: NEC Corporation uPD72010x USB 2.0 Controller (rev 04) > 2c:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller > > I have a bus powered USB 3.0 hub, a bus powered USB 2.0 hub and a > self powered USB 2.0 hub built into my monitor. > > I have connected my devices directly into the ports on motherboard > and PCI card as well as into external hub. Here are the results I > saw when devices wee plugged into various combination of ports: > > 1. USB 3.0/3.1 controller - does NOT work > 2. USB 2.0 controller - WORKS > 3. USB 3.0/3.1 controller -> Self powered USB 2.0 hub in monitor - does > NOT work > 4. USB 3.0/3.1 controller -> bus powered USB 3.0 hub - does NOT work > 5. USB 3.0/3.1 controller -> Bus powered USB 2.0 hub - WORKS > 7. USB 2.0 controller -> Bus powered USB 3.0 hub - does NOT work > 8. USB 2.0 controller -> Bus powered 2.0 hub - Does not work The error messages in your log extract all refer to ehci-pci, which is the driver for a USB-2 controller. They are completely unrelated to any problems you may be having with USB-3 controllers. > I narrowed the failure down to following lines (this code was added > in 5.5 with commit 64cc3f12d1c7 "USB: EHCI: Do not return -EPIPE > when hub is disconnected"): > > drivers/usb/host/ehci-q.c: > > 217 } else if ((token & QTD_STS_MMF) && > 218 (QTD_PID(token) == PID_CODE_IN)) { > 219 status = -EPROTO; > 220 /* CERR nonzero + halt --> stall */ > > At the time of failure, when we reach this conditional, token is > either 0x80408d46 or 0x408d46 which means following bits are set: > > QTD_STS_STS, QTD_STS_MMF, QTD_STS_HALT, QTD_IOC, QTD_TOGGLE > > and > > QTD_PID = 1 > QTD_CERR = 3 > QTD_LENGTH = 0x40 (64) > > This causes the branch "(token & QTD_STS_MMF) && (QTD_PID(token) == > PID_CODE_IN" to be taken and qtd_copy_status() returns EPROTO. This > return value in qh_completions() results in ehci_clear_tt_buffer() > being called: > > drivers/usb/host/ehci-q.c: > 472 /* As part of low/full-speed endpoint-halt processi ng > 473 * we must clear the TT buffer (11.17.5). > 474 */ > 475 if (unlikely(last_status != -EINPROGRESS && > 476 last_status != -EREMOTEIO)) { > 477 /* The TT's in some hubs malfunction when t hey > 478 * receive this request following a STALL ( they > 479 * stop sending isochronous packets). Sinc e a > 480 * STALL can't leave the TT buffer in a bus y > 481 * state (if you believe Figures 11-48 - 11 -51 > 482 * in the USB 2.0 spec), we won't clear the TT > 483 * buffer in this case. Strictly speaking this > 484 * is a violation of the spec. > 485 */ > 486 if (last_status != -EPIPE) > 487 ehci_clear_tt_buffer(ehci, qh, urb, > 488 token); > 489 } > > It seems like clearing TT buffers in this case is resulting in hub > hanging. A USB reset gets it going again until we repeat the cycle > over again. The comment in this code says "The TT's in some hubs > malfunction when they receive this request following a STALL (they > stop sending isochronous packets)". That may be what is happening. What makes you think that? Do you have any evidence that the hub is receiving a STALL? Indeed, the commit you referenced above specifically mentions that when MMF is set and the PID code is IN then it is not a STALL. > Removing the code that returns EPROTO for such case solves the > problem on my machine (as in the RFC patch) It certainly can't solve the problem for any USB-3 connections, because the patch doesn't touch any of the USB-3 driver code. > but that probably is not > the right solution. I do not understand USB protocol well enough to > propose a better solution. Does anyone have a better idea? Can you collect a usbmon trace showing an example of this problem? One possibility is to introduce a special quirk for the NEC uPD72010x EHCI controller. But we should hold off on that until we know exactly what is happening. Alan Stern