Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp3469065pxb; Thu, 10 Feb 2022 23:49:35 -0800 (PST) X-Google-Smtp-Source: ABdhPJxkcIgiSuNzUdkvSD5SQG91WgxtzUkoS7mt+tVbkLQ6yFQOJ9BFzw1CzduM5Hdr9jjruRFo X-Received: by 2002:a17:907:3ea9:: with SMTP id hs41mr318009ejc.727.1644565775675; Thu, 10 Feb 2022 23:49:35 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644565775; cv=none; d=google.com; s=arc-20160816; b=HiK/VwHY8+2CTdrWySwtIL45avsLWdy8M7eG/HvAT6Us9heaGJqIb6vzXS3eFah9iQ anxuv2GlaRzOeekAXESlQNde3HlVUfmpiu79iQjI2cpcIXzMFSdp2Z+M6Zn8ePWbt6Zl IlSLEO9qVq4WwLv4SVp2EvZSfSuVRMb7IyJM9tQbln5UEvzIKfEneklgvPlmBl40gt3K 0V9saZX5jg5LwrXbm5ofU73R7TDdqLnXFxT5evlqRYXaWPC43jRy5NA/GJ/trS4kWXMr cPY4vwwtQOkuGMvpQLk7+HKp4iuwVdKxVyVR7SDLCujmgFe3b/F45OKtx04mBv7ig8zN nkyQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=DSnJteDARUlrsjbSpn5agFSV825dv8HSaJ68vz1hIpc=; b=h8Pjcj2h5S2PSWe6Xmht8xAWh0YTx4G3t+3adxe+JnhTi4Gz6JwCmAMGS9xeOETZiM d3ka2+6dnf/iZBuKS9hJ0FlnKfONrcX4PEP5/JftU0Y7SKTWrooFJw/6mQYlDXypnHHW glDKpbr6MYu8KbUeDC7tejAmjB72Qx/K+uiyiAvLsgQnCdqi9cbnsQqPbhln4gyV2FiR 2jmhJS2WG/xYtBtFPpQuwcoelpv4mv/t8rZawQ/8mrO9P7yMyLwUCnlJ5WsmF0QXSFP6 57pLX/h5Mj6qBT6x363qwvgN07MfSm22r1tFqv7AuyxfI2HmKxJsevQpUsQnQzbVRhvX nRQg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b="H/kHE4NQ"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id e5si4756955ejs.479.2022.02.10.23.49.10; Thu, 10 Feb 2022 23:49:35 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcdkim header.b="H/kHE4NQ"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1347784AbiBKHnl (ORCPT + 99 others); Fri, 11 Feb 2022 02:43:41 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:51484 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243198AbiBKHnk (ORCPT ); Fri, 11 Feb 2022 02:43:40 -0500 Received: from alexa-out.qualcomm.com (alexa-out.qualcomm.com [129.46.98.28]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E76AF250; Thu, 10 Feb 2022 23:43:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; i=@quicinc.com; q=dns/txt; s=qcdkim; t=1644565420; x=1676101420; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=DSnJteDARUlrsjbSpn5agFSV825dv8HSaJ68vz1hIpc=; b=H/kHE4NQ5ikn8hoUayiwE1jok+YDuiRe2eheyDvZcR4yN+/zmVjV/C1W aabWB1wGOnY7aSlbcpt7431TJ5ewYUMYO1BCayTi5N+qdzc2igKF2OV5z FRQMdN+30VsGAAwz+TF/I+JqaD7pbuoyhcxGcYE1BeSMxsrfl19w/Z1OB U=; Received: from ironmsg09-lv.qualcomm.com ([10.47.202.153]) by alexa-out.qualcomm.com with ESMTP; 10 Feb 2022 23:43:39 -0800 X-QCInternal: smtphost Received: from nalasex01a.na.qualcomm.com ([10.47.209.196]) by ironmsg09-lv.qualcomm.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Feb 2022 23:43:39 -0800 Received: from hu-pkondeti-hyd.qualcomm.com (10.80.80.8) by nalasex01a.na.qualcomm.com (10.47.209.196) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.922.19; Thu, 10 Feb 2022 23:43:35 -0800 Date: Fri, 11 Feb 2022 13:13:31 +0530 From: Pavan Kondeti To: Pavan Kondeti CC: Mathias Nyman , Jung Daehwan , Greg Kroah-Hartman , , , Subject: Re: usb: host: Reduce xhci_handshake timeout in xhci_reset Message-ID: <20220211074331.GA12625@hu-pkondeti-hyd.qualcomm.com> References: <1624361096-41282-1-git-send-email-dh10.jung@samsung.com> <20210628022548.GA69289@ubuntu> <20210628065553.GA83203@ubuntu> <496c9d86-70d7-1050-5bbb-9f841e4b464a@intel.com> <20220211064630.GA20567@hu-pkondeti-hyd.qualcomm.com> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline In-Reply-To: <20220211064630.GA20567@hu-pkondeti-hyd.qualcomm.com> User-Agent: Mutt/1.5.24 (2015-08-30) X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01a.na.qualcomm.com (10.52.223.231) To nalasex01a.na.qualcomm.com (10.47.209.196) X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Sorry for the spam. I have added an incorrect email address in my previous email. On Fri, Feb 11, 2022 at 12:16:30PM +0530, Pavan Kondeti wrote: > On Mon, Jun 28, 2021 at 10:49:00AM +0300, Mathias Nyman wrote: > > On 28.6.2021 9.55, Jung Daehwan wrote: > > > On Mon, Jun 28, 2021 at 08:53:02AM +0200, Greg Kroah-Hartman wrote: > > >> On Mon, Jun 28, 2021 at 11:25:48AM +0900, Jung Daehwan wrote: > > >>> On Tue, Jun 22, 2021 at 09:56:20PM +0200, Greg Kroah-Hartman wrote: > > >>>> On Tue, Jun 22, 2021 at 08:24:56PM +0900, Daehwan Jung wrote: > > >>>>> It seems 10 secs timeout is too long in general case. A core would wait for > > >>>>> 10 secs without doing other task and it can be happended on every device. > > >>>> > > >>>> Only if the handshake does not come back sooner, right? > > >>> > > >>> Yes, right. > > >>> > > >>>> What is causing your device to timeout here? > > >>> > > >>> Host Controller doesn't respond handshake. I don't know why and I ask HW team > > >>> to debug it. > > >> > > >> Please work to fix your hardware, that feels like the root of the > > >> problem here. If you require the timeout for xhci_reset() to happen, > > >> then how do you know that the hardware really did reset properly in the > > >> reduced amount of time you just provided? > > >> > > > > > > I continue fixing this issue with hardware engineer, but currently just > > > host controller can crash whole system and that's why I want to fix it. > > > How about adding some error logs in this situation for recognizing this issue? > > > We can add error log in xhci_stop as xhci_reset can returns error like below. > > > > > > static void xhci_stop(struct usb_hcd *hcd) > > > { > > > u32 temp; > > > struct xhci_hcd *xhci = hcd_to_xhci(hcd); > > > + int ret; > > > > > > mutex_lock(&xhci->mutex); > > > > > > @@ -733,6 +734,9 @@ static void xhci_stop(struct usb_hcd *hcd) > > > xhci->cmd_ring_state = CMD_RING_STATE_STOPPED; > > > xhci_halt(xhci); > > > xhci_reset(xhci); > > > + if (ret) > > > + xhci_err(xhci, "%s: Error while reset xhci Host controller - ret = %d\n" > > > + , __func__, ret); > > > spin_unlock_irq(&xhci->lock); > > > > > > > We can check the xhci_reset() return value here and print a message, makes sense. > > > > The original reason for the 10 second timeout was because a host actually took 9 seconds: > > > > commit 22ceac191211cf6688b1bf6ecd93c8b6bf80ed9b > > > > xhci: Increase reset timeout for Renesas 720201 host. > > > > The NEC/Renesas 720201 xHCI host controller does not complete its reset > > within 250 milliseconds. In fact, it takes about 9 seconds to reset the > > host controller, and 1 second for the host to be ready for doorbell > > rings. Extend the reset and CNR polling timeout to 10 seconds each. > > > Agreed. > > We also run into the similar issue (very very rarely reproduced) on > our platforms like SM8450. The issue happens when host mode is de-activated > (type-c cable disconnected). Since xhci_reset() is called with interrupts > disabled, a timeout of 10 seconds is fatal to the system. > > Thanks, > Pavan