Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.6 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SIGNED_OFF_BY,SPF_PASS,URIBL_BLOCKED,USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2CE5EC4360F for ; Tue, 2 Apr 2019 18:05:05 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id E1E862070B for ; Tue, 2 Apr 2019 18:05:04 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=chromium.org header.i=@chromium.org header.b="n43EItqK" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729237AbfDBSFE (ORCPT ); Tue, 2 Apr 2019 14:05:04 -0400 Received: from mail-pl1-f195.google.com ([209.85.214.195]:35176 "EHLO mail-pl1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728678AbfDBSFE (ORCPT ); Tue, 2 Apr 2019 14:05:04 -0400 Received: by mail-pl1-f195.google.com with SMTP id w24so260951plp.2 for ; Tue, 02 Apr 2019 11:05:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=UZmm2yKfctlxCZPnZgX3MXlTS77WwX1GVQIRkmMVQGc=; b=n43EItqKZAToEG6snnFferYbR3Cv8rjRMhFR1LtqSGbDccKTn52/M3cECgnNGOyicm himUNmCe5pkPQCXc8QUCmWBW3TVK5Wcd7qQkR+RI0vvO/LdQLrMw95gjQvks7GVSSuAr NOAQN+2xu2ez1nWhim+5W9YBE+D9cU6PVMvrY= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=UZmm2yKfctlxCZPnZgX3MXlTS77WwX1GVQIRkmMVQGc=; b=gPIme5cQU4cxXmLSZZ2L75Qts2nFnAqdRtZaCWjAPbVA9fuSF94hY9kYDf42E88f3h 9VgJS4gX4yDvk9vDZZnBkjbMyT5wIlLirjAoIDKJIT4TgT/okMIxtcQ9UlFtuXQThHyP UQ33/+emlUuKtMUHWIP4IrkZZvI9xPn3QroyYp4R24IurW/1OiVjFyik+YxsYexr749G PjKG8N7pKOnR+MWQ6125ci1rllDl/OniO5JNx/zhR+8vwpiUKGNrRvSy1XRz1L9nQCus GK76OvYLlOzOjcovK+Aa1WNvlissDbWWG2kwLtFADW9uWK7TO8oNjXSGmdjNOoLPZ5Sy Gr6w== X-Gm-Message-State: APjAAAU0+q3sKTJjsCqZFpmSeneQ6XejUQSU0Np4k9ZgF05zOQursAac c8+Cpw1js0L+/GDgBs9lHR6sLw== X-Google-Smtp-Source: APXvYqyOQkMGDDQ4+fsU3xhhhw7bymbRlWXK82D+i8mKOtSGjgjkMqQqvdEuJ6gu3XhPPUmiPKxGew== X-Received: by 2002:a17:902:2e83:: with SMTP id r3mr54239766plb.153.1554228303233; Tue, 02 Apr 2019 11:05:03 -0700 (PDT) Received: from localhost ([2620:15c:202:1:75a:3f6e:21d:9374]) by smtp.gmail.com with ESMTPSA id k75sm34776680pfj.46.2019.04.02.11.05.02 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 02 Apr 2019 11:05:02 -0700 (PDT) Date: Tue, 2 Apr 2019 11:05:01 -0700 From: Matthias Kaehlcke To: Balakrishna Godavarthi Cc: Marcel Holtmann , Johan Hedberg , linux-bluetooth@vger.kernel.org, linux-kernel@vger.kernel.org, Hemantg Subject: Re: [PATCH 2/2] Bluetooth: hci_qca: wcn3990: Drop baudrate change vendor event Message-ID: <20190402180501.GK112750@google.com> References: <20190307004041.38059-1-mka@chromium.org> <20190307004041.38059-3-mka@chromium.org> <20190307182009.GB138592@google.com> <20190307233039.GA69116@google.com> <20190401171202.GH112750@google.com> <1bebaf0766988a9f27cbbcb9bb35e26f@codeaurora.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <1bebaf0766988a9f27cbbcb9bb35e26f@codeaurora.org> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-bluetooth-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-bluetooth@vger.kernel.org On Tue, Apr 02, 2019 at 05:32:54PM +0530, Balakrishna Godavarthi wrote: > Hi Matthias, > > On 2019-04-01 22:42, Matthias Kaehlcke wrote: > > On Mon, Apr 01, 2019 at 01:48:23PM +0530, Balakrishna Godavarthi wrote: > > > Hi Matthias, > > > > > > On 2019-04-01 13:29, Balakrishna Godavarthi wrote: > > > > Hi Matthias, > > > > > > > > Sorry for the late reply i was on vacation. > > > > > > > > On 2019-03-08 05:00, Matthias Kaehlcke wrote: > > > > > On Thu, Mar 07, 2019 at 10:20:09AM -0800, Matthias Kaehlcke wrote: > > > > > > Hi Balakrishna, > > > > > > > > > > > > On Thu, Mar 07, 2019 at 10:35:08AM +0530, Balakrishna Godavarthi > > > > > > wrote: > > > > > > > hi Matthias, > > > > > > > > > > > > > > On 2019-03-07 06:10, Matthias Kaehlcke wrote: > > > > > > > > Firmware download to the WCN3990 often fails with a 'TLV response size > > > > > > > > mismatch' error: > > > > > > > > > > > > > > > > [ 133.064659] Bluetooth: hci0: setting up wcn3990 > > > > > > > > [ 133.489150] Bluetooth: hci0: QCA controller version 0x02140201 > > > > > > > > [ 133.495245] Bluetooth: hci0: QCA Downloading qca/crbtfw21.tlv > > > > > > > > [ 133.507214] Bluetooth: hci0: QCA TLV response size mismatch > > > > > > > > [ 133.513265] Bluetooth: hci0: QCA Failed to download patch (-84) > > > > > > > > > > > > > > > > This is caused by a vendor event that corresponds to an earlier command > > > > > > > > to change the baudrate. The event is not processed in the context of the > > > > > > > > baudrate change and later interpreted as response to the firmware > > > > > > > > download command (which is also a vendor command), but the driver > > > > > > > > detects > > > > > > > > that the event doesn't have the expected amount of associated data. > > > > > > > > > > > > > > > > More details: > > > > > > > > > > > > > > > > For the WCN3990 the vendor command for a baudrate change isn't sent as > > > > > > > > synchronous HCI command, because the controller sends the corresponding > > > > > > > > vendor event with the new baudrate. The event is received and decoded > > > > > > > > after the baudrate change of the host port. > > > > > > > > > > > > > > > > Identify the 'unused' event when it is received and don't add it to > > > > > > > > the queue of RX frames. > > > > > > > > > > > > > > > > Signed-off-by: Matthias Kaehlcke > > > > > > > > --- > > > > > > > > > > > > > > ... > > > > > > > > > > > > > > Can you test by reverting this change "94d6671473924". > > > > > > > > > > > > The issue is still reproducible. > > > > > > > > > > > > > We need at least 15ms minimum delay for the soc to change its baud rate and > > > > > > > respond to the with command complete event. > > > > > > > > > > > > The baudrate change has clearly been successful when the problem is > > > > > > observed, since the host receives the vendor event with the new > > > > > > baudrate. > > > > > > > > > > I forgot to mention this earlier: the controller doesn't send a > > > > > command complete event for the command, or at least not a correct > > > > > one. > > > > > > > > > > That's the data that is received: > > > > > > > > > > 04 0e 04 01 00 00 00 > > > > > ~~ ~~ > > > > > > > > > [Bala]: can you share me the command sent and event recevied. > > > > I see that we receive a command complete event for the baud rate > > > > change command. > > > > > > > > command sent: 01 48 fc 01 11 > > > > vendor specific event: 04 ff 02 92 01 > > > > command complete event: 04 0e 04 01 00 00 00. > > > > > > > > > > > > > > > > > This is *a* command complete event, but the opcode is 0x0000 instead > > > > > of the earlier command. The same happens for the firmware > > > > > download/read version command, which is the reason why the command > > > > > complete injection mess > > > > > (https://lore.kernel.org/patchwork/patch/1027955/) is needed in one > > > > > way or another. > > > > > > > > > [Bala]: fw download approach is different where we use > > > > __hci_cmd_sync() where as here we use hci_uart_tx_wakeup() > > > > which directly calls the hci_uart_write_work(). so even we > > > > send an valid opcode or not for baudrate change will bot matter. > > > > > > > [Bala]: i miss understood the comment. Yes your true. in the all > > > vendor > > > commands SoC responds with an 0x0000 opcode. > > > > And IIUC this is not compliant with the spec, or at least the BT core > > expects the actual opcode to consider the command to be completed. > > [Bala]: Did you try increasing the the baud rate change timeout to 50ms > instead of 10ms. It is even reproducible with the ROME timeout of 300ms. The timeout doesn't help here. With 78e8fa2972e5 ("Bluetooth: hci_qca: Deassert RTS while baudrate change command") RTS is deasserted during the baudrate change, hence the controller only sends the response when RTS is asserted again. Before the event resulted in a frame reassembly error and the data was discarded. > i suspect it is an timing issue. Timing is certainly also a factor here (the problem isn't seen always), but I don't think 'timing issue' is a proper description of this issue. The problem is an event that the Bluetooth core doesn't expect due to the hack of sending a raw command behind the core's back to work around the firmware 'feature' of sending the command response with the new baudrate. Maybe a delay after re-asserting RTS can address is reliably, I don't remember if I already experimented with that in the past. If a delay can 'fix' the issue reliably I'm open to consider it for the sake of simplicity, but only with a detailed comment that describes the problem.