Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp112468pxm; Wed, 2 Mar 2022 11:26:25 -0800 (PST) X-Google-Smtp-Source: ABdhPJxyiJ45ch4me4H4CmFDWcmCrNHQSFSJD7mKAwDTEvR3b+JyZeSPdccVhiheWpbexLfyuNtT X-Received: by 2002:a17:907:a40a:b0:6d8:888d:fa35 with SMTP id sg10-20020a170907a40a00b006d8888dfa35mr4642016ejc.139.1646249185597; Wed, 02 Mar 2022 11:26:25 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646249185; cv=none; d=google.com; s=arc-20160816; b=mty8zGqGLjo5TJo4lRwMf6hsnh7jMjdrypTbBvaS2WWHfH0NalrlW6qHmUfJV1Q8mQ 3hRUaWYBULLoVC/8FW5DHuBxCXwrgJYT2GYIMUfzMjEhdcxOOOuarUVOOhp8nFA9rarm L8hWDoJC+dIQOCThnoo+U7Ehb6dUhrg38SNIwtV09gjKHXyyN5kAwpxNcrdM22zWQTPK 3wUTcflGfJwd0ZfyIBUKvgf0Rrtoo3vTJIy2kEd7H5IfwYavsVuxFxRF0wx1ya9dxuQA S6Q9kk0GH4QbRjWiFx7QVNadvNmtQA3ompt9oq2l41j+8hMn6j3Z6ffd2My09iKwICf2 bKIg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=aminLQRovqrL9FW/UkLt0yWcCb5nxeMFo3C1GcqG7lo=; b=0AACw4jtS5YmlQFGqo32b6/Nn7M8W068p2uV5cc6/c9iTGg7jGEZfZVEvzKz/jleiU zXz1zXA6cTZW/diMPJ9olKeQKmR/GWQ8k4Jk84Q5j2AlBJLt8O4C1BndmFtMq6dEJRAN Bn20vvyWAOz46U4Z8d5eEl504j0npF2i6NIb5vjTi6YNnqiRULxU8coBbxOnd3U2Hkpx bAWjqByb3bceM0w5BvRcelUBmR7nMybosKBfftIzttr3vbSKhzaFJ/O47t4SLo8KG98p WL6r72X/cFO+tl+aUAUlpX+COrro+Nb3fz6JjLhGszOt6UgKVIQRC7b7MlbCLBKug4UF xHFQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=VxCsmmxo; spf=pass (google.com: domain of linux-bluetooth-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-bluetooth-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 5-20020a508e45000000b00415a1793caasi3246964edx.39.2022.03.02.11.25.41; Wed, 02 Mar 2022 11:26:25 -0800 (PST) Received-SPF: pass (google.com: domain of linux-bluetooth-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=VxCsmmxo; spf=pass (google.com: domain of linux-bluetooth-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-bluetooth-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233241AbiCAWlU (ORCPT + 99 others); Tue, 1 Mar 2022 17:41:20 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33950 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232201AbiCAWlT (ORCPT ); Tue, 1 Mar 2022 17:41:19 -0500 Received: from mail-yw1-x112c.google.com (mail-yw1-x112c.google.com [IPv6:2607:f8b0:4864:20::112c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E8F506C906 for ; Tue, 1 Mar 2022 14:40:36 -0800 (PST) Received: by mail-yw1-x112c.google.com with SMTP id 00721157ae682-2db2add4516so128646597b3.1 for ; Tue, 01 Mar 2022 14:40:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=aminLQRovqrL9FW/UkLt0yWcCb5nxeMFo3C1GcqG7lo=; b=VxCsmmxoFOjHh82QOfg57t8tHW0HnxjDDtPfHn+sogQtv07FqUvInSI+mffWvOwiD5 TLxl/EWYdNzV5u6px2kEN2aH6bUeTuQ4fCXonGVAhbw/6Bt0OjPVULB/gFv/X3axSA7C pmewHgaUVoKtI3CKmAOSNcivlhHqyf5+KknN/+c7aLtZkMDVEttf1OHnAkecN74WERvA fejZPHFXaPy5laKCho/nAuEI21zwDWAnu9LkNvuoXZ53vGh14eJexUXDkqkyqmk0CXVm K25CTrzHFAeubRsUJT407OIr+7wA3hD7ytiP35IfJpHA0brkogRrfjz0XwO8jnBtbNrB 8ulw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=aminLQRovqrL9FW/UkLt0yWcCb5nxeMFo3C1GcqG7lo=; b=3fkIS0HbFPAj/948BvoqCWccsA/41ANmZXorAv7W0GP/fprotZit/l4Zkkl1NxaZzS LN1+WFYaU5uLqjMw3USzTA8TdBMVIIKC9pkysp/mORFljcW3iTiCepiOds3g4USNwt3Y YL4GRruN/1PQoxGXILg6BK+JXHRpvVQg0qNi8y27vT9boKsrSFBh6ATiwVaC+rq01YIO js3G8mTIvABEReEkUzIYGOjwt9DJPNSD7VN60owTHp7p6FnetAbubeG/KSmynrSjiD9Z 3AhX2mafsh/h1+9TeyXKAm0d0dczOwnQFg/dqqDMP2f4l9NCoc7MDSfbCbacl7RdC2ht J8/A== X-Gm-Message-State: AOAM533XomG/vGP3PgiICw6ii87+z1MdTqcTd8bvuWrhKfyYgY5Mzatj 45TU03bC2X2KEIU1F6P7pz4CPt7FwUJ5sHw8e80= X-Received: by 2002:a81:638b:0:b0:2d6:ac65:962 with SMTP id x133-20020a81638b000000b002d6ac650962mr27391607ywb.37.1646174435976; Tue, 01 Mar 2022 14:40:35 -0800 (PST) MIME-Version: 1.0 References: <82216882-463a-8976-e6bc-4a8919107a31@googlemail.com> <2ce6175c-74ec-8469-80a5-374bd1429542@googlemail.com> <17f2bf7e-1d6b-e090-8926-21a408f2b496@googlemail.com> <16cac2c8-c1ca-c8d7-e3d9-5f00be511614@googlemail.com> <6b17bdb5-dddb-e7c7-86f6-2240c6523a81@googlemail.com> <032917a9-0c14-16d3-2751-945b9cd766a3@googlemail.com> In-Reply-To: <032917a9-0c14-16d3-2751-945b9cd766a3@googlemail.com> From: Luiz Augusto von Dentz Date: Tue, 1 Mar 2022 14:40:24 -0800 Message-ID: Subject: Re: bug kernel 5.17, qualcom and intel adapters, unable to reliably connect to bluetooth devices To: Chris Clayton Cc: Chris Murphy , Bluetooth , regressions@lists.linux.dev Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-bluetooth@vger.kernel.org Hi Chris, On Tue, Mar 1, 2022 at 11:40 AM Chris Clayton wrote: > > Hi > > On 01/03/2022 18:57, Luiz Augusto von Dentz wrote: > > Hi Chris, > > > > On Tue, Mar 1, 2022 at 10:34 AM Luiz Augusto von Dentz > > wrote: > >> > >> Hi Chris, > >> > >> On Tue, Mar 1, 2022 at 1:26 AM Chris Clayton wrote: > >>> > >>> Hi Luiz, > >>> > >>> I guess you are hoping for PEBKAC :-) > >>> > >>> On 28/02/2022 21:20, Luiz Augusto von Dentz wrote: > >>>> Hi Chris, > >>>> > >>>> On Mon, Feb 28, 2022 at 1:02 PM Chris Clayton wrote: > >>>>> > >>>>> Hi Luiz, > >>>>> > >>>>> On 28/02/2022 19:34, Luiz Augusto von Dentz wrote: > >>>>>> Hi Chris, > >>>>>> > >>>>>> On Sat, Feb 26, 2022 at 12:04 AM Chris Clayton wrote: > >>>>>>> > >>>>>>> Hi, > >>>>>>> > >>>>>>> On 24/02/2022 15:16, Luiz Augusto von Dentz wrote: > >>>>>>>>> I'll try another bisection today, but limit its range to changes made in the net/bluetooth directory. > >>>>>>> > >>>>>>> That bisection has proved very difficult because the bluetooth "service" in kernels at some steps of the bisection were > >>>>>>> completely borked to the extent that blueman's device-manager application wouldn't start and emitted the messages: > >>>>>>> > >>>>>>> blueman-manager 12.00.37 ERROR Manager:137 on_dbus_name_appeared: Default adapter not found, trying first available. > >>>>>>> blueman-manager 12.00.37 ERROR Manager:141 on_dbus_name_appeared: No adapter(s) found, exiting > >>>>>>> > >>>>>>> Obviously, I don't know whether the problem I am trying to pinpoint is hiding behind this more fundamental problem with > >>>>>>> the bluetooth "service", so being unable to say whether that kernel was good or bad, I had to skip. There seems to be a > >>>>>>> batch of commits that mean that, whilst the kernel builds okay, hunting down a bluetooth-related problem is not > >>>>>>> possible. Eventually and I cursed and gave up. Whatever was causing this breakage has obviously been fixed. > >>>>>>> > >>>>>>>> Please record the HCI with btmon, it must be producing something since > >>>>>>>> it records even the mgmt commands. > >>>>>>>> > >>>>>>> > >>>>>>> Refreshed by a good night's sleep, I started another bisection (between 5.16 and 5.17-rc1) yesterday morning but this > >>>>>>> time did not limit it to net/bluetooth. That was going okay until I ran into what I assume is the same batch of borked > >>>>>>> kernels. I've been more persistent this time but have just had a run of 16 steps in which the bluetooth support in the > >>>>>>> kernel is broken so badly that testing bluetooth is not possible. I will push on today, but I've suspended that activity > >>>>>>> to get the hci trace that Luiz has asked for. > >>>>>>> > >>>>>>> Using information from the bisect, I built a kernel that had tested as bad (but not borked). The commit is > >>>>>>> f2b551fad8d8f2ac5e1f810ad595298381e0b0c5. As I've mentioned before, the problem with devices not connecting is > >>>>>>> intermittent - for a given kernel, sometimes a connection works and other times it doesn't. On the first boot of this > >>>>>>> kernel, my bluetooth devices could connect, Attached are 4 files related to this - the output from btmon, and the > >>>>>>> related portions of daemon.log, kern.log and sys.log from /var/log/. Each of the these files is suffixed with ".good". > >>>>>>> > >>>>>>> I then powered down the laptop and booted into the same kernel. This time the bluetooth devices could not connect. Four > >>>>>>> more files are attached for this boot and are suffixed with ".bad". I said in an earlier email that when connection > >>>>>>> fails, there is no output from btmon, so that log is empty. That's still the case, but I guess that fact itself is a > >>>>>>> clue to what the problem might be. What I can add, however, is that if, in that same bad kernel, I unload and then > >>>>>>> reload the btusb module, connections start to work. Maybe that too is a clue. The same unload/load process revives > >>>>>>> bluetooth on a kernel built after a pull of Linus' latest and greatest this morning. > >>>>>>> > >>>>>>> Since I now have a workround, I'm going stop the current bisection that I was doing. I've done another couple of steps > >>>>>>> this morning and both produced kernels on which I could not test bluetooth and had to tell git bisect to skip. If > >>>>>>> however, I can provide any other diagnostics, please let me know. > >>>>>>> > >>>>>>> Chris > >>>>>> > >>>>>> Can you try with the following patch: > >>>>>> > >>>>>> https://patchwork.kernel.org/project/bluetooth/patch/20220228173918.524733-1-brian.gix@intel.com/ > >>>>>> > >>>>>> > >>>>> Sorry, that patch has made no difference. After the first boot my headphones connected okay, but after a power-down and > >>>>> reboot they would not connect without an unload and reload of the btusb module. > >>>> > >>>> Can you tell us exactly what steps you are using? Are you applying on > >>>> top of what, rc6? > >>>> > >>> > >>> Until I got your patch yesterday, I was using a clone of > >>> git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git which I update frequently and have been doing so for as > >>> long as I can remember. Just in case there was a hidden flaw in that tree, I took a new clone yesterday, (so yes, the > >>> patch was tested on top of rc6) copied over the .config file and applied the patch. Then I built and installed the > >>> kernel, updated grub, powered off the laptop and booted into the new kernel. Once the laptop had booted and logged in to > >>> my LXQt desktop, I powered on my headphones and a connection was establisehed almost straight away. I powered the > >>> headphones off and the disconnection worked fine. > >>> > >>> Knowing that the problem crops up intermittently, I then rebooted the laptop. When the boot was complete, I then powered > >>> on my bluetooth headphones an waited for them to connect to the laptop. After about 20 seconds, a connection had not > >>> been established. I powered off the headphones, used modprobe to unload and then reload the btusb module. When I powered > >>> on the headphones, a connectiin was established within 2 or 3 seconds. > > > > Ive attempted 5 restart with 5.17.0-0.rc6.109.fc37.x86_64, my headset > > was able to reconnect every single time without any problem. The only > > For your five tests, did it connect on the first boot? As I've said, sometimes it fails to connect on the first boot, > but if it succeeds, it has always failed after a power-off and restart. Looking back at the notes I took during the > bisect, I've didn't have a single bisection step where I had to boot more more than twice to ascertain that it was a bad > kernel. As I said, I didn't mark a kernel as good until I'd had five successful boots. It did connect every single time. > > normally, once from gdm and then another time when gnome is loading, > > but I assume it is normal nowadays since it appears when switching > > session pipewire unregisters its audio endpoints. > > > > I don't use pipewire. Prior to 5.17, bluetooth has worked more or less trouble free for at least 4 years. I've read > about pipewire in Linux Magazine but don't see what it would bring to my party except complication. > > ends on how bluetoothd is being > > started or something. > > > > Did you see the warnings that read "Bluetooth: hci0: unexpected event 0xff length: 5 > 0"? That seems to indicate that > something is sending events that are unexpected. What effect will that have? As I said, according to lshw, my system's > bluetooth hardware is Intel AX201. Is that what you are testing on? I have an AX200 on my system, AX201 is very similar so Id be surprised if that is the problem, btw Ive also got some unexpected events but that didn't stop the headset to reconnect. > >>> I've booted this laptop countless times over the last few days. Doing the bisect, I didn't mark a commit as good until I > >>> had done five boots and been able to connect my headphones on each boot. What I can say from that work is that two > >>> consecutive boots into a working kernel are very rare. I can't remeber an occasion when it took more than two boots to > >>> establish that a kernel was bad. > >> > >> Do commands such as bluetoothctl power on or scan on works? Try > >> running bluetoothd -dn from a shell (disable bluetooth.service), also > >> are there any settings changed in main.conf? > >> > > Sorry, I forgot to answer this question earlier. I haven't changed main.conf. Besides, my bluetooth devices connect > successfully every time with 5.16.11 and 5.15.25 kernels. As I've said before, that strongly suggests that there is a > code regression in 5.17. Not saying there isn't something wrong, we have sent a couple of fixes that doesn't seem to be merged yet, and we are working on another one for fixing the scan: https://lore.kernel.org/linux-bluetooth/f648f2e11bb3c2974c32e605a85ac3a9fac944f1.camel@redhat.com/T/ > >> > >> -- > >> Luiz Augusto von Dentz > > > > > > -- Luiz Augusto von Dentz