Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp2641310pxb; Sun, 15 Nov 2020 12:13:00 -0800 (PST) X-Google-Smtp-Source: ABdhPJxUd6Iso0JDWW3fCnqT0FQAK53z0XhUuCqCIeAjiEldWe2KLV5Hzvpm7rsBo4jDZT+dGJne X-Received: by 2002:a17:906:2581:: with SMTP id m1mr11456632ejb.28.1605471180201; Sun, 15 Nov 2020 12:13:00 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605471180; cv=none; d=google.com; s=arc-20160816; b=0dE+6HFqmd7Yrnj0t+hWpvUEZhxq13f1rbSlTMdORRyOQ0QRrDDkMdIw29FoJtPy0V +87w+Ar+Y4JkZZCkYO2oILIOhCZQwRBFkK0MnHJIuVPDWf15z8ZyQeyXTFy6ZTNZrUvN o20iX1hd3wYL2S3CA1SPYy8EykMsvUpMa84cZa4fT3SFn/acoiNJDX+tHOHL7kM4GLKT hNZLJ8MeAtCUs92JAWYHif54JUpTwMmaMWmZla/LmHmOhZQqArwtLxfGywwYvdIvTSJw YcaRxJwwZkJV8sqbqeq+KtEtfSbvm7TSsZ9w9WCmxcaYFJOETaysnO997S1YA7pTxMZ/ j8tA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=i5nM1Cp39iy5bVLNTzfiQjcRIsGdFyVgBh8LybDWFVc=; b=EYJntLGXA4qQJF1uFySoPETcWhXRTuH40/HN3FklImB3exPXRa6K5Ou1yROKlnZ8nM WeIVNuG0YE/FdrdCO7ZBojloqEW/9g6TMMSZUtRiUQghIZ+VSdUucUs3ZnCkm/ZJui+o C43HTe4QTB88dQ4z2FlaS5xrt3oM6+yglsKFCgaRWocc+fSsi8fpJVTuKM7Dacu9Y5tv r945IWkRwDhDuWrGXy7zHfsYbczHN2DYFV8kWpSUN/rvN/nyhuaiqR1hopbaT0RPv6X7 SlN5w4WWAzYlKyZNDgvhwPZU0bpYuLzsvvV9kssA85HbTricHV+FkkD1Ol98rOOH9r5b 8LBA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@technolu-st.20150623.gappssmtp.com header.s=20150623 header.b=un34enj3; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c26si10257785eja.409.2020.11.15.12.12.18; Sun, 15 Nov 2020 12:13:00 -0800 (PST) Received-SPF: pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@technolu-st.20150623.gappssmtp.com header.s=20150623 header.b=un34enj3; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727476AbgKOTzd (ORCPT + 99 others); Sun, 15 Nov 2020 14:55:33 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43094 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727230AbgKOTzd (ORCPT ); Sun, 15 Nov 2020 14:55:33 -0500 Received: from mail-ej1-x635.google.com (mail-ej1-x635.google.com [IPv6:2a00:1450:4864:20::635]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7EF20C0613D1 for ; Sun, 15 Nov 2020 11:55:32 -0800 (PST) Received: by mail-ej1-x635.google.com with SMTP id s25so21450150ejy.6 for ; Sun, 15 Nov 2020 11:55:32 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=technolu-st.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=i5nM1Cp39iy5bVLNTzfiQjcRIsGdFyVgBh8LybDWFVc=; b=un34enj3U770N2DIna9ITRJ6e0he+8AXTan21KvotS8gHhOu7ruN2cj1klpvykFYM/ kZmohXD2uX17vctguE6CBv1huia2vxL4qIVEijO/C308FILJ7cTjYa6DllL1Xu4YPf0Q SG0C7RcVrWQYiFe4pxOJenhXz8L+YY7f5vqGmYQgEkzLcXbUbbwI2etumm1nOka7cXNN u9JA+wGQWbGyEtKhPjhj1T0o9nHzWGh8U4lcrwyyBS6WHtYJeAYLs6V3sAb9ke5W3Cnr 5YnKYN32XkzlD7eq9B+X9EBU8nHrs4MmsvO9VKwZqV3q38vxQxDQZulWMIxCnAiwWv3Q gIzA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=i5nM1Cp39iy5bVLNTzfiQjcRIsGdFyVgBh8LybDWFVc=; b=aiuCyNzCLeMrYilRwbnbdHWsa42xvwRgQ0jQmYFToU0rhD8+mVL0SqBK/9ypoG3TBy cRxAf442dpTaN5IOwZNZWdtsA1OTxlvkk2q7dnDprJPNsyxjrLJ5AjuDosTEBNc5gL8T 2pfz5SLcXtNmVv5Nm17ASwO36UTLS2dw1DPelVqM8VUi43YPyXfdyZRspx9VgLFe1GOS 8lCmjehBa2gdcCD+zTDnpb5LaNpu92rcmh6/TjZ3khVEVkxjs2yKAGDC3jEbtXyJzu+m Nt6sBQKBDe57TwiPW1bhr399KU7NDED28OmGHXtcVhj4okp9uoanO8lRAKTOMAHUxwU8 roXg== X-Gm-Message-State: AOAM532r4ShkcLFDXxiguHJ980T+cpucFNGr5LzZ5YyOMdalGLcCX2fA JXaiatnpIrsgVXaD5p44OVf8SBX5veaQmcFK9/vYEA== X-Received: by 2002:a17:906:c41:: with SMTP id t1mr11622210ejf.19.1605470131112; Sun, 15 Nov 2020 11:55:31 -0800 (PST) MIME-Version: 1.0 References: <20201103160838.GA246433@bjorn-Precision-5520> <874km61732.fsf@nanos.tec.linutronix.de> <87mtzxkus5.fsf@nanos.tec.linutronix.de> <87wnz0hr9k.fsf@codeaurora.org> <87ft5hehlb.fsf@codeaurora.org> <6b60c8f1-ec37-d601-92c2-97a485b73431@posteo.de> <87v9ec9rk3.fsf@codeaurora.org> <87imab4slq.fsf@codeaurora.org> <0b58872b4f27dbf5aad2a39f5ec4a066e080d806.camel@seibold.net> <875z6b3v22.fsf@codeaurora.org> <87pn4j2bna.fsf@codeaurora.org> In-Reply-To: From: wi nk Date: Sun, 15 Nov 2020 20:55:20 +0100 Message-ID: Subject: Re: pci_alloc_irq_vectors fails ENOSPC for XPS 13 9310 To: Thomas Krause Cc: Kalle Valo , Govind Singh , linux-pci@vger.kernel.org, Stefani Seibold , linux-wireless@vger.kernel.org, Devin Bayer , Christoph Hellwig , Bjorn Helgaas , Thomas Gleixner , ath11k@lists.infradead.org, David Woodhouse Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-wireless@vger.kernel.org On Sun, Nov 15, 2020 at 2:30 PM Thomas Krause wrote: > > > Am 12.11.20 um 16:44 schrieb wi nk: > > On Thu, Nov 12, 2020 at 10:00 AM Kalle Valo wrote: > >> wi nk writes: > >> > >>> On Thu, Nov 12, 2020 at 8:15 AM Kalle Valo wrote: > >>>> Stefani Seibold writes: > >>>> > >>>>> Am Donnerstag, den 12.11.2020, 02:10 +0100 schrieb wi nk: > >>>>>> I've yet to see any instability after 45 minutes of exercising it, I > >>>>>> do see a couple of messages that came out of the driver: > >>>>>> > >>>>>> [ 8.963389] ath11k_pci 0000:55:00.0: Unknown eventid: 0x16005 > >>>>>> [ 11.342317] ath11k_pci 0000:55:00.0: Unknown eventid: 0x1d00a > >>>>>> > >>>>>> then when it associates: > >>>>>> > >>>>>> [ 16.718895] wlp85s0: send auth to ec:08:6b:27:01:ea (try 1/3) > >>>>>> [ 16.722636] wlp85s0: authenticated > >>>>>> [ 16.724150] wlp85s0: associate with ec:08:6b:27:01:ea (try 1/3) > >>>>>> [ 16.726486] wlp85s0: RX AssocResp from ec:08:6b:27:01:ea > >>>>>> (capab=0x411 status=0 aid=8) > >>>>>> [ 16.738443] wlp85s0: associated > >>>>>> [ 16.764966] IPv6: ADDRCONF(NETDEV_CHANGE): wlp85s0: link becomes > >>>>>> ready > >>>>>> > >>>>>> The adapter is achieving around 500 mbps on my gigabit connection, my > >>>>>> 2018 mbp sees around 650, so it's doing pretty well so far. > >>>>>> > >>>>>> Stefani - when you applied the patch that Kalle shared, which branch > >>>>>> did you apply it to? I applied it to ath11k-qca6390-bringup and when > >>>>>> I revert 7fef431be9c9 there is a small merge conflict I needed to > >>>>>> resolve. I wonder if either the starting branch, or your chosen > >>>>>> resolution are related to the instability you see (or I'm just lucky > >>>>>> so far! :)). > >>>>>> > >>>>> I used the vanilla kernel tree > >>>>> https://git.kernel.org/torvalds/t/linux-5.10-rc2.tar.gz. On top of this > >>>>> i applied the > >>>>> > >>>>> RFT-ath11k-pci-support-platforms-with-one-MSI-vector.patch > >>>>> > >>>>> and reverted the patch 7fef431be9c9 > >>>> I did also my testing on v5.10-rc2 and I recommend to use that as the > >>>> baseline when debuggin these ath11k problems. It helps to compare the > >>>> results if everyone have the same baseline. > >>>> > >>>> -- > >>>> https://patchwork.kernel.org/project/linux-wireless/list/ > >>>> > >>>> https://wireless.wiki.kernel.org/en/developers/documentation/submittingpatches > >>> Absolutely, I'll rebuild to 5.10 later today and apply the same series > >>> of patches and report back. > >> Great, thanks. > >> > >>> I'll also test out the patch on both versions from Carl to fix > >>> resuming. It stands to reason that we may be seeing another regression > >>> between Stefani (5.10) and myself (5.9 bringup branch) as I don't see > >>> any disconnections or instability once the interface is online. > >> Yeah, there is something strange happening between v5.9 and v5.10 we > >> have not yet figured out. Most likely it has something to do with memory > >> allocations and DMA transfers failing, but no clear understanding yet. > >> > >> But to keep things simple let's only discuss the MSI problem on this > >> thread, and discuss the timeouts in the another thread: > >> > >> http://lists.infradead.org/pipermail/ath11k/2020-November/000641.html > >> > >> I'll include you and other reporters to that thread. > >> > >> -- > >> https://patchwork.kernel.org/project/linux-wireless/list/ > >> > >> https://wireless.wiki.kernel.org/en/developers/documentation/submittingpatches > > Ok, I've tried a clean checkout of 5.10-rc2 with the one MSI patch > > applied and 7fef431be9c9 reverted. I can't get my machine to boot > > into anything usable with that configuration. I'm running ubuntu so > > its starting right into X and sometime between showing the available > > users and me clicking the icon to login the machine freezes. I can > > see in the system tray that the wifi adapter is being activated and > > appears to have associated with an AP, I just can't do much beyond > > that as the keyboard backlight wakes up, but the caps lock key doesn't > > work. I see similar behavior with the 5.9 configuration, but after a > > reboot or two I win whatever race is occuring. With 5.10, I tried > > maybe 10-15 times with 0 success. > > I can confirm this behavior on my configuration. I managed to login once > and select the Wifi and connect to it. It seemed curiously enough be > stable long enough to enter the Wifi passphrase. After the connection > was established, the system hang and on each attempt to reboot into the > graphical system it would freeze at some point (sometimes even before > showing the login screen). > > Kernel was both based on 5.10-rc2 and 5.10-rc3 (I did see the same > behavior) with the patch applied, 7fef431be9c9 reverted and firmware > downloaded and copied to /lib/firmware/ath11k/QCA6390/hw2.0/. > > I did a bit more digging to see if I could find any new information, I'm not sure I did but here's what I did / found. I spent the time to get a kdump kernel running and enabled, I was able to SysRq-C (both via keyboard and echo c > /proc/sysrq-trigger) and generate a crash dump. Actually viewing them at the moment will require reverting a couple of patches to printk to fix the file for the crash utility (https://github.com/crash-utility/crash/issues/67), but right now that's not super important since the mechanism isn't being triggered. As reported here and by Mitchell, the adapter will work occasionally, but more often it will hang the machine (I too tried 5.10-rc3 with no noticable differences). Whatever is causing the system to hang isn't triggering the kdump kernel to take over and dump the vmcore. I've set watchdog=1 , nmi_watchdog=1, hung_task_panic=1, softlockup_panic=1 trying to convince the kernel to dump it's state during this. I've not been able to make it write a crash, it just sits 'hung'. One interesting observation that may be related, is that if the lockup occurs during my login, I can actually see the system grind to a halt over the course of a number of frames (the rendering of the login animations starts to stutter/get really slow, then after a few frames everything is frozen). If something were spin locking/ed, I'd expect the soft lockup panic to find it, but I don't know these mechanisms well. The only consistent behavior that I managed to create is that if the wifi adapter / machine are in a 'working' state (ie: I can browse the internet, etc) and I issue sysrq-c to crash the kernel and then let the crash dump write and reboot the machine, once booted the adapter is no longer seen by the kernel, and there are zero messages in dmesg that match "ath11k". The driver shows up in lsmod , but it reports zero messages and it's like the adapter is completely invisible. A power off and back on of the machine will re-enter it into the freezing/wifi working lottery.