Received: by 2002:a05:6358:45e:b0:b5:b6eb:e1f9 with SMTP id 30csp4697270rwe; Tue, 30 Aug 2022 15:20:41 -0700 (PDT) X-Google-Smtp-Source: AA6agR4HFj0qSNlmz8hdijfkInw7YG0bT74ZY76UeR/KtKx1XxgnvMsjFBTmWQJvuwIq/7WStVJS X-Received: by 2002:a17:90b:4c47:b0:1fa:dd14:aabd with SMTP id np7-20020a17090b4c4700b001fadd14aabdmr188996pjb.76.1661898040886; Tue, 30 Aug 2022 15:20:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1661898040; cv=none; d=google.com; s=arc-20160816; b=NHBZSGbco8v2SvxCBJP2wT+/mBi9/PIOkokbuSdn0Iq6oLT75H03twlGzd0qqjbpGc V1tQVvty5ZvL42Nyy2y8RT3q7IS8pFsWrawolew+Rgre7aWcskjVpPHpvRVibDKkwCiC b9hSBE0F3/PFT+d1vqgLJEs/OldksUCEFV7GFIgOG9jbjVVTMdGLQdVkaKymLjCFZcaT p0GTMpiWYqW74b2zlktX/8SpyedN2/8+iV+5tkAxocBLb9a5X+wg5lwlfIJJsdg2pMz8 JgSnzAuyfq2zg//KH6d64QGebqwLG0OlImj8R04uUzva6M+Q2PmzBDcS0WaB4NM4LExz zs+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=WUGxfmzXB6fhVtEP1PFM5vtGo0a9ejsFljVJZnbsO6o=; b=zM+X8L++YAo5/bCg5blksA6vLV4N2bBs6u87bsZcDubIgq4s57+HuBcVavPk47RS7i 2aK8rzpDSMaHcH5V5rRFnOf/xPJmzMTaElh3tiDs4n8i74SNvzhZXFOTXq2z0sliCZuu TWD2uyxG3cAUd+zz1Ky8ed9NNEIcnDYcfT711lWl2d3Rb4nQZJRuFdWOV0c7e7RPlKlW Jv+xXBaXw9xmMak0whzPAgLniuX0/qJRthddLC9IY581Jnl40Kn6/h8TJp4TsI0WKu8a 6p13JqmeuRSl2Y4QsT1MoJ2e1lSggrqGajb+fLeAh4Lg+CdTPFA1YNBz6SQumPdQIyh8 PISw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=BAeTBa0j; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id y22-20020a634956000000b0042be3b90259si3113601pgk.564.2022.08.30.15.20.29; Tue, 30 Aug 2022 15:20:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=BAeTBa0j; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230292AbiH3U6l (ORCPT + 99 others); Tue, 30 Aug 2022 16:58:41 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38680 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231206AbiH3U6i (ORCPT ); Tue, 30 Aug 2022 16:58:38 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3C7F77C755; Tue, 30 Aug 2022 13:58:37 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id C473061839; Tue, 30 Aug 2022 20:58:36 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D3749C433D6; Tue, 30 Aug 2022 20:58:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1661893116; bh=DGa8DZfgmN4fY2jkpCB2mbfVdu7HUE7vBbFQOMTS9DY=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=BAeTBa0jgM8cQy4hOltLiCbZud891Erw6dFvDUiygC5nfQW9jFqcFEJtQxpVQPv0T mxtSmDAbfM4sHMmKgiYcND4PsyBgH9EFjec1NhFFgdrb0KN0T2YcCqDBvwBz7nXxvc yk9MBd0fHMnEXY4J+YBYsEa3uLJH38TcEJj0Xxtz4cq2r+Gcz8LQ/nUP+YLIpjBd4Y lT4+/VabW7CyKxFfgcr0xfAdLmhIom6Rj26GCmBJwNB4n0wMtuwLLW+DR2xGU1UQZ7 JW3rTvX+94g13EbGWEqSZbmNyJY25aTIwvhiVfetZx9U2abKy+em7tIqdbhx5podLW xQB9D7oN3ZEOg== Received: by pali.im (Postfix) id 0F2CC834; Tue, 30 Aug 2022 22:58:33 +0200 (CEST) Date: Tue, 30 Aug 2022 22:58:32 +0200 From: Pali =?utf-8?B?Um9ow6Fy?= To: Ben Greear Cc: Greg Kroah-Hartman , bjorn@helgaas.com, LKML , stable@vger.kernel.org, Stefan Roese , Bjorn Helgaas , "Rafael J. Wysocki" , Bharat Kumar Gogada , Michal Simek , Yao Hongbo , Naveen Naidu , Sasha Levin Subject: Re: [PATCH 5.4 182/389] PCI/portdrv: Dont disable AER reporting in get_port_device_capability() Message-ID: <20220830205832.g3lyysmgkarijkvj@pali> References: <20220823080115.331990024@linuxfoundation.org> <20220823080123.228828362@linuxfoundation.org> <47b775c5-57fa-5edf-b59e-8a9041ffbee7@candelatech.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <47b775c5-57fa-5edf-b59e-8a9041ffbee7@candelatech.com> User-Agent: NeoMutt/20180716 X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tuesday 30 August 2022 13:47:48 Ben Greear wrote: > On 8/23/22 11:41 PM, Greg Kroah-Hartman wrote: > > On Tue, Aug 23, 2022 at 07:20:14AM -0500, Bjorn Helgaas wrote: > > > On Tue, Aug 23, 2022, 6:35 AM Greg Kroah-Hartman > > > wrote: > > > > > > > From: Stefan Roese > > > > > > > > [ Upstream commit 8795e182b02dc87e343c79e73af6b8b7f9c5e635 ] > > > > > > > > > > There's an open regression related to this commit: > > > > > > https://bugzilla.kernel.org/show_bug.cgi?id=216373 > > > > This is already in the following released stable kernels: > > 5.10.137 5.15.61 5.18.18 5.19.2 > > > > I'll go drop it from the 4.19 and 5.4 queues, but when this gets > > resolved in Linus's tree, make sure there's a cc: stable on the fix so > > that we know to backport it to the above branches as well. Or at the > > least, a "Fixes:" tag. > > This is still in 5.19.5. We saw some funny iwlwifi crashes in 5.19.3+ > that we did not see in 5.19.0+. I just bisected the scary looking AER errors to this > patch, though I do not know for certain if it causes the iwlwifi related crashes yet. > > In general, from reading the commit msg, this patch doesn't seem to be a great candidate > for stable in general. Does it fix some important problem? > > In case it helps, here is example of what I see in dmesg. The kernel crashes in iwlwifi > had to do with rx messages from the firmware, and some warnings lead me to believe that > pci messages were slow coming back and/or maybe duplicated. So maybe this AER patch changes > timing or otherwise screws up the PCI adapter boards we use... From that log I have feeling that issue is in that intel wifi card and it was there also before that commit. Card is crashing (or something other happens on PCIe bus) and because kernel had disabled Error Reporting for this card, nobody spotted any issue. And that commit just opened eye to kernel to see those errors. I think this issue should be reported to intel wifi card developers, maybe they comment it, why card is reporting errors. > > [ 50.905809] iwlwifi 0000:04:00.0: AER: can't recover (no error_detected callback) > [ 50.905830] pcieport 0000:03:01.0: AER: device recovery failed > [ 50.905831] pcieport 0000:00:1c.0: AER: Uncorrected (Non-Fatal) error received: 0000:03:01.0 > [ 50.905845] pcieport 0000:03:01.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 50.915679] pcieport 0000:03:01.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 50.922735] pcieport 0000:03:01.0: [20] UnsupReq (First) > [ 50.928230] pcieport 0000:03:01.0: AER: TLP Header: 34000000 04001f10 00000000 88c888c8 > [ 50.935126] iwlwifi 0000:04:00.0: AER: can't recover (no error_detected callback) > [ 50.935133] pcieport 0000:03:01.0: AER: device recovery failed > [ 50.935134] pcieport 0000:00:1c.0: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:03:01.0 > [ 50.935222] pcieport 0000:03:01.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 50.945059] pcieport 0000:03:01.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 50.952120] pcieport 0000:03:01.0: [20] UnsupReq (First) > [ 50.957614] pcieport 0000:03:01.0: AER: TLP Header: 34000000 04001f10 00000000 88c888c8 > [ 50.964492] pcieport 0000:03:01.0: AER: Error of this Agent is reported first > [ 50.970519] pcieport 0000:03:02.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 50.980344] pcieport 0000:03:02.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 50.987399] pcieport 0000:03:02.0: [20] UnsupReq (First) > [ 50.992891] pcieport 0000:03:02.0: AER: TLP Header: 34000000 05001f10 00000000 88c888c8 > [ 50.999785] pcieport 0000:03:03.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.009611] pcieport 0000:03:03.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.016665] pcieport 0000:03:03.0: [20] UnsupReq (First) > [ 51.022161] pcieport 0000:03:03.0: AER: TLP Header: 34000000 06001f10 00000000 88c888c8 > [ 51.029052] pcieport 0000:03:05.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.038881] pcieport 0000:03:05.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.045931] pcieport 0000:03:05.0: [20] UnsupReq (First) > [ 51.051430] pcieport 0000:03:05.0: AER: TLP Header: 34000000 07001f10 00000000 88c888c8 > [ 51.058320] pcieport 0000:03:07.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.068147] pcieport 0000:03:07.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.075200] pcieport 0000:03:07.0: [20] UnsupReq (First) > [ 51.080696] pcieport 0000:03:07.0: AER: TLP Header: 34000000 08001f10 00000000 88c888c8 > [ 51.087589] iwlwifi 0000:04:00.0: AER: can't recover (no error_detected callback) > [ 51.087598] pcieport 0000:03:01.0: AER: device recovery failed > [ 51.087611] iwlwifi 0000:05:00.0: AER: can't recover (no error_detected callback) > [ 51.087615] pcieport 0000:03:02.0: AER: device recovery failed > [ 51.087628] iwlwifi 0000:06:00.0: AER: can't recover (no error_detected callback) > [ 51.087631] pcieport 0000:03:03.0: AER: device recovery failed > [ 51.087643] iwlwifi 0000:07:00.0: AER: can't recover (no error_detected callback) > [ 51.087646] pcieport 0000:03:05.0: AER: device recovery failed > [ 51.087659] iwlwifi 0000:08:00.0: AER: can't recover (no error_detected callback) > [ 51.087662] pcieport 0000:03:07.0: AER: device recovery failed > [ 51.103761] pcieport 0000:00:1c.0: AER: Uncorrected (Non-Fatal) error received: 0000:03:0f.0 > [ 51.103778] pcieport 0000:03:0f.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.113608] pcieport 0000:03:0f.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.120658] pcieport 0000:03:0f.0: [20] UnsupReq (First) > [ 51.126152] pcieport 0000:03:0f.0: AER: TLP Header: 34000000 0f001f10 00000000 88c888c8 > [ 51.133044] iwlwifi 0000:0f:00.0: AER: can't recover (no error_detected callback) > [ 51.133068] pcieport 0000:03:0f.0: AER: device recovery failed > [ 51.168925] pcieport 0000:00:1c.0: AER: Uncorrected (Non-Fatal) error received: 0000:03:0f.0 > [ 51.168940] pcieport 0000:03:0f.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.178773] pcieport 0000:03:0f.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.185823] pcieport 0000:03:0f.0: [20] UnsupReq (First) > [ 51.191318] pcieport 0000:03:0f.0: AER: TLP Header: 34000000 0f001f10 00000000 88c888c8 > [ 51.198211] iwlwifi 0000:0f:00.0: AER: can't recover (no error_detected callback) > [ 51.198234] pcieport 0000:03:0f.0: AER: device recovery failed > [ 51.260695] pcieport 0000:00:1c.0: AER: Uncorrected (Non-Fatal) error received: 0000:03:0f.0 > [ 51.260710] pcieport 0000:03:0f.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.270548] pcieport 0000:03:0f.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.277605] pcieport 0000:03:0f.0: [20] UnsupReq (First) > [ 51.283103] pcieport 0000:03:0f.0: AER: TLP Header: 34000000 0f001f10 00000000 88c888c8 > [ 51.290009] iwlwifi 0000:0f:00.0: AER: can't recover (no error_detected callback) > [ 51.290033] pcieport 0000:03:0f.0: AER: device recovery failed > [ 51.328514] pcieport 0000:00:1c.0: AER: Uncorrected (Non-Fatal) error received: 0000:03:0f.0 > [ 51.328530] pcieport 0000:03:0f.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID) > [ 51.331638] ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001) > [ 51.338363] pcieport 0000:03:0f.0: device [10b5:8619] error status/mask=00100000/00000000 > [ 51.338364] pcieport 0000:03:0f.0: [20] UnsupReq (First) > [ 51.345413] ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001) > [ 51.350900] pcieport 0000:03:0f.0: AER: TLP Header: 34000000 0f001f10 00000000 88c888c8 > [ 51.350927] iwlwifi 0000:0f:00.0: AER: can't recover (no error_detected callback) > > > Thanks, > Ben > > -- > Ben Greear > Candela Technologies Inc http://www.candelatech.com >