Received: by 2002:a05:7412:2a91:b0:fc:a2b0:25d7 with SMTP id u17csp494558rdh; Wed, 14 Feb 2024 03:16:11 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCWTSsg9zqbAffxQlKIGmgyg4KjFEplRBVAZb1PrNk+50p2jFD8u29uqTpHkvqRb+dJagGDpMwwNGY4ajBJMTfflE0lIENTAkXLRj16svw== X-Google-Smtp-Source: AGHT+IEBSGu9alSUUneDxt3A+6SrrPc8gr5Kdj4JbwMKelWATNmBO08HRgHxdS61dQzIVjTQTgx/ X-Received: by 2002:a05:6a00:14d5:b0:6db:afd4:f54c with SMTP id w21-20020a056a0014d500b006dbafd4f54cmr2313507pfu.0.1707909371446; Wed, 14 Feb 2024 03:16:11 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1707909371; cv=pass; d=google.com; s=arc-20160816; b=MXjc/sBmZIkNLZMsehpygGpJWRl1ZhySSWnz3PGQU7XBWtyOtH4Au19nwz/w21vrNH S2/FFmYPPhZDU/RM6STyPv8bS5CVTVzN0HDZ7RTOfGAglqQftC7xw5vc0Clk9f1AR4fw 6u2LjXIeEiziIMZzvNSVPR8A1NUwvOELN56oQ2IQKSQk2PHpQGCg8b7mn5SMUHtd1xD1 XySOwGc2hrdx7aDKfT5mg/FRaYC++U4xt6/B6cVUb9E5+gaUANGdCEebc8dVhPbJlrJr sMmBH+IFZGJQPQRvv0wgr3TksNh7iKMcpdbF9jrHNa1A18up381YFuW9AA4+k28sEDY7 cdpQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date:dkim-signature; bh=gAh8zSlHY9nEU0SOAxu+J+5szZ031ffvGmvN1jvSx94=; fh=YhLlNHM1hdwgIWmakPUcMlseQqcLRm7DDrByiYEie9A=; b=HaoCA9q95ySv+zDBBQmRIOzsiKOpHsJdnBsvl6XR/qljImfyco1qm/GzqZ4SL0FuDx F1GOeJJtxOcg1LmF5b7aKA4DI/VtIMdXKunk4B5BQNPdR5J8cM6MrPdeD1BUDzu2E6eY XIHPyPF9D3BK4FQgGvV/5jFYr6ZIv53id60w8d7IdY6A/2NwJSEqIvxXl7Z3Ipavm1gq dhU3yLOADTrT0zOcgPZlXjEVYAnjhQq/aazgRFHVKkvRb0DTWXkn4oX5O49rMNywkdNA qF7YT2HVhhiGqUvcsjso6B3I7DoO8xWyCX7zpJCu8rIittZ7fwquI/MOWdwOTI7xC6zw Am9A==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=cD3qBx6S; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-65100-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-65100-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org X-Forwarded-Encrypted: i=2; AJvYcCUdv+mfFHV2cqfQXX3Wg6DquZORFpeT5NDIqOwl70hNZR+CQBloY3lZluaL8w0WfT8AIzesEgPLXxtsNcNK/Cl5ActeoiQWCYb3PvU+/w== Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [139.178.88.99]) by mx.google.com with ESMTPS id by32-20020a056a0205a000b005dc8834047esi2491655pgb.422.2024.02.14.03.16.11 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 14 Feb 2024 03:16:11 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-65100-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) client-ip=139.178.88.99; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=cD3qBx6S; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-65100-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-65100-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id 81A7C2902E9 for ; Wed, 14 Feb 2024 11:09:09 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 0B92119472; Wed, 14 Feb 2024 11:08:59 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="cD3qBx6S" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id ED89118E06; Wed, 14 Feb 2024 11:08:57 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1707908938; cv=none; b=eSbvcGshYAIf3REXimuthESOfqEEFXgNAv6W1+RvQMXeOjsLMa3GSh89OUGlsKJZd59HvB8oew61FkdMNpQpmnIa1pMzo4uuLeaSFPasaq9/lN3g/MYBoh1au3H3H3i/RKeplngkDrsJksM3ovI7qor7o+CVyv3jl6nWmEBMydE= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1707908938; c=relaxed/simple; bh=hB8KBBOQw2o2auMkE+FoaFIV9znv2P+vBlojlfb7ka0=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=JiKePTM1xVgG2FMBrajj42GZG2QL4j7ZxlTk1mbDMKMiRXRJ0PBJebnzPHYMBxbKvvXz3NOS5B1OcERQjizhCgWRO3b7JyCk0o1JGCaE23rJjUuSPDYvyeFLafvNguSeaKmIzeO93alHaH1PI0gINHeD0yMoxFV8/lCoqUbABD8= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=cD3qBx6S; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 463D4C433C7; Wed, 14 Feb 2024 11:08:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1707908937; bh=hB8KBBOQw2o2auMkE+FoaFIV9znv2P+vBlojlfb7ka0=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=cD3qBx6SNQ3J5ZO2E+FFtDFxOJT5pjB6oBjzWzKWMfVVxSrUX/Uq2XBwboh3mMK2F nXSBXP/oqfOtbdYdj1P8j+WXSjRdlCVOwEMB0ScL6DJYNH2Fc6wPkd4C6r/CyJDfA4 qxX2TI3zzAhKp1tXOM3ORBP56YM6zMRJZ0/xafU68I0k/52pCKPCK5/AW92rq38aEj iSJLwIxfxorc27ubCSlMuMQf1hWH5XwTTnmgWZJwI2J4omQgtkbiXWauREIFy7Inp4 bMlaP/S2XOLiPLurHaIgLCMZfwYXuBTitG6new5o/mhBMlXwcrTchDsw7ZdCGNMa+5 iQ825CNQglrEA== Received: from johan by xi.lan with local (Exim 4.97.1) (envelope-from ) id 1raD8d-000000004Y8-3YdG; Wed, 14 Feb 2024 12:09:15 +0100 Date: Wed, 14 Feb 2024 12:09:15 +0100 From: Johan Hovold To: Manivannan Sadhasivam Cc: Johan Hovold , Bjorn Andersson , Bjorn Helgaas , Konrad Dybcio , Lorenzo Pieralisi , Krzysztof =?utf-8?Q?Wilczy=C5=84ski?= , Rob Herring , Krzysztof Kozlowski , Conor Dooley , linux-arm-msm@vger.kernel.org, linux-pci@vger.kernel.org, devicetree@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH 00/10] arm64: dts: qcom: sc8280xp: enable GICv3 ITS for PCIe Message-ID: References: <20240212165043.26961-1-johan+linaro@kernel.org> <20240214063554.GC4618@thinkpad> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20240214063554.GC4618@thinkpad> On Wed, Feb 14, 2024 at 12:05:54PM +0530, Manivannan Sadhasivam wrote: > On Mon, Feb 12, 2024 at 05:50:33PM +0100, Johan Hovold wrote: > > This series addresses a few problems with the sc8280xp PCIe > > implementation. > > > > The DWC PCIe controller can either use its internal MSI controller or an > > external one such as the GICv3 ITS. Enabling the latter allows for > > assigning affinity to individual interrupts, but results in a large > > amount of Correctable Errors being logged on both the Lenovo ThinkPad > > X13s and the sc8280xp-crd reference design. > > > > It turns out that these errors are always generated, > > How did you confirm this? You can see that error flags being set in the controller and endpoint, for example, using lspci -vv: CESta: RxErr- BadTLP+ BadDLLP- Rollover- Timeout- AdvNonFatalErr- > > but for some yet to > > be determined reason, the AER interrupts are never received when using > > the internal MSI controller, which makes the link errors harder to > > notice. > > If you manually inject the errors using "aer-inject", are you not seeing the AER > errors with internal MSI controller as well? I haven't tried that, I'm just reporting that that piece of functionality is currently broken and that that partly explains why the ASPM problems went unnoticed. > > On the X13s, there is a large number of errors generated when bringing > > up the link on boot. This is related to the fact that UEFI firmware has > > already enabled the Wi-Fi PCIe link at Gen2 speed and restarting the > > link at Gen3 generates a massive amount of errors until the Wi-Fi > > firmware is restarted. > > > > A recent commit enabling ASPM on certain Qualcomm platforms introduced > > further errors when using the Wi-Fi on the X13s as well as when > > accessing the NVMe on the CRD. The exact reason for this has not yet > > been identified, but disabling ASPM L0s makes the errors go away. This > > could suggest that either the current ASPM implementation is incomplete > > or that L0s is not supported with these devices. > > What are those "further errors" you are seeing with ASPM enabled? Are those > errors appear with GIC ITS or with internal MSI controller as well? Further errors as in further correctable errors that are not related to the errors seen when resetting the X13s Wi-Fi link at boot. These show up, for example, when accessing the NVMe on the CRD or when using the Wi-Fi on the X13s. These errors go away when L0s is disabled. And yes, you see them with both the external and internal MSI controller (in the latter case, by looking at the error flags mentioned above). > > Note that the X13s and CRD use the same Wi-Fi controller, but the errors > > are only generated on the X13s. The NVMe controller on my X13s does not > > support L0s so there are no issues there, unlike on the CRD which uses a > > different controller. The modem on the CRD does not generate any errors, > > but both the NVMe and modem keeps bouncing in and out of L0s/L1 also > > when not used, which could indicate that there are bigger problems with > > the ASPM implementation. I don't have a modem on my X13s so I have not > > been able to test whether L0s causes an trouble there. > > > > Enabling AER error reporting on sc8280xp could similarly also reveal > > existing problems with the related sa8295p and sa8540p platforms as they > > share the base dtsi. > > > > The last four patches, marked as RFC, adds support for disabling ASPM > > L0s in the devicetree and disables it selectively for the X13s Wi-Fi > > and CRD NVMe. If it turns out that the Qualcomm PCIe implementation is > > incomplete, we may need to disable ASPM (L0s) completely in the driver > > instead. > > If the device is not supporting L0s, then it as to be disabled in the device, > not in the PCIe controller, no? Well, we don't know yet where the problem lies, just that enabling L0s results in a large number of correctable errors. Until yesterday I had not seen any such errors for the Wi-Fi on the CRD, which uses essentially the same ath11k controller, so there was no clear indication that this was necessarily a problem with the devices either. > > Note that disabling ASPM L0s for the X13s Wi-Fi does not seem to have a > > significant impact on the power consumption > > > > The DT bindings and PCI patch are expected to go through the PCI tree, > > while Bjorn A takes the devicetree updates through the Qualcomm tree. > > Since I took a stab at enabling the GIC ITS previously, I noticed that the NVMe > performance got a slight dip. And that was one of the reasons (apart from AER > errors) that I never submitted the patch. > > Could you share the NVMe benchmark (fio) with this series? Did you have any particular benchmark in mind? I have run multiple fio benchmarks and while the results vary with the parameters, the impact of switching to ITS (so that not all PCIe interrupts are processed on CPU0) is generally favourable. A raw sequential read shows no change in throughput on either the X13s or the CRD even if for some reason this test performs really badly on the X13s (i.e. regardless of which MSI controller is used): crd-rseq-read: IOPS=11.1k, BW=2764MiB/s (2898MB/s)(81.0GiB/30003msec) X13s-rseq-read: IOPS=508, BW=127MiB/s (134MB/s)(3841MiB/30169msec) Another benchmark I've used against a mounted ext4 partition shows a 2x improvement in throughput with ITS for sequential and random reads and writes on the X13s: seq-read: IOPS=88.4k, BW=345MiB/s (362MB/s)(10.0GiB/29657msec) rand-read: IOPS=21.2k, BW=82.8MiB/s (86.8MB/s)(4967MiB/60001msec) seq-write: IOPS=162k, BW=632MiB/s (662MB/s)(10.0GiB/16213msec) rand-write: IOPS=142k, BW=555MiB/s (582MB/s)(10.0GiB/18439msec) while the results are essentially unchanged with a larger block size and queue depth (32/2m instead of 4/4k): seq-read: IOPS=1095, BW=2191MiB/s (2298MB/s)(10.0GiB/4673msec) rand-read: IOPS=1020, BW=2041MiB/s (2140MB/s)(10.0GiB/5017msec) seq-write: IOPS=918, BW=1837MiB/s (1926MB/s)(10.0GiB/5574msec) rand-write: IOPS=826, BW=1653MiB/s (1734MB/s)(10.0GiB/6194msec) > > Johan Hovold (10): > > dt-bindings: PCI: qcom: Allow 'required-opps' > > dt-bindings: PCI: qcom: Do not require 'msi-map-mask' > > arm64: dts: qcom: sc8280xp: add missing PCIe minimum OPP > > arm64: dts: qcom: sc8280xp-crd: limit pcie4 link speed > > arm64: dts: qcom: sc8280xp-x13s: limit pcie4 link speed > > arm64: dts: qcom: sc8280xp: enable GICv3 ITS for PCIe > > Is this patch based on the version I shared with you long back? If so, I'd > expect to have some credit. If you came up with your own version, then ignore > this comment. No, this patch has beeen created and evaluated from scratch based on the downstream direwolf dts, which has these five 'msi-map' properties. I debated whether I should base it on your version instead, but in the end it would have a new commit message and only these properties from the downstream dtsi would remain (you also removed existing properties IIRC). So while it's certainly inspired by your work, this has been done from scratch, including the testing. If you prefer I can make this clear in the commit message, but adding a Co-developed-by didn't seem quite right either as I did this work without your involvement. But perhaps that would be better? Johan