Received: by 10.223.185.116 with SMTP id b49csp405018wrg; Tue, 20 Feb 2018 01:08:52 -0800 (PST) X-Google-Smtp-Source: AH8x226ye/v8muO6mmimpY/nK7vR+yWdcFYZiw34KW142wUyw/fWKcVR1U63leC0HFuim27JYnNS X-Received: by 10.99.96.137 with SMTP id u131mr14882127pgb.103.1519117732141; Tue, 20 Feb 2018 01:08:52 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1519117732; cv=none; d=google.com; s=arc-20160816; b=tKi5p7JI3rm+PX+uWI3t9+id0E/GJbA0T5+xGFXx24Xb8lOXhBzBWg3RIl35dvgnXC RUJeln1ic+gDDYdk5FdhPanAEyvZjtQg9hDCoq6nYiF8xCYiVqz8mzgD5yHWEMu0RnnF JzdOIuVr9FLvk4KjfC9Xw31ZkV+QsacUnAULOtigtJ1aIzIzIBTz8b/65J0uNd+wVeW3 u2h2v0/cqT5SrAbX3linn0/77ILa1VwC4ADbgtUqQS4bKymyey9TkR+Wl3Q0gP92H7dv pETX7w6Gc2bbb1vBBFHX/1bFN+cLu4irLDg7R2byebhorsYD5Blp/FQUdf50wVqHmD+f Qu6Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:organization:user-agent :references:in-reply-to:subject:cc:to:from:message-id:date :arc-authentication-results; bh=+VDRSMohm7U8yy+TIxroJcXt6zcD0o5RwOsAmsr0sbM=; b=MafKHDa8u93GZcH+7G9huuqZ75wR4sDnZtOspNbhnti1lLi2Mr0zwS9N27nzxerJpL ATTFNcFHd9+MrQpVuKxQQX9OE+yaBx7gQY90JtOacDVc60ADYTqYN2neY8aTzL7sRIGR zz5uCmccGqRk3w9Sey0DPZ91ik9SbUjAhEaRZWXpOA+rrjFKMOKXCS9iF3gNFmap7ECN YqDoFfYf2Ew6eABwMPl7QmniuTKuevkcJrrOALOlmng1OocAgc5mo3/72XoJ8VoHQEfz 8IUeQpUzKoGchwUb7rqssAOcwIdJVMql0uXtqhQ7at5ghyt2Yz+ejXYhMjNN6MChmE6G 9RrA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r2-v6si1129849pli.625.2018.02.20.01.08.37; Tue, 20 Feb 2018 01:08:52 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751131AbeBTJHt (ORCPT + 99 others); Tue, 20 Feb 2018 04:07:49 -0500 Received: from foss.arm.com ([217.140.101.70]:39878 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750703AbeBTJHp (ORCPT ); Tue, 20 Feb 2018 04:07:45 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id A1FBD80D; Tue, 20 Feb 2018 01:07:44 -0800 (PST) Received: from big-swifty.misterjones.org (big-swifty.cambridge.arm.com [10.1.30.167]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 21B2C3F25C; Tue, 20 Feb 2018 01:07:40 -0800 (PST) Date: Tue, 20 Feb 2018 09:07:38 +0000 Message-ID: <86lgfoc9lx.wl-marc.zyngier@arm.com> From: Marc Zyngier To: Jayachandran C Cc: Ganapatrao Kulkarni , , , , , , , , , , , , Subject: Re: [PATCH v2] irqchip/gic-v3-its: Add workaround for ThunderX2 erratum #174 In-Reply-To: <20180219211151.GA158953@jc-sabre> References: <20180118052820.30286-1-ganapatrao.kulkarni@cavium.com> <20180121070038.GA4450@jc-sabre> <86po635trt.wl-marc.zyngier@arm.com> <20180219211151.GA158953@jc-sabre> User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM/1.14.9 (=?ISO-8859-4?Q?Goj=F2?=) APEL/10.8 EasyPG/1.0.0 Emacs/25.1 (aarch64-unknown-linux-gnu) MULE/6.0 (HANACHIRUSATO) Organization: ARM Ltd MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 19 Feb 2018 21:12:10 +0000, Jayachandran C wrote: > > On Sun, Jan 21, 2018 at 11:35:34AM +0000, Marc Zyngier wrote: > > On Sun, 21 Jan 2018 07:00:48 +0000, > > Jayachandran C wrote: > > > > > > On Thu, Jan 18, 2018 at 10:58:20AM +0530, Ganapatrao Kulkarni wrote: > > > > This erratum is observed on the ThunderX2 GICv3 ITS. When a > > > > MOVI command is used to change affinity of a LPI to a collection/cpu > > > > on another node, the LPI is not delivered to the cpu. > > > > An additional INV command is required after the MOVI to deliver > > > > the LPI to the new destination. > > > > > > > > If we add INV after MOVI, there is a chance that we lose LPIs which > > > > are raised when the affinity is changed. So for now, adding workaround fix > > > > to disable inter node affinity change. > > > > > > > > Signed-off-by: Ganapatrao Kulkarni > > > > --- > > > > > > > > v2: Added workaround to avoid inter node affinity change. > > > > > > > > v1: Initial patch > > > > > > > > Documentation/arm64/silicon-errata.txt | 1 + > > > > arch/arm64/Kconfig | 10 ++++++++++ > > > > drivers/irqchip/irq-gic-v3-its.c | 21 ++++++++++++++++++++- > > > > 3 files changed, 31 insertions(+), 1 deletion(-) > > > > > > > > diff --git a/Documentation/arm64/silicon-errata.txt b/Documentation/arm64/silicon-errata.txt > > > > index fc1c884..fb27cb5 100644 > > > > --- a/Documentation/arm64/silicon-errata.txt > > > > +++ b/Documentation/arm64/silicon-errata.txt > > > > @@ -63,6 +63,7 @@ stable kernels. > > > > | Cavium | ThunderX Core | #27456 | CAVIUM_ERRATUM_27456 | > > > > | Cavium | ThunderX Core | #30115 | CAVIUM_ERRATUM_30115 | > > > > | Cavium | ThunderX SMMUv2 | #27704 | N/A | > > > > +| Cavium | ThunderX2 ITS | #174 | CAVIUM_ERRATUM_174 | > > > > | Cavium | ThunderX2 SMMUv3| #74 | N/A | > > > > | Cavium | ThunderX2 SMMUv3| #126 | N/A | > > > > | | | | | > > > > diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig > > > > index c9a7e9e..0dbf3bd 100644 > > > > --- a/arch/arm64/Kconfig > > > > +++ b/arch/arm64/Kconfig > > > > @@ -461,6 +461,16 @@ config ARM64_ERRATUM_843419 > > > > > > > > If unsure, say Y. > > > > > > > > +config CAVIUM_ERRATUM_174 > > > > + bool "Cavium ThunderX2 erratum 174" > > > > + default y > > > > + help > > > > + Cavium ThunderX2 dual socket systems may loose interrupts > > > > + on affinity change to a cpu on other node. > > > > + This workaround fix avoids inter node affinity change. > > > > > > This has to be fixed up to match the commit message (and for spelling). > > > I have seen some questions offlist about how important this fix is, > > > and how it can affect users - so that would be useful to have in the > > > description as well. > > > > > > To clarify, this errata comes into play only when the irq affinity is > > > forced from the node given by the device (and ITS) affinity to another > > > node. This should not happen in normal, useful configurations. > > > > Define normal. That's all under control of userspace, and the kernel > > doesn't really have a say. irqbalance will happily move interrupts > > around. Disable all CPUs from node at runtime (again, from userspace), > > and you'll get the exact same thing. I can't see what's so "abnormal" > > about any of that. > > > > > Also, we will hold further posting of this errata until we do another > > > round of investigation with the hardware team for a better solution. > > > If we can handle the pending interrupts for the small window of MOVI/INV > > > in first workaround, we will not need this restriction at all. > > > > What do you mean by "If we can handle the pending interrupts for the > > small window of MOVI/INV"? Taking the interrupt on the source CPU? > > Sure, that would be fine. But that's assuming that the souce CPU is in > > a position to actually handle this, and is not simply going down. > > > > If there is only a slight possibility that you may loose an interrupt > > in the MOVI/INV window (which is not that small, since that's a 4 > > command sequence), your only other solution is to inject a spurious > > interrupt to replace the one you may have lost in that window. > > > > In the meantime, and until I see a patch fixing this (or a decent > > explanation of why this isn't a problem), I'll consider it broken. > > After reviewing the issue with our hardware team, we decided to > tweak the redistributor cache configuration from firmware rather > than go with this errata workaournd in Linux (and other OSes). > > So, with the new firmware MOVI will work across nodes as expected, > and this patch is no longer neeeded. That's good news! One question though: how do we detect the old (or new) firmware? We'd still need mitigation for the "old firmware" case, unless you have a way to guarantee that all existing boxes are magically updated to a fixed version? Thanks, M. -- Jazz is not dead, it just smell funny.