Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp1780348pxk; Tue, 1 Sep 2020 07:39:11 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxdOYM+2LoZh4kj7NyAfWaIi8hTR5T7RYImMHVray48cAZcnDFsWtamWlm2DDe7+V23nQmD X-Received: by 2002:a17:906:4755:: with SMTP id j21mr1558010ejs.228.1598971151359; Tue, 01 Sep 2020 07:39:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598971151; cv=none; d=google.com; s=arc-20160816; b=PEXyFFft00fwHNoXmVpm3pJZeGnDkwLIcezfzANoztHCdEgxgnHpVREStAZJtjKbka mo32/YAOTIdgOUYOAbNJg6nksWIfVKFsZJHIm2QGisnNGH7sG5PgxDpnO+Jw8EtZeYoC ays26aUrDt7n7L2B8DrTaMI43E0W/etaSvEO7OUUOA4yzSMmbDQ2hqM3NaVOiT59jEx4 Hzl0lm7VOliTQGi1ejBY/nppfZVDEHDLoz2LMTPNvPYufUXr9CFfpC1V2XVDry4tkfUU k9a1Shc9v5wUOjYjA8Mr2DBvx8MT8n0utBSRxvGdLt7D7A9BZeRK9We0+bqKsFdVy72h xsxQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=W/JjlPkr56envneKc3L1P/Ow3k/grlx00YCQ6+YN6bg=; b=oKP01ZGBK21kB6MYOf3+/d1alfzIAxumoqDH+pMHOtTxcXK1n1HJx06QcuxNFKWftl 4jC3tCrnZ2xrUs+iELI3xfV/vI56tNjuwY9nRdXAvQRZh6HN0TLgOrgjJsfAVgv6I3WZ md5wznAJn2l5XWe5fWsVTe0pni+08TJfDqr+wk8hdLE/00+43nslX8AXjki5G4J9sKQi Hxz29byqvt1UGErmGIf4iAMQdnsjba88uzxClK13N3CXJZWyi7i7Dzwb0F0jeYXilWcs lsBKL5Q16ML8nex/ZgQZ55KqEWUNddP2h9Evq6vgnfYg/VWG/utqKLTHOTrZg9cFT6k6 onpA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=abXJBU2J; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t22si784245edi.341.2020.09.01.07.38.46; Tue, 01 Sep 2020 07:39:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@alien8.de header.s=dkim header.b=abXJBU2J; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=alien8.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728431AbgIAOgG (ORCPT + 99 others); Tue, 1 Sep 2020 10:36:06 -0400 Received: from mail.skyhub.de ([5.9.137.197]:48366 "EHLO mail.skyhub.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728149AbgIAOfn (ORCPT ); Tue, 1 Sep 2020 10:35:43 -0400 Received: from zn.tnic (p200300ec2f111c00f0e9a36004a82e59.dip0.t-ipconnect.de [IPv6:2003:ec:2f11:1c00:f0e9:a360:4a8:2e59]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.skyhub.de (SuperMail on ZX Spectrum 128k) with ESMTPSA id 4C2721EC00EC; Tue, 1 Sep 2020 16:35:38 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alien8.de; s=dkim; t=1598970938; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:in-reply-to:in-reply-to: references:references; bh=W/JjlPkr56envneKc3L1P/Ow3k/grlx00YCQ6+YN6bg=; b=abXJBU2J4S5SacLtZDga/ty8nkt15z0PcR3rPQlsoWywTj6Vdp1ODsbYvriCt5eH7K3rA+ /RuhpFcuV4oSFc6Dl0m2urbBNH76GW74iPRzcs00L3lnkoSdR4k4ODPyyuUcQ2/jszk02L WJRxfrOe0y8ZAzSYnE4QKDKCQN7F19M= Date: Tue, 1 Sep 2020 16:35:39 +0200 From: Borislav Petkov To: Shiju Jose Cc: linux-edac@vger.kernel.org, linux-acpi@vger.kernel.org, linux-kernel@vger.kernel.org, tony.luck@intel.com, rjw@rjwysocki.net, james.morse@arm.com, lenb@kernel.org, linuxarm@huawei.com Subject: Re: [PATCH 1/1] RAS: Add CPU Correctable Error Collector to isolate an erroneous CPU core Message-ID: <20200901143539.GC8392@zn.tnic> References: <20200901140140.1772-1-shiju.jose@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20200901140140.1772-1-shiju.jose@huawei.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Sep 01, 2020 at 03:01:40PM +0100, Shiju Jose wrote: > When the CPU correctable errors reported on an ARM64 CPU core too often, > it should be isolated. Add the CPU correctable error collector to > store the CPU correctable error count. > > When the correctable error count for a CPU exceed the threshold > value in a short time period, it will try to isolate the CPU core. > The threshold value, time period etc are configurable. > > Implementation details is added in the file. > > Signed-off-by: Shiju Jose > --- > Documentation/ABI/testing/debugfs-cpu-cec | 22 ++ > arch/arm64/ras/Kconfig | 8 + > drivers/acpi/apei/ghes.c | 30 +- > drivers/ras/Kconfig | 1 + > drivers/ras/Makefile | 1 + > drivers/ras/cpu_cec.c | 393 ++++++++++++++++++++++ So instead of adding the ability to collect other error types to the CEC, you're duplicating the CEC itself?! Why? -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette