Received: by 10.192.165.156 with SMTP id m28csp1519542imm; Wed, 18 Apr 2018 10:57:01 -0700 (PDT) X-Google-Smtp-Source: AIpwx49Wb6FVmFE9PjaZtM6r33OD+sYPEkrAK3fJkFFuVqi/u6qt6aHUbVsBnRTS6k4l69zAO4ea X-Received: by 2002:a17:902:6e8f:: with SMTP id v15-v6mr3008029plk.245.1524074221226; Wed, 18 Apr 2018 10:57:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1524074221; cv=none; d=google.com; s=arc-20160816; b=g9KNFYXDFbMQm/h1JR9ed81jTbH58t/hJ1jW0M/DrzN+LdafuztlW3xNMk1W6y7Vrh Luvg/+5qgLnjONmeZhcVRMkAjcSyLpcCG9h7SQF6IfaUaDFadzi0JStzJBmth2O+RTDc nclJHUnE5V7Mu/IxewK3Z1cMDKT1q2EmdHBvwko6sAysxUBSaNRVa28Qol/pJ+HjU+4D DLneibcvmBTLWT+UxlTKByb53ekGC76x41vPp5ux1OwkhWm1k53PGXqUzCf7s8DkK7uf 8WrYeOlF/+STt3C2bSRPx9VsyPscYPflLdqDRlUQklapel/xROOT8du3lnk5RvTETBbZ 1LgA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=IaXcvG43vNXcmoFQt2b8Wkm7gHsBd6E801mLCu4WinI=; b=EHp19EpPxsmCyAHWM4SyekIB4lA4Ib4ffNFO8mlSDKPgohhKychxiDtK+TgvI9W1em qWmMsOoexPQ6sXKYKozd7Ra5tY0CDqiCVZElMyZxl9aQhX8faJYqccGfnLDRvxoCgyMh vA7gT7ja+CwspFsb5cFoV1je7Wb9CLYRGXd0QgqhvhWv1OtTCL3UBoFohRaIlKIorgKV Oz9M8fLd9uERd9LvMTScXB6xI26PTtl8U5na0h7SzXgNFWLnLrN5eDG1KQxoirjJ0oY1 mmPDweO6MByXbd/fYwgKVDEPSlOn//HAtunHaJ36lOQpPHfYorXsItg7JlhFOlvOj695 hrYw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a139si1664628pfd.308.2018.04.18.10.56.47; Wed, 18 Apr 2018 10:57:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753206AbeDRRzX (ORCPT + 99 others); Wed, 18 Apr 2018 13:55:23 -0400 Received: from mail.skyhub.de ([5.9.137.197]:51306 "EHLO mail.skyhub.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752760AbeDRRzS (ORCPT ); Wed, 18 Apr 2018 13:55:18 -0400 X-Virus-Scanned: Nedap ESD1 at mail.skyhub.de Received: from mail.skyhub.de ([127.0.0.1]) by localhost (blast.alien8.de [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id A-scsfdKUeTX; Wed, 18 Apr 2018 19:55:01 +0200 (CEST) Received: from pd.tnic (p200300EC2BCA86003047E01E637F8411.dip0.t-ipconnect.de [IPv6:2003:ec:2bca:8600:3047:e01e:637f:8411]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.skyhub.de (SuperMail on ZX Spectrum 128k) with ESMTPSA id 8EC761EC00FF; Wed, 18 Apr 2018 19:54:53 +0200 (CEST) Date: Wed, 18 Apr 2018 19:54:52 +0200 From: Borislav Petkov To: Alexandru Gagniuc Cc: linux-acpi@vger.kernel.org, linux-edac@vger.kernel.org, rjw@rjwysocki.net, lenb@kernel.org, tony.luck@intel.com, tbaicar@codeaurora.org, will.deacon@arm.com, james.morse@arm.com, shiju.jose@huawei.com, zjzhang@codeaurora.org, gengdongjiu@huawei.com, linux-kernel@vger.kernel.org, alex_gagniuc@dellteam.com, austin_bolen@dell.com, shyam_iyer@dell.com, devel@acpica.org, mchehab@kernel.org, robert.moore@intel.com, erik.schmauss@intel.com Subject: Re: [RFC PATCH v2 4/4] acpi: apei: Warn when GHES marks correctable errors as "fatal" Message-ID: <20180418175452.GK4795@pd.tnic> References: <20180416215903.7318-1-mr.nuke.me@gmail.com> <20180416215903.7318-5-mr.nuke.me@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20180416215903.7318-5-mr.nuke.me@gmail.com> User-Agent: Mutt/1.9.3 (2018-01-21) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 16, 2018 at 04:59:03PM -0500, Alexandru Gagniuc wrote: > There seems to be a culture amongst BIOS teams to want to crash the > OS when an error can't be handled in firmware. Marking GHES errors as > "fatal" is a very common way to do this. > > However, a number of errors reported by GHES may be fatal in the sense > a device or link is lost, but are not fatal to the system. When there > is a disagreement with firmware about the handleability of an error, > print a warning message. > > Signed-off-by: Alexandru Gagniuc > --- > drivers/acpi/apei/ghes.c | 10 +++++++++- > 1 file changed, 9 insertions(+), 1 deletion(-) > > diff --git a/drivers/acpi/apei/ghes.c b/drivers/acpi/apei/ghes.c > index e0528da4e8f8..6a117825611d 100644 > --- a/drivers/acpi/apei/ghes.c > +++ b/drivers/acpi/apei/ghes.c > @@ -535,13 +535,14 @@ static const struct ghes_handler *get_handler(const guid_t *type) > static void ghes_do_proc(struct ghes *ghes, > const struct acpi_hest_generic_status *estatus) > { > - int sev, sec_sev; > + int sev, sec_sev, corrected_sev; > struct acpi_hest_generic_data *gdata; > const struct ghes_handler *handler; > guid_t *sec_type; > guid_t *fru_id = &NULL_UUID_LE; > char *fru_text = ""; > > + corrected_sev = GHES_SEV_NO; > sev = ghes_severity(estatus->error_severity); > apei_estatus_for_each_section(estatus, gdata) { > sec_type = (guid_t *)gdata->section_type; > @@ -563,6 +564,13 @@ static void ghes_do_proc(struct ghes *ghes, > sec_sev, err, > gdata->error_data_length); > } > + > + corrected_sev = max(corrected_sev, sec_sev); > + } > + > + if ((sev >= GHES_SEV_PANIC) && (corrected_sev < sev)) { > + pr_warn("FIRMWARE BUG: Firmware sent fatal error that we were able to correct"); > + pr_warn("BROKEN FIRMWARE: Complain to your hardware vendor"); No, I don't want any of that crap issuing stuff in dmesg and then people opening bugs and running around and trying to replace hardware. We either can handle the error and log a normal record somewhere or we cannot and explode. The complaining about the FW doesn't bring shit. -- Regards/Gruss, Boris. Good mailing practices for 400: avoid top-posting and trim the reply.