Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933502AbdGSQKW (ORCPT ); Wed, 19 Jul 2017 12:10:22 -0400 Received: from g4t3425.houston.hpe.com ([15.241.140.78]:46288 "EHLO g4t3425.houston.hpe.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754369AbdGSQKS (ORCPT ); Wed, 19 Jul 2017 12:10:18 -0400 From: "Kani, Toshimitsu" To: "bp@alien8.de" CC: "linux-kernel@vger.kernel.org" , "tglx@linutronix.de" , "mchehab@kernel.org" , "rjw@rjwysocki.net" , "srinivas.pandruvada@linux.intel.com" , "tony.luck@intel.com" , "lenb@kernel.org" , "linux-acpi@vger.kernel.org" , "linux-edac@vger.kernel.org" Subject: Re: [PATCH 3/3] ghes_edac: add platform check to enable ghes_edac Thread-Topic: [PATCH 3/3] ghes_edac: add platform check to enable ghes_edac Thread-Index: AQHS/0lGOl4i5ss6QUSn88C0VF1rJqJZF6OAgAAj5gCAANq7AIAAkZmAgACp84A= Date: Wed, 19 Jul 2017 16:10:07 +0000 Message-ID: <1500480051.2042.27.camel@hpe.com> References: <20170717215912.26070-1-toshi.kani@hpe.com> <20170717215912.26070-4-toshi.kani@hpe.com> <20170718060007.GB8736@nazgul.tnic> <20170718080836.GB8372@nazgul.tnic> <1500412288.2042.25.camel@hpe.com> <20170719055235.GD26030@nazgul.tnic> In-Reply-To: <20170719055235.GD26030@nazgul.tnic> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: alien8.de; dkim=none (message not signed) header.d=none;alien8.de; dmarc=none action=none header.from=hpe.com; x-originating-ip: [15.203.227.8] x-ms-publictraffictype: Email x-microsoft-exchange-diagnostics: 1;DF4PR84MB0188;7:Ld2Rx9PWlibbesfpoOduPxj9uBRWxn2ggqZ9USlpSGZjE/L9qu0pnUnDfCQ4mn7OOiohk7mDDAxUb1JukhODzle2GSM9rj7WRiIgEh+CE62T/Euuf12MRWjdv/MLaAftMXZ68q518DJ1Vz2JGWGUyWwhIno+NWOF7dPbNVbWe7/w16cEF/BDksR24BrLsoV/AWweSFoq8nqYJD4wa9bPRpu5C4QOWFVJEqTPSj4p06vacc2g05iKUUgKPjwcBVmUiwoescLE7IDFyCxqTMQFC4aQFaiY3vmMmXE3JBor3dbystnNXgBnXoSDkIwpl6NotrqesoB86PYLzFqkKjikQoPZnoDPTFhS2A54v4rGyTmp6NE7keRU7OjHzu5Dv0pv9q1//PLrofkcgyPHYKAlLJff23Tx3zZvmlPlO62TUsKYP2q2Mxq76HDDd2IJ/F9X9Y3Fyrm3llPyJ7DBjZ7cD89vFVAD730qg1zBykGSuV3ok2fSqXafwpWM9NSNHnJ1UlzdClht/a9tIYctdFsZ8OCAUU/TxJytCoP73PAT87TQzbavUsYp08n1F1tciABJ0p2H1HBSUEggmoSjRROpnVV7+P63l7kV5Uikh07iMggayDYMT2ZlyIT+qPVRKaxnlux+zPJSwfQtv0AczTmpTyNZ0d3dzDBXPKTWHxpljx14p09mWD2XdSEhZzd6ET0TlPKet4588urHs3mYynQSh1rIt2FzGIEkO/MTc1ow+LTT1OtSNRXEMrICIA8CcooEGGJakw4yXW66qUrXd8MXfNOt/slhSarJLlX0jphmoZI= x-ms-office365-filtering-correlation-id: 3c51537c-546a-4474-415f-08d4cec0a016 x-ms-office365-filtering-ht: Tenant x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(300000500095)(300135000095)(300000501095)(300135300095)(22001)(300000502095)(300135100095)(2017030254075)(48565401081)(300000503095)(300135400095)(2017052603031)(201703131423075)(201703031133081)(300000504095)(300135200095)(300000505095)(300135600095)(300000506095)(300135500095);SRVR:DF4PR84MB0188; x-ms-traffictypediagnostic: DF4PR84MB0188: x-exchange-antispam-report-test: UriScan:(236129657087228); x-microsoft-antispam-prvs: x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(100000700101)(100105000095)(100000701101)(100105300095)(100000702101)(100105100095)(6040450)(601004)(2401047)(2017060910075)(5005006)(8121501046)(3002001)(93006095)(93001095)(100000703101)(100105400095)(10201501046)(6055026)(6041248)(20161123564025)(201703131423075)(201702281528075)(201703061421075)(201703061406153)(20161123555025)(20161123560025)(20161123558100)(20161123562025)(6072148)(100000704101)(100105200095)(100000705101)(100105500095);SRVR:DF4PR84MB0188;BCL:0;PCL:0;RULEID:(100000800101)(100110000095)(100000801101)(100110300095)(100000802101)(100110100095)(100000803101)(100110400095)(100000804101)(100110200095)(100000805101)(100110500095);SRVR:DF4PR84MB0188; x-forefront-prvs: 0373D94D15 x-forefront-antispam-report: SFV:NSPM;SFS:(10019020)(6009001)(39400400002)(39850400002)(39840400002)(39410400002)(39860400002)(39450400003)(377424004)(24454002)(3846002)(14454004)(77096006)(66066001)(2900100001)(6486002)(93886004)(4326008)(2501003)(2351001)(5660300001)(6506006)(189998001)(25786009)(6116002)(102836003)(478600001)(6436002)(86362001)(6512007)(2950100002)(229853002)(38730400002)(6916009)(3660700001)(53936002)(54906002)(103116003)(81166006)(3280700002)(110136004)(6246003)(33646002)(54356999)(5640700003)(2906002)(8676002)(50986999)(305945005)(8936002)(76176999)(36756003)(7416002)(1730700003)(7736002);DIR:OUT;SFP:1102;SCL:1;SRVR:DF4PR84MB0188;H:DF4PR84MB0187.NAMPRD84.PROD.OUTLOOK.COM;FPR:;SPF:None;MLV:sfv;LANG:en; spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: text/plain; charset="utf-8" Content-ID: <52895BDA465CFE48BB7EBD21C875F15B@NAMPRD84.PROD.OUTLOOK.COM> MIME-Version: 1.0 X-MS-Exchange-CrossTenant-originalarrivaltime: 19 Jul 2017 16:10:07.9375 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 105b2061-b669-4b31-92ac-24d304d195dc X-MS-Exchange-Transport-CrossTenantHeadersStamped: DF4PR84MB0188 X-OriginatorOrg: hpe.com Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by nfs id v6JGAPBo010369 Content-Length: 2164 Lines: 52 On Wed, 2017-07-19 at 07:52 +0200, Borislav Petkov wrote: > On Tue, Jul 18, 2017 at 09:20:44PM +0000, Kani, Toshimitsu wrote: > > I agree that 'osc_sb_apei_support_acked' should be checked when > > enabling ghes_edac.  I do not know the details of existing issues, > > but it sounds unlikely that this will address all of them since > > bugs can be everywhere. > > No, see below. > > > For instance, ghes_edac relies on DMI/SMBIOS info, unlike > > other EDAC drivers, which can be buggy regardless of this _OSC > > info. > > That's the problem with firmware. You can't really fix it and it is > buggy as hell. Right, and that's what I was told as an issue for ghes_edac. This is why this patch introduces a white-list to preclude all buggy firmwares that are unknown to us... > > I agree that making ghes_edac as a normal module is a good thing, > > but I do not think it's going to solve this issue. > > Of course it will - if the firmware says it wants to look at the > errors first, then it gets to do so. This is the whole handling of > hardware errors in the firmware deal. I admit, sometimes it makes > sense because the firmware has the most intimate knowledge of the > platform and, in a perfect world, we won't ever need to have > platform-specific EDAC drivers. > > But, we don't live in a perfect world. And the vendor execution of > the whole firmware-error-handling deal is an abomination at best. > > So, if we realize that the firmware is buggy, we can use a platform > list to blacklist it (^hint hint^) and have a parameter to disable > ghes_edac from loading. Setting blacklist needs us to enable ghes_edac and find all buggy firmwares to date. I think this is too disturbing for people who are happily using regular edac drivers today even though their platforms have GHES. > But we'll deal with that when we get to cross that bridge. Right now, > I'd like to do the loading spec-conform and not fiddle with white-, > black-, or any-other-color lists. I do prefer to avoid any white / black listing. But I do not see how it solves the buggy DMI/SMBIOS info as an example of firmware bugs we may have to deal with. Thanks, -Toshi