Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752874AbeAJXbM (ORCPT + 1 other); Wed, 10 Jan 2018 18:31:12 -0500 Received: from mail-wm0-f49.google.com ([74.125.82.49]:37024 "EHLO mail-wm0-f49.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752278AbeAJXbL (ORCPT ); Wed, 10 Jan 2018 18:31:11 -0500 X-Google-Smtp-Source: ACJfBosWVhqebPGq01vWg8hYBYukixLOCtO1eMmn4ZKvJOGy7jzNNo7i9riUqKZxr110WiyfSYZE2A== Subject: Re: EDAC-AMD64: what is the ecc_msg good for ? To: Borislav Petkov Cc: linux-edac@vger.kernel.org, linux-kernel@vger.kernel.org References: <5dea4184-8782-7b57-2624-2957c91fe4ac@gmail.com> <20180110231235.a3ts353lzajqywb5@pd.tnic> From: Gabriel C Message-ID: <5be13f07-aeae-ee07-1194-f882eb958ac8@gmail.com> Date: Thu, 11 Jan 2018 00:31:08 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.2 MIME-Version: 1.0 In-Reply-To: <20180110231235.a3ts353lzajqywb5@pd.tnic> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On 11.01.2018 00:12, Borislav Petkov wrote: > On Thu, Jan 11, 2018 at 12:06:49AM +0100, Gabriel C wrote: >> while doing some testings with a EPYC box I notice >> these strange messages when a Node is disabled. >> >> I really do think the message is confusing since >> we tell 'Node X: ... disabled' followed by a >> INFO on the edac driver which tells the driver will not load. > > And that is confusing because? Beacuse we see the following: [ 4.694948] EDAC amd64: Node 6: DRAM ECC disabled. [ 4.694949] EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load. Either enable ECC checking or force module loading by setting 'ecc_enable_override'. (Note that use of the override may cause unknown side effects.) The first one tells the Node is disabled the second is a KERN INFO message telling the *module* will not load. Telling then *module* will not load for 'this Node' should be clear for everone. Don't get me wrong for me is clear what this means , I don't need the second message at all but I have here folks didn't understand wth that means. > >> Also even worse , we suggest to use ecc_enable_override then, >> which can cause wrose things.. We really should not suggest >> something like this by default. > > That is an remnant from the old times. Family 0x17 and newer doesn't > allow that anymore. > So do we need an < fam17h check for that message then ?