Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758164Ab3HNX4t (ORCPT ); Wed, 14 Aug 2013 19:56:49 -0400 Received: from mailout4.w2.samsung.com ([211.189.100.14]:19060 "EHLO usmailout4.samsung.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750929Ab3HNX4r (ORCPT ); Wed, 14 Aug 2013 19:56:47 -0400 X-AuditID: cbfec372-b7f046d000001821-3c-520c193eae07 Date: Wed, 14 Aug 2013 20:56:38 -0300 From: Mauro Carvalho Chehab To: "Naveen N. Rao" Cc: Borislav Petkov , tony.luck@intel.com, bhelgaas@google.com, rostedt@goodmis.org, rjw@sisk.pl, lance.ortiz@hp.com, linux-pci@vger.kernel.org, linux-acpi@vger.kernel.org, linux-kernel@vger.kernel.org, Aristeu Rozanski Filho Subject: Re: [PATCH 3/3] mce: acpi/apei: trace: Enable ghes memory error trace event Message-id: <20130814205638.1ebcfd3e@concha.lan> In-reply-to: <520A6A30.1030406@linux.vnet.ibm.com> References: <1375986471-27113-1-git-send-email-naveen.n.rao@linux.vnet.ibm.com> <1375986471-27113-4-git-send-email-naveen.n.rao@linux.vnet.ibm.com> <20130808163822.67e0828a@samsung.com> <20130810180322.GC4155@pd.tnic> <20130812083355.47c1bae8@samsung.com> <5208D80D.5030206@linux.vnet.ibm.com> <20130812114404.3bd64fa0@samsung.com> <520A1B5E.8040105@linux.vnet.ibm.com> <20130813094147.062317f8@concha.lan> <520A6A30.1030406@linux.vnet.ibm.com> X-Mailer: Claws Mail 3.9.2 (GTK+ 2.24.19; x86_64-redhat-linux-gnu) MIME-version: 1.0 Content-type: text/plain; charset=US-ASCII Content-transfer-encoding: 7bit X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFnrKLMWRmVeSWpSXmKPExsVy+t/hEF07SZ4gg4075C3aTvxms1jSlGHx ecM/NosPfdeYLJbv62e0uLxrDpvF2XnH2Szutzxlt+hf2Mtksa/jAZPFmwv3WBy4Pb639rF4 tOy7xe6xYFOpx65tO5k8Fu95yeTx4NBmFo/3+66yeTxa3MLo8XmTXABnFJdNSmpOZllqkb5d AlfGxVff2QvaBCu2zLnN2sC4kbeLkZNDQsBE4taufhYIW0ziwr31bF2MXBxCAksYJS7dnMMI 4TQwSWy/O4sNpIpFQFVieds9sA42ASOJV40trCC2iICpxJEV15lAGpgFupkknv+7CdYgLBAm sWdPMxOIzStgILHs9wUwmxOo+c3ZuVAbjjFLfNnQCDSJA+gOJ4mtU30h6gUlfkyGWMYsoCWx eVsTK4QtL7F5zVvmCYwCs5CUzUJSNgtJ2QJG5lWMoqXFyQXFSem5hnrFibnFpXnpesn5uZsY IVFTtIPx2QarQ4wCHIxKPLwRbdxBQqyJZcWVuYcYJTiYlUR4z3QAhXhTEiurUovy44tKc1KL DzEycXBKNTB2zr4mV7I0d8eufMswlrutWWmK5tKHI45Ft/9Sv7o/6U7S4nt6ljf3aT6VMtvz t0z+87epN2J6oq+5iq40rX3zcuq21W/8ivjZ76596Ha6drpnbu6yifWfqpbe1vh6qCXl6Amv SG7WxxP+GS3M+MV794TSVZ51ol9+6PlHzDpZ6vKkYs96r2/hSizFGYmGWsxFxYkAT2c94HgC AAA= Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2372 Lines: 60 Em Tue, 13 Aug 2013 22:47:36 +0530 "Naveen N. Rao" escreveu: > On 08/13/2013 06:11 PM, Mauro Carvalho Chehab wrote: > > Em Tue, 13 Aug 2013 17:11:18 +0530 > > "Naveen N. Rao" escreveu: > > > >> On 08/12/2013 08:14 PM, Mauro Carvalho Chehab wrote: > >>>> But, this only seems to expose the APEI data as a string > >>>> and doesn't look to really make all the fields available to user-space > >>>> in a raw manner. Not sure how well this can be utilised by a user-space > >>>> tool. Do you have suggestions on how we can do this? > >>> > >>> There's already an userspace tool that handes it: > >>> https://git.fedorahosted.org/cgit/rasdaemon.git/ > >>> > >>> What is missing there on the current version is the bits that would allow > >>> to translate from APEI way to report an error (memory node, card, module, > >>> bank, device) into a DIMM label[1]. > >> > >> If I'm reading this right, all APEI data seems to be squashed into a > >> string in mc_event. > > > > Yes. We had lots of discussion about how to map memory errors over the > > last couple years. Basically, it was decided that the information that > > could be decoded into a DIMM to be mapped as integers, and all other > > driver-specific data to be added as strings. > > > > On the tests I did, different machines/vendors fill the APEI data on > > a different way, with makes harder to associate them to a DIMM. > > Ok, so it looks like ghes_edac isn't quite useful yet. > > In the meantime, like Boris suggests, I think we can have a different > trace event for raw APEI reports - userspace can use it as it pleases. "In the meantime" is something that worries me the most. Kernel APIs should be stable. We cannot randomly change it on each new kernel version. Better to spend a little more time discussing than implementing a new trace that will be removed on a near future. > > Once ghes_edac gets better, users can decide whether they want raw APEI > reports or the EDAC-processed version and choose one or the other trace > event. > > Regards, > Naveen > -- Cheers, Mauro -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/