Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp913788imm; Mon, 21 May 2018 17:07:50 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpbjKwrP/OBvN25QpBe0qaBsynWfwuUblBmx6kqBCkrQL14hnIgYzhXiP4hGYpDcUmFLxow X-Received: by 2002:a63:bf0c:: with SMTP id v12-v6mr1807788pgf.18.1526947670749; Mon, 21 May 2018 17:07:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526947670; cv=none; d=google.com; s=arc-20160816; b=Ube1JXQIBFwRIKwo32a2XY4gtFgNRPzkTo2eoWLjr8UUEO+4lQDzwg7us9/Cr48QUQ D1I25wte5CyQIBiDvMxDfZPeJz3jlbdUlOqlmt9uu5ytoqxM0B/w34QZoDAC9y8yygXd FVv4qtG1iA3XHyx55upfaWj4894Tj+dY7p7OdqKg0SEGZ5i/Mu8fYdQ/t/K1fCJtfpUq GnxwUj21FgptgC6318uZnTQRgSqR7xmVtDv99J1ZBL772ixjGSzxK+CTXSwRjUsSaVk9 r62xrSCBACWTC/nQpYMtkV990X8xler9YFaP2p6+EqoJD20WrRYhVvMc57BeLQIZYePW LTVA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=opoSXIkXgBgJMTnv5x0JOOm5h34W3oETsh4/7yeh+hw=; b=TRil2qNY7aTDIu6ZPI0Bm1xbAs4xfWrZYhhezkJo41OExwuem+Fa136b3JDANbD8+i oFx6OraLftcu/fh1VqfOHmxq9n402ZY2DHGKOWcMCCglCtaGrZ9/6hCaX3PGRJQTDg04 srRc2AA/P47N3PCUSTkZCZX7YJgueW42lhNicO6XUg1+ZjkL3CxDBfnmoJxVJ2E9om1Z QiBOnsYSdZfCAvTIyeMKIFnwfAp3mp7tvJryrKo4we1ZkGeEuLk4T65rA+zlL5QvkDG4 9CaSAgWbf+WXgLW4PS1zbcnOhzE1T1UZe5q4z7r0nXfkqMHonNJmD3h25i876S5kg3Rc 6Llw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y25-v6si14422842pfn.248.2018.05.21.17.07.34; Mon, 21 May 2018 17:07:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751188AbeEVAGR (ORCPT + 99 others); Mon, 21 May 2018 20:06:17 -0400 Received: from mga18.intel.com ([134.134.136.126]:56270 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751110AbeEVAGQ (ORCPT ); Mon, 21 May 2018 20:06:16 -0400 X-Amp-Result: UNSCANNABLE X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 21 May 2018 17:06:15 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.49,428,1520924400"; d="scan'208";a="201298417" Received: from agluck-desk.sc.intel.com (HELO agluck-desk) ([10.3.52.160]) by orsmga004.jf.intel.com with ESMTP; 21 May 2018 17:06:15 -0700 Date: Mon, 21 May 2018 17:06:14 -0700 From: "Luck, Tony" To: Jeffrin Thalakkottoor Cc: Borislav Petkov , Thomas Gleixner , mingo@redhat.com, hpa@zytor.com, x86@kernel.org, linux-edac@vger.kernel.org, lkml Subject: Re: PROBLEM: mce: [Hardware Error] from dmesg -l emerg Message-ID: <20180522000614.GA21542@agluck-desk> References: <20180514162752.GG23049@pd.tnic> <20180520204032.GA19845@pd.tnic> <20180521165803.GA15717@agluck-desk> <20180521205751.GA19282@agluck-desk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, May 22, 2018 at 02:43:37AM +0530, Jeffrin Thalakkottoor wrote: > mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 5: ee0000000040110b > mce: [Hardware Error]: TSC 0 ADDR 160000080 MISC 5040008086 > mce: [Hardware Error]: PROCESSOR 0:306d4 TIME 1526932210 SOCKET 0 APIC > 0 microcode 2a The problem is that "mcelog --ascii" is expecting the first line to look like: CPU 0: Machine Check Exception: 0 Bank 5: ee0000000040110b This seems to have been broken by commit: cd9c57cad3fe ("x86/MCE: Dump MCE to dmesg if no consumers") relevent part is this ... where we now conditionally include the word "Exception". -static void print_mce(struct mce *m) +static void __print_mce(struct mce *m) { - int ret = 0; - - pr_emerg(HW_ERR "CPU %d: Machine Check Exception: %Lx Bank %d: %016Lx\n", - m->extcpu, m->mcgstatus, m->bank, m->status); + pr_emerg(HW_ERR "CPU %d: Machine Check%s: %Lx Bank %d: %016Lx\n", + m->extcpu, + (m->mcgstatus & MCG_STATUS_MCIP ? " Exception" : ""), + m->mcgstatus, m->bank, m->status); While this is a bit easier to read, no new information is included as we do print the value of m->mcgstatus. Sadly, the change was made back in v4.10 ... so reverting it won't help all the people running kernels built in the last fifteen months :-( I'll see if I can get Andi to take a patch for mcelog to accept the line with or without the " Exception". Oh ... one more thing. Did your e-mail client line wrap that last line? > mce: [Hardware Error]: PROCESSOR 0:306d4 TIME 1526932210 SOCKET 0 APIC > 0 microcode 2a That should all be on one line. -Tony