Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752790AbcLZSYk (ORCPT ); Mon, 26 Dec 2016 13:24:40 -0500 Received: from Galois.linutronix.de ([146.0.238.70]:39739 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750721AbcLZSYj (ORCPT ); Mon, 26 Dec 2016 13:24:39 -0500 Date: Mon, 26 Dec 2016 19:21:44 +0100 (CET) From: Thomas Gleixner To: Boris Ostrovsky cc: Markus Trippelsdorf , Linus Torvalds , LKML , Ingo Molnar , "H. Peter Anvin" , Sebastian Andrzej Siewior , Borislav Petkov Subject: Re: [GIT pull] smp/hotplug: Removal of notifiers In-Reply-To: <53e3b52b-f353-63c8-f96f-649d754596bc@oracle.com> Message-ID: References: <20161226074530.GA297@x4> <20161226110600.GB297@x4> <20161226154502.GA287@x4> <53e3b52b-f353-63c8-f96f-649d754596bc@oracle.com> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 822 Lines: 23 On Mon, 26 Dec 2016, Boris Ostrovsky wrote: > On 12/26/2016 10:45 AM, Markus Trippelsdorf wrote: > > The comment in arch/x86/kernel/cpu/mcheck/mce_amd.c says: > > > > 1384 * mcheck_init_device should be inited before threshold_init_device to > > 1385 * initialize mce_device, otherwise a NULL ptr dereference will cause panic. That's not changed by this commit. > My nightly test hit this as well. AMD only, Intel passed. I haven't > verified whether commit that Markus implicated is the one that caused > this but it's the same BUG signature (but possibly slightly different stack) > > [ 1.554351] smpboot: CPU0: AMD Engineering Sample (family: 0x10, > model: 0x4, stepping: 0x1) > ... Is there anything interesting error message before the BUG hits? I'll try to reproduce on a AMD box tomorrow. Thanks, tglx