Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754768AbdIHG0x (ORCPT ); Fri, 8 Sep 2017 02:26:53 -0400 Received: from Galois.linutronix.de ([146.0.238.70]:53657 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754560AbdIHG0u (ORCPT ); Fri, 8 Sep 2017 02:26:50 -0400 Date: Fri, 8 Sep 2017 08:26:44 +0200 (CEST) From: Thomas Gleixner To: Markus Trippelsdorf cc: Peter Zijlstra , LKML , Ingo Molnar , Andy Lutomirski , Borislav Petkov Subject: Re: Current mainline git (24e700e291d52bd2) hangs when building e.g. perf In-Reply-To: <20170908053534.GA276@x4> Message-ID: References: <20170905072738.GA277@x4> <20170905085350.cgi7shvnillbikow@hirez.programming.kicks-ass.net> <20170905095547.GA286@x4> <20170906131504.GA282@x4> <20170907062845.GA280@x4> <20170908053534.GA276@x4> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1307 Lines: 33 On Fri, 8 Sep 2017, Markus Trippelsdorf wrote: CC+ Borislav. He might have access to such a beast > On 2017.09.07 at 08:28 +0200, Markus Trippelsdorf wrote: > > On 2017.09.06 at 15:15 +0200, Markus Trippelsdorf wrote: > > > On 2017.09.06 at 14:52 +0200, Thomas Gleixner wrote: > > > > On Tue, 5 Sep 2017, Markus Trippelsdorf wrote: > > > > > On 2017.09.05 at 10:53 +0200, Peter Zijlstra wrote: > > > > > > > Any ideas on how to debug this further? > > > > > > > > > > > > So you have a (real) serial line on that box? > > > > > > > > > > Sadly, no. But hopefully somebody else (with a proper kernel debugging > > > > > setup) will reproduce the issue soon. > > > > > > > > Does the machine respond to ping or is it entirely dead? > > > > > > It is entirely dead and doesn't respond to ping. > > > > The bug even kills the host (running 4.13) when running 24e700e2 in qemu > > (kvm) and compiling stuff in parallel in the guest. > > I see an RCU CPU stall in dmesg (on the host), but unfortunately cannot > > save it, because nothing gets written to disk after the stall. > > Connecting to qemu via gdb also doesn't work. > > My guess would be a bug in a low level function (asm) that only hits AMD > machines. I'm running an old Phenom II X4 processor. My config is > attached. > > -- > Markus >