Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1764859AbYBLVLU (ORCPT ); Tue, 12 Feb 2008 16:11:20 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1762033AbYBLVLG (ORCPT ); Tue, 12 Feb 2008 16:11:06 -0500 Received: from ra.tuxdriver.com ([70.61.120.52]:3972 "EHLO ra.tuxdriver.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753242AbYBLVLE (ORCPT ); Tue, 12 Feb 2008 16:11:04 -0500 Date: Tue, 12 Feb 2008 16:08:16 -0500 From: Neil Horman To: Vivek Goyal Cc: kexec@lists.infradead.org, linux-kernel@vger.kernel.org, mingo@redhat.com, "Eric W. Biederman" , "H. Peter Anvin" , Ingo Molnar , tglx@linutronix.de Subject: Re: [PATCH], issue EOI to APIC prior to calling crash_kexec in die_nmi path Message-ID: <20080212210816.GA1862@hmsreliant.think-freely.org> References: <20080206192555.GA24910@hmsendeavour.rdu.redhat.com> <20080206220001.GA15155@elte.hu> <20080206224805.GD11886@redhat.com> <47AA3B16.7000507@zytor.com> <20080206233657.GB12393@elte.hu> <20080207121719.GA29279@hmsreliant.think-freely.org> <20080207122404.GA8195@elte.hu> <20080208161422.GA32204@hmsreliant.think-freely.org> <20080208164544.GA23772@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080208164544.GA23772@redhat.com> User-Agent: Mutt/1.5.17 (2007-11-01) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1635 Lines: 51 > > Neil, is it possible to do some serial console debugging to find out > where exactly we are hanging? Beats me, what's that operation which can > not be executed while being in NMI handler and makes system to hang. I am > also curious to know if it is nested NMI case. > > Thanks > Vivek > Hey- Some intermediate results: I've instrumented head.S in the kernel with the following code: #define SEROUT(z) \ mov $0x3F8,%dx;\ movb z,%al;\ outb %dx And peppered different ascii characters throughout the startup code from startup_32 to right before the jump to start_kernel. When I panic the system via an: echo c > /proc/sysrq_trigger I see an appropriate sequence of characters on the serial console When I panic the box by forcing an NMI watchdog timeout however, I see nothing. The machine will either hang, or reset into the bios. I think this is reasonably conclusive in its indication that we're not getting into the second kernel when this problem occurs. Next I'll instrument the purgatory code in a simmilar way. Regards Neil > _______________________________________________ > kexec mailing list > kexec@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/kexec -- /**************************************************** * Neil Horman * Software Engineer, Red Hat ****************************************************/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/