Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1761455AbZLJSpH (ORCPT ); Thu, 10 Dec 2009 13:45:07 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1761298AbZLJSpD (ORCPT ); Thu, 10 Dec 2009 13:45:03 -0500 Received: from 0122700014.0.fullrate.dk ([95.166.99.235]:41003 "EHLO kernel.dk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1761158AbZLJSpB (ORCPT ); Thu, 10 Dec 2009 13:45:01 -0500 Date: Thu, 10 Dec 2009 19:45:07 +0100 From: Jens Axboe To: Yinghai Lu Cc: Ingo Molnar , Thomas Gleixner , "H. Peter Anvin" , "linux-kernel@vger.kernel.org" Subject: Re: Bisected regression (Was Re: current -git fails to boot on nehalem-ex) Message-ID: <20091210184506.GP8742@kernel.dk> References: <20091210091040.GG8742@kernel.dk> <20091210103945.GI8742@kernel.dk> <20091210160640.GB27794@elte.hu> <67EA4830-231F-45C9-A1A6-CE1C6CD9493D@kernel.org> <20091210180727.GK8742@kernel.dk> <4B213F1F.2020305@kernel.org> <4B2140F8.3020201@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4B2140F8.3020201@kernel.org> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 8972 Lines: 156 On Thu, Dec 10 2009, Yinghai Lu wrote: > Yinghai Lu wrote: > > Jens Axboe wrote: > >> On Thu, Dec 10 2009, Yinghai wrote: > >>> > >>> > >>> On Dec 10, 2009, at 8:06 AM, Ingo Molnar wrote: > >>> > >>>> * Jens Axboe wrote: > >>>> > >>>>> On Thu, Dec 10 2009, Jens Axboe wrote: > >>>>>> Hi, > >>>>>> > >>>>>> As written in the subject, it just hard hangs before writing > >>>>>> anything on > >>>>>> the console. With earlyprintk, I captured the failure, see below. > >>>>>> > >>>>>> I'll try and bisect this, but it takes some time (since the bios and > >>>>>> post process takes forever). I just updated the firmware on the > >>>>>> box as > >>>>>> well, but it did boot 2.6.32 (and RHEL5 boots fine too). Of course > >>>>>> that > >>>>>> doesn't rule out a BIOS bug. > >>>>> Results are persistent, git bisect points to: > >>>>> > >>>>> commit b24c2a925a9837cccf54d50aeac22ba0cbc15455 > >>>>> Author: Yinghai Lu > >>>>> Date: Tue Nov 24 02:48:18 2009 -0800 > >>>>> > >>>>> x86: Move find_smp_config() earlier and avoid bootmem usage > >>>>> > >>>>> which appears consistent with the panic(). Reverting that does indeed > >>>>> make current -git boot properly. > >>>> Thanks. Yinghai, do you have any ideas, or should we revert it? > >>>> > >>> Let find the root cause > >>> > >>> Jen > >> Jens :-) > >> > >>> Can you boot with earlyconsole or earlyprintk and debug? > >> The original report already included oops output from earlyprintk, it's > >> below as well. > >> > >> kernel /vmlinuz-2.6 root=/dev/sda3 ro console=ttyS0,115200 console=tty0 earlyp > >> rintk=serial,ttyS0,115200 > >> [Linux-bzImage, setup=0x3200, size=0x298f90] > >> > >> > >> [ 0.000000] Linux version 2.6.32 (axboe@nehalem) (gcc version 4.3.2 (Debian 4.3.2-1.1) ) #35 SMP Thu Dec 10 09:47:51 CET 2009 * > >> [ 0.000000] Command line: root=/dev/sda3 ro console=ttyS0,115200 console=tty0 earlyprintk=serial,ttyS0,115200 * > >> [ 0.000000] BIOS-provided physical RAM map: * > >> [ 0.000000] BIOS-e820: 0000000000000000 - 000000000009a400 (usable) * > >> [ 0.000000] BIOS-e820: 000000000009a400 - 00000000000a0000 (reserved) * > >> [ 0.000000] BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)*** > >> [ 0.000000] BIOS-e820: 0000000000100000 - 0000000078c75000 (usable) > >> [ 0.000000] BIOS-e820: 0000000078c75000 - 0000000078e89000 (ACPI NVS) > >> [ 0.000000] BIOS-e820: 0000000078e89000 - 000000007924e000 (ACPI data) > >> [ 0.000000] BIOS-e820: 000000007924e000 - 00000000792c2000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000792c2000 - 00000000792d2000 (ACPI data) > >> [ 0.000000] BIOS-e820: 00000000792d2000 - 00000000792e7000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000792e7000 - 0000000079301000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079301000 - 0000000079303000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079303000 - 0000000079305000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079305000 - 0000000079310000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079310000 - 0000000079314000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079314000 - 0000000079319000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079319000 - 0000000079336000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079336000 - 0000000079358000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079358000 - 0000000079388000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079388000 - 00000000793c9000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000793c9000 - 000000007968f000 (ACPI data) > >> [ 0.000000] BIOS-e820: 000000007968f000 - 00000000796bb000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000796bb000 - 00000000799d8000 (ACPI data) > >> [ 0.000000] BIOS-e820: 00000000799d8000 - 0000000079bd8000 (ACPI NVS) > >> [ 0.000000] BIOS-e820: 0000000079bd8000 - 0000000079dc7000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079dc7000 - 0000000079dcb000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079dcb000 - 0000000079e1c000 (ACPI data) > >> [ 0.000000] BIOS-e820: 0000000079e1c000 - 0000000079e87000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000079e87000 - 000000007bd5f000 (ACPI data) > >> [ 0.000000] BIOS-e820: 000000007bd5f000 - 000000007be4f000 (reserved) > >> [ 0.000000] BIOS-e820: 000000007be4f000 - 000000007bf87000 (ACPI data) > >> [ 0.000000] BIOS-e820: 000000007bf87000 - 000000007bfcf000 (ACPI NVS) > >> [ 0.000000] BIOS-e820: 000000007bfcf000 - 000000007bfff000 (ACPI data) > >> [ 0.000000] BIOS-e820: 000000007bfff000 - 0000000090000000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000fc000000 - 00000000fd000000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000fed1c000 - 00000000fed20000 (reserved) > >> [ 0.000000] BIOS-e820: 00000000ff000000 - 0000000100000000 (reserved) > >> [ 0.000000] BIOS-e820: 0000000100000000 - 0000001080000000 (usable) > >> [ 0.000000] bootconsole [earlyser0] enabled > >> [ 0.000000] NX (Execute Disable) protection: active > >> [ 0.000000] DMI 2.5 present. > >> [ 0.000000] last_pfn = 0x1080000 max_arch_pfn = 0x400000000 > >> [ 0.000000] x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 > >> [ 0.000000] last_pfn = 0x78c75 max_arch_pfn = 0x400000000 > >> [ 0.000000] init_memory_mapping: 0000000000000000-0000000078c75000 > >> [ 0.000000] init_memory_mapping: 0000000100000000-0000001080000000 > >> [ 0.000000] ACPI: RSDP 00000000000f0410 00024 (v02 QUANTA) > >> [ 0.000000] ACPI: XSDT 000000007bffe120 000BC (v01 QUANTA QSSC-S4R 00000000 01000013) > >> [ 0.000000] ACPI: FACP 000000007bffd000 000F4 (v04 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: DSDT 000000007bfe3000 19BAD (v02 QUANTA QSSC-S4R 00000003 MSFT 0100000D) > >> [ 0.000000] ACPI: FACS 000000007bf87000 00040 > >> [ 0.000000] ACPI: APIC 000000007bfe2000 003E4 (v02 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: MSCT 000000007bfe1000 00090 (v01 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] ACPI: MCFG 000000007bfe0000 0003C (v01 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] ACPI: HPET 000000007bfdf000 00038 (v01 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] ACPI: SLIT 000000007bfde000 0003C (v01 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] ACPI: SRAT 000000007bfdd000 00930 (v02 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] ACPI: SPCR 000000007bfdc000 00050 (v01 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: WDDT 000000007bfdb000 00040 (v01 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: SSDT 000000007bf4a000 3CFA4 (v02 QUANTA QSSC-S4R 00004000 INTL 20061109) > >> [ 0.000000] ACPI: SSDT 000000007bfda000 00174 (v02 QUANTA QSSC-S4R 00004000 INTL 20061109) > >> [ 0.000000] ACPI: PMCT 000000007bfd9000 00064 (v01 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: MIGT 000000007bfd8000 00040 (v01 QUANTA QSSC-S4R 00000000 MSFT 0100000D) > >> [ 0.000000] ACPI: TCPA 000000007bfd5000 00032 (v00 QUANTA QSSC-S4R 00000000 00000000) > >> [ 0.000000] ACPI: HEST 000000007bfd4000 005D0 (v01 QUANTA QSSC-S4R 00000001 INTL 00000001) > >> [ 0.000000] ACPI: BERT 000000007bfd3000 00030 (v01 QUANTA QSSC-S4R 00000001 INTL 00000001) > >> [ 0.000000] ACPI: ERST 000000007bfd2000 00230 (v01 QUANTA QSSC-S4R 00000001 INTL 00000001) > >> [ 0.000000] ACPI: EINJ 000000007bfd1000 00130 (v01 QUANTA QSSC-S4R 00000001 INTL 00000001) > >> [ 0.000000] ACPI: DMAR 000000007bfd0000 00350 (v01 QUANTA QSSC-S4R 00000001 MSFT 0100000D) > >> [ 0.000000] found SMP MP-table at [ffff8800000fdd80] fdd80 > >> [ 0.000000] Kernel panic - not syncing: Overlapping early reservations 12-f011 MP-table mpc to 0-fff BIOS data page > > > > mptable mpc is [12 - f011] > > > > what a BIOS ! > > > > can you check if it works with 2.6.32 + "acpi=off"? I'll try that. > please check if this one could workaround it. > > diff --git a/arch/x86/kernel/e820.c b/arch/x86/kernel/e820.c > index d17d482..f60acec 100644 > --- a/arch/x86/kernel/e820.c > +++ b/arch/x86/kernel/e820.c > @@ -732,7 +732,7 @@ struct early_res { > char overlap_ok; > }; > static struct early_res early_res[MAX_EARLY_RES] __initdata = { > - { 0, PAGE_SIZE, "BIOS data page" }, /* BIOS data page */ > + { 0, PAGE_SIZE, "BIOS data page", 1 }, /* BIOS data page */ > {} > }; And then this. Results in 10-15min. -- Jens Axboe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/