Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753556AbeAKDKL (ORCPT + 1 other); Wed, 10 Jan 2018 22:10:11 -0500 Received: from mail.cn.fujitsu.com ([183.91.158.132]:50911 "EHLO heian.cn.fujitsu.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753131AbeAKDKK (ORCPT ); Wed, 10 Jan 2018 22:10:10 -0500 X-IronPort-AV: E=Sophos;i="5.43,368,1503331200"; d="scan'208";a="35156392" Subject: Re: lapic-related boot crash in 4.15-rc1 To: Thomas Gleixner , Meelis Roos CC: Ingo Molnar , Linux Kernel list , , Ville Syrjala References: From: Dou Liyang Message-ID: Date: Thu, 11 Jan 2018 11:10:03 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.167.226.106] X-yoursite-MailScanner-ID: 6FB1F48AEA0A.AC646 X-yoursite-MailScanner: Found to be clean X-yoursite-MailScanner-From: douly.fnst@cn.fujitsu.com Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: Hi Thomas, Meelis At 01/11/2018 02:34 AM, Thomas Gleixner wrote: > On Wed, 10 Jan 2018, Meelis Roos wrote: >> On 3 of my test computers, boot hangs with 4.15 git kernels. So far I >> have traced it down to 4.14.0 being good and 4.15-rc1 being bad (bisect >> is slow because the computers are somwehat remote). Also because of >> trying to find when it started, I have not tries newer than rc5 >> kernels. >> >> The hang happens after Performance Events feature table has been printed >> but before APIC initialization messages normally appear ("Enabling APIC >> mode: Flat. Using 0 I/O APICs" or something like that). >> >> It happens on at least 3 distinct computers: >> * AMD Athlon with AMD 750 chipset >> * P3 1 GHz with i815 chipset >> * P4 1.7 GHz with SiS chipset >> >> They all have have "lapic" parameter because they were telling me that >> lapic has been disabled by BIOS and I re-enabled it with the parameter >> to get apic timer interrupts. This worked up until 4.14. >> >> None of the computers have IO-APICs. > > Oh. Then it possibly falls into Dou's realm. > > Dou? > Ah, This problem looks like what Ville reported[1]. Revert the commit b371ae0d4a19 ("x86/apic: Remove init_bsp_APIC()") may make it work again. But, I don't know the true reason. In Linux v4.15-rc1+, apic->setup_apic_routing() has been called earlier, So, I guess kernel may hang in some early initcall fuctions. we can add "initcall_debug" option in the kernel command line for tracing initcalls as they are executed. [1] https://marc.info/?l=linux-kernel&m=151188084018443&w=2 Thanks, dou. --------------------------->8----------------------------------- From: Ville Syrjälä This reverts commit b371ae0d4a194b178817b0edfb6a7395c7aec37a. Causes my P3 UP machine to hang at boot with "lapic". Cc: Dou Liyang Cc: Thomas Gleixner Cc: yinghai@kernel.org Cc: bhe@redhat.com Cc: Ingo Molnar Cc: "H. Peter Anvin" Signed-off-by: Ville Syrjälä --- arch/x86/include/asm/apic.h | 1 + arch/x86/kernel/apic/apic.c | 49 +++++++++++++++++++++++++++++++++++++++++++++ arch/x86/kernel/irqinit.c | 3 +++ 3 files changed, 53 insertions(+) diff --git a/arch/x86/include/asm/apic.h b/arch/x86/include/asm/apic.h index a9e57f08bfa6..98722773391d 100644 --- a/arch/x86/include/asm/apic.h +++ b/arch/x86/include/asm/apic.h @@ -136,6 +136,7 @@ extern void disconnect_bsp_APIC(int virt_wire_setup); extern void disable_local_APIC(void); extern void lapic_shutdown(void); extern void sync_Arb_IDs(void); +extern void init_bsp_APIC(void); extern void apic_intr_mode_init(void); extern void setup_local_APIC(void); extern void init_apic_mappings(void); diff --git a/arch/x86/kernel/apic/apic.c b/arch/x86/kernel/apic/apic.c index 6e272f3ea984..cec9aaea7f9d 100644 --- a/arch/x86/kernel/apic/apic.c +++ b/arch/x86/kernel/apic/apic.c @@ -1286,6 +1286,55 @@ static int __init apic_intr_mode_select(void) return APIC_SYMMETRIC_IO; } +/* + * An initial setup of the virtual wire mode. + */ +void __init init_bsp_APIC(void) +{ + unsigned int value; + + /* + * Don't do the setup now if we have a SMP BIOS as the + * through-I/O-APIC virtual wire mode might be active. + */ + if (smp_found_config || !boot_cpu_has(X86_FEATURE_APIC)) + return; + + /* + * Do not trust the local APIC being empty at bootup. + */ + clear_local_APIC(); + + /* + * Enable APIC. + */ + value = apic_read(APIC_SPIV); + value &= ~APIC_VECTOR_MASK; + value |= APIC_SPIV_APIC_ENABLED; + +#ifdef CONFIG_X86_32 + /* This bit is reserved on P4/Xeon and should be cleared */ + if ((boot_cpu_data.x86_vendor == X86_VENDOR_INTEL) && + (boot_cpu_data.x86 == 15)) + value &= ~APIC_SPIV_FOCUS_DISABLED; + else +#endif + value |= APIC_SPIV_FOCUS_DISABLED; + value |= SPURIOUS_APIC_VECTOR; + apic_write(APIC_SPIV, value); + + /* + * Set up the virtual wire mode. + */ + apic_write(APIC_LVT0, APIC_DM_EXTINT); + value = APIC_DM_NMI; + if (!lapic_is_integrated()) /* 82489DX */ + value |= APIC_LVT_LEVEL_TRIGGER; + if (apic_extnmi == APIC_EXTNMI_NONE) + value |= APIC_LVT_MASKED; + apic_write(APIC_LVT1, value); +} + /* Init the interrupt delivery mode for the BSP */ void __init apic_intr_mode_init(void) { diff --git a/arch/x86/kernel/irqinit.c b/arch/x86/kernel/irqinit.c index 8da3e909e967..a539410c4ea9 100644 --- a/arch/x86/kernel/irqinit.c +++ b/arch/x86/kernel/irqinit.c @@ -61,6 +61,9 @@ void __init init_ISA_irqs(void) struct irq_chip *chip = legacy_pic->chip; int i; +#if defined(CONFIG_X86_64) || defined(CONFIG_X86_LOCAL_APIC) + init_bsp_APIC(); +#endif legacy_pic->init(0); for (i = 0; i < nr_legacy_irqs(); i++) -- > Thanks, > > tglx > > >