Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1761654AbZCaUqx (ORCPT ); Tue, 31 Mar 2009 16:46:53 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754823AbZCaUqo (ORCPT ); Tue, 31 Mar 2009 16:46:44 -0400 Received: from rcsinet12.oracle.com ([148.87.113.124]:50771 "EHLO rgminet12.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753985AbZCaUqn (ORCPT ); Tue, 31 Mar 2009 16:46:43 -0400 Message-ID: <49D280EF.1080507@oracle.com> Date: Tue, 31 Mar 2009 13:45:35 -0700 From: Randy Dunlap Organization: Oracle Linux Engineering User-Agent: Thunderbird 2.0.0.6 (X11/20070801) MIME-Version: 1.0 To: Linux Kernel Mailing List CC: "Rafael J. Wysocki" , Rusty Russell , Bjorn Helgaas , andreas.herrmann3@amd.com Subject: 2.6.29 boot hang Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Source-IP: acsmt703.oracle.com [141.146.40.81] X-Auth-Type: Internal IP X-CT-RefId: str=0001.0A010207.49D28113.0013:SCFMA4539814,ss=1,fgs=0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2158 Lines: 63 (If this is old news, would someone please tell me the status of this problem? or if there is a known CPU issue, tell me. Thanks.) On a 4-proc x86_64 (HP BladeCenter, AMD CPUs) system, booting 2.6.29 (or earlier, back to 2.6.28-6921-g873392c) hangs during boot. git bisect says: 873392ca514f87eae39f53b6944caf85b1a047cb is first bad commit commit 873392ca514f87eae39f53b6944caf85b1a047cb Author: Rusty Russell Date: Wed Dec 31 23:54:56 2008 +1030 PCI: work_on_cpu: use in drivers/pci/pci-driver.c ~~~~~~~~~~~~~~~~~~~~~ The last messages are usually like this: Initializing CPU#1 Calibrating delay using timer specific routine.. 5224.09 BogoMIPS (lpj=10448183) CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 1024K (64 bytes/line) CPU 1/0x2 -> Node 1 CPU: Physical Processor ID: 1 CPU: Processor Core ID: 0 CPU1: Dual-Core AMD Opteron(tm) Processor 8218 stepping 02 lockdep: fixing up alternatives. Booting processor 2 APIC 0x1 ip 0x6000 Initializing CPU#2 Calibrating delay using timer specific routine.. 5224.12 BogoMIPS (lpj=10448242) CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 1024K (64 bytes/line) CPU 2/0x1 -> Node 0 CPU: Physical Processor ID: 0 CPU: Processor Core ID: 1 CPU2: Dual-Core AMD Opteron(tm) Processor 8218 stepping 02 lockdep: fixing up alternatives. Booting processor 3 APIC 0x3 ip 0x6000 Initializing CPU#3 Calibrating delay using timer specific routine.. 5224.11 BogoMIPS (lpj=10448230) CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) If I change CONFIG_MICROCODE_AMD=y to CONFIG_MICROCODE_AMD=n & rebuild, the kernel boots successfully. There are never any kernel oops/panic/error messages. I suppose that the CPU microcode update (if there actually is one) causes some kind of problem that prevents messages. Suggestions, fixes, etc.?? Thanks, -- ~Randy -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/