Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754911AbYLJOy1 (ORCPT ); Wed, 10 Dec 2008 09:54:27 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751448AbYLJOyS (ORCPT ); Wed, 10 Dec 2008 09:54:18 -0500 Received: from e4.ny.us.ibm.com ([32.97.182.144]:38705 "EHLO e4.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751278AbYLJOyR (ORCPT ); Wed, 10 Dec 2008 09:54:17 -0500 Date: Wed, 10 Dec 2008 06:54:14 -0800 From: "Paul E. McKenney" To: Kamalesh Babulal Cc: Stephen Rothwell , linux-next@vger.kernel.org, LKML , dm-devel@redhat.com, tglx@linutronix.de, mel@csn.ul.ie Subject: Re: [BUG] linux-next: 20081209 - kernel bug at __rcu_process_callbacks, while booting up Message-ID: <20081210145414.GA6945@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20081209185237.8bb715d7.sfr@canb.auug.org.au> <20081210115721.GB6107@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20081210115721.GB6107@linux.vnet.ibm.com> User-Agent: Mutt/1.5.15+20070412 (2007-04-11) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 10604 Lines: 212 On Wed, Dec 10, 2008 at 05:27:21PM +0530, Kamalesh Babulal wrote: > Hi, > > Kernel bug is hit while booting up the next-20081208/09 kernels over > the x86_64 box. The IP is pointing to 0x0 and its stuck at > __rcu_process_callbacks. Kernel config? Thanx, Paul > Activating logical volumes > 4 logical volume(s) in volume group "VolGroup00"BUG: unable to handle kernel NULL pointer dereference at 0000000000000000 > IP: [<0000000000000000>] 0x0 > PGD 3e8ad067 PUD 3e8ac067 PMD 0 > Oops: 0010 [#1] SMP > last sysfs file: /sys/block/dm-1/removable > CPU 3 > Modules linked in: > Pid: 0, comm: swapper Not tainted 2.6.28-rc7-next-20081209-autotest #1 > RIP: 0010:[<0000000000000000>] [<0000000000000000>] 0x0 > RSP: 0018:ffff88003fa73ef0 EFLAGS: 00010286 > RAX: ffff88003eae0500 RBX: ffff880001047040 RCX: ffffffff80268f66 > RDX: 0000000000000000 RSI: ffffe20000fab800 RDI: ffff88003eae0500 > RBP: 0000000000000000 R08: ffff880001047040 R09: ffffffff80853950 > R10: 0000000000000038 R11: ffffffff8032222d R12: 0000000000000005 > R13: 0000000000000038 R14: 0000000000000009 R15: 0000000000000018 > FS: 000000000066d870(0000) GS:ffff88003f803f00(0000) knlGS:0000000000000000 > CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b > CR2: 0000000000000000 CR3: 000000003e8a9000 CR4: 00000000000006e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process swapper (pid: 0, threadinfo ffff88003fa64000, task ffff88003f9dde00) > Stack: > ffffffff80268f66 ffff88003fa73ef8 0000000000000100 0000000000000001 > ffffffff807400b8 0000000000000038 ffffffff80269001 0000000000000018 > ffffffff8023b891 ffffffff8020e79e 0000000000000046 ffff88003fa73f80 > Call Trace: > <0> [] __rcu_process_callbacks+0x14e/0x1c3 > [] rcu_process_callbacks+0x26/0x4a > [] __do_softirq+0x76/0x136 > [] profile_pc+0x21/0x5b > [] call_softirq+0x1c/0x28 > [] do_softirq+0x2c/0x6c > [] smp_apic_timer_interrupt+0x93/0xac > [] apic_timer_interrupt+0x13/0x20 > <0>Code: Bad RIP value. > RIP [<0000000000000000>] 0x0 > RSP > CR2: 0000000000000000 > ---[ end trace 69ecde41a682e571 ]--- > Kernel panic - not syncing: Fatal exception in interrupt > Pid: 0, comm: swapper Tainted: G D 2.6.28-rc7-next-20081209-autotest #1 > Call Trace: > Mounting root fi lesystem. > [] panic+0x86/0x144 > [] apic_timer_interrupt+0x13/0x20 > [] oops_end+0x61/0xad > [] oops_end+0xa0/0xad > [] do_page_fault+0x748/0x801 > [] page_fault+0x1f/0x30 > [] selinux_cred_free+0x0/0x14 > BUG: unable to handle kernel NULL pointer dereference at 0000000000000002 > IP: [] kmem_cache_alloc+0x6a/0x99 > PGD 3e8ad067 PUD 3e8ac067 PMD 0 > Oops: 0000 [#2] SMP > last sysfs file: /sys/block/ram7/removable > CPU 2 > Modules linked in: > Pid: 1, comm: init Tainted: G D 2.6.28-rc7-next-20081209-autotest #1 > RIP: 0010:[] [] kmem_cache_alloc+0x6a/0x99 > RSP: 0018:ffff88003f9d7db8 EFLAGS: 00010002 > RAX: 0000000000000000 RBX: 0000000000000246 RCX: 0000000000000001 > RDX: ffff880001037800 RSI: 0000000000000002 RDI: ffffffff80854f70 > RBP: 0000000000000020 R08: 0000000000000000 R09: ffffffff80859a80 > R10: 0000000000000001 R11: ffff88003f426018 R12: ffffffff80854f70 > R13: ffffffff80253ab1 R14: 0000000000000080 R15: ffffffff802b4141 > FS: 000000000066d870(0063) GS:ffff88003f803c80(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > CR2: 0000000000000002 CR3: 000000003e8a9000 CR4: 00000000000006e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process init (pid: 1, threadinfo ffff88003f9d6000, task ffff88003f9d8000) > Stack: > 00000000ffffffff ffff88003f426000 ffffffff80859a80 ffffffff80859a80 > 0000000000000000 ffffffff80253ab1 00000000006f1db0 0000000000000000 > ffff88003eb6a2f8 ffff88003f9d7e48 0000000000000000 01ff88003e97c400 > Call Trace: > [] ? smp_call_function_many+0xdb/0x220 > [] ? do_writepages+0x23/0x30 > [] ? invalidate_bh_lru+0x0/0x42 > [] ? smp_call_function+0x20/0x24 > [] ? on_each_cpu+0x10/0x22 > [] ? kill_bdev+0x1b/0x30 > [] ? __blkdev_put+0x54/0x151 > [] ? __fput+0xe7/0x1b4 > [] ? filp_close+0x5e/0x66 > [] ? sys_close+0x8d/0xd1 > [] ? system_call_fastpath+0x16/0x1b > Code: 18 03 00 00 48 8b 32 44 8b 72 18 48 85 f6 75 18 49 89 d0 89 ee 4c 89 e9 83 ca ff 4c 89 e7 e8 57 f8 ff ff 48 89 c6 eb 0a 8b 42 14 <48> 8b 04 c6 48 89 02 53 9d 31 c0 c1 ed 0f 48 85 f6 0f 95 c0 85 > RIP [] kmem_cache_alloc+0x6a/0x99 > RSP > CR2: 0000000000000002 > ---[ end trace 69ecde41a682e571 ]--- > Kernel panic - not syncing: Attempted to kill init! > Pid: 1, comm: init Tainted: G D 2.6.28-rc7-next-20081209-autotest #1 > Call Trace: > [] panic+0x86/0x144 > [] mntput_no_expire+0x1e/0x139 > [] filp_close+0x5e/0x66 > [] exit_fs+0x35/0x46 > [] do_exit+0x75/0x766 > [] oops_end+0xa8/0xad > [] do_page_fault+0x748/0x801 > [] smp_call_function_many+0xdb/0x220 > [] invalidate_bh_lru+0x0/0x42 > [] page_fault+0x1f/0x30 > [] invalidate_bh_lru+0x0/0x42 > [] smp_call_function_many+0xdb/0x220 > [] kmem_cache_alloc+0x6a/0x99 > [] smp_call_function_many+0xdb/0x220 > [] do_writepages+0x23/0x30 > [] invalidate_bh_lru+0x0/0x42 > [] smp_call_function+0x20/0x24 > [] on_each_cpu+0x10/0x22 > [] kill_bdev+0x1b/0x30 > [] __blkdev_put+0x54/0x151 > [] __fput+0xe7/0x1b4 > [] filp_close+0x5e/0x66 > [] sys_close+0x8d/0xd1 > [] system_call_fastpath+0x16/0x1b > BUG: unable to handle kernel NULL pointer dereference at 0000000000000002 > IP: [] kmem_cache_alloc+0x6a/0x99 > PGD 0 > Oops: 0000 [#3] SMP > last sysfs file: /sys/block/ram7/removable > CPU 2 > Modules linked in: > Pid: 1, comm: init Tainted: G D 2.6.28-rc7-next-20081209-autotest #1 > RIP: 0010:[] [] kmem_cache_alloc+0x6a/0x99 > RSP: 0018:ffff88003f9d7a58 EFLAGS: 00010002 > RAX: 0000000000000000 RBX: 0000000000000246 RCX: 0000000000000001 > RDX: ffff880001037800 RSI: 0000000000000002 RDI: ffffffff80854f70 > RBP: 0000000000000020 R08: ffffffff8052a5c0 R09: ffffffff80859a80 > R10: 0000000000000000 R11: ffffffff8067c887 R12: ffffffff80854f70 > R13: ffffffff80253ab1 R14: 0000000000000080 R15: ffffffff80211dd3 > FS: 000000000066d870(0000) GS:ffff88003f803c80(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > CR2: 0000000000000002 CR3: 0000000000201000 CR4: 00000000000006e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process init (pid: 1, threadinfo ffff88003f9d6000, task ffff88003f9d8000) > Stack: > 00000000ffffffff ffff88003f9d8000 ffffffff80859a80 ffffffff80859a80 > 0000000000000000 ffffffff80253ab1 ffffffff8052a5c0 ffff88003fa3bfc0 > 0000000000000000 ffffffff8020e525 0000000000000010 000000003e943800 > Call Trace: > [] ? smp_call_function_many+0xdb/0x220 > [] ? dump_trace+0x223/0x232 > [] ? smp_call_function+0x20/0x24 > [] ? native_smp_send_stop+0x1a/0x26 > [] ? panic+0x9a/0x144 > [] ? mntput_no_expire+0x1e/0x139 > [] ? filp_close+0x5e/0x66 > [] ? exit_fs+0x35/0x46 > [] ? do_exit+0x75/0x766 > [] ? oops_end+0xa8/0xad > [] ? do_page_fault+0x748/0x801 > [] ? smp_call_function_many+0xdb/0x220 > [] ? invalidate_bh_lru+0x0/0x42 > [] ? page_fault+0x1f/0x30 > [] ? invalidate_bh_lru+0x0/0x42 > [] ? smp_call_function_many+0xdb/0x220 > [] ? kmem_cache_alloc+0x6a/0x99 > [] ? smp_call_function_many+0xdb/0x220 > [] ? do_writepages+0x23/0x30 > [] ? invalidate_bh_lru+0x0/0x42 > [] ? smp_call_function+0x20/0x24 > [] ? on_each_cpu+0x10/0x22 > [] ? kill_bdev+0x1b/0x30 > [] ? __blkdev_put+0x54/0x151 > [] ? __fput+0xe7/0x1b4 > [] ? filp_close+0x5e/0x66 > [] ? sys_close+0x8d/0xd1 > [] ? system_call_fastpath+0x16/0x1b > Code: 18 03 00 00 48 8b 32 44 8b 72 18 48 85 f6 75 18 49 89 d0 89 ee 4c 89 e9 83 ca ff 4c 89 e7 e8 57 f8 ff ff 48 89 c6 eb 0a 8b 42 14 <48> 8b 04 c6 48 89 02 53 9d 31 c0 c1 ed 0f 48 85 f6 0f 95 c0 85 > RIP [] kmem_cache_alloc+0x6a/0x99 > RSP > CR2: 0000000000000002 > ---[ end trace 69ecde41a682e571 ]--- > Fixing recursive fault but reboot is needed! > [] __rcu_process_callbacks+0x14e/0x1c3 > [] __rcu_process_callbacks+0x14e/0x1c3 > [] rcu_process_callbacks+0x26/0x4a > [] __do_softirq+0x76/0x136 > [] profile_pc+0x21/0x5b > [] call_softirq+0x1c/0x28 > [] do_softirq+0x2c/0x6c > [] smp_apic_timer_interrupt+0x93/0xac > [] apic_timer_interrupt+0x13/0x20 > > > -- > Thanks & Regards, > Kamalesh Babulal, > Linux Technology Center, > IBM, ISTL. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/