Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S964859AbWHCRQc (ORCPT ); Thu, 3 Aug 2006 13:16:32 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932584AbWHCRQc (ORCPT ); Thu, 3 Aug 2006 13:16:32 -0400 Received: from wx-out-0102.google.com ([66.249.82.193]:13803 "EHLO wx-out-0102.google.com") by vger.kernel.org with ESMTP id S932585AbWHCRQb (ORCPT ); Thu, 3 Aug 2006 13:16:31 -0400 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=V10CCQ1zpEU1hO6CNQmAEMFykg7H40iZM+TxirMLyc+xO1G1n2rDIlDKGcg26mcLRPl1J37i82q3i2Of19NouFf4az3Q/JckvNQzMhjlnbesYHsuGhdPNhv1Hc9YtNDNdXBuYFnbRj22jC/rAdY9UhOsbRyQfhN9g0hRHfRVpgk= Message-ID: Date: Thu, 3 Aug 2006 10:16:29 -0700 From: "Robert Crocombe" To: "Steven Rostedt" Subject: Re: Problems with 2.6.17-rt8 Cc: linux-kernel@vger.kernel.org, "Ingo Molnar" , "Thomas Gleixner" , "Bill Huey" In-Reply-To: <1154621059.32264.29.camel@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <1154541079.25723.8.camel@localhost.localdomain> <1154615261.32264.6.camel@localhost.localdomain> <1154618859.32264.19.camel@localhost.localdomain> <1154621059.32264.29.camel@localhost.localdomain> Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 25237 Lines: 601 On 8/3/06, Steven Rostedt wrote: > Are you also getting any warnings or BUG reports before this. In your > other dmesg, it should a bug being reported. This could cause problems > later on. Well, I'm getting a "nobody cared" on the tg3 (there's a separate email about that to the list since it seemed sorta unrelated). This is straight from the console: ***************************************************************************** Time: acpi_pm clocksource has been installed. * * * REMINDER, the following debugging options are turned on in your .config: * * * * CONFIG_DEBUG_RT_MUTEXES * * CONFIG_DEBUG_PREEMPT * * CONFIG_CRITICAL_PREEMPT_TIMING * * CONFIG_CRITICAL_IRQSOFF_TIMING * * CONFIG_LATENCY_TRACE * * * * they may increase runtime overhead and latencies. * * * ***************************************************************************** Freeing unused kernel memory: 240k freed Write protecting the kernel read-only data: 1036k input: ImPS/2 Generic Wheel Mouse as /class/input/input1 md: Autodetecting RAID arrays. md: autorun ... md: considering sdb1 ... md: adding sdb1 ... md: adding sda1 ... md: created md0 md: bind md: bind md: running: md: md0: raid array is not clean -- starting background reconstruction raid1: raid set md0 active with 2 out of 2 mirrors md: ... autorun DONE. md: syncing RAID array md0 md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for reconstruction. md: using 128k window, over a total of 287836480 blocks. kjournald starting. Commit interval 5 seconds EXT3-fs: mounted filesystem with ordered data mode. irq 106: nobody cared (try booting with the "irqpoll" option) Call Trace: {__report_bad_irq+61} {thread_return+230} {note_interrupt+487} {keventd_create_kthread+0} {thread_simple_irq+137} {do_irqd+0} {do_irqd+220} {do_irqd+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000001 ] | 1-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock_irq+0x21/0x2e .....[] .. ( <= thread_simple_irq+0x70/0x98) handlers: [] (tg3_interrupt_tagged+0x0/0xa7 [tg3]) turning off IO-APIC fast mode. irq 106: nobody cared (try booting with the "irqpoll" option) Call Trace: {__report_bad_irq+61} {note_interrupt+487} {keventd_create_kthread+0} {thread_simple_irq+137} {do_irqd+0} {do_irqd+220} {do_irqd+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000001 ] | 1-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock_irq+0x21/0x2e .....[] .. ( <= thread_simple_irq+0x70/0x98) handlers: [] (tg3_interrupt_tagged+0x0/0xa7 [tg3]) md0_raid1/1118[CPU#3]: BUG in debug_rt_mutex_unlock at kernel/rtmutex-debug.c:471 Call Trace: {_raw_spin_lock_irqsave+34} {__WARN_ON+105} {__WARN_ON+36} {debug_rt_mutex_unlock+204} {rt_lock_slowunlock+30} {__lock_text_start+14} {kmem_cache_alloc+207} {mempool_alloc_slab+22} {mempool_alloc+80} {constant_test_bit+9} {_raw_spin_unlock+51} {rt_lock_slowunlock+70} {get_request+375} {mcount+45} {get_request_wait+45} {mcount+45} {as_merge+0} {as_merge+22} {__make_request+750} {md_thread+0} {generic_make_request+380} {constant_test_bit+9} {_raw_spin_unlock+51} {md_thread+0} {raid1d+246} {mcount+45} {mcount+45} {rt_lock_slowunlock+70} {thread_return+230} {thread_return+230} {constant_test_bit+9} {_raw_spin_unlock+51} {rt_lock_slowunlock+70} {thread_return+230} {__lock_text_start+14} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {autoremove_wake_function+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowunlock+0x16/0x70) .. [] .... _raw_spin_lock_irqsave+0x22/0x33 .....[] .. ( <= __WARN_ON+0x24/0x8a) md0_raid1/1118[CPU#3]: BUG in debug_rt_mutex_unlock at kernel/rtmutex-debug.c:472 Call Trace: {_raw_spin_lock_irqsave+34} {__WARN_ON+105} {__WARN_ON+36} {debug_rt_mutex_unlock+366} {rt_lock_slowunlock+30} {__lock_text_start+14} {kmem_cache_alloc+207} {mempool_alloc_slab+22} {mempool_alloc+80} {constant_test_bit+9} {_raw_spin_unlock+51} {rt_lock_slowunlock+70} {get_request+375} {mcount+45} {get_request_wait+45} {mcount+45} {as_merge+0} {as_merge+22} {__make_request+750} {md_thread+0} {generic_make_request+380} {constant_test_bit+9} {_raw_spin_unlock+51} {md_thread+0} {raid1d+246} {mcount+45} {mcount+45} {rt_lock_slowunlock+70} {thread_return+230} {thread_return+230} {constant_test_bit+9} {_raw_spin_unlock+51} {rt_lock_slowunlock+70} {thread_return+230} {__lock_text_start+14} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {autoremove_wake_function+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowunlock+0x16/0x70) .. [] .... _raw_spin_lock_irqsave+0x22/0x33 .....[] .. ( <= __WARN_ON+0x24/0x8a) ----------- [cut here ] --------- [please bite here ] --------- Kernel BUG at kernel/rtmutex.c:639 invalid opcode: 0000 [1] PREEMPT SMP CPU 3 Modules linked in: nfsd exportfs lockd sunrpc ohci1394 ieee1394 tg3 Pid: 1118, comm: md0_raid1 Not tainted 2.6.17-rt8_local_00 #3 RIP: 0010:[] {rt_lock_slowlock+186} RSP: 0018:ffff8101ea5bd9e8 EFLAGS: 00010246 RAX: ffff8107eac1e340 RBX: 0000000000000010 RCX: 0000000000240180 RDX: ffff8107eac1e340 RSI: ffffffff8027927e RDI: ffff810600115ca0 RBP: ffff8101ea5bdaa8 R08: ffff8104000d7f38 R09: ffff8101ea5bd9e8 R10: ffff8104000d7f38 R11: 0000000000000023 R12: ffff810600115ca0 R13: ffff8103ead2a080 R14: ffffffff8027927e R15: ffff8103eadbe0e0 FS: 00002afcbce1c770(0000) GS:ffff810600211340(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 000000334d703088 CR3: 00000007e8ffa000 CR4: 00000000000006e0 Process md0_raid1 (pid: 1118, threadinfo ffff8101ea5bc000, task ffff8107eac1e340) Stack: 111111110000008c ffff8101ea5bd9f0 ffff8101ea5bd9f0 ffff8101ea5bda00 ffff8101ea5bda00 0000000000000000 111111110000008c ffff8101ea5bda20 ffff8101ea5bda20 ffff8101ea5bda30 Call Trace: {rt_lock+18} {kmem_cache_alloc+84} {mempool_alloc_slab+22} {mempool_alloc+80} {mcount+45} {as_set_request+0} {as_set_request+37} {elv_set_request+27} {get_request+411} {mcount+45} {get_request_wait+45} {mcount+45} {as_merge+0} {as_merge+22} {__make_request+750} {md_thread+0} {generic_make_request+380} {constant_test_bit+9} {_raw_spin_unlock+51} {md_thread+0} {raid1d+246} {mcount+45} {mcount+45} {rt_lock_slowunlock+70} {thread_return+230} {thread_return+230} {constant_test_bit+9} {_raw_spin_unlock+51} {rt_lock_slowunlock+70} {thread_return+230} {__lock_text_start+14} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {autoremove_wake_function+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowlock+0x3b/0x213) .. [] .... _raw_spin_trylock+0x1b/0x5f .....[] .. ( <= oops_begin+0x28/0x77) Code: 0f 0b 68 70 44 4c 80 c2 7f 02 65 48 8b 04 25 00 00 00 00 41 RIP {rt_lock_slowlock+186} RSP etc. and so forth. Oh wait, here's one that happens straight off after boot: ***************************************************************************** Time: acpi_pm clocksource has been installed. * * * REMINDER, the following debugging options are turned on in your .config: * * * * CONFIG_DEBUG_RT_MUTEXES * * CONFIG_DEBUG_PREEMPT * * CONFIG_CRITICAL_PREEMPT_TIMING * * CONFIG_CRITICAL_IRQSOFF_TIMING * * CONFIG_LATENCY_TRACE * * * * they may increase runtime overhead and latencies. * * * ***************************************************************************** Freeing unused kernel memory: 240k freed Write protecting the kernel read-only data: 1036k input: ImPS/2 Generic Wheel Mouse as /class/input/input1 md: Autodetecting RAID arrays. md: autorun ... md: considering sdb1 ... md: adding sdb1 ... md: adding sda1 ... md: created md0 md: bind md: bind md: running: md: md0: raid array is not clean -- starting background reconstruction raid1: raid set md0 active with 2 out of 2 mirrors md: ... autorun DONE. md: syncing RAID array md0 md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for reconstruction. md: using 128k window, over a total of 287836480 blocks. EXT3-fs: INFO: recovery required on readonly filesystem. EXT3-fs: write access will be enabled during recovery. kjournald starting. Commit interval 5 seconds EXT3-fs: recovery complete. EXT3-fs: mounted filesystem with ordered data mode. md0_resync/1119[CPU#1]: BUG in debug_rt_mutex_unlock at kernel/rtmutex-debug.c:471 Call Trace: {_raw_spin_lock_irqsave+34} {__WARN_ON+105} {__WARN_ON+36} {debug_rt_mutex_unlock+204} {rt_lock_slowunlock+30} {__lock_text_start+14} {kmem_cache_alloc+207} {kobject_get+31} {scsi_get_command+77} {scsi_prep_fn+339} {elv_next_request+149} {kobject_get+31} {scsi_request_fn+128} {__generic_unplug_device+45} {generic_unplug_device+37} {unplug_slaves+132} {raid1_unplug+29} {md_do_sync+1303} {md_thread+0} {constant_test_bit+9} {mcount+45} {rt_lock_slowunlock+70} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {mcount+45} {md_thread+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowunlock+0x16/0x70) .. [] .... _raw_spin_lock_irqsave+0x22/0x33 .....[] .. ( <= __WARN_ON+0x24/0x8a) md0_resync/1119[CPU#1]: BUG in debug_rt_mutex_unlock at kernel/rtmutex-debug.c:472 Call Trace: {_raw_spin_lock_irqsave+34} {__WARN_ON+105} {__WARN_ON+36} {debug_rt_mutex_unlock+366} {rt_lock_slowunlock+30} {__lock_text_start+14} {kmem_cache_alloc+207} {kobject_get+31} {scsi_get_command+77} {scsi_prep_fn+339} {elv_next_request+149} {kobject_get+31} {scsi_request_fn+128} {__generic_unplug_device+45} {generic_unplug_device+37} {unplug_slaves+132} {raid1_unplug+29} {md_do_sync+1303} {md_thread+0} {constant_test_bit+9} {mcount+45} {rt_lock_slowunlock+70} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {mcount+45} {md_thread+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowunlock+0x16/0x70) .. [] .... _raw_spin_lock_irqsave+0x22/0x33 .....[] .. ( <= __WARN_ON+0x24/0x8a) ----------- [cut here ] --------- [please bite here ] --------- Kernel BUG at kernel/rtmutex.c:639 invalid opcode: 0000 [1] PREEMPT SMP CPU 1 Modules linked in: Pid: 1119, comm: md0_resync Not tainted 2.6.17-rt8_local_00 #3 RIP: 0010:[] {rt_lock_slowlock+186} RSP: 0018:ffff8105eac63a58 EFLAGS: 00010246 RAX: ffff8102002b2db0 RBX: 0000000000000020 RCX: 00000000000c0080 RDX: ffff8102002b2db0 RSI: ffffffff8027927e RDI: ffff810200115ca0 RBP: ffff8105eac63b18 R08: ffff8104000d7650 R09: ffff8105eac63a58 R10: ffff8104000d7650 R11: ffff8104000d7650 R12: ffff810200115ca0 R13: ffff8107eac3f600 R14: ffffffff8027927e R15: ffff81020033a0c0 FS: 00002adbe7c426d0(0000) GS:ffff8102001e3340(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 00002aed3bd60770 CR3: 0000000000201000 CR4: 00000000000006e0 Process md0_resync (pid: 1119, threadinfo ffff8105eac62000, task ffff8102002b2db0) Stack: 111111110000008c ffff8105eac63a60 ffff8105eac63a60 ffff8105eac63a70 ffff8105eac63a70 0000000000000000 111111110000008c ffff8105eac63a90 ffff8105eac63a90 ffff8105eac63aa0 Call Trace: {rt_lock_slowunlock+70} {rt_lock+18} {kmem_cache_alloc+84} {mempool_alloc_slab+22} {mempool_alloc+80} {__lock_text_start+14} {scsi_get_command+253} {scsi_prep_fn+634} {elv_next_request+149} {kobject_get+31} {scsi_request_fn+128} {__generic_unplug_device+45} {generic_unplug_device+37} {unplug_slaves+132} {raid1_unplug+29} {md_do_sync+1303} {md_thread+0} {constant_test_bit+9} {mcount+45} {rt_lock_slowunlock+70} {md_thread+0} {keventd_create_kthread+0} {md_thread+280} {mcount+45} {md_thread+0} {kthread+224} {schedule_tail+198} {child_rip+8} {keventd_create_kthread+0} {thread_return+230} {thread_return+230} {thread_return+230} {kthread+0} {child_rip+0} --------------------------- | preempt count: 00000002 ] | 2-level deep critical section nesting: ---------------------------------------- .. [] .... _raw_spin_lock+0x1b/0x28 .....[] .. ( <= rt_lock_slowlock+0x3b/0x213) .. [] .... _raw_spin_trylock+0x1b/0x5f .....[] .. ( <= oops_begin+0x28/0x77) Code: 0f 0b 68 70 44 4c 80 c2 7f 02 65 48 8b 04 25 00 00 00 00 41 RIP {rt_lock_slowlock+186} RSP > ?? > > Are you sure that vmlinux is the one created with the given config file? > There's been times when I added some configs and either forgot to > compile, or the compile failed, and I didn't notice, so the old binary > was being used. I think so. I just rebuilt the thing and still no love: rcrocomb@spanky:compressed$ cd ~/kernel/2.6.17-rt8 rcrocomb@spanky:2.6.17-rt8$ make clean CLEAN arch/x86_64/boot/compressed/ CLEAN arch/x86_64/boot CLEAN /home/rcrocomb/kernel/2.6.17-rt8 CLEAN arch/x86_64/ia32 CLEAN arch/x86_64/kernel CLEAN drivers/char CLEAN drivers/ieee1394 CLEAN drivers/md CLEAN drivers/scsi/aic7xxx CLEAN init CLEAN kernel CLEAN lib CLEAN usr CLEAN .tmp_versions CLEAN vmlinux System.map .tmp_kallsyms1.o .tmp_kallsyms1.S .tmp_kallsyms2.o .tmp_kallsyms2.S .tmp_vmli nux1 .tmp_vmlinux2 .tmp_System.map rcrocomb@spanky:2.6.17-rt8$ make -j4 . . . buildy buildy buildy . . . LD [M] fs/lockd/lockd.ko LD [M] fs/nfs/nfs.ko LD [M] fs/nfsd/nfsd.ko LD [M] net/sunrpc/sunrpc.ko rcrocomb@spanky:2.6.17-rt8$ cd arch/x86_64/boot/compressed/ rcrocomb@spanky:compressed$ gdb vmlinux GNU gdb Red Hat Linux (6.3.0.0-1.122rh) Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "x86_64-redhat-linux-gnu"...(no debugging symbols found) Using host libthread_db library "/lib64/libthread_db.so.1". (gdb) li *0xffffffff802792f9 No symbol table is loaded. Use the "file" command. (gdb) I am at [even more of] a loss. -- Robert Crocombe rcrocomb@gmail.com I hope warnings like this (one for each and every file, it seems): scripts/mod/empty.c:1: warning: -ffunction-sections disabled; it makes profiling impossible and ld: warning: i386:x86-64 architecture of input file `arch/x86_64/boot/compressed/head.o' is incompatible with i386 output don't indicate anything particularly maleficent. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/