Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752199AbWAEUNG (ORCPT ); Thu, 5 Jan 2006 15:13:06 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752203AbWAEUNF (ORCPT ); Thu, 5 Jan 2006 15:13:05 -0500 Received: from solarneutrino.net ([66.199.224.43]:12548 "EHLO tau.solarneutrino.net") by vger.kernel.org with ESMTP id S1752199AbWAEUNE (ORCPT ); Thu, 5 Jan 2006 15:13:04 -0500 Date: Thu, 5 Jan 2006 15:12:49 -0500 To: Kai Makisara Cc: James Bottomley , Linus Torvalds , Hugh Dickins , Andrew Morton , linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org, ryan@tau.solarneutrino.net Subject: Re: Fw: crash on x86_64 - mm related? Message-ID: <20060105201249.GB1795@tau.solarneutrino.net> References: <1134409531.9994.13.camel@mulgrave> <1134411882.9994.18.camel@mulgrave> <20051215190930.GA20156@tau.solarneutrino.net> <1134705703.3906.1.camel@mulgrave> <20051226234238.GA28037@tau.solarneutrino.net> <20060104172727.GA320@tau.solarneutrino.net> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.9i From: Ryan Richter Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 11619 Lines: 200 On Wed, Jan 04, 2006 at 11:48:52PM +0200, Kai Makisara wrote: > > Here's what I got: Another one. I can't keep running this kernel - nearly all of our backup tapes are erased now. If a drive were to fail today, we would lose hundreds of GB of irreplacible data. I'm going back to 2.6.11.3 until we have a full set of backups again. st: page attributes before page_release 8 0: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553908 1: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553846 2: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553907 3: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1554431 4: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553947 5: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553919 6: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553940 7: flags:0x060000000000006c mapping:ffff810102bbaaf0 mapcount:2 count:4 pfn:1553918 Bad page state at free_hot_cold_page (in process 'taper', page ffff81000455f7b0) flags:0x010000000000000c mapping:ffff810102bbaaf0 mapcount:2 count:0 Backtrace: Call Trace:{bad_page+116} {free_hot_cold_page+105} {sgl_unmap_user_pages+124} {release_buffering+27} {st_write+1670} {vfs_write+173} {sys_write+69} {system_call+126} Trying to fix it up, but a reboot is needed ----------- [cut here ] --------- [please bite here ] --------- Kernel BUG at mm/swap.c:49 invalid operand: 0000 [1] SMP CPU 0 Modules linked in: bonding Pid: 2166, comm: taper Tainted: G B 2.6.15 #1 RIP: 0010:[] {put_page+96} RSP: 0018:ffff81017b6bfe18 EFLAGS: 00010256 RAX: 0000000000000000 RBX: 00000000000000e0 RCX: ffff81000455f7b0 RDX: ffff81000455f7b0 RSI: 0000000000000001 RDI: ffff81000455f7b0 RBP: 0000000000000007 R08: ffff81017b6be000 R09: 0000000000000001 R10: ffff810004878aa0 R11: 0000000000000046 R12: 0000000000000008 R13: ffff8100f6f9e068 R14: 0000000000000000 R15: 0000000000008000 FS: 00002aaaab53d880(0000) GS:ffffffff804a9800(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00002aaaaab5ffff CR3: 000000017b06e000 CR4: 00000000000006e0 Process taper (pid: 2166, threadinfo ffff81017b6be000, task ffff81017b6b4700) Stack: 010000000000000c ffffffff8028c534 0000000000008000 0000000000008000 ffff8100f6f9e000 ffff81000c06e200 0000000000008000 0000000000000040 0000000000008000 ffffffff8028826d Call Trace:{sgl_unmap_user_pages+124} {release_buffering+27} {st_write+1670} {vfs_write+173} {sys_write+69} {system_call+126} Code: 0f 0b 68 ae b1 36 80 c2 31 00 f0 83 42 08 ff 0f 98 c0 84 c0 RIP {put_page+96} RSP ----------- [cut here ] --------- [please bite here ] --------- Kernel BUG at mm/swap.c:215 invalid operand: 0000 [2] SMP CPU 0 Modules linked in: bonding Pid: 2166, comm: taper Tainted: G B 2.6.15 #1 RIP: 0010:[] {release_pages+79} RSP: 0018:ffff81017b6bfb08 EFLAGS: 00010256 RAX: 0000000000000000 RBX: ffff81000455f7b0 RCX: ffff81000000f518 RDX: ffff81000455f7b0 RSI: 0000000000000006 RDI: ffff81000c006dd0 RBP: 0000000000000000 R08: ffffffff803cfa68 R09: 0000000000000286 R10: 0000000000000001 R11: 0000000000000001 R12: 0000000000000005 R13: 0000000000000006 R14: ffff81000c006dd0 R15: 0000000000008000 FS: 00002aaaab53d880(0000) GS:ffffffff804a9800(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00002aaaaab5ffff CR3: 000000017b06e000 CR4: 00000000000006e0 Process taper (pid: 2166, threadinfo ffff81017b6be000, task ffff81017b6b4700) Stack: 0000000000000000 0000000000000000 0000000000008000 ffff81017b6bfd68 0000000000000000 0000000000000000 ffffffff80362e14 000000000000000f 0000000000000000 0000000000000000 Call Trace:{printk+141} {__pagevec_lru_add+195} {lru_add_drain+32} {exit_mmap+33} {mmput+34} {do_exit+489} {die_nmi+0} {do_invalid_op+145} {put_page+96} {thread_return+0} {error_exit+0} {put_page+96} {sgl_unmap_user_pages+124} {release_buffering+27} {st_write+1670} {vfs_write+173} {sys_write+69} {system_call+126} Code: 0f 0b 68 ae b1 36 80 c2 d7 00 f0 83 43 08 ff 0f 98 c0 84 c0 RIP {release_pages+79} RSP <1>Fixing recursive fault but reboot is needed! Bad page state at prep_new_page (in process 'nfsd', page ffff81000455f7b0) flags:0x010000000000002c mapping:0000000000000000 mapcount:0 count:0 Backtrace: Call Trace:{bad_page+116} {prep_new_page+70} {buffered_rmqueue+311} {get_page_from_freelist+130} {__alloc_pages+86} {kmem_getpages+88} {cache_grow+195} {cache_alloc_refill+408} {kmem_cache_alloc+51} {cache_make_upcall+85} {cache_check+183} {exp_find_key+118} {__wake_up+54} {cache_make_upcall+299} {fh_verify+401} {nfsd3_proc_getattr+129} {nfsd_dispatch+226} {svc_process+1003} {default_wake_function+0} {nfsd+0} {nfsd+457} {child_rip+8} {nfsd+0} {nfsd+0} {child_rip+0} Trying to fix it up, but a reboot is needed Bad page state at prep_new_page (in process 'nfsd', page ffff81000455b840) flags:0x010000000000002c mapping:ffff810102bbaaf0 mapcount:2 count:3 Backtrace: Call Trace:{bad_page+116} {prep_new_page+70} {buffered_rmqueue+311} {get_page_from_freelist+130} {nfsd+0} {__alloc_pages+86} {nfsd+0} {svc_recv+291} {default_wake_function+0} {default_wake_function+0} {nfsd+0} {nfsd+269} {child_rip+8} {nfsd+0} {nfsd+0} {child_rip+0} Trying to fix it up, but a reboot is needed Bad page state at prep_new_page (in process 'zsh', page ffff81000455f3c0) flags:0x010000000000002c mapping:ffff810102bbaaf0 mapcount:2 count:3 Backtrace: Call Trace:{bad_page+116} {prep_new_page+70} {buffered_rmqueue+311} {get_page_from_freelist+130} {__alloc_pages+86} {do_wp_page+400} {__handle_mm_fault+626} {do_page_fault+520} {__put_user_4+32} {error_exit+0} Trying to fix it up, but a reboot is needed ----------- [cut here ] --------- [please bite here ] --------- Kernel BUG at mm/swap.c:303 invalid operand: 0000 [3] SMP CPU 0 Modules linked in: bonding Pid: 171, comm: pdflush Tainted: G B 2.6.15 #1 RIP: 0010:[] {__pagevec_lru_add+95} RSP: 0018:ffff8100f6fbdba8 EFLAGS: 00010086 RAX: 00000000ffffffff RBX: ffff81000000f300 RCX: 000000000000000f RDX: ffffffff804ebe60 RSI: ffff81000000f000 RDI: ffff81000000f500 RBP: ffff81000c006dc0 R08: 0000000000000000 R09: ffffffff801a4f68 R10: 0000000000000001 R11: ffff81017fc63c00 R12: ffff810004474798 R13: 0000000000000000 R14: 0000000000000001 R15: 0000000000000001 FS: 00002aaaaae00640(0000) GS:ffffffff804a9800(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 00007fffffd68f60 CR3: 000000017b223000 CR4: 00000000000006e0 Process pdflush (pid: 171, threadinfo ffff8100f6fbc000, task ffff8100f6fbaf80) Stack: ffff81008bb1af60 ffff8100f6fbdc48 ffff8100f6fbde78 ffff81017d204350 ffff8100f6f21978 ffffffff801577b9 ffff81017fc63c00 ffffffff80157a00 ffff810101b3b930 ffffffff8018e5a1 Call Trace:{lru_add_drain+32} {__pagevec_release+9} {mpage_writepages+663} {ext3_ordered_writepage+0} {mapping_tagged+58} {__sync_single_inode+108} {__writeback_single_inode+353} {process_timeout+0} {dm_table_any_congested+18} {dm_table_any_congested+18} {sync_sb_inodes+468} {keventd_create_kthread+0} {writeback_inodes+133} {pdflush+0} {background_writeout+102} {__pdflush+289} {pdflush+58} {background_writeout+0} {kthread+129} {child_rip+8} {keventd_create_kthread+0} {kthread+0} {child_rip+0} Code: 0f 0b 68 ae b1 36 80 c2 2f 01 48 8b 93 18 02 00 00 49 8d 44 RIP {__pagevec_lru_add+95} RSP NMI Watchdog detected LOCKUP on CPU 0 CPU 0 Modules linked in: bonding Pid: 1619, comm: irqbalance Tainted: G B 2.6.15 #1 RIP: 0010:[] {.text.lock.spinlock+32} RSP: 0018:ffff8100f648de90 EFLAGS: 00000086 RAX: ffff81000000f300 RBX: ffff81000000f300 RCX: 00002aaaaaac0000 RDX: ffffffff804ebe60 RSI: ffff8100f09339e0 RDI: ffff81000000f500 RBP: ffff81000c006dc0 R08: 00002aaaaaac1000 R09: 00000000ffffffff R10: ffff81000c399da8 R11: ffff81000c399088 R12: ffff810004474798 R13: 0000000000000000 R14: 00002aaaaaac1000 R15: 00002aaaaaac0000 FS: 00002aaaaae00640(0000) GS:ffffffff804a9800(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00002aaaaaac0000 CR3: 00000000f6486000 CR4: 00000000000006e0 Process irqbalance (pid: 1619, threadinfo ffff8100f648c000, task ffff810004aac340) Stack: ffffffff80157b03 ffff81008bb1af60 ffffffff804e7320 ffff8100f65c6580 ffff81000c399d78 ffff81000c399088 ffffffff801577b9 ffff81000c399088 ffffffff80160eb9 ffff8100f09339e0 Call Trace:{__pagevec_lru_add+82} {lru_add_drain+32} {unmap_region+65} {do_munmap+387} {sys_munmap+57} {system_call+126} Code: f3 90 83 3f 00 7e f9 e9 66 fe ff ff f3 90 83 3f 00 7e f9 e9 console shuts up ... <0>Kernel panic - not syncing: Aiee, killing interrupt handler! -ryan - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/