Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S261323AbTE1WmP (ORCPT ); Wed, 28 May 2003 18:42:15 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S261352AbTE1WmP (ORCPT ); Wed, 28 May 2003 18:42:15 -0400 Received: from hermine.idb.hist.no ([158.38.50.15]:60942 "HELO hermine.idb.hist.no") by vger.kernel.org with SMTP id S261323AbTE1WmM (ORCPT ); Wed, 28 May 2003 18:42:12 -0400 Date: Thu, 29 May 2003 00:59:13 +0200 To: William Lee Irwin III , Andrew Morton , linux-kernel@vger.kernel.org, linux-mm@kvack.org, neilb@cse.unsw.edu.au Subject: Re: 2.5.70-mm1 bootcrash, possibly RAID-1 Message-ID: <20030528225913.GA1103@hh.idb.hist.no> References: <20030408042239.053e1d23.akpm@digeo.com> <3ED49A14.2020704@aitel.hist.no> <20030528111345.GU8978@holomorphy.com> <3ED49EB8.1080506@aitel.hist.no> <20030528113544.GV8978@holomorphy.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20030528113544.GV8978@holomorphy.com> User-Agent: Mutt/1.5.4i From: Helge Hafting Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1775 Lines: 61 On Wed, May 28, 2003 at 04:35:44AM -0700, William Lee Irwin III wrote: > > This is unusual; I'm having trouble very close to this area. There is > a remote chance it could be the same problem. > > Could you log this to serial and get the rest of the oops/BUG? If it's > where I think it is, I've been looking at end_page_writeback() and so > might have an idea or two. I tried 2.5.70-mm1 on the dual celeron at home. This one has scsi instead of ide, so I guess it is a RAID-1 problem. This machine has root on raid-1 too. I believe there where several oopses in a row, I captured all of the last one thanks to a framebuffer with a small font. Here it is: Unable to handle kernel paging request at virtual address 8a8a8ab6 *pde=0 OOPS 0000 [#1] EIP at put_all_bios+0x47/0x80 (edx was the register containing 8a8a8a8a) Process swapper pid=0 threadinfo c1352000 task=c13f52d0 Call trace: raid_end_bio_io raid1_end_request scsi_request_fn bio_endio _end_that_request_first scsi_end_request __wake_up scsi_io_completion scsi_delete_timer sd_rw_intr sym_wakeup_done scsi_finish_command scsi_softirq timer_interrupt do_softirq do_IRQ default_idle default_idle common_interrupt default_idle default_idle default_idle cpu_idle printk <0> Kernel panic:fatal exception in interrupt in interrupt - not syncing reboot in 300 seconds This looks very similiar to the partial trace from the ide machine, it had everything from _end_that_request_first down to the three default_idles, but with ide instead of scsi functions. Helge Hafting - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/