Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S261409AbUJZUNy (ORCPT ); Tue, 26 Oct 2004 16:13:54 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S261452AbUJZUNW (ORCPT ); Tue, 26 Oct 2004 16:13:22 -0400 Received: from fire.osdl.org ([65.172.181.4]:52612 "EHLO fire-1.osdl.org") by vger.kernel.org with ESMTP id S261440AbUJZULx (ORCPT ); Tue, 26 Oct 2004 16:11:53 -0400 Message-ID: <417EAD2F.7040608@osdl.org> Date: Tue, 26 Oct 2004 13:01:51 -0700 From: "Randy.Dunlap" Organization: OSDL User-Agent: Mozilla Thunderbird 0.8 (X11/20040913) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Bartlomiej Zolnierkiewicz CC: Andrew Morton , linux-kernel@vger.kernel.org Subject: Re: 2.6.9-mm1 References: <20041022032039.730eb226.akpm@osdl.org> <417D7EB9.4090800@osdl.org> <20041025155626.11b9f3ab.akpm@osdl.org> <417D88BB.70907@osdl.org> <20041025164743.0af550ce.akpm@osdl.org> <417D8DFF.1060104@osdl.org> <417DBEC1.5000701@osdl.org> <417E71C1.1080400@osdl.org> <58cb370e041026094016ac67d0@mail.gmail.com> In-Reply-To: <58cb370e041026094016ac67d0@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3918 Lines: 130 Bartlomiej Zolnierkiewicz wrote: > On Tue, 26 Oct 2004 08:48:17 -0700, Randy.Dunlap wrote: > >>>>>>>Yes, that gets further. :( >>>>>>>Maybe I'll just (try) apply the kexec patch to a vanilla kernel. >>>> >>>> >>>> >>>>IDE PIO changes are the part of a vanilla kernel. >>>> >>>>If vanilla kernel (+akpm's fix) works OK then >>>>this bug is not mine fault. :) >>>> >>>> >>>> >>>>>>I doubt if it'll help much. It looks like IDE PIO got badly broken. >>>> >>>> >>>> >>>>Weird, this code was in -mm for over a month. >>>> >>>> >>>> >>>>>>That's something we have to fix - could you work with Bart on it >>>>>>please? >>>>> >>>>> >>>>>Sure. Bart? >>>> >>>> >>>> >>>>I need more data, IDE PIO works fine here. >>>> >>>> >>>> >>>>>>How come your disks are running in PIO mode anyway? >>>> >>>> >>>> >>>>Maybe disks are runing in DMA mode but some application >>>>triggers PIO access (IDENTIFY command, S.M.A.R.T. etc.)... >>>> >>>> >>>> >>>>>No idea. >>> >>> >>>Andrew made me look. Duh. It's because I'm booting with >>>ide=nodma. >>> >>>So Bart, can you check the noautodma=1 code path? >>>And I'll test it again on Tuesday without using ide=nodma. >> >>Booting 2.6.9-mm1 without using "ide=nodma" works well for me. >>No other kernel changes. > > > I audited the code and only found the unrelated bug in > /proc/ide/hd?/smart_thresholds, fix below... > > --- ide-disk.c.orig 2004-10-26 15:50:51.000000000 +0200 > +++ ide-disk.c 2004-10-26 18:34:50.736448416 +0200 > @@ -977,6 +977,7 @@ > args.tfRegister[IDE_HCYL_OFFSET] = SMART_HCYL_PASS; > args.tfRegister[IDE_COMMAND_OFFSET] = WIN_SMART; > args.command_type = IDE_DRIVE_TASK_IN; > + args.data_phase = TASKFILE_IN; > args.handler = &task_in_intr; > (void) smart_enable(drive); > return ide_raw_taskfile(drive, &args, buf); > > I tried reproducing the OOPS but I couldn't. Little bird tells me that > this bug is SMP and/or highmem specific (I don't have such hardware). > > Randy, could you "ide=nodma" with 2.6.10-rc1 (+akpm's fix) and 2.6.9? Sure, did that. 2.6.9 works fine, 2.6.10-rc1 dies as with 2.6.9-mm1. Next? Unable to handle kernel paging request at virtual address fffea000 printing eip: c029f206 *pde = 00610067 *pte = 00000000 Oops: 0002 [#1] SMP DEBUG_PAGEALLOC Modules linked in: CPU: 0 EIP: 0060:[] Not tainted VLI EFLAGS: 00010006 (2.6.10-rc1) EIP is at ide_insw+0xd/0x13 eax: 000001f0 ebx: c05b322c ecx: 00000100 edx: 000001f0 esi: c05b322c edi: fffea000 ebp: c0539e78 esp: c0539e74 ds: 007b es: 007b ss: 0068 Process swapper (pid: 0, threadinfo=c0538000 task=c0452b80) Stack: c05b3180 c0539e98 c029f6bb 000001f0 fffea000 00000100 c05b322c 00000080 fffea000 c0539eb8 c02a302d c05b322c fffea000 00000080 00000000 00000000 c05b3180 c0539ee4 c02a3714 c05b322c fffea000 00000080 00000000 fffea000 Call Tra [] show_stack+0xaf/0xb7 [] show_registers+0x15d/0x1d2 [] die+0x106/0x18e [] do_page_fault+0x4da/0x669 [] error_code+0x2d/0x38 [] ata_input_data+0x98/0xa0 [] taskfile_input_data+0x26/0x49 [] ide_pio_sector+0xcb/0xe5 [] task_in_intr+0xe2/0xfe [] ide_intr+0xb6/0x14f [] handle_IRQ_event+0x38/0x69 [] __do_IRQ+0xe2/0x158 [] do_IRQ+0x36/0x60 [] common_interrupt+0x18/0x20 [] cpu_idle+0x31/0x40 [] start_kernel+0x179/0x195 [] 0xc0100211 Code: 90 90 90 90 90 55 89 e5 8b 55 08 ec 0f b6 c0 5d c3 55 89 e5 8b 55 08 66 e <0>Kernel panic - not syncing: Fatal exception in interruptce: -- ~Randy - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/