2002-01-24 13:12:07

by Anton Altaparmakov

[permalink] [raw]
Subject: 2.5.3-pre4 panics on boot )-:

oops is below...

Can't attach .config nor decode oops at the moment as the machine is now
dead (I am remote), I can reboot it remotely but LILO doesn't accept my
remote commands (even though I send a break signal it doesn't do anything,
just sits at prompt and after time out boots into -pre4 and dies - any
ideas?). )-:

Is there a patch I can apply to fix this panic? Assuming one of the
scheduler ones. But which one?

Best regards,

Anton

Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1336.3655 MHz.
..... host bus clock speed is 267.2731 MHz.
cpu: 0, clocks: 2672731, slice: 1336365
CPU0<T0:2672720,T1:1336352,D:3,S:1336365,C:2672731>
kernel BUG at sched.c:588!
invalid operand: 0000
CPU: 0
EIP: 0010:[<c0116d12>] Not tainted
EFLAGS: 00010082
eax: 0000001b ebx: c0386000 ecx: 00000001 edx: 00000c38
esi: c03c7cc0 edi: c0386000 ebp: c0387f84 esp: c0387f58
ds: 0018 es: 0018 ss: 0018
Process swapper (pid: 0, stackpage=c0387000)
Stack: c02c9739 0000024c c0387f8c c0387fc0 c0105000 0008e000 c0107521 c0386000
c0386000 c0387fc0 c0105000 0008e000 c0108ad9 00010f00 c0105050 00000000
c0387fc0 c0105000 0008e000 00000001 00000018 00000018 00000078 c010711d
Call Trace: [<c0105000>] [<c0107521>] [<c0105000>] [<c0108ad9>] [<c0105050>]
[<c0105000>] [<c010711d>] [<c0105019>] [<c0105050>]

Code: 0f 0b 58 5a fa f0 fe 0e 0f 88 60 f5 19 00 8b 4d f0 8b 01 85
spurious 8259A interrupt: IRQ7.
Kernel panic: Attempted to kill the idle task!
In idle task - not syncing



--
"I've not lost my mind. It's backed up on tape somewhere." - Unknown
--
Anton Altaparmakov <aia21 at cam.ac.uk> (replace at with @)
Linux NTFS Maintainer / WWW: http://linux-ntfs.sf.net/
ICQ: 8561279 / WWW: http://www-stu.christs.cam.ac.uk/~aia21/


2002-01-24 13:29:11

by Alexander Viro

[permalink] [raw]
Subject: Re: 2.5.3-pre4 panics on boot )-:



On Thu, 24 Jan 2002, Anton Altaparmakov wrote:

> oops is below...
>
> Can't attach .config nor decode oops at the moment as the machine is now
> dead (I am remote), I can reboot it remotely but LILO doesn't accept my
> remote commands (even though I send a break signal it doesn't do anything,
> just sits at prompt and after time out boots into -pre4 and dies - any
> ideas?). )-:
>
> Is there a patch I can apply to fix this panic? Assuming one of the
> scheduler ones. But which one?

In arch/i386/kernel/smpboot.c find
global_irq_holder = 0;
and s/0/NO_PROC_ID/

2002-01-24 13:36:00

by Jeff Garzik

[permalink] [raw]
Subject: Re: 2.5.3-pre4 panics on boot )-:

Anton Altaparmakov wrote:
> Can't attach .config nor decode oops at the moment as the machine is now

Normally that would suck but this is a BUG, so no problem...

[...]
> Using local APIC timer interrupts.
> calibrating APIC timer ...
> ..... CPU clock speed is 1336.3655 MHz.
> ..... host bus clock speed is 267.2731 MHz.
> cpu: 0, clocks: 2672731, slice: 1336365
> CPU0<T0:2672720,T1:1336352,D:3,S:1336365,C:2672731>
> kernel BUG at sched.c:588!
[...]
> spurious 8259A interrupt: IRQ7.
> Kernel panic: Attempted to kill the idle task!
> In idle task - not syncing


Scheduling in an interrupt? Ouch.

I would assume Ingo's latest, which is J6 AFAICS:
http://people.redhat.com/mingo/O(1)-scheduler/sched-O1-2.5.3-pre4-J6.patch

Jeff



--
Jeff Garzik | "I went through my candy like hot oatmeal
Building 1024 | through an internally-buttered weasel."
MandrakeSoft | - goats.com

2002-01-24 17:05:41

by Anton Altaparmakov

[permalink] [raw]
Subject: Re: 2.5.3-pre4 panics on boot )-:

At 13:28 24/01/02, Alexander Viro wrote:
>On Thu, 24 Jan 2002, Anton Altaparmakov wrote:
> > oops is below...
> >
> > Can't attach .config nor decode oops at the moment as the machine is now
> > dead (I am remote), I can reboot it remotely but LILO doesn't accept my
> > remote commands (even though I send a break signal it doesn't do anything,
> > just sits at prompt and after time out boots into -pre4 and dies - any
> > ideas?). )-:
> >
> > Is there a patch I can apply to fix this panic? Assuming one of the
> > scheduler ones. But which one?
>
>In arch/i386/kernel/smpboot.c find
> global_irq_holder = 0;
>and s/0/NO_PROC_ID/

That worked a treat. Thanks!

Best regards,

Anton


--
"I've not lost my mind. It's backed up on tape somewhere." - Unknown
--
Anton Altaparmakov <aia21 at cam.ac.uk> (replace at with @)
Linux NTFS Maintainer / WWW: http://linux-ntfs.sf.net/
ICQ: 8561279 / WWW: http://www-stu.christs.cam.ac.uk/~aia21/