2011-06-01 19:07:15

by Christian Kujau

[permalink] [raw]
Subject: 2.6.32: Attempted to kill the idle task?

Hi,

we have a Debian/lenny system (Dell R410) with its "latest"
2.6.32+27~bpo50+1 installed. From time to time the system tends to oops.
Can someone have a quick look at the following, if this makes sense at
all? I suspect the "real" oops message has scrolled away, but this is
all we got so far:

------------------------------------
[4330423.071859] RSP <ffff88012fffdcb8>
[4330423.071926] ---[ end trace b471cad78827bf8a ]--—
[4330423.071994] Kernel panic - not syncing: Attempted to kill the idle
task? [4330423.072068] rid: 0, comm: swapper Tainted: G D 2.6.32-5-amd64
81 [4330423.072139] Call Trace:
[4330423.072206] [afffffff81216ee5>] ? panic+0x86/0x141
[4330423.072277] [afffffff812f6fee>1 ? printk+0x4e/0x58
[4330423.072349] [afffffff81041c07>] ? do_exit+0x72/0x6b5
[4330423.072419] [afffffff8104d011>] ? release_console_sem+0xl7e/0xlaf
[4330423.072493] [afffffff812f9c4d>1 ? oops_end+0xaf/0xb4
[4330423.072563] [afffffff81012734>] ? do_divide_error+0x85/0x81
[4330423.072635] [afffffff81043bdb>1 ? find_busiest_group+0x3cf/0x875
[4330423.072709] [afffffff81192d8a>1 ? vsnprintf+0x9e/0x449
[4330423.072782] [afffffff811b5815>] ? write_vga+0x17/0x4c
[4330423.072852] [afffffff810118db>1 ? divide_error+0x1b/0x20
[4330423.072925] [afffffff81039401>] ? get_avenrun+0x14/0x32
[4330423.072996] [afffffff81043bdb>1 ? find_busiest_group+0x3cf/0x875
[4330423.073069] [afffffff81043b80>] ? find_busiest_group+0x374/0x875
[4330423.073141] [afffffff8103ec61>] ? update_curr+0x131/0x147
[4330423.073212] [afffffff8101651f>1 ? sched_clock+0x5/0x8
[4330423.073283] [afffffff81016539>] ? read_tsc+0xa/0x20
[4330423.073353] [afffffff812f7487>] ? schedule+0x2cc/0x7da
[4330423.073424] [afffffff8106ea2b>1 ? tick_dev_program_event+0x2d/0x95
[4330423.073497] [afffffff8100fee7>] 1 cpu_idle+Oxd8/0xda
------------------------------------

The "original" is attached as a .png, the above is the product of an OCR
conversion.

Thanks,
Christian.
--
make bzImage, not war


Attachments:
p2.png (42.05 kB)
dmesg_i.txt (69.65 kB)
Download all attachments

2011-06-01 23:24:56

by Christian Kujau

[permalink] [raw]
Subject: Re: 2.6.32: Attempted to kill the idle task?

On Wed, 1 Jun 2011 at 15:06, Connor H wrote:
> > we have a Debian/lenny system (Dell R410) with its "latest"
> > 2.6.32+27~bpo50+1 installed. From time to time the system tends to oops.
> > Can someone have a quick look at the following, if this makes sense at
> > all? I suspect the "real" oops message has scrolled away, but this is
> > all we got so far:
> >
> > ------------------------------------
> > [4330423.071859] RSP <ffff88012fffdcb8>
> > [4330423.071926] ---[ end trace b471cad78827bf8a ]--—
> > [4330423.071994] Kernel panic - not syncing: Attempted to kill the idle
> > task? [4330423.072068] rid: 0, comm: swapper Tainted: G D 2.6.32-5-amd64
> > 81 [4330423.072139] Call Trace:
> > [4330423.072206] [afffffff81216ee5>] ? panic+0x86/0x141
> > [4330423.072277] [afffffff812f6fee>1 ? printk+0x4e/0x58
> > [4330423.072349] [afffffff81041c07>] ? do_exit+0x72/0x6b5
> > [4330423.072419] [afffffff8104d011>] ? release_console_sem+0xl7e/0xlaf
> > [4330423.072493] [afffffff812f9c4d>1 ? oops_end+0xaf/0xb4
> > [4330423.072563] [afffffff81012734>] ? do_divide_error+0x85/0x81
> Here is the division fail
> kernel/sched_fair.c #3164
>
> in find_busiest_group

Thanks for looking into it!

> sds.avg_load = (SCHED_POWER_SCALE * sds.total_load) / sds.total_pwr;
>
> kernel/sched_fair.c #2853
> sds set from this call
> update_sd_lb_stats(sd, this_cpu, idle, cpus, balance, &sds);
>
> which calculates total_pwr by looping all sched_group 's in
> sds->total_pwr += sg->cpu_power;
>
> so hopefully Ingo or Peter can shed some light,
>
> Connor

Christian.
--
BOFH excuse #365:

parallel processors running perpendicular today