Hi,
I want to ask about strange behavior of linux kernel
on some laptops (namely recent Asus laptops with
dualcore 64bit Athlons). There's a weird bug when
the kernel's under some kind of heavy load. It usually
freezes all processes which run from X11 (including
X11..) - happens usually when:
1] starting some kind of OpenGL (not very often..)
2] transmitting heavy traffic to ethernet (Almost
everytime - all you need to do is make it send
data faster then 1MB/s and wait 10 secs)
3] randomly (yeah it's rare, but happens..)
If running X, the system becomes absolutely
unusuable - all it does is that it catches acpi events
a few more seconds before a complete freeze.
I tried to debug - switch to console and reproduce
the bug with network load -- result is that it doesn't
freeze when in console, but it usually leaves network
interface unusuable (doesn't send/receive, but
statistics (/proc/dev/net) show some tx bytes (not more
than 300B) ). It needs a reboot to work again.
Worst thing is that I could't get any kind of debugging info.
Last thing I got were some watchdog messages (about
tx transmit timeouts), turning off watchdog doesn't help
(it just removes that "crash info"). I'm not getting anything
more in dmesg, no oopses, no panic. I have no idea how
to get more info.
I guess it could be about bad driver for eth - I'm using
r8169 for realtek8168 PCI-E gigabit. afaik Realtek's
r1000 driver makes this problem too.
about versions - I tried all kernels from 2.6.17 to
2.6.20-rc5, all affected. PREEMPT doesn't do any
difference. Turning off SMP also doesn't help.
If anyone wanted exact configurations which I
tested please ask me.
about hardware - this is mostly seen on Asus A6
and similar laptops. A6T, A6Tc, A6Km,....
so please - Is there some better way to debug this?
If I was able to do so, I'd happilly post fixes, but I have
no idea how to get some more info. honestly I'm not
even sure that it's the bug of r8169, as I read some
similar crash reports about disc activity and so...
If anyone encountered this kind of problems before,
please help me id and solve it.
thanks
M. Kratochvil
On Fri, 26 Jan 2007, mirek kratochvil wrote:
> I want to ask about strange behavior of linux kernel on some laptops
> (namely recent Asus laptops with dualcore 64bit Athlons). There's a
> weird bug when the kernel's under some kind of heavy load. It usually
> freezes all processes which run from X11 (including X11..) - happens
> usually when:
[ .. snip .. ]
> about hardware - this is mostly seen on Asus A6 and similar laptops.
> A6T, A6Tc, A6Km,....
What BIOS do the machines have? 04xx versions are known to be horribly
buggy (even the-other-OS(tm) users experience strange things).
In case you have BIOS 04.., try to upgrade to 06.. version. Also, does it
help when you boot with acpi=off kernel commandline parameter? (do you
compile kernel with both acpi and apic support?).
--
Jiri Kosina