2005-11-09 14:22:26

by Lee

[permalink] [raw]
Subject: kernel crash debugging

Hello,

I am running kernel version '2.6.13-gentoo-r3'

My hardware is as follows:
- motherboard: Tyan Tiger 230T
- processors: 2 Pentium II 1.13Ghz
- memory: 1.5GB

I have been having intermittent lockups for a while now.

At first, I thought it had something to do with vmware, but this is no occuring with a non-tainted kernel.

I recently read a thread on the mailing list about how having preempting enabled can hide bugs, so I have disabled all forms of preempting, but I still am unable to determine the cause.

I have also attached a second system to this system via a null modem cable and am capturing kernel output by redirecting its console to the serial port.

However, the kernel is not outputing any information when it locks up.

Can anyone advise on the following:
- determining why the kernel is locking up
- locating where in the code is causing the lock up
- alternative methods of debugging
- increasing the console log level


I have attached my kernel config file to this email.

Please advise,
Lee



--
Lee
[email protected]

08:06:58 up 2 days, 13:17, 0 users, load average: 0.24, 0.12, 0.06


Attachments:
(No filename) (1.11 kB)
bzImage-config (26.86 kB)
Download all attachments

2005-11-09 14:48:26

by Lee

[permalink] [raw]
Subject: Re: kernel crash debugging

> Can anyone advise on the following:
> - determining why the kernel is locking up
> - locating where in the code is causing the lock up
> - alternative methods of debugging
> - increasing the console log level

After booting my kernel with the options 'debug loglevel=9' I was able to get proper OOPS reports from the kernel on the serial console, which I have attached to this email.

Please advise on how I can help debug / fix this problem.

Thanks
Lee


--
Lee
[email protected]

08:45:44 up 2 days, 13:56, 1 user, load average: 0.00, 0.00, 0.00


Attachments:
(No filename) (565.00 B)
kernel-2.6.13-gentoo-r3-crash.txt (5.27 kB)
Download all attachments

2005-11-11 20:14:24

by Robert Hancock

[permalink] [raw]
Subject: Re: kernel crash debugging

Lee wrote:
> Hello,
>
> I am running kernel version '2.6.13-gentoo-r3'
>
> My hardware is as follows:
> - motherboard: Tyan Tiger 230T
> - processors: 2 Pentium II 1.13Ghz
> - memory: 1.5GB
>
> I have been having intermittent lockups for a while now.
>
> At first, I thought it had something to do with vmware, but this is no occuring with a non-tainted kernel.

First thing I would try with those kind of faults is Memtest86, you
could have some bad RAM or bad memory timing settings..

--
Robert Hancock Saskatoon, SK, Canada
To email, remove "nospam" from [email protected]
Home Page: http://www.roberthancock.com/

2005-11-11 20:23:23

by Lee

[permalink] [raw]
Subject: Re: kernel crash debugging

> > I am running kernel version '2.6.13-gentoo-r3'
> >
> > My hardware is as follows:
> > - motherboard: Tyan Tiger 230T
> > - processors: 2 Pentium II 1.13Ghz
> > - memory: 1.5GB
> >
> > I have been having intermittent lockups for a while now.
> >
> > At first, I thought it had something to do with vmware, but this is no occuring with a non-tainted kernel.
>
> First thing I would try with those kind of faults is Memtest86, you
> could have some bad RAM or bad memory timing settings..

I could understand that concept except for one thing:
- with 4k stacks turned on, i have the lockups
- with 4k stacks turned off, i have not had a single lock up at all.

Per an earlier email from '[email protected]', it appears that there is still an execution path in the kernel with is causing a stack overflow.




--
Lee
[email protected]

14:18:50 up 4 days, 19:29, 1 user, load average: 2.80, 2.72, 2.30