2011-05-28 12:01:41

by Justin Piszcz

[permalink] [raw]
Subject: 2.6.39 kernel panic w/screenshots & kernel cfg link provided

Hello,

Since I updated to 2.6.39, my system has been crashing on a random basis,
can someone identify what is going on here?

I've checked the hardware it is OK and this only started to occur after I
moved to 2.6.39 from what I can tell..

Kernel crash/panic, hang:
http://home.comcast.net/~jpiszcz/20110528/2639-ss1.jpg
http://home.comcast.net/~jpiszcz/20110528/2639-ss2.jpg

Kernel .config:
http://home.comcast.net/~jpiszcz/20110528/config-2.6.39.txt

Justin.


2011-05-28 19:52:06

by Daniel Thaler

[permalink] [raw]
Subject: Re: 2.6.39 kernel panic w/screenshots & kernel cfg link provided

On 28.05.2011 14:01, Justin Piszcz wrote:
> Hello,
>
> Since I updated to 2.6.39, my system has been crashing on a random basis,
> can someone identify what is going on here?
>
> I've checked the hardware it is OK and this only started to occur after I
> moved to 2.6.39 from what I can tell..
>
> Kernel crash/panic, hang:
> http://home.comcast.net/~jpiszcz/20110528/2639-ss1.jpg
> http://home.comcast.net/~jpiszcz/20110528/2639-ss2.jpg
>
> Kernel .config:
> http://home.comcast.net/~jpiszcz/20110528/config-2.6.39.txt
>
> Justin.

I've just had a very similar panic.

Here's a screenshot: http://dl.dropbox.com/u/3531548/panic.png

Daniel

2011-05-31 14:54:52

by Brad Campbell

[permalink] [raw]
Subject: Re: 2.6.39 kernel panic w/screenshots & kernel cfg link provided

On 28/05/11 20:01, Justin Piszcz wrote:
> Hello,
>
> Since I updated to 2.6.39, my system has been crashing on a random
> basis, can someone identify what is going on here?
>
> I've checked the hardware it is OK and this only started to occur after
> I moved to 2.6.39 from what I can tell..
>
> Kernel crash/panic, hang:
> http://home.comcast.net/~jpiszcz/20110528/2639-ss1.jpg
> http://home.comcast.net/~jpiszcz/20110528/2639-ss2.jpg
>
> Kernel .config:
> http://home.comcast.net/~jpiszcz/20110528/config-2.6.39.txt

Just a me too I'm afraid.
On a 2.6.39 kernel I get these from about 3 of 4 boots.
If I append "panic=30" to the command line and leave it go, eventually I
get into a working kernel.

I've seen them on a 2.6.38.7 kernel also, but far harder to hit and I've
not seen them at all on a 2.6.38.2 kernel.

Once the kernel is running I don't seem to have an issue.

brad@srv:~$ lspci
00:00.0 Host bridge: Advanced Micro Devices [AMD] RS880 Host Bridge
00:01.0 PCI bridge: ASRock Incorporation Device 9602
00:02.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge
(ext gfx port 0)
00:03.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge
(ext gfx port 1)
00:04.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge
(PCIE port 0)
00:09.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge
(PCIE port 4)
00:0a.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge
(PCIE port 5)
00:11.0 SATA controller: ATI Technologies Inc SB700/SB800 SATA
Controller [AHCI mode] (rev 40)
00:12.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0
Controller
00:12.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller
00:13.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0
Controller
00:13.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller
00:14.0 SMBus: ATI Technologies Inc SBx00 SMBus Controller (rev 41)
00:14.2 Audio device: ATI Technologies Inc SBx00 Azalia (Intel HDA) (rev 40)
00:14.3 ISA bridge: ATI Technologies Inc SB700/SB800 LPC host controller
(rev 40)
00:14.4 PCI bridge: ATI Technologies Inc SBx00 PCI to PCI Bridge (rev 40)
00:14.5 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI2
Controller
00:16.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0
Controller
00:16.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller
00:18.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor
HyperTransport Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor
Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor
DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor
Miscellaneous Control
00:18.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor
Link Control
01:05.0 VGA compatible controller: ATI Technologies Inc RS880 [Radeon HD
4250]
02:00.0 USB Controller: NEC Corporation uPD720200 USB 3.0 Host
Controller (rev 03)
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd.
RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 06)
04:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 9240
(rev 02)
05:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 9240
(rev 02)
07:00.0 SCSI storage controller: Marvell Technology Group Ltd. 88SX7042
PCI-e 4-port SATA-II (rev 02)

2011-06-05 20:04:55

by Justin Piszcz

[permalink] [raw]
Subject: Re: 2.6.39 kernel panic w/screenshots & kernel cfg link provided



On Tue, 31 May 2011, Brad Campbell wrote:

> On 28/05/11 20:01, Justin Piszcz wrote:
>> Hello,

Hi,

This was happening on a regular basis, since then I have performed the
following: (sdc,sdd,sde were recently added)

# hal-disable-polling --device /dev/sr0
Polling is already disabled on the given drive.
# hal-disable-polling --device /dev/sdd
Polling for drive /dev/sdd have been disabled. The fdi file written was
# hal-disable-polling --device /dev/sde
Polling for drive /dev/sde have been disabled. The fdi file written was
# hal-disable-polling --device /dev/sdc
Polling for drive /dev/sdc have been disabled. The fdi file written was

# ps auxww|grep udisks
root 4125 0.0 0.0 127464 5844 ? Sl 07:54 0:00 /usr/lib/udisks/udisks-daemon
root 4126 0.0 0.0 45088 1100 ? S 07:54 0:00 udisks-daemon: polling /dev/sr0 /dev/sdd /dev/sde /dev/sdc
root 13335 0.0 0.0 8648 844 pts/8 S+ 08:23 0:00 grep udisks
# dpkg -S /usr/lib/udisks/udisks-daemon
udisks: /usr/lib/udisks/udisks-daemon
# apt-get remove udisks

# uname -a
Linux box 2.6.39 #11 SMP Tue May 31 15:51:52 EDT 2011 x86_64 GNU/Linux

So far, my system has not crashed in 4 days after doing this, I am also
using the 'threadirqs' boot option, if it crashes again I'll provide an update
but so far it has been stable..

Justin.

2011-06-06 22:02:49

by Justin Piszcz

[permalink] [raw]
Subject: Re: 2.6.39 kernel panic w/screenshots & kernel cfg link provided



On Sun, 5 Jun 2011, Justin Piszcz wrote:

>
>
> On Tue, 31 May 2011, Brad Campbell wrote:
>
>> On 28/05/11 20:01, Justin Piszcz wrote:
>>> Hello,
>
> Hi,
>
> This was happening on a regular basis, since then I have performed the
> following: (sdc,sdd,sde were recently added)
>
> # hal-disable-polling --device /dev/sr0
> Polling is already disabled on the given drive.
> # hal-disable-polling --device /dev/sdd
> Polling for drive /dev/sdd have been disabled. The fdi file written was
> # hal-disable-polling --device /dev/sde
> Polling for drive /dev/sde have been disabled. The fdi file written was
> # hal-disable-polling --device /dev/sdc
> Polling for drive /dev/sdc have been disabled. The fdi file written was
>
> # ps auxww|grep udisks
> root 4125 0.0 0.0 127464 5844 ? Sl 07:54 0:00
> /usr/lib/udisks/udisks-daemon
> root 4126 0.0 0.0 45088 1100 ? S 07:54 0:00
> udisks-daemon: polling /dev/sr0 /dev/sdd /dev/sde /dev/sdc
> root 13335 0.0 0.0 8648 844 pts/8 S+ 08:23 0:00 grep udisks
> # dpkg -S /usr/lib/udisks/udisks-daemon
> udisks: /usr/lib/udisks/udisks-daemon
> # apt-get remove udisks
>
> # uname -a
> Linux box 2.6.39 #11 SMP Tue May 31 15:51:52 EDT 2011 x86_64 GNU/Linux
>
> So far, my system has not crashed in 4 days after doing this, I am also using
> the 'threadirqs' boot option, if it crashes again I'll provide an update
> but so far it has been stable..
>
> Justin.
>

For others who also have crashes, do your crashes persist when you disable
these options (don't use ACPI) for ATA/SATA.

CONFIG_ATA_ACPI=y
CONFIG_PATA_ACPI=m

Recently I added a lot of options to my .config (compiled them as
modules), I noticed two items, one, when all of the ATA devices (SATA),
not the deprecated ones and/or the ACPI option for libata, that seemed to
be the final culprit that may have been causing my issues, no crashes..
yet.

Justin.