2007-05-21 22:29:29

by Uwe Bugla

[permalink] [raw]
Subject: Kernel Oops with 2.6.21.1 on an AMD K7 machine

Hello everybody,

kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is brought up showing its graphical login (kdm):

lspci -v, based on a "sane" kernel 2.6.22-rc2 or 2.6.20.11 (with both of them the Oops does not happen at all) looks like this:

00:00.0 Host bridge: Silicon Integrated Systems [SiS] 740 Host (rev 01)
Flags: bus master, medium devsel, latency 32
Memory at d0000000 (32-bit, non-prefetchable) [size=64M]
Capabilities: [c0] AGP version 2.0

00:01.0 PCI bridge: Silicon Integrated Systems [SiS] Virtual PCI-to-PCI bridge (AGP) (prog-if 00 [Normal decode])
Flags: bus master, fast devsel, latency 64
Bus: primary=00, secondary=01, subordinate=02, sec-latency=64
I/O behind bridge: 0000a000-0000afff
Memory behind bridge: cfd00000-cfefffff
Prefetchable memory behind bridge: bfa00000-cfbfffff

00:02.0 ISA bridge: Silicon Integrated Systems [SiS] SiS962 [MuTIOL Media IO] (rev 04)
Flags: bus master, medium devsel, latency 0

00:02.1 SMBus: Silicon Integrated Systems [SiS] SiS961/2 SMBus Controller
Flags: medium devsel
I/O ports at 0c00 [size=32]

00:02.5 IDE interface: Silicon Integrated Systems [SiS] 5513 [IDE] (prog-if 80 [Master])
Flags: bus master, medium devsel, latency 128
[virtual] Memory at 000001f0 (32-bit, non-prefetchable) [disabled] [size=8]
[virtual] Memory at 000003f0 (type 3, non-prefetchable) [disabled] [size=1]
[virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8]
[virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1]
I/O ports at ff00 [size=16]

00:02.7 Multimedia audio controller: Silicon Integrated Systems [SiS] AC'97 Sound Controller (rev a0)
Subsystem: C-Media Electronics Inc Unknown device 0300
Flags: bus master, medium devsel, latency 64, IRQ 11
I/O ports at d800 [size=256]
I/O ports at d400 [size=128]
Capabilities: [48] Power Management version 2

00:03.0 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
Flags: bus master, medium devsel, latency 64, IRQ 5
Memory at cfff7000 (32-bit, non-prefetchable) [size=4K]

00:03.1 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
Flags: bus master, medium devsel, latency 64, IRQ 11
Memory at cfff8000 (32-bit, non-prefetchable) [size=4K]

00:03.2 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
Flags: bus master, medium devsel, latency 64, IRQ 11
Memory at cfff9000 (32-bit, non-prefetchable) [size=4K]

00:03.3 USB Controller: Silicon Integrated Systems [SiS] USB 2.0 Controller (prog-if 20 [EHCI])
Subsystem: Micro-Star International Co., Ltd. Unknown device 5470
Flags: bus master, medium devsel, latency 64, IRQ 11
Memory at cfffa000 (32-bit, non-prefetchable) [size=4K]
Capabilities: [50] Power Management version 2

00:04.0 Ethernet controller: Silicon Integrated Systems [SiS] SiS900 PCI Fast Ethernet (rev 90)
Subsystem: Silicon Integrated Systems [SiS] SiS900 10/100 Ethernet Adapter
Flags: bus master, medium devsel, latency 64, IRQ 11
I/O ports at d000 [size=256]
Memory at cfff6000 (32-bit, non-prefetchable) [size=4K]
Expansion ROM at cffc0000 [disabled] [size=128K]
Capabilities: [40] Power Management version 2

00:09.0 SCSI storage controller: Adaptec AIC-7861 (rev 01)
Flags: bus master, medium devsel, latency 64, IRQ 5
I/O ports at dc00 [disabled] [size=256]
Memory at cfffb000 (32-bit, non-prefetchable) [size=4K]
Expansion ROM at cffe0000 [disabled] [size=64K]

00:0b.0 Ethernet controller: 3Com Corporation 3c905B 100BaseTX [Cyclone] (rev 24)
Subsystem: 3Com Corporation 3C905B Fast Etherlink XL 10/100
Flags: bus master, medium devsel, latency 64, IRQ 11
I/O ports at cc00 [size=128]
Memory at cfff5f80 (32-bit, non-prefetchable) [size=128]
Expansion ROM at cffa0000 [disabled] [size=128K]
Capabilities: [dc] Power Management version 1

01:00.0 VGA compatible controller: Silicon Integrated Systems [SiS] 65x/M650/740 PCI/AGP VGA Display Adapter (prog-if 00 [VGA])
Subsystem: Silicon Integrated Systems [SiS] 65x/M650/740 PCI/AGP VGA Display Adapter
Flags: 66MHz, medium devsel
BIST result: 00
Memory at c0000000 (32-bit, prefetchable) [size=128M]
Memory at cfee0000 (32-bit, non-prefetchable) [size=128K]
I/O ports at ac00 [size=128]
Capabilities: [40] Power Management version 2
Capabilities: [50] AGP version 2.0

Please notice:

On an Intel P4 machine with but Intel chipset and Broadcom b44 NIC that Oops does not happen in connection with 2.6.21.1!


Questions:

1. Does anyone have a similar machine (AMD Athlon 1800 XP processor) with similar hardware configuration?

2. If yes, can anyone confirm that Oops problem in connection with that similar hardware architecture and kernel 2.6.21.1.

Regards

Uwe

--
Psssst! Schon vom neuen GMX MultiMessenger geh?rt?
Der kanns mit allen: http://www.gmx.net/de/go/multimessenger


2007-05-21 22:55:17

by Ray Lee

[permalink] [raw]
Subject: Re: Kernel Oops with 2.6.21.1 on an AMD K7 machine

On 5/21/07, Uwe Bugla <[email protected]> wrote:
> kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is brought up showing its graphical login (kdm):

Please include the output out of the actual oops message. It has all
sorts of information that will help track this down.

The "REPORTING-BUGS" file in the root of the kernel source directory
has an example format that covers the sorts of stuff that they want to
see.

Thanks,

Ray

2007-05-21 22:55:50

by Andrew Morton

[permalink] [raw]
Subject: Re: Kernel Oops with 2.6.21.1 on an AMD K7 machine

On Tue, 22 May 2007 00:28:57 +0200
"Uwe Bugla" <[email protected]> wrote:

> kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is brought up showing its graphical login (kdm):

We're completely blind unless we can see that oops info. If it reached
/var/log/messages then please send it.

If it didn't please set up netconsole (Documentation/networking/netconsole.txt)

If for some reason that isn't possible then you couldry switching to
the vga concole (alt-ctl-F1) before the ooops and see if the output
is visible on the VGA screen. If it is there, a digital photo will suit
(you can email it to me and I can stick it on a server somewhere if
you like). The last option is to write the oops info down with a
pencil and type it in again, but that's painful (all the hex addresses
in the stack backtrace can be omitted)

Thanks.

2007-05-21 23:21:15

by Wolfgang Walter

[permalink] [raw]
Subject: Re: Kernel Oops with 2.6.21.1 on an AMD K7 machine

Uwe Bugla schrieb:
> Hello everybody,
>
> kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is brought up showing its graphical login (kdm):
>
> lspci -v, based on a "sane" kernel 2.6.22-rc2 or 2.6.20.11 (with both of them the Oops does not happen at all) looks like this:
>
> 00:00.0 Host bridge: Silicon Integrated Systems [SiS] 740 Host (rev 01)
> Flags: bus master, medium devsel, latency 32
> Memory at d0000000 (32-bit, non-prefetchable) [size=64M]
> Capabilities: [c0] AGP version 2.0
>
> 00:01.0 PCI bridge: Silicon Integrated Systems [SiS] Virtual PCI-to-PCI bridge (AGP) (prog-if 00 [Normal decode])
> Flags: bus master, fast devsel, latency 64
> Bus: primary=00, secondary=01, subordinate=02, sec-latency=64
> I/O behind bridge: 0000a000-0000afff
> Memory behind bridge: cfd00000-cfefffff
> Prefetchable memory behind bridge: bfa00000-cfbfffff
>
> 00:02.0 ISA bridge: Silicon Integrated Systems [SiS] SiS962 [MuTIOL Media IO] (rev 04)
> Flags: bus master, medium devsel, latency 0
>
> 00:02.1 SMBus: Silicon Integrated Systems [SiS] SiS961/2 SMBus Controller
> Flags: medium devsel
> I/O ports at 0c00 [size=32]
>
> 00:02.5 IDE interface: Silicon Integrated Systems [SiS] 5513 [IDE] (prog-if 80 [Master])
> Flags: bus master, medium devsel, latency 128
> [virtual] Memory at 000001f0 (32-bit, non-prefetchable) [disabled] [size=8]
> [virtual] Memory at 000003f0 (type 3, non-prefetchable) [disabled] [size=1]
> [virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8]
> [virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1]
> I/O ports at ff00 [size=16]
>
> 00:02.7 Multimedia audio controller: Silicon Integrated Systems [SiS] AC'97 Sound Controller (rev a0)
> Subsystem: C-Media Electronics Inc Unknown device 0300
> Flags: bus master, medium devsel, latency 64, IRQ 11
> I/O ports at d800 [size=256]
> I/O ports at d400 [size=128]
> Capabilities: [48] Power Management version 2
>
> 00:03.0 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
> Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
> Flags: bus master, medium devsel, latency 64, IRQ 5
> Memory at cfff7000 (32-bit, non-prefetchable) [size=4K]
>
> 00:03.1 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
> Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
> Flags: bus master, medium devsel, latency 64, IRQ 11
> Memory at cfff8000 (32-bit, non-prefetchable) [size=4K]
>
> 00:03.2 USB Controller: Silicon Integrated Systems [SiS] USB 1.0 Controller (rev 0f) (prog-if 10 [OHCI])
> Subsystem: Micro-Star International Co., Ltd. K7SOM+ 5.2C Motherboard
> Flags: bus master, medium devsel, latency 64, IRQ 11
> Memory at cfff9000 (32-bit, non-prefetchable) [size=4K]
>
> 00:03.3 USB Controller: Silicon Integrated Systems [SiS] USB 2.0 Controller (prog-if 20 [EHCI])
> Subsystem: Micro-Star International Co., Ltd. Unknown device 5470
> Flags: bus master, medium devsel, latency 64, IRQ 11
> Memory at cfffa000 (32-bit, non-prefetchable) [size=4K]
> Capabilities: [50] Power Management version 2
>
> 00:04.0 Ethernet controller: Silicon Integrated Systems [SiS] SiS900 PCI Fast Ethernet (rev 90)
> Subsystem: Silicon Integrated Systems [SiS] SiS900 10/100 Ethernet Adapter
> Flags: bus master, medium devsel, latency 64, IRQ 11
> I/O ports at d000 [size=256]
> Memory at cfff6000 (32-bit, non-prefetchable) [size=4K]
> Expansion ROM at cffc0000 [disabled] [size=128K]
> Capabilities: [40] Power Management version 2
>
> 00:09.0 SCSI storage controller: Adaptec AIC-7861 (rev 01)
> Flags: bus master, medium devsel, latency 64, IRQ 5
> I/O ports at dc00 [disabled] [size=256]
> Memory at cfffb000 (32-bit, non-prefetchable) [size=4K]
> Expansion ROM at cffe0000 [disabled] [size=64K]
>
> 00:0b.0 Ethernet controller: 3Com Corporation 3c905B 100BaseTX [Cyclone] (rev 24)
> Subsystem: 3Com Corporation 3C905B Fast Etherlink XL 10/100
> Flags: bus master, medium devsel, latency 64, IRQ 11
> I/O ports at cc00 [size=128]
> Memory at cfff5f80 (32-bit, non-prefetchable) [size=128]
> Expansion ROM at cffa0000 [disabled] [size=128K]
> Capabilities: [dc] Power Management version 1
>
> 01:00.0 VGA compatible controller: Silicon Integrated Systems [SiS] 65x/M650/740 PCI/AGP VGA Display Adapter (prog-if 00 [VGA])
> Subsystem: Silicon Integrated Systems [SiS] 65x/M650/740 PCI/AGP VGA Display Adapter
> Flags: 66MHz, medium devsel
> BIST result: 00
> Memory at c0000000 (32-bit, prefetchable) [size=128M]
> Memory at cfee0000 (32-bit, non-prefetchable) [size=128K]
> I/O ports at ac00 [size=128]
> Capabilities: [40] Power Management version 2
> Capabilities: [50] AGP version 2.0
>
> Please notice:
>
> On an Intel P4 machine with but Intel chipset and Broadcom b44 NIC that Oops does not happen in connection with 2.6.21.1!
>
>
> Questions:
>
> 1. Does anyone have a similar machine (AMD Athlon 1800 XP processor) with similar hardware configuration?
>
> 2. If yes, can anyone confirm that Oops problem in connection with that similar hardware architecture and kernel 2.6.21.1.
>
> Regards
>
> Uwe
>

Maybe it's the sis900-network-driver? See patch

http://git.kernel.org/?p=linux/kernel/git/stable/stable-queue.git;a=blob;f=queue-2.6.21/sis900-allocate-rx-replacement-buffer-before-rx-operation.patch;h=ee9db79405db0144953eab964d34326fcc1b09c8;hb=05ab505f2909acf3a614d3e6a32271c4c1f8a69d

Regards,
--
Wolfgang Walter
Studentenwerk M?nchen
Anstalt des ?ffentlichen Rechts
Leiter EDV
Leopoldstra?e 15
80802 M?nchen
http://www.studentenwerk.mhn.de/

2007-05-21 23:26:37

by Dave Jones

[permalink] [raw]
Subject: Re: Kernel Oops with 2.6.21.1 on an AMD K7 machine

On Tue, May 22, 2007 at 12:28:57AM +0200, Uwe Bugla wrote:
> Hello everybody,
>
> kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is brought up showing its graphical login (kdm):
>
> lspci -v, based on a "sane" kernel 2.6.22-rc2 or 2.6.20.11 (with both of them the Oops does not happen at all) looks like this:
> 00:04.0 Ethernet controller: Silicon Integrated Systems [SiS] SiS900 PCI Fast Ethernet (rev 90)

This is probably the sis900 oops that is fixed in 2.6.21.2rc1.

See http://lkml.org/lkml/2007/5/21/406

Dave

--
http://www.codemonkey.org.uk

2007-05-22 11:32:10

by Uwe Bugla

[permalink] [raw]
Subject: Re: Kernel Oops with 2.6.21.1 on an AMD K7 machine

Am Dienstag, 22. Mai 2007 13:18 schrieben Sie:
> Am Dienstag, 22. Mai 2007 00:54 schrieben Sie:
> > On Tue, 22 May 2007 00:28:57 +0200
> >
> > "Uwe Bugla" <[email protected]> wrote:
> > > kernel 2.6.21.1 makes my machine Oops about 20 seconds after KDE 3,5 is
> > > brought up showing its graphical login (kdm):
> >
> > We're completely blind unless we can see that oops info.
>
> Yes.
>
> And I am completely blind if I even once rely on some v4l "maintainer" who:
>
> a. has absolutely ZERO knowledge about DVB issues
> b. is not ready to share power and competence with a second or a third
> maintainer
> c. smashes down democratic votes (there have been 4 traditional highly
> experienced DVB maintainers voting for one person to resolve that issue -
> one of them was the "old rock" Ralph Metzler)
> d. reviews code that he absolutely has no idea about and signs it off
> e. gives stupid replies on specific questions due to his proven
> incompetence
>
> Would you call this man helpful and honest? My answer you can easily guess!
>
> > If it reached
> > /var/log/messages then please send it.
> >
> > If it didn't please set up netconsole
> > (Documentation/networking/netconsole.txt)
> >
> > If for some reason that isn't possible then you couldry switching to
> > the vga concole (alt-ctl-F1) before the ooops and see if the output
> > is visible on the VGA screen. If it is there, a digital photo will suit
> > (you can email it to me and I can stick it on a server somewhere if
> > you like). The last option is to write the oops info down with a
> > pencil and type it in again, but that's painful (all the hex addresses
> > in the stack backtrace can be omitted)
> >
> > Thanks.
>
> OK, back to the AMD K7 oops issue now:
>
> I in fact never could believe that a simple buggy NIC driver could ever
> hang up a whole machine!
>
> So, if someone could be kind enough to explain to me in how far that can
> happen I'd be very thankful.
>
> The solution for the 2.6.21.1 oops issue is the following patch, who needs
> to flow into vanilla mainline of 2.6.21.x as quick as possible:
>
> --- a/drivers/net/sis900.c
> +++ b/drivers/net/sis900.c
> @@ -1754,6 +1754,7 @@ static int sis900_rx(struct net_device *
> sis_priv->rx_ring[entry].cmdsts = RX_BUF_SIZE;
> } else {
> struct sk_buff * skb;
> + struct sk_buff * rx_skb;
>
> pci_unmap_single(sis_priv->pci_dev,
> sis_priv->rx_ring[entry].bufptr, RX_BUF_SIZE,
> @@ -1787,10 +1788,10 @@ static int sis900_rx(struct net_device *
> }
>
> /* give the socket buffer to upper layers */
> - skb = sis_priv->rx_skbuff[entry];
> - skb_put(skb, rx_size);
> - skb->protocol = eth_type_trans(skb, net_dev);
> - netif_rx(skb);
> + rx_skb = sis_priv->rx_skbuff[entry];
> + skb_put(rx_skb, rx_size);
> + rx_skb->protocol = eth_type_trans(rx_skb, net_dev);
> + netif_rx(rx_skb);
>
> /* some network statistics */
> if ((rx_status & BCAST) == MCAST)
>
> Lots of thanks for Andrew Morton's helpful explanations how to store oops
> infos. Lots of thanks to all the other people trying to help.
> I love to learn! :)
>
> Best Regards
>
> Uwe

Sorry, I forgot to Cc vger.kernel.org - mea culpa!