2001-04-01 01:09:55

by Ketil Froyn

[permalink] [raw]
Subject: oops in uhci.c running 2.4.2-ac28

Hi.

While running kernel 2.4.2-ac28, I switched on spinlock debugging and
verbose BUG() reporting (I always use sysrq). Anyway, while running this I
got an oops after about 2 or 3 minutes running, several times, exact same
place each time, which I traced back to rh_int_timer_do(). This was in
uhci.c (I used CONFIG_USB_UHCI_ALT).

The oops was at [<c01e4d24>], and rh_int_timer_do starts at [<c01e4cac>]
(I'd calculate the offset in the function, but it's too late at night). I
recompiled with usb-uhci.c instead (CONFIG_USB_UHCI), and now I don't get
the oops any more. Using nm vmlinux, I got this stack:

rh_int_timer_do
timer_bh
bh_action
tasklet_hi_action
gcc_compiled. (?)
do_IRQ
default_idle

If more info is needed, I'll be glad to reproduce the bug and copy the
oops here.

Also, the reason I started all this debugging was that I have a freecom
usb CD-RW that stops when I try to access it (the process is in
uninterruptible sleep and never comes out) when I access some files on
some CDs. If anyone knows about this, I'd appreciate any tips. Also, if
anyone has the technical specs on it, I'd even be willing to (try to) look
at it to see if I can spot something with the freecom code... haven't
found anything on http://www.freecom.com yet.

Linux XXXXXX 2.4.2-ac28 #3 Sun Apr 1 01:16:44 CEST 2001 i686 unknown
Gnu C egcs-2.91.66
Gnu make 3.78.1
binutils 2.9.5.0.22
util-linux 2.10f
mount 2.10f
modutils 2.4.2
e2fsprogs 1.19
pcmcia-cs 3.1.8
PPP 2.4.0b1
Linux C Library 2.1.3
Dynamic linker (ldd) 2.1.3
Procps 2.0.6
Net-tools 1.54
Console-tools 0.3.3
Sh-utils 2.0
Modules Loaded ne 8390 vfat fat ufs

Kind Regards,
Ketil Froyn


2001-04-02 22:56:41

by Pete Zaitcev

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

> Date: Sun, 1 Apr 2001 03:35:03 +0200 (CEST)
> From: Ketil Froyn <[email protected]>
> To: <[email protected]>

> While running kernel 2.4.2-ac28, I switched on spinlock debugging and
> verbose BUG() reporting (I always use sysrq). Anyway, while running this I
> got an oops after about 2 or 3 minutes running, several times, exact same
> place each time, which I traced back to rh_int_timer_do().
> This was in uhci.c (I used CONFIG_USB_UHCI_ALT). [...] I
> recompiled with usb-uhci.c instead (CONFIG_USB_UHCI), and now I don't get
> the oops any more.

I am behind usb-uhci for a reason. Alan bounced your report
to me but I do not see a case for action...

-- Pete

2001-04-02 23:06:51

by Johannes Erdfelt

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

On Mon, Apr 02, 2001, Pete Zaitcev <[email protected]> wrote:
> > Date: Sun, 1 Apr 2001 03:35:03 +0200 (CEST)
> > From: Ketil Froyn <[email protected]>
> > To: <[email protected]>
>
> > While running kernel 2.4.2-ac28, I switched on spinlock debugging and
> > verbose BUG() reporting (I always use sysrq). Anyway, while running this I
> > got an oops after about 2 or 3 minutes running, several times, exact same
> > place each time, which I traced back to rh_int_timer_do().
> > This was in uhci.c (I used CONFIG_USB_UHCI_ALT). [...] I
> > recompiled with usb-uhci.c instead (CONFIG_USB_UHCI), and now I don't get
> > the oops any more.
>
> I am behind usb-uhci for a reason. Alan bounced your report
> to me but I do not see a case for action...

What do you mean by "behind".

Have you tried the latest patches for uhci.c?

JE

2001-04-02 23:29:01

by Jeff Golds

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

Pete Zaitcev wrote:
>
> > Date: Sun, 1 Apr 2001 03:35:03 +0200 (CEST)
> > From: Ketil Froyn <[email protected]>
> > To: <[email protected]>
>
> > While running kernel 2.4.2-ac28, I switched on spinlock debugging and
> > verbose BUG() reporting (I always use sysrq). Anyway, while running this I
> > got an oops after about 2 or 3 minutes running, several times, exact same
> > place each time, which I traced back to rh_int_timer_do().
> > This was in uhci.c (I used CONFIG_USB_UHCI_ALT). [...] I
> > recompiled with usb-uhci.c instead (CONFIG_USB_UHCI), and now I don't get
> > the oops any more.
>
> I am behind usb-uhci for a reason. Alan bounced your report
> to me but I do not see a case for action...
>
> -- Pete
> -

Let me show what I got with the 2.4.2 kernel with USB support enabled.

Mar 19 14:10:00 Eng99 kernel: uhci: host controller halted. very bad
Mar 19 14:10:31 Eng99 last message repeated 108 times
Mar 19 14:11:37 Eng99 last message repeated 93 times
Mar 19 14:12:39 Eng99 last message repeated 87 times
Mar 19 14:13:40 Eng99 last message repeated 20 times
Mar 19 14:14:45 Eng99 last message repeated 42 times
Mar 19 14:15:46 Eng99 last message repeated 47 times
Mar 19 14:16:47 Eng99 last message repeated 127 times
Mar 19 14:17:50 Eng99 last message repeated 7074 times
Mar 19 14:18:51 Eng99 last message repeated 3342 times
Mar 19 14:19:52 Eng99 last message repeated 10948 times
Mar 19 14:20:00 Eng99 last message repeated 15
times

This happens after simply fiddling around with ethernet settings (it's a
PCI ethernet card). In fact, my syslog is FULL of these messages... my
syslog was 3x larger than usual. The console is just about unusable
because of all the spam.

Something seems terribly wrong with the uhci driver... I've disabled it
on my system and it's fine now (I don't need USB).

My system:
Slot 1 P3-850
VIA chipset MB (not sure of exact chipset, can find out if needed)

-Jeff

--
Jeff Golds
[email protected]

2001-04-02 23:52:01

by Johannes Erdfelt

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

On Mon, Apr 02, 2001, Jeff Golds <[email protected]> wrote:
> Let me show what I got with the 2.4.2 kernel with USB support enabled.
>
> Mar 19 14:10:00 Eng99 kernel: uhci: host controller halted. very bad
> Mar 19 14:10:31 Eng99 last message repeated 108 times
> Mar 19 14:11:37 Eng99 last message repeated 93 times
> Mar 19 14:12:39 Eng99 last message repeated 87 times
> Mar 19 14:13:40 Eng99 last message repeated 20 times
> Mar 19 14:14:45 Eng99 last message repeated 42 times
> Mar 19 14:15:46 Eng99 last message repeated 47 times
> Mar 19 14:16:47 Eng99 last message repeated 127 times
> Mar 19 14:17:50 Eng99 last message repeated 7074 times
> Mar 19 14:18:51 Eng99 last message repeated 3342 times
> Mar 19 14:19:52 Eng99 last message repeated 10948 times
> Mar 19 14:20:00 Eng99 last message repeated 15
> times
>
> This happens after simply fiddling around with ethernet settings (it's a
> PCI ethernet card). In fact, my syslog is FULL of these messages... my
> syslog was 3x larger than usual. The console is just about unusable
> because of all the spam.
>
> Something seems terribly wrong with the uhci driver... I've disabled it
> on my system and it's fine now (I don't need USB).

Do you get the same messages with the usb-uhci driver?

> My system:
> Slot 1 P3-850
> VIA chipset MB (not sure of exact chipset, can find out if needed)

Some of the VIA chipsets have port aliasing problems supposedely. This
may cause your controller to go insane like you've described.

JE

2001-04-03 00:14:11

by Jeff Golds

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

Johannes Erdfelt wrote:
>
> On Mon, Apr 02, 2001, Jeff Golds <[email protected]> wrote:
> > Let me show what I got with the 2.4.2 kernel with USB support enabled.
> >
> > Mar 19 14:10:00 Eng99 kernel: uhci: host controller halted. very bad
> > Mar 19 14:10:31 Eng99 last message repeated 108 times
> > Mar 19 14:11:37 Eng99 last message repeated 93 times
> > Mar 19 14:12:39 Eng99 last message repeated 87 times
> > Mar 19 14:13:40 Eng99 last message repeated 20 times
> > Mar 19 14:14:45 Eng99 last message repeated 42 times
> > Mar 19 14:15:46 Eng99 last message repeated 47 times
> > Mar 19 14:16:47 Eng99 last message repeated 127 times
> > Mar 19 14:17:50 Eng99 last message repeated 7074 times
> > Mar 19 14:18:51 Eng99 last message repeated 3342 times
> > Mar 19 14:19:52 Eng99 last message repeated 10948 times
> > Mar 19 14:20:00 Eng99 last message repeated 15
> > times
> >
> > This happens after simply fiddling around with ethernet settings (it's a
> > PCI ethernet card). In fact, my syslog is FULL of these messages... my
> > syslog was 3x larger than usual. The console is just about unusable
> > because of all the spam.
> >
> > Something seems terribly wrong with the uhci driver... I've disabled it
> > on my system and it's fine now (I don't need USB).
>
> Do you get the same messages with the usb-uhci driver?
>

Don't think I tried that one.

> > My system:
> > Slot 1 P3-850
> > VIA chipset MB (not sure of exact chipset, can find out if needed)
>
> Some of the VIA chipsets have port aliasing problems supposedely. This
> may cause your controller to go insane like you've described.
>
> JE
>

That could explain the issue. Fortunately, I don't need USB so I can
avoid the spam, I just thought someone might like to hear about it.

-Jeff

P.S. Sorry for responding directly to you, Johannes.

--
Jeff Golds
[email protected]

2001-04-03 07:59:49

by Erik Oomen

[permalink] [raw]
Subject: Re: oops in uhci.c running 2.4.2-ac28

On Mon, 2 Apr 2001, Johannes Erdfelt wrote:

> > Let me show what I got with the 2.4.2 kernel with USB support enabled.
> >
> > Mar 19 14:10:00 Eng99 kernel: uhci: host controller halted. very bad

I am having the same problem with kernel 2.4.3 on a Intel 430TX system:

** Last lines of DMESG:

uhci.c: suspend_hc
uhci: host controller halted. very bad
uhci.c: wakeup_hc

** lspci
root@immtng:/home/oomen > lspci
00:00.0 Host bridge: Intel Corporation 430TX - 82439TX MTXC (rev 01)
00:01.0 VGA compatible controller: Chips and Technologies F69000 HiQVideo
(rev 64)
00:02.0 Ethernet controller: Digital Equipment Corporation DECchip
21142/43 (rev 41)
00:04.0 Bridge: Tundra Semiconductor Corp. CA91C042 [Universe] (rev 02)
00:05.0 SCSI storage controller: Symbios Logic Inc. (formerly NCR) 53c860
(rev 02)
00:07.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
00:07.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01)
00:07.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01)
00:07.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)

** lsmod
root@immtng:/home/oomen > lsmod
Module Size Used by
uhci 18736 0 (unused)
universe 83728 0
sym53c8xx 55904 0 (unused)
usbcore 51088 0 [uhci]
tulip 34880 1
serial 43568 0 (autoclean)

** cat /proc/cpuinfo
root@immtng:/home/oomen > cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 5
model : 8
model name : Mobile Pentium MMX
stepping : 1
cpu MHz : 267.278
fdiv_bug : no
hlt_bug : no
f00f_bug : yes
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr mce cx8 mmx
bogomips : 532.48

Erik.