2001-10-10 14:03:51

by Alexander Feigl

[permalink] [raw]
Subject: PROBLEM: aic7xxx SCSI system hangs

Summary : aic7xxx SCSI system hangs

Full description : After doing cdrecord -toc with an audio CD (CDDA) on my
Plextor drive (PX 32TS), the [scsi_eh_0] proccess remains in uninterruptible
state. All proccesses which will use the SCSI subsystem (e.g. cdrecord,
mkisofs) will be in uninterruptible state too - which renders the SCSI system
unusable until the next reboot. The SCSI system crashes only with my CD-ROM
here. My Plextor PXW1210S does not hang. Reading the TOC with cdda2wav works
here. Somebody told me something similar happens with Plextor PX40 and
cdda2wav. Others reported hangs and didn't tell me anything about their SCSI
system. (I'm coding a cd recording UI frontend and received some bug
reports). The problems remain if I don't use my software but call the
commands from the shell. I think it is a kernel problem. Userspace bugs
(cdrecord,mkisofs?) should not hang the SCSI subsystem anyway.

Keywords : SCSI , cdrecord, mkisofs, aic7xxx

Kernel version : Linux version 2.4.11-pre5-xfs ([email protected])
(gcc version egcs-2.91.66 19990314/Linux (egcs-1.1.2 release / Mandrake Linux
8.1)) #1 Mon Okt 8 13:38:15 CEST 2001

I also tried stable and non-XFS kernels and the problem remains

Shell command : cdrecord -toc dev=x,y,z (with a CDDA in the drive)

My environment : AMD Athlon 900 Mhz
512 MB RAM
2 IDE hard drives
Plextor PX-32TS (the one which makes problems)
Plextor PX-W1210S
Adaptec 2940U2W SCSI controller (both CD drives attached)
nVidia Geforce 2 GFX (using XFree86 drivers)
ALSA sound system (0.5.10) with a SB Live!
------------------------------------- ver_linux ---------------------------
Gnu C 2.96
Gnu make 3.79.1
binutils 2.11.90.0.8
util-linux 2.11h
mount 2.11h
modutils 2.4.10
e2fsprogs 1.24a
PPP 2.4.1
Linux C Library 2.2.4
Dynamic linker (ldd) 2.2.4
Procps 2.0.7
Net-tools 1.60
Console-tools 0.2.3
Sh-utils 2.0.11
----------------------------------------------------------------------------

------------------------------------- cpuinfo -------------------------------
processor : 0
vendor_id : AuthenticAMD
cpu family : 6
model : 4
model name : AMD Athlon(tm) Processor
stepping : 2
cpu MHz : 908.977
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pat pse36
mmx fxsr syscall mmxext 3dnowext 3dnow
bogomips : 1808.79
------------------------------------------------------------------------------
----------------------------------- modules ----------------------------------
ppp_deflate 44576 1 (autoclean)
bsd_comp 4928 0 (autoclean)
ppp_async 8032 1 (autoclean)
ppp_generic 22504 3 (autoclean) [ppp_deflate bsd_comp ppp_async]
slhc 6436 1 (autoclean) [ppp_generic]
ipt_LOG 4288 3 (autoclean)
ip6table_filter 2816 0 (autoclean) (unused)
ip6_tables 13920 1 [ip6table_filter]
iptable_filter 2720 0 (autoclean) (unused)
ipt_state 1536 2
ipt_MASQUERADE 2304 1
iptable_nat 19380 0 [ipt_MASQUERADE]
ip_conntrack 22860 2 [ipt_state ipt_MASQUERADE iptable_nat]
ip_tables 13312 7 [ipt_LOG iptable_filter ipt_state
ipt_MASQUERADE iptable_nat]
nfs 87420 1 (autoclean)
snd-synth-emu10k1 5568 0 (autoclean) (unused)
snd-synth-emux 31680 0 (autoclean) [snd-synth-emu10k1]
snd-seq-midi-emul 6384 0 (autoclean) [snd-synth-emux]
snd-seq-virmidi 10400 0 (autoclean) [snd-synth-emux]
snd-seq-midi 4640 0 (autoclean) (unused)
snd-seq-oss 33152 0 (unused)
snd-seq-midi-event 5552 0 [snd-seq-virmidi snd-seq-midi snd-seq-oss]
snd-seq 53040 0 [snd-synth-emux snd-seq-virmidi snd-seq-midi
snd-seq-oss snd-seq-midi-event]
snd-pcm-oss 21312 0 (unused)
snd-pcm-plugin 21552 0 [snd-pcm-oss]
snd-mixer-oss 6432 0 [snd-pcm-oss]
snd-card-emu10k1 3552 0
snd-emu10k1 28608 0 [snd-synth-emu10k1 snd-card-emu10k1]
snd-ac97-codec 29472 0 [snd-emu10k1]
snd-mixer 35368 0 [snd-mixer-oss snd-emu10k1 snd-ac97-codec]
snd-pcm 42880 0 [snd-pcm-oss snd-pcm-plugin snd-emu10k1]
snd-timer 11904 0 [snd-seq snd-pcm]
snd-rawmidi 14624 0 [snd-seq-midi snd-emu10k1]
snd-emux-mem 3744 0 [snd-synth-emux snd-emu10k1]
snd-seq-device 5808 0 [snd-synth-emu10k1 snd-synth-emux
snd-seq-midi snd-seq-oss snd-seq snd-card-emu10k1 snd-rawmidi]
snd 49152 1 [snd-synth-emux snd-seq-virmidi snd-seq-midi
snd-seq-oss snd-seq-midi-event snd-seq snd-pcm-oss snd-pcm-plugin
snd-mixer-oss snd-card-emu10k1 snd-emu10k1 snd-ac97-codec snd-mixer snd-pcm
snd-timer snd-rawmidi snd-emux-mem snd-seq-device]
soundcore 6692 9 [snd]
nfsd 79168 8 (autoclean)
lockd 58528 1 (autoclean) [nfs nfsd]
sunrpc 77140 1 (autoclean) [nfs nfsd lockd]
lp 6400 0
parport_pc 23812 1
parport 35680 1 [lp parport_pc]
ipv6 150240 -1 (autoclean)
usb-ohci 20992 0 (unused)
usbcore 62496 1 [usb-ohci]
tulip 42080 1 (autoclean)
sr_mod 15768 0 (autoclean) (unused)
md 50624 0 (unused)
tuner 6020 1 (autoclean)
tvaudio 10464 0 (autoclean) (unused)
msp3400 16176 1 (autoclean)
bttv 80960 0 (unused)
videodev 8128 3 [bttv]
i2c-algo-bit 8492 1 [bttv]
i2c-core 18400 0 [tuner tvaudio msp3400 bttv i2c-algo-bit]
v4l2-common 8672 0 [bttv]
xfs 530904 4
aic7xxx 112172 0 (unused)
sd_mod 10456 0 (unused)
scsi_mod 83916 3 [sr_mod aic7xxx sd_mod]
xfs_support 9508 0 [xfs]
pagebuf 24896 4 [xfs xfs_support]
-----------------------------------------------------------------------------
------------------------------------ ioports --------------------------------
0000-001f : dma1
0020-003f : pic1
0040-005f : timer
0060-006f : keyboard
0070-007f : rtc
0080-008f : dma page reg
00a0-00bf : pic2
00c0-00df : dma2
00f0-00ff : fpu
0170-0177 : ide1
01f0-01f7 : ide0
02f8-02ff : serial(auto)
0376-0376 : ide1
0378-037a : parport0
037b-037f : parport0
03c0-03df : vga+
03f6-03f6 : ide0
03f8-03ff : serial(auto)
0778-077a : parport0
0cf8-0cff : PCI conf1
b000-bfff : PCI Bus #01
d200-d203 : Advanced Micro Devices [AMD] AMD-751 [Irongate] System Controller
d400-d41f : Creative Labs SB Live! EMU10k1
d400-d41f : EMU10K1
d600-d6ff : Lite-On Communications Inc LNE100TX
d600-d6ff : tulip
d800-d803 : Elsa AG QuickStep 1000
da00-da7f : Elsa AG QuickStep 1000
dc00-dcff : Adaptec AHA-2940U2/W
de00-de07 : Creative Labs SB Live!
f000-f00f : Advanced Micro Devices [AMD] AMD-756 [Viper] IDE
f000-f007 : ide0
f008-f00f : ide1
------------------------------------------------------------------------------

--------------------------------------- iomem --------------------------------
00000000-0009fbff : System RAM
0009fc00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000cb800-000d7fff : Extension ROM
000f0000-000fffff : System ROM
00100000-1ffeffff : System RAM
00100000-001ffeeb : Kernel code
001ffeec-002cb227 : Kernel data
1fff0000-1fff7fff : ACPI Tables
1fff8000-1fffffff : ACPI Non-volatile Storage
d5c00000-e5cfffff : PCI Bus #01
d8000000-dfffffff : nVidia Corporation NV11 (GeForce2 MX)
e8000000-ebffffff : Advanced Micro Devices [AMD] AMD-751 [Irongate] System
Controller
eddfe000-eddfefff : Advanced Micro Devices [AMD] AMD-751 [Irongate] System
Controller
eddff000-eddfffff : Brooktree Corporation Bt848 TV with DMA push
eddff000-eddfffff : bttv0
ede00000-efefffff : PCI Bus #01
ee000000-eeffffff : nVidia Corporation NV11 (GeForce2 MX)
efffd000-efffdfff : Advanced Micro Devices [AMD] AMD-756 [Viper] USB
efffd000-efffdfff : usb-ohci
efffee00-efffeeff : Lite-On Communications Inc LNE100TX
efffee00-efffeeff : tulip
efffef80-efffefff : Elsa AG QuickStep 1000
effff000-efffffff : Adaptec AHA-2940U2/W
effff000-efffffff : aic7xxx
ffff0000-ffffffff : reserved
-------------------------------------------------------------------------------

-------------------------------------- lspci --------------------------------
00:00.0 Host bridge: Advanced Micro Devices [AMD] AMD-751 [Irongate] System
Controller (rev 25)
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort+ >SERR- <PERR-
Latency: 120
Region 0: Memory at e8000000 (32-bit, prefetchable) [size=64M]
Region 1: Memory at eddfe000 (32-bit, prefetchable) [size=4K]
Region 2: I/O ports at d200 [disabled] [size=4]
Capabilities: [a0] AGP version 1.0
Status: RQ=15 SBA+ 64bit- FW- Rate=x1,x2
Command: RQ=0 SBA- AGP+ 64bit- FW- Rate=x1

00:01.0 PCI bridge: Advanced Micro Devices [AMD] AMD-751 [Irongate] AGP
Bridge (rev 01) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 120
Bus: primary=00, secondary=01, subordinate=01, sec-latency=32
I/O behind bridge: 0000b000-0000bfff
Memory behind bridge: ede00000-efefffff
Prefetchable memory behind bridge: d5c00000-e5cfffff
BridgeCtl: Parity- SERR+ NoISA- VGA+ MAbort- >Reset- FastB2B-

00:07.0 ISA bridge: Advanced Micro Devices [AMD] AMD-756 [Viper] ISA (rev 01)
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 0

00:07.1 IDE interface: Advanced Micro Devices [AMD] AMD-756 [Viper] IDE (rev
07) (prog-if 8a [Master SecP PriP])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 32
Region 4: I/O ports at f000 [size=16]

00:07.3 Bridge: Advanced Micro Devices [AMD] AMD-756 [Viper] ACPI (rev 03)
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-

00:07.4 USB Controller: Advanced Micro Devices [AMD] AMD-756 [Viper] USB (rev
06) (prog-if 10 [OHCI])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 16 (20000ns max), cache line size 08
Interrupt: pin D routed to IRQ 10
Region 0: Memory at efffd000 (32-bit, non-prefetchable) [size=4K]

00:08.0 Multimedia audio controller: Creative Labs SB Live! EMU10000 (rev 04)
Subsystem: Creative Labs CT4620 SBLive!
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64 (500ns min, 5000ns max)
Interrupt: pin A routed to IRQ 9
Region 0: I/O ports at d400 [size=32]
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-

00:08.1 Input device controller: Creative Labs SB Live! (rev 01)
Subsystem: Creative Labs Gameport Joystick
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64
Region 0: I/O ports at de00 [size=8]
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-

00:09.0 Multimedia video controller: Brooktree Corporation Bt848 TV with DMA
push (rev 12)
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64 (4000ns min, 10000ns max)
Interrupt: pin A routed to IRQ 11
Region 0: Memory at eddff000 (32-bit, prefetchable) [size=4K]

00:0a.0 Ethernet controller: Lite-On Communications Inc LNE100TX (rev 20)
Subsystem: Netgear FA310TX
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64
Interrupt: pin A routed to IRQ 5
Region 0: I/O ports at d600 [size=256]
Region 1: Memory at efffee00 (32-bit, non-prefetchable) [size=256]
Expansion ROM at eff80000 [disabled] [size=256K]

00:0b.0 SCSI storage controller: Adaptec AHA-2940U2/W
Subsystem: Adaptec: Unknown device a180
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64 (9750ns min, 6250ns max)
Interrupt: pin A routed to IRQ 10
BIST result: 00
Region 0: I/O ports at dc00 [disabled] [size=256]
Region 1: Memory at effff000 (64-bit, non-prefetchable) [size=4K]
Expansion ROM at effc0000 [disabled] [size=128K]
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-

00:0c.0 Network controller: Elsa AG QuickStep 1000 (rev 01)
Subsystem: Elsa AG QuickStep 1000
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Interrupt: pin A routed to IRQ 9
Region 0: Memory at efffef80 (32-bit, non-prefetchable) [size=128]
Region 1: I/O ports at da00 [size=128]
Region 3: I/O ports at d800 [size=4]

01:05.0 VGA compatible controller: nVidia Corporation NV11 (rev a1) (prog-if
00 [VGA])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 64 (1250ns min, 250ns max)
Interrupt: pin A routed to IRQ 11
Region 0: Memory at ee000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at d8000000 (32-bit, prefetchable) [size=128M]
Expansion ROM at efef0000 [disabled] [size=64K]
Capabilities: [60] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [44] AGP version 2.0
Status: RQ=31 SBA- 64bit- FW+ Rate=x1,x2
Command: RQ=0 SBA- AGP- 64bit- FW- Rate=<none>

--------------------------------------------------------------------------------
----------------------------------- scsi -------------------------------------
Attached devices:
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: PLEXTOR Model: CD-ROM PX-32TS Rev: 1.02
Type: CD-ROM ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 03 Lun: 00
Vendor: PLEXTOR Model: CD-R PX-W1210S Rev: 1.03
Type: CD-ROM ANSI SCSI revision: 02
--------------------------------------------------------------------------------


Alexander Feigl


2001-10-10 16:22:41

by Justin T. Gibbs

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

>Summary : aic7xxx SCSI system hangs

You need to send a complete console log from boot through the hang
(serial console preferred). It would also be userfull for you to
add KDB to your system and get as much information about the hung
exception handling thread (e.g. what routines it is looping through).

--
Justin

2001-10-10 17:47:57

by Cliff Albert

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

On Wed, Oct 10, 2001 at 04:03:12PM +0200, Alexander Feigl wrote:

> Full description : After doing cdrecord -toc with an audio CD (CDDA) on my
> Plextor drive (PX 32TS), the [scsi_eh_0] proccess remains in uninterruptible
> state. All proccesses which will use the SCSI subsystem (e.g. cdrecord,
> mkisofs) will be in uninterruptible state too - which renders the SCSI system
> unusable until the next reboot. The SCSI system crashes only with my CD-ROM
> here. My Plextor PXW1210S does not hang. Reading the TOC with cdda2wav works
> here. Somebody told me something similar happens with Plextor PX40 and
> cdda2wav. Others reported hangs and didn't tell me anything about their SCSI
> system. (I'm coding a cd recording UI frontend and received some bug
> reports). The problems remain if I don't use my software but call the
> commands from the shell. I think it is a kernel problem. Userspace bugs
> (cdrecord,mkisofs?) should not hang the SCSI subsystem anyway.

This problem also seems to appear on 2.4.10-ac10 using the old AIC7xxx driver cdrecord spits out a bunch of errors, and after that the SCSI bus is just plain dead. The audio-cd itself can be played correctly on the PX-32TS using the front buttons.

> Kernel version : Linux version 2.4.11-pre5-xfs ([email protected])
> (gcc version egcs-2.91.66 19990314/Linux (egcs-1.1.2 release / Mandrake Linux
> 8.1)) #1 Mon Okt 8 13:38:15 CEST 2001

Kernel version : Linux version 2.4.10-ac10 (root@neve) (gcc version 2.95.4 20011006 (Debian prerelease)) #6 Tue Oct 9 11:59:30 CEST 2001

> I also tried stable and non-XFS kernels and the problem remains
>
> Shell command : cdrecord -toc dev=x,y,z (with a CDDA in the drive)
>
> My environment : AMD Athlon 900 Mhz
> 512 MB RAM
> 2 IDE hard drives
> Plextor PX-32TS (the one which makes problems)
> Plextor PX-W1210S
> Adaptec 2940U2W SCSI controller (both CD drives attached)
> nVidia Geforce 2 GFX (using XFree86 drivers)
> ALSA sound system (0.5.10) with a SB Live!

Enviroment here : Intel Celeron 266 (Overclocked to 400)
512 MB RAM
Adaptec 7890 (Onboard P2B-S)

--> /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: QUANTUM Model: FIREBALL ST6.4S Rev: 0F0C
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: QUANTUM Model: VIKING II 9.1WLS Rev: 4110
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: YAMAHA Model: CRW8424S Rev: 1.0g
Type: CD-ROM ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 04 Lun: 00
Vendor: IOMEGA Model: ZIP 100 Rev: J.03
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 05 Lun: 00
Vendor: YAMAHA Model: CRW2100S Rev: 1.0H
Type: CD-ROM ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 06 Lun: 00
Vendor: PLEXTOR Model: CD-ROM PX-32TS Rev: 1.01
Type: CD-ROM ANSI SCSI revision: 02


--> The error messages (kernel)
Oct 10 19:38:48 neve kernel: scsi : aborting command due to timeout : pid 655363, scsi0, channel 0, id 1, lun 0 Write (10) 00 00 00 cd 1f 00 00 18 00
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Aborting scb 5, flags 0x4, SEQADDR 0xa3, LASTPHASE 0x80
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) SG_CACHEPTR 0x6, SG_COUNT 1, SCSISIGI 0x44
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) SSTAT0 0x0, SSTAT1 0x3, SSTAT2 0x50
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) SCB disconnected. Queueing Abort SCB.
Oct 10 19:38:48 neve kernel: SCSI host 0 abort (pid 655363) timed out - resetting
Oct 10 19:38:48 neve kernel: SCSI bus is being reset for host 0 channel 0.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Reset called, scb 5, flags 0x1094
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Have already attempted to reach device with queued
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) message, will escalate to bus reset.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Reset channel called, will initiate reset.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Resetting currently active channel.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Channel reset
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Reset device, active_scb 14
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Current scb_tag 16, SEQADDR 0xa3, LASTPHASE 0x80
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) SG_CACHEPTR 0x6, SG_COUNT 1, SCSISIGI 0x0
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) SSTAT0 0x0, SSTAT1 0x1, SSTAT2 0x40
Oct 10 19:38:48 neve kernel: (scsi0:0:0:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:2:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:3:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:4:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:5:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:6:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:8:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:9:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:10:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:11:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:12:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:13:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:14:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:15:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Cleaning QINFIFO.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Cleaning waiting_scbs.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Cleaning waiting for selection list.
Oct 10 19:38:48 neve kernel: (scsi0:0:-1:-1) Cleaning disconnected scbs list.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Aborting scb 5
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Aborting scb 8
Oct 10 19:38:48 neve kernel: (scsi0:0:0:0) Aborting scb 12
Oct 10 19:38:48 neve kernel: (scsi0:0:0:0) Aborting scb 15
Oct 10 19:38:48 neve kernel: (scsi0:0:6:0) Aborting scb 16
Oct 10 19:38:48 neve kernel: (scsi0:-1:-1:-1) 5 commands found and queued for completion.
Oct 10 19:38:48 neve kernel: (scsi0:0:0:0) Sending SDTR 12/127 message.
Oct 10 19:38:48 neve kernel: (scsi0:0:0:0) Synchronous at 20.0 Mbyte/sec, offset 15.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Sending WDTR message.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Using Wide(16bit) transfers
Oct 10 19:38:48 neve kernel: (scsi0:0:6:0) Sending SDTR 12/127 message.
Oct 10 19:38:48 neve kernel: (scsi0:0:6:0) Synchronous at 20.0 Mbyte/sec, offset 15.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Sending SDTR 10/127 message.
Oct 10 19:38:48 neve kernel: (scsi0:0:1:0) Synchronous at 80.0 Mbyte/sec, offset 31.

--> The errormessage (kernel) 2nd Attempt
Oct 10 19:39:39 neve kernel: scsi : aborting command due to timeout : pid 655571, scsi0, channel 0, id
0, lun 0 Write (10) 00 00 1a 60 b2 00 00 18 00
Oct 10 19:39:39 neve kernel: (scsi0:0:0:0) Aborting scb 11, flags 0x4, SEQADDR 0xa4, LASTPHASE 0x80
Oct 10 19:39:39 neve kernel: (scsi0:0:0:0) SG_CACHEPTR 0x6, SG_COUNT 1, SCSISIGI 0x44
Oct 10 19:39:39 neve kernel: (scsi0:0:0:0) SSTAT0 0x0, SSTAT1 0x3, SSTAT2 0x50
Oct 10 19:39:39 neve kernel: (scsi0:0:0:0) SCB found in QINFIFO and aborted.
Oct 10 19:39:39 neve kernel: (scsi0:0:0:0) Aborting scb 11
Oct 10 19:39:39 neve kernel: (scsi0:-1:-1:-1) 1 commands found and queued for completion.
Oct 10 19:39:49 neve kernel: scsi : aborting command due to timeout : pid 655570, scsi0, channel 0, id
6, lun 0 VENDOR SPECIFIC(0xe5) 00 00 00 00 01 00 00 0e 00
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) Aborting scb 4, flags 0x4, SEQADDR 0xa4, LASTPHASE 0x80
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) SG_CACHEPTR 0x6, SG_COUNT 1, SCSISIGI 0x44
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) SSTAT0 0x0, SSTAT1 0x3, SSTAT2 0x50
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) SCB is currently active. Waiting on completion.
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) SCSISIGI 0x44, SEQADDR 0xa4, SSTAT0 0x0, SSTAT1 0x3
Oct 10 19:39:49 neve kernel: (scsi0:0:6:0) SG_CACHEPTR 0x6, SSTAT2 0x50, STCNT 0x8
Oct 10 19:39:54 neve kernel: SCSI host 0 abort (pid 655570) timed out - resetting
Oct 10 19:39:54 neve kernel: SCSI bus is being reset for host 0 channel 0.
Oct 10 19:39:54 neve kernel: (scsi0:0:6:0) Reset called, scb 4, flags 0x84
Oct 10 19:39:54 neve kernel: (scsi0:0:6:0) Bus device reset stupid when other action has failed.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Reset channel called, will initiate reset.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Resetting currently active channel.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Channel reset
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Reset device, active_scb 4
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Current scb_tag 4, SEQADDR 0xa4, LASTPHASE 0x80
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) SG_CACHEPTR 0x6, SG_COUNT 1, SCSISIGI 0x0
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) SSTAT0 0x0, SSTAT1 0x1, SSTAT2 0x40
Oct 10 19:39:54 neve kernel: (scsi0:0:0:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:1:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:2:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:3:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:4:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:5:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:6:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:8:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:9:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:10:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:11:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:12:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:13:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:14:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:15:-1) Cleaning up status information and delayed_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Cleaning QINFIFO.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Cleaning waiting_scbs.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Cleaning waiting for selection list.
Oct 10 19:39:54 neve kernel: (scsi0:0:-1:-1) Cleaning disconnected scbs list.
Oct 10 19:39:54 neve kernel: (scsi0:0:6:0) Aborting scb 4
Oct 10 19:39:54 neve kernel: (scsi0:0:0:0) Aborting scb 11
Oct 10 19:39:54 neve kernel: (scsi0:-1:-1:-1) 2 commands found and queued for completion.
Oct 10 19:39:54 neve kernel: (scsi0:0:0:0) Sending SDTR 12/127 message.
Oct 10 19:39:54 neve kernel: (scsi0:0:0:0) Synchronous at 20.0 Mbyte/sec, offset 15.
Oct 10 19:39:54 neve kernel: (scsi0:0:6:0) Sending SDTR 12/127 message.
Oct 10 19:39:54 neve kernel: (scsi0:0:6:0) Synchronous at 20.0 Mbyte/sec, offset 15.
Oct 10 19:39:54 neve kernel: (scsi0:0:1:0) Sending WDTR message.
Oct 10 19:39:54 neve kernel: (scsi0:0:1:0) Using Wide(16bit) transfers
Oct 10 19:39:54 neve kernel: (scsi0:0:1:0) Sending SDTR 10/127 message.
Oct 10 19:39:54 neve kernel: (scsi0:0:1:0) Synchronous at 80.0 Mbyte/sec, offset 31.


--> The errormessages (cdrecord)
scsidev: '0,6,0'
scsibus: 0 target: 6 lun: 0
Linux sg driver version: 3.1.20
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 02 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 03 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 04 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 05 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 06 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 07 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 08 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 09 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 0A 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 0B 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 0C 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 0D 00 0C 00
status: 0x2 (CHECK CONDITION)
Sense Bytes: 70 00 02 00 00 00 00 0A 00 00 00 00 04 01 00 00
Sense Key: 0x2 Not Ready, Segment 0
Sense Code: 0x04 Qual 0x01 (logical unit is in process of becoming ready) Fru 0x0
Sense flags: Blk 0 (not valid)
cmd finished after 0.000s timeout 40s
cdrecord: Cannot read TOC
cdrecord: Input/output error. read toc: scsi sendcmd: no error
CDB: 43 00 00 00 00 00 0E 00 0C 00
status: 0x2 (CHECK CONDITION)

--
Cliff Albert | RIPE: CA3348-RIPE | http://www.oisec.net
[email protected] | 6BONE: CA2-6BONE | icq 18461740

2001-10-10 19:36:58

by Alexander Feigl

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

Am Mit, 2001-10-10 um 18.22 schrieb 1002730973:
> >Summary : aic7xxx SCSI system hangs
>
> You need to send a complete console log from boot through the hang
> (serial console preferred). It would also be userfull for you to
> add KDB to your system and get as much information about the hung
> exception handling thread (e.g. what routines it is looping through).
>

I don't have any real kdb experiences and not much time at the
moment.But I try to post the required information. I attached a second
computer via null modem cable to serial and logged the output. Tell me
if some essential output is missing.

Where can I find information how to trace the problem with kdb or
anything else?

----------------------------- console log ---------------------------

Linux version 2.4.11-xfs ([email protected]) (gcc version egcs-2.9
1.66 19990314/Linux (egcs-1.1.2 release / Mandrake Linux 8.1)) #1 Mit Okt 10 20:
02:42 CEST 2001
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000001fff0000 (usable)
BIOS-e820: 000000001fff0000 - 000000001fff8000 (ACPI data)
BIOS-e820: 000000001fff8000 - 0000000020000000 (ACPI NVS)
BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
On node 0 totalpages: 131056
zone(0): 4096 pages.
zone(1): 126960 pages.
zone(2): 0 pages.
Kernel command line: BOOT_IMAGE=2411 ro root=306 ramdisk_size=8192 devfs=mount v
ideo=riva:off console=ttyS1
Initializing CPU#0
Detected 908.964 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1808.79 BogoMIPS
Memory: 511756k/524224k available (1025k kernel code, 12080k reserved, 812k data
, 228k init, 0k highmem)
kdb version 1.9 by Scott Lurndal, Keith Owens. Copyright SGI, All Rights Reserve
d
Dentry-cache hash table entries: 65536 (order: 7, 524288 bytes)
Inode-cache hash table entries: 32768 (order: 6, 262144 bytes)
Mount-cache hash table entries: 8192 (order: 4, 65536 bytes)
Buffer-cache hash table entries: 32768 (order: 5, 131072 bytes)
Page-cache hash table entries: 131072 (order: 7, 524288 bytes)
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 256K (64 bytes/line)
CPU: AMD Athlon(tm) Processor stepping 02
Enabling fast FPU save and restore... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch ([email protected])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfdb71, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
IA-32 Microcode Update Driver: v1.08 <[email protected]>
apm: BIOS version 1.2 Flags 0x03 (Driver version 1.14)
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
devfs: v0.117 (20010927) Richard Gooch ([email protected])
devfs: boot_options: 0x1
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI IS
APNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
Real Time Clock Driver v1.10e
Non-volatile memory driver v1.1
block: 128 slots per queue, batch=16
RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
AMD7409: IDE controller on PCI bus 00 dev 39
AMD7409: chipset revision 7
AMD7409: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:DMA, hdd:pio
hda: QUANTUM FIREBALLlct15 30, ATA DISK drive
hdc: ST340823A, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: 58633344 sectors (30020 MB) w/418KiB Cache, CHS=3649/255/63, UDMA(33)
hdc: 78165360 sectors (40021 MB) w/1024KiB Cache, CHS=77545/16/63, UDMA(33)
Partition check:
/dev/ide/host0/bus0/target0/lun0: p1 p2 < p5 p6 p7 p8 p9 p10 p11 >
/dev/ide/host0/bus1/target0/lun0: [PTBL] [4865/255/63] p1 p2 < p5 p6 >
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 4096 buckets, 32Kbytes
TCP: Hash tables configured (established 32768 bind 32768)
Linux IP multicast router 0.06 plus PIM-SM
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
RAMDISK: Compressed image found at block 0
Freeing initrd memory: 1554k freed
VFS: Mounted root (ext2 filesystem).
Mounted devfs on /dev
Red Hat nash verSCSI subsystem driver Revision: 1.00
sion 3.1.6-mdk starting
Loadingscsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.1
pagebuf module <Adaptec 2940 Ultra2 SCSI adapter>

Loading xfs_sup aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/255 SCBs
port module
Loa
ding scsi_mod module
Loading sd_mod module
Loading aic7xxx module
Vendor: PLEXTOR Model: CD-ROM PX-32TS Rev: 1.02
Type: CD-ROM ANSI SCSI revision: 02
(scsi0:A:2): 20.000MB/s transfers (20.000MHz, offset 15)
Vendor: PLEXTOR Model: CD-R PX-W1210S Rev: 1.03
Type: CD-ROM ANSI SCSI revision: 02
(scsi0:A:3): 20.000MB/s transfers (20.000MHz, offset 16)
Loading xfs module
SGI XFS with ACLs, EAs, DMAPI, realtime, quota, no debug enabled
Mounting /proc filesystem
Creating rootXFS mounting filesystem ide0(3,6)
device
Mounting root filesystem
Remounting devfs at correct place if necessary
Freeing unused kernel memory: 228k freed
INIT: version 2.78 booting
Couldnt get a file descriptor referring to the console
get_console_fd: Das Argument ist ung?ltig
Standardzeichensatz setzen: [ OK ]
Willkommen zu Mandrake Linux
Dr?cken Sie ?I?, um im interaktiven Modus zu starten.
Einh?ngen des Dateisystems ?/proc?: [ OK ]
Starten des DevFS D?mons: [ OK ]
Unmounting initrd: [ OK ]
Kern-Parameter konfigurieren: [ OK ]
Setting clock (utc): Mit Okt 10 20:31:56 CEST 2001 [ OK ]
Swap-Partitionen einbinden: [ OK ]
Setting hostname PowerBox.MysticWorld.homenet: [ OK ]
Checking root filesystem
[ OK ]
Verzeichnisbaumwurzel im Schreib-/Lese-Modus einh?ngen: [ OK ]
Finden der Modulabh?ngigkeiten: [ OK ]
Checking filesystems
[ OK ]
Einh?ngen lokaler Dateisysteme: [ OK ]
Kontrollieren des LoopBack-Dateisystemes[ OK ]
Einh?ngen der Loopback-Dateisysteme: [ OK ]
Running devfsd actions: [ OK ]
Loading compose keys: compose.latin.inc [ OK ]
The BackSpace key sends: ^?[ OK ]
Enabling local filesystem quotas: [ OK ]
Lokale Benutzer- und Gruppen-Quotas aktivieren: [ OK ]
Swap-Bereich aktivieren: [ OK ]
Building Window Manager Sessions [ OK ]
INIT: Entering runlevel: 1
Entering non-interactive startup
Allen Prozessen das Signal ?TERM? senden ...
INIT: no more processes left in this runlevel
Sende allen Prozessen das Signal ?KILL? ...
INIT anweisen, in den Einbenutzer-Betrieb zu wechseln.
INIT: Going single user
sh-2.05# cdrecord -toc dev=0,2,0
Cdrecord 1.10 (i586-mandrake-linux-gnu) Copyright (C) 1995-2001 J?rg Schilling
scsidev: '0,2,0'
scsibus: 0 target: 2 lun: 0
Linux sg driver version: 3.1.20
Using libscg version 'schily-0.5'
Device type : Removable CD-ROM
Version : 2
Response Format: 2
Capabilities : SYNC LINKED
Vendor_info : 'PLEXTOR '
Identifikation : 'CD-ROM PX-32TS '
Revision : '1.02'
Device seems to be: Generic CD-ROM.
Using generic SCSI-2 CD driver (scsi2_cd).
Driver flags :
first: 1 last 18
Entering kdb (current=0xc183e000, pid 3) due to Keyboard Entry
kdb> b btp 16
EBP EIP Function(args)
0xdf945f6c 0xc0112d34 schedule+0x2d4 (0xc18627c0, 0xdf944000)
kernel .text 0xc0100000 0xc0112a60 0xc0112e80
0xdf945f9c 0xc0105df0 __down_interruptible+0x80 (0xdf945fd8, 0xe082051f, 0xdf945
fec, 0xe08189d7, 0x100)
kernel .text 0xc0100000 0xc0105d70 0xc0105e50
0xdf945fac 0xc0105eae __down_failed_interruptible+0xa
kernel .text 0xc0100000 0xc0105ea4 0xc0105eb4
0xc0105763 kernel_thread+0x23
kernel .text 0xc0100000 0xc0105740 0xc0105780
kdb>

2001-10-10 20:27:08

by Justin T. Gibbs

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

>I don't have any real kdb experiences and not much time at the
>moment.But I try to post the required information. I attached a second
>computer via null modem cable to serial and logged the output. Tell me
>if some essential output is missing.

You should run with "aic7xxx=verbose". Since you are using modules,
you will need to add a line similar to the following to /etc/modules.conf
and, if you are using an initrd, rerun mkinitrd to have the change come
into effect.

#you should already have this line
alias scsi_hostadapter aic7xxx
#but need this line
options aic7xxx aic7xxx='"verbose"'

>Where can I find information how to trace the problem with kdb or
>anything else?

The kdb package comes with several man pages.

>Entering kdb (current=0xc183e000, pid 3) due to Keyboard Entry
>kdb> b btp 16
> EBP EIP Function(args)
>0xdf945f6c 0xc0112d34 schedule+0x2d4 (0xc18627c0, 0xdf944000)
> kernel .text 0xc0100000 0xc0112a60 0xc0112e80
>0xdf945f9c 0xc0105df0 __down_interruptible+0x80 (0xdf945fd8, 0xe082051f, 0xdf9
>45
>fec, 0xe08189d7, 0x100)
> kernel .text 0xc0100000 0xc0105d70 0xc0105e50
>0xdf945fac 0xc0105eae __down_failed_interruptible+0xa
> kernel .text 0xc0100000 0xc0105ea4 0xc0105eb4
> 0xc0105763 kernel_thread+0x23
> kernel .text 0xc0100000 0xc0105740 0xc0105780
>kdb>

This looks pretty normal for an exception handling thread. It is asleep
waiting for some error condition to wake it up.

--
Justin

2001-10-11 13:05:27

by Alexander Feigl

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

Hi

Sorry for my incomplete console output yesterday. I also learned why the
[scsi_eh_0] process looked normal when I did a btp last time. I also
added aic7xxx=verbose and incresed the loglevel of my system.

After invoking cdrecord, there is some time (some minutes) while there
is no output of cdrecord. Then there is a huge amout of syslog messages,
then is is silent for some time again. Then cdrecord prints some error
messages. On serial console it hangs immediately, on local text console
it gives a segmentation fault. Any further invocations of cdrecord or
any SCSI (generic?) application hang immediately when started. (even a
cdrecord -scanbus)

Here is the syslog and kdb backtrace of the [scsi_eh_0] proccess

---------------------------------------------------------------------------------------
Linux version 2.4.11-xfs ([email protected]) (gcc version egcs-2.9
1.66 19990314/Linux (egcs-1.1.2 release / Mandrake Linux 8.1)) #1 Mit Okt 10 20:
02:42 CEST 2001
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000001fff0000 (usable)
BIOS-e820: 000000001fff0000 - 000000001fff8000 (ACPI data)
BIOS-e820: 000000001fff8000 - 0000000020000000 (ACPI NVS)
BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
On node 0 totalpages: 131056
zone(0): 4096 pages.
zone(1): 126960 pages.
zone(2): 0 pages.
Kernel command line: BOOT_IMAGE=2411 ro root=306 ramdisk_size=8192 devfs=mount v
ideo=riva:off console=ttyS1
Initializing CPU#0
Detected 908.966 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1808.79 BogoMIPS
Memory: 511756k/524224k available (1025k kernel code, 12080k reserved, 812k data
, 228k init, 0k highmem)
kdb version 1.9 by Scott Lurndal, Keith Owens. Copyright SGI, All Rights Reserve
d
Dentry-cache hash table entries: 65536 (order: 7, 524288 bytes)
Inode-cache hash table entries: 32768 (order: 6, 262144 bytes)
Mount-cache hash table entries: 8192 (order: 4, 65536 bytes)
Buffer-cache hash table entries: 32768 (order: 5, 131072 bytes)
Page-cache hash table entries: 131072 (order: 7, 524288 bytes)
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 256K (64 bytes/line)
CPU: AMD Athlon(tm) Processor stepping 02
Enabling fast FPU save and restore... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch ([email protected])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfdb71, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
IA-32 Microcode Update Driver: v1.08 <[email protected]>
apm: BIOS version 1.2 Flags 0x03 (Driver version 1.14)
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
devfs: v0.117 (20010927) Richard Gooch ([email protected])
devfs: boot_options: 0x1
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI IS
APNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
Real Time Clock Driver v1.10e
Non-volatile memory driver v1.1
block: 128 slots per queue, batch=16
RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
AMD7409: IDE controller on PCI bus 00 dev 39
AMD7409: chipset revision 7
AMD7409: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:DMA, hdd:pio
hda: QUANTUM FIREBALLlct15 30, ATA DISK drive
hdc: ST340823A, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: 58633344 sectors (30020 MB) w/418KiB Cache, CHS=3649/255/63, UDMA(33)
hdc: 78165360 sectors (40021 MB) w/1024KiB Cache, CHS=77545/16/63, UDMA(33)
Partition check:
/dev/ide/host0/bus0/target0/lun0: p1 p2 < p5 p6 p7 p8 p9 p10 p11 >
/dev/ide/host0/bus1/target0/lun0: [PTBL] [4865/255/63] p1 p2 < p5 p6 >
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 4096 buckets, 32Kbytes
TCP: Hash tables configured (established 32768 bind 32768)
Linux IP multicast router 0.06 plus PIM-SM
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
RAMDISK: Compressed image found at block 0
Freeing initrd memory: 1552k freed
VFS: Mounted root (ext2 filesystem).
Mounted devfs on /dev
SCSI subsystem driver Revision: 1.00
ahc_pci:0:11:0: Reading SEEPROM...done.
ahc_pci:0:11:0: BIOS eeprom is present
ahc_pci:0:11:0: Secondary High byte termination Enabled
ahc_pci:0:11:0: Secondary Low byte termination Enabled
ahc_pci:0:11:0: Primary Low Byte termination Enabled
ahc_pci:0:11:0: Primary High Byte termination Enabled
ahc_pci:0:11:0: Downloading Sequencer Program... 422 instructions downloaded
scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.1
<Adaptec 2940 Ultra2 SCSI adapter>
aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/255 SCBs
Vendor: PLEXTOR Model: CD-ROM PX-32TS Rev: 1.02
Type: CD-ROM ANSI SCSI revision: 02
(scsi0:A:2:1): Sending SDTR period c, offset 7f
(scsi0:A:2:1): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2): 20.000MB/s transfers (20.000MHz, offset 15)
scsi0: target 2 synchronous at 20.0MHz, offset = 0xf
Vendor: PLEXTOR Model: CD-R PX-W1210S Rev: 1.03
Type: CD-ROM ANSI SCSI revision: 02
(scsi0:A:3:1): Sending SDTR period c, offset 7f
(scsi0:A:3:1): Received SDTR period c, offset 10
Filtered to period c, offset 10
(scsi0:A:3): 20.000MB/s transfers (20.000MHz, offset 16)
scsi0: target 3 synchronous at 20.0MHz, offset = 0x10
Loading xfs module
SGI XFS with ACLs, EAs, DMAPI, realtime, quota, no debug enabled
Mounting /proc filesystem
Creating root device
XFS mounting filesystem ide0(3,6)
Mounting root filesystem
Remounting devfs at correct place if necessary
Freeing unused kernel memory: 228k freed
INIT: version 2.78 booting
Couldnt get a file descriptor referring to the console
get_console_fd: Das Argument ist ung?ltig
Standardzeichensatz setzen: [ OK ]
Willkommen zu Mandrake Linux
Dr?cken Sie ?I?, um im interaktiven Modus zu starten.
Einh?ngen des Dateisystems ?/proc?: [ OK ]
Starten des DevFS D?mons: [ OK ]
klogctl: Das Argument ist ung?ltig
Unmounting initrd: [ OK ]
Kern-Parameter konfigurieren: [ OK ]
Setting clock (utc): Don Okt 11 14:18:32 CEST 2001 [ OK ]
Swap-Partitionen einbinden: Adding Swap: 1028120k swap-space (priority -1)
Adding Swap: 530104k swap-space (priority -2)
[ OK ]
Setting hostname PowerBox.MysticWorld.de: [ OK ]
Checking root filesystem
[ OK ]
Verzeichnisbaumwurzel im Schreib-/Lese-Modus einh?ngen: [ OK ]
Finden der Modulabh?ngigkeiten: [ OK ]
i2c-core.o: i2c core module
i2c-algo-bit.o: i2c bit algorithm module
Linux video capture interface: v1.00
bttv: driver version 0.8.28 loaded [v4l/v4l2]
bttv: using 10 buffers with 2080k (520 pages) each for capture
bttv: Host bridge is Advanced Micro Devices [AMD] AMD-751 [Irongate] System Cont
roller
bttv: Bt8xx card found (0).
bttv0: Bt848 (rev 18) at 00:09.0, irq: 11, latency: 64, memory: 0xeddff000
bttv0: using: BT848A(MIRO PCTV pro) [card=11,insmod option]
i2c-core.o: adapter bt848 #0 registered as adapter 0.
bttv0: i2c: checking for MSP34xx @ 0x80... found
bttv0: miro: id=1 tuner=0 radio=matchbox stereo=yes
bttv0: i2c: checking for MSP34xx @ 0x80... found
i2c-core.o: driver i2c msp3400 driver registered.
msp34xx: init: chip=MSP3410D-B4, has NICAM support
msp3410: daemon started
bttv0: i2c attach [MSP3410D-B4]
i2c-core.o: client [MSP3410D-B4] registered to adapter [bt848 #0](pos. 0).
bttv0: i2c: checking for TDA9875 @ 0xb0... not found
bttv0: i2c: checking for TDA7432 @ 0x8a... not found
tvaudio: TV audio decoder + audio/video mux driver
tvaudio: known chips: tda9840,tda9873h,tda9850,tda9855,tea6300,tea6420,tda8425,p
ic16c54 (PV951)
i2c-core.o: driver generic i2c audio driver registered.
i2c-core.o: driver i2c TV tuner driver registered.
tuner: chip found @ 0xc0
bttv0: i2c attach [Temic PAL (4002 FH5)]
i2c-core.o: client [Temic PAL (4002 FH5)] registered to adapter [bt848 #0](pos.
1).
devfs: devfs_register(): device already registered: "v4l/video0"
devfs: devfs_register(): device already registered: "v4l/vbi0"
devfs: devfs_register(): device already registered: "v4l/radio0"
md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27
Checking filesystems
[ OK ]
Einh?ngen lokaler Dateisysteme: XFS mounting filesystem ide0(3,5)
XFS mounting filesystem ide1(22,6)
XFS mounting filesystem ide0(3,11)
[ OK ]
Kontrollieren des LoopBack-Dateisystemes[ OK ]
Einh?ngen der Loopback-Dateisysteme: [ OK ]
Running devfsd actions: [ OK ]
Loading compose keys: compose.latin.inc [ OK ]
The BackSpace key sends: ^?[ OK ]
Attached scsi CD-ROM sr0 at scsi0, channel 0, id 2, lun 0
Attached scsi CD-ROM sr1 at scsi0, channel 0, id 3, lun 0
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
sr0: scsi-1 drive
devfs: devfs_register(): device already registered: "cd"
Uniform CD-ROM driver Revision: 3.12
devfs: devfs_register(): device already registered: "cdrom0"
(scsi0:A:3:0): Sending SDTR period c, offset 10
(scsi0:A:3:0): Received SDTR period c, offset 10
Filtered to period c, offset 10
sr1: scsi3-mmc drive: 32x/32x writer cd/rw xa/form2 cdda tray
devfs: devfs_register(): device already registered: "cd"
devfs: devfs_register(): device already registered: "cdrom1"
Swap-Bereich aktivieren: [ OK ]
Building Window Manager Sessions [ OK ]
Linux Tulip driver version 0.9.15-pre7 (Oct 2, 2001)
tulip0: MII transceiver #1 config 3000 status 7829 advertising 01e1.
eth0: Lite-On 82c168 PNIC rev 32 at 0xd600, 00:A0:CC:59:AC:06, IRQ 5.
INIT: Entering runlevel: 1
Entering non-interactive startup
Allen Prozessen das Signal ?TERM? senden ...
INIT: no more processes left in this runlevel
Sende allen Prozmd: recovery thread got woken up ...
essen das Signalmd: recovery thread finished ...
?KILL? ...
INIT anweisen, in den Einbenutzer-Betrieb zu wechseln.
INIT: Going single user
sh-2.05# modprobe sg
sh-2.05# cdrecord -toc dev=0,2,0
Cdrecord 1.10 (i586-mandrake-linux-gnu) Copyright (C) 1995-2001 J?rg Schilling
(scsi0:A:2:0): Sending SDTR period c, offset 7f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
scsidev: '0,2,0'
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
scsibus: 0 target: 2 lun: 0
(scsi0:A:2:0): Sending SDTR period c, offset f
Linux sg driver version: 3.1.20
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
Using libscg ver(scsi0:A:2:0): Sending SDTR period c, offset f
sion 'schily-0.5(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
'
Device type (scsi0:A:2:0): Sending SDTR period c, offset f
: Removable CD(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
-ROM
Version (scsi0:A:2:0): Sending SDTR period c, offset f
: 2
Respon(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
se Format: 2
Ca(scsi0:A:2:0): Sending SDTR period c, offset f
pabilities : S(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
YNC LINKED
Ven(scsi0:A:2:0): Sending SDTR period c, offset f
dor_info : 'P(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
LEXTOR '
Identifikation : 'CD-ROM PX-32TS '
Revision : '1.02'
Device seems to be: Generic CD-ROM.
Using generic SCSI-2 CD driver (scs(scsi0:A:2:0): Sending SDTR period c, offset
f
i2_cd).
Driver (scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
flags :
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
first: 1 last 18
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
scsi0:0:2:0: Attempting to queue an ABORT message
scsi0: Dumping Card State in Command phase, at SEQADDR 0xbf
ACCUM = 0x80, SINDEX = 0xa0, DINDEX = 0xe4, ARG_2 = 0x0
HCNT = 0xa
SCSISEQ = 0x12, SBLKCTL = 0xa
DFCNTRL = 0x24, DFSTATUS = 0x80
LASTPHASE = 0x80, SCSISIGI = 0x44, SXFRCTL0 = 0x80
SSTAT0 = 0x0, SSTAT1 = 0x3
STACK == 0x35, 0x17b, 0x165, 0x0
SCB count = 4
Kernel NEXTQSCB = 3
Card NEXTQSCB = 3
QINFIFO entries:
Waiting Queue entries:
Disconnected Queue entries:
QOUTFIFO entries:
Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2
2 23 24 25 26 27 28 29 30 31
Pending list: 2
Kernel Free SCB list: 1 0
Untagged Q(2): 2
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
scsi0:0:2:0: Device is active, asserting ATN
Recovery code sleeping
Recovery code awake
Timer Expired
aic7xxx_abort returns 8195
scsi0:0:2:0: Attempting to queue a TARGET RESET message
aic7xxx_dev_reset returns 8195
Recovery SCB completes
(scsi0:A:2): 3.300MB/s transfers
scsi0: target 2 using asynchronous transfers
(scsi0:A:3): 3.300MB/s transfers
scsi0: target 3 using asynchronous transfers
scsi0: SCSI bus reset delivered. 1 SCBs aborted.
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2): 20.000MB/s transfers (20.000MHz, offset 15)
scsi0: target 2 synchronous at 20.0MHz, offset = 0xf
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
(scsi0:A:2:0): Sending SDTR period c, offset f
(scsi0:A:2:0): Received SDTR period c, offset f
Filtered to period c, offset f
scsi0:0:2:0: Attempting to queue an ABORT message
scsi0: Dumping Card State in Command phase, at SEQADDR 0xc0
ACCUM = 0x80, SINDEX = 0xa0, DINDEX = 0xe4, ARG_2 = 0x0
HCNT = 0xa
SCSISEQ = 0x12, SBLKCTL = 0xa
DFCNTRL = 0x24, DFSTATUS = 0x80
LASTPHASE = 0x80, SCSISIGI = 0x44, SXFRCTL0 = 0x80
SSTAT0 = 0x0, SSTAT1 = 0x3
STACK == 0x35, 0x17b, 0x165, 0x0
SCB count = 4
Kernel NEXTQSCB = 2
Card NEXTQSCB = 2
QINFIFO entries:
Waiting Queue entries:
Disconnected Queue entries:
QOUTFIFO entries:
Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2
2 23 24 25 26 27 28 29 30 31
Pending list: 3
Kernel Free SCB list: 1 0
Untagged Q(2): 3
DevQ(0:2:0): 0 waiting
DevQ(0:3:0): 0 waiting
scsi0:0:2:0: Device is active, asserting ATN
Recovery code sleeping
Recovery code awake
aic7xxx_abort returns 8194
scsi: device set offline - not ready or command retry failed after bus reset: ho
st 0 channel 0 id 2 lun 0
track: 1 lba: 0 ( 0) 00:02:00 adr: 1 control: 0 mode: -1
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl
cdrecord: No such device or address. Cannot send SCSI cmd via ioctl

Entering kdb (current=0xc02cc000, pid 0) due to Keyboard Entry
kdb> btp 16
EBP EIP Function(args)
0xdf945e7c 0xc0112d34 schedule+0x2d4 (0x3e8, 0xdf945ef0, 0xdf8de080, 0x0, 0xdf94
4000)
kernel .text 0xc0100000 0xc0112a60 0xc0112e80
0xdf945eb0 0xc011311c wait_for_completion+0x7c
kernel .text 0xc0100000 0xc01130a0 0xc0113140
0xc0105763 kernel_thread+0x23
kernel .text 0xc0100000 0xc0105740 0xc0105780
kdb> ps
Task Addr Pid Parent [*] cpu State Thread Command
0xc181c000 00000001 00000000 0 000 stop 0xc181c270 init
0xc1810000 00000002 00000001 0 000 stop 0xc1810270 keventd
0xc183e000 00000003 00000001 0 000 stop 0xc183e270 kapm-idled
0xc183a000 00000004 00000000 0 000 stop 0xc183a270 ksoftirqd_CPU0
0xc1838000 00000005 00000000 0 000 stop 0xc1838270 kswapd
0xc1836000 00000006 00000000 0 000 stop 0xc1836270 bdflush
0xc1834000 00000007 00000000 0 000 stop 0xc1834270 kupdated
0xdfa16000 00000010 00000001 0 000 stop 0xdfa16270 pagebuf_daemon
0xdf944000 00000016 00000001 0 000 stop 0xdf944270 scsi_eh_0
0xdf7ce000 00000219 00000001 0 000 stop 0xdf7ce270 msp3410 [auto]
0xdf72c000 00000235 00000001 0 000 stop 0xdf72c270 mdrecoveryd
0xdee40000 00000649 00000001 0 000 stop 0xdee40270 init
0xdf780000 00000650 00000649 0 000 stop 0xdf780270 sh
0xdf5b4000 00000653 00000650 0 000 stop 0xdf5b4270 cdrecord

2001-10-11 19:15:07

by Justin T. Gibbs

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

>Hi
>
>Sorry for my incomplete console output yesterday.

Not a problem.

Here's the interesting part...

>scsi0:0:2:0: Attempting to queue an ABORT message
>scsi0: Dumping Card State in Command phase, at SEQADDR 0xbf

The sequencer believes that the last, REQ qualified, phase was
the command phase.

>ACCUM = 0x80, SINDEX = 0xa0, DINDEX = 0xe4, ARG_2 = 0x0
>HCNT = 0xa

We're setup to send a 10byte cdb(command) to the target. No bytes of
the cdb have yet been transfered.

>SCSISEQ = 0x12, SBLKCTL = 0xa
> DFCNTRL = 0x24, DFSTATUS = 0x80

The data fifo is all set to send data...

>LASTPHASE = 0x80, SCSISIGI = 0x44, SXFRCTL0 = 0x80
^^^^
But what's this? The target has us in data-in phase.

>SSTAT0 = 0x0, SSTAT1 = 0x3

But this phase, according to the hardware, was never qualified by
a REQ, so we never see this change and fall out of the loop that is
trying to process the command phase.

To sum up, from time to time, the controller sees the first REQ for
the data-in phase that follows the command phase, prior to seeing the
phase lines change to data-in. This is either caused by the plextor
not allowing the proper bus-settle time for the phase change to be
seen prior to asserting REQ *OR* your cabling is poor (too long,
marginal/bent pin, incorrect termination, etc.) giving a similar
result.

As for why you cannot talk to the device after a while, the device
has been set offline. The controller was unable to talk to it
successfully, so the SCSI layer decided to ignore it.

--
Justin

2001-10-11 19:47:37

by Alexander Feigl

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

Am Don, 2001-10-11 um 21.15 schrieb 1002827721:
>
> To sum up, from time to time, the controller sees the first REQ for
> the data-in phase that follows the command phase, prior to seeing the
> phase lines change to data-in. This is either caused by the plextor
> not allowing the proper bus-settle time for the phase change to be
> seen prior to asserting REQ *OR* your cabling is poor (too long,
> marginal/bent pin, incorrect termination, etc.) giving a similar
> result.
>

I am rather sure that the cabling is Ok. Besides I am not a SCSI newbie
(although I don't have any ideas how it works internally) - everything
else works without problems. High data rate reading of a CD-ROM,
ripping, reading the TOC of data CD-ROM and reading the TOC of a CDDA
with cdda2wav. On my system 3 things have to come together.

1) I use cdrecord -toc to read the TOC
2) A CDDA disc is inserted in the drive
3) I use my Plextor PX 32TS drive

The problem is 100% reproducable here.

A co-developer of my project has similar problems with reading the TOC.
But he owns a Plextor PX40 drive and has problems with cdda2wav. I
cannot remember if it hangs when reading CDDA or CD-ROM TOCs on his
machine. cdda2wav is working fine on my machine and I am unable to hang
the SCSI subsystem with it - he gets reproducible hangs on his machine.
cdrecord works fine on his machine and makes problems on mine.

> As for why you cannot talk to the device after a while, the device
> has been set offline. The controller was unable to talk to it
> successfully, so the SCSI layer decided to ignore it.
>

If the device is only set offline it would be a minor problem. Probably
I could do rmmod/insmod or scsi-remove-single-device and re-add it. As I
mailed in my first posts even a cdrecord -scanbus or accesses to other
drives on the same controller hang and will be in uninterruptible state
after this call. The processes cannot be killed, the module cannot be
unloaded and I have to reboot to do any SCSI accesses again.

Alexander Feigl

2001-10-11 19:59:31

by Justin T. Gibbs

[permalink] [raw]
Subject: Re: PROBLEM: aic7xxx SCSI system hangs

>I am rather sure that the cabling is Ok. Besides I am not a SCSI newbie
>(although I don't have any ideas how it works internally) - everything
>else works without problems. High data rate reading of a CD-ROM,
>ripping, reading the TOC of data CD-ROM and reading the TOC of a CDDA
>with cdda2wav. On my system 3 things have to come together.
>
>1) I use cdrecord -toc to read the TOC
>2) A CDDA disc is inserted in the drive
>3) I use my Plextor PX 32TS drive
>
>The problem is 100% reproducable here.

That's great and all, but it doesn't change the fact that the
problem, from the Adaptec chip's perspective, is a protocol
violation *on the bus*. I'm willing to believe that there is
a chip bug in the aic7896/97 if you can provice a SCSI bus trace
indicating that everything is normal on the SCSI bus.

>A co-developer of my project has similar problems with reading the TOC.

On the same or different controller?

>> As for why you cannot talk to the device after a while, the device
>> has been set offline. The controller was unable to talk to it
>> successfully, so the SCSI layer decided to ignore it.
>
>If the device is only set offline it would be a minor problem. Probably
>I could do rmmod/insmod or scsi-remove-single-device and re-add it. As I
>mailed in my first posts even a cdrecord -scanbus or accesses to other
>drives on the same controller hang and will be in uninterruptible state
>after this call. The processes cannot be killed, the module cannot be
>unloaded and I have to reboot to do any SCSI accesses again.

To diagnose why things are hung will require you to grovel around
in the SCSI data structures to determine why I/O is hung. Perhaps
use kdb to change the SCSI loging level after the hangs start, then
try access to another device and see where the traces lead you? I
cannot reproduce the problem here, so it is difficult for me to debug
it.

--
Justin