2003-01-13 02:59:09

by Jason Thomas

[permalink] [raw]
Subject: ide-scsi bug hard locks machine

Hi all, heres the kernel logs from my machine which hard locks.

If you need more info email me directly I'm not subscribed.

Thanks.

ptelnet log 1: 13 Jan 2003 1:51 pm
ide-scsi: abort called for 2
bad: scheduling while atomic!
Call Trace: [<c0116941>] [<c0109b76>] [<c01169b0>] [<c011a5db>] [<c0109d8c>] [<c0263100>] [<c0262730>] [<c026848c>] [<c0268390>] [<c0261e63>] [<c0261cc0>] [<c0261c80>] [<c026225d>] [<c02622d7>] [<c0262b65>] [<c0109d97>] [<c0262c89>] [<c0262bb0>] [<c0108be9>]
hde: lost interrupt
ide-scsi: CoD != 0 in idescsi_pc_intr
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }
ide-scsi: reset called for 2
------------[ cut here ]------------
kernel BUG at drivers/scsi/ide-scsi.c:483!
invalid operand: 0000
CPU: 0
EIP: 0060:[<c0267845>] Not tainted
EFLAGS: 00010286
eax: c024ab50 ebx: c0448d84 ecx: edae15ed edx: 0000d002
esi: d7c96800 edi: d7c93bc0 ebp: 00000000 esp: d7c4be5c
ds: 007b es: 007b ss: 0068
Process scsi_eh_0 (pid: 11, threadinfo=d7c4a000 task=d7c498c0)
Stack: 0000d002 c0448d84 00000008 00000080 0000001e c0448d84 c0448d84 00000000
d7d023c0 c0246bd9 c0448d84 d7c93bc0 00000000 00000088 0000001e d7c4a000
d7d023c0 c0448d84 d7d7eb80 c0246dd9 c0448d84 d7d023c0 c0448e2c d7c4a000
Call Trace: [<c0246bd9>] [<c0246dd9>] [<c02474ff>] [<c0247443>] [<c026825b>] [<c0261dbc>] [<c0261cc0>] [<c0261c80>] [<c026225d>] [<c0262417>] [<c02622d7>] [<c0262b79>] [<c0109d97>] [<c0262c89>] [<c0262bb0>] [<c0108be9>]
Code: 0f 0b e3 01 2b 07 38 c0 8b 57 38 a1 c8 d0 42 c0 89 d1 29 c1
hde: ATAPI reset complete
hde: status error: status=0x48 { DriveReady DataRequest }
hde: drive not ready for command
hde: lost interrupt
ide-scsi: CoD != 0 in idescsi_pc_intr
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }
hde: ATAPI reset complete
hde: irq timeout: status=0xc0 { Busy }

--
Jason Thomas Phone: +61 2 6257 7111
Unix System Administrator Fax: +61 2 6257 7311
tSA Consulting Group Pty. Ltd. Mobile: 0418 29 66 81
1 Hall Street Lyneham ACT 2602 http://www.topic.com.au/


2003-01-13 05:04:48

by Jason Thomas

[permalink] [raw]
Subject: Re: ide-scsi bug hard locks machine

so after some coaching, below should be some useful information.

I have a liteon 48x cd burner connected to a promise controller using
ide-scsi, this currently works fine on 2.5.52. I think this also
happened with 2.5.54 and I gave up on that one. The output below is from
2.5.56.

Thanks


ksymoops 2.4.8 on i686 2.5.52. Options used
-V (default)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.5.52/ (default)
-m /boot/System.map-2.5.56 (specified)

Error (regular_file): read_ksyms stat /proc/ksyms failed
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Call Trace: [<c0116941>] [<c0109b76>] [<c01169b0>] [<c011a5db>] [<c0109d8c>] [<c0263100>] [<c0262730>] [<c026848c>] [<c0268390>] [<c0261e63>] [<c0261cc0>] [<c0261c80>] [<c026225d>] [<c02622d7>] [<c0262b65>] [<c0109d97>] [<c0262c89>] [<c0262bb0>] [<c0108be9>]
Warning (Oops_read): Code line not seen, dumping what data is available


Trace; c0116941 <schedule+2f1/300>
Trace; c0109b76 <__down+96/100>
Trace; c01169b0 <default_wake_function+0/40>
Trace; c011a5db <call_console_drivers+5b/120>
Trace; c0109d8c <__down_failed+8/c>
Trace; c0263100 <.text.lock.scsi_error+2d/6d>
Trace; c0262730 <scsi_sleep_done+0/20>
Trace; c026848c <idescsi_abort+fc/110>
Trace; c0268390 <idescsi_abort+0/110>
Trace; c0261e63 <scsi_send_eh_cmnd+153/1a0>
Trace; c0261cc0 <scsi_eh_done+0/50>
Trace; c0261c80 <scsi_eh_times_out+0/40>
Trace; c026225d <scsi_eh_tur+9d/d0>
Trace; c02622d7 <scsi_eh_abort_cmd+47/70>
Trace; c0262b65 <scsi_unjam_host+a5/f0>
Trace; c0109d97 <__down_failed_interruptible+7/c>
Trace; c0262c89 <scsi_error_handler+d9/110>
Trace; c0262bb0 <scsi_error_handler+0/110>
Trace; c0108be9 <kernel_thread_helper+5/c>

kernel BUG at drivers/scsi/ide-scsi.c:483!
invalid operand: 0000
CPU: 0
EIP: 0060:[<c0267845>] Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286
eax: c024ab50 ebx: c0448d84 ecx: edae15ed edx: 0000d002
esi: d7c96800 edi: d7c93bc0 ebp: 00000000 esp: d7c4be5c
ds: 007b es: 007b ss: 0068
Stack: 0000d002 c0448d84 00000008 00000080 0000001e c0448d84 c0448d84 00000000
d7d023c0 c0246bd9 c0448d84 d7c93bc0 00000000 00000088 0000001e d7c4a000
d7d023c0 c0448d84 d7d7eb80 c0246dd9 c0448d84 d7d023c0 c0448e2c d7c4a000
Call Trace: [<c0246bd9>] [<c0246dd9>] [<c02474ff>] [<c0247443>] [<c026825b>] [<c0261dbc>] [<c0261cc0>] [<c0261c80>] [<c026225d>] [<c0262417>] [<c02622d7>] [<c0262b79>] [<c0109d97>] [<c0262c89>] [<c0262bb0>] [<c0108be9>]
Code: 0f 0b e3 01 2b 07 38 c0 8b 57 38 a1 c8 d0 42 c0 89 d1 29 c1


>>EIP; c0267845 <idescsi_transfer_pc+a5/110> <=====

>>eax; c024ab50 <atapi_reset_pollfunc+0/120>
>>ebx; c0448d84 <ide_hwifs+f64/4998>

Trace; c0246bd9 <start_request+f9/1b0>
Trace; c0246dd9 <ide_do_request+f9/210>
Trace; c02474ff <ide_do_drive_cmd+9f/100>
Trace; c0247443 <ide_init_drive_cmd+23/40>
Trace; c026825b <idescsi_queue+1eb/320>
Trace; c0261dbc <scsi_send_eh_cmnd+ac/1a0>
Trace; c0261cc0 <scsi_eh_done+0/50>
Trace; c0261c80 <scsi_eh_times_out+0/40>
Trace; c026225d <scsi_eh_tur+9d/d0>
Trace; c0262417 <scsi_eh_bus_device_reset+77/c0>
Trace; c02622d7 <scsi_eh_abort_cmd+47/70>
Trace; c0262b79 <scsi_unjam_host+b9/f0>
Trace; c0109d97 <__down_failed_interruptible+7/c>
Trace; c0262c89 <scsi_error_handler+d9/110>
Trace; c0262bb0 <scsi_error_handler+0/110>
Trace; c0108be9 <kernel_thread_helper+5/c>

Code; c0267845 <idescsi_transfer_pc+a5/110>
00000000 <_EIP>:
Code; c0267845 <idescsi_transfer_pc+a5/110> <=====
0: 0f 0b ud2a <=====
Code; c0267847 <idescsi_transfer_pc+a7/110>
2: e3 01 jecxz 5 <_EIP+0x5>
Code; c0267849 <idescsi_transfer_pc+a9/110>
4: 2b 07 sub (%edi),%eax
Code; c026784b <idescsi_transfer_pc+ab/110>
6: 38 c0 cmp %al,%al
Code; c026784d <idescsi_transfer_pc+ad/110>
8: 8b 57 38 mov 0x38(%edi),%edx
Code; c0267850 <idescsi_transfer_pc+b0/110>
b: a1 c8 d0 42 c0 mov 0xc042d0c8,%eax
Code; c0267855 <idescsi_transfer_pc+b5/110>
10: 89 d1 mov %edx,%ecx
Code; c0267857 <idescsi_transfer_pc+b7/110>
12: 29 c1 sub %eax,%ecx


1 warning and 1 error issued. Results may not be reliable.



output from lspci:

00:00.0 Host bridge: VIA Technologies, Inc. VT8363/8365 [KT133/KM133] (rev 03)
00:01.0 PCI bridge: VIA Technologies, Inc. VT8363/8365 [KT133/KM133 AGP]
00:07.0 ISA bridge: VIA Technologies, Inc. VT82C686 [Apollo Super South] (rev 40)
00:07.2 USB Controller: VIA Technologies, Inc. USB (rev 1a)
00:07.3 USB Controller: VIA Technologies, Inc. USB (rev 1a)
00:07.4 Bridge: VIA Technologies, Inc. VT82C686 [Apollo Super ACPI] (rev 40)
00:09.0 Ethernet controller: Digital Equipment Corporation DECchip 21140 [FasterNet] (rev 22)
00:0a.0 Unknown mass storage controller: Promise Technology, Inc. 20267 (rev 02)
00:0b.0 Multimedia video controller: Brooktree Corporation Bt878 Video Capture (rev 02)
00:0b.1 Multimedia controller: Brooktree Corporation Bt878 Audio Capture (rev 02)
00:0c.0 Multimedia audio controller: Creative Labs SB Live! EMU10k1 (rev 08)
00:0c.1 Input device controller: Creative Labs SB Live! MIDI/Game Port (rev 08)
01:00.0 VGA compatible controller: nVidia Corporation NV6 [Vanta] (rev 15)


--
Jason Thomas Phone: +61 2 6257 7111
Unix System Administrator Fax: +61 2 6257 7311
tSA Consulting Group Pty. Ltd. Mobile: 0418 29 66 81
1 Hall Street Lyneham ACT 2602 http://www.topic.com.au/

2003-01-13 14:06:48

by Alan

[permalink] [raw]
Subject: Re: ide-scsi bug hard locks machine

On Mon, 2003-01-13 at 03:07, Jason Thomas wrote:
> ide-scsi: abort called for 2
> bad: scheduling while atomic!
> Call Trace: [<c0116941>] [<c0109b76>] [<c01169b0>] [<c011a5db>] [<c0109d8c>] [<c0263100>] [<c0262730>] [<c026848c>] [<c0268390>] [<c0261e63>] [<c0261cc0>] [<c0261c80>] [<c026225d>] [<c02622d7>] [<c0262b65>] [<c0109d97>] [<c0262c89>] [<c0262bb0>] [<c0108be9>]
> hde: lost interrupt
> ide-scsi: CoD != 0 in idescsi_pc_intr
> hde: ATAPI reset complete
> hde: irq timeout: status=0xc0 { Busy }
> hde: ATAPI reset complete
> hde: irq timeout: status=0xc0 { Busy }
> hde: ATAPI reset complete
> hde: irq timeout: status=0xc0 { Busy }
> ide-scsi: reset called for 2

Someone broke ide-scsi in 2.5. I've not had time to investigate why yet. If you
need ide-scsi use 2.4.

2003-01-14 02:11:23

by Jason Thomas

[permalink] [raw]
Subject: Re: ide-scsi bug hard locks machine

Some more info for those interested, it seems to have started in 2.5.53
with the implementation of the mid level api.

heres a trace from 2.5.53 maybe this will make it easier to find.

Unable to handle kernel NULL pointer dereference at virtual address 00000030
printing eip:
c0252ba3
*pde = 00000000
Oops: 0000
CPU: 0
EIP: 0060:[<c0252ba3>] Not tainted
EFLAGS: 00010086
EIP is at idescsi_abort+0x73/0x110
eax: 00000002 ebx: 00000086 ecx: d7cc5200 edx: 00000000
esi: d7c3e000 edi: d7c3e000 ebp: 00000007 esp: d7c3ff60
ds: 0068 es: 0068 ss: 0068
Process scsi_eh_0 (pid: 9, threadinfo=d7c3e000 task=d7d1a680)
Stack: c030dee0 00000002 d7c3e000 d7c8c980 c040e1ec d7c3e000 00000202 d7c3e000
00000000 c024c7a2 d7cc5200 d7cc5200 d7cdf180 d7c3e000 c024c8c8 d7cc5200
d7cdf180 d7cdf180 d7c3ffd4 c024d165 d7cc5200 d7cdf180 c0109d97 d7cc5200
Call Trace:
[<c024c7a2>] scsi_try_to_abort_cmd+0x62/0x80
[<c024c8c8>] scsi_eh_abort_cmd+0x38/0x70
[<c024d165>] scsi_unjam_host+0xa5/0xf0
[<c0109d97>] __down_failed_interruptible+0x7/0xc
[<c024d289>] scsi_error_handler+0xd9/0x110
[<c024d1b0>] scsi_error_handler+0x0/0x110
[<c0108bfd>] kernel_thread_helper+0x5/0x18

Code: 39 42 30 74 67 8b 44 24 10 83 c0 10 89 04 24 e8 69 55 fc ff
<6>note: scsi_eh_0[9] exited with preempt_count 2


On Mon, Jan 13, 2003 at 03:02:53PM +0000, Alan Cox wrote:
> Someone broke ide-scsi in 2.5. I've not had time to investigate why yet. If you
> need ide-scsi use 2.4.