2003-06-01 22:15:43

by Robert Murray

[permalink] [raw]
Subject: ide problem - is this a known problem, or is the disk dead?


Hi

I'm running 2.4.21-rc6, and one of my drives has failed with the following messages. Does this indicate a hardware failure?

All but one of the partitions are raid1, so they still work. The one raid0 partition, which holds less important stuff like squid cache files, obviously doesn't. I tried to remount it read-only, but mount hangs for ever. It would be nice if linux could deal with hardware failures like this gracefully, and allow the fs to be unmounted.

here is the log:

Jun 1 06:28:00 r2d2 kernel: hdc: dma_timer_expiry: dma status == 0x21
Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
Jun 1 06:28:10 r2d2 kernel: hdc: (__ide_dma_test_irq) called while not waiting
Jun 1 06:28:10 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
Jun 1 06:28:10 r2d2 kernel:
Jun 1 06:28:10 r2d2 kernel: hdc: drive not ready for command
Jun 1 06:28:40 r2d2 kernel: ide1: reset timed-out, status=0xd0
Jun 1 06:28:40 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
Jun 1 06:29:11 r2d2 kernel:
Jun 1 06:29:11 r2d2 kernel: hdc: drive not ready for command
Jun 1 06:29:11 r2d2 kernel: ide1: reset timed-out, status=0xd0
Jun 1 06:29:11 r2d2 kernel: blk: queue c03886b4, I/O limit 4095Mb (mask 0xfffff
fff)
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 905
464
Jun 1 06:29:11 r2d2 kernel: raid1: Disk failure on ide/host0/bus1/target0/lun0/
part6, disabling device.
Jun 1 06:29:11 r2d2 kernel: ^IOperation continuing on 1 devices
Jun 1 06:29:11 r2d2 kernel: md: updating md3 RAID superblock on device
Jun 1 06:29:11 r2d2 kernel: md: (skipping new-faulty ide/host0/bus1/target0/lun
0/part6 )
Jun 1 06:29:11 r2d2 kernel: md: ide/host0/bus0/target0/lun0/part6 [events: 0000
0077]<6>(write) ide/host0/bus0/target0/lun0/part6's sb offset: 94095168
Jun 1 06:29:11 r2d2 kernel: raid1: ide/host0/bus1/target0/lun0/part6: reschedul
ing block 905464
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 192
0
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 192
8
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 193
6
Jun 1 06:29:11 r2d2 kernel: raid1: ide/host0/bus0/target0/lun0/part6: redirecti
ng sector 905464 to another mirror
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 194
4
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 195
2
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 196
0
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 196
8
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 197
6
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 198
4
Jun 1 06:29:11 r2d2 kernel: end_request: I/O error, dev 16:06 (hdc), sector 199
2
[...]
more of the same.

Get back to me if you need more info. I'll try the drive in another machine soon to check it.


Cheers

Rob



2003-06-02 16:10:53

by Alan

[permalink] [raw]
Subject: Re: ide problem - is this a known problem, or is the disk dead?

On Sul, 2003-06-01 at 23:28, Robert Murray wrote:
> Jun 1 06:28:00 r2d2 kernel: hdc: dma_timer_expiry: dma status == 0x21
> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
> Jun 1 06:28:10 r2d2 kernel: hdc: (__ide_dma_test_irq) called while not waiting
> Jun 1 06:28:10 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
> Jun 1 06:28:10 r2d2 kernel:
> Jun 1 06:28:10 r2d2 kernel: hdc: drive not ready for command
> Jun 1 06:28:40 r2d2 kernel: ide1: reset timed-out, status=0xd0
> Jun 1 06:28:40 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
> Jun 1 06:29:11 r2d2 kernel:

Its hard to tell if thats a dead disk or just terminally confused. The
drive stopped taking to us, set itself Busy and never came back even
after we tried to reset it.


2003-06-02 17:52:23

by Protasevich, Natalie

[permalink] [raw]
Subject: Re: ide problem - is this a known problem, or is the disk dead?

On Sul, 2003-06-01 at 23:28, Robert Murray wrote:
> Jun 1 06:28:00 r2d2 kernel: hdc: dma_timer_expiry: dma status == 0x21
> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
> Jun 1 06:28:10 r2d2 kernel: hdc: (__ide_dma_test_irq) called while not
waiting
> Jun 1 06:28:10 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
> Jun 1 06:28:10 r2d2 kernel:
> Jun 1 06:28:10 r2d2 kernel: hdc: drive not ready for command
> Jun 1 06:28:40 r2d2 kernel: ide1: reset timed-out, status=0xd0
> Jun 1 06:28:40 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
> Jun 1 06:29:11 r2d2 kernel:

I had exact same problem when I had both SCSI and IDE drives on my system
(and boot drive was SCSI). If you check out file drivers/ide/ide-geometry.c,
the potential hazard is spelled out in the comment:
/*
* I did this, but it doesnt work - there is no reasonable way to find the
* correspondence between the BIOS numbering of the disks and the Linux
* numbering. -aeb
*
* The code below is bad. One of the problems is that drives 1 and 2
* may be SCSI disks (even when IDE disks are present), so that
* the geometry we read here from BIOS is attributed to the wrong disks.
* Consequently, also the former "drive->present = 1" below was a mistake.
*
* Eventually the entire routine below should be removed.
*
*/
void probe_cmos_for_drives (ide_hwif_t *hwif)
{
...
... and it was removed in 2.5. I avoided this by just "return"-ing from this
routine.

--Natalie

2003-06-02 20:32:17

by Gabor Z. Papp

[permalink] [raw]
Subject: Re: ide problem - is this a known problem, or is the disk dead?

* Alan Cox <[email protected]>:

|> Jun 1 06:28:00 r2d2 kernel: hdc: dma_timer_expiry: dma status == 0x21
|> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
|> Jun 1 06:28:10 r2d2 kernel: hdc: timeout waiting for DMA
|> Jun 1 06:28:10 r2d2 kernel: hdc: (__ide_dma_test_irq) called while not waiting
|> Jun 1 06:28:10 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
|> Jun 1 06:28:10 r2d2 kernel:
|> Jun 1 06:28:10 r2d2 kernel: hdc: drive not ready for command
|> Jun 1 06:28:40 r2d2 kernel: ide1: reset timed-out, status=0xd0
|> Jun 1 06:28:40 r2d2 kernel: hdc: status timeout: status=0xd0 { Busy }
|> Jun 1 06:29:11 r2d2 kernel:
|
| Its hard to tell if thats a dead disk or just terminally confused. The
| drive stopped taking to us, set itself Busy and never came back even
| after we tried to reset it.

I'm running 2.4.21-rc3 on the host where i got this today:

hdc: dma_timer_expiry: dma status == 0x20
hdc: timeout waiting for DMA
hdc: timeout waiting for DMA
hdc: (__ide_dma_test_irq) called while not waiting
hdc: status timeout: status=0xd0 { Busy }

hdc: drive not ready for command
ide1: reset timed-out, status=0xd0
hdc: status timeout: status=0xd0 { Busy }

hdc: drive not ready for command
ide1: reset timed-out, status=0xd0
end_request: I/O error, dev 16:01 (hdc), sector 78496
end_request: I/O error, dev 16:01 (hdc), sector 78496
end_request: I/O error, dev 16:01 (hdc), sector 20328
end_request: I/O error, dev 16:01 (hdc), sector 20336
end_request: I/O error, dev 16:01 (hdc), sector 20344
end_request: I/O error, dev 16:01 (hdc), sector 20352
end_request: I/O error, dev 16:01 (hdc), sector 20360
end_request: I/O error, dev 16:01 (hdc), sector 20368
end_request: I/O error, dev 16:01 (hdc), sector 25427984
end_request: I/O error, dev 16:01 (hdc), sector 20376
end_request: I/O error, dev 16:01 (hdc), sector 20384
end_request: I/O error, dev 16:01 (hdc), sector 20392
end_request: I/O error, dev 16:01 (hdc), sector 20400
end_request: I/O error, dev 16:01 (hdc), sector 20408
end_request: I/O error, dev 16:01 (hdc), sector 20416
end_request: I/O error, dev 16:01 (hdc), sector 20424
end_request: I/O error, dev 16:01 (hdc), sector 16
end_request: I/O error, dev 16:01 (hdc), sector 37486592
end_request: I/O error, dev 16:01 (hdc), sector 37486608
end_request: I/O error, dev 16:01 (hdc), sector 54788120
end_request: I/O error, dev 16:01 (hdc), sector 0
end_request: I/O error, dev 16:01 (hdc), sector 37486608
EXT3-fs error (device ide1(22,1)): ext3_get_inode_loc: unable to read inode block - inode=2338338, block=4685826
Aborting journal on device ide1(22,1).
end_request: I/O error, dev 16:01 (hdc), sector 4176
Remounting filesystem read-only
end_request: I/O error, dev 16:01 (hdc), sector 0
EXT3-fs error (device ide1(22,1)) in ext3_reserve_inode_write: IO failure
EXT3-fs error (device ide1(22,1)) in ext3_orphan_add: IO failure
EXT3-fs error (device ide1(22,1)) in ext3_truncate: IO failure
EXT3-fs error (device ide1(22,1)) in start_transaction: Journal has aborted

My setup looks like:

Memory: 516616k/524204k available (1051k kernel code, 7200k reserved, 228k data, 76k init, 0k highmem)
CPU: Intel(R) Celeron(TM) CPU 1400MHz stepping 01
Uniform Multi-Platform E-IDE driver Revision: 7.00beta3-.2.4
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
ICH2: IDE controller at PCI slot 00:1f.1
ICH2: chipset revision 17
ICH2: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xb800-0xb807, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xb808-0xb80f, BIOS settings: hdc:DMA, hdd:pio
hda: IC25N040ATCS04-0, ATA DISK drive
blk: queue c0278660, I/O limit 4095Mb (mask 0xffffffff)
hdc: IC25N040ATCS04-0, ATA DISK drive
blk: queue c0278ab4, I/O limit 4095Mb (mask 0xffffffff)
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: attached ide-disk driver.
hda: host protected area => 1
hda: 78140160 sectors (40008 MB) w/1768KiB Cache, CHS=4864/255/63, UDMA(100)
hdc: attached ide-disk driver.
hdc: host protected area => 1
hdc: 78140160 sectors (40008 MB) w/1768KiB Cache, CHS=77520/16/63, UDMA(100)
Partition check:
/dev/ide/host0/bus0/target0/lun0: p1 p2
/dev/ide/host0/bus1/target0/lun0: p1

Now I'm trying heavy reads/writes on the drive which failed,
but I'm unable to generate any error (yet). I had more than
2 weeks of uptime with this kernel.

/dev/hda: IC25N040ATCS04-0: 40 C
/dev/hdc: IC25N040ATCS04-0: 42 C

2003-06-05 11:17:22

by Piotr Krukowiecki

[permalink] [raw]
Subject: Re: ide problem - is this a known problem, or is the disk dead?

[I'm not subscribed here, please CC me on replays]

Hi, i have similar problems with my cdrom.
Everything was fine in 2.4.20. After upgrading to 2.4.21-rc6 i
can't mount my cdrom, it prints this message to syslog:

hdb: timeout waiting for DMA
hdb: timeout waiting for DMA
hdb: (__ide_dma_test_irq) called while not waiting

and then hangs and i have to reboot.

Disabling DMA (hdparm -d0 /dev/hdb) helps, no errors whatsover - but
in 2.4.20 i didn't have to disable dma.

The config for both 2.4.20 and 2.4.21-rc6 are the same, with one difference
- in .20 CONFIG_BLK_DEV_ADMA was set, it's not in .21-rc6 (and it's named
ADMA100 afaik)
I can send .config if needed.


And here's boot messages from 2.4.20 and 2.4.21-rc6:


klogd 1.4.1#10, log source = /proc/kmsg started.
Inspecting /boot/System.map-2.4.20
Loaded 15447 symbols from /boot/System.map-2.4.20.
Symbols match kernel version 2.4.20.
Loaded 70 symbols from 8 modules.
Linux version 2.4.20 (root@localhost) (gcc version 2.95.4 20011002 (Debian prerelease)) #1 wto gru 3 22:18:40 CET 2002
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 0000000008000000 (usable)
BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
user-defined physical RAM map:
user: 0000000000000000 - 00000000000a0000 (usable)
user: 00000000000f0000 - 0000000000100000 (reserved)
user: 0000000000100000 - 0000000008000000 (usable)
128MB LOWMEM available.
On node 0 totalpages: 32768
zone(0): 4096 pages.
zone(1): 28672 pages.
zone(2): 0 pages.
Kernel command line: root=/dev/hdg3 mem=131072K
Local APIC disabled by BIOS -- reenabling.
Found and enabled local APIC!
Initializing CPU#0
Detected 851.955 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1697.38 BogoMIPS
Memory: 127504k/131072k available (1010k kernel code, 3184k reserved, 276k data, 248k init, 0k highmem)
Dentry cache hash table entries: 16384 (order: 5, 131072 bytes)
Inode cache hash table entries: 8192 (order: 4, 65536 bytes)
Mount-cache hash table entries: 2048 (order: 2, 16384 bytes)
Buffer-cache hash table entries: 8192 (order: 3, 32768 bytes)
Page-cache hash table entries: 32768 (order: 5, 131072 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 128K
CPU: After generic, caps: 0383fbff 00000000 00000000 00000000
CPU: Common caps: 0383fbff 00000000 00000000 00000000
CPU: Intel Celeron (Coppermine) stepping 0a
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 851.9290 MHz.
..... host bus clock speed is 100.2268 MHz.
cpu: 0, clocks: 1002268, slice: 501134
CPU0<T0:1002256,T1:501120,D:2,S:501134,C:1002268>
mtrr: v1.40 (20010327) Richard Gooch ([email protected])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfb380, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router PIIX [8086/7110] at 00:07.0
Limiting direct PCI/PCI transfers.
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
apm: BIOS version 1.2 Flags 0x07 (Driver version 1.16)
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Detected PS/2 Mouse Port.
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI ISAPNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller on PCI bus 00 dev 39
PIIX4: chipset revision 1
PIIX4: not 100%% native mode: will probe irqs later
ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:pio, hdb:pio
ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:pio, hdd:pio
HPT366: onboard version of chipset, pin1=1 pin2=2
HPT366: IDE controller on PCI bus 00 dev 98
PCI: Found IRQ 11 for device 00:13.0
PCI: Sharing IRQ 11 with 00:13.1
HPT366: chipset revision 1
HPT366: not 100%% native mode: will probe irqs later
ide2: BM-DMA at 0xd400-0xd407, BIOS settings: hde:pio, hdf:pio
HPT366: IDE controller on PCI bus 00 dev 99
PCI: Found IRQ 11 for device 00:13.1
PCI: Sharing IRQ 11 with 00:13.0
HPT366: chipset revision 1
HPT366: not 100%% native mode: will probe irqs later
ide3: BM-DMA at 0xe000-0xe007, BIOS settings: hdg:DMA, hdh:pio
hda: ST360021A, ATA DISK drive
hdb: CRD-8400C, ATAPI CD/DVD-ROM drive
hdg: ST340823A, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide3 at 0xd800-0xd807,0xdc02 on irq 11
blk: queue c02ae8e4, I/O limit 4095Mb (mask 0xffffffff)
hda: 117231408 sectors (60022 MB) w/2048KiB Cache, CHS=116301/16/63, (U)DMA
blk: queue c02af310, I/O limit 4095Mb (mask 0xffffffff)
hdg: 78165360 sectors (40021 MB) w/512KiB Cache, CHS=77545/16/63, UDMA(66)
Partition check:
hda: hda1 hda2
hdg: [PTBL] [4865/255/63] hdg1 hdg2 hdg3 hdg4 < hdg5 hdg6 hdg7 hdg8 hdg9 hdg10 hdg11 >
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP
IP: routing cache hash table of 1024 buckets, 8Kbytes
TCP: Hash tables configured (established 8192 bind 8192)
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.


klogd 1.4.1#10, log source = /proc/kmsg started.
Inspecting /boot/System.map-2.4.21-rc6
Loaded 15924 symbols from /boot/System.map-2.4.21-rc6.
Symbols match kernel version 2.4.21.
Loaded 70 symbols from 8 modules.
Linux version 2.4.21-rc6 (root@localhost) (gcc version 2.95.4 20011002 (Debian prerelease)) #1 ?ro cze 4 11:45:43 CEST 2003
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 0000000008000000 (usable)
BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
user-defined physical RAM map:
user: 0000000000000000 - 00000000000a0000 (usable)
user: 00000000000f0000 - 0000000000100000 (reserved)
user: 0000000000100000 - 0000000008000000 (usable)
128MB LOWMEM available.
On node 0 totalpages: 32768
zone(0): 4096 pages.
zone(1): 28672 pages.
zone(2): 0 pages.
Kernel command line: root=/dev/hdg3 mem=131072K
Local APIC disabled by BIOS -- reenabling.
Found and enabled local APIC!
Initializing CPU#0
Detected 851.944 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1697.38 BogoMIPS
Memory: 127464k/131072k available (1032k kernel code, 3224k reserved, 290k data, 248k init, 0k highmem)
Dentry cache hash table entries: 16384 (order: 5, 131072 bytes)
Inode cache hash table entries: 8192 (order: 4, 65536 bytes)
Mount cache hash table entries: 512 (order: 0, 4096 bytes)
Buffer-cache hash table entries: 8192 (order: 3, 32768 bytes)
Page-cache hash table entries: 32768 (order: 5, 131072 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 128K
CPU: After generic, caps: 0383fbff 00000000 00000000 00000000
CPU: Common caps: 0383fbff 00000000 00000000 00000000
CPU: Intel Celeron (Coppermine) stepping 0a
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 851.9239 MHz.
..... host bus clock speed is 100.2262 MHz.
cpu: 0, clocks: 1002262, slice: 501131
CPU0<T0:1002256,T1:501120,D:5,S:501131,C:1002262>
mtrr: v1.40 (20010327) Richard Gooch ([email protected])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfb380, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router PIIX [8086/7110] at 00:07.0
Limiting direct PCI/PCI transfers.
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
apm: BIOS version 1.2 Flags 0x07 (Driver version 1.16)
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Detected PS/2 Mouse Port.
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI ISAPNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
Uniform Multi-Platform E-IDE driver Revision: 7.00beta4-2.4
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller at PCI slot 00:07.1
PIIX4: chipset revision 1
PIIX4: not 100%% native mode: will probe irqs later
ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:pio, hdb:pio
ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:pio, hdd:pio
HPT366: onboard version of chipset, pin1=1 pin2=2
HPT366: IDE controller at PCI slot 00:13.0
PCI: Found IRQ 11 for device 00:13.0
PCI: Sharing IRQ 11 with 00:13.1
HPT366: chipset revision 1
HPT366: not 100%% native mode: will probe irqs later
ide2: BM-DMA at 0xd400-0xd407, BIOS settings: hde:pio, hdf:pio
PCI: Found IRQ 11 for device 00:13.1
PCI: Sharing IRQ 11 with 00:13.0
ide3: BM-DMA at 0xe000-0xe007, BIOS settings: hdg:DMA, hdh:pio
hda: ST360021A, ATA DISK drive
hdb: CRD-8400C, ATAPI CD/DVD-ROM drive
blk: queue c02b7fe0, I/O limit 4095Mb (mask 0xffffffff)
hdb: set_drive_speed_status: status=0x51 { DriveReady SeekComplete Error }
hdb: set_drive_speed_status: error=0xb4
hdg: ST340823A, ATA DISK drive
blk: queue c02b8d3c, I/O limit 4095Mb (mask 0xffffffff)
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide3 at 0xd800-0xd807,0xdc02 on irq 11
hda: attached ide-disk driver.
hda: host protected area => 1
hda: 117231408 sectors (60022 MB) w/2048KiB Cache, CHS=116301/16/63, UDMA(33)
hdg: attached ide-disk driver.
hdg: host protected area => 1
hdg: 78165360 sectors (40021 MB) w/512KiB Cache, CHS=77545/16/63, UDMA(66)
Partition check:
hda: hda1 hda2
hdg: [PTBL] [4865/255/63] hdg1 hdg2 hdg3 hdg4 < hdg5 hdg6 hdg7 hdg8 hdg9 hdg10 hdg11 >
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP
IP: routing cache hash table of 1024 buckets, 8Kbytes
TCP: Hash tables configured (established 8192 bind 8192)
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.

--
Piotrek
irc: #debian.pl
Mors Drosophilis melanogastribus!

2003-07-26 23:17:19

by paul

[permalink] [raw]
Subject: Re: ide problem - is this a known problem, or is the disk dead?

I am having similar problems with a disk. I have ext3, reiser and ntfs
partitions on this disk and all are failing with messages such as those in the
rest of this thread and as below.

I am running Mandrake 9.1 (2.4.21-0.13) on an Asus A7N8X deluxe

Is this due to hardware failure or should I try a different kernel?

Please CC to me as I only have webmail because of this problem.


Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:51 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:52 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:52 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:52 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:52 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:53 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:53 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:53 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:53 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:53 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:53 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:53 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:53 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:53 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:53 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: zam-7001: io error in reiserfs_find_entry
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456

Message from syslogd@monkey at Sun Jul 27 00:15:54 2003 ...
monkey kernel: journal-601, buffer write failed
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector 1288
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector 1296
Jul 27 00:15:54 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector 1304
Jul 27 00:15:54 monkey kernel: journal-601, buffer write failed
Jul 27 00:15:54 monkey kernel: kernel BUG at prints.c:334!
Jul 27 00:15:54 monkey kernel: invalid operand: 0000
Jul 27 00:15:54 monkey kernel: ppp_deflate zlib_deflate bsd_comp ppp_async
ppp_generic slhc sd_mod isofs zlib_inflate udf ide-cd cdrom i810_audio soundcore
ac97_codec af_packet floppy 3c90x ohci1394 ieee1394 nls_cp850 vfat fat
nls_iso8859-15 ntfs reiserfs supermount usb-storage scsi_mod usbmouse keybdev
mousedev hid input ehci-hcd usb-ohci usbcore rtc ext3 jbd
Jul 27 00:15:54 monkey kernel: CPU: 0
Jul 27 00:15:54 monkey kernel: EIP:
0010:[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1325544/96]
Not tainted
Jul 27 00:15:54 monkey kernel: EIP: 0010:[<e49f9618>] Not tainted
Jul 27 00:15:54 monkey kernel: EFLAGS: 00210286
Jul 27 00:15:54 monkey kernel: EIP is at reiserfs_panic+0x28/0x60 [reiserfs]
Jul 27 00:15:54 monkey kernel: eax: 00000024 ebx: df424c00 ecx: 00000001
edx: dd940000
Jul 27 00:15:54 monkey kernel: esi: 00000000 edi: 00000012 ebp: c1591eb4
esp: c1591ea0
Jul 27 00:15:54 monkey kernel: ds: 0018 es: 0018 ss: 0018
Jul 27 00:15:54 monkey kernel: Process kupdated (pid: 7, stackpage=c1591000)
Jul 27 00:15:54 monkey kernel: Stack: e4a107f1 e4a14f60 e4a0dfe0 c1591ec4
e4a18ba0 c1591ee0 e4a03c7a df424c00
Jul 27 00:15:54 monkey kernel: e4a0dfe0 00000006 00000004 00000000
cf85c1e0 e4a1808c 00000029 00000013
Jul 27 00:15:54 monkey kernel: c1591f44 e4a076be df424c00 e4a18ba0
00000001 e4a18000 e4a62414 00000820
Jul 27 00:15:54 monkey kernel: Call Trace:
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1230863/96]
.rodata.str1.1+0x4af/0xa1a [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a107f1>] .rodata.str1.1+0x4af/0xa1a [reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1212576/96]
error_buf+0x0/0x400 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a14f60>] error_buf+0x0/0x400 [reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1241120/96]
.rodata.str1.32+0x3440/0x57a2 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a0dfe0>] .rodata.str1.32+0x3440/0x57a2
[reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1282950/96]
flush_commit_list+0x29a/0x410 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a03c7a>] flush_commit_list+0x29a/0x410
[reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1241120/96]
.rodata.str1.32+0x3440/0x57a2 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a0dfe0>] .rodata.str1.32+0x3440/0x57a2
[reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1268034/96]
do_journal_end+0x60e/0xb10 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a076be>] do_journal_end+0x60e/0xb10 [reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1271342/96]
flush_old_commits+0xf2/0x170 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a069d2>] flush_old_commits+0xf2/0x170 [reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1230131/96]
.rodata.str1.1+0x78b/0xa1a [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e4a10acd>] .rodata.str1.1+0x78b/0xa1a [reiserfs]
Jul 27 00:15:54 monkey kernel:
[af_packet:__insmod_af_packet_O/lib/modules/2.4.21-0.13mdk/kernel/net/+-1337384/96]
reiserfs_write_super+0x28/0x30 [reiserfs]
Jul 27 00:15:54 monkey kernel: [<e49f67d8>] reiserfs_write_super+0x28/0x30
[reiserfs]
Jul 27 00:15:54 monkey kernel: [sync_supers+175/304] sync_supers+0xaf/0x130
[kernel]
Jul 27 00:15:54 monkey kernel: [<c01448df>] sync_supers+0xaf/0x130 [kernel]
Jul 27 00:15:54 monkey kernel: [sync_old_buffers+17/64]
sync_old_buffers+0x11/0x40 [kernel]
Jul 27 00:15:54 monkey kernel: [<c0143d51>] sync_old_buffers+0x11/0x40 [kernel]
Jul 27 00:15:54 monkey kernel: [kupdate+272/336] kupdate+0x110/0x150 [kernel]
Jul 27 00:15:54 monkey kernel: [<c0144050>] kupdate+0x110/0x150 [kernel]
Jul 27 00:15:54 monkey kernel: [rest_init+0/48] stext+0x0/0x30 [kernel]
Jul 27 00:15:54 monkey kernel: [<c0105000>] stext+0x0/0x30 [kernel]
Jul 27 00:15:54 monkey kernel: [arch_kernel_thread+38/64]
arch_kernel_thread+0x26/0x40 [kernel]
Jul 27 00:15:54 monkey kernel: [<c0107526>] arch_kernel_thread+0x26/0x40 [kernel]
Jul 27 00:15:54 monkey kernel: [kupdate+0/336] kupdate+0x0/0x150 [kernel]
Jul 27 00:15:54 monkey kernel: [<c0143f40>] kupdate+0x0/0x150 [kernel]
Jul 27 00:15:54 monkey kernel:
Jul 27 00:15:54 monkey kernel: Code: 0f 0b 4e 01 f7 07 a1 e4 68 60 4f a1 e4 85
db 74 17 66 8b 43
Jul 27 00:15:57 monkey kernel: <6>end_request: I/O error, dev 16:06 (hdc),
sector 15079456
Jul 27 00:15:57 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:03 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:03 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:09 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:09 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:15 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:15 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:21 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:21 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:27 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:27 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:33 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:33 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:39 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:39 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:45 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:45 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:51 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:51 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:16:57 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:16:57 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]
Jul 27 00:17:03 monkey kernel: end_request: I/O error, dev 16:06 (hdc), sector
15079456
Jul 27 00:17:03 monkey kernel: vs-13070: reiserfs_read_inode2: i/o failure
occurred trying to find stat data of [2 14773 0x0 SD]