2002-02-26 17:39:42

by Simon Turvey

[permalink] [raw]
Subject: IDE error on 2.4.17

After a large file (4gigs) transfer using Samba attempts to access the file
(also across Samba) resulted in lots of the following type of message.

hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
sector=250680
end_request: I/O error, dev 03:01 (hda), sector 250680

Can anyone point me in the direction of a reason/solution?

Thanks,
Simon




2002-02-26 17:56:41

by André Dahlqvist

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

On Tue, Feb 26, 2002 at 05:38:35PM -0000, Simon Turvey wrote:

> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
^^^^^^^^^^^^^^^^^^
If I'm not misstaking that's hardware error.
--

Andr? Dahlqvist <[email protected]>

2002-02-26 18:13:01

by Alan

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
> sector=250680
> end_request: I/O error, dev 03:01 (hda), sector 250680

Uncorrectable error is a message from your disk, along the lines of "Hey
pal I wonder if the warranty has expired yet"

2002-02-26 18:23:32

by Mohammad A. Haque

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

On Tuesday, February 26, 2002, at 12:38 , Simon Turvey wrote:

> After a large file (4gigs) transfer using Samba attempts to access the
> file
> (also across Samba) resulted in lots of the following type of message.
>
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
> sector=250680
> end_request: I/O error, dev 03:01 (hda), sector 250680
>
> Can anyone point me in the direction of a reason/solution?

I got a similar error yesterday and this morning and asked about it on
the ext3 list because I wasn't sure if it was fs related or hw.

http://marc.theaimsgroup.com/?l=ext3-users&m=101473048203232&w=2

--

=====================================================================
Mohammad A. Haque http://www.haque.net/
[email protected]

"Alcohol and calculus don't mix. Developer/Project Lead
Don't drink and derive." --Unknown http://www.themes.org/
[email protected]
=====================================================================

2002-02-26 18:50:33

by Simon Turvey

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

The drive's less than a year old :-(

Should I try disabling some of the UDMA stuff?

----- Original Message -----
From: "Alan Cox" <[email protected]>
To: "Simon Turvey" <[email protected]>
Cc: "Linux Kernel Mailing List" <[email protected]>
Sent: Tuesday, February 26, 2002 6:27 PM
Subject: Re: IDE error on 2.4.17


> > hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> > hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
> > sector=250680
> > end_request: I/O error, dev 03:01 (hda), sector 250680
>
> Uncorrectable error is a message from your disk, along the lines of "Hey
> pal I wonder if the warranty has expired yet"
>
>


2002-02-26 18:57:02

by Alan

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

> The drive's less than a year old :-(
> Should I try disabling some of the UDMA stuff?

If it amuses you. You might as well dye your hair pink and dance naked
around it for all the difference it will make

Alan

2002-02-26 19:01:21

by Hans-Christian Armingeon

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

Am Dienstag, 26. Februar 2002 19:50 schrieb Simon Turvey:
> The drive's less than a year old :-(
>
> Should I try disabling some of the UDMA stuff?
you should return it to the place you've bought it

Johnny

2002-02-26 19:02:11

by David Rees

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

On Tue, Feb 26, 2002 at 06:50:15PM -0000, Simon Turvey wrote:
> The drive's less than a year old :-(
>
> Should I try disabling some of the UDMA stuff?

Age of the disk doesn't matter, they'll die at any age. They seem to die
most frequently either within one year, or after 3 years.

Turning off UDMA probably won't help, it looks like it's time to restore
from backups. If you try to recover data from the disk, make sure you mount
it in read-only mode if you can get the drive that far up if you reboot.

-Dave

2002-02-26 19:10:11

by Simon Turvey

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

LOL, I get the message, thanks Alan.

----- Original Message -----
From: "Alan Cox" <[email protected]>
To: "Simon Turvey" <[email protected]>
Cc: "Alan Cox" <[email protected]>; "Linux Kernel Mailing List"
<[email protected]>
Sent: Tuesday, February 26, 2002 7:10 PM
Subject: Re: IDE error on 2.4.17


> > The drive's less than a year old :-(
> > Should I try disabling some of the UDMA stuff?
>
> If it amuses you. You might as well dye your hair pink and dance naked
> around it for all the difference it will make
>
> Alan
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>


2002-02-26 19:12:51

by Simon Turvey

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

It's a dev box anyway - just for tinkering. Even so, it's annoying.

Someone asked what the drive model was: IBM-DTLA-305030.

Will send the bugger back in the morning.

----- Original Message -----
From: "David Rees" <[email protected]>
To: "Linux Kernel Mailing List" <[email protected]>
Sent: Tuesday, February 26, 2002 7:01 PM
Subject: Re: IDE error on 2.4.17


> On Tue, Feb 26, 2002 at 06:50:15PM -0000, Simon Turvey wrote:
> > The drive's less than a year old :-(
> >
> > Should I try disabling some of the UDMA stuff?
>
> Age of the disk doesn't matter, they'll die at any age. They seem to die
> most frequently either within one year, or after 3 years.
>
> Turning off UDMA probably won't help, it looks like it's time to restore
> from backups. If you try to recover data from the disk, make sure you
mount
> it in read-only mode if you can get the drive that far up if you reboot.
>
> -Dave
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>


2002-02-26 19:21:01

by Costa, Juliano

[permalink] [raw]
Subject: RE: IDE error on 2.4.17

I got a similar error, in my case disk error.

-----Original Message-----
From: David Rees [mailto:[email protected]]
Sent: Quarta-feira, 27 de Fevereiro de 2002 07:02
To: Linux Kernel Mailing List
Subject: Re: IDE error on 2.4.17


On Tue, Feb 26, 2002 at 06:50:15PM -0000, Simon Turvey wrote:
> The drive's less than a year old :-(
>
> Should I try disabling some of the UDMA stuff?

Age of the disk doesn't matter, they'll die at any age. They seem to die
most frequently either within one year, or after 3 years.

Turning off UDMA probably won't help, it looks like it's time to restore
from backups. If you try to recover data from the disk, make sure you mount
it in read-only mode if you can get the drive that far up if you reboot.

-Dave

2002-02-26 20:07:25

by Skip Ford

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Simon Turvey wrote:
> The drive's less than a year old :-(
>
> Should I try disabling some of the UDMA stuff?
>
> ----- Original Message -----
> From: "Alan Cox" <[email protected]>
> To: "Simon Turvey" <[email protected]>
> Cc: "Linux Kernel Mailing List" <[email protected]>
> Sent: Tuesday, February 26, 2002 6:27 PM
> Subject: Re: IDE error on 2.4.17
>
>
> > > hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> > > hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
> > > sector=250680
> > > end_request: I/O error, dev 03:01 (hda), sector 250680
> >
> > Uncorrectable error is a message from your disk, along the lines of "Hey
> > pal I wonder if the warranty has expired yet"

I've seen 2 drives with UncorrectableErrors that were correctable.

One happened after booting Linux with APM enabled, and the other after
running Windows with vendor-specific DMA Block mode enabled.

In each case a low level format took care of it. No bad sectors on
either drive in over a year.

- --
Skip ID: 0x7EDDDB0A
-----BEGIN PGP SIGNATURE-----

iEYEARECAAYFAjx76nUACgkQBMKxVH7d2wpW4wCg+MQQBIOZfHRqR0QKUhNbuFSB
kkIAoK/ThQ/pIeoj3/Qu8ZAd6S26E8M/
=/bRM
-----END PGP SIGNATURE-----

2002-02-26 20:40:55

by Chris Friesen

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

Simon Turvey wrote:
>
> It's a dev box anyway - just for tinkering. Even so, it's annoying.
>
> Someone asked what the drive model was: IBM-DTLA-305030.

That's your problem right there. I've had two of the DTLA drives die, a 15GB
and a 45GB. They're notorious. They sent me back a new 120GXP-series 60GB one
to replace the 45 gigger. Hopefully it'll last a bit longer...


--
Chris Friesen | MailStop: 043/33/F10
Nortel Networks | work: (613) 765-0557
3500 Carling Avenue | fax: (613) 765-2986
Nepean, ON K2H 8E9 Canada | email: [email protected]

2002-02-27 09:47:50

by Vojtech Pavlik

[permalink] [raw]
Subject: Re: IDE error on 2.4.17

On Tue, Feb 26, 2002 at 06:50:15PM -0000, Simon Turvey wrote:
> The drive's less than a year old :-(
>
> Should I try disabling some of the UDMA stuff?

That won't help, this indeed is a media error. The drive is heading to
hell. You have about another six months of life before it goes belly up
completely.

Any chance it's one of those fast IBM 30 or 45 gig drives? They seem to
be dying pretty fast ...

>
> ----- Original Message -----
> From: "Alan Cox" <[email protected]>
> To: "Simon Turvey" <[email protected]>
> Cc: "Linux Kernel Mailing List" <[email protected]>
> Sent: Tuesday, February 26, 2002 6:27 PM
> Subject: Re: IDE error on 2.4.17
>
>
> > > hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> > > hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=250746,
> > > sector=250680
> > > end_request: I/O error, dev 03:01 (hda), sector 250680
> >
> > Uncorrectable error is a message from your disk, along the lines of "Hey
> > pal I wonder if the warranty has expired yet"
> >
> >
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/

--
Vojtech Pavlik
SuSE Labs

2002-02-27 10:26:07

by Erik Andersen

[permalink] [raw]
Subject: [PATCH] Re: IDE error on 2.4.17

On Wed Feb 27, 2002 at 10:47:35AM +0100, Vojtech Pavlik wrote:
> That won't help, this indeed is a media error. The drive is heading to
> hell. You have about another six months of life before it goes belly up
> completely.
>
> Any chance it's one of those fast IBM 30 or 45 gig drives? They seem to
> be dying pretty fast ...

I expect a patch like this would help avoid these sort of
questions...

diff -urN linux/drivers/ide.orig/hd.c linux/drivers/ide/hd.c
--- linux/drivers/ide.orig/hd.c Mon Oct 15 14:27:42 2001
+++ linux/drivers/ide/hd.c Wed Feb 27 03:16:16 2002
@@ -201,6 +201,12 @@
printk(", sector=%ld", CURRENT->sector);
}
printk("\n");
+ /* Make sure people realize that very bad things are
+ * happening, so they can do something about it before
+ * it is too late... */
+ if (hd_error & ECC_ERR) {
+ printk("hd%c: You should make a backup, this drive may fail soon!\n", devc);
+ }
}
#else
printk("hd%c: %s: status=0x%02x.\n", devc, msg, stat & 0xff);
@@ -209,6 +215,9 @@
} else {
hd_error = inb(HD_ERROR);
printk("hd%c: %s: error=0x%02x.\n", devc, msg, hd_error & 0xff);
+ if (hd_error & ECC_ERR) {
+ printk("hd%c: You should make a backup, this drive may fail soon!\n", devc);
+ }
}
#endif /* verbose errors */
restore_flags (flags);
diff -urN linux/drivers/ide.orig/ide.c linux/drivers/ide/ide.c
--- linux/drivers/ide.orig/ide.c Fri Feb 22 16:20:13 2002
+++ linux/drivers/ide/ide.c Wed Feb 27 03:20:12 2002
@@ -832,6 +832,13 @@
if (HWGROUP(drive) && HWGROUP(drive)->rq)
printk(", sector=%ld", HWGROUP(drive)->rq->sector);
}
+ if (hd_error & ECC_ERR) {
+ printk("\nhd%c: You should make a backup, this drive may fail soon!", devc);
+ }
+ }
+#else
+ if (drive->media == ide_disk && hd_error & ECC_ERR) {
+ printk("\nhd%c: You should make a backup, this drive may fail soon!", devc);
}
#endif /* FANCY_STATUS_DUMPS */
printk("\n");
-Erik

--
Erik B. Andersen http://codepoet-consulting.com/
--This message was written using 73% post-consumer electrons--

2002-02-27 14:45:17

by Alan

[permalink] [raw]
Subject: Re: [PATCH] Re: IDE error on 2.4.17

> > Any chance it's one of those fast IBM 30 or 45 gig drives? They seem to
> > be dying pretty fast ...
>
> I expect a patch like this would help avoid these sort of
> questions...

This is the wrong approach. That information is available properly if and
when the vendors install the smart utilities

2002-02-27 18:48:48

by Jan Niehusmann

[permalink] [raw]
Subject: Re: [PATCH] Re: IDE error on 2.4.17

On Wed, Feb 27, 2002 at 02:59:28PM +0000, Alan Cox wrote:
> This is the wrong approach. That information is available properly if and
> when the vendors install the smart utilities

Doesn't necessarily help. I recently saw a hard drive which made funny
noises and got really slow reading some parts of the drive (~30MB/s on
some parts, ~300kB/s on others), but ide-smart didn't report failed
tests. Two days later the drive was dead...

It was an IBM 60GB drive, but I don't remember the exact type. It
called itself "IC35L060AVER07-0".

Jan

2002-02-27 19:07:46

by Andre Hedrick

[permalink] [raw]
Subject: Re: [PATCH] Re: IDE error on 2.4.17


Did you enable smart?
Did you run the captive smart tests?

If this is on you / drive DO NOT run captive mode, the kernel go into
starvation mode.

On Wed, 27 Feb 2002, Jan Niehusmann wrote:

> On Wed, Feb 27, 2002 at 02:59:28PM +0000, Alan Cox wrote:
> > This is the wrong approach. That information is available properly if and
> > when the vendors install the smart utilities
>
> Doesn't necessarily help. I recently saw a hard drive which made funny
> noises and got really slow reading some parts of the drive (~30MB/s on
> some parts, ~300kB/s on others), but ide-smart didn't report failed
> tests. Two days later the drive was dead...
>
> It was an IBM 60GB drive, but I don't remember the exact type. It
> called itself "IC35L060AVER07-0".
>
> Jan
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>

Andre Hedrick
Linux Disk Certification Project Linux ATA Development

2002-02-27 19:27:33

by Jan Niehusmann

[permalink] [raw]
Subject: Re: [PATCH] Re: IDE error on 2.4.17

On Wed, Feb 27, 2002 at 10:53:41AM -0800, Andre Hedrick wrote:
> Did you enable smart?
> Did you run the captive smart tests?

I only ran ide-smart /dev/hdx on the drive, and got a list of 'Passed'
tests. Do I need to enable something else for ide-smart to work?
What are the captive smart tests?

Jan

2002-02-27 19:47:16

by Chris Friesen

[permalink] [raw]
Subject: Re: [PATCH] Re: IDE error on 2.4.17

Jan Niehusmann wrote:
>
> On Wed, Feb 27, 2002 at 02:59:28PM +0000, Alan Cox wrote:
> > This is the wrong approach. That information is available properly if and
> > when the vendors install the smart utilities
>
> Doesn't necessarily help. I recently saw a hard drive which made funny
> noises and got really slow reading some parts of the drive (~30MB/s on
> some parts, ~300kB/s on others), but ide-smart didn't report failed
> tests. Two days later the drive was dead...

I had two DTLA-series IBM drives. Both of them started having problems, nothing
showed up on IBM's own bootable floppy based SMART tests, and both of them died
with no SMART warnings.

I'm not at all confident that SMART is all that useful.


--
Chris Friesen | MailStop: 043/33/F10
Nortel Networks | work: (613) 765-0557
3500 Carling Avenue | fax: (613) 765-2986
Nepean, ON K2H 8E9 Canada | email: [email protected]