Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755555AbYKULhV (ORCPT ); Fri, 21 Nov 2008 06:37:21 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752865AbYKULhG (ORCPT ); Fri, 21 Nov 2008 06:37:06 -0500 Received: from lucidpixels.com ([75.144.35.66]:33528 "EHLO lucidpixels.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751909AbYKULhF (ORCPT ); Fri, 21 Nov 2008 06:37:05 -0500 Date: Fri, 21 Nov 2008 06:37:04 -0500 (EST) From: Justin Piszcz To: Peter Rabbitson cc: linux-raid , linux-kernel@vger.kernel.org, alan@lxorguk.ukuu.org.uk, smartmontools-support@lists.sourceforge.net, Bruce Allen Subject: Re: Ninth(?) Velociraptor replacement or md(RAID)/smartmontools(?) bug? In-Reply-To: Message-ID: References: <49269BCF.8060300@rabbit.us> User-Agent: Alpine 1.10 (DEB 962 2008-03-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3487 Lines: 89 On Fri, 21 Nov 2008, Justin Piszcz wrote: > > > On Fri, 21 Nov 2008, Peter Rabbitson wrote: > >> It might very well be a WD bug. I had three (3) identical WDC >> WD2500AAJS-08B4A0 drives fail on me with the same _identical_ error >> (same sector number to the last digit): >> >> Oct 27 11:33:41 Arzamas kernel: ata6.00: exception Emask 0x10 SAct 0x0 >> SErr 0x80000 action 0xe frozen >> Oct 27 11:33:41 Arzamas kernel: ata6.00: irq_stat 0x01100010, PHY RDY >> changed >> Oct 27 11:33:41 Arzamas kernel: ata6: SError: { 10B8B } >> Oct 27 11:33:41 Arzamas kernel: ata6.00: cmd >> ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0 >> Oct 27 11:33:41 Arzamas kernel: res 06/37:00:00:00:00/00:00:00:00:06/00 >> Emask 0x12 (ATA bus error) >> Oct 27 11:33:41 Arzamas kernel: ata6.00: error: { IDNF ABRT } >> Oct 27 11:33:41 Arzamas kernel: ata6: hard resetting link >> Oct 27 11:33:46 Arzamas kernel: ata6: SATA link up 3.0 Gbps (SStatus 123 >> SControl 0) >> Oct 27 11:33:46 Arzamas kernel: ata6.00: configured for UDMA/100 >> Oct 27 11:33:46 Arzamas kernel: ata6: EH complete >> Oct 27 11:33:46 Arzamas kernel: sd 6:0:0:0: [sde] 488397168 512-byte >> hardware sectors (250059 MB) >> Oct 27 11:33:46 Arzamas kernel: sd 6:0:0:0: [sde] Write Protect is off >> Oct 27 11:33:46 Arzamas kernel: sd 6:0:0:0: [sde] Mode Sense: 00 3a 00 00 >> Oct 27 11:33:46 Arzamas kernel: sd 6:0:0:0: [sde] Write cache: enabled, >> read cache: enabled, doesn't support DPO or FUA >> Oct 27 11:33:46 Arzamas kernel: end_request: I/O error, dev sde, sector >> 488166955 >> Oct 27 11:33:46 Arzamas kernel: md: super_written gets error=-5, uptodate=0 >> >> >> All 3 drives endured the same multiple rewriting of the sector in >> question, as they did multiple smart self-tests. I am currently in the >> process of replacing these two drives with Seagates, (the other 2 in the >> 4 member array are Maxtors). Will see what happens. >> >> Peter >> >> P.S. See threads http://marc.info/?l=linux-raid&m=122523835815697 and >> http://marc.info/?l=linux-raid&m=122669103213041 for more info on my > > Pete, > > Are these -new- 250GiB drives, recently purchased? > > # hdparm -iv /dev/sda > Drive conforms to: Unspecified: ATA/ATAPI-1,2,3,4,5,6,7 > > What does yours conform to, just curious? Update, the extended offline test completed without any errors: SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 823 - # 2 Short offline Completed without error 00% 822 - Running offline test now. p34:~# smartctl -t offline /dev/sda smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART off-line routine immediately in off-line mode". Drive command "Execute SMART off-line routine immediately in off-line mode" successful. Testing has begun. Please wait 4800 seconds for test to complete. Test will complete after Fri Nov 21 07:56:02 2008 Use smartctl -X to abort test. p34:~# We'll see what happens next.. Justin. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/