Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Tue, 30 Jul 2002 23:32:17 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Tue, 30 Jul 2002 23:32:17 -0400 Received: from tone.orchestra.cse.unsw.EDU.AU ([129.94.242.28]:64655 "HELO tone.orchestra.cse.unsw.EDU.AU") by vger.kernel.org with SMTP id ; Tue, 30 Jul 2002 23:32:16 -0400 From: Neil Brown To: "jeff millar" Date: Wed, 31 Jul 2002 13:32:26 +1000 (EST) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <15687.23114.805995.595589@notabene.cse.unsw.edu.au> Cc: "Bill Davidsen" , "Jakob Oestergaard" , "Kernel mailing list" Subject: Re: RAID problems In-Reply-To: message from jeff millar on Tuesday July 30 References: <004501c23841$03265a30$6a01a8c0@wa1hco> X-Mailer: VM 6.72 under Emacs 20.7.2 X-face: [Gw_3E*Gng}4rRrKRYotwlE?.2|**#s9D > Raid needs an automatic way to maintain device synchronization. Why should > I have to... > manually examine the device data (lsraid) > find two devices that match > mark the others failed in /etc/raidtab > reinitialize the raid devices...putting all data at risk > hot add the "failed" device > wait for it to recover (hours) > change /etc/raidtab again > retest everything > > This is 10 times worse that e2fsck and much more error prone. The file > system guru's worked hard on journalling to minimize this kind of risk. > Part of the answer is to use mdadm http://www.cse.unsw.edu.au/~neilb/source/mdadm/ mdadm --assemble --force .... will do a lot of that for you. Another part of the answer is that raid5 should never mark two drives as failed. There really isn't any point. If they are both really failed, you've lost your data anyway. If it is really a cable failure, then it should be easier to get back to where you started from. I hope to have raid5 working better in this respect in 2.6. A finally part of the answer is that even perfect raid software cannot make up for buggy drivers, shoddy hard drives, or flacky cabling. NeilBrown - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/