Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Fri, 13 Jul 2001 23:23:18 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Fri, 13 Jul 2001 23:23:10 -0400 Received: from horus.its.uow.edu.au ([130.130.68.25]:6121 "EHLO horus.its.uow.edu.au") by vger.kernel.org with ESMTP id ; Fri, 13 Jul 2001 23:22:58 -0400 Message-ID: <3B4FBB36.1F692780@uow.edu.au> Date: Sat, 14 Jul 2001 13:23:34 +1000 From: Andrew Morton X-Mailer: Mozilla 4.76 [en] (X11; U; Linux 2.4.6 i686) X-Accept-Language: en MIME-Version: 1.0 To: Alan Cox CC: Andreas Dilger , "Albert D. Cahalan" , Ben LaHaise , Ragnar Kjxrstad , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, mike@bigstorage.com, kevin@bigstorage.com, linux-lvm@sistina.com Subject: Re: [PATCH] 64 bit scsi read/write In-Reply-To: <200107132041.f6DKfqM8013404@webber.adilger.int> from "Andreas Dilger" at Jul 13, 2001 02:41:52 PM Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Alan Cox wrote: > > > RAID 5 throws a wrench into this by not guaranteeing that all of the > > blocks in a stripe are consistent (you don't know which blocks and/or > > parity were written and which not). Ideally, you want a multi-stage > > commit for RAID as well, so that you write the data first, and the > > parity afterwards (so on reboot you trust the data first, and not the > > parity). You have a problem if there is a bad disk and you crash. > > Well to be honest so does most disk firmware. IDE especially. For one thing > the logical sector size the drives writes need not match the illusions > provided upstream, and the write flush commands are frequently not implemented > because they damage benchmarketing numbers from folks like Zdnet.. If, after a power outage, the IDE disk can keep going for long enough to write its write cache out to the reserved vendor area (which will only take 20-30 milliseconds) then the data may be considered *safe* as soon as it hits writecache. In which case it is perfectly legitimate and sensible for the drive to ignore flush commands, and to ack data as soon as it hits cache. Yes? If I'm right then the only open question is: which disks do and do not do the right thing when the lights go out. - - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/