From: Michael Tokarev Subject: Re: raid is dangerous but that's secret (was Re: [patch] ext2/3: document conditions when reliable operation is possible) Date: Sun, 30 Aug 2009 18:44:04 +0400 Message-ID: <4A9A9034.8000703@msgid.tls.msk.ru> References: <20090824212518.GF29763@elf.ucw.cz> <20090825232601.GF4300@elf.ucw.cz> <4A947682.2010204@redhat.com> <200908262253.17886.rob@landley.net> <4A967175.5070700@redhat.com> <20090827221319.GA1601@ucw.cz> <4A9733C1.2070904@redhat.com> <20090828064449.GA27528@elf.ucw.cz> <20090828120854.GA8153@mit.edu> <20090830075135.GA1874@ucw.cz> <4A9A88B6.9050902@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: david@lang.hm, Pavel Machek , Theodore Tso , NeilBrown , Rob Landley , Florian Weimer , Goswin von Brederlow , kernel list , Andrew Morton , mtk.manpages@gmail.com, rdunlap@xenotime.net, linux-doc@vger.kernel.org, linux-ext4@vger.kernel.org, corbet@lwn.net To: Ric Wheeler Return-path: In-Reply-To: <4A9A88B6.9050902@redhat.com> Sender: linux-doc-owner@vger.kernel.org List-Id: linux-ext4.vger.kernel.org Ric Wheeler wrote: [] > The easiest way to lose your data in Linux - with RAID, without RAID, > S-ATA or SAS - is to run with the write cache enabled. > > If you compare the size of even a large RAID stripe it will be measured > in KB and as this thread has mentioned already, you stand to have damage > to just one stripe (or even just a disk sector or two). > > If you lose power with the write caches enabled on that same 5 drive > RAID set, you could lose as much as 5 * 32MB of freshly written data on > a power loss (16-32MB write caches are common on s-ata disks these days). This is fundamentally wrong. Many filesystems today use either barriers or flushes (if barriers are not supported), and the times when disk drives were lying to the OS that the cache got flushed are long gone. > For MD5 (and MD6), you really must run with the write cache disabled > until we get barriers to work for those configurations. I highly doubt barriers will ever be supported on anything but simple raid1, because it's impossible to guarantee ordering across multiple drives. Well, it *is* possible to have write barriers with journalled (and/or with battery-backed-cache) raid[456]. Note that even if raid[456] does not support barriers, write cache flushes still works. /mjt