From: david@lang.hm Subject: Re: raid is dangerous but that's secret (was Re: [patch] ext2/3: document conditions when reliable operation is possible) Date: Mon, 31 Aug 2009 08:50:53 -0700 (PDT) Message-ID: References: <20090827221319.GA1601@ucw.cz> <4A9733C1.2070904@redhat.com> <20090828064449.GA27528@elf.ucw.cz> <20090828120854.GA8153@mit.edu> <20090830075135.GA1874@ucw.cz> <4A9A88B6.9050902@redhat.com> <4A9A9034.8000703@msgid.tls.msk.ru> <20090830163513.GA25899@infradead.org> <4A9BCCEF.7010402@redhat.com> <20090831131626.GA17325@infradead.org> <4A9BCEA8.9080206@redhat.com> Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: Christoph Hellwig , Michael Tokarev , Pavel Machek , Theodore Tso , NeilBrown , Rob Landley , Florian Weimer , Goswin von Brederlow , kernel list , Andrew Morton , mtk.manpages@gmail.com, rdunlap@xenotime.net, linux-doc@vger.kernel.org, linux-ext4@vger.kernel.org, corbet@lwn.net To: Ric Wheeler Return-path: Received: from mail.lang.hm ([64.81.33.126]:43644 "EHLO bifrost.lang.hm" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751644AbZHaPwP (ORCPT ); Mon, 31 Aug 2009 11:52:15 -0400 In-Reply-To: <4A9BCEA8.9080206@redhat.com> Sender: linux-ext4-owner@vger.kernel.org List-ID: On Mon, 31 Aug 2009, Ric Wheeler wrote: > On 08/31/2009 09:16 AM, Christoph Hellwig wrote: >> On Mon, Aug 31, 2009 at 09:15:27AM -0400, Ric Wheeler wrote: >>>> While most common filesystem do have barrier support it is: >>>> >>>> - not actually enabled for the two most common filesystems >>>> - the support for write barriers an cache flushing tends to be buggy >>>> all over our software stack, >>>> >>> >>> Or just missing - I think that MD5/6 simply drop the requests at present. >>> >>> I wonder if it would be worth having MD probe for write cache enabled& >>> warn if barriers are not supported? >> >> In my opinion even that is too weak. We know how to control the cache >> settings on all common disks (that is scsi and ata), so we should always >> disable the write cache unless we know that the whole stack (filesystem, >> raid, volume managers) supports barriers. And even then we should make >> sure the filesystems does actually use barriers everywhere that's needed >> which failed at for years. >> > > I was thinking about that as well. Having us disable the write cache when we > know it is not supported (like in the MD5 case) would certainly be *much* > safer for almost everyone. > > We would need to have a way to override the write cache disabling for people > who either know that they have a non-volatile write cache (unlikely as it > would probably be to put MD5 on top of a hardware RAID/external array, but > some of the new SSD's claim to have non-volatile write cache). I've done this when the hardware raid only suppored raid 5 but I wanted raid 6. I've also done it when I had enough disks to need more than one hardware raid card to talk to them all, but wanted one logical drive for the system. > It would also be very useful to have all of our top tier file systems enable > barriers by default, provide consistent barrier on/off mount options and log > a nice warning when not enabled.... most people are not willing to live with unbuffered write performance. they care about their data, but they also care about performance, and since performance is what they see on an ongong basis, they tend to care more about performance. given that we don't even have barriers enabled by default on ext3 due to the performance hit, what makes you think that disabling buffers entirely is going to be acceptable to people? David Lang