From: Ric Wheeler Subject: Re: raid is dangerous but that's secret (was Re: [patch] ext2/3: document conditions when reliable operation is possible) Date: Mon, 31 Aug 2009 09:22:48 -0400 Message-ID: <4A9BCEA8.9080206@redhat.com> References: <20090827221319.GA1601@ucw.cz> <4A9733C1.2070904@redhat.com> <20090828064449.GA27528@elf.ucw.cz> <20090828120854.GA8153@mit.edu> <20090830075135.GA1874@ucw.cz> <4A9A88B6.9050902@redhat.com> <4A9A9034.8000703@msgid.tls.msk.ru> <20090830163513.GA25899@infradead.org> <4A9BCCEF.7010402@redhat.com> <20090831131626.GA17325@infradead.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Michael Tokarev , david@lang.hm, Pavel Machek , Theodore Tso , NeilBrown , Rob Landley , Florian Weimer , Goswin von Brederlow , kernel list , Andrew Morton , mtk.manpages@gmail.com, rdunlap@xenotime.net, linux-doc@vger.kernel.org, linux-ext4@vger.kernel.org, corbet@lwn.net To: Christoph Hellwig Return-path: In-Reply-To: <20090831131626.GA17325@infradead.org> Sender: linux-doc-owner@vger.kernel.org List-Id: linux-ext4.vger.kernel.org On 08/31/2009 09:16 AM, Christoph Hellwig wrote: > On Mon, Aug 31, 2009 at 09:15:27AM -0400, Ric Wheeler wrote: >>> While most common filesystem do have barrier support it is: >>> >>> - not actually enabled for the two most common filesystems >>> - the support for write barriers an cache flushing tends to be buggy >>> all over our software stack, >>> >> >> Or just missing - I think that MD5/6 simply drop the requests at present. >> >> I wonder if it would be worth having MD probe for write cache enabled& >> warn if barriers are not supported? > > In my opinion even that is too weak. We know how to control the cache > settings on all common disks (that is scsi and ata), so we should always > disable the write cache unless we know that the whole stack (filesystem, > raid, volume managers) supports barriers. And even then we should make > sure the filesystems does actually use barriers everywhere that's needed > which failed at for years. > I was thinking about that as well. Having us disable the write cache when we know it is not supported (like in the MD5 case) would certainly be *much* safer for almost everyone. We would need to have a way to override the write cache disabling for people who either know that they have a non-volatile write cache (unlikely as it would probably be to put MD5 on top of a hardware RAID/external array, but some of the new SSD's claim to have non-volatile write cache). It would also be very useful to have all of our top tier file systems enable barriers by default, provide consistent barrier on/off mount options and log a nice warning when not enabled.... ric