Return-Path: linux-nfs-owner@vger.kernel.org Received: from acsinet15.oracle.com ([141.146.126.227]:55161 "EHLO acsinet15.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752853Ab2BAQpd (ORCPT ); Wed, 1 Feb 2012 11:45:33 -0500 Date: Wed, 1 Feb 2012 11:45:21 -0500 From: Chris Mason To: Gregory Farnum Cc: Bernd Schubert , Linux NFS Mailing List , linux-scsi@vger.kernel.org, "Martin K. Petersen" , Sven Breuner , Chuck Lever , linux-fsdevel , lsf-pc@lists.linux-foundation.org Subject: Re: [Lsf-pc] [LSF/MM TOPIC] end-to-end data and metadata corruption detection Message-ID: <20120201164521.GY16796@shiny> References: <38C050B3-2AAD-4767-9A25-02C33627E427@oracle.com> <4F2147BA.6030607@itwm.fraunhofer.de> <4F217F0C.6030105@itwm.fraunhofer.de> <4F283F7A.4020905@itwm.fraunhofer.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii In-Reply-To: Sender: linux-nfs-owner@vger.kernel.org List-ID: On Tue, Jan 31, 2012 at 11:28:26AM -0800, Gregory Farnum wrote: > On Tue, Jan 31, 2012 at 11:22 AM, Bernd Schubert > wrote: > > I guess we should talk to developers of other parallel file systems and see > > what they think about it. I think cephfs already uses data integrity > > provided by btrfs, although I'm not entirely sure and need to check the > > code. As I said before, Lustre does network checksums already and *might* be > > interested. > > Actually, right now Ceph doesn't check btrfs' data integrity > information, but since Ceph doesn't have any data-at-rest integrity > verification it relies on btrfs if you want that. Integrating > integrity verification throughout the system is on our long-term to-do > list. > We too will be said if using a kernel-level integrity system requires > using DIO, although we could probably work out a way to do > "translation" between our own integrity checksums and the > btrfs-generated ones if we have to (thanks to replication). DIO isn't really required, but doing this without synchronous writes will get painful in a hurry. There's nothing wrong with letting the data sit in the page cache after the IO is done though. -chris