Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932132Ab0FAOcR (ORCPT ); Tue, 1 Jun 2010 10:32:17 -0400 Received: from cantor2.suse.de ([195.135.220.15]:52470 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932116Ab0FAOcO (ORCPT ); Tue, 1 Jun 2010 10:32:14 -0400 Subject: Re: Wrong DIF guard tag on ext2 write From: James Bottomley To: "Martin K. Petersen" Cc: Chris Mason , Christof Schmitt , Boaz Harrosh , linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org In-Reply-To: References: <20100531112817.GA16260@schmichrtp.mainz.de.ibm.com> <1275318102.2823.47.camel@mulgrave.site> <4C03D5FD.3000202@panasas.com> <20100601103041.GA15922@schmichrtp.mainz.de.ibm.com> <1275398876.21962.6.camel@mulgrave.site> <20100601133341.GK8980@think> <1275399637.21962.11.camel@mulgrave.site> Content-Type: text/plain; charset="UTF-8" Date: Tue, 01 Jun 2010 14:32:08 +0000 Message-ID: <1275402728.21962.35.camel@mulgrave.site> Mime-Version: 1.0 X-Mailer: Evolution 2.28.2 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1541 Lines: 34 On Tue, 2010-06-01 at 09:50 -0400, Martin K. Petersen wrote: > >>>>> "James" == James Bottomley writes: > > James> Would it be too much work in the fs to mark the page dirty before > James> you begin altering it (and again after you finish, just in case > James> some cleaner noticed and initiated a write)? Or some other flag > James> that indicates page under modification? All the process > James> controlling the writeout (which is pretty high up in the stack) > James> needs to know is if we triggered the check error by altering the > James> page while it was in flight. > > James> I agree that a block based retry would close all the holes ... it > James> just doesn't look elegant to me that the fs will already be > James> repeating the I/O if it changed the page and so will block. > > I experimented with this approach a while back. However, I quickly got > into a situation where frequently updated blocks never made it to disk > because the page was constantly being updated. And all writes failed > with a guard tag error. But that's unfixable with a retry based system as well if the page is changing so fast that the guard is always wrong by the time we get to the array. The only way to fix this is either to copy or freeze the page. James -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/