Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755339Ab0KIVo5 (ORCPT ); Tue, 9 Nov 2010 16:44:57 -0500 Received: from cantor.suse.de ([195.135.220.2]:53280 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752278Ab0KIVoz (ORCPT ); Tue, 9 Nov 2010 16:44:55 -0500 Date: Tue, 9 Nov 2010 22:44:52 +0100 From: Jan Kara To: Rik van Riel Cc: "Ted Ts'o" , Jeff Layton , linux-kernel@vger.kernel.org, akpm@linux-foundation.org, esandeen@redhat.com, jmoyer@redhat.com, linux-fsdevel@vger.kernel.org Subject: Re: [PATCH] clear PageError bit in msync & fsync Message-ID: <20101109214452.GH4936@quack.suse.cz> References: <20101109114422.3918e7f6@annuminas.surriel.com> <20101109142109.224267d0@corrin.poochiereds.net> <4CD9A209.6070807@redhat.com> <20101109210715.GJ3099@thunk.org> <4CD9BA08.2000002@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4CD9BA08.2000002@redhat.com> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2154 Lines: 55 On Tue 09-11-10 16:15:52, Rik van Riel wrote: > On 11/09/2010 04:07 PM, Ted Ts'o wrote: > >On Tue, Nov 09, 2010 at 02:33:29PM -0500, Rik van Riel wrote: > >> > >>There are essentially two possibilities: > >>1) the VM can potentially be filled up with uncleanable dirty pages, or > >>2) pages that hit an IO error are left in a clean state, so they can > >> be reclaimed under memory pressure > >> > >>Alternative 1 could cause the entire system to deadlock, while > >>option 2 puts the onus on userland apps to rewrite the data > >>from a failed msync/fsync. > >> > >>Currently the VM has behaviour #2 which is preserved with my > >>patch. > >> > >>The only difference with my patch is, we won't keep returning > >>-EIO on subsequent, error free, msync or fsync calls to files > >>that had an IO error at some previous point in the past. > > > >Do we guarantee that the application will get EIO at least once? I > >thought there were issues where the error bit could get lost if the > >page writeback was triggered by sync() run by a third-party > >application. > > There is no such guarantee in the current kernel, either > with or without my patch. > > A third application calling fsync or msync can get the > EIO cleared, so the application that did the write does > not see it. > > The VM could also reclaim the PageError page due to > memory pressure, so the application calling fsync or > msync does not see it. What should be done in this case is that we set AS_EIO in the page->mapping->flags like we currently do in fs/buffer.c. > I see no good way in which we could guarantee that > every process calling msync or fsync on a file that > had an IO error in the past gets EIO once - at least, > not without every one of them always getting EIO on > the file even after the IO path is good again... Yes, this is rather hard to do... Honza -- Jan Kara SUSE Labs, CR -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/