Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932238AbWCQWXE (ORCPT ); Fri, 17 Mar 2006 17:23:04 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932230AbWCQWXD (ORCPT ); Fri, 17 Mar 2006 17:23:03 -0500 Received: from mx1.redhat.com ([66.187.233.31]:3789 "EHLO mx1.redhat.com") by vger.kernel.org with ESMTP id S932210AbWCQWXA (ORCPT ); Fri, 17 Mar 2006 17:23:00 -0500 Subject: Re: ext3_ordered_writepage() questions From: "Stephen C. Tweedie" To: Badari Pulavarty Cc: Andreas Dilger , Jan Kara , "Theodore Ts'o" , Andrew Morton , lkml , linux-fsdevel , Stephen Tweedie In-Reply-To: <1142631141.15257.40.camel@dyn9047017100.beaverton.ibm.com> References: <20060308124726.GC4128@lst.de> <4410551D.5000303@us.ibm.com> <20060309153550.379516e1.akpm@osdl.org> <4410CA25.2090400@us.ibm.com> <20060316180904.GA29275@thunk.org> <1142533360.21442.153.camel@dyn9047017100.beaverton.ibm.com> <20060316210424.GD29275@thunk.org> <1142546275.21442.172.camel@dyn9047017100.beaverton.ibm.com> <20060316220545.GB18753@atrey.karlin.mff.cuni.cz> <1142552722.21442.180.camel@dyn9047017100.beaverton.ibm.com> <20060317005418.GY30801@schatzie.adilger.int> <1142615110.3641.8.camel@orbit.scot.redhat.com> <1142631141.15257.40.camel@dyn9047017100.beaverton.ibm.com> Content-Type: text/plain Date: Fri, 17 Mar 2006 17:22:13 -0500 Message-Id: <1142634134.3641.56.camel@orbit.scot.redhat.com> Mime-Version: 1.0 X-Mailer: Evolution 2.6.0 (2.6.0-1) Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2101 Lines: 51 Hi, On Fri, 2006-03-17 at 13:32 -0800, Badari Pulavarty wrote: > I have a patch which eliminates adding buffers to the journal, if > we are doing just re-write of the disk block. ... > 2.6.16-rc6 2.6.16-rc6+patch > real 0m6.606s 0m3.705s OK, that's a really significant win! What exactly was the test case for this, and does that performance edge persist for a longer-running test? > In real world, does this ordering guarantee matter ? Not that I am aware of. Even with the ordering guarantee, there is still no guarantee of the order in which the writes hit disk within that transaction, which makes it hard to depend on it. I recall that some versions of fsync depended on ordered mode flushing dirty data on transaction commit, but I don't think the current ext3_sync_file() will have any problems there. Other than that, the only thing I can think of that had definite dependencies in this are was InterMezzo, and that's no longer in the tree. Even then, I'm not 100% certain that InterMezzo had a dependency for overwrites (it was certainly strongly dependent on the ordering semantics for allocates.) It is theoretically possible to write applications that depend on that ordering, but they would be necessarily non-portable anyway. I think relaxing it is fine, especially for a 100% (wow) performance gain. There is one other perspective to be aware of, though: the current behaviour means that by default ext3 generally starts flushing pending writeback data within 5 seconds of a write. Without that, we may end up accumulating a lot more dirty data in memory, shifting the task of write throttling from the filesystem to the VM. That's not a problem per se, just a change of behaviour to keep in mind, as it could expose different corner cases in the performance of write-intensive workloads. --Stephen - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/