Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932431AbWHAGtB (ORCPT ); Tue, 1 Aug 2006 02:49:01 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932432AbWHAGtB (ORCPT ); Tue, 1 Aug 2006 02:49:01 -0400 Received: from thunk.org ([69.25.196.29]:28610 "EHLO thunker.thunk.org") by vger.kernel.org with ESMTP id S932431AbWHAGs7 (ORCPT ); Tue, 1 Aug 2006 02:48:59 -0400 Date: Tue, 1 Aug 2006 02:48:38 -0400 From: Theodore Tso To: David Lang Cc: David Masover , tdwebste2@yahoo.com, Nate Diller , Adrian Ulrich , "Horst H. von Brand" , ipso@snappymail.ca, reiser@namesys.com, lkml@lpbproductions.com, jeff@garzik.org, linux-kernel@vger.kernel.org, reiserfs-list@namesys.com Subject: Re: Solaris ZFS on Linux [Was: Re: the " 'official' point of view"expressed by kernelnewbies.org regarding reiser4 inclusion] Message-ID: <20060801064837.GB1987@thunk.org> Mail-Followup-To: Theodore Tso , David Lang , David Masover , tdwebste2@yahoo.com, Nate Diller , Adrian Ulrich , "Horst H. von Brand" , ipso@snappymail.ca, reiser@namesys.com, lkml@lpbproductions.com, jeff@garzik.org, linux-kernel@vger.kernel.org, reiserfs-list@namesys.com References: <20060801034726.58097.qmail@web51311.mail.yahoo.com> <44CED777.5080308@slaphack.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.11 X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: tytso@thunk.org X-SA-Exim-Scanned: No (on thunker.thunk.org); SAEximRunCond expanded to false Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2366 Lines: 42 On Mon, Jul 31, 2006 at 09:41:02PM -0700, David Lang wrote: > just becouse you have redundancy doesn't mean that your data is idle enough > for you to run a repacker with your spare cycles. to run a repacker you > need a time when the chunk of the filesystem that you are repacking is not > being accessed or written to. it doesn't matter if that data lives on one > disk or 9 disks all mirroring the same data, you can't just break off 1 of > the copies and repack that becouse by the time you finish it won't match > the live drives anymore. > > database servers have a repacker (vaccum), and they are under tremendous > preasure from their users to avoid having to use it becouse of the > performance hit that it generates. (the theory in the past is exactly what > was presented in this thread, make things run faster most of the time and > accept the performance hit when you repack). the trend seems to be for a > repacker thread that runs continuously, causing a small impact all the time > (that can be calculated into the capacity planning) instead of a large > impact once in a while. Ah, but as soon as the repacker thread runs continuously, then you lose all or most of the claimed advantage of "wandering logs". Specifically, the claim of the "wandering log" is that you don't have to write your data twice --- once to the log, and once to the final location on disk (whereas with ext3 you end up having to do double writes). But if the repacker is running continuously, you end up doing double writes anyway, as the repacker moves things from a location that is convenient for the log, to a location which is efficient for reading. Worse yet, if the repacker is moving disk blocks or objects which are no longer in cache, it may end up having to read objects in before writing them to a final location on disk. So instead of a write-write overhead, you end up with a write-read-write overhead. But of course, people tend to disable the repacker when doing benchmarks because they're trying to play the "my filesystem/database has bigger performance numbers than yours" game.... - Ted - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/