From: Dave Chinner Subject: Re: Ext4 and xfs problems in dm-thin on allocation and discard Date: Wed, 20 Jun 2012 09:12:52 +1000 Message-ID: <20120619231252.GQ25389@dastard> References: <20120619031241.GA3884@redhat.com> <20120619131649.GA6811@redhat.com> <20120619133041.GB6811@redhat.com> <4FE0840F.2050704@shiftmail.org> <20120619144413.GA7225@redhat.com> <20120619184858.GA8841@redhat.com> <20120619200631.GL25389@dastard> <4FE0F132.2080207@shiftmail.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: Mike Snitzer , =?utf-8?B?THVrw6HFoQ==?= Czerner , device-mapper development , linux-ext4@vger.kernel.org, xfs@oss.sgi.com To: Spelic Return-path: Received: from ipmail07.adl2.internode.on.net ([150.101.137.131]:57844 "EHLO ipmail07.adl2.internode.on.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751623Ab2FSXM6 (ORCPT ); Tue, 19 Jun 2012 19:12:58 -0400 Content-Disposition: inline In-Reply-To: <4FE0F132.2080207@shiftmail.org> Sender: linux-ext4-owner@vger.kernel.org List-ID: On Tue, Jun 19, 2012 at 11:37:54PM +0200, Spelic wrote: > On 06/19/12 22:06, Dave Chinner wrote: > >On Tue, Jun 19, 2012 at 02:48:59PM -0400, Mike Snitzer wrote: > >>On Tue, Jun 19 2012 at 10:44am -0400, > >>Mike Snitzer wrote: > >> > >>>On Tue, Jun 19 2012 at 9:52am -0400, > >>>Spelic wrote: > >>> > >>>>I do not know what is the mechanism for which xfs cannot unmap > >>>>blocks from dm-thin, but it really can't. > >>>>If anyone has dm-thin installed he can try. This is 100% > >>>>reproducible for me. > >>>I was initially surprised by this considering the thinp-test-suite does > >>>test a compilebench workload against xfs and ext4 using online discard > >>>(-o discard). > >>> > >>>But I just modified that test to use a thin-pool with 'ignore_discard' > >>>and the test still passed on both ext4 and xfs. > >>> > >>>So there is more work needed in the thinp-test-suite to use blktrace > >>>hooks to verify that discards are occuring when the compilebench > >>>generated files are removed. > >>> > >>>I'll work through that and report back. > >>blktrace shows discards for both xfs and ext4. > >> > >>But in general xfs is issuing discards with much smaller extents than > >>ext4 does, e.g.: > >THat's normal when you use -o discard - XFS sends extremely > >fine-grained discards as the have to be issued during the checkpoint > >commit that frees the extent. Hence they can't be aggregated like is > >done in ext4. > > > >As it is, no-one really should be using -o discard - it is extremely > >inefficient compared to a background fstrim run given that discards > >are unqueued, blocking IOs. It's just a bad idea until the lower > >layers get fixed to allow asynchronous, vectored discards and SATA > >supports queued discards... > > > > Could it be that the thin blocksize is larger than the discard > granularity by xfs so nothing ever gets unmapped? for -o discard, possibly. for fstrim, unlikely. > I have tried thin pools with the default blocksize (64k afair with > lvm2) and 1MB. > HOWEVER I also have tried fstrim on xfs, and that is also not > capable to unmap things from the dm-thin. > What is the granularity with fstrim in xfs? Whatever granularity you passed fstrim. You need to run an event trace on XFS to find out if it is issuing discards before going any further.. Cheers, Dave. -- Dave Chinner david@fromorbit.com