Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757966AbcCUU7e (ORCPT ); Mon, 21 Mar 2016 16:59:34 -0400 Received: from mx1.redhat.com ([209.132.183.28]:38918 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757937AbcCUU7c (ORCPT ); Mon, 21 Mar 2016 16:59:32 -0400 Date: Mon, 21 Mar 2016 16:59:29 -0400 From: Brian Foster To: Mike Snitzer Cc: "Darrick J. Wong" , Jens Axboe , Linus Torvalds , Bruce Fields , "Theodore Ts'o" , "Martin K. Petersen" , linux-api@vger.kernel.org, david@fromorbit.com, "linux-kernel@vger.kernel.org" , shane.seymour@hpe.com, Christoph Hellwig , linux-fsdevel , Jeff Layton , Andrew Morton , device-mapper development Subject: Re: [PATCH 3/3] block: implement (some of) fallocate for block devices Message-ID: <20160321205928.GA48412@bfoster.bfoster> References: <20160315194221.30093.70506.stgit@birch.djwong.org> <20160315194244.30093.6483.stgit@birch.djwong.org> <20160321191116.GB12309@birch.djwong.org> <20160321192229.GA17220@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160321192229.GA17220@redhat.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3501 Lines: 75 On Mon, Mar 21, 2016 at 03:22:29PM -0400, Mike Snitzer wrote: > On Mon, Mar 21 2016 at 3:11pm -0400, > Darrick J. Wong wrote: > > > On Mon, Mar 21, 2016 at 02:52:00PM -0400, Mike Snitzer wrote: > > > On Tue, Mar 15, 2016 at 3:42 PM, Darrick J. Wong > > > wrote: > > > > After much discussion, it seems that the fallocate feature flag > > > > FALLOC_FL_ZERO_RANGE maps nicely to SCSI WRITE SAME; and the feature > > > > FALLOC_FL_PUNCH_HOLE maps nicely to the devices that have been > > > > whitelisted for zeroing SCSI UNMAP. Punch still requires that > > > > FALLOC_FL_KEEP_SIZE is set. A length that goes past the end of the > > > > device will be clamped to the device size if KEEP_SIZE is set; or will > > > > return -EINVAL if not. Both start and length must be aligned to the > > > > device's logical block size. > > > > > > > > Since the semantics of fallocate are fairly well established already, > > > > wire up the two pieces. The other fallocate variants (collapse range, > > > > insert range, and allocate blocks) are not supported. > > > > > > I'd like to see fallocate (block allocation) extend down to DM thinp. > > > This more traditional use of fallocate would be useful for ensuring > > > ENOSPC won't occur -- especially important if the FS has committed > > > space in response to fallocate. As of now fallocate doesn't inform DM > > > thinp at all. Curious why you decided not to wire it up? > > > > I don't know what to wire it up to. :) > > Fair enough. Yes something needs to be invented. > > > I didn't find any blkdev_* function that looked encouraging, though I > > haven't dug too deeply into bfoster's "prototype a block reservation > > allocation model" patchset yet. At a high level I'd guess that would > > be a reasonable piece to connect to? It looks like the piece I want > > is blk_provision_space(). > > Yes, something like that. > Just a note that the caveat/hack with the provision call in there is that it returns an allocated block count. That was necessary to help maintain the local reservation accounting. I'd love to find a way to handle that more cleanly or take advantage of generic fallocate, but I don't have a clear idea on how to do that at the moment. (I do wonder whether an internal-only set of falloc "reserve" flags would fly...). Anyways, that's a separate topic. Feel free to steal any of that dm-thin provision code if it is useful for generic fallocate(). :) Brian > > > But I'm not sure what "it" (the "allocate blocks" variant) even is > > > given falloc.h doesn't show anything like "_ALLOCATE_BLOCKS"... > > > > The default behavior of fallocate is to allocate blocks, which means > > that one invokes it by not passing any mode flags (except possibly > > KEEP_SIZE). > > OK. > > > > It would require a new block interface to pass the fallocate extent > > > down. But it seems bizarre to implement "some of" fallocate but not > > > the most widely used case for fallocate. > > > > Agreed. I'd like to get the existing functionality wired up sooner than > > later, and plumbing "allocate blocks" down to thinp can be done as a > > followup. > > > > (Or stall long enough that it becomes one patchset.) > > Sure, sounds good. Glad we're in agreement. > -- > To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html