Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757910AbcCUTWf (ORCPT ); Mon, 21 Mar 2016 15:22:35 -0400 Received: from mx1.redhat.com ([209.132.183.28]:48782 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757625AbcCUTWb (ORCPT ); Mon, 21 Mar 2016 15:22:31 -0400 Date: Mon, 21 Mar 2016 15:22:29 -0400 From: Mike Snitzer To: "Darrick J. Wong" Cc: Jens Axboe , Linus Torvalds , Bruce Fields , "Theodore Ts'o" , "Martin K. Petersen" , linux-api@vger.kernel.org, david@fromorbit.com, "linux-kernel@vger.kernel.org" , shane.seymour@hpe.com, Christoph Hellwig , linux-fsdevel , Jeff Layton , Andrew Morton , device-mapper development Subject: Re: [PATCH 3/3] block: implement (some of) fallocate for block devices Message-ID: <20160321192229.GA17220@redhat.com> References: <20160315194221.30093.70506.stgit@birch.djwong.org> <20160315194244.30093.6483.stgit@birch.djwong.org> <20160321191116.GB12309@birch.djwong.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160321191116.GB12309@birch.djwong.org> User-Agent: Mutt/1.5.21 (2010-09-15) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.38]); Mon, 21 Mar 2016 19:22:31 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2560 Lines: 57 On Mon, Mar 21 2016 at 3:11pm -0400, Darrick J. Wong wrote: > On Mon, Mar 21, 2016 at 02:52:00PM -0400, Mike Snitzer wrote: > > On Tue, Mar 15, 2016 at 3:42 PM, Darrick J. Wong > > wrote: > > > After much discussion, it seems that the fallocate feature flag > > > FALLOC_FL_ZERO_RANGE maps nicely to SCSI WRITE SAME; and the feature > > > FALLOC_FL_PUNCH_HOLE maps nicely to the devices that have been > > > whitelisted for zeroing SCSI UNMAP. Punch still requires that > > > FALLOC_FL_KEEP_SIZE is set. A length that goes past the end of the > > > device will be clamped to the device size if KEEP_SIZE is set; or will > > > return -EINVAL if not. Both start and length must be aligned to the > > > device's logical block size. > > > > > > Since the semantics of fallocate are fairly well established already, > > > wire up the two pieces. The other fallocate variants (collapse range, > > > insert range, and allocate blocks) are not supported. > > > > I'd like to see fallocate (block allocation) extend down to DM thinp. > > This more traditional use of fallocate would be useful for ensuring > > ENOSPC won't occur -- especially important if the FS has committed > > space in response to fallocate. As of now fallocate doesn't inform DM > > thinp at all. Curious why you decided not to wire it up? > > I don't know what to wire it up to. :) Fair enough. Yes something needs to be invented. > I didn't find any blkdev_* function that looked encouraging, though I > haven't dug too deeply into bfoster's "prototype a block reservation > allocation model" patchset yet. At a high level I'd guess that would > be a reasonable piece to connect to? It looks like the piece I want > is blk_provision_space(). Yes, something like that. > > But I'm not sure what "it" (the "allocate blocks" variant) even is > > given falloc.h doesn't show anything like "_ALLOCATE_BLOCKS"... > > The default behavior of fallocate is to allocate blocks, which means > that one invokes it by not passing any mode flags (except possibly > KEEP_SIZE). OK. > > It would require a new block interface to pass the fallocate extent > > down. But it seems bizarre to implement "some of" fallocate but not > > the most widely used case for fallocate. > > Agreed. I'd like to get the existing functionality wired up sooner than > later, and plumbing "allocate blocks" down to thinp can be done as a > followup. > > (Or stall long enough that it becomes one patchset.) Sure, sounds good. Glad we're in agreement.