Return-Path: Received: from mail1.vodafone.ie ([213.233.128.43]:33197 "EHLO mail1.vodafone.ie" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751862AbbIHUDN (ORCPT ); Tue, 8 Sep 2015 16:03:13 -0400 Message-ID: <55EF3EFD.3080302@draigBrady.com> Date: Tue, 08 Sep 2015 21:03:09 +0100 From: =?UTF-8?B?UMOhZHJhaWcgQnJhZHk=?= MIME-Version: 1.0 To: Andy Lutomirski , Anna Schumaker CC: linux-nfs@vger.kernel.org, Linux btrfs Developers List , Linux FS Devel , Linux API , Zach Brown , Al Viro , Chris Mason , "Darrick J. Wong" , Michael Kerrisk-manpages , andros@netapp.com, Christoph Hellwig , Coreutils Subject: Re: [PATCH v1 0/8] VFS: In-kernel copy system call References: <1441397823-1203-1-git-send-email-Anna.Schumaker@Netapp.com> <55EEFCEE.5090000@draigBrady.com> <55EF279B.3020101@Netapp.com> In-Reply-To: Content-Type: text/plain; charset=utf-8 Sender: linux-nfs-owner@vger.kernel.org List-ID: On 08/09/15 20:10, Andy Lutomirski wrote: > On Tue, Sep 8, 2015 at 11:23 AM, Anna Schumaker > wrote: >> On 09/08/2015 11:21 AM, Pádraig Brady wrote: >>> I see copy_file_range() is a reflink() on BTRFS? >>> That's a bit surprising, as it avoids the copy completely. >>> cp(1) for example considered doing a BTRFS clone by default, >>> but didn't due to expectations that users actually wanted >>> the data duplicated on disk for resilience reasons, >>> and for performance reasons so that write latencies were >>> restricted to the copy operation, rather than being >>> introduced at usage time as the dest file is CoW'd. >>> >>> If reflink() is a possibility for copy_file_range() >>> then could it be done optionally with a flag? >> >> The idea is that filesystems get to choose how to handle copies in the default case. BTRFS could do a reflink, but NFS could do a server side copy instead. I can change the default behavior to only do a data copy (unless the reflink flag is specified) instead, if that is desirable. >> >> What does everybody think? > > I think the best you could do is to have a hint asking politely for > the data to be deep-copied. After all, some filesystems reserve the > right to transparently deduplicate. > > Also, on a true COW filesystem (e.g. btrfs sometimes), there may be no > advantage to deep copying unless you actually want two copies for > locality reasons. Agreed. The relink and server side copy are separate things. There's no advantage to not doing a server side copy, but as mentioned there may be advantages to doing deep copies on BTRFS (another reason not previous mentioned in this thread, would be to avoid ENOSPC errors at some time in the future). So having control over the deep copy seems useful. It's debatable whether ALLOW_REFLINK should be on/off by default for copy_file_range(). I'd be inclined to have such a setting off by default, but cp(1) at least will work with whatever is chosen. thanks, Pádraig.