DomainKey-Signature: a=rsa-sha1; c=nofws; d=newdream.net; h=date:from:to:cc
	:subject:in-reply-to:message-id:references:mime-version:
	content-type; q=dns; s=newdream.net; b=JCBs3W1teVr8dKEkWcQn16dSX
	FPJrX48jRSMr+2nUJYYEikdMT2Kiwd3mqekCmre8Lep7dAlyV3JJFm1Njd20NXGU
	oLYIiZ7kaiuyEQG3tIcDeGHAiLYjZ4y/Miy8+QUoSxI4S+wCcHfaQVvo9NVGkz8e
	ntZEEPL5RtsfU+nFkA=
Date: Tue, 15 Mar 2011 09:08:09 -0700 (PDT)
From: Sage Weil <sage@newdream.net>
To: Andreas Dilger <adilger@dilger.ca>
cc: Dave Chinner <david@fromorbit.com>, Indan Zupancic <indan@nul.nu>,
        Greg KH <greg@kroah.com>, Jonathan Nieder <jrnieder@gmail.com>,
        Arnd Bergmann <arnd@arndb.de>,
        "linux-fsdevel@vger.kernel.org" <linux-fsdevel@vger.kernel.org>,
        "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
        "Aneesh Kumar K. V" <aneesh.kumar@linux.vnet.ibm.com>,
        "akpm@linux-foundation.org" <akpm@linux-foundation.org>,
        "linux-api@vger.kernel.org" <linux-api@vger.kernel.org>,
        "mtk.manpages@gmail.com" <mtk.manpages@gmail.com>,
        "viro@zeniv.linux.org.uk" <viro@zeniv.linux.org.uk>,
        "hch@lst.de" <hch@lst.de>, "l@jasper.es" <l@jasper.es>
Subject: Re: [PATCH v3] introduce sys_syncfs to sync a single file system
In-Reply-To: <C79DDACF-9320-424D-84F5-3DB9C5325B1D@dilger.ca>
Message-ID: <Pine.LNX.4.64.1103150901420.21727@cobra.newdream.net>
References: <Pine.LNX.4.64.1103101125150.4190@cobra.newdream.net>
 <edfa4cf081249734807e582c14253fca.squirrel@webmail.greenhost.nl>
 <201103111255.44979.arnd@arndb.de> <ffafd3ccb3fd1be7c40348c9cee84aef.squirrel@webmail.greenhost.nl>
 <20110311235607.GB15853@elie> <9446ab1a2315c0d2476c30f8315a0503.squirrel@webmail.greenhost.nl>
 <20110312021001.GA16833@elie> <20110312173217.GA24981@kroah.com>
 <1e597aedd3d7825dcc0630b1cf2399fa.squirrel@webmail.greenhost.nl>
 <Pine.LNX.4.64.1103132114380.5145@cobra.newdream.net> <20110315101155.GO15097@dastard>
 <C79DDACF-9320-424D-84F5-3DB9C5325B1D@dilger.ca>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: linux-kernel-owner@vger.kernel.org
Content-Length: 3863
Lines: 95

On Tue, 15 Mar 2011, Andreas Dilger wrote:
> Should there be a "wait" argument or flag that allows an app to start 
> the syncfs(), do something, and then call again to wait for completion?
> 
> A more advanced implementation would have the "wait=0" call return an 
> opaque wait handle (e.g. transaction ID) and then calling back with this 
> handle as an argument would only wait if that handle hadn't committed 
> yet. That allows userspace to do efficient batching without having to 
> wait for the full sync to complete.

FWIW this is what the btrfs {START,WAIT}_SYNC ioctls do.  Because it ties 
into btrfs' transactions, there's an additional semantic that operations 
after the START_SYNC are not included in the committed state (this time 
around, at least).  Absent that (which I'm not sure maps cleanly onto many 
other file systems currently, and is of limited value when not used in 
concert with btrfs snapshots), I'm not sure a wait flag is helpful.  Is it 
any different than an application doing the syncfs() in a separate thread?

sage


> 
> Cheers, Andreas
> 
> On 2011-03-15, at 4:11, Dave Chinner <david@fromorbit.com> wrote:
> 
> > On Sun, Mar 13, 2011 at 09:29:17PM -0700, Sage Weil wrote:
> >> On Mon, 14 Mar 2011, Indan Zupancic wrote:
> >>> Everyone seems to want to add this new syncfs, but it's not even defined 
> >>> what it does. "Same as sync, but only on one fs" is IMHO not good 
> >>> enough, because sync's behaviour is pretty badly documented, and that's 
> >>> a system call.
> >> 
> >> How about the man page below?  I tried to avoid the somewhat antiquated 
> >> implementation specific terminology in the sync(2) man page.
> >> 
> >> I think adding this functionality into sync_file_range(2) is forcing 
> >> unrelated functionality into an existing interface; sync_file_range 
> >> operates on _files_, not an entire file system.  With each API addition it 
> >> is more important to make the interface simple and intuitive than to 
> >> minimize the size of our patches.  IMO that's why a new syscall is 
> >> preferable to, say, an equivalent ioctl.
> >> 
> >> Thanks-
> >> sage
> >> 
> >> 
> >> .TH SYNCFS 2 2011-03-13 "Linux" "Linux Programmer's Manual"
> >> .SH NAME
> >> syncfs \- commit cached file system state to stable storage
> >> .SH SYNOPSIS
> >> .B #include <unistd.h>
> >> .sp
> >> .B void syncfs(int fd);
> >> .SH DESCRIPTION
> >> .BR syncfs ()
> >> flushes any cached data modifications to the file system containing the 
> >> file referenced by the file descriptor
> >> .I fd
> >> to stable storage (usually a disk).  This includes the results of any
> >> file modifications or other file system operations that have completed
> >> prior to the call to
> >> .BR syncfs(2).
> >> This is similar to 
> >> .BR sync(2),
> >> but will commit changes for only a single file system instead of all
> >> mounted file systems.
> >> .SH ERRORS
> >> This function is always successful.
> > 
> > Perhaps we should consider propagating errors out to the user
> > application rather than discarding them in kernel and pretending we
> > can't ever have a write error?
> > 
> > Cheers,
> > 
> > Dave.
> > -- 
> > Dave Chinner
> > david@fromorbit.com
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/