Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp261968pxb; Thu, 12 Nov 2020 03:14:40 -0800 (PST) X-Google-Smtp-Source: ABdhPJz0bMKgUzltK1BeIyIvd73I+SkjElShHY1MCcmCmHQubzpq9+cSB9HUirg6sbnmnibfBy48 X-Received: by 2002:a17:906:f84f:: with SMTP id ks15mr28632988ejb.337.1605179680624; Thu, 12 Nov 2020 03:14:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605179680; cv=none; d=google.com; s=arc-20160816; b=b/La99i5CkFOlpo4xOg0PehvxvwweitCliSVO1A8BTz9iqjVeuNVUaFSsXnY5gN9Ze bLfl1RbSLtBdVQXDNwxAd33H5TwV6NXULC6iZ46TpfzB2yWxhnoQdnGJo0tW/45pRAww 1097wGVpB6kb053OtHOjIuAl/jsvoe96dG2NeEy0Ub1r5klXkMzL/6pZ0sbSmpPMTpB5 4TKdIKWHM4eFr8GFYY50/rKf/zjTpKYlxGkoOn94JFcDtkrroX6dbe5S8I0Xh42mtrYZ 2W/n/p8GdgUYtPaPkRHXJYUxbPHl4IUXTRLC9C5etbBPBmvOyZPkmc3dY2kl7SAZeYjn wCfg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=PAmEep+2hCC2/zlo0yUsyakhk2jE1W5Chb6kpBgI3qk=; b=QFDyyokcA/3y6xEGnAxhNmcvw/dI9iKw2yYhg1rB8e5bUAtFMKdysQST6gP3sGIoKP 3ZFKlKCpBeCROfLkEj9vRT4fP5im+wzFBdEWycyjxbmfmmdHIKkRm+q0qYtsErKY5+tI 5tAHTusnWXIpOiU7gmAeufcV6QcBMnZoL6hZRsZhum/sJfjQ0dQOQ4Y/narR55KjuQF5 MHcoJcvOt/LpBtQT/jCG4ACj5BdSaeJUfCMyyzLv6L0mo8ZfPkfUXwRC5PiFBOYuQKM2 e60Gec2kFhXz4oh+qWlvA08CXrSywmgQjKDUXBAKzYc/OjHEP0Fv7AqZTQ5sQEnOGFty LbqA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p15si3897002edq.35.2020.11.12.03.14.16; Thu, 12 Nov 2020 03:14:40 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728091AbgKLLMm (ORCPT + 99 others); Thu, 12 Nov 2020 06:12:42 -0500 Received: from mx2.suse.de ([195.135.220.15]:44036 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728035AbgKLLMV (ORCPT ); Thu, 12 Nov 2020 06:12:21 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id EB56AAEAA; Thu, 12 Nov 2020 11:12:18 +0000 (UTC) Date: Thu, 12 Nov 2020 12:12:17 +0100 From: Michal =?iso-8859-1?Q?Such=E1nek?= To: Dave Chinner Cc: "Darrick J. Wong" , linux-xfs@vger.kernel.org, linux-ext4@vger.kernel.org, Theodore Ts'o , Andreas Dilger , Ira Weiny , Jan Kara , linux-kernel@vger.kernel.org Subject: Re: [PATCH 1/2] xfs: show the dax option in mount options. Message-ID: <20201112111217.GF29778@kitsune.suse.cz> References: <20201109192419.GC9695@magnolia> <20201109202705.GZ29778@kitsune.suse.cz> <20201109210823.GI7391@dread.disaster.area> <20201111102848.GD29778@kitsune.suse.cz> <20201112014952.GL7391@dread.disaster.area> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20201112014952.GL7391@dread.disaster.area> User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Nov 12, 2020 at 12:49:52PM +1100, Dave Chinner wrote: > On Wed, Nov 11, 2020 at 11:28:48AM +0100, Michal Such?nek wrote: > > On Tue, Nov 10, 2020 at 08:08:23AM +1100, Dave Chinner wrote: > > > On Mon, Nov 09, 2020 at 09:27:05PM +0100, Michal Such?nek wrote: > > > > On Mon, Nov 09, 2020 at 11:24:19AM -0800, Darrick J. Wong wrote: > > > > > On Mon, Nov 09, 2020 at 08:10:08PM +0100, Michal Suchanek wrote: > > > > > > xfs accepts both dax and dax_enum but shows only dax_enum. Show both > > > > > > options. > > > > > > > > > > > > Fixes: 8d6c3446ec23 ("fs/xfs: Make DAX mount option a tri-state") > > > > > > Signed-off-by: Michal Suchanek > > > > > > --- > > > > > > fs/xfs/xfs_super.c | 2 +- > > > > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > > > > > > > diff --git a/fs/xfs/xfs_super.c b/fs/xfs/xfs_super.c > > > > > > index e3e229e52512..a3b00003840d 100644 > > > > > > --- a/fs/xfs/xfs_super.c > > > > > > +++ b/fs/xfs/xfs_super.c > > > > > > @@ -163,7 +163,7 @@ xfs_fs_show_options( > > > > > > { XFS_MOUNT_GRPID, ",grpid" }, > > > > > > { XFS_MOUNT_DISCARD, ",discard" }, > > > > > > { XFS_MOUNT_LARGEIO, ",largeio" }, > > > > > > - { XFS_MOUNT_DAX_ALWAYS, ",dax=always" }, > > > > > > + { XFS_MOUNT_DAX_ALWAYS, ",dax,dax=always" }, > > > > > > > > > > NAK, programs that require DAX semantics for files stored on XFS must > > > > > call statx to detect the STATX_ATTR_DAX flag, as outlined in "Enabling > > > > > DAX on xfs" in Documentation/filesystems/dax.txt. > > > > statx can be used to query S_DAX. NOTE that only regular files will > > > > ever have S_DAX set and therefore statx will never indicate that S_DAX > > > > is set on directories. > > > > > > Yup, by design. > > > > > > The application doesn't need to do anything complex to make this > > > work. If the app wants to use DAX, then it should use > > > FS_IOC_FS{GS}ETXATTR to always set the on disk per inode DAX flags > > > for it's data dirs and files, and then STATX_ATTR_DAX will *always* > > > tell it whether DAX is actively in use at runtime. It's pretty > > > simple, really. > > > > > > > The filesystem may not have any files so statx cannot be used. > > > > > > Really? The app or installer is about to *write to the fs* and has > > > all the permissions it needs to modify the contents of the fs. It's > > > pretty simple to create a tmpdir, set the DAX flag on the tmpdir, > > > then create a tmpfile in the tmpdir and run STATX_ATTR_DAX on it to > > > see if DAX is active or not..... > > > > Have you ever seen a 'wizard' style installer? > > I wrote my first one in 1995 on Windows NT 3.51 using Installshield. > > > Like one that firsts asks what to install, and then presents a list of > > suitable locations that have enough space, supported filesystem features > > enabled, and whatnot? > > Hold on, 1995 is calling me. The application I was packaging used > ACLs. But the NTFS version created by windows NT 3.1 was > incompatible as ACL support didn't arrive until NT 3.51 and service > pack 4(?) for NT 3.1. Yes, I had to write code to probe the > filesystems to detect whether ACL support was available or not by > -trying to create an ACL-. > > I guess you could say "been there, done that, learnt the lesson". So we are trying to be as bad as Windows now? > > > So to present a list of mountpoints that support DAX one has to scribble > > over every mountpoint on the system? > > If you are filtering storage options presented to the user by > supported features, then you have to probe for them in some way. > And that means you have to consider that many option filesystem > features that applications use cannot be detected via mount options > checking the filesytem config. That is, there are features that can > only be discovered by actually testing whether they work or not. > > > That sounds ridiculous. > > Reality is a harsh mistress. :/ > > [snip the rest because you're being ridiculous] > > Are you aware of ndctl? > > $ ndctl list > [ > { > "dev":"namespace1.0", > "mode":"fsdax", > "map":"mem", > "size":8589934592, > "sector_size":512, > "blockdev":"pmem1" > }, > { > "dev":"namespace0.0", > "mode":"fsdax", > "map":"mem", > "size":8589934592, > "sector_size":512, > "blockdev":"pmem0" > } > ] Yes, that tells me that the device can be configured for dax. Not if the filesystem will use it. > > Oh, look there are two block devices on this machine that are > configured for filesystem DAX (fsdax). They are /dev/pmem0 and > /dev/pmem1. > > What filesytsems are on them? > > $ lsblk -o NAME,SIZE,FSTYPE,MOUNTPOINT /dev/pmem0 /dev/pmem1 > NAME SIZE FSTYPE MOUNTPOINT > pmem1 8G ext4 /mnt/test > pmem0 8G xfs /mnt/scratch > $ > > One XFs, one ext4, both of which will be using DAX capable unless > the dax=never mount option is set. Which: Or the bock size does not match page size. Or whatever other requirement the filesystem might have is not met. > > $ mount |grep pmem > /dev/pmem0 on /mnt/scratch type xfs (rw,relatime,attr2,inode64,logbufs=8,logbsize=32k,noquota) > /dev/pmem1 on /mnt/test type ext4 (rw,relatime) > $ > > is not set on either mount. > > Hence both filesystems at DAX capable and enabled, and should be > presented as options to the user as such. No, it is not the case. That is why it would make sense for the kernel to make the information about DAX availability accessible somewhere. > > And all this comes about because DAX is a property of the block > device, not the filesystem. Hence the only time a DAX capable > filesystem on a block device that is DAX capable will not be DAX > capable is if the dax=never is set... See, it is not property of the block device. It is property of the mount point. The availability on the device is one requirement but the filesystem options affect availability to the user in the end. > > Of course, this is just encoding how existing filesystems behave - > it's not a requirement for future filesytsems so they may use other > mechanisms for enabling/disabling DAX. Which leaves you with the > only reliable mechanism of creating filesystem and checking > statx(STATX_ATTR_DAX).... Or the kernel could just tell the user. But right, information is power, and keeping the user in the dark is much more entertaining. OK I guess I have exhausted sane discussion options here. Best regards Michal