LinuxLists.cc - [PATCH 1/2] fs: Do not dispatch FITRIM through separate super

2010-11-18 07:37:08

Subject: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

There was concern that FITRIM ioctl is not common enough to be included
in core vfs ioctl, as Christoph Hellwig pointed out there's no real point
in dispatching this out to a separate vector instead of just through
->ioctl.

So this commit removes ioctl_fstrim() from vfs ioctl and trim_fs
from super_operation structure.

Signed-off-by: Lukas Czerner <[email protected]>
---
fs/ext4/super.c | 1 -
fs/ioctl.c | 39 ---------------------------------------
include/linux/fs.h | 1 -
3 files changed, 0 insertions(+), 41 deletions(-)

diff --git a/fs/ext4/super.c b/fs/ext4/super.c
index 61182fe..1d3c2aa 100644
--- a/fs/ext4/super.c
+++ b/fs/ext4/super.c
@@ -1197,7 +1197,6 @@ static const struct super_operations ext4_sops = {
.quota_write = ext4_quota_write,
#endif
.bdev_try_to_free_page = bdev_try_to_free_page,
- .trim_fs = ext4_trim_fs
};

static const struct super_operations ext4_nojournal_sops = {
diff --git a/fs/ioctl.c b/fs/ioctl.c
index e92fdbb..f855ea4 100644
--- a/fs/ioctl.c
+++ b/fs/ioctl.c
@@ -530,41 +530,6 @@ static int ioctl_fsthaw(struct file *filp)
return thaw_super(sb);
}

-static int ioctl_fstrim(struct file *filp, void __user *argp)
-{
- struct super_block *sb = filp->f_path.dentry->d_inode->i_sb;
- struct fstrim_range range;
- int ret = 0;
-
- if (!capable(CAP_SYS_ADMIN))
- return -EPERM;
-
- /* If filesystem doesn't support trim feature, return. */
- if (sb->s_op->trim_fs == NULL)
- return -EOPNOTSUPP;
-
- /* If a blockdevice-backed filesystem isn't specified, return EINVAL. */
- if (sb->s_bdev == NULL)
- return -EINVAL;
-
- if (argp == NULL) {
- range.start = 0;
- range.len = ULLONG_MAX;
- range.minlen = 0;
- } else if (copy_from_user(&range, argp, sizeof(range)))
- return -EFAULT;
-
- ret = sb->s_op->trim_fs(sb, &range);
- if (ret < 0)
- return ret;
-
- if ((argp != NULL) &&
- (copy_to_user(argp, &range, sizeof(range))))
- return -EFAULT;
-
- return 0;
-}
-
/*
* When you add any new common ioctls to the switches above and below
* please update compat_sys_ioctl() too.
@@ -615,10 +580,6 @@ int do_vfs_ioctl(struct file *filp, unsigned int fd, unsigned int cmd,
error = ioctl_fsthaw(filp);
break;

- case FITRIM:
- error = ioctl_fstrim(filp, argp);
- break;
-
case FS_IOC_FIEMAP:
return ioctl_fiemap(filp, arg);

diff --git a/include/linux/fs.h b/include/linux/fs.h
index 334d68a..eedc00b 100644
--- a/include/linux/fs.h
+++ b/include/linux/fs.h
@@ -1612,7 +1612,6 @@ struct super_operations {
ssize_t (*quota_write)(struct super_block *, int, const char *, size_t, loff_t);
#endif
int (*bdev_try_to_free_page)(struct super_block*, struct page*, gfp_t);
- int (*trim_fs) (struct super_block *, struct fstrim_range *);
};

/*
--
1.7.2.3

2010-11-18 07:37:07

by Lukas Czerner

[permalink] [raw]

Subject: [PATCH 2/2] ext4: Add EXT4_IOC_TRIM ioctl to handle batched discard

Filesystem independent ioctl was rejected as not common enough to be in
core vfs ioctl. Since we still need to access to this functionality this
commit adds ext4 specific ioctl EXT4_IOC_TRIM to dispatch
ext4_trim_fs().

It takes fstrim_range structure as an argument. fstrim_range is definec in
the include/linux/fs.h and its definition is as follows.

struct fstrim_range {
__u64 start;
__u64 len;
__u64 minlen;
}

start - first Byte to trim
len - number of Bytes to trim from start
minlen - minimum extent length to trim, free extents shorter than this
number of Bytes will be ignored. This will be rounded up to fs
block size.

After the FITRIM is done, the number of actually discarded Bytes is stored
in fstrim_range.len to give the user better insight on how much storage
space has been really released for wear-leveling.

Signed-off-by: Lukas Czerner <[email protected]>
---
fs/ext4/ext4.h | 1 +
fs/ext4/ioctl.c | 24 ++++++++++++++++++++++++
2 files changed, 25 insertions(+), 0 deletions(-)

diff --git a/fs/ext4/ext4.h b/fs/ext4/ext4.h
index 6a5edea..2af5042 100644
--- a/fs/ext4/ext4.h
+++ b/fs/ext4/ext4.h
@@ -541,6 +541,7 @@ struct ext4_new_group_data {
/* note ioctl 11 reserved for filesystem-independent FIEMAP ioctl */
#define EXT4_IOC_ALLOC_DA_BLKS _IO('f', 12)
#define EXT4_IOC_MOVE_EXT _IOWR('f', 15, struct move_extent)
+#define EXT4_IOC_TRIM FITRIM

#if defined(__KERNEL__) && defined(CONFIG_COMPAT)
/*
diff --git a/fs/ext4/ioctl.c b/fs/ext4/ioctl.c
index bf5ae88..e07944a 100644
--- a/fs/ext4/ioctl.c
+++ b/fs/ext4/ioctl.c
@@ -331,6 +331,30 @@ mext_out:
return err;
}

+ case EXT4_IOC_TRIM:
+ {
+ struct super_block *sb = inode->i_sb;
+ struct fstrim_range range;
+ int ret = 0;
+
+ if (!capable(CAP_SYS_ADMIN))
+ return -EPERM;
+
+ if (copy_from_user(&range, (struct fstrim_range *)arg,
+ sizeof(range)))
+ return -EFAULT;
+
+ ret = ext4_trim_fs(sb, &range);
+ if (ret < 0)
+ return ret;
+
+ if (copy_to_user((struct fstrim_range *)arg, &range,
+ sizeof(range)))
+ return -EFAULT;
+
+ return 0;
+ }
+
default:
return -ENOTTY;
}
--
1.7.2.3

2010-11-18 13:06:33

by Matthew Wilcox

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

2010-11-18 13:48:51

by Josef Bacik

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, Nov 18, 2010 at 06:06:30AM -0700, Matthew Wilcox wrote:
> On Thu, Nov 18, 2010 at 08:36:48AM +0100, Lukas Czerner wrote:
> > There was concern that FITRIM ioctl is not common enough to be included
> > in core vfs ioctl, as Christoph Hellwig pointed out there's no real point
> > in dispatching this out to a separate vector instead of just through
> > ->ioctl.
>
> Um, are you and Josef working independently of each other? You don't
> seem to be cc'ing each other on your patches, and you're basically doing
> the same thing.
>

I guess they are the same thing in that we're both dealing with free'ing up
space, but thats about where the similarities end. Lukas' work is in TRIM'ing
already free'd space, mine is in creating free'd space. Plus I don't know
anything nor wish to ever know anything about TRIM ;). Thanks,

Josef

2010-11-18 14:20:01

by Matthew Wilcox

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, Nov 18, 2010 at 08:48:04AM -0500, Josef Bacik wrote:
> On Thu, Nov 18, 2010 at 06:06:30AM -0700, Matthew Wilcox wrote:
> > On Thu, Nov 18, 2010 at 08:36:48AM +0100, Lukas Czerner wrote:
> > > There was concern that FITRIM ioctl is not common enough to be included
> > > in core vfs ioctl, as Christoph Hellwig pointed out there's no real point
> > > in dispatching this out to a separate vector instead of just through
> > > ->ioctl.
> >
> > Um, are you and Josef working independently of each other? You don't
> > seem to be cc'ing each other on your patches, and you're basically doing
> > the same thing.
> >
>
> I guess they are the same thing in that we're both dealing with free'ing up
> space, but thats about where the similarities end. Lukas' work is in TRIM'ing
> already free'd space, mine is in creating free'd space. Plus I don't know
> anything nor wish to ever know anything about TRIM ;). Thanks,

I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
filesystem that was TRIM-aware would pass that information down to the
block device that it's mounted on. I strongly feel that we shouldn't
have two interfaces to do essentially the same thing.

I guess I'm saying that you're going to have to learn about TRIM :-)

--
Matthew Wilcox Intel Open Source Technology Centre
"Bill, look, we understand that you're interested in selling us this
operating system, but compare it to ours. We can't possibly take such
a retrograde step."

2010-11-18 14:29:22

by Christoph Hellwig

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
> I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
> filesystem that was TRIM-aware would pass that information down to the
> block device that it's mounted on. I strongly feel that we shouldn't
> have two interfaces to do essentially the same thing.
>
> I guess I'm saying that you're going to have to learn about TRIM :-)

Did you actually look Lukas FITRIM code (not the slight reordering here,
but the original one). It's the ext4 version of the batched discard
model, that is a userspace ioctl to discard free space in the
filesystem.

hole punching will free the blocks into the free space pool. If you do
online discard it will also get discarded, but a filesystem that has
online discard enabled doesn't need FITRIM.

2010-11-18 14:31:45

by Josef Bacik

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
> On Thu, Nov 18, 2010 at 08:48:04AM -0500, Josef Bacik wrote:
> > On Thu, Nov 18, 2010 at 06:06:30AM -0700, Matthew Wilcox wrote:
> > > On Thu, Nov 18, 2010 at 08:36:48AM +0100, Lukas Czerner wrote:
> > > > There was concern that FITRIM ioctl is not common enough to be included
> > > > in core vfs ioctl, as Christoph Hellwig pointed out there's no real point
> > > > in dispatching this out to a separate vector instead of just through
> > > > ->ioctl.
> > >
> > > Um, are you and Josef working independently of each other? You don't
> > > seem to be cc'ing each other on your patches, and you're basically doing
> > > the same thing.
> > >
> >
> > I guess they are the same thing in that we're both dealing with free'ing up
> > space, but thats about where the similarities end. Lukas' work is in TRIM'ing
> > already free'd space, mine is in creating free'd space. Plus I don't know
> > anything nor wish to ever know anything about TRIM ;). Thanks,
>
> I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
> filesystem that was TRIM-aware would pass that information down to the
> block device that it's mounted on. I strongly feel that we shouldn't
> have two interfaces to do essentially the same thing.

But they aren't doing the same thing, his is discarding already free'd space,
I'm enabling people to de-allocate space in the middle of files, they are two
seperate things. Of course if the filesystem is TRIM aware the de-allocation
would lead to a TRIM, but not if the filesystem isn't mounted with -o discard.
Hole punching is useful independantly of the ability to do TRIM. Thanks,

Josef

2010-11-18 14:38:34

by Tao Ma

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On 11/18/2010 10:31 PM, Josef Bacik wrote:
> On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
>> On Thu, Nov 18, 2010 at 08:48:04AM -0500, Josef Bacik wrote:
>>> On Thu, Nov 18, 2010 at 06:06:30AM -0700, Matthew Wilcox wrote:
>>>> On Thu, Nov 18, 2010 at 08:36:48AM +0100, Lukas Czerner wrote:
>>>>> There was concern that FITRIM ioctl is not common enough to be included
>>>>> in core vfs ioctl, as Christoph Hellwig pointed out there's no real point
>>>>> in dispatching this out to a separate vector instead of just through
>>>>> ->ioctl.
>>>>
>>>> Um, are you and Josef working independently of each other? You don't
>>>> seem to be cc'ing each other on your patches, and you're basically doing
>>>> the same thing.
>>>>
>>>
>>> I guess they are the same thing in that we're both dealing with free'ing up
>>> space, but thats about where the similarities end. Lukas' work is in TRIM'ing
>>> already free'd space, mine is in creating free'd space. Plus I don't know
>>> anything nor wish to ever know anything about TRIM ;). Thanks,
>>
>> I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
>> filesystem that was TRIM-aware would pass that information down to the
>> block device that it's mounted on. I strongly feel that we shouldn't
>> have two interfaces to do essentially the same thing.
>
> But they aren't doing the same thing, his is discarding already free'd space,
> I'm enabling people to de-allocate space in the middle of files, they are two
> seperate things. Of course if the filesystem is TRIM aware the de-allocation
> would lead to a TRIM, but not if the filesystem isn't mounted with -o discard.
> Hole punching is useful independantly of the ability to do TRIM. Thanks,
yeah, actually, ocfs2 has implemented punching holes while we don't have
TRIM support enabled yet.

Regards,
Tao

2010-11-18 17:19:18

by James Bottomley

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, 2010-11-18 at 09:29 -0500, Christoph Hellwig wrote:
> On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
> > I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
> > filesystem that was TRIM-aware would pass that information down to the
> > block device that it's mounted on. I strongly feel that we shouldn't
> > have two interfaces to do essentially the same thing.
> >
> > I guess I'm saying that you're going to have to learn about TRIM :-)
>
> Did you actually look Lukas FITRIM code (not the slight reordering here,
> but the original one). It's the ext4 version of the batched discard
> model, that is a userspace ioctl to discard free space in the
> filesystem.
>
> hole punching will free the blocks into the free space pool. If you do
> online discard it will also get discarded, but a filesystem that has
> online discard enabled doesn't need FITRIM.

Not stepping into the debate: I'm happy to see punch go to the mapping
data and FITRIM pick it up later.

However, I think it's time to question whether we actually still want to
allow online discard at all. Most of the benchmarks show it to be a net
lose to almost everything (either SSD or Thinly Provisioned arrays), so
it's become an "enable this to degrade performance" option with no
upside.

James

2010-11-18 17:27:20

by Jeff Moyer

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

James Bottomley <[email protected]> writes:

> Not stepping into the debate: I'm happy to see punch go to the mapping
> data and FITRIM pick it up later.
>
> However, I think it's time to question whether we actually still want to
> allow online discard at all. Most of the benchmarks show it to be a net

Define online discard, please.

> lose to almost everything (either SSD or Thinly Provisioned arrays), so
> it's become an "enable this to degrade performance" option with no
> upside.

Some SSDs very much require TRIMming to perform well as they age. If
you're suggesting that we move from doing discards in journal commits to
a batched discard, like the one Lukas implemented, then I think that's
fine. If we need to reintroduce the finer-grained discards due to some
hardware changes in the future, we can always do that.

Cheers,
Jeff

2010-11-18 17:35:57

by Lukas Czerner

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, 18 Nov 2010, James Bottomley wrote:

> On Thu, 2010-11-18 at 09:29 -0500, Christoph Hellwig wrote:
> > On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
> > > I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
> > > filesystem that was TRIM-aware would pass that information down to the
> > > block device that it's mounted on. I strongly feel that we shouldn't
> > > have two interfaces to do essentially the same thing.
> > >
> > > I guess I'm saying that you're going to have to learn about TRIM :-)
> >
> > Did you actually look Lukas FITRIM code (not the slight reordering here,
> > but the original one). It's the ext4 version of the batched discard
> > model, that is a userspace ioctl to discard free space in the
> > filesystem.
> >
> > hole punching will free the blocks into the free space pool. If you do
> > online discard it will also get discarded, but a filesystem that has
> > online discard enabled doesn't need FITRIM.
>
> Not stepping into the debate: I'm happy to see punch go to the mapping
> data and FITRIM pick it up later.
>
> However, I think it's time to question whether we actually still want to
> allow online discard at all. Most of the benchmarks show it to be a net
> lose to almost everything (either SSD or Thinly Provisioned arrays), so
> it's become an "enable this to degrade performance" option with no
> upside.
>
> James
>

This time began a long time ago :) that is why am I originally created
batched discard for ext4 (ext3) accessible through FITRIM ioctl. Ext4
performance with -o discard mount option goes down on the most of the
SSD's and every Thinly-provisioned storage I have a chance to benchmark.

But, for example SSD's are getting better and as time goes by we might
see devices that does not suffer terrible performance loss with discard
enabled (discard on unlink in ext4 etc...), so this "online" discard
probably still does make sense.

-Lukas

2010-11-18 17:41:46

by James Bottomley

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On Thu, 2010-11-18 at 12:22 -0500, Jeff Moyer wrote:
> James Bottomley <[email protected]> writes:
>
> > Not stepping into the debate: I'm happy to see punch go to the mapping
> > data and FITRIM pick it up later.
> >
> > However, I think it's time to question whether we actually still want to
> > allow online discard at all. Most of the benchmarks show it to be a net
>
> Define online discard, please.

Trims emitted inline at the FS operates (mount option -o discard)

> > lose to almost everything (either SSD or Thinly Provisioned arrays), so
> > it's become an "enable this to degrade performance" option with no
> > upside.
>
> Some SSDs very much require TRIMming to perform well as they age. If
> you're suggesting that we move from doing discards in journal commits to
> a batched discard, like the one Lukas implemented, then I think that's
> fine. If we need to reintroduce the finer-grained discards due to some
> hardware changes in the future, we can always do that.

Right, I'm suggesting we just rely on offline methods. Regardless of
what happens to FITRIM, we have wiper.sh now (although it does require
unmounted use for most of the less than modern fs).

James

2010-11-18 17:56:40

by Chris Mason

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Excerpts from James Bottomley's message of 2010-11-18 12:19:10 -0500:
> On Thu, 2010-11-18 at 09:29 -0500, Christoph Hellwig wrote:
> > On Thu, Nov 18, 2010 at 07:19:58AM -0700, Matthew Wilcox wrote:
> > > I guess I was assuming that, on receiving a FALLOC_FL_PUNCH_HOLE, a
> > > filesystem that was TRIM-aware would pass that information down to the
> > > block device that it's mounted on. I strongly feel that we shouldn't
> > > have two interfaces to do essentially the same thing.
> > >
> > > I guess I'm saying that you're going to have to learn about TRIM :-)
> >
> > Did you actually look Lukas FITRIM code (not the slight reordering here,
> > but the original one). It's the ext4 version of the batched discard
> > model, that is a userspace ioctl to discard free space in the
> > filesystem.
> >
> > hole punching will free the blocks into the free space pool. If you do
> > online discard it will also get discarded, but a filesystem that has
> > online discard enabled doesn't need FITRIM.
>
> Not stepping into the debate: I'm happy to see punch go to the mapping
> data and FITRIM pick it up later.
>
> However, I think it's time to question whether we actually still want to
> allow online discard at all. Most of the benchmarks show it to be a net
> lose to almost everything (either SSD or Thinly Provisioned arrays), so
> it's become an "enable this to degrade performance" option with no
> upside.

I think we want to keep it. In general we've (except for hch) spent
almost zero time actually tuning online discard, and the benchmarking
needs to be redone with the shiny new barrier code.

-chris

2010-11-18 18:50:31

by Jamie Lokier

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Jeff Moyer wrote:
> James Bottomley <[email protected]> writes:
>
> > Not stepping into the debate: I'm happy to see punch go to the mapping
> > data and FITRIM pick it up later.
> >
> > However, I think it's time to question whether we actually still want to
> > allow online discard at all. Most of the benchmarks show it to be a net
>
> Define online discard, please.
>
> > lose to almost everything (either SSD or Thinly Provisioned arrays), so
> > it's become an "enable this to degrade performance" option with no
> > upside.
>
> Some SSDs very much require TRIMming to perform well as they age. If
> you're suggesting that we move from doing discards in journal commits to
> a batched discard, like the one Lukas implemented, then I think that's
> fine. If we need to reintroduce the finer-grained discards due to some
> hardware changes in the future, we can always do that.

"Growable" virtual disks benefit from it too, if it frees up a lot of space.

Windows has some ability to trim unused space in NTFS on virtual disks
for this reason; I'm not sure if it's an online or offline procedure.

Online trim may be slow, but offline would be awfully inconvenient
when an fs is big and needed for a live system, or when it's your root fs.

-- Jamie

2010-11-18 19:33:00

by Markus Trippelsdorf

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

On 2010.11.18 at 18:05 +0000, Jamie Lokier wrote:
> Jeff Moyer wrote:
> > James Bottomley <[email protected]> writes:
> >
> > > Not stepping into the debate: I'm happy to see punch go to the mapping
> > > data and FITRIM pick it up later.
> > >
> > > However, I think it's time to question whether we actually still want to
> > > allow online discard at all. Most of the benchmarks show it to be a net
> >
> > Define online discard, please.
> >
> > > lose to almost everything (either SSD or Thinly Provisioned arrays), so
> > > it's become an "enable this to degrade performance" option with no
> > > upside.
> >
> > Some SSDs very much require TRIMming to perform well as they age. If
> > you're suggesting that we move from doing discards in journal commits to
> > a batched discard, like the one Lukas implemented, then I think that's
> > fine. If we need to reintroduce the finer-grained discards due to some
> > hardware changes in the future, we can always do that.
>
> "Growable" virtual disks benefit from it too, if it frees up a lot of space.
>
> Windows has some ability to trim unused space in NTFS on virtual disks
> for this reason; I'm not sure if it's an online or offline procedure.
>
> Online trim may be slow, but offline would be awfully inconvenient
> when an fs is big and needed for a live system, or when it's your root fs.

You can call FITRIM from a running system. Infact I run it once per week
as a cron job on my (mounted) root fs.

--
Markus

2010-11-18 20:11:33

by Greg Freemyer

[permalink] [raw]

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Adding Mark Lord in CC:

On Thu, Nov 18, 2010 at 9:41 AM, James Bottomley
<[email protected]> wrote:
> On Thu, 2010-11-18 at 12:22 -0500, Jeff Moyer wrote:
>> James Bottomley <[email protected]> writes:
>>
>> > Not stepping into the debate: I'm happy to see punch go to the mapping
>> > data and FITRIM pick it up later.
>> >
>> > However, I think it's time to question whether we actually still want to
>> > allow online discard at all. ?Most of the benchmarks show it to be a net
>>
>> Define online discard, please.
>
> Trims emitted inline at the FS operates (mount option -o discard)
>
>> > lose to almost everything (either SSD or Thinly Provisioned arrays), so
>> > it's become an "enable this to degrade performance" option with no
>> > upside.
>>
>> Some SSDs very much require TRIMming to perform well as they age. ?If
>> you're suggesting that we move from doing discards in journal commits to
>> a batched discard, like the one Lukas implemented, then I think that's
>> fine. ?If we need to reintroduce the finer-grained discards due to some
>> hardware changes in the future, we can always do that.
>
> Right, I'm suggesting we just rely on offline methods. ?Regardless of
> what happens to FITRIM, we have wiper.sh now (although it does require
> unmounted use for most of the less than modern fs).
>
> James

I'm a fan of wiper.sh, but afaik it still cannot address a
multi-spindle LVM setup, Nor a MDraid setup. etc.

That's because it bypasses the block stack and talks directly to the
devices. Thus it doesn't get the benefit of all the logical to
physical sector remapping handled via the block stack.

Mark, please correct me if I'm wrong.

The LVM and MDraid setup are important to support, and only "mount -o
discard" and Lucas's FITRIM support them.

So afaik we have 3 options, each with an opportunity for improvement:

1) mount -o discard - needs kernel tuning / new hardware to be a
performance win.

2) FITRIM doesn't leverage the fact the a TRIM command can handle
multiple ranges per TRIM command payload. I haven't seen any FITRIM
vs. wiper.sh benchmarks, so I don't know what impact that has in
practice. Mark Lord thought that this lacking feature would cause
FITRIM to take minutes or hours with some hardware. Especially early
generation SSDs.

3) wiper.sh does leverage the multiple ranges per TRIM command, but it
really needs a new block layer interface that would allow it to push
discard commands into the kernel via the block layer, not just down at
the physical drive layer. The interface should accept multiple ranges
per invocation and trigger TRIM commands to the SSD that have have a
multi-range discard payload.

So it seems that for now keeping all 3 is best. My personal hope is
that the block layer grows the ability to accept multirange discard
requests and FITRIM is updated to leverage it.

Greg

2010-11-18 21:37:40

Subject: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: [PATCH 2/2] ext4: Add EXT4_IOC_TRIM ioctl to handle batched discard

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 2/2] ext4: Add EXT4_IOC_TRIM ioctl to handle batched discard

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 2/2] ext4: Add EXT4_IOC_TRIM ioctl to handle batched discard

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 2/2] ext4: Add EXT4_IOC_TRIM ioctl to handle batched discard

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Attachments:

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

Subject: Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation