2022-11-02 09:05:35

by Christoph Hellwig

[permalink] [raw]
Subject: Re: [RESEND PATCH 0/4] Implement File-Based optimization functionality

On Wed, Nov 02, 2022 at 01:30:54PM +0800, Jiaming Li wrote:
> 1) The host let the device know of lba range(s) of interest. Those
> ranges are typically associated with a specific file. One can
> obtain it from the iNode of the file and some offset calculations.

This is completely and utter madness. Files are a logic concept, that
is non-unique (reflinks, snapshot) and can change at any time
(defragmentation, GC, dedup). Whoever came up with this scheme is on
crack and the it has no business being in the Linux kernel

NAK.


2022-11-03 06:24:39

by Juhyung Park

[permalink] [raw]
Subject: Re: [RESEND PATCH 0/4] Implement File-Based optimization functionality

On 11/2/22 17:47, Christoph Hellwig wrote:
> On Wed, Nov 02, 2022 at 01:30:54PM +0800, Jiaming Li wrote:
>> 1) The host let the device know of lba range(s) of interest. Those
>> ranges are typically associated with a specific file. One can
>> obtain it from the iNode of the file and some offset calculations.
>
> This is completely and utter madness. Files are a logic concept, that
> is non-unique (reflinks, snapshot) and can change at any time
> (defragmentation, GC, dedup). Whoever came up with this scheme is on
> crack and the it has no business being in the Linux kernel
>
> NAK.
>
>

Is the idea really an utter madness? Majority of regular files that may
be of interest from the perspective of UFS aren't reflinked or
snapshotted (let alone the lack of support from ext4 or f2fs).

Device-side fragmentation is a real issue [1] and it makes more than
enough sense to defrag LBAs of interests to improve performance. This
was long overdue, unless the block interface itself changes somehow.

The question is how to implement it correctly without creating a mess
with mismatched/outdated LBAs as you've mentioned, preferably through
file-system's integration: If the LBAs in questions are indeed
reflinked, how do we handle it?, If the LBAs are moved/invalidated from
defrag or GC, how do we make sure that UFS is up-to-date?, etc.

>
> From: lijiaming3 <[email protected]>
>
> add fbo analysis and defrag function
>
> We can send LBA info to the device as a comma separated string. Each
> adjacent pair represents a range:<open-lba>,<close-lba>.
> e.g. The LBA range of the file is 0x1234,0x3456;0x4567,0x5678
> echo 0x1234,0x3456,0x4567,0x5678 > fbo_send_lba
>

Like, ew. Why would we ever want *the userspace* to be able to
manipulate this directly?

[1]
https://www.usenix.org/conference/atc17/technical-sessions/presentation/hahn
- Section 3.3: "For example, even if a file was not fragmented at all in
the logical space (DoFL=1), if the file had a DoFP value of 0.5, the I/O
throughput became only 48% of that with DoFP=0."

2022-11-04 13:39:22

by Matias Bjørling

[permalink] [raw]
Subject: Re: [RESEND PATCH 0/4] Implement File-Based optimization functionality

On 03/11/2022 07.11, Juhyung Park wrote:
...
>
> Is the idea really an utter madness? Majority of regular files that may
> be of interest from the perspective of UFS aren't reflinked or
> snapshotted (let alone the lack of support from ext4 or f2fs).
>
> Device-side fragmentation is a real issue [1] and it makes more than
> enough sense to defrag LBAs of interests to improve performance. This
> was long overdue, unless the block interface itself changes somehow.

There are ongoing work with UFS to extend the block interface with
zones. This approach eliminates the mismatch between the device-side
mapping and host-side mapping and lets the host and device collaborate
on the data placement.

>
> The question is how to implement it correctly without creating a mess
> with mismatched/outdated LBAs as you've mentioned, preferably through
> file-system's integration: If the LBAs in questions are indeed
> reflinked, how do we handle it?, If the LBAs are moved/invalidated from
> defrag or GC, how do we make sure that UFS is up-to-date?, etc.

If using zoned UFS, the file-system can use zones for LBA tracking,
eliminating the mismatched/outdated LBA issue. f2fs already supports
this approach (works today with SMR HDDs and ZNS SSDs). It'll extend to
UFS when zone support is added/implemented.


2022-11-05 05:39:03

by Juhyung Park

[permalink] [raw]
Subject: Re: [RESEND PATCH 0/4] Implement File-Based optimization functionality

On Fri, Nov 4, 2022 at 9:37 PM Matias Bjørling <[email protected]> wrote:
>
> On 03/11/2022 07.11, Juhyung Park wrote:
> ...
> >
> > Is the idea really an utter madness? Majority of regular files that may
> > be of interest from the perspective of UFS aren't reflinked or
> > snapshotted (let alone the lack of support from ext4 or f2fs).
> >
> > Device-side fragmentation is a real issue [1] and it makes more than
> > enough sense to defrag LBAs of interests to improve performance. This
> > was long overdue, unless the block interface itself changes somehow.
>
> There are ongoing work with UFS to extend the block interface with
> zones. This approach eliminates the mismatch between the device-side
> mapping and host-side mapping and lets the host and device collaborate
> on the data placement.
>
> >
> > The question is how to implement it correctly without creating a mess
> > with mismatched/outdated LBAs as you've mentioned, preferably through
> > file-system's integration: If the LBAs in questions are indeed
> > reflinked, how do we handle it?, If the LBAs are moved/invalidated from
> > defrag or GC, how do we make sure that UFS is up-to-date?, etc.
>
> If using zoned UFS, the file-system can use zones for LBA tracking,
> eliminating the mismatched/outdated LBA issue. f2fs already supports
> this approach (works today with SMR HDDs and ZNS SSDs). It'll extend to
> UFS when zone support is added/implemented.
>

More reasons to have this functionality integrated with the
file-system instead of allowing users to specify random LBA ranges.

2022-11-09 13:17:26

by Christoph Hellwig

[permalink] [raw]
Subject: Re: [RESEND PATCH 0/4] Implement File-Based optimization functionality

On Thu, Nov 03, 2022 at 03:11:16PM +0900, Juhyung Park wrote:
> Is the idea really an utter madness?

Yes.

> Majority of regular files that may be
> of interest from the perspective of UFS aren't reflinked or snapshotted (let
> alone the lack of support from ext4 or f2fs).

Linux does not require you in any way to use obsolete file systems
desings only on any given block device.

> Device-side fragmentation is a real issue [1] and it makes more than enough
> sense to defrag LBAs of interests to improve performance. This was long
> overdue, unless the block interface itself changes somehow.

Or maybe random writes to flash aren't a good idea if you FTL sucks?
Full blown FTLs tend to not do any extent based mappings, so
fragmentation does not matter. The price paid for that is much larger
FTL tables. If you stop pretending flash is random writable through
saner interfaces like ZNS you automatically solve this fragmentation
problem as well.

> The question is how to implement it correctly without creating a mess with
> mismatched/outdated LBAs as you've mentioned, preferably through
> file-system's integration: If the LBAs in questions are indeed reflinked,
> how do we handle it?, If the LBAs are moved/invalidated from defrag or GC,
> how do we make sure that UFS is up-to-date?, etc.

The fix is to plug the leaking abtractions in UFS. If it wants to look
like a random writable block device it better perform when doing that.
And if it doesn't want to pay the prize for that it'd better expose
an abstraction that actually fits the underlying media. It's not like
some of us haven't worked on that for the last decade.