2024-06-10 08:40:47

by Johannes Thumshirn

[permalink] [raw]
Subject: [PATCH 0/3] btrfs: rst: updates for RAID stripe tree

Three independent updates for RAID stripe tree.
The 1st one removes pointless space from the on-disk format. As the
feature itself is still experimental I'd like to get rid of that as early
as possible.

Patch 2 replaces stripe extents in case we hit a EEXIST when inserting a
stripe extent on a write. This can happen i.e. on device-replace.

Patch 3 splits a stripe extent on partial delete of a stripe.

---
Johannes Thumshirn (2):
btrfs: rst: remove encoding field from stripe_extent
btrfs: replace stripe extents

JohnnesThumshirn (1):
btrfs: split RAID stripes on deletion

fs/btrfs/accessors.h | 3 -
fs/btrfs/ctree.c | 1 +
fs/btrfs/print-tree.c | 5 --
fs/btrfs/raid-stripe-tree.c | 148 ++++++++++++++++++++++++++++++----------
fs/btrfs/raid-stripe-tree.h | 3 +-
fs/btrfs/tree-checker.c | 19 ------
include/uapi/linux/btrfs_tree.h | 14 +---
7 files changed, 114 insertions(+), 79 deletions(-)
---
base-commit: e361635b966fca48f92263277ff38cd5a1971d39
change-id: 20240610-b4-rst-updates-d0aa696b9d5a

Best regards,
--
Johannes Thumshirn <[email protected]>



2024-06-10 08:41:06

by Johannes Thumshirn

[permalink] [raw]
Subject: [PATCH 1/3] btrfs: rst: remove encoding field from stripe_extent

From: Johannes Thumshirn <[email protected]>

Remove the encoding field from 'struct btrfs_stripe_extent'. It was
originally intended to encode the RAID type as well as if we're a data
or a parity stripe.

But the RAID type can be inferred form the block-group and the data vs.
parity differentiation can be done easier with adding a new key type
for parity stripes in the RAID stripe tree.

Signed-off-by: Johannes Thumshirn <[email protected]>
---
fs/btrfs/accessors.h | 3 ---
fs/btrfs/print-tree.c | 5 -----
fs/btrfs/raid-stripe-tree.c | 13 -------------
fs/btrfs/raid-stripe-tree.h | 3 +--
fs/btrfs/tree-checker.c | 19 -------------------
include/uapi/linux/btrfs_tree.h | 14 +-------------
6 files changed, 2 insertions(+), 55 deletions(-)

diff --git a/fs/btrfs/accessors.h b/fs/btrfs/accessors.h
index 6c3deaa3e878..b2eb9cde2c5d 100644
--- a/fs/btrfs/accessors.h
+++ b/fs/btrfs/accessors.h
@@ -315,11 +315,8 @@ BTRFS_SETGET_FUNCS(timespec_nsec, struct btrfs_timespec, nsec, 32);
BTRFS_SETGET_STACK_FUNCS(stack_timespec_sec, struct btrfs_timespec, sec, 64);
BTRFS_SETGET_STACK_FUNCS(stack_timespec_nsec, struct btrfs_timespec, nsec, 32);

-BTRFS_SETGET_FUNCS(stripe_extent_encoding, struct btrfs_stripe_extent, encoding, 8);
BTRFS_SETGET_FUNCS(raid_stride_devid, struct btrfs_raid_stride, devid, 64);
BTRFS_SETGET_FUNCS(raid_stride_physical, struct btrfs_raid_stride, physical, 64);
-BTRFS_SETGET_STACK_FUNCS(stack_stripe_extent_encoding,
- struct btrfs_stripe_extent, encoding, 8);
BTRFS_SETGET_STACK_FUNCS(stack_raid_stride_devid, struct btrfs_raid_stride, devid, 64);
BTRFS_SETGET_STACK_FUNCS(stack_raid_stride_physical, struct btrfs_raid_stride, physical, 64);

diff --git a/fs/btrfs/print-tree.c b/fs/btrfs/print-tree.c
index 7e46aa8a0444..9f1e5e11bf71 100644
--- a/fs/btrfs/print-tree.c
+++ b/fs/btrfs/print-tree.c
@@ -208,11 +208,6 @@ static void print_raid_stripe_key(const struct extent_buffer *eb, u32 item_size,
struct btrfs_stripe_extent *stripe)
{
const int num_stripes = btrfs_num_raid_stripes(item_size);
- const u8 encoding = btrfs_stripe_extent_encoding(eb, stripe);
-
- pr_info("\t\t\tencoding: %s\n",
- (encoding && encoding < BTRFS_NR_RAID_TYPES) ?
- btrfs_raid_array[encoding].raid_name : "unknown");

for (int i = 0; i < num_stripes; i++)
pr_info("\t\t\tstride %d devid %llu physical %llu\n",
diff --git a/fs/btrfs/raid-stripe-tree.c b/fs/btrfs/raid-stripe-tree.c
index 6af6b4b9a32e..e6f7a234b8f6 100644
--- a/fs/btrfs/raid-stripe-tree.c
+++ b/fs/btrfs/raid-stripe-tree.c
@@ -80,7 +80,6 @@ static int btrfs_insert_one_raid_extent(struct btrfs_trans_handle *trans,
struct btrfs_key stripe_key;
struct btrfs_root *stripe_root = fs_info->stripe_root;
const int num_stripes = btrfs_bg_type_to_factor(bioc->map_type);
- u8 encoding = btrfs_bg_flags_to_raid_index(bioc->map_type);
struct btrfs_stripe_extent *stripe_extent;
const size_t item_size = struct_size(stripe_extent, strides, num_stripes);
int ret;
@@ -94,7 +93,6 @@ static int btrfs_insert_one_raid_extent(struct btrfs_trans_handle *trans,

trace_btrfs_insert_one_raid_extent(fs_info, bioc->logical, bioc->size,
num_stripes);
- btrfs_set_stack_stripe_extent_encoding(stripe_extent, encoding);
for (int i = 0; i < num_stripes; i++) {
u64 devid = bioc->stripes[i].dev->devid;
u64 physical = bioc->stripes[i].physical;
@@ -159,7 +157,6 @@ int btrfs_get_raid_extent_offset(struct btrfs_fs_info *fs_info,
struct extent_buffer *leaf;
const u64 end = logical + *length;
int num_stripes;
- u8 encoding;
u64 offset;
u64 found_logical;
u64 found_length;
@@ -222,16 +219,6 @@ int btrfs_get_raid_extent_offset(struct btrfs_fs_info *fs_info,

num_stripes = btrfs_num_raid_stripes(btrfs_item_size(leaf, slot));
stripe_extent = btrfs_item_ptr(leaf, slot, struct btrfs_stripe_extent);
- encoding = btrfs_stripe_extent_encoding(leaf, stripe_extent);
-
- if (encoding != btrfs_bg_flags_to_raid_index(map_type)) {
- ret = -EUCLEAN;
- btrfs_handle_fs_error(fs_info, ret,
- "on-disk stripe encoding %d doesn't match RAID index %d",
- encoding,
- btrfs_bg_flags_to_raid_index(map_type));
- goto out;
- }

for (int i = 0; i < num_stripes; i++) {
struct btrfs_raid_stride *stride = &stripe_extent->strides[i];
diff --git a/fs/btrfs/raid-stripe-tree.h b/fs/btrfs/raid-stripe-tree.h
index c9c258f84903..1ac1c21aac2f 100644
--- a/fs/btrfs/raid-stripe-tree.h
+++ b/fs/btrfs/raid-stripe-tree.h
@@ -48,8 +48,7 @@ static inline bool btrfs_need_stripe_tree_update(struct btrfs_fs_info *fs_info,

static inline int btrfs_num_raid_stripes(u32 item_size)
{
- return (item_size - offsetof(struct btrfs_stripe_extent, strides)) /
- sizeof(struct btrfs_raid_stride);
+ return item_size / sizeof(struct btrfs_raid_stride);
}

#endif
diff --git a/fs/btrfs/tree-checker.c b/fs/btrfs/tree-checker.c
index a2c3651a3d8f..1e140f6dabc6 100644
--- a/fs/btrfs/tree-checker.c
+++ b/fs/btrfs/tree-checker.c
@@ -1682,9 +1682,6 @@ static int check_inode_ref(struct extent_buffer *leaf,
static int check_raid_stripe_extent(const struct extent_buffer *leaf,
const struct btrfs_key *key, int slot)
{
- struct btrfs_stripe_extent *stripe_extent =
- btrfs_item_ptr(leaf, slot, struct btrfs_stripe_extent);
-
if (unlikely(!IS_ALIGNED(key->objectid, leaf->fs_info->sectorsize))) {
generic_err(leaf, slot,
"invalid key objectid for raid stripe extent, have %llu expect aligned to %u",
@@ -1698,22 +1695,6 @@ static int check_raid_stripe_extent(const struct extent_buffer *leaf,
return -EUCLEAN;
}

- switch (btrfs_stripe_extent_encoding(leaf, stripe_extent)) {
- case BTRFS_STRIPE_RAID0:
- case BTRFS_STRIPE_RAID1:
- case BTRFS_STRIPE_DUP:
- case BTRFS_STRIPE_RAID10:
- case BTRFS_STRIPE_RAID5:
- case BTRFS_STRIPE_RAID6:
- case BTRFS_STRIPE_RAID1C3:
- case BTRFS_STRIPE_RAID1C4:
- break;
- default:
- generic_err(leaf, slot, "invalid raid stripe encoding %u",
- btrfs_stripe_extent_encoding(leaf, stripe_extent));
- return -EUCLEAN;
- }
-
return 0;
}

diff --git a/include/uapi/linux/btrfs_tree.h b/include/uapi/linux/btrfs_tree.h
index d24e8e121507..cb103c76d398 100644
--- a/include/uapi/linux/btrfs_tree.h
+++ b/include/uapi/linux/btrfs_tree.h
@@ -747,21 +747,9 @@ struct btrfs_raid_stride {
__le64 physical;
} __attribute__ ((__packed__));

-/* The stripe_extent::encoding, 1:1 mapping of enum btrfs_raid_types. */
-#define BTRFS_STRIPE_RAID0 1
-#define BTRFS_STRIPE_RAID1 2
-#define BTRFS_STRIPE_DUP 3
-#define BTRFS_STRIPE_RAID10 4
-#define BTRFS_STRIPE_RAID5 5
-#define BTRFS_STRIPE_RAID6 6
-#define BTRFS_STRIPE_RAID1C3 7
-#define BTRFS_STRIPE_RAID1C4 8
-
struct btrfs_stripe_extent {
- __u8 encoding;
- __u8 reserved[7];
/* An array of raid strides this stripe is composed of. */
- struct btrfs_raid_stride strides[];
+ __DECLARE_FLEX_ARRAY(struct btrfs_raid_stride, strides);
} __attribute__ ((__packed__));

#define BTRFS_HEADER_FLAG_WRITTEN (1ULL << 0)

--
2.43.0


2024-06-10 08:41:12

by Johannes Thumshirn

[permalink] [raw]
Subject: [PATCH 2/3] btrfs: replace stripe extents

From: Johannes Thumshirn <[email protected]>

If we can't insert a stripe extent in the RAID stripe tree, because
the key that points to the specific position in the stripe tree is
already existing, we have to remove the item and then replace it by a
new item.

This can happen for example on device replace operations.

Signed-off-by: Johannes Thumshirn <[email protected]>
---
fs/btrfs/ctree.c | 1 +
fs/btrfs/raid-stripe-tree.c | 34 ++++++++++++++++++++++++++++++++++
2 files changed, 35 insertions(+)

diff --git a/fs/btrfs/ctree.c b/fs/btrfs/ctree.c
index 1a49b9232990..ad934c5469c4 100644
--- a/fs/btrfs/ctree.c
+++ b/fs/btrfs/ctree.c
@@ -3844,6 +3844,7 @@ static noinline int setup_leaf_for_split(struct btrfs_trans_handle *trans,
btrfs_item_key_to_cpu(leaf, &key, path->slots[0]);

BUG_ON(key.type != BTRFS_EXTENT_DATA_KEY &&
+ key.type != BTRFS_RAID_STRIPE_KEY &&
key.type != BTRFS_EXTENT_CSUM_KEY);

if (btrfs_leaf_free_space(leaf) >= ins_len)
diff --git a/fs/btrfs/raid-stripe-tree.c b/fs/btrfs/raid-stripe-tree.c
index e6f7a234b8f6..3020820dd6e2 100644
--- a/fs/btrfs/raid-stripe-tree.c
+++ b/fs/btrfs/raid-stripe-tree.c
@@ -73,6 +73,37 @@ int btrfs_delete_raid_extent(struct btrfs_trans_handle *trans, u64 start, u64 le
return ret;
}

+static int replace_raid_extent_item(struct btrfs_trans_handle *trans,
+ struct btrfs_key *key,
+ struct btrfs_stripe_extent *stripe_extent,
+ const size_t item_size)
+{
+ struct btrfs_fs_info *fs_info = trans->fs_info;
+ struct btrfs_root *stripe_root = fs_info->stripe_root;
+ struct btrfs_path *path;
+ int ret;
+
+ path = btrfs_alloc_path();
+ if (!path)
+ return -ENOMEM;
+
+ ret = btrfs_search_slot(trans, stripe_root, key, path, -1, 1);
+ if (ret)
+ goto err;
+
+ ret = btrfs_del_item(trans, stripe_root, path);
+ if (ret)
+ goto err;
+
+ btrfs_free_path(path);
+
+ return btrfs_insert_item(trans, stripe_root, key, stripe_extent,
+ item_size);
+ err:
+ btrfs_free_path(path);
+ return ret;
+}
+
static int btrfs_insert_one_raid_extent(struct btrfs_trans_handle *trans,
struct btrfs_io_context *bioc)
{
@@ -112,6 +143,9 @@ static int btrfs_insert_one_raid_extent(struct btrfs_trans_handle *trans,

ret = btrfs_insert_item(trans, stripe_root, &stripe_key, stripe_extent,
item_size);
+ if (ret == -EEXIST)
+ ret = replace_raid_extent_item(trans, &stripe_key,
+ stripe_extent, item_size);
if (ret)
btrfs_abort_transaction(trans, ret);


--
2.43.0


2024-06-10 08:45:40

by Johannes Thumshirn

[permalink] [raw]
Subject: [PATCH 3/3] btrfs: split RAID stripes on deletion

From: JohnnesThumshirn <[email protected]>

The current RAID stripe code assumes, that we will always remove a
whole stripe entry.

But ff we're only removing a part of a RAID stripe we're hitting the
ASSERT()ion checking for this condition.

Instead of assuming the complete deletion of a RAID stripe, split the
stripe if we need to.

Signed-off-by: Johnnes Thumshirn <[email protected]>
---
fs/btrfs/raid-stripe-tree.c | 101 +++++++++++++++++++++++++++++++++-----------
1 file changed, 77 insertions(+), 24 deletions(-)

diff --git a/fs/btrfs/raid-stripe-tree.c b/fs/btrfs/raid-stripe-tree.c
index 3020820dd6e2..41403217c3e6 100644
--- a/fs/btrfs/raid-stripe-tree.c
+++ b/fs/btrfs/raid-stripe-tree.c
@@ -33,42 +33,95 @@ int btrfs_delete_raid_extent(struct btrfs_trans_handle *trans, u64 start, u64 le
if (!path)
return -ENOMEM;

- while (1) {
- key.objectid = start;
- key.type = BTRFS_RAID_STRIPE_KEY;
- key.offset = length;
+again:
+ key.objectid = start;
+ key.type = BTRFS_RAID_STRIPE_KEY;
+ key.offset = length;

- ret = btrfs_search_slot(trans, stripe_root, &key, path, -1, 1);
- if (ret < 0)
- break;
- if (ret > 0) {
- ret = 0;
- if (path->slots[0] == 0)
- break;
- path->slots[0]--;
- }
+ ret = btrfs_search_slot(trans, stripe_root, &key, path, -1, 1);
+ if (ret < 0)
+ goto out;
+ if (ret > 0) {
+ ret = 0;
+ if (path->slots[0] == 0)
+ goto out;
+ path->slots[0]--;
+ }
+
+ leaf = path->nodes[0];
+ slot = path->slots[0];
+ btrfs_item_key_to_cpu(leaf, &key, slot);
+ found_start = key.objectid;
+ found_end = found_start + key.offset;
+
+ /* That stripe ends before we start, we're done. */
+ if (found_end <= start)
+ goto out;
+
+ trace_btrfs_raid_extent_delete(fs_info, start, end,
+ found_start, found_end);
+
+ if (found_start < start) {
+ u64 diff = start - found_start;
+ struct btrfs_key new_key;
+ int num_stripes;
+ struct btrfs_stripe_extent *stripe_extent;
+
+ new_key.objectid = start;
+ new_key.type = BTRFS_RAID_STRIPE_KEY;
+ new_key.offset = length - diff;
+
+ ret = btrfs_duplicate_item(trans, stripe_root, path,
+ &new_key);
+ if (ret)
+ goto out;

leaf = path->nodes[0];
slot = path->slots[0];
- btrfs_item_key_to_cpu(leaf, &key, slot);
- found_start = key.objectid;
- found_end = found_start + key.offset;

- /* That stripe ends before we start, we're done. */
- if (found_end <= start)
- break;
+ num_stripes =
+ btrfs_num_raid_stripes(btrfs_item_size(leaf, slot));
+ stripe_extent =
+ btrfs_item_ptr(leaf, slot, struct btrfs_stripe_extent);

- trace_btrfs_raid_extent_delete(fs_info, start, end,
- found_start, found_end);
+ for (int i = 0; i < num_stripes; i++) {
+ struct btrfs_raid_stride *raid_stride =
+ &stripe_extent->strides[i];
+ u64 physical =
+ btrfs_raid_stride_physical(leaf, raid_stride);

- ASSERT(found_start >= start && found_end <= end);
- ret = btrfs_del_item(trans, stripe_root, path);
+ btrfs_set_stack_raid_stride_physical(raid_stride,
+ physical + diff);
+ }
+
+ btrfs_mark_buffer_dirty(trans, leaf);
+ btrfs_release_path(path);
+ goto again;
+ }
+
+ if (found_end > end) {
+ u64 diff = found_end - end;
+ struct btrfs_key new_key;
+
+ new_key.objectid = found_start;
+ new_key.type = BTRFS_RAID_STRIPE_KEY;
+ new_key.offset = length - diff;
+
+ ret = btrfs_duplicate_item(trans, stripe_root, path,
+ &new_key);
if (ret)
- break;
+ goto out;

+ btrfs_mark_buffer_dirty(trans, leaf);
btrfs_release_path(path);
+ goto again;
+
}

+ if (found_start == start && found_end == end)
+ ret = btrfs_del_item(trans, stripe_root, path);
+
+ out:
btrfs_free_path(path);
return ret;
}

--
2.43.0


2024-06-10 19:44:02

by Josef Bacik

[permalink] [raw]
Subject: Re: [PATCH 2/3] btrfs: replace stripe extents

On Mon, Jun 10, 2024 at 10:40:26AM +0200, Johannes Thumshirn wrote:
> From: Johannes Thumshirn <[email protected]>
>
> If we can't insert a stripe extent in the RAID stripe tree, because
> the key that points to the specific position in the stripe tree is
> already existing, we have to remove the item and then replace it by a
> new item.
>
> This can happen for example on device replace operations.
>
> Signed-off-by: Johannes Thumshirn <[email protected]>
> ---
> fs/btrfs/ctree.c | 1 +
> fs/btrfs/raid-stripe-tree.c | 34 ++++++++++++++++++++++++++++++++++
> 2 files changed, 35 insertions(+)
>
> diff --git a/fs/btrfs/ctree.c b/fs/btrfs/ctree.c
> index 1a49b9232990..ad934c5469c4 100644
> --- a/fs/btrfs/ctree.c
> +++ b/fs/btrfs/ctree.c
> @@ -3844,6 +3844,7 @@ static noinline int setup_leaf_for_split(struct btrfs_trans_handle *trans,
> btrfs_item_key_to_cpu(leaf, &key, path->slots[0]);
>
> BUG_ON(key.type != BTRFS_EXTENT_DATA_KEY &&
> + key.type != BTRFS_RAID_STRIPE_KEY &&

This seems unrelated. Thanks,

Josef

2024-06-10 19:45:26

by Josef Bacik

[permalink] [raw]
Subject: Re: [PATCH 3/3] btrfs: split RAID stripes on deletion

On Mon, Jun 10, 2024 at 10:40:27AM +0200, Johannes Thumshirn wrote:
> From: JohnnesThumshirn <[email protected]>
>
> The current RAID stripe code assumes, that we will always remove a
> whole stripe entry.
>
> But ff we're only removing a part of a RAID stripe we're hitting the
> ASSERT()ion checking for this condition.
>
> Instead of assuming the complete deletion of a RAID stripe, split the
> stripe if we need to.
>
> Signed-off-by: Johnnes Thumshirn <[email protected]>

I'd like a selftest for this helper, should be relatively straightforward to do,
just to test edgecases and such. Thanks,

Josef

2024-06-11 06:33:00

by Johannes Thumshirn

[permalink] [raw]
Subject: Re: [PATCH 2/3] btrfs: replace stripe extents

On 10.06.24 21:43, Josef Bacik wrote:
> On Mon, Jun 10, 2024 at 10:40:26AM +0200, Johannes Thumshirn wrote:
>> From: Johannes Thumshirn <[email protected]>
>>
>> If we can't insert a stripe extent in the RAID stripe tree, because
>> the key that points to the specific position in the stripe tree is
>> already existing, we have to remove the item and then replace it by a
>> new item.
>>
>> This can happen for example on device replace operations.
>>
>> Signed-off-by: Johannes Thumshirn <[email protected]>
>> ---
>> fs/btrfs/ctree.c | 1 +
>> fs/btrfs/raid-stripe-tree.c | 34 ++++++++++++++++++++++++++++++++++
>> 2 files changed, 35 insertions(+)
>>
>> diff --git a/fs/btrfs/ctree.c b/fs/btrfs/ctree.c
>> index 1a49b9232990..ad934c5469c4 100644
>> --- a/fs/btrfs/ctree.c
>> +++ b/fs/btrfs/ctree.c
>> @@ -3844,6 +3844,7 @@ static noinline int setup_leaf_for_split(struct btrfs_trans_handle *trans,
>> btrfs_item_key_to_cpu(leaf, &key, path->slots[0]);
>>
>> BUG_ON(key.type != BTRFS_EXTENT_DATA_KEY &&
>> + key.type != BTRFS_RAID_STRIPE_KEY &&
>
> This seems unrelated. Thanks,
>
> Josef
>

Oops it should go into 3/3

2024-06-11 06:40:19

by Johannes Thumshirn

[permalink] [raw]
Subject: Re: [PATCH 3/3] btrfs: split RAID stripes on deletion

On 10.06.24 21:45, Josef Bacik wrote:
> On Mon, Jun 10, 2024 at 10:40:27AM +0200, Johannes Thumshirn wrote:
>> From: JohnnesThumshirn <[email protected]>
>>
>> The current RAID stripe code assumes, that we will always remove a
>> whole stripe entry.
>>
>> But ff we're only removing a part of a RAID stripe we're hitting the
>> ASSERT()ion checking for this condition.
>>
>> Instead of assuming the complete deletion of a RAID stripe, split the
>> stripe if we need to.
>>
>> Signed-off-by: Johnnes Thumshirn <[email protected]>
>
> I'd like a selftest for this helper, should be relatively straightforward to do,
> just to test edgecases and such. Thanks,

Sure. Let me see what I can cook up.

2024-06-11 14:46:58

by David Sterba

[permalink] [raw]
Subject: Re: [PATCH 1/3] btrfs: rst: remove encoding field from stripe_extent

On Mon, Jun 10, 2024 at 10:40:25AM +0200, Johannes Thumshirn wrote:
> -#define BTRFS_STRIPE_RAID5 5
> -#define BTRFS_STRIPE_RAID6 6
> -#define BTRFS_STRIPE_RAID1C3 7
> -#define BTRFS_STRIPE_RAID1C4 8
> -
> struct btrfs_stripe_extent {
> - __u8 encoding;
> - __u8 reserved[7];
> /* An array of raid strides this stripe is composed of. */
> - struct btrfs_raid_stride strides[];
> + __DECLARE_FLEX_ARRAY(struct btrfs_raid_stride, strides);

Is there a reason to use the __ underscore macro? I see no difference
between that and DECLARE_FLEX_ARRAY and underscore usually means that
it's special in some way.

2024-06-11 16:34:40

by Johannes Thumshirn

[permalink] [raw]
Subject: Re: [PATCH 1/3] btrfs: rst: remove encoding field from stripe_extent

On 11.06.24 16:37, David Sterba wrote:
> On Mon, Jun 10, 2024 at 10:40:25AM +0200, Johannes Thumshirn wrote:
>> -#define BTRFS_STRIPE_RAID5 5
>> -#define BTRFS_STRIPE_RAID6 6
>> -#define BTRFS_STRIPE_RAID1C3 7
>> -#define BTRFS_STRIPE_RAID1C4 8
>> -
>> struct btrfs_stripe_extent {
>> - __u8 encoding;
>> - __u8 reserved[7];
>> /* An array of raid strides this stripe is composed of. */
>> - struct btrfs_raid_stride strides[];
>> + __DECLARE_FLEX_ARRAY(struct btrfs_raid_stride, strides);
>
> Is there a reason to use the __ underscore macro? I see no difference
> between that and DECLARE_FLEX_ARRAY and underscore usually means that
> it's special in some way.
>

Yes, the __ version is for UAPI, like __u8 or __le32 and so on.

2024-06-13 21:24:19

by David Sterba

[permalink] [raw]
Subject: Re: [PATCH 1/3] btrfs: rst: remove encoding field from stripe_extent

On Tue, Jun 11, 2024 at 04:33:19PM +0000, Johannes Thumshirn wrote:
> On 11.06.24 16:37, David Sterba wrote:
> > On Mon, Jun 10, 2024 at 10:40:25AM +0200, Johannes Thumshirn wrote:
> >> -#define BTRFS_STRIPE_RAID5 5
> >> -#define BTRFS_STRIPE_RAID6 6
> >> -#define BTRFS_STRIPE_RAID1C3 7
> >> -#define BTRFS_STRIPE_RAID1C4 8
> >> -
> >> struct btrfs_stripe_extent {
> >> - __u8 encoding;
> >> - __u8 reserved[7];
> >> /* An array of raid strides this stripe is composed of. */
> >> - struct btrfs_raid_stride strides[];
> >> + __DECLARE_FLEX_ARRAY(struct btrfs_raid_stride, strides);
> >
> > Is there a reason to use the __ underscore macro? I see no difference
> > between that and DECLARE_FLEX_ARRAY and underscore usually means that
> > it's special in some way.
> >
>
> Yes, the __ version is for UAPI, like __u8 or __le32 and so on.

I see, though I'd rather keep the on-disk definitions free of wrappers
that hide the types. We use the __ int types but that's all and quite
clear what it means.

There already are flexible members (btrfs_leaf, btrfs_node,
btrfs_inode_extref), using the empty[] syntax. The macro wraps the
distinction that c++ needs but so far the existing declarations have't
been problematic. So I'd rather keep the declarations consistent.

2024-06-14 09:36:59

by Johannes Thumshirn

[permalink] [raw]
Subject: Re: [PATCH 1/3] btrfs: rst: remove encoding field from stripe_extent

On 13.06.24 23:23, David Sterba wrote:
> On Tue, Jun 11, 2024 at 04:33:19PM +0000, Johannes Thumshirn wrote:
>> On 11.06.24 16:37, David Sterba wrote:
>>> On Mon, Jun 10, 2024 at 10:40:25AM +0200, Johannes Thumshirn wrote:
>>>> -#define BTRFS_STRIPE_RAID5 5
>>>> -#define BTRFS_STRIPE_RAID6 6
>>>> -#define BTRFS_STRIPE_RAID1C3 7
>>>> -#define BTRFS_STRIPE_RAID1C4 8
>>>> -
>>>> struct btrfs_stripe_extent {
>>>> - __u8 encoding;
>>>> - __u8 reserved[7];
>>>> /* An array of raid strides this stripe is composed of. */
>>>> - struct btrfs_raid_stride strides[];
>>>> + __DECLARE_FLEX_ARRAY(struct btrfs_raid_stride, strides);
>>>
>>> Is there a reason to use the __ underscore macro? I see no difference
>>> between that and DECLARE_FLEX_ARRAY and underscore usually means that
>>> it's special in some way.
>>>
>>
>> Yes, the __ version is for UAPI, like __u8 or __le32 and so on.
>
> I see, though I'd rather keep the on-disk definitions free of wrappers
> that hide the types. We use the __ int types but that's all and quite
> clear what it means.
>
> There already are flexible members (btrfs_leaf, btrfs_node,
> btrfs_inode_extref), using the empty[] syntax. The macro wraps the
> distinction that c++ needs but so far the existing declarations have't
> been problematic. So I'd rather keep the declarations consistent.
>

Yes but all these examples have other members as well. After this patch,
btrfs_stripe_extent is a container for btrfs_raid_stride, and C doesn't
allow a flexmember only struct:

In file included from fs/btrfs/ctree.h:18,
from fs/btrfs/delayed-inode.h:19,
from fs/btrfs/super.c:32:
./include/uapi/linux/btrfs_tree.h:753:34: error: flexible array member
in a struct with no named members
753 | struct btrfs_raid_stride strides[];
| ^~~~~~~