2010-02-23 05:43:46

by Theodore Ts'o

[permalink] [raw]
Subject: [PATCH] e2fsck: Fix bug which can cause e2fsck -fD to corrupt non-indexed directories

E2fsprogs 1.41.10 introduced a regression (in commit b71e018) where
e2fsck -fD can corrupt non-indexed directories when are exists one or
more file names which alphabetically sort before ".". This can happen
with ext2 filesystems or for small directories (take less than a
block) which contain filenames that begin with a space or some other
punctuation mark.

Fix this by making sure we never reorder the '.' or '..' entry in the
directory, since they must be first.

Signed-off-by: "Theodore Ts'o" <[email protected]>
---
e2fsck/rehash.c | 7 ++++++-
1 files changed, 6 insertions(+), 1 deletions(-)

diff --git a/e2fsck/rehash.c b/e2fsck/rehash.c
index 780742e..ceb8543 100644
--- a/e2fsck/rehash.c
+++ b/e2fsck/rehash.c
@@ -763,7 +763,12 @@ retry_nohash:

/* Sort the list */
resort:
- qsort(fd.harray, fd.num_array, sizeof(struct hash_entry), hash_cmp);
+ if (fd.compress)
+ qsort(fd.harray+2, fd.num_array-2, sizeof(struct hash_entry),
+ hash_cmp);
+ else
+ qsort(fd.harray, fd.num_array, sizeof(struct hash_entry),
+ hash_cmp);

/*
* Look for duplicates
--
1.6.6.1.1.g974db.dirty



2010-02-24 08:25:19

by Andreas Dilger

[permalink] [raw]
Subject: Re: [PATCH] e2fsck: Fix bug which can cause e2fsck -fD to corrupt non-indexed directories

On 2010-02-22, at 22:43, Theodore Ts'o wrote:
> E2fsprogs 1.41.10 introduced a regression (in commit b71e018) where
> e2fsck -fD can corrupt non-indexed directories when are exists one or
> more file names which alphabetically sort before ".". This can happen
> with ext2 filesystems or for small directories (take less than a
> block) which contain filenames that begin with a space or some other
> punctuation mark.
>
> Fix this by making sure we never reorder the '.' or '..' entry in the
> directory, since they must be first.

Ted, thanks for getting this out so fast. We were just building our
release based on 1.41.10 and are able to add this in.

Do you have a regression test for this case? e2fsck itself will
detect the corruption ("." and ".." not at the start of the directory)
after the fact, but I guess it means that there are no existing tests
where there is a directory entry that sorts before "." or it would
have been noticed earlier.

Looking at the ASCII characters before "." it seems that CVS "old
version" files with a leading ".#", "(", and "%gconf.xml" would
probably be the most likely cause of problems.

In any case, I can reproduce this easily on my filesystem with my mp3
collection (on a backup, fortunately). Unfortunately, it seems that
re-running e2fsck after such a corruption causes all (thousands) of
the entries that were sorted to the beginning of the directory to be
deleted, and moved into /lost+found.

Have you pulled this release from Sourceforge and any downstream
releases already (Debian, FC, etc)? It seems like a pretty serious
problem, even though "-fD" is likely not run very often.

> Signed-off-by: "Theodore Ts'o" <[email protected]>
> ---
> e2fsck/rehash.c | 7 ++++++-
> 1 files changed, 6 insertions(+), 1 deletions(-)
>
> diff --git a/e2fsck/rehash.c b/e2fsck/rehash.c
> index 780742e..ceb8543 100644
> --- a/e2fsck/rehash.c
> +++ b/e2fsck/rehash.c
> @@ -763,7 +763,12 @@ retry_nohash:
>
> /* Sort the list */
> resort:
> - qsort(fd.harray, fd.num_array, sizeof(struct hash_entry), hash_cmp);
> + if (fd.compress)
> + qsort(fd.harray+2, fd.num_array-2, sizeof(struct hash_entry),
> + hash_cmp);
> + else
> + qsort(fd.harray, fd.num_array, sizeof(struct hash_entry),
> + hash_cmp);
>
> /*
> * Look for duplicates
> --
> 1.6.6.1.1.g974db.dirty
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-
> ext4" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html


Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.


2010-02-24 15:11:07

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [PATCH] e2fsck: Fix bug which can cause e2fsck -fD to corrupt non-indexed directories

On Wed, Feb 24, 2010 at 01:25:16AM -0700, Andreas Dilger wrote:
> Do you have a regression test for this case? e2fsck itself will
> detect the corruption ("." and ".." not at the start of the
> directory) after the fact, but I guess it means that there are no
> existing tests where there is a directory entry that sorts before
> "." or it would have been noticed earlier.

Yes, I'll add a regression test; binary files just don't work well in
patch sets, so I tend to put those in separate commits, for ease in
cherry picking. Basically it's just a 100k ext2 filesystem with a
directory which happens to contain a name that begins with a open
parenthesis, i.e., "(oops)".

> Have you pulled this release from Sourceforge and any downstream
> releases already (Debian, FC, etc)? It seems like a pretty serious
> problem, even though "-fD" is likely not run very often.

I was just going to accelerate getting 1.41.11 out the door, as
opposed to going to the effort of trying to deprecate 1.41.10. In the
case of Debian, and Ubuntu, it's too late already since 1.41.10 has
already propagated out to bleeding-edge users. So the only way to
pull it back would be to get a new release out the door, quickly...

I guess I can easily enough pull it from kernel.org and make
1.41.9 the default release to download on sourceforge.net.

- Ted

2010-02-24 16:28:40

by Eric Sandeen

[permalink] [raw]
Subject: Re: [PATCH] e2fsck: Fix bug which can cause e2fsck -fD to corrupt non-indexed directories

[email protected] wrote:
> On Wed, Feb 24, 2010 at 01:25:16AM -0700, Andreas Dilger wrote:
>> Do you have a regression test for this case? e2fsck itself will
>> detect the corruption ("." and ".." not at the start of the
>> directory) after the fact, but I guess it means that there are no
>> existing tests where there is a directory entry that sorts before
>> "." or it would have been noticed earlier.
>
> Yes, I'll add a regression test; binary files just don't work well in
> patch sets, so I tend to put those in separate commits, for ease in
> cherry picking. Basically it's just a 100k ext2 filesystem with a
> directory which happens to contain a name that begins with a open
> parenthesis, i.e., "(oops)".
>
>> Have you pulled this release from Sourceforge and any downstream
>> releases already (Debian, FC, etc)? It seems like a pretty serious
>> problem, even though "-fD" is likely not run very often.
>
> I was just going to accelerate getting 1.41.11 out the door, as
> opposed to going to the effort of trying to deprecate 1.41.10. In the
> case of Debian, and Ubuntu, it's too late already since 1.41.10 has
> already propagated out to bleeding-edge users.

Ditto for fedora, but I pushed this patch to rawhide yesterday, thanks.

-Eric

> So the only way to
> pull it back would be to get a new release out the door, quickly...
>
> I guess I can easily enough pull it from kernel.org and make
> 1.41.9 the default release to download on sourceforge.net.
>
> - Ted
> --
> To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html