LinuxLists.cc - Odd data corruption problem with LVM/ReiserFS

2005-02-21 05:26:01

Subject: Odd data corruption problem with LVM/ReiserFS

As of this morning I've experienced some very odd data corruption
problem on my server. Let me post some background information first.

For the past few years I've been running this server under Linux 2.4.x
and Debian Woody. It has two software RAID 1 partitions, one for the
ReiserFS root filesystem (md0), and one for LVM running on top of RAID
1 (md1). Under LVM I have three logical volumes, one for /usr, one
for /var, and one for /home. All of them run ReiserFS. Also, during
daily backups I'd create a snapshot of /var and /home and back that
up. I haven't experienced any problems with this, other than
occasional power outages which might've corruped some log files by
adding a bunch of NULs to it, but that's never caused problems for me.

A few weeks ago I decided to upgrade to Debian Sarge. This was a
fairly smooth process, and haven't seen any problems with that (and I
don't think this is related to the problem I described below). Also,
last week I decided to upgrade from the 2.4.22 kernel to 2.6.10-ac12.
This has been a pretty smooth ride too (until this morning). One
exception is that I might not have had swap turned on due to device
name changes, so yesterday I saw some big processes getting killed due
to out-of-memory conditions (my server has 256MB non-ECC RAM, normally
with 512MB of swap). That swap issue had not been fixed until this
afternoon, after the crash/corruption. Yesterday afternoon I also
updated the metadata of my LVM volume group from version 1 to version
2. Before then I temporarily stopped taking snapshots once I upgraded
to 2.6.10-ac12 since it didn't like taking snapshots inside LVM1
volume groups. This morning was the first time my backup script took
a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few
snapshots myself for testing purposes, this seemed to work fine).

This morning when I tried to login after the backup process (which
takes snapshots) had started I couldn't get in. SSH would just hang
after I sent my username. After a while I gave up waiting and tried
to reboot the server by attaching a keyboard and hitting Ctrl-Alt-Del,
which started the shutdown process. I can't fully remember if that
successfully rebooted the server, but I believe I ended up having to
press the reset button because the shutdown process would hang at some
point. The server came back up but some processes wouldn't start due
to some corrupted files in the /var partition.

I checked the logs, and saw a bunch of the messages below. On a
sidenote, when my backup script isn't able to mount a snapshot, it
removes it, waits a minute, then tries creating/mounting the snapshot
again, supposedly up to 10 times, even though those messages, spaced
apart by one minute, occurred much more than 10 times, but that might
be a bug in my script. This was due to occasional problems I had with
older kernels which sometimes failed to mount the snapshot, but were
successful when trying again later.

These are the messages I saw:

Feb 20 09:59:16 homer kernel: lvcreate: page allocation failure.
order:0, mode:0xd0
Feb 20 09:59:16 homer kernel: [__alloc_pages+440/864] __alloc_pages+0x1b8/0x360
Feb 20 09:59:16 homer kernel: [alloc_pl+51/96] alloc_pl+0x33/0x60
Feb 20 09:59:16 homer kernel: [client_alloc_pages+28/96]
client_alloc_pages+0x1c/0x60
Feb 20 09:59:16 homer kernel: [vmalloc+32/48] vmalloc+0x20/0x30
Feb 20 09:59:16 homer kernel: [kcopyd_client_create+104/192]
kcopyd_client_create+0x68/0xc0
Feb 20 09:59:16 homer kernel: [dm_create_persistent+199/320]
dm_create_persistent+0xc7/0x140
Feb 20 09:59:16 homer kernel: [snapshot_ctr+680/880] snapshot_ctr+0x2a8/0x370
Feb 20 09:59:16 homer kernel: [dm_table_add_target+262/432]
dm_table_add_target+0x106/0x1b0
Feb 20 09:59:16 homer kernel: [populate_table+130/224] populate_table+0x82/0xe0
Feb 20 09:59:16 homer kernel: [table_load+103/368] table_load+0x67/0x170
Feb 20 09:59:16 homer kernel: [ctl_ioctl+241/336] ctl_ioctl+0xf1/0x150
Feb 20 09:59:16 homer kernel: [table_load+0/368] table_load+0x0/0x170
Feb 20 09:59:16 homer kernel: [sys_ioctl+173/528] sys_ioctl+0xad/0x210
Feb 20 09:59:16 homer kernel: [syscall_call+7/11] syscall_call+0x7/0xb
Feb 20 09:59:16 homer kernel: device-mapper: error adding target to table
Feb 20 09:59:16 homer kernel: lvremove: page allocation failure.
order:0, mode:0xd0
Feb 20 09:59:16 homer kernel: [__alloc_pages+440/864] __alloc_pages+0x1b8/0x360
Feb 20 09:59:16 homer kernel: [alloc_pl+51/96] alloc_pl+0x33/0x60
Feb 20 09:59:16 homer kernel: [client_alloc_pages+28/96]
client_alloc_pages+0x1c/0x60
Feb 20 09:59:16 homer kernel: [vmalloc+32/48] vmalloc+0x20/0x30
Feb 20 09:59:16 homer kernel: [kcopyd_client_create+104/192]
kcopyd_client_create+0x68/0xc0
Feb 20 09:59:16 homer kernel: [dm_create_persistent+199/320]
dm_create_persistent+0xc7/0x140
Feb 20 09:59:16 homer kernel: [snapshot_ctr+680/880] snapshot_ctr+0x2a8/0x370
Feb 20 09:59:16 homer kernel: [dm_table_add_target+262/432]
dm_table_add_target+0x106/0x1b0
Feb 20 09:59:16 homer kernel: [populate_table+130/224] populate_table+0x82/0xe0
Feb 20 09:59:16 homer kernel: [table_load+103/368] table_load+0x67/0x170
Feb 20 09:59:16 homer kernel: [ctl_ioctl+241/336] ctl_ioctl+0xf1/0x150
Feb 20 09:59:16 homer kernel: [table_load+0/368] table_load+0x0/0x170
Feb 20 09:59:16 homer kernel: [sys_ioctl+173/528] sys_ioctl+0xad/0x210
Feb 20 09:59:16 homer kernel: [syscall_call+7/11] syscall_call+0x7/0xb
Feb 20 09:59:16 homer kernel: device-mapper: error adding target to table
...

As far as I can tell all the directories are still intact, but there
was a good number of files that had been corrupted. Those files
looked like they had some chunks removed, and some had a bunch of NUL
characters (in blocks of 4096 characters). Some files even had chunks
of other files inside of them! I suspect that some of the contents
from the other files were from different partitions (e.g. /home
contents in /var file). I also believe I saw contents from my
/root/.viminfo in one of the files in /var or /home (and keep in mind
that my root partition which stores /root does not use LVM)

/var was not the only volume that was corrupted. /home was corrupted
as well. I first thought files that had been written to within the
past 24 hours were corrupted, but later I realized that some files
that haven't been changed for months were corrupted too.

I did not test /usr for corruption. Also, from some quick checks I
did on my root (non-LVM) partition I did not find any corrupted files
there, as far as I can remember.

I did a reiserfsck (3.6.19) on /var, which did not report any problems.

So, I figured I'd just restore those corrupted files from my backups.
I restored a few corrupted files, verified that they were in good
shape, moved on to other parts... only to find out that the I restored
first had become corrupted again (with a similar type of corruption as
I saw before)!

Another thing to keep in mind is that I never removed those snapshots
after rebooting. I'd be curious to see if the continual corruption
goes away if I remove those snapshots, but I'll wait to make sure you
guys don't want me to try anything else first.

Also, I did a preliminary memory check to make sure my memory hadn't
gone bad. After a single pass of memtest86+ on my memory, it was not
able to find any memory problems during that pass.

Anyway, what do you guys think could be the problem? Could it be that
the LVM / Device Mapper snapshot feature is solely responsible for
this corruption? (I'm sure there's a reason it's marked
Experimental).

I know it might be hard to pinpoint without more details. If you want
me to provide more details (e.g. LVM data, debugreiserfs data, kernel
config, more system details, etc.) or run some experiments, I can do
so, but please let me know ASAP because I'm planning on scrapping my
entire LVM volume group and restoring it from my backups tomorrow
unless I'm told otherwise.

Thanks a lot,

Alex

2005-02-21 10:49:50

by Vladimir V. Saveliev

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

Hello

On Mon, 2005-02-21 at 08:25, Alex Adriaanse wrote:
> As of this morning I've experienced some very odd data corruption
> problem on my server. Let me post some background information first.
>
> For the past few years I've been running this server under Linux 2.4.x
> and Debian Woody. It has two software RAID 1 partitions, one for the
> ReiserFS root filesystem (md0), and one for LVM running on top of RAID
> 1 (md1). Under LVM I have three logical volumes, one for /usr, one
> for /var, and one for /home. All of them run ReiserFS. Also, during
> daily backups I'd create a snapshot of /var and /home and back that
> up. I haven't experienced any problems with this, other than
> occasional power outages which might've corruped some log files by
> adding a bunch of NULs to it, but that's never caused problems for me.
>
> A few weeks ago I decided to upgrade to Debian Sarge. This was a
> fairly smooth process, and haven't seen any problems with that (and I
> don't think this is related to the problem I described below). Also,
> last week I decided to upgrade from the 2.4.22 kernel to 2.6.10-ac12.
> This has been a pretty smooth ride too (until this morning). One
> exception is that I might not have had swap turned on due to device
> name changes, so yesterday I saw some big processes getting killed due
> to out-of-memory conditions (my server has 256MB non-ECC RAM, normally
> with 512MB of swap). That swap issue had not been fixed until this
> afternoon, after the crash/corruption. Yesterday afternoon I also
> updated the metadata of my LVM volume group from version 1 to version
> 2. Before then I temporarily stopped taking snapshots once I upgraded
> to 2.6.10-ac12 since it didn't like taking snapshots inside LVM1
> volume groups. This morning was the first time my backup script took
> a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few
> snapshots myself for testing purposes, this seemed to work fine).
>
> This morning when I tried to login after the backup process (which
> takes snapshots) had started I couldn't get in. SSH would just hang
> after I sent my username. After a while I gave up waiting and tried
> to reboot the server by attaching a keyboard and hitting Ctrl-Alt-Del,
> which started the shutdown process. I can't fully remember if that
> successfully rebooted the server, but I believe I ended up having to
> press the reset button because the shutdown process would hang at some
> point. The server came back up but some processes wouldn't start due
> to some corrupted files in the /var partition.
>
> I checked the logs, and saw a bunch of the messages below. On a
> sidenote, when my backup script isn't able to mount a snapshot, it
> removes it, waits a minute, then tries creating/mounting the snapshot
> again, supposedly up to 10 times, even though those messages, spaced
> apart by one minute, occurred much more than 10 times, but that might
> be a bug in my script. This was due to occasional problems I had with
> older kernels which sometimes failed to mount the snapshot, but were
> successful when trying again later.
>
> These are the messages I saw:
>
> Feb 20 09:59:16 homer kernel: lvcreate: page allocation failure.
> order:0, mode:0xd0
> Feb 20 09:59:16 homer kernel: [__alloc_pages+440/864] __alloc_pages+0x1b8/0x360
> Feb 20 09:59:16 homer kernel: [alloc_pl+51/96] alloc_pl+0x33/0x60
> Feb 20 09:59:16 homer kernel: [client_alloc_pages+28/96]
> client_alloc_pages+0x1c/0x60
> Feb 20 09:59:16 homer kernel: [vmalloc+32/48] vmalloc+0x20/0x30
> Feb 20 09:59:16 homer kernel: [kcopyd_client_create+104/192]
> kcopyd_client_create+0x68/0xc0
> Feb 20 09:59:16 homer kernel: [dm_create_persistent+199/320]
> dm_create_persistent+0xc7/0x140
> Feb 20 09:59:16 homer kernel: [snapshot_ctr+680/880] snapshot_ctr+0x2a8/0x370
> Feb 20 09:59:16 homer kernel: [dm_table_add_target+262/432]
> dm_table_add_target+0x106/0x1b0
> Feb 20 09:59:16 homer kernel: [populate_table+130/224] populate_table+0x82/0xe0
> Feb 20 09:59:16 homer kernel: [table_load+103/368] table_load+0x67/0x170
> Feb 20 09:59:16 homer kernel: [ctl_ioctl+241/336] ctl_ioctl+0xf1/0x150
> Feb 20 09:59:16 homer kernel: [table_load+0/368] table_load+0x0/0x170
> Feb 20 09:59:16 homer kernel: [sys_ioctl+173/528] sys_ioctl+0xad/0x210
> Feb 20 09:59:16 homer kernel: [syscall_call+7/11] syscall_call+0x7/0xb
> Feb 20 09:59:16 homer kernel: device-mapper: error adding target to table
> Feb 20 09:59:16 homer kernel: lvremove: page allocation failure.
> order:0, mode:0xd0
> Feb 20 09:59:16 homer kernel: [__alloc_pages+440/864] __alloc_pages+0x1b8/0x360
> Feb 20 09:59:16 homer kernel: [alloc_pl+51/96] alloc_pl+0x33/0x60
> Feb 20 09:59:16 homer kernel: [client_alloc_pages+28/96]
> client_alloc_pages+0x1c/0x60
> Feb 20 09:59:16 homer kernel: [vmalloc+32/48] vmalloc+0x20/0x30
> Feb 20 09:59:16 homer kernel: [kcopyd_client_create+104/192]
> kcopyd_client_create+0x68/0xc0
> Feb 20 09:59:16 homer kernel: [dm_create_persistent+199/320]
> dm_create_persistent+0xc7/0x140
> Feb 20 09:59:16 homer kernel: [snapshot_ctr+680/880] snapshot_ctr+0x2a8/0x370
> Feb 20 09:59:16 homer kernel: [dm_table_add_target+262/432]
> dm_table_add_target+0x106/0x1b0
> Feb 20 09:59:16 homer kernel: [populate_table+130/224] populate_table+0x82/0xe0
> Feb 20 09:59:16 homer kernel: [table_load+103/368] table_load+0x67/0x170
> Feb 20 09:59:16 homer kernel: [ctl_ioctl+241/336] ctl_ioctl+0xf1/0x150
> Feb 20 09:59:16 homer kernel: [table_load+0/368] table_load+0x0/0x170
> Feb 20 09:59:16 homer kernel: [sys_ioctl+173/528] sys_ioctl+0xad/0x210
> Feb 20 09:59:16 homer kernel: [syscall_call+7/11] syscall_call+0x7/0xb
> Feb 20 09:59:16 homer kernel: device-mapper: error adding target to table
> ...
>
> As far as I can tell all the directories are still intact, but there
> was a good number of files that had been corrupted. Those files
> looked like they had some chunks removed, and some had a bunch of NUL
> characters (in blocks of 4096 characters). Some files even had chunks
> of other files inside of them! I suspect that some of the contents
> from the other files were from different partitions (e.g. /home
> contents in /var file). I also believe I saw contents from my
> /root/.viminfo in one of the files in /var or /home (and keep in mind
> that my root partition which stores /root does not use LVM)
>
> /var was not the only volume that was corrupted. /home was corrupted
> as well. I first thought files that had been written to within the
> past 24 hours were corrupted, but later I realized that some files
> that haven't been changed for months were corrupted too.
>
> I did not test /usr for corruption. Also, from some quick checks I
> did on my root (non-LVM) partition I did not find any corrupted files
> there, as far as I can remember.
>
> I did a reiserfsck (3.6.19) on /var, which did not report any problems.
>
> So, I figured I'd just restore those corrupted files from my backups.
> I restored a few corrupted files, verified that they were in good
> shape, moved on to other parts... only to find out that the I restored
> first had become corrupted again (with a similar type of corruption as
> I saw before)!
>
You shoult ask lvm mailing list, I guess

> Another thing to keep in mind is that I never removed those snapshots
> after rebooting. I'd be curious to see if the continual corruption
> goes away if I remove those snapshots, but I'll wait to make sure you
> guys don't want me to try anything else first.
>
> Also, I did a preliminary memory check to make sure my memory hadn't
> gone bad. After a single pass of memtest86+ on my memory, it was not
> able to find any memory problems during that pass.
>
> Anyway, what do you guys think could be the problem? Could it be that
> the LVM / Device Mapper snapshot feature is solely responsible for
> this corruption? (I'm sure there's a reason it's marked
> Experimental).
>
> I know it might be hard to pinpoint without more details. If you want
> me to provide more details (e.g. LVM data, debugreiserfs data, kernel
> config, more system details, etc.) or run some experiments, I can do
> so, but please let me know ASAP because I'm planning on scrapping my
> entire LVM volume group and restoring it from my backups tomorrow
> unless I'm told otherwise.
>
> Thanks a lot,
>
> Alex
>

2005-02-21 11:37:55

by Andreas Steinmetz

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

Alex Adriaanse wrote:
> As far as I can tell all the directories are still intact, but there
> was a good number of files that had been corrupted. Those files
> looked like they had some chunks removed, and some had a bunch of NUL
> characters (in blocks of 4096 characters). Some files even had chunks
> of other files inside of them!

I can second that. I had the same experience this weekend on a
md/dm/reiserfs setup. The funny thing is that e.g. find reports I/O
errors but if you then run tar on the tree you eventually get the
correct data from tar. Then run find again and you'll again get I/O errors.

> I did a reiserfsck (3.6.19) on /var, which did not report any problems.

You need to run 'reiserfsck --rebuild-tree' and see what happens :-(

> Anyway, what do you guys think could be the problem? Could it be that
> the LVM / Device Mapper snapshot feature is solely responsible for
> this corruption? (I'm sure there's a reason it's marked
> Experimental).

I don't think so - I changed from reiserfs to ext3 without changing the
underlying dm/raid5 and this seems to work properly.

I can furthermore state that reiserfs without dm/md does work correctly
as I use reiserfs on a ieee1394 backup disk (that saved me from terrible
trouble).

Currently I can only warn to not use reiserfs with dm/md on 2.6.
--
Andreas Steinmetz SPAMmers use [email protected]

2005-02-21 15:19:05

by Alasdair G Kergon

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Sun, Feb 20, 2005 at 11:25:37PM -0600, Alex Adriaanse wrote:
> This morning was the first time my backup script took
> a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few
> snapshots myself for testing purposes, this seemed to work fine).

a) Activating a snapshot requires a lot of memory;

b) If a snapshot can't get the memory it needs you have to back it
out manually (using dmsetup - some combination of resume, remove &
possibly reload) to avoid locking up the volume - what you have to do
depends how far it got before it failed;

c) You should be OK once a snapshot is active and its origin has
successfully had a block written to it.

Work is underway to address the various problems with snapshot activation
- we think we understand them all - but until the fixes have worked their
way through, unless you've enough memory in the machine it's best to avoid
them.

Suggestions:
Only do one snapshot+backup at once;
Make sure logging in as root and using dmsetup does not depend on access
to anything in /var or /home (similar to the case of hard NFS mounts with
the server down) so you can still log in;

BTW Also never snapshot the root filesystem unless you've mounted it noatime
or disabled hotplug etc. - e.g. the machine can lock up attempting to
update the atime on /sbin/hotplug while writes to the filesystem are blocked

Alasdair
--
[email protected]

2005-02-21 16:44:45

by Alex Adriaanse

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Mon, 21 Feb 2005 12:37:53 +0100, Andreas Steinmetz <[email protected]> wrote:
> Alex Adriaanse wrote:
> > As far as I can tell all the directories are still intact, but there
> > was a good number of files that had been corrupted. Those files
> > looked like they had some chunks removed, and some had a bunch of NUL
> > characters (in blocks of 4096 characters). Some files even had chunks
> > of other files inside of them!
>
> I can second that. I had the same experience this weekend on a
> md/dm/reiserfs setup. The funny thing is that e.g. find reports I/O
> errors but if you then run tar on the tree you eventually get the
> correct data from tar. Then run find again and you'll again get I/O errors.
The weird thing is I did not see any I/O errors in my logs, and
running find on /var worked without a problem. By the way, did you
take any DM snapshots when you experienced that corruption?

2005-02-21 18:02:18

by Andreas Steinmetz

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

Alex Adriaanse wrote:
> The weird thing is I did not see any I/O errors in my logs, and
> running find on /var worked without a problem. By the way, did you
> take any DM snapshots when you experienced that corruption?

No, no snapshots. Just working find on a large dataset (source tree,
about 16GB). The fun part is that I got the I/O errors for varying
diretories and 'ls'-sing thes directories after find failed, too.
However a follow-up tar to the ieee1394 disk to salvage the data
actually could access all data correctly. One day before I did
experience the same symptom but did reboot. This caused actual damage
all over the place and I had to restore from the last checkpoint I made.

--
Andreas Steinmetz SPAMmers use [email protected]

2005-02-21 21:28:55

by Alex Adriaanse

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

Alasdair,

Thanks for the tips. Do you think it's possible DM's snapshots
could've caused this corruption, or do you think the problem lies
elsewhere?

Alex

On Mon, 21 Feb 2005 15:18:52 +0000, Alasdair G Kergon <[email protected]> wrote:
> On Sun, Feb 20, 2005 at 11:25:37PM -0600, Alex Adriaanse wrote:
> > This morning was the first time my backup script took
> > a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few
> > snapshots myself for testing purposes, this seemed to work fine).
>
> a) Activating a snapshot requires a lot of memory;
>
> b) If a snapshot can't get the memory it needs you have to back it
> out manually (using dmsetup - some combination of resume, remove &
> possibly reload) to avoid locking up the volume - what you have to do
> depends how far it got before it failed;
>
> c) You should be OK once a snapshot is active and its origin has
> successfully had a block written to it.
>
> Work is underway to address the various problems with snapshot activation
> - we think we understand them all - but until the fixes have worked their
> way through, unless you've enough memory in the machine it's best to avoid
> them.
>
> Suggestions:
> Only do one snapshot+backup at once;
> Make sure logging in as root and using dmsetup does not depend on access
> to anything in /var or /home (similar to the case of hard NFS mounts with
> the server down) so you can still log in;
>
> BTW Also never snapshot the root filesystem unless you've mounted it noatime
> or disabled hotplug etc. - e.g. the machine can lock up attempting to
> update the atime on /sbin/hotplug while writes to the filesystem are blocked
>
> Alasdair
> --
> [email protected]
>

2005-02-22 03:19:54

by Alex Adriaanse

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

I found out some interesting things tonight. I removed my /var and
/home snapshots, and all the corruption, with the exception of files I
had changed while /var and /home were in their corrupted state, had
disappeared!

I overwrote several files on /var that were corrupt with clean copies
from my backups, and verified that they were OK. I then created a new
/var snapshot, mounted it, only to find out that the files on that
snapshot were still corrupt, but the files under the real /var were
still in good shape. I umounted, lvremoved, lvcreated, and mounted
the /var snapshot, and saw the same results. Even after removing the
snapshot, rebooting, and recreating the snapshot I saw the same thing
(real /var had correct file, snapshot /var had corrupt file).

Do you think my volume group has simply become corrupt and will need
to be recreated, or do you guys think this is a bug in dm-snapshot?
If so, please let me know what I can do to help you guys debug this.

Thanks,

Alex

On Mon, 21 Feb 2005 15:18:52 +0000, Alasdair G Kergon <[email protected]> wrote:
> On Sun, Feb 20, 2005 at 11:25:37PM -0600, Alex Adriaanse wrote:
> > This morning was the first time my backup script took
> > a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few
> > snapshots myself for testing purposes, this seemed to work fine).
>
> a) Activating a snapshot requires a lot of memory;
>
> b) If a snapshot can't get the memory it needs you have to back it
> out manually (using dmsetup - some combination of resume, remove &
> possibly reload) to avoid locking up the volume - what you have to do
> depends how far it got before it failed;
>
> c) You should be OK once a snapshot is active and its origin has
> successfully had a block written to it.
>
> Work is underway to address the various problems with snapshot activation
> - we think we understand them all - but until the fixes have worked their
> way through, unless you've enough memory in the machine it's best to avoid
> them.
>
> Suggestions:
> Only do one snapshot+backup at once;
> Make sure logging in as root and using dmsetup does not depend on access
> to anything in /var or /home (similar to the case of hard NFS mounts with
> the server down) so you can still log in;
>
> BTW Also never snapshot the root filesystem unless you've mounted it noatime
> or disabled hotplug etc. - e.g. the machine can lock up attempting to
> update the atime on /sbin/hotplug while writes to the filesystem are blocked
>
> Alasdair
> --
> [email protected]
>

2005-02-22 19:03:27

by Marc Lehmann

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Mon, Feb 21, 2005 at 12:37:53PM +0100, Andreas Steinmetz <[email protected]> wrote:
> >Anyway, what do you guys think could be the problem? Could it be that
> >the LVM / Device Mapper snapshot feature is solely responsible for
> >this corruption? (I'm sure there's a reason it's marked
> >Experimental).
>
> I don't think so - I changed from reiserfs to ext3 without changing the
> underlying dm/raid5 and this seems to work properly.

I use both reiserfs and ext3 on lvm/dm on raid.

Both filesystems have issues when restoring from backup (i.e. very heavy
write activity).

I did report this to the linux kernel, and got as reply that there are
indeed races *somewhere*, but as of yet there is no fix.

The symptoms are _not_ I/O errors (but until I see logs I wouldn't believe
you that there are real I/O errors), but usually too-high block numbers.

A reboot fixes this for both ext3 and reiserfs (i.e. the error is gone).

You might want to explore this problem and decide for yourself if it's caused
by I/O errors (in which case you have a disk problem) or "just" filesystem
corruption.

--
The choice of a
-----==- _GNU_
----==-- _ generation Marc Lehmann
---==---(_)__ __ ____ __ [email protected]
--==---/ / _ \/ // /\ \/ / http://schmorp.de/
-=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE

2005-02-22 19:43:12

by Andreas Steinmetz

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

pcg( Marc)@goof(A.).(Lehmann )com wrote:
> I use both reiserfs and ext3 on lvm/dm on raid.
>
> Both filesystems have issues when restoring from backup (i.e. very heavy
> write activity).
>
> I did report this to the linux kernel, and got as reply that there are
> indeed races *somewhere*, but as of yet there is no fix.
>
> The symptoms are _not_ I/O errors (but until I see logs I wouldn't believe
> you that there are real I/O errors), but usually too-high block numbers.
>

To clarify: there were no disk I/O errors, only I/O errors were reported
by find during operation so it is definitely filesystem corruption
that is going on here.
Though find performs heavy read activity there could well be heavy write
activity be involved due to atime updates so this fits your description.

> A reboot fixes this for both ext3 and reiserfs (i.e. the error is gone).
>

Well, it didn't fix it for me. The fs was trashed for good. The major
question for me is now usability of md/dm for any purpose with 2.6.x.
For me this is a showstopper for any kind of 2.6 production use.
--
Andreas Steinmetz SPAMmers use [email protected]

2005-02-22 19:49:29

by Marc Lehmann

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Tue, Feb 22, 2005 at 08:39:21PM +0100, Andreas Steinmetz <[email protected]> wrote:
> To clarify: there were no disk I/O errors, only I/O errors were reported
> by find during operation so it is definitely filesystem corruption
> that is going on here.
> Though find performs heavy read activity there could well be heavy write
> activity be involved due to atime updates so this fits your description.

That wouldn't fill my definition for "heavy", but as it is a race
somewhere, it can happen undr many circumstances, I guess.

> >A reboot fixes this for both ext3 and reiserfs (i.e. the error is gone).
> >
>
> Well, it didn't fix it for me. The fs was trashed for good. The major
> question for me is now usability of md/dm for any purpose with 2.6.x.
> For me this is a showstopper for any kind of 2.6 production use.

Well, I do use reiserfs->aes-loop->lvm/dm->md5/raid5, and it never failed
for me, except once, and the error is likely to be outside reiserfs, and
possibly outside lvm.

However, your case *is* different, as corruption wasn't permament for me,
so chances are that you are hitting sth. else.

Of course, many people find 2.6 too unstable for production use (remember
2.4.x, it was the same), and the most important rule is: if it fails for
you, it's a showstopper.

Also, mild corruption is likely to be more disastrous to reiserfs as it
uses a much denser representation on disk (in other words: it uses space
more efficiently, but that comes at the price of redundancy).

--
The choice of a
-----==- _GNU_
----==-- _ generation Marc Lehmann
---==---(_)__ __ ____ __ [email protected]
--==---/ / _ \/ // /\ \/ / http://schmorp.de/
-=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE

2005-02-22 20:46:53

by Alex Adriaanse

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Tue, 22 Feb 2005 20:49:00 +0100, Marc A. Lehmann <[email protected]> wrote:
> > >A reboot fixes this for both ext3 and reiserfs (i.e. the error is gone).
> > >
> >
> > Well, it didn't fix it for me. The fs was trashed for good. The major
> > question for me is now usability of md/dm for any purpose with 2.6.x.
> > For me this is a showstopper for any kind of 2.6 production use.
>
> Well, I do use reiserfs->aes-loop->lvm/dm->md5/raid5, and it never failed
> for me, except once, and the error is likely to be outside reiserfs, and
> possibly outside lvm.

Marc, what about you, were you using dm-snapshot when you experienced
temporary corruption?

Alex

2005-02-22 20:54:26

by Marc Lehmann

[permalink] [raw]

Subject: Re: Odd data corruption problem with LVM/ReiserFS

On Tue, Feb 22, 2005 at 02:46:44PM -0600, Alex Adriaanse <[email protected]> wrote:
> On Tue, 22 Feb 2005 20:49:00 +0100, Marc A. Lehmann <[email protected]> wrote:
> > Well, I do use reiserfs->aes-loop->lvm/dm->md5/raid5, and it never failed
> > for me, except once, and the error is likely to be outside reiserfs, and
> > possibly outside lvm.
>
> Marc, what about you, were you using dm-snapshot when you experienced
> temporary corruption?

No snapshots either.

--
The choice of a
-----==- _GNU_
----==-- _ generation Marc Lehmann
---==---(_)__ __ ____ __ [email protected]
--==---/ / _ \/ // /\ \/ / http://schmorp.de/
-=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE