2011-12-24 08:17:12

by Sandon Van Ness

[permalink] [raw]
Subject: [email protected]

Most of our machines are ext3 and have seen the card get reset on ext3
and it never went read-only like it always does in ext4 now. The I/O
goes unresponsive for a few minutes as it detects I/O is unresponsive
and then the controller is reset and the machine would recover (on
ext3/jfs, and other fs's) on ext4 the journal is aborted and it goes
into read-only:

Anyone ever see something like this?


[605458.429395] scsi cmnd aborted, scsi_cmnd(0xffff88041c4dac80),
cmnd[0x28,0x 0,0xcb,0x 4,0x13,0x40,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
scsi_id = 0x 0, scsi_lun = 0x 1.
[605458.444011] arcmsr1: executing eh bus reset .....num_resets = 0,
num_aborts = 114
[605458.451724] arcmsr1: executing hw bus reset .....
[605480.472827] arcmsr1: waiting for hw bus reset return, retry=1
[605500.486408] arcmsr1: waiting for hw bus reset return, retry=2
[605520.516017] Areca RAID Controller1: F/W V1.49 2010-12-02 & Model
ARC-1222
[606900.929121] EXT4-fs error (device sdb1): mb_free_blocks:1397: group
39137block 1282445473:freeing already freed block (bit 4257)
[606900.941415] Aborting journal on device sdb1-8.
[606900.941561] EXT4-fs error (device sdb1) in ext4_setattr:5462:
Readonly filesystem
[606900.955051] Aborting journal on device sdb1-8.

Seconds after the card is reset and recovers the journal is aborted and
read-only. Here is another case where it happens even before the card is
reset:

[574763.342694] scsi cmnd aborted, scsi_cmnd(0xffff8803d5fff1c0),
cmnd[0x2a,0x 0,0x 0,0x 0,0x a,0xd0,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
scsi_id = 0x 0, scsi_lun = 0x 1.
[574763.357267] scsi cmnd aborted, scsi_cmnd(0xffff8800712a1480),
cmnd[0x2a,0x 0,0x 0,0x 0,0x a,0xf0,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
scsi_id = 0x 0, scsi_lun = 0x 1.
--------------------------SNIP----------------------------------------
[584376.272002] scsi cmnd aborted, scsi_cmnd(0xffff8802407f63c0),
cmnd[0x88,0x 0,0x 0,0x 0,0x 0,0x 2,0x62,0x44,0xe7,0x28,0x 0,0x 0],
scsi_id = 0x 0, scsi_lun = 0x 1.
[584376.286524] arcmsr1: executing eh bus reset .....num_resets = 2,
num_aborts = 497
[584376.612598] arcmsr1: wait 'abort all outstanding command' timeout
[587971.898239] EXT4-fs error (device sdb1): ext4_mb_generate_buddy:731:
group 4216017413 blocks in bitmap, 17416 in gd
[587971.908788] Aborting journal on device sdb1-8.
[587972.072416] EXT4-fs (sdb1): Remounting filesystem read-only
[587972.072513] EXT4-fs error (device sdb1): ext4_journal_start_sb:260:
Detected aborted journal
[587972.072518] EXT4-fs (sdb1): Remounting filesystem read-only
[587972.092489] EXT4-fs error (device sdb1) in
ext4_reserve_inode_write:5657: Journal has aborted
[587974.432255] EXT4-fs error (device sdb1) in ext4_evict_inode:210:
Journal has aborted
[587974.443945] EXT4-fs (sdb1): ext4_da_writepages: jbd2_start: 778
pages, ino 76182150; err -30
[587974.446615] EXT4-fs (sdb1): ext4_da_writepages: jbd2_start:
9223372036854775773 pages, ino 88873701; err -30

Any ideas why ext4 has this behavior when ext3 did not?


2011-12-24 08:21:34

by Sandon Van Ness

[permalink] [raw]
Subject: Re: [email protected] --- ext4 going read-only/journal abort when raid controller resets itself

Sorry I didn't initially send it in plaintext and had to start a new
email and somehow messed up the subject it should have been:

ext4 going read-only/journal abort when raid controller resets itself

On 12/24/2011 12:18 AM, Sandon Van Ness wrote:
> Most of our machines are ext3 and have seen the card get reset on ext3
> and it never went read-only like it always does in ext4 now. The I/O
> goes unresponsive for a few minutes as it detects I/O is unresponsive
> and then the controller is reset and the machine would recover (on
> ext3/jfs, and other fs's) on ext4 the journal is aborted and it goes
> into read-only:
>
> Anyone ever see something like this?
>
>
> [605458.429395] scsi cmnd aborted, scsi_cmnd(0xffff88041c4dac80),
> cmnd[0x28,0x 0,0xcb,0x 4,0x13,0x40,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
> scsi_id = 0x 0, scsi_lun = 0x 1.
> [605458.444011] arcmsr1: executing eh bus reset .....num_resets = 0,
> num_aborts = 114
> [605458.451724] arcmsr1: executing hw bus reset .....
> [605480.472827] arcmsr1: waiting for hw bus reset return, retry=1
> [605500.486408] arcmsr1: waiting for hw bus reset return, retry=2
> [605520.516017] Areca RAID Controller1: F/W V1.49 2010-12-02 & Model
> ARC-1222
> [606900.929121] EXT4-fs error (device sdb1): mb_free_blocks:1397:
> group 39137block 1282445473:freeing already freed block (bit 4257)
> [606900.941415] Aborting journal on device sdb1-8.
> [606900.941561] EXT4-fs error (device sdb1) in ext4_setattr:5462:
> Readonly filesystem
> [606900.955051] Aborting journal on device sdb1-8.
>
> Seconds after the card is reset and recovers the journal is aborted
> and read-only. Here is another case where it happens even before the
> card is reset:
>
> [574763.342694] scsi cmnd aborted, scsi_cmnd(0xffff8803d5fff1c0),
> cmnd[0x2a,0x 0,0x 0,0x 0,0x a,0xd0,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
> scsi_id = 0x 0, scsi_lun = 0x 1.
> [574763.357267] scsi cmnd aborted, scsi_cmnd(0xffff8800712a1480),
> cmnd[0x2a,0x 0,0x 0,0x 0,0x a,0xf0,0x 0,0x 0,0x 8,0x 0,0x 0,0x 0],
> scsi_id = 0x 0, scsi_lun = 0x 1.
> --------------------------SNIP----------------------------------------
> [584376.272002] scsi cmnd aborted, scsi_cmnd(0xffff8802407f63c0),
> cmnd[0x88,0x 0,0x 0,0x 0,0x 0,0x 2,0x62,0x44,0xe7,0x28,0x 0,0x 0],
> scsi_id = 0x 0, scsi_lun = 0x 1.
> [584376.286524] arcmsr1: executing eh bus reset .....num_resets = 2,
> num_aborts = 497
> [584376.612598] arcmsr1: wait 'abort all outstanding command' timeout
> [587971.898239] EXT4-fs error (device sdb1):
> ext4_mb_generate_buddy:731: group 4216017413 blocks in bitmap, 17416
> in gd
> [587971.908788] Aborting journal on device sdb1-8.
> [587972.072416] EXT4-fs (sdb1): Remounting filesystem read-only
> [587972.072513] EXT4-fs error (device sdb1):
> ext4_journal_start_sb:260: Detected aborted journal
> [587972.072518] EXT4-fs (sdb1): Remounting filesystem read-only
> [587972.092489] EXT4-fs error (device sdb1) in
> ext4_reserve_inode_write:5657: Journal has aborted
> [587974.432255] EXT4-fs error (device sdb1) in ext4_evict_inode:210:
> Journal has aborted
> [587974.443945] EXT4-fs (sdb1): ext4_da_writepages: jbd2_start: 778
> pages, ino 76182150; err -30
> [587974.446615] EXT4-fs (sdb1): ext4_da_writepages: jbd2_start:
> 9223372036854775773 pages, ino 88873701; err -30
>
> Any ideas why ext4 has this behavior when ext3 did not?
> --
> To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>


2011-12-24 17:03:22

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [email protected] --- ext4 going read-only/journal abort when raid controller resets itself

On 12/24/2011 12:18 AM, Sandon Van Ness wrote:
>Most of our machines are ext3 and have seen the card get reset on
>ext3 and it never went read-only like it always does in ext4 now.
>The I/O goes unresponsive for a few minutes as it detects I/O is
>unresponsive and then the controller is reset and the machine
>would recover (on ext3/jfs, and other fs's) on ext4 the journal is
>aborted and it goes into read-only:

Both ext3 and ext4 will go abort the journal and remount the file
system read/only if it detects an inconsistency in the metadata. This
is the default behavior, and it is intended to protect the file system
from further damage leading to data loss.

So for example, if a RAID card hiccups and returns all zero's for a
block allocation bitmap, if ext3 or ext4 then tries to delete a file
and it discovers that when it tries to deallocate a block, that the
block bitmap already shows that the block is not in use, that's
considered a file system inconsistency. At that point, the default
behavior is that the file system will be remounted read-only, to
prevent the corrupted information from being written back to the disk,
or if the corruption was already on the disk, to prevent things from
getting worse.

That's what this is all about:

>[606900.929121] EXT4-fs error (device sdb1): mb_free_blocks:1397:
>group 39137block 1282445473:freeing already freed block (bit 4257)

Now, why wasn't this happening before on ext3? I can think of two
possible reasons. One is that the layout of a freshly created ext4
file system is different from that of a freshly created ext3 file
system. Specifically, the block allocation bitmaps for adjacent block
groups are laid out slightly differently. That may have caused some
*other* data or metadata block to be corrupted, which wasn't noticed
by the file system.

The other possibility is that the older tune2fs had the default
behavior when file system errors are discovered changed to something
else. For example, via the command "tune2fs -e continue /dev/sdXX".
This will put the file system in what I call, "Don't worry, be happy"
mode. It's NOT safe, but if uptime is more import than data
consistency, that's your decision....

In any case, the real issue seems to be that you have a hardware
problem. If your hardware raid card is aborting SCSI commands,
something is wrong, and you should fix this. The fact that ext4 is
remounting the file system read-only is because it's trying to protect
you. Complaining about that is like complaining about why the air
bags went off after your car suffers a head-on collision....

- Ted