From: bugme-daemon@bugzilla.kernel.org Subject: [Bug 12821] filesystem corrupts on heavy I/O Date: Fri, 13 Mar 2009 08:03:45 -0700 (PDT) Message-ID: <20090313150345.7BA4B108042@picon.linux-foundation.org> References: To: linux-ext4@vger.kernel.org Return-path: Received: from smtp1.linux-foundation.org ([140.211.169.13]:43504 "EHLO smtp1.linux-foundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752191AbZCMPEw (ORCPT ); Fri, 13 Mar 2009 11:04:52 -0400 Received: from picon.linux-foundation.org (picon.linux-foundation.org [140.211.169.79]) by smtp1.linux-foundation.org (8.14.2/8.13.5/Debian-3ubuntu1.1) with ESMTP id n2DF3jXW001002 for ; Fri, 13 Mar 2009 08:04:21 -0700 In-Reply-To: Sender: linux-ext4-owner@vger.kernel.org List-ID: http://bugzilla.kernel.org/show_bug.cgi?id=12821 ------- Comment #15 from sandeen@redhat.com 2009-03-13 08:03 ------- (In reply to comment #14) > I'm seeing a similar error with heavy read/write I/O on a 1TB ext4 volume. It's > not clear what behavior triggers the error for me. Occasionally I see the error > in dmesg... > > [ 7829.004269] EXT4-fs error (device sdb1): ext4_ext_search_right: bad header > in inode #2491097: unexpected eh_depth - magic f30a, entries 78, max 340(0), > depth 1(2) > [ 7829.012197] mpage_da_map_blocks block allocation failed for inode 2491097 at > logical offset 2788227 with max blocks 7 with error -5 > [ 7829.012220] This should not happen.!! Data will be lost > > ...and sync does not complete. It's not clear from the discussion if e2image > needs to be done before the error occurs on a clean mount, or afterward while > the system is up. If you like, it's probably sufficient to point debugfs at the system and do debugfs> stat <2491097> to give us an idea of the layout of that file. > When this happens the reboot procedure does not complete, and I power cycle the > machine. the fs may have gone readonly after the error, but I'm not sure it should have hung up the system. > After a power cycle, the journal replays and fsck completes, no > inconsistencies. The files affected by data loss are part of a BitTorrent > network download and after a Torrent data consistency check, I confirm that > data has been lost. If I leave the Torrent active downloading for more than an > hour or so, the ext4 errors occur. Great, if you can reproduce it w/ bittorrent, can you please try the patch in the attachments? Thanks, -Eric -- Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.