From: Eric Whitney Subject: [PATCH] ext4: fix loss of delalloc extent info in ext4_zero_range() Date: Fri, 20 Mar 2015 19:53:50 -0400 Message-ID: <20150320235350.GA10101@wallace> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: tytso@mit.edu To: linux-ext4@vger.kernel.org Return-path: Received: from mail-qg0-f53.google.com ([209.85.192.53]:34838 "EHLO mail-qg0-f53.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751281AbbCTXxx (ORCPT ); Fri, 20 Mar 2015 19:53:53 -0400 Received: by qgew92 with SMTP id w92so31164549qge.2 for ; Fri, 20 Mar 2015 16:53:52 -0700 (PDT) Content-Disposition: inline Sender: linux-ext4-owner@vger.kernel.org List-ID: In ext4_zero_range(), removing a file's entire block range from the extent status tree removes all records of that file's delalloc extents. The delalloc accounting code uses this information, and its loss can then lead to accounting errors and kernel warnings at writeback time and subsequent file system damage. This is most noticeable on bigalloc file systems where code in ext4_ext_map_blocks() handles cases where delalloc extents share clusters with a newly allocated extent. Because we're not deleting a block range and are correctly updating the status of its associated extent, there is no need to remove anything from the extent status tree. When this patch is combined with an unrelated bug fix for ext4_zero_range(), kernel warnings and e2fsck errors reported during xfstests runs on bigalloc filesystems are greatly reduced without introducing regressions on other xfstests-bld test scenarios. Signed-off-by: Eric Whitney --- fs/ext4/extents.c | 13 ------------- 1 file changed, 13 deletions(-) diff --git a/fs/ext4/extents.c b/fs/ext4/extents.c index bed4308..c187cc3 100644 --- a/fs/ext4/extents.c +++ b/fs/ext4/extents.c @@ -4847,19 +4847,6 @@ static long ext4_zero_range(struct file *file, loff_t offset, flags, mode); if (ret) goto out_dio; - /* - * Remove entire range from the extent status tree. - * - * ext4_es_remove_extent(inode, lblk, max_blocks) is - * NOT sufficient. I'm not sure why this is the case, - * but let's be conservative and remove the extent - * status tree for the entire inode. There should be - * no outstanding delalloc extents thanks to the - * filemap_write_and_wait_range() call above. - */ - ret = ext4_es_remove_extent(inode, 0, EXT_MAX_BLOCKS); - if (ret) - goto out_dio; } if (!partial_begin && !partial_end) goto out_dio; -- 2.1.0