Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754308AbXJWKHy (ORCPT ); Tue, 23 Oct 2007 06:07:54 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752195AbXJWKHV (ORCPT ); Tue, 23 Oct 2007 06:07:21 -0400 Received: from pentafluge.infradead.org ([213.146.154.40]:54580 "EHLO pentafluge.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751658AbXJWKHQ (ORCPT ); Tue, 23 Oct 2007 06:07:16 -0400 Subject: Re: [PATCH] reiserfs: don't drop PG_dirty when releasing sub-page-sized dirty file From: Peter Zijlstra To: Fengguang Wu Cc: Maxim Levitsky , linux-kernel@vger.kernel.org, Fengguang Wu , Andrew Morton , Jeff Mahoney , reiserfs-dev@namesys.com In-Reply-To: <393126119.26275@ustc.edu.cn> References: <200710220822.52370.maximlevitsky@gmail.com> <200710221258.11384.maximlevitsky@gmail.com> <393051953.24752@ustc.edu.cn> <200710221421.21439.maximlevitsky@gmail.com> <393126119.26275@ustc.edu.cn> Content-Type: text/plain Date: Tue, 23 Oct 2007 12:07:07 +0200 Message-Id: <1193134027.7406.1.camel@twins> Mime-Version: 1.0 X-Mailer: Evolution 2.10.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3895 Lines: 106 [ adding reiserfs devs to the CC ] On Tue, 2007-10-23 at 15:55 +0800, Fengguang Wu wrote: > This is not a new problem in 2.6.23-git17. > 2.6.22/2.6.23 is buggy in the same way. > > Reiserfs could leave newly created sub-page-size files in dirty state > for ever. They cannot be synced to disk by pdflush routines or > explicit `sync' commands. Only `umount' can do the trick. > > The direct cause is: the dirty page's PG_dirty is wrongly _cleared_. > Call trace: > [] cancel_dirty_page+0xd0/0xf0 > [] :reiserfs:reiserfs_cut_from_item+0x660/0x710 > [] :reiserfs:reiserfs_do_truncate+0x271/0x530 > [] :reiserfs:reiserfs_truncate_file+0xfd/0x3b0 > [] :reiserfs:reiserfs_file_release+0x1e0/0x340 > [] __fput+0xcc/0x1b0 > [] fput+0x16/0x20 > [] filp_close+0x56/0x90 > [] sys_close+0xad/0x110 > [] system_call+0x7e/0x83 > > Fix the bug by removing the cancel_dirty_page() call. Tests show that > it causes no bad behaviors on various write sizes. > > > === for the patient === > Here are more detailed demonstrations of the problem. > > 1) the page has both PG_dirty(D)/PAGECACHE_TAG_DIRTY(d) after being written to; > and then only PAGECACHE_TAG_DIRTY(d) remains after the file is closed. > > ------------------------------ screen 0 ------------------------------ > [T0] root /home/wfg# cat > /test/tiny > [T1] hi > [T2] root /home/wfg# > > ------------------------------ screen 1 ------------------------------ > [T1] root /home/wfg# echo /test/tiny > /proc/filecache > [T1] root /home/wfg# cat /proc/filecache > # file /test/tiny > # flags R:referenced A:active M:mmap U:uptodate D:dirty W:writeback O:owner B:buffer d:dirty w:writeback > # idx len state refcnt > 0 1 ___UD__Bd_ 2 > [T2] root /home/wfg# cat /proc/filecache > # file /test/tiny > # flags R:referenced A:active M:mmap U:uptodate D:dirty W:writeback O:owner B:buffer d:dirty w:writeback > # idx len state refcnt > 0 1 ___U___Bd_ 2 > > 2) note the non-zero 'cancelled_write_bytes' after /tmp/hi is copied. > > ------------------------------ screen 0 ------------------------------ > [T0] root /home/wfg# echo hi > /tmp/hi > [T1] root /home/wfg# cp /tmp/hi /dev/stdin /test > [T2] hi > [T3] root /home/wfg# > > ------------------------------ screen 1 ------------------------------ > [T1] root /proc/4397# cd /proc/`pidof cp` > [T1] root /proc/4713# cat io > rchar: 8396 > wchar: 3 > syscr: 20 > syscw: 1 > read_bytes: 0 > write_bytes: 20480 > cancelled_write_bytes: 4096 > [T2] root /proc/4713# cat io > rchar: 8399 > wchar: 6 > syscr: 21 > syscw: 2 > read_bytes: 0 > write_bytes: 24576 > cancelled_write_bytes: 4096 > > //Question: the 'write_bytes' is a bit more than expected ;-) > > Cc: Maxim Levitsky > Cc: Peter Zijlstra > Signed-off-by: Fengguang Wu > --- > fs/reiserfs/stree.c | 3 --- > 1 file changed, 3 deletions(-) > > --- linux-2.6.24-git17.orig/fs/reiserfs/stree.c > +++ linux-2.6.24-git17/fs/reiserfs/stree.c > @@ -1458,9 +1458,6 @@ static void unmap_buffers(struct page *p > } > bh = next; > } while (bh != head); > - if (PAGE_SIZE == bh->b_size) { > - cancel_dirty_page(page, PAGE_CACHE_SIZE); > - } > } > } > } > - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/