Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754050AbbL3AFU (ORCPT ); Tue, 29 Dec 2015 19:05:20 -0500 Received: from mail.kernel.org ([198.145.29.136]:50815 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753493AbbL3AFQ (ORCPT ); Tue, 29 Dec 2015 19:05:16 -0500 Date: Tue, 29 Dec 2015 16:05:13 -0800 From: Jaegeuk Kim To: Chao Yu Cc: linux-f2fs-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org Subject: Re: [PATCH 2/2] f2fs: support revoking atomic written pages Message-ID: <20151230000513.GA13809@jaegeuk.local> References: <00a901d141e6$e42ec950$ac8c5bf0$@samsung.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <00a901d141e6$e42ec950$ac8c5bf0$@samsung.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3071 Lines: 87 Hi Chao, On Tue, Dec 29, 2015 at 11:12:36AM +0800, Chao Yu wrote: > f2fs support atomic write with following semantics: > 1. open db file > 2. ioctl start atomic write > 3. (write db file) * n > 4. ioctl commit atomic write > 5. close db file > > With this flow we can avoid file becoming corrupted when abnormal power > cut, because we hold data of transaction in referenced pages linked in > inmem_pages list of inode, but without setting them dirty, so these data > won't be persisted unless we commit them in step 4. > > But we should still hold journal db file in memory by using volatile write, > because our semantics of 'atomic write support' is not full, in step 4, we > could be fail to submit all dirty data of transaction, once partial dirty > data was committed in storage, db file should be corrupted, in this case, > we should use journal db to recover the original data in db file. Originally, IOC_ABORT_VOLATILE_WRITE was supposed to handle commit failures, since database should get its error literally. So, the only thing that we need to do is keeping journal data for further db recovery. But, unfortunately, it seems that something is missing in the current implementation. So simply how about this? A possible flow would be: 1. write journal data to volatile space 2. write db data to atomic space 3. in the error case, call ioc_abort_volatile_writes for both journal and db - flush/fsync journal data to disk - drop atomic data, and will be recovered by database with journal >From cb33fc8bc30981c370ec70fe68871130109793ec Mon Sep 17 00:00:00 2001 From: Jaegeuk Kim Date: Tue, 29 Dec 2015 15:46:33 -0800 Subject: [PATCH] f2fs: fix f2fs_ioc_abort_volatile_write There are two rules to handle aborting volatile or atomic writes. 1. drop atomic writes - we don't need to keep any stale db data. 2. write journal data - we should keep the journal data with fsync for db recovery. Signed-off-by: Jaegeuk Kim --- fs/f2fs/file.c | 13 ++++++++++--- 1 file changed, 10 insertions(+), 3 deletions(-) diff --git a/fs/f2fs/file.c b/fs/f2fs/file.c index 91f576a..d16438a 100644 --- a/fs/f2fs/file.c +++ b/fs/f2fs/file.c @@ -1433,9 +1433,16 @@ static int f2fs_ioc_abort_volatile_write(struct file *filp) if (ret) return ret; - clear_inode_flag(F2FS_I(inode), FI_ATOMIC_FILE); - clear_inode_flag(F2FS_I(inode), FI_VOLATILE_FILE); - commit_inmem_pages(inode, true); + if (f2fs_is_atomic_file(inode)) { + clear_inode_flag(F2FS_I(inode), FI_ATOMIC_FILE); + commit_inmem_pages(inode, true); + } + if (f2fs_is_volatile_file(inode)) { + clear_inode_flag(F2FS_I(inode), FI_VOLATILE_FILE); + ret = commit_inmem_pages(inode, false); + if (!ret) + ret = f2fs_sync_file(filp, 0, LLONG_MAX, 0); + } mnt_drop_write_file(filp); return ret; -- 2.6.3 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/