Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752696AbbHUMtk (ORCPT ); Fri, 21 Aug 2015 08:49:40 -0400 Received: from mailout2.samsung.com ([203.254.224.25]:58179 "EHLO mailout2.samsung.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751812AbbHUMti (ORCPT ); Fri, 21 Aug 2015 08:49:38 -0400 X-AuditID: cbfee61a-f79a06d000005c6f-1f-55d71e605ba0 From: Chao Yu To: "'Jaegeuk Kim'" Cc: linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net References: <1439887589-35190-1-git-send-email-jaegeuk@kernel.org> <1439887589-35190-5-git-send-email-jaegeuk@kernel.org> <01bb01d0db28$61844ea0$248cebe0$@samsung.com> <20150820153523.GA42028@jaegeuk-mac02.mot-mobility.com> In-reply-to: <20150820153523.GA42028@jaegeuk-mac02.mot-mobility.com> Subject: RE: [f2fs-dev] [PATCH 5/5] f2fs: check the node block address of newly allocated nid Date: Fri, 21 Aug 2015 20:48:48 +0800 Message-id: <021501d0dc0f$d365f900$7a31eb00$@samsung.com> MIME-version: 1.0 Content-type: text/plain; charset=us-ascii Content-transfer-encoding: 7bit X-Mailer: Microsoft Outlook 14.0 Thread-index: AQMbK+U27TEaBuzOkuL9EYWgZcyS+gL3ywKSAcOoX9wC3KaX2JtE94sQ Content-language: zh-cn X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFnrOLMWRmVeSWpSXmKPExsVy+t9jQd0EueuhBtd/CVk8WT+L2eLSIneL PXtPslhc3jWHzYHFY9OqTjaP3Qs+M3l83iQXwBzFZZOSmpNZllqkb5fAlfFs5mfGgsfKFWvW XWdrYGyR6WLk5JAQMJFYNvMeG4QtJnHh3nogm4tDSGAWo0T7/TuMXYwcQM4rRokdESA1bAIq Ess7/jOB2CICahK9+6aA2cwCmRIT+l+wQ/Q+ZZSYufcQO0iCU8BF4uH0B0wgc4QF4iXeXlAC CbMIqEosv/qLBcTmFbCUOLZ3JROELSjxY/I9FoiZWhLrdx6Hmi8vsXnNW2aIOxUkdpx9zQhx g5vE8ac3mSFqxCU2HrnFMoFRaBaSUbOQjJqFZNQsJC0LGFlWMUqkFiQXFCel5xrmpZbrFSfm Fpfmpesl5+duYgQH/zOpHYwHd7kfYhTgYFTi4Z0ReS1UiDWxrLgy9xCjBAezkgiv9QugEG9K YmVValF+fFFpTmrxIUZpDhYlcV7ZDZtDhQTSE0tSs1NTC1KLYLJMHJxSDYwRD/z1vvM+erjo uedEycP8rtyf7eKutzA9fTGfM8Tv38zZWsn8SS8mlnKuClSWjLLp4lp/TrTy1TX2a+LdqzoZ pBkv2ha9C70gb7HP5k2ay/a5/U6rg7+f1uh5X6fZ/Cc1giuuYsnFlVPqmpaVOgueLlfuPLrC lFv7ysls2xcyocmqUyMOtyuxFGckGmoxFxUnAgCHLzOFegIAAA== Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4697 Lines: 132 Hi Jaegeuk, > -----Original Message----- > From: Jaegeuk Kim [mailto:jaegeuk@kernel.org] > Sent: Thursday, August 20, 2015 11:35 PM > To: Chao Yu > Cc: linux-kernel@vger.kernel.org; linux-fsdevel@vger.kernel.org; > linux-f2fs-devel@lists.sourceforge.net > Subject: Re: [f2fs-dev] [PATCH 5/5] f2fs: check the node block address of newly allocated nid > > On Thu, Aug 20, 2015 at 05:12:03PM +0800, Chao Yu wrote: > > Hi Jaegeuk, > > > > > -----Original Message----- > > > From: Jaegeuk Kim [mailto:jaegeuk@kernel.org] > > > Sent: Tuesday, August 18, 2015 4:46 PM > > > To: linux-kernel@vger.kernel.org; linux-fsdevel@vger.kernel.org; > > > linux-f2fs-devel@lists.sourceforge.net > > > Cc: Jaegeuk Kim > > > Subject: [f2fs-dev] [PATCH 5/5] f2fs: check the node block address of newly allocated nid > > > > > > This patch adds a routine which checks the block address of newly allocated nid. > > > If an nid has already allocated by other thread due to subtle data races, it > > > will result in filesystem corruption. > > > So, it needs to check whether its block address was already allocated or not > > > in prior to nid allocation as the last chance. > > > > > > Signed-off-by: Jaegeuk Kim > > > --- > > > fs/f2fs/node.c | 9 +++++++++ > > > 1 file changed, 9 insertions(+) > > > > > > diff --git a/fs/f2fs/node.c b/fs/f2fs/node.c > > > index 3cc32b8..6bef5a2 100644 > > > --- a/fs/f2fs/node.c > > > +++ b/fs/f2fs/node.c > > > @@ -1573,6 +1573,8 @@ retry: > > > > > > /* We should not use stale free nids created by build_free_nids */ > > > if (nm_i->fcnt && !on_build_free_nids(nm_i)) { > > > + struct node_info ni; > > > + > > > f2fs_bug_on(sbi, list_empty(&nm_i->free_nid_list)); > > > list_for_each_entry(i, &nm_i->free_nid_list, list) > > > if (i->state == NID_NEW) > > > @@ -1583,6 +1585,13 @@ retry: > > > i->state = NID_ALLOC; > > > nm_i->fcnt--; > > > spin_unlock(&nm_i->free_nid_list_lock); > > > + > > > + /* check nid is allocated already */ > > > + get_node_info(sbi, *nid, &ni); > > > + if (ni.blk_addr != NULL_ADDR) { > > > > I didn't get it, why free nid is with non-NULL blkaddr? > > Could you please explain more about this? > > As I wrote in the description, I've been suffering from wrongly added free nids > which results in fs corruption. I suspect somewhat race condition in > build_free_nids, but it is very subtle to figure out exactly. > So, I wrote this patch to fix that. > > The concern would be performance regarding to cold cache miss at an NAT entry. > However, I expect that it would be tolerable since get_node_info will be called > after alloc_nid later. After investigating, I think I can reproduce this bug: 1. touch a (nid = 4) & touch b (nid = 5) 2. sync 3. rm a & rm b a) rm a to make next_scan_nid = 4. b) I change the logical of f2fs code making remove_inode_page failed when file b is being removed, so file b's nat entry is not set dirty; 4. sync 5. touch 1815 files 6. echo 3 > /proc/sys/vm/drop_caches drop clean nat entry of inode (nid:5), it makes we can pass blkaddr verification in add_free_nid: if (build) { /* do not add allocated nids */ 7. touch c because there is no free nids in cache, we try to build cache by two steps: a) build nids by loading from nat pages; b) build nids by loading from curseg and try to unload nids which has valid blkaddr in curseg. unfortunately, our build operation is not atomic, so after step a), nid:5 should be in free nids cache and it should be removed in step b). So all free nids allocated between step a) and step b) can be risky of incorrect allocation. If I'm not miss something, the root casue looks like our recent change: allocate free nid aggressively. Thanks, > > > > > > + alloc_nid_done(sbi, *nid); > > > > Will another thread call alloc_nid_done too, making this free nid being > > released again? > > No, its state became NID_ALLOC, so no other thread can pick this up till > alloc_nid_done is called. > > Thanks, > > > > > Thanks, > > > > > + goto retry; > > > + } > > > return true; > > > } > > > spin_unlock(&nm_i->free_nid_list_lock); > > > -- > > > 2.1.1 > > > > > > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > > > Linux-f2fs-devel mailing list > > > Linux-f2fs-devel@lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/