Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp8481255ybi; Tue, 23 Jul 2019 09:18:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqy7WsbmpPV0gvZjXuE9zGrquavkW0Ep9qIRccjVmSI3oDMWBdiPATIBHpoXecoQnPlzyMc5 X-Received: by 2002:a62:2784:: with SMTP id n126mr6713037pfn.61.1563898710879; Tue, 23 Jul 2019 09:18:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563898710; cv=none; d=google.com; s=arc-20160816; b=er+d/Mkd4jEGRzgpoeZKXsVbCzIaa/4mEksm7z/9rXyH5n7iVyIKT7xgnAM5qgmPCt 9lD7xMaa/Xi+5Ra6Ux+y/20EkNHzfpAAt/mJac3n1GvHLHHHQzCAEQXYUgp10BI7Tu8u vVQkvD/hII55+r9YXdXBEJPk4YkTKjKsXIjG6w+yZ3Clo/7vKl4hg6MYeGUvuWjg5qks gv+zZ1ZluGPdUENCu1MeaFGAOhC/3fHk/8KSKcqxNwxit1aH+1XUJls5o+EAO3KvLhpV bwPg0ql3I2KklOyNMrOku3BbRVYpGdc3ByXo5EefQXLw4emBMKyCxILPo1+JBWvwVzLI otEw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=Nu57GhlQQ2nUGqsOU60xpotERbvCWFvWDGGMcpdteJs=; b=dG8K/xXeEUdGsZ+HeuR1nFMMESV/MT7HDQQvn1kbF2M0dpfPHMfI6K4aLwJNgSv0LX u0ZjWrGwzLkqbpaYcsZbBQe+qMVPXeE3snmLX1BqeknAxlVMMqui2RtuW7DNhEHymdlV s+akd5pFlFVikmUk1ANcTcI6qEqdtj2hDY8QkoTdAKu9JroP4/yYXeqtYyKAW4tjaqHl 98gGnp6dq6+SdyzmOntVGX+k6NBnB+26VKvbW2RLNUBci6NVKlVvsJxz4/c7wfBOpo7U vt0pSO6S0GVt8//FGiKd9K7LU1VVTKIy4l5D2b8BtNpJ7CU9IrIxeKV7vxADBSgcSUIs tuYA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 3si1662523pfn.170.2019.07.23.09.18.14; Tue, 23 Jul 2019 09:18:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730433AbfGWGqz (ORCPT + 99 others); Tue, 23 Jul 2019 02:46:55 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:55418 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729552AbfGWGqz (ORCPT ); Tue, 23 Jul 2019 02:46:55 -0400 Received: from DGGEMS411-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id CD062DD72BF5750FBEDE; Tue, 23 Jul 2019 14:46:52 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.211) with Microsoft SMTP Server (TLS) id 14.3.439.0; Tue, 23 Jul 2019 14:46:48 +0800 Subject: Re: [f2fs-dev] [PATCH v2] f2fs: fix to read source block before invalidating it To: Jaegeuk Kim CC: , References: <20190718013718.70335-1-jaegeuk@kernel.org> <20190718031214.GA78336@jaegeuk-macbookpro.roam.corp.google.com> <19a25101-da74-de98-6ca4-a9fd9fa09ef2@huawei.com> <20190718040005.GA81995@jaegeuk-macbookpro.roam.corp.google.com> <91dbfa33-cda0-e6e7-d62f-6604939142d4@huawei.com> <20190723012721.GA60134@jaegeuk-macbookpro.roam.corp.google.com> From: Chao Yu Message-ID: <45a9f47b-337c-527c-420d-b29e77a8ed22@huawei.com> Date: Tue, 23 Jul 2019 14:46:47 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20190723012721.GA60134@jaegeuk-macbookpro.roam.corp.google.com> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/7/23 9:27, Jaegeuk Kim wrote: > On 07/18, Chao Yu wrote: >> On 2019/7/18 12:00, Jaegeuk Kim wrote: >>> On 07/18, Chao Yu wrote: >>>> On 2019/7/18 11:12, Jaegeuk Kim wrote: >>>>> f2fs_allocate_data_block() invalidates old block address and enable new block >>>>> address. Then, if we try to read old block by f2fs_submit_page_bio(), it will >>>>> give WARN due to reading invalid blocks. >>>>> >>>>> Let's make the order sanely back. >>>> >>>> Hmm.. to avoid WARM, we may suffer one more memcpy, I suspect this can reduce >>>> online resize or foreground gc ioctl performance... >>> >>> I worried about performance tho, more concern came to me that there may exist a >>> chance that other thread can allocate and write something in old block address. >> >> Me too, however, previous invalid block address should be reused after a >> checkpoint, and checkpoint should have invalidated meta cache already, so there >> shouldn't be any race here. > > I think SSR can reuse that before checkpoint. Yes, I should have considered that when I introduced readahead feature for migration of block, we've kept invalidating meta page cache in old block address whenever the block address is not valid. quoted from ("f2fs: readahead encrypted block during GC") "Note that for OPU, truncation, deletion, we need to invalid meta page after we invalid old block address, to make sure we won't load invalid data from target meta page during encrypted block migration." But to avoid potential issue, how about just enable meta page cache during GC? that is saying we should truncate all valid meta cache after one section has been moved. One more concern is whether below case exists during SSR? - write 4k to fileA; - fsync fileA, 4k data is writebacked to lbaA; - write 4k to fileA; - kworker flushs 4k to lbaB; dnode contain lbaB didn't be persisted yet; - write 4k to fileB; - kworker flush 4k to lbaA due to SSR; - SPOR -> dnode with lbaA will be recovered, however lbaA contains fileB's data.. Thanks, > >> >> /* >> * invalidate intermediate page cache borrowed from meta inode >> * which are used for migration of encrypted inode's blocks. >> */ >> if (f2fs_sb_has_encrypt(sbi)) >> invalidate_mapping_pages(META_MAPPING(sbi), >> MAIN_BLKADDR(sbi), MAX_BLKADDR(sbi) - 1); >> >> Thanks, >> >>> >>>> >>>> Can we just relief to use DATA_GENERIC_ENHANCE_READ for this case...? >>> >>> We need to keep consistency for this api. >>> >>> Thanks, >>> >>>> >>>>> >>>>> Signed-off-by: Jaegeuk Kim >>>> >>>> Except performance, I'm okay with this change. >>>> >>>> Reviewed-by: Chao Yu >>>> >>>> Thanks, >>>> >>>>> --- >>>>> v2: >>>>> I was fixing the comments. :) >>>>> >>>>> fs/f2fs/gc.c | 70 +++++++++++++++++++++++++--------------------------- >>>>> 1 file changed, 34 insertions(+), 36 deletions(-) >>>>> >>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c >>>>> index 6691f526fa40..8974672db78f 100644 >>>>> --- a/fs/f2fs/gc.c >>>>> +++ b/fs/f2fs/gc.c >>>>> @@ -796,6 +796,29 @@ static int move_data_block(struct inode *inode, block_t bidx, >>>>> if (lfs_mode) >>>>> down_write(&fio.sbi->io_order_lock); >>>>> >>>>> + mpage = f2fs_grab_cache_page(META_MAPPING(fio.sbi), >>>>> + fio.old_blkaddr, false); >>>>> + if (!mpage) >>>>> + goto up_out; >>>>> + >>>>> + fio.encrypted_page = mpage; >>>>> + >>>>> + /* read source block in mpage */ >>>>> + if (!PageUptodate(mpage)) { >>>>> + err = f2fs_submit_page_bio(&fio); >>>>> + if (err) { >>>>> + f2fs_put_page(mpage, 1); >>>>> + goto up_out; >>>>> + } >>>>> + lock_page(mpage); >>>>> + if (unlikely(mpage->mapping != META_MAPPING(fio.sbi) || >>>>> + !PageUptodate(mpage))) { >>>>> + err = -EIO; >>>>> + f2fs_put_page(mpage, 1); >>>>> + goto up_out; >>>>> + } >>>>> + } >>>>> + >>>>> f2fs_allocate_data_block(fio.sbi, NULL, fio.old_blkaddr, &newaddr, >>>>> &sum, CURSEG_COLD_DATA, NULL, false); >>>>> >>>>> @@ -803,44 +826,18 @@ static int move_data_block(struct inode *inode, block_t bidx, >>>>> newaddr, FGP_LOCK | FGP_CREAT, GFP_NOFS); >>>>> if (!fio.encrypted_page) { >>>>> err = -ENOMEM; >>>>> - goto recover_block; >>>>> - } >>>>> - >>>>> - mpage = f2fs_pagecache_get_page(META_MAPPING(fio.sbi), >>>>> - fio.old_blkaddr, FGP_LOCK, GFP_NOFS); >>>>> - if (mpage) { >>>>> - bool updated = false; >>>>> - >>>>> - if (PageUptodate(mpage)) { >>>>> - memcpy(page_address(fio.encrypted_page), >>>>> - page_address(mpage), PAGE_SIZE); >>>>> - updated = true; >>>>> - } >>>>> f2fs_put_page(mpage, 1); >>>>> - invalidate_mapping_pages(META_MAPPING(fio.sbi), >>>>> - fio.old_blkaddr, fio.old_blkaddr); >>>>> - if (updated) >>>>> - goto write_page; >>>>> - } >>>>> - >>>>> - err = f2fs_submit_page_bio(&fio); >>>>> - if (err) >>>>> - goto put_page_out; >>>>> - >>>>> - /* write page */ >>>>> - lock_page(fio.encrypted_page); >>>>> - >>>>> - if (unlikely(fio.encrypted_page->mapping != META_MAPPING(fio.sbi))) { >>>>> - err = -EIO; >>>>> - goto put_page_out; >>>>> - } >>>>> - if (unlikely(!PageUptodate(fio.encrypted_page))) { >>>>> - err = -EIO; >>>>> - goto put_page_out; >>>>> + goto recover_block; >>>>> } >>>>> >>>>> -write_page: >>>>> + /* write target block */ >>>>> f2fs_wait_on_page_writeback(fio.encrypted_page, DATA, true, true); >>>>> + memcpy(page_address(fio.encrypted_page), >>>>> + page_address(mpage), PAGE_SIZE); >>>>> + f2fs_put_page(mpage, 1); >>>>> + invalidate_mapping_pages(META_MAPPING(fio.sbi), >>>>> + fio.old_blkaddr, fio.old_blkaddr); >>>>> + >>>>> set_page_dirty(fio.encrypted_page); >>>>> if (clear_page_dirty_for_io(fio.encrypted_page)) >>>>> dec_page_count(fio.sbi, F2FS_DIRTY_META); >>>>> @@ -871,11 +868,12 @@ static int move_data_block(struct inode *inode, block_t bidx, >>>>> put_page_out: >>>>> f2fs_put_page(fio.encrypted_page, 1); >>>>> recover_block: >>>>> - if (lfs_mode) >>>>> - up_write(&fio.sbi->io_order_lock); >>>>> if (err) >>>>> f2fs_do_replace_block(fio.sbi, &sum, newaddr, fio.old_blkaddr, >>>>> true, true); >>>>> +up_out: >>>>> + if (lfs_mode) >>>>> + up_write(&fio.sbi->io_order_lock); >>>>> put_out: >>>>> f2fs_put_dnode(&dn); >>>>> out: >>>>> >>>> >>>> >>>> _______________________________________________ >>>> Linux-f2fs-devel mailing list >>>> Linux-f2fs-devel@lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel >>> . >>> > . >