Received: by 2002:a25:ef43:0:0:0:0:0 with SMTP id w3csp33473ybm; Wed, 27 May 2020 18:23:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy1Uf+pJ9aLhSqnpvU4GuroZ/KCSctcGR5AFs7FNTKwvAo6bKIPAOnb/t7T9hkN0DbrmxY3 X-Received: by 2002:a17:907:1002:: with SMTP id ox2mr882506ejb.189.1590628988006; Wed, 27 May 2020 18:23:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1590628987; cv=none; d=google.com; s=arc-20160816; b=HNQnTsAZ42i2EM7NDEvY3h7svb6VZe0A73GR+jZEuOy4J6N1VhqSycPVo5WpPOqNhO SWoIY0sOvhu2shgOTglvY+2CLAWt0zwIPQBY6HbJvt/qYV7deYRxO+F1CIeAOdtJ0OzP j4NXFRCUMzCfDD8ZlSz0qLbxMcmlwfe7yTAgFrSFYSipkx/nb1nZUxTqA/cL7D70kcML pPcsIHWJM+qrtkz/2qsY8QbVBEkZCHF/OzmT86hTkoJ7bmqLgiVaesbX6CZkzY5nDNkU b/UNJrxBLFsr7KC5oBjIQs5xhM6ewFqTrwJTaSTDo01dlPle4xQyBT+ehJAPu/zTYlrC naRw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=SXVPZPkrLna7Av7PfWfJsBR/39ZNN5K2ATCRvnThCXI=; b=bEkNNDCz87IyT/nUH1mi9pzBSjk/xOhTxNepSPsUo1XeCN9RqdPO5v9ThAuvOiIt3l UdkmEF72QQwl7dD/fSg8Y0u5ViRREnemYPtm9PoCoCNIfaXOND0MwjqC52S05tsjQuXf 6bv+QHz7nUiD6X1sZ3eXWUSSz55BdDRlhZ/tDSBlzU6hW9REcsDtz1n7ziJBQ9RRNwRX 8/p51izI7mirT3zEBnhxIrT0z9tD987QSoODbAqrsP2YY1LpA8Ni3Qn2YNKY07Y07MDD KJG7pdzoObilXb3PFyoTZ8apmF1rGy7UcTwk9QfHR+8IF0KXLkdqo9YDEOm3VUJiEK9o TTxA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id sb15si2988318ejb.478.2020.05.27.18.22.44; Wed, 27 May 2020 18:23:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726555AbgE1BUu (ORCPT + 99 others); Wed, 27 May 2020 21:20:50 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:48924 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725294AbgE1BUu (ORCPT ); Wed, 27 May 2020 21:20:50 -0400 Received: from DGGEMS406-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id 1F5C510104ED80D4744C; Thu, 28 May 2020 09:20:47 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.206) with Microsoft SMTP Server (TLS) id 14.3.487.0; Thu, 28 May 2020 09:20:41 +0800 Subject: Re: [f2fs-dev] [PATCH v3] f2fs: avoid inifinite loop to wait for flushing node pages at cp_error To: Jaegeuk Kim CC: , , References: <20200522144752.216197-1-jaegeuk@kernel.org> <20200522233243.GA94020@google.com> <20200525035655.GA135148@google.com> <565af47c-8364-d910-8d1c-93645c12e660@huawei.com> <20200525150608.GA55033@google.com> <92afae8b-2dd3-171a-562c-404a67f9aab2@huawei.com> <20200526015650.GA207949@google.com> <765a1ac5-a318-14d6-666f-eab46f892d01@huawei.com> <20200527205603.GB206249@google.com> From: Chao Yu Message-ID: Date: Thu, 28 May 2020 09:20:37 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20200527205603.GB206249@google.com> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/5/28 4:56, Jaegeuk Kim wrote: > On 05/27, Chao Yu wrote: >> On 2020/5/26 9:56, Jaegeuk Kim wrote: >>> On 05/26, Chao Yu wrote: >>>> On 2020/5/26 9:11, Chao Yu wrote: >>>>> On 2020/5/25 23:06, Jaegeuk Kim wrote: >>>>>> On 05/25, Chao Yu wrote: >>>>>>> On 2020/5/25 11:56, Jaegeuk Kim wrote: >>>>>>>> Shutdown test is somtimes hung, since it keeps trying to flush dirty node pages >> >> 71.07% 0.01% kworker/u256:1+ [kernel.kallsyms] [k] wb_writeback >> | >> --71.06%--wb_writeback >> | >> |--68.96%--__writeback_inodes_wb >> | | >> | --68.95%--writeback_sb_inodes >> | | >> | |--65.08%--__writeback_single_inode >> | | | >> | | --64.35%--do_writepages >> | | | >> | | |--59.83%--f2fs_write_node_pages >> | | | | >> | | | --59.74%--f2fs_sync_node_pages >> | | | | >> | | | |--27.91%--pagevec_lookup_range_tag >> | | | | | >> | | | | --27.90%--find_get_pages_range_tag >> >> Before umount, kworker will always hold one core, that looks not reasonable, >> to avoid that, could we just allow node write, since it's out-place-update, >> and cp is not allowed, we don't need to worry about its effect on data on >> previous checkpoint, and it can decrease memory footprint cost by node pages. > > It can cause some roll-forward recovery? Yup, recovery should be considered, Later fsync() will fail due to: int f2fs_sync_file(struct file *file, loff_t start, loff_t end, int datasync) { if (unlikely(f2fs_cp_error(F2FS_I_SB(file_inode(file))))) return -EIO; And we need to adjust f2fs_fsync_node_pages() to handle in-process fsyncing node pages as well: if (f2fs_cp_error()) { set_fsync_mark(page, 0); set_dentry_mark(page, 0); if (atomic) { unlock & put page; ret = -EIO; break; } } ret = __write_node_page(); Thanks, > >> >> Thanks, >> >>>>>>> >>>>>>> IMO, for umount case, we should drop dirty reference and dirty pages on meta/data >>>>>>> pages like we change for node pages to avoid potential dead loop... >>>>>> >>>>>> I believe we're doing for them. :P >>>>> >>>>> Actually, I mean do we need to drop dirty meta/data pages explicitly as below: >>>>> >>>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c >>>>> index 3dc3ac6fe143..4c08fd0a680a 100644 >>>>> --- a/fs/f2fs/checkpoint.c >>>>> +++ b/fs/f2fs/checkpoint.c >>>>> @@ -299,8 +299,15 @@ static int __f2fs_write_meta_page(struct page *page, >>>>> >>>>> trace_f2fs_writepage(page, META); >>>>> >>>>> - if (unlikely(f2fs_cp_error(sbi))) >>>>> + if (unlikely(f2fs_cp_error(sbi))) { >>>>> + if (is_sbi_flag_set(sbi, SBI_IS_CLOSE)) { >>>>> + ClearPageUptodate(page); >>>>> + dec_page_count(sbi, F2FS_DIRTY_META); >>>>> + unlock_page(page); >>>>> + return 0; >>>>> + } >>>>> goto redirty_out; >>>>> + } >>>>> if (unlikely(is_sbi_flag_set(sbi, SBI_POR_DOING))) >>>>> goto redirty_out; >>>>> if (wbc->for_reclaim && page->index < GET_SUM_BLOCK(sbi, 0)) >>>>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>>>> index 48a622b95b76..94b342802513 100644 >>>>> --- a/fs/f2fs/data.c >>>>> +++ b/fs/f2fs/data.c >>>>> @@ -2682,6 +2682,12 @@ int f2fs_write_single_data_page(struct page *page, int *submitted, >>>>> >>>>> /* we should bypass data pages to proceed the kworkder jobs */ >>>>> if (unlikely(f2fs_cp_error(sbi))) { >>>>> + if (is_sbi_flag_set(sbi, SBI_IS_CLOSE)) { >>>>> + ClearPageUptodate(page); >>>>> + inode_dec_dirty_pages(inode); >>>>> + unlock_page(page); >>>>> + return 0; >>>>> + } >>>> >>>> Oh, I notice previously, we will drop non-directory inode's dirty pages directly, >>>> however, during umount, we'd better drop directory inode's dirty pages as well, right? >>> >>> Hmm, I remember I dropped them before. Need to double check. >>> >>>> >>>>> mapping_set_error(page->mapping, -EIO); >>>>> /* >>>>> * don't drop any dirty dentry pages for keeping lastest >>>>> >>>>>> >>>>>>> >>>>>>> Thanks, >>>>>>> >>>>>>>> in an inifinite loop. Let's drop dirty pages at umount in that case. >>>>>>>> >>>>>>>> Signed-off-by: Jaegeuk Kim >>>>>>>> --- >>>>>>>> v3: >>>>>>>> - fix wrong unlock >>>>>>>> >>>>>>>> v2: >>>>>>>> - fix typos >>>>>>>> >>>>>>>> fs/f2fs/node.c | 9 ++++++++- >>>>>>>> 1 file changed, 8 insertions(+), 1 deletion(-) >>>>>>>> >>>>>>>> diff --git a/fs/f2fs/node.c b/fs/f2fs/node.c >>>>>>>> index e632de10aedab..e0bb0f7e0506e 100644 >>>>>>>> --- a/fs/f2fs/node.c >>>>>>>> +++ b/fs/f2fs/node.c >>>>>>>> @@ -1520,8 +1520,15 @@ static int __write_node_page(struct page *page, bool atomic, bool *submitted, >>>>>>>> >>>>>>>> trace_f2fs_writepage(page, NODE); >>>>>>>> >>>>>>>> - if (unlikely(f2fs_cp_error(sbi))) >>>>>>>> + if (unlikely(f2fs_cp_error(sbi))) { >>>>>>>> + if (is_sbi_flag_set(sbi, SBI_IS_CLOSE)) { >>>>>>>> + ClearPageUptodate(page); >>>>>>>> + dec_page_count(sbi, F2FS_DIRTY_NODES); >>>>>>>> + unlock_page(page); >>>>>>>> + return 0; >>>>>>>> + } >>>>>>>> goto redirty_out; >>>>>>>> + } >>>>>>>> >>>>>>>> if (unlikely(is_sbi_flag_set(sbi, SBI_POR_DOING))) >>>>>>>> goto redirty_out; >>>>>>>> >>>>>> . >>>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> Linux-f2fs-devel mailing list >>>>> Linux-f2fs-devel@lists.sourceforge.net >>>>> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel >>>>> . >>>>> >>> . >>> > . >