Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp692176ybt; Wed, 24 Jun 2020 08:56:53 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxkHcVbshEOHYzNs7O5BtoZLiftW8HCto0tI8LTWkgvVhLpNxaXalJ2ZGdT3hCBq+FLOiWN X-Received: by 2002:a17:906:4ecf:: with SMTP id i15mr26566958ejv.515.1593014213261; Wed, 24 Jun 2020 08:56:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1593014213; cv=none; d=google.com; s=arc-20160816; b=g7kYxdXppg7JOd+PPbpDzxjgmKSsgwVHJIlitY9H5Qx+80WBA1N9Q9LR/DGTxJFK3x vsj94c7W3MwPKlWSM6mo1kpHymUrpqZH/XI1MyK0tqQa+nELfGFeNQviz4ssgq4eWpdv amuucrnXfD9DLeHfxKAO54/Skq2Qpj1q6ZAbCeWvESftwG06g46jhTS53/1n7VOhCbNE wwNAP9OdxaHjLRPBUdcKvp69p6LuAb/RVhjgDdFYXBHhmZNR9+PcF3p8jFrKGfsjdtwC 4oYffKp5HlYMAZgeE2XIiCpC9V8OB5JdBtEHfuBYBa3OXCf9Z516EvJbcBf0BXfk8/r7 36Kw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=ghha37SW6k4gGjGXppkOBZFDXuVjaWbchATsZGSx7HE=; b=Mmt/SdEKhrS9BGIior9p6IsM4xqDru6uiimcBFLTCkgNivbrmToa70oPsI2o7R15Ky Kg9nqHMynmQ4MjmWq1RHuQ1A42EQvg6Xta4GzYXwuROA45aCThuCGjQaaFnyOIo9sJyY WcvUrSamddqaZ3G6klcmV6N+s67tsLsP94SOhMO3gnUSpgETVzrDtEL9ZPpgFyynv0ue K7WSz2QG1Dw9Uzl5WcxKb/RAbIfgI7MpUfxyAUx5FQT1JOyDaFT6LtqCHVvTXAIRQQdA +OYnXgU/1Dacd6bBn//bwQ1+sTGPrAZHfD29LxZ4LkIZna7AV2P8RdRYYIgt1YhaJjHQ +r5Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=wlOmHpkv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z13si4601545eju.130.2020.06.24.08.56.28; Wed, 24 Jun 2020 08:56:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=wlOmHpkv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404579AbgFXPz5 (ORCPT + 99 others); Wed, 24 Jun 2020 11:55:57 -0400 Received: from mail.kernel.org ([198.145.29.99]:44732 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404489AbgFXPz5 (ORCPT ); Wed, 24 Jun 2020 11:55:57 -0400 Received: from localhost (unknown [104.132.1.66]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 138E4206F7; Wed, 24 Jun 2020 15:55:56 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1593014156; bh=psUwwXM9mI+SwEdZqxZWgZzspbx1X33B7G8vF8nDbDA=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=wlOmHpkvWlninN9X+L/1oaWpmeBBITfzpu68jvKmd2kfh6TOHM3dFyG62D8v7jQD3 LKrG90OISpZD3F63ydSJpNZdshrLGSfLT9PLb5D4W18We73P1X06qUu/7tRXkBQwe4 DuuZ2+NT3pnpp8qzt78OYv2+4ETqFBQ7bx1rOOxQ= Date: Wed, 24 Jun 2020 08:55:55 -0700 From: Jaegeuk Kim To: Chao Yu Cc: linux-f2fs-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org, chao@kernel.org Subject: Re: [PATCH 1/5] f2fs: fix to wait page writeback before update Message-ID: <20200624155555.GA215264@google.com> References: <20200618063625.110273-1-yuchao0@huawei.com> <20200618235932.GA227771@google.com> <20200619054922.GC227771@google.com> <3634ef79-5903-449d-0d52-3d5566481863@huawei.com> <20200619224755.GA60059@google.com> <3f49539a-7be1-be90-d13a-2f66a8483458@huawei.com> <20200621163834.GA36924@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06/22, Chao Yu wrote: > On 2020/6/22 0:38, Jaegeuk Kim wrote: > > On 06/20, Chao Yu wrote: > >> On 2020/6/20 6:47, Jaegeuk Kim wrote: > >>> On 06/19, Chao Yu wrote: > >>>> On 2020/6/19 13:49, Jaegeuk Kim wrote: > >>>>> On 06/19, Chao Yu wrote: > >>>>>> Hi Jaegeuk, > >>>>>> > >>>>>> On 2020/6/19 7:59, Jaegeuk Kim wrote: > >>>>>>> Hi Chao, > >>>>>>> > >>>>>>> On 06/18, Chao Yu wrote: > >>>>>>>> to make page content stable for special device like raid. > >>>>>>> > >>>>>>> Could you elaborate the problem a bit? > >>>>>> > >>>>>> Some devices like raid5 wants page content to be stable, because > >>>>>> it will calculate parity info based page content, if page is not > >>>>>> stable, parity info could be corrupted, result in data inconsistency > >>>>>> in stripe. > >>>>> > >>>>> I don't get the point, since those pages are brand new pages which were not > >>>>> modified before. If it's on writeback, we should not modify them regardless > >>>>> of whatever raid configuration. For example, f2fs_new_node_page() waits for > >>>>> writeback. Am I missing something? > >>>> > >>>> I think we should use f2fs_bug_on(, PageWriteback()) rather than > >>>> f2fs_wait_on_page_writeback() for brand new page which is allocated just now. > >>>> For other paths, we can keep rule that waiting for writeback before updating. > >>>> > >>>> How do you think? > >>>> > >>>> Thanks, > >>>> > >>>>> > >>>>>> > >>>>>> Thanks, > >>>>>> > >>>>>>> > >>>>>>>> > >>>>>>>> Signed-off-by: Chao Yu > >>>>>>>> --- > >>>>>>>> fs/f2fs/dir.c | 2 ++ > >>>>>>>> fs/f2fs/extent_cache.c | 18 +++++++++--------- > >>>>>>>> fs/f2fs/f2fs.h | 2 +- > >>>>>>>> fs/f2fs/file.c | 1 + > >>>>>>>> fs/f2fs/inline.c | 2 ++ > >>>>>>>> fs/f2fs/inode.c | 3 +-- > >>>>>>>> 6 files changed, 16 insertions(+), 12 deletions(-) > >>>>>>>> > >>>>>>>> diff --git a/fs/f2fs/dir.c b/fs/f2fs/dir.c > >>>>>>>> index d35976785e8c..91e86747a604 100644 > >>>>>>>> --- a/fs/f2fs/dir.c > >>>>>>>> +++ b/fs/f2fs/dir.c > >>>>>>>> @@ -495,6 +495,8 @@ static int make_empty_dir(struct inode *inode, > >>>>>>>> if (IS_ERR(dentry_page)) > >>>>>>>> return PTR_ERR(dentry_page); > >>>>>>>> > >>>>>>>> + f2fs_bug_on(F2FS_I_SB(inode), PageWriteback(dentry_page)); > >>>>>>>> + > >>>>>>>> dentry_blk = page_address(dentry_page); > >>>>>>>> > >>>>>>>> make_dentry_ptr_block(NULL, &d, dentry_blk); > >>>>>>>> diff --git a/fs/f2fs/extent_cache.c b/fs/f2fs/extent_cache.c > >>>>>>>> index e60078460ad1..686c68b98610 100644 > >>>>>>>> --- a/fs/f2fs/extent_cache.c > >>>>>>>> +++ b/fs/f2fs/extent_cache.c > >>>>>>>> @@ -325,9 +325,10 @@ static void __drop_largest_extent(struct extent_tree *et, > >>>>>>>> } > >>>>>>>> > >>>>>>>> /* return true, if inode page is changed */ > >>>>>>>> -static bool __f2fs_init_extent_tree(struct inode *inode, struct f2fs_extent *i_ext) > >>>>>>>> +static void __f2fs_init_extent_tree(struct inode *inode, struct page *ipage) > >>>>>>>> { > >>>>>>>> struct f2fs_sb_info *sbi = F2FS_I_SB(inode); > >>>>>>>> + struct f2fs_extent *i_ext = ipage ? &F2FS_INODE(ipage)->i_ext : NULL; > >>>>>>>> struct extent_tree *et; > >>>>>>>> struct extent_node *en; > >>>>>>>> struct extent_info ei; > >>>>>>>> @@ -335,16 +336,18 @@ static bool __f2fs_init_extent_tree(struct inode *inode, struct f2fs_extent *i_e > >>>>>>>> if (!f2fs_may_extent_tree(inode)) { > >>>>>>>> /* drop largest extent */ > >>>>>>>> if (i_ext && i_ext->len) { > >>>>>>>> + f2fs_wait_on_page_writeback(ipage, NODE, true, true); > >>>>>>>> i_ext->len = 0; > >>>>>>>> - return true; > >>>>>>>> + set_page_dirty(ipage); > >>>>>>>> + return; > >>>>>>>> } > >>>>>>>> - return false; > >>>>>>>> + return; > >>>>>>>> } > >>>>>>>> > >>>>>>>> et = __grab_extent_tree(inode); > >>>>>>>> > >>>>>>>> if (!i_ext || !i_ext->len) > >>>>>>>> - return false; > >>>>>>>> + return; > >>>>>>>> > >>>>>>>> get_extent_info(&ei, i_ext); > >>>>>>>> > >>>>>>>> @@ -360,17 +363,14 @@ static bool __f2fs_init_extent_tree(struct inode *inode, struct f2fs_extent *i_e > >>>>>>>> } > >>>>>>>> out: > >>>>>>>> write_unlock(&et->lock); > >>>>>>>> - return false; > >>>>>>>> } > >>>>>>>> > >>>>>>>> -bool f2fs_init_extent_tree(struct inode *inode, struct f2fs_extent *i_ext) > >>>>>>>> +void f2fs_init_extent_tree(struct inode *inode, struct page *ipage) > >>>>>>>> { > >>>>>>>> - bool ret = __f2fs_init_extent_tree(inode, i_ext); > >>>>>>>> + __f2fs_init_extent_tree(inode, ipage); > >>>>>>>> > >>>>>>>> if (!F2FS_I(inode)->extent_tree) > >>>>>>>> set_inode_flag(inode, FI_NO_EXTENT); > >>>>>>>> - > >>>>>>>> - return ret; > >>>>>>>> } > >>>>>>>> > >>>>>>>> static bool f2fs_lookup_extent_tree(struct inode *inode, pgoff_t pgofs, > >>>>>>>> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h > >>>>>>>> index b35a50f4953c..326c12fa0da5 100644 > >>>>>>>> --- a/fs/f2fs/f2fs.h > >>>>>>>> +++ b/fs/f2fs/f2fs.h > >>>>>>>> @@ -3795,7 +3795,7 @@ struct rb_entry *f2fs_lookup_rb_tree_ret(struct rb_root_cached *root, > >>>>>>>> bool f2fs_check_rb_tree_consistence(struct f2fs_sb_info *sbi, > >>>>>>>> struct rb_root_cached *root); > >>>>>>>> unsigned int f2fs_shrink_extent_tree(struct f2fs_sb_info *sbi, int nr_shrink); > >>>>>>>> -bool f2fs_init_extent_tree(struct inode *inode, struct f2fs_extent *i_ext); > >>>>>>>> +void f2fs_init_extent_tree(struct inode *inode, struct page *ipage); > >>>>>>>> void f2fs_drop_extent_tree(struct inode *inode); > >>>>>>>> unsigned int f2fs_destroy_extent_node(struct inode *inode); > >>>>>>>> void f2fs_destroy_extent_tree(struct inode *inode); > >>>>>>>> diff --git a/fs/f2fs/file.c b/fs/f2fs/file.c > >>>>>>>> index 3268f8dd59bb..1862073b96d2 100644 > >>>>>>>> --- a/fs/f2fs/file.c > >>>>>>>> +++ b/fs/f2fs/file.c > >>>>>>>> @@ -1250,6 +1250,7 @@ static int __clone_blkaddrs(struct inode *src_inode, struct inode *dst_inode, > >>>>>>>> f2fs_put_page(psrc, 1); > >>>>>>>> return PTR_ERR(pdst); > >>>>>>>> } > >>>>>>>> + f2fs_wait_on_page_writeback(pdst, DATA, true, true); > >>> > >>> Do you mean pdst can be under writeback? > >> > >> Use f2fs_bug_on(, dirty || writeback) here? > >> > >>> > >>>>>>>> f2fs_copy_page(psrc, pdst); > >>>>>>>> set_page_dirty(pdst); > >>>>>>>> f2fs_put_page(pdst, 1); > >>>>>>>> diff --git a/fs/f2fs/inline.c b/fs/f2fs/inline.c > >>>>>>>> index dbade310dc79..4bcbc486c9e2 100644 > >>>>>>>> --- a/fs/f2fs/inline.c > >>>>>>>> +++ b/fs/f2fs/inline.c > >>>>>>>> @@ -340,6 +340,8 @@ int f2fs_make_empty_inline_dir(struct inode *inode, struct inode *parent, > >>>>>>>> struct f2fs_dentry_ptr d; > >>>>>>>> void *inline_dentry; > >>>>>>>> > >>>>>>>> + f2fs_wait_on_page_writeback(ipage, NODE, true, true); > >> > >> f2fs_bug_on(, writeback)? > > > > So, which case do you suspect unstable page for raid? > > - gc_node_segment > - f2fs_move_node_page > - __write_node_page > - set_page_writeback > > - do_read_inode > - f2fs_init_extent_tree > - __f2fs_init_extent_tree > i_ext->len = 0; Could you please add wait_on_writeback on this specific case only with this backtrace in the description? Thanks, > > > > >> > >> Thanks, > >> > >>>>>>>> + > >>>>>>>> inline_dentry = inline_data_addr(inode, ipage); > >>>>>>>> > >>>>>>>> make_dentry_ptr_inline(inode, &d, inline_dentry); > >>>>>>>> diff --git a/fs/f2fs/inode.c b/fs/f2fs/inode.c > >>>>>>>> index 44582a4db513..7c156eb26dd7 100644 > >>>>>>>> --- a/fs/f2fs/inode.c > >>>>>>>> +++ b/fs/f2fs/inode.c > >>>>>>>> @@ -367,8 +367,7 @@ static int do_read_inode(struct inode *inode) > >>>>>>>> fi->i_pino = le32_to_cpu(ri->i_pino); > >>>>>>>> fi->i_dir_level = ri->i_dir_level; > >>>>>>>> > >>>>>>>> - if (f2fs_init_extent_tree(inode, &ri->i_ext)) > >>>>>>>> - set_page_dirty(node_page); > >>>>>>>> + f2fs_init_extent_tree(inode, node_page); > >>>>>>>> > >>>>>>>> get_inline_info(inode, ri); > >>>>>>>> > >>>>>>>> -- > >>>>>>>> 2.18.0.rc1 > >>>>>>> . > >>>>>>> > >>>>> . > >>>>> > >>> . > >>> > > . > >