Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp344300pxm; Tue, 1 Mar 2022 22:44:21 -0800 (PST) X-Google-Smtp-Source: ABdhPJwsNMg9zhuKIa721GIi/yrc8qBqkw9aDe3O55Vo40E7ox6PgdqFwlJ1ZhdZKFontW/pZTdn X-Received: by 2002:a50:e08c:0:b0:407:cdc6:809e with SMTP id f12-20020a50e08c000000b00407cdc6809emr27472123edl.162.1646203460848; Tue, 01 Mar 2022 22:44:20 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646203460; cv=none; d=google.com; s=arc-20160816; b=EFYXrckaDtavc+tTV4fP8i64UUnqfw+mYqiW4aFV15EvHuUqW17Vf0Ann4Mv4dnLHr Q2AXJyllBgbJgWIVgIx+bXodjdi4DhwqyfLI9HiFYsW7Yz9VDpAljP69iqHllsyqQ94E RrQ3ony9eGLQR1/3Om3AqSy8zgCpI5CGcHEKSUHHMZj9Tn2vJoYiVwUubOXQZajeiQDO gQPxJBYg4CkXUXii0bKy81YXATsRzMBpT7xlhZlRMdtrJCzRq2o5YBdf2GxsvhmiyuWF pbRVcLqKB2wk9dPIMp/aChm8vDUGMhgxZk7+gDyiZa0hmNsFnKQ8Xg+mg0t1j2HqwIpl 7DgQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :cc:to:from:content-language:subject:user-agent:mime-version:date :message-id:dkim-signature; bh=5ZsmwPmVly2Iw/MIDcVTsM5XFmYyhdBXWuGIAcyfWzY=; b=KgZihjWM9FWobV3TG+nhsxiFz96hDF9bur4B50RafMH4SsG3HNvFeVkdwSuzkKpPkM yFS12oHXnIp2UBvwoZAVZVrylkcpcfErX6EfUbxvCOVb8Pic9lK6jaaPLmaBVd/lMhar vyA8ezaCQAwub4m0F315G1/sqXHig3Ln2j7W68N8CI42F8mjhQVYe13ydLfmCpNJHVEB pDFKexM2SrlV1LIihoqmXVkrzhwMDf2VLDV0nhMQxyWaBiz4Gq6KQyRjfT7zXIYYn1cf dEtHzXtEZzExr5wcCHnX7m4Xx4LTHQrxSfh8O66X/8UeX7OKhl18pcwy86o2yaJpV27Y mLig== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HekcxkIT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id v5-20020aa7dbc5000000b00412e5aa522bsi9865988edt.434.2022.03.01.22.43.57; Tue, 01 Mar 2022 22:44:20 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HekcxkIT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239350AbiCBDdS (ORCPT + 99 others); Tue, 1 Mar 2022 22:33:18 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34506 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229911AbiCBDdQ (ORCPT ); Tue, 1 Mar 2022 22:33:16 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E34D114096 for ; Tue, 1 Mar 2022 19:32:32 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 8AF1761763 for ; Wed, 2 Mar 2022 03:32:31 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 744ACC340EE; Wed, 2 Mar 2022 03:32:29 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1646191950; bh=tw2FLwUKv/LJosyP7l26hDKZUXqpREWG4hH9qMm6hVQ=; h=Date:Subject:From:To:Cc:References:In-Reply-To:From; b=HekcxkIT5USwEq4gDBMDMEHvLjLfpBwrolf7gAibCI46vKz0PpqfZOHrUcLfARs8c q5z1nFCueunfJa2qs2TFVeff/FFiRQm970lTHm5oaOl1p+GdNA2YVBPkyRAv7Jx6gS nYPRnPrkZQzr7YTaxIcyj3Vdo6EmPr5q8Kp5eKWG6ULadFpGX9G7z4M/e0uBsXEoHI W2PwNFe9eeREdY8oV/d4ujh54DpjXYa+GBZiXGpsWlImQ2vNU6L9E6qg7Yp2AhUkz2 ltF0xoSQZssaAt2ztY4vBI/2hlzHDDeNKSdANoi/3PDStXpibyjRZFsU5/p4ZxU/bs fw4kh31K0RhJg== Message-ID: <51826b5f-e480-994a-4a72-39ff4572bb3f@kernel.org> Date: Wed, 2 Mar 2022 11:32:27 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.1 Subject: Re: [f2fs-dev] [PATCH] f2fs: fix to avoid potential deadlock Content-Language: en-US From: Chao Yu To: Jaegeuk Kim Cc: Jing Xia , linux-f2fs-devel@lists.sourceforge.net, Zhiguo Niu , linux-kernel@vger.kernel.org References: <20220127054449.24711-1-chao@kernel.org> <51be77f1-6e85-d46d-d0d3-c06d2055a190@kernel.org> <86a175d3-c438-505b-1dbc-4ef6e8b5adcb@kernel.org> <5b5e20d1-877f-b321-b341-c0f233ee976c@kernel.org> In-Reply-To: <5b5e20d1-877f-b321-b341-c0f233ee976c@kernel.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-7.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org ping, On 2022/2/25 11:02, Chao Yu wrote: > On 2022/2/3 22:57, Chao Yu wrote: >> On 2022/2/3 9:51, Jaegeuk Kim wrote: >>> On 01/29, Chao Yu wrote: >>>> On 2022/1/29 8:37, Jaegeuk Kim wrote: >>>>> On 01/28, Chao Yu wrote: >>>>>> On 2022/1/28 5:59, Jaegeuk Kim wrote: >>>>>>> On 01/27, Chao Yu wrote: >>>>>>>> Quoted from Jing Xia's report, there is a potential deadlock may happen >>>>>>>> between kworker and checkpoint as below: >>>>>>>> >>>>>>>> [T:writeback]                [T:checkpoint] >>>>>>>> - wb_writeback >>>>>>>>     - blk_start_plug >>>>>>>> bio contains NodeA was plugged in writeback threads >>>>>>> >>>>>>> I'm still trying to understand more precisely. So, how is it possible to >>>>>>> have bio having node write in this current context? >>>>>> >>>>>> IMO, after above blk_start_plug(), it may plug some inode's node page in kworker >>>>>> during writebacking node_inode's data page (which should be node page)? >>>>> >>>>> Wasn't that added into a different task->plug? >>>> >>>> I'm not sure I've got your concern correctly... >>>> >>>> Do you mean NodeA and other IOs from do_writepages() were plugged in >>>> different local plug variables? >>> >>> I think so. >> >> I guess block plug helper says it doesn't allow to use nested plug, so there >> is only one plug in kworker thread? >> >> void blk_start_plug_nr_ios(struct blk_plug *plug, unsigned short nr_ios) >> { >>      struct task_struct *tsk = current; >> >>      /* >>       * If this is a nested plug, don't actually assign it. >>       */ >>      if (tsk->plug) >>          return; >> ... >> } > > Any further comments? > > Thanks, > >> >> Thanks, >> >>> >>>> >>>> Thanks, >>>> >>>>> >>>>>> >>>>>> Thanks, >>>>>> >>>>>>> >>>>>>>>                     - do_writepages  -- sync write inodeB, inc wb_sync_req[DATA] >>>>>>>>                      - f2fs_write_data_pages >>>>>>>>                       - f2fs_write_single_data_page -- write last dirty page >>>>>>>>                        - f2fs_do_write_data_page >>>>>>>>                         - set_page_writeback  -- clear page dirty flag and >>>>>>>>                         PAGECACHE_TAG_DIRTY tag in radix tree >>>>>>>>                         - f2fs_outplace_write_data >>>>>>>>                          - f2fs_update_data_blkaddr >>>>>>>>                           - f2fs_wait_on_page_writeback -- wait NodeA to writeback here >>>>>>>>                        - inode_dec_dirty_pages >>>>>>>>     - writeback_sb_inodes >>>>>>>>      - writeback_single_inode >>>>>>>>       - do_writepages >>>>>>>>        - f2fs_write_data_pages -- skip writepages due to wb_sync_req[DATA] >>>>>>>>         - wbc->pages_skipped += get_dirty_pages() -- PAGECACHE_TAG_DIRTY is not set but get_dirty_pages() returns one >>>>>>>>      - requeue_inode -- requeue inode to wb->b_dirty queue due to non-zero.pages_skipped >>>>>>>>     - blk_finish_plug >>>>>>>> >>>>>>>> Let's try to avoid deadlock condition by forcing unplugging previous bio via >>>>>>>> blk_finish_plug(current->plug) once we'v skipped writeback in writepages() >>>>>>>> due to valid sbi->wb_sync_req[DATA/NODE]. >>>>>>>> >>>>>>>> Fixes: 687de7f1010c ("f2fs: avoid IO split due to mixed WB_SYNC_ALL and WB_SYNC_NONE") >>>>>>>> Signed-off-by: Zhiguo Niu >>>>>>>> Signed-off-by: Jing Xia >>>>>>>> Signed-off-by: Chao Yu >>>>>>>> --- >>>>>>>>     fs/f2fs/data.c | 6 +++++- >>>>>>>>     fs/f2fs/node.c | 6 +++++- >>>>>>>>     2 files changed, 10 insertions(+), 2 deletions(-) >>>>>>>> >>>>>>>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>>>>>>> index 76d6fe7b0c8f..932a4c81acaf 100644 >>>>>>>> --- a/fs/f2fs/data.c >>>>>>>> +++ b/fs/f2fs/data.c >>>>>>>> @@ -3174,8 +3174,12 @@ static int __f2fs_write_data_pages(struct address_space *mapping, >>>>>>>>         /* to avoid spliting IOs due to mixed WB_SYNC_ALL and WB_SYNC_NONE */ >>>>>>>>         if (wbc->sync_mode == WB_SYNC_ALL) >>>>>>>>             atomic_inc(&sbi->wb_sync_req[DATA]); >>>>>>>> -    else if (atomic_read(&sbi->wb_sync_req[DATA])) >>>>>>>> +    else if (atomic_read(&sbi->wb_sync_req[DATA])) { >>>>>>>> +        /* to avoid potential deadlock */ >>>>>>>> +        if (current->plug) >>>>>>>> +            blk_finish_plug(current->plug); >>>>>>>>             goto skip_write; >>>>>>>> +    } >>>>>>>>         if (__should_serialize_io(inode, wbc)) { >>>>>>>>             mutex_lock(&sbi->writepages); >>>>>>>> diff --git a/fs/f2fs/node.c b/fs/f2fs/node.c >>>>>>>> index 556fcd8457f3..69c6bcaf5aae 100644 >>>>>>>> --- a/fs/f2fs/node.c >>>>>>>> +++ b/fs/f2fs/node.c >>>>>>>> @@ -2106,8 +2106,12 @@ static int f2fs_write_node_pages(struct address_space *mapping, >>>>>>>>         if (wbc->sync_mode == WB_SYNC_ALL) >>>>>>>>             atomic_inc(&sbi->wb_sync_req[NODE]); >>>>>>>> -    else if (atomic_read(&sbi->wb_sync_req[NODE])) >>>>>>>> +    else if (atomic_read(&sbi->wb_sync_req[NODE])) { >>>>>>>> +        /* to avoid potential deadlock */ >>>>>>>> +        if (current->plug) >>>>>>>> +            blk_finish_plug(current->plug); >>>>>>>>             goto skip_write; >>>>>>>> +    } >>>>>>>>         trace_f2fs_writepages(mapping->host, wbc, NODE); >>>>>>>> -- >>>>>>>> 2.32.0 >> >> >> _______________________________________________ >> Linux-f2fs-devel mailing list >> Linux-f2fs-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel > > > _______________________________________________ > Linux-f2fs-devel mailing list > Linux-f2fs-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel