Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp339777ybe; Wed, 18 Sep 2019 18:28:54 -0700 (PDT) X-Google-Smtp-Source: APXvYqzaL7joA7fX3VX4+hUp4PjC6qdUOSgPVmWAvlaxv+mSc+BlajE8xb3fziUpEElz0YtD2Pn5 X-Received: by 2002:a50:ed17:: with SMTP id j23mr5312959eds.248.1568856534860; Wed, 18 Sep 2019 18:28:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1568856534; cv=none; d=google.com; s=arc-20160816; b=I/S2jOtkPKubLWKGjQfMANo7SwVEByh6vRDJuuFXxcQkaJAYx2LvnOf+rIkqWUDyVa 7mg+jVNLhQaLTS3SzeFy1CAEbFQKnaEYWjL+b8O+AJe9D8IZbwRRu259MC1YdAzVtrDA GSOnPD4LKZWBHvyu5pZifAocsAEUPTORxUtGJcAQ1vHSs3J1g1a+1SAWGTH7fwIcSqCP tIdZKmPBPZ3V0cFCES8m/5yLV427IiJRnOj4u27UZ4ySMwTpPrxSl0NHHPRC4CmGRDjo T0Bk+jMzM+hDHDoNPrQ1s+aAJ23spvJiWrvNdp2sllKZKMCR0/5mOgrjTpxlWRaJDLmY 8rrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=LuzFsO4KqFryUH4sR6VX8BfDY6xpIfu8zoUJgZ9MEv4=; b=CwvS4OssPHnar7dn9McaVf4mbmvf4y1aVEjXvTIUEYj4Xa9xr8nc+Bt0PFc/F4sSnB 5stZ725P5ZFUAY3KUWgTRq0/zSzG63Hysr6T+QEIL/R7MJAK49qk0x9MnN9s/mikqJt/ iL5pkb/i0zDfI8tV/Qh9sY50GCHKdie3c4dpAShTVw0dUGDbwJu7DXDUHgn8lqA6ML95 TI/VefjDlehmick+5hU3PMY0igTRUmHiciv5eFb2VTTai4m/+ZYblAxzqQkvYB2RRmXm GEAV+72p753YhIL63B/7wfdmHPo9rW5FBg5/MUtTUOVEUMrKde/BFW1bJ4iOFZQ4i3gC KH1g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id qw8si3676547ejb.74.2019.09.18.18.28.32; Wed, 18 Sep 2019 18:28:54 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730740AbfISAx5 (ORCPT + 99 others); Wed, 18 Sep 2019 20:53:57 -0400 Received: from szxga04-in.huawei.com ([45.249.212.190]:2674 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2387547AbfISAx5 (ORCPT ); Wed, 18 Sep 2019 20:53:57 -0400 Received: from DGGEMS403-HUB.china.huawei.com (unknown [172.30.72.60]) by Forcepoint Email with ESMTP id D63DBB24693A9CBEFC49; Thu, 19 Sep 2019 08:53:53 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.203) with Microsoft SMTP Server (TLS) id 14.3.439.0; Thu, 19 Sep 2019 08:53:50 +0800 Subject: Re: [f2fs-dev] [PATCH 1/2] f2fs: do not select same victim right again To: Jaegeuk Kim CC: , References: <20190909080654.GD21625@jaegeuk-macbookpro.roam.corp.google.com> <97237da2-897a-8420-94de-812e94aa751f@huawei.com> <20190909120443.GA31108@jaegeuk-macbookpro.roam.corp.google.com> <27725e65-53fe-5731-0201-9959b8ef6b49@huawei.com> <20190916153736.GA2493@jaegeuk-macbookpro.roam.corp.google.com> <20190917205501.GA60683@jaegeuk-macbookpro.roam.corp.google.com> <20190918031257.GA82722@jaegeuk-macbookpro.roam.corp.google.com> <20190918164754.GA88624@jaegeuk-macbookpro.roam.corp.google.com> From: Chao Yu Message-ID: <7d7e8e46-0261-ddec-881a-e720ca2badac@huawei.com> Date: Thu, 19 Sep 2019 08:53:34 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20190918164754.GA88624@jaegeuk-macbookpro.roam.corp.google.com> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/9/19 0:47, Jaegeuk Kim wrote: > On 09/18, Chao Yu wrote: >> On 2019/9/18 11:12, Jaegeuk Kim wrote: >>> On 09/18, Chao Yu wrote: >>>> On 2019/9/18 4:55, Jaegeuk Kim wrote: >>>>> On 09/17, Chao Yu wrote: >>>>>> On 2019/9/16 23:37, Jaegeuk Kim wrote: >>>>>>> On 09/16, Chao Yu wrote: >>>>>>>> On 2019/9/9 20:04, Jaegeuk Kim wrote: >>>>>>>>> On 09/09, Chao Yu wrote: >>>>>>>>>> On 2019/9/9 16:06, Jaegeuk Kim wrote: >>>>>>>>>>> On 09/09, Chao Yu wrote: >>>>>>>>>>>> On 2019/9/9 9:25, Jaegeuk Kim wrote: >>>>>>>>>>>>> GC must avoid select the same victim again. >>>>>>>>>>>> >>>>>>>>>>>> Blocks in previous victim will occupy addition free segment, I doubt after this >>>>>>>>>>>> change, FGGC may encounter out-of-free space issue more frequently. >>>>>>>>>>> >>>>>>>>>>> Hmm, actually this change seems wrong by sec_usage_check(). >>>>>>>>>>> We may be able to avoid this only in the suspicious loop? >>>>>>>>>>> >>>>>>>>>>> --- >>>>>>>>>>> fs/f2fs/gc.c | 2 +- >>>>>>>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>>>>>>>> >>>>>>>>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c >>>>>>>>>>> index e88f98ddf396..5877bd729689 100644 >>>>>>>>>>> --- a/fs/f2fs/gc.c >>>>>>>>>>> +++ b/fs/f2fs/gc.c >>>>>>>>>>> @@ -1326,7 +1326,7 @@ int f2fs_gc(struct f2fs_sb_info *sbi, bool sync, >>>>>>>>>>> round++; >>>>>>>>>>> } >>>>>>>>>>> >>>>>>>>>>> - if (gc_type == FG_GC) >>>>>>>>>>> + if (gc_type == FG_GC && seg_freed) >>>>>>>>>> >>>>>>>>>> That's original solution Sahitya provided to avoid infinite loop of GC, but I >>>>>>>>>> suggest to find the root cause first, then we added .invalid_segmap for that >>>>>>>>>> purpose. >>>>>>>>> >>>>>>>>> I've checked the Sahitya's patch. So, it seems the problem can happen due to >>>>>>>>> is_alive or atomic_file. >>>>>>>> >>>>>>>> For some conditions, this doesn't help, for example, two sections contain the >>>>>>>> same fewest valid blocks, it will cause to loop selecting them if it fails to >>>>>>>> migrate blocks. >>>>>>>> >>>>>>>> How about keeping it as it is to find potential bug. >>>>>>> >>>>>>> I think it'd be fine to merge this. Could you check the above scenario in more >>>>>>> detail? >>>>>> >>>>>> I haven't saw this in real scenario yet. >>>>>> >>>>>> What I mean is if there is a bug (maybe in is_alive()) failing us to GC on one >>>>>> section, when that bug happens in two candidates, there could be the same >>>>>> condition that GC will run into loop (select A, fail to migrate; select B, fail >>>>>> to migrate, select A...). >>>>>> >>>>>> But I guess the benefit of this change is, if FGGC fails to migrate block due to >>>>>> i_gc_rwsem race, selecting another section and later retrying previous one may >>>>>> avoid lock race, right? >>>>> >>>>> In any case, I think this can avoid potenial GC loop. At least to me, it'd be >>>>> quite risky, if we remain this just for debugging purpose only. >>>> >>>> Yup, >>>> >>>> One more concern is would this cur_victim_sec remain after FGGC? then BGGC/SSR >>>> will always skip the section cur_victim_sec points to. >>> >>> Then, we can get another loop before using it by BGGC/SSR. >> >> I guess I didn't catch your point, do you mean, if we reset it in the end of >> FGGC, we may encounter the loop during BGGC/SSR? > > FGGC failed in a loop and last victim was remained in cur_victim_sec. It won't run into a loop because we keep below condition? + if (gc_type == FG_GC && seg_freed) + sbi->cur_victim_sec = NULL_SEGNO; if (sync) goto stop; I meant add below logic in addition: + if (gc_type == FG_GC) + sbi->cur_victim_sec = NULL_SEGNO; mutex_unlock(&sbi->gc_mutex); Thanks, > Next FGGC kicked in and did the same thing again. I don't expect BGGC/SSR > wants to select this victim much, since it will have CB policy. > >> >> I meant: >> >> f2fs_gc() >> ... >> >> + if (gc_type == FG_GC) >> + sbi->cur_victim_sec = NULL_SEGNO; >> >> mutex_unlock(&sbi->gc_mutex); >> >> put_gc_inode(&gc_list); >> ... >> >> Thanks, >> >>> >>>> >>>> So could we reset cur_victim_sec in the end of FGGC? >>>> >>>> Thanks, >>>> >>>>> >>>>>> >>>>>> Thanks, >>>>>> >>>>>>> >>>>>>> Thanks, >>>>>>> >>>>>>>> >>>>>>>> Thanks, >>>>>>>> >>>>>>>>> >>>>>>>>>> >>>>>>>>>> Thanks, >>>>>>>>>> >>>>>>>>>>> sbi->cur_victim_sec = NULL_SEGNO; >>>>>>>>>>> >>>>>>>>>>> if (sync) >>>>>>>>>>> >>>>>>>>> . >>>>>>>>> >>>>>>> . >>>>>>> >>>>> . >>>>> >>> . >>> > . >