Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp6309013ybe; Wed, 18 Sep 2019 01:04:45 -0700 (PDT) X-Google-Smtp-Source: APXvYqw0DSS6E/ZyVFuAV8V/DbA0YF2+Zy0MptbBRJKZgjdardoOSgDobK3Fm3sRK8WTnsIgjrUf X-Received: by 2002:a17:906:20c7:: with SMTP id c7mr8144924ejc.248.1568793885013; Wed, 18 Sep 2019 01:04:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1568793885; cv=none; d=google.com; s=arc-20160816; b=DEStreemBwZvyt1hmsiK6/s9JFYzDb4n8vsupHNnZ3xYk0dm2YmVhnOGLyhdjeABQZ Y8Belwg30oWNHYyhUQZfozBNbMBAqaAsxftg35RoQngmrRUPtWKwFVdd0CWY00n+ZSp/ nBJ1Su1LEz+FmmgazIfT5TQGznkp16wJ7T1LWZ6yS/eCFxs6QNHKX9GGS0xeX3zW8AIp RafVnEDqBQS2a9epqyJ8hIfdPr38K+yAGxI//Q8AfegLkmIRf9PdA0KofO1Wyq/I8Au4 klSPN1V5QxshiZpKzT1N7b3lceypexxU+Id6HbP+2yxQ4vWRR3zqcStJeRs7vb2mnCSq oIMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=ZxFJD8io4wpxYH4YPdyBEUiVo2wIJkbQuWzab6Tth9Y=; b=LCq/4nFCh2YYA5RHNAUlbLV+ATB73mxWbMRt9Mfuov0t/J8W10aQMjYvgahj1pl1Wd 1lzWMEnTPvihcU24wFhmBuTPX/cV7/p09Pzzu+GYsj+yvkcoj3ovg6bMrxHfsjbq6g8J 5uI2UixnIsE1i2iAfQwCLpApqwwtXMlP2BHiCf2meZjLkj9Zb0F23q4p+FIEMB5K/EbP dkLM8XEfcytv/Xk0fT8knPRlo+yyzO9gHIxhYD0lPmeHUQPi7BGJUQykjhXmxkk1ATrc TDqerx2k3t0RxXxEw3hBPni7RTgk4Pb/PtkGX4eJqs0HKIW/DAykEW+tFAg8NGLWRb4s T9pw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n5si2409562eje.103.2019.09.18.01.04.21; Wed, 18 Sep 2019 01:04:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728296AbfIRD1K (ORCPT + 99 others); Tue, 17 Sep 2019 23:27:10 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:51792 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726538AbfIRD1K (ORCPT ); Tue, 17 Sep 2019 23:27:10 -0400 Received: from DGGEMS407-HUB.china.huawei.com (unknown [172.30.72.60]) by Forcepoint Email with ESMTP id CBBD1E842C024FB08F9E; Wed, 18 Sep 2019 11:27:08 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.207) with Microsoft SMTP Server (TLS) id 14.3.439.0; Wed, 18 Sep 2019 11:27:03 +0800 Subject: Re: [f2fs-dev] [PATCH 1/2] f2fs: do not select same victim right again To: Jaegeuk Kim CC: , References: <20190909012532.20454-1-jaegeuk@kernel.org> <69933b7f-48cc-47f9-ba6f-b5ca8f733cba@huawei.com> <20190909080654.GD21625@jaegeuk-macbookpro.roam.corp.google.com> <97237da2-897a-8420-94de-812e94aa751f@huawei.com> <20190909120443.GA31108@jaegeuk-macbookpro.roam.corp.google.com> <27725e65-53fe-5731-0201-9959b8ef6b49@huawei.com> <20190916153736.GA2493@jaegeuk-macbookpro.roam.corp.google.com> <20190917205501.GA60683@jaegeuk-macbookpro.roam.corp.google.com> <20190918031257.GA82722@jaegeuk-macbookpro.roam.corp.google.com> From: Chao Yu Message-ID: Date: Wed, 18 Sep 2019 11:26:33 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20190918031257.GA82722@jaegeuk-macbookpro.roam.corp.google.com> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/9/18 11:12, Jaegeuk Kim wrote: > On 09/18, Chao Yu wrote: >> On 2019/9/18 4:55, Jaegeuk Kim wrote: >>> On 09/17, Chao Yu wrote: >>>> On 2019/9/16 23:37, Jaegeuk Kim wrote: >>>>> On 09/16, Chao Yu wrote: >>>>>> On 2019/9/9 20:04, Jaegeuk Kim wrote: >>>>>>> On 09/09, Chao Yu wrote: >>>>>>>> On 2019/9/9 16:06, Jaegeuk Kim wrote: >>>>>>>>> On 09/09, Chao Yu wrote: >>>>>>>>>> On 2019/9/9 9:25, Jaegeuk Kim wrote: >>>>>>>>>>> GC must avoid select the same victim again. >>>>>>>>>> >>>>>>>>>> Blocks in previous victim will occupy addition free segment, I doubt after this >>>>>>>>>> change, FGGC may encounter out-of-free space issue more frequently. >>>>>>>>> >>>>>>>>> Hmm, actually this change seems wrong by sec_usage_check(). >>>>>>>>> We may be able to avoid this only in the suspicious loop? >>>>>>>>> >>>>>>>>> --- >>>>>>>>> fs/f2fs/gc.c | 2 +- >>>>>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>>>>>> >>>>>>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c >>>>>>>>> index e88f98ddf396..5877bd729689 100644 >>>>>>>>> --- a/fs/f2fs/gc.c >>>>>>>>> +++ b/fs/f2fs/gc.c >>>>>>>>> @@ -1326,7 +1326,7 @@ int f2fs_gc(struct f2fs_sb_info *sbi, bool sync, >>>>>>>>> round++; >>>>>>>>> } >>>>>>>>> >>>>>>>>> - if (gc_type == FG_GC) >>>>>>>>> + if (gc_type == FG_GC && seg_freed) >>>>>>>> >>>>>>>> That's original solution Sahitya provided to avoid infinite loop of GC, but I >>>>>>>> suggest to find the root cause first, then we added .invalid_segmap for that >>>>>>>> purpose. >>>>>>> >>>>>>> I've checked the Sahitya's patch. So, it seems the problem can happen due to >>>>>>> is_alive or atomic_file. >>>>>> >>>>>> For some conditions, this doesn't help, for example, two sections contain the >>>>>> same fewest valid blocks, it will cause to loop selecting them if it fails to >>>>>> migrate blocks. >>>>>> >>>>>> How about keeping it as it is to find potential bug. >>>>> >>>>> I think it'd be fine to merge this. Could you check the above scenario in more >>>>> detail? >>>> >>>> I haven't saw this in real scenario yet. >>>> >>>> What I mean is if there is a bug (maybe in is_alive()) failing us to GC on one >>>> section, when that bug happens in two candidates, there could be the same >>>> condition that GC will run into loop (select A, fail to migrate; select B, fail >>>> to migrate, select A...). >>>> >>>> But I guess the benefit of this change is, if FGGC fails to migrate block due to >>>> i_gc_rwsem race, selecting another section and later retrying previous one may >>>> avoid lock race, right? >>> >>> In any case, I think this can avoid potenial GC loop. At least to me, it'd be >>> quite risky, if we remain this just for debugging purpose only. >> >> Yup, >> >> One more concern is would this cur_victim_sec remain after FGGC? then BGGC/SSR >> will always skip the section cur_victim_sec points to. > > Then, we can get another loop before using it by BGGC/SSR. I guess I didn't catch your point, do you mean, if we reset it in the end of FGGC, we may encounter the loop during BGGC/SSR? I meant: f2fs_gc() ... + if (gc_type == FG_GC) + sbi->cur_victim_sec = NULL_SEGNO; mutex_unlock(&sbi->gc_mutex); put_gc_inode(&gc_list); ... Thanks, > >> >> So could we reset cur_victim_sec in the end of FGGC? >> >> Thanks, >> >>> >>>> >>>> Thanks, >>>> >>>>> >>>>> Thanks, >>>>> >>>>>> >>>>>> Thanks, >>>>>> >>>>>>> >>>>>>>> >>>>>>>> Thanks, >>>>>>>> >>>>>>>>> sbi->cur_victim_sec = NULL_SEGNO; >>>>>>>>> >>>>>>>>> if (sync) >>>>>>>>> >>>>>>> . >>>>>>> >>>>> . >>>>> >>> . >>> > . >