Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp6296214ybe; Wed, 18 Sep 2019 00:49:41 -0700 (PDT) X-Google-Smtp-Source: APXvYqz7ddiygBJPDb2fRLrAc5sDlBby6yc19aIf0E9yY4/m/41Z44JYx+4RJBomaWY9OAngZu4H X-Received: by 2002:a17:906:318d:: with SMTP id 13mr7989072ejy.281.1568792981577; Wed, 18 Sep 2019 00:49:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1568792981; cv=none; d=google.com; s=arc-20160816; b=hQ0pEwhOO69WHJF1VbdZi5QntFSK38tYI/1O2Dom5zeO806b5Y4MzWvcaTfhuGvwCW f+BVpr6nNkgGnHmGzs/Y2aMafOW4KEpMlUcLrAH5nH5PFIvOY9cbmlfPLfW09e9+Wnh7 pkRYmW1l6AowvEH/xC5DwIkkWAf5nQT7d6iy4pg5XdK2vjU4SghgPekFXwuzZiwGHegV ALLgY4l8fzD+3wecRaSZcsWXC/g679XKF+tE52yi6mUieTakuadH54q4kIH16DKkI7cp IkeF5cvvg7jrwhyEPvoSIwCgElqr2UQxJSIcg2tBe8m7yWaDfxTPhx9ZYWUrQuPzJmJw rRXg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=lilIzO3FCSjO3+LFSH7//5aqHS/sm7ZDOfKkZ/w6i0M=; b=Fus1CJQZ7tTaNQaAKkIs/eFM/Mc7JxGJhUeHD6km1gpoaSFAFq8gbHkjw+bzaTCLUp 3twrRLlF8sOezawamVfO/HcEn5pdT6SgLORwa2Yo+E0U/6WweBJR73BkDv5s9z1Mt3+G RSnT6kGBUsGqyxcuT6on7okKyC3S2XeidD+YJ6TusbhjkQlk70qKGIsKjv0XMiNc09hM Xk6N2/avwP7e2Eu4dFn3zaFt4kpzQrFob1j6MpA8LMfigsiPVryoPJphx6i1GHVPrZCj vwv3Fo6b8pYmh1MNxtx5rinqunn6H73ajopYB+MDkbz/z3/HK70bow2kdFz9hVjuO7rZ 6y7A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=crVCuysu; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y20si2761359edo.436.2019.09.18.00.49.18; Wed, 18 Sep 2019 00:49:41 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=crVCuysu; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727057AbfIRDNA (ORCPT + 99 others); Tue, 17 Sep 2019 23:13:00 -0400 Received: from mail.kernel.org ([198.145.29.99]:51624 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726786AbfIRDNA (ORCPT ); Tue, 17 Sep 2019 23:13:00 -0400 Received: from localhost (unknown [104.132.0.81]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 9985F21855; Wed, 18 Sep 2019 03:12:58 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1568776378; bh=2nL15fdb42znZn7Z0uLezA9HuQwulYK2zVQeGOjDF7o=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=crVCuysuqQ+TWaYTS224Ytoemrl6mkiBBkbsaqjqxs/PqOviAmadvOAW4qV8lKsR5 pAU1m5zqurS9rw5WJwzUdu98mLAnz5y4Y7zD3OSSrwDH3aAyuyYLKi/GYmvajftRT2 Eq89aVkVQgCc4sBHgE9zaWHcqSXWUTl3gaXMjQRc= Date: Tue, 17 Sep 2019 20:12:57 -0700 From: Jaegeuk Kim To: Chao Yu Cc: linux-kernel@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net Subject: Re: [f2fs-dev] [PATCH 1/2] f2fs: do not select same victim right again Message-ID: <20190918031257.GA82722@jaegeuk-macbookpro.roam.corp.google.com> References: <20190909012532.20454-1-jaegeuk@kernel.org> <69933b7f-48cc-47f9-ba6f-b5ca8f733cba@huawei.com> <20190909080654.GD21625@jaegeuk-macbookpro.roam.corp.google.com> <97237da2-897a-8420-94de-812e94aa751f@huawei.com> <20190909120443.GA31108@jaegeuk-macbookpro.roam.corp.google.com> <27725e65-53fe-5731-0201-9959b8ef6b49@huawei.com> <20190916153736.GA2493@jaegeuk-macbookpro.roam.corp.google.com> <20190917205501.GA60683@jaegeuk-macbookpro.roam.corp.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.8.2 (2017-04-18) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 09/18, Chao Yu wrote: > On 2019/9/18 4:55, Jaegeuk Kim wrote: > > On 09/17, Chao Yu wrote: > >> On 2019/9/16 23:37, Jaegeuk Kim wrote: > >>> On 09/16, Chao Yu wrote: > >>>> On 2019/9/9 20:04, Jaegeuk Kim wrote: > >>>>> On 09/09, Chao Yu wrote: > >>>>>> On 2019/9/9 16:06, Jaegeuk Kim wrote: > >>>>>>> On 09/09, Chao Yu wrote: > >>>>>>>> On 2019/9/9 9:25, Jaegeuk Kim wrote: > >>>>>>>>> GC must avoid select the same victim again. > >>>>>>>> > >>>>>>>> Blocks in previous victim will occupy addition free segment, I doubt after this > >>>>>>>> change, FGGC may encounter out-of-free space issue more frequently. > >>>>>>> > >>>>>>> Hmm, actually this change seems wrong by sec_usage_check(). > >>>>>>> We may be able to avoid this only in the suspicious loop? > >>>>>>> > >>>>>>> --- > >>>>>>> fs/f2fs/gc.c | 2 +- > >>>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) > >>>>>>> > >>>>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c > >>>>>>> index e88f98ddf396..5877bd729689 100644 > >>>>>>> --- a/fs/f2fs/gc.c > >>>>>>> +++ b/fs/f2fs/gc.c > >>>>>>> @@ -1326,7 +1326,7 @@ int f2fs_gc(struct f2fs_sb_info *sbi, bool sync, > >>>>>>> round++; > >>>>>>> } > >>>>>>> > >>>>>>> - if (gc_type == FG_GC) > >>>>>>> + if (gc_type == FG_GC && seg_freed) > >>>>>> > >>>>>> That's original solution Sahitya provided to avoid infinite loop of GC, but I > >>>>>> suggest to find the root cause first, then we added .invalid_segmap for that > >>>>>> purpose. > >>>>> > >>>>> I've checked the Sahitya's patch. So, it seems the problem can happen due to > >>>>> is_alive or atomic_file. > >>>> > >>>> For some conditions, this doesn't help, for example, two sections contain the > >>>> same fewest valid blocks, it will cause to loop selecting them if it fails to > >>>> migrate blocks. > >>>> > >>>> How about keeping it as it is to find potential bug. > >>> > >>> I think it'd be fine to merge this. Could you check the above scenario in more > >>> detail? > >> > >> I haven't saw this in real scenario yet. > >> > >> What I mean is if there is a bug (maybe in is_alive()) failing us to GC on one > >> section, when that bug happens in two candidates, there could be the same > >> condition that GC will run into loop (select A, fail to migrate; select B, fail > >> to migrate, select A...). > >> > >> But I guess the benefit of this change is, if FGGC fails to migrate block due to > >> i_gc_rwsem race, selecting another section and later retrying previous one may > >> avoid lock race, right? > > > > In any case, I think this can avoid potenial GC loop. At least to me, it'd be > > quite risky, if we remain this just for debugging purpose only. > > Yup, > > One more concern is would this cur_victim_sec remain after FGGC? then BGGC/SSR > will always skip the section cur_victim_sec points to. Then, we can get another loop before using it by BGGC/SSR. > > So could we reset cur_victim_sec in the end of FGGC? > > Thanks, > > > > >> > >> Thanks, > >> > >>> > >>> Thanks, > >>> > >>>> > >>>> Thanks, > >>>> > >>>>> > >>>>>> > >>>>>> Thanks, > >>>>>> > >>>>>>> sbi->cur_victim_sec = NULL_SEGNO; > >>>>>>> > >>>>>>> if (sync) > >>>>>>> > >>>>> . > >>>>> > >>> . > >>> > > . > >