Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp1529402ybb; Thu, 2 Apr 2020 02:34:01 -0700 (PDT) X-Google-Smtp-Source: APiQypJ7hQcr8rhNgHGE+qIaTUbhfwz4OtuYgFYOn6PvuJ8y/w6LclqKG3Oc60waaYtit0uuYcXy X-Received: by 2002:aca:eb4c:: with SMTP id j73mr1446878oih.145.1585820041468; Thu, 02 Apr 2020 02:34:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585820041; cv=none; d=google.com; s=arc-20160816; b=kMeeJwEB1wm0JYO3TIRPRHdldPMwaBfmV41/gf7tu2cAXHDTv4Ou4hkwNfQjldRW3t 0lFs/FcXM/2UO+ST69VDwYxHVJsRDA9vOrgBYOGVd+dp7ZEp/Ki4uMp32lXiVRjTeYDA Wh0tyvMH6qiZqBXZRjXZQcXKhrODdx7/EkKCu5g3ikJBhelQGFRc7rcwLS1kQEOcVHis 8Bwsc4r7zcjZqBQ3sal5wrRrEwRGpATdiLBczKotdLad0QGDVdVtwID8pHgOV50ThRsw dy6kLd3crszAm6O6oge4vFYWfYHbmMw9Qo7jdWQPhG7wbzeF62Ke/HeGW2XfsBhnppvA 7n1g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=od5cS126zl4YMaii9/MAIHQPQepVRdkPT4j+xTzLmd4=; b=xJDdSQF0f8le3LQtUYdVcuqrxy91nIcJSCLemGGU3Gk2MATvSiBLmt+Yl1PcPJmMiP na3lAiWQiz8/SPN6U9iEKNBj64J8JRV1YVYCivPaEsmL+qQdiksDUm5hCS85GO/55vB6 aCVJLiZAa0UR0dOTxUkeBxqTZml7wZsnTSFydRc4zgUSPF22yyqaCaOKjkDuR1SL/whP IqK7eTe7ZwFom/zKuyqnFPsTwSUW9e4U737HhkNY4gV172+W0Bo7OcIcP8tktzgz3fe7 W1W/jRbZGzjyEaJwd9njtSefp52WXm5ntOtTQQew7LeZsRMxsBSnafTBzp3frIIQb9xA Zffg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f6si2043341oom.58.2020.04.02.02.33.48; Thu, 02 Apr 2020 02:34:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387723AbgDBJcx (ORCPT + 99 others); Thu, 2 Apr 2020 05:32:53 -0400 Received: from szxga06-in.huawei.com ([45.249.212.32]:60456 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725965AbgDBJcw (ORCPT ); Thu, 2 Apr 2020 05:32:52 -0400 Received: from DGGEMS410-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 2444568442345BB538E0; Thu, 2 Apr 2020 17:32:19 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.210) with Microsoft SMTP Server (TLS) id 14.3.487.0; Thu, 2 Apr 2020 17:32:14 +0800 Subject: Re: [PATCH v3] f2fs: fix long latency due to discard during umount To: Sahitya Tummala , Jaegeuk Kim CC: , References: <1585550730-1858-1-git-send-email-stummala@codeaurora.org> <20200331184655.GB198665@google.com> <20200401092201.GB20234@codeaurora.org> From: Chao Yu Message-ID: <417fffb8-0638-e674-cf39-e54665080c36@huawei.com> Date: Thu, 2 Apr 2020 17:32:13 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20200401092201.GB20234@codeaurora.org> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/4/1 17:22, Sahitya Tummala wrote: > Hi Jaegeuk, > > On Tue, Mar 31, 2020 at 11:46:55AM -0700, Jaegeuk Kim wrote: >> On 03/30, Sahitya Tummala wrote: >>> F2FS already has a default timeout of 5 secs for discards that >>> can be issued during umount, but it can take more than the 5 sec >>> timeout if the underlying UFS device queue is already full and there >>> are no more available free tags to be used. In that case, submit_bio() >>> will wait for the already queued discard requests to complete to get >>> a free tag, which can potentially take way more than 5 sec. >>> >>> Fix this by submitting the discard requests with REQ_NOWAIT >>> flags during umount. This will return -EAGAIN for UFS queue/tag full >>> scenario without waiting in the context of submit_bio(). The FS can >>> then handle these requests by retrying again within the stipulated >>> discard timeout period to avoid long latencies. >> >> Sorry, Sahitya, but, do we really need to do like this? How about just >> controlling # of outstanding discarding bios in __issue_discard_cmd()? > > Do you mean something like this? > > diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c > index 1a62b27..860dd43 100644 > --- a/fs/f2fs/segment.c > +++ b/fs/f2fs/segment.c > @@ -1099,7 +1099,7 @@ static void __init_discard_policy(struct f2fs_sb_info *sbi, > } else if (discard_type == DPOLICY_FSTRIM) { > dpolicy->io_aware = false; > } else if (discard_type == DPOLICY_UMOUNT) { > - dpolicy->max_requests = UINT_MAX; > + dpolicy->max_requests = 30; 8 or 16? It looks more simple than previous implementation. Thanks, > dpolicy->io_aware = false; > /* we need to issue all to keep CP_TRIMMED_FLAG */ > dpolicy->granularity = 1; > @@ -1470,12 +1470,14 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, > struct list_head *pend_list; > struct discard_cmd *dc, *tmp; > struct blk_plug plug; > - int i, issued = 0; > + int i, issued; > bool io_interrupted = false; > > if (dpolicy->timeout != 0) > f2fs_update_time(sbi, dpolicy->timeout); > > +retry: > + issued = 0; > for (i = MAX_PLIST_NUM - 1; i >= 0; i--) { > if (dpolicy->timeout != 0 && > f2fs_time_over(sbi, dpolicy->timeout)) > @@ -1522,6 +1524,11 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, > break; > } > > + if (dpolicy->type == DPOLICY_UMOUNT && issued) { > + __wait_all_discard_cmd(sbi, dpolicy); > + goto retry; > + } > + > if (!issued && io_interrupted) > issued = -1; > > Thanks, > >> >>> >>> Signed-off-by: Sahitya Tummala >>> --- >>> v3: >>> -Handle the regression reported by Chao with v2. >>> -simplify the logic to split the dc with multiple bios incase any bio returns >>> EAGAIN and retry those new dc within 5 sec timeout. >>> >>> fs/f2fs/segment.c | 65 +++++++++++++++++++++++++++++++++++++++++++------------ >>> 1 file changed, 51 insertions(+), 14 deletions(-) >>> >>> diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c >>> index fb3e531..55d18c7 100644 >>> --- a/fs/f2fs/segment.c >>> +++ b/fs/f2fs/segment.c >>> @@ -1029,13 +1029,16 @@ static void f2fs_submit_discard_endio(struct bio *bio) >>> struct discard_cmd *dc = (struct discard_cmd *)bio->bi_private; >>> unsigned long flags; >>> >>> - dc->error = blk_status_to_errno(bio->bi_status); >>> - >>> spin_lock_irqsave(&dc->lock, flags); >>> + if (!dc->error) >>> + dc->error = blk_status_to_errno(bio->bi_status); >>> + >>> dc->bio_ref--; >>> - if (!dc->bio_ref && dc->state == D_SUBMIT) { >>> - dc->state = D_DONE; >>> - complete_all(&dc->wait); >>> + if (!dc->bio_ref) { >>> + if (dc->error || dc->state == D_SUBMIT) { >>> + dc->state = D_DONE; >>> + complete_all(&dc->wait); >>> + } >>> } >>> spin_unlock_irqrestore(&dc->lock, flags); >>> bio_put(bio); >>> @@ -1124,10 +1127,13 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> struct discard_cmd_control *dcc = SM_I(sbi)->dcc_info; >>> struct list_head *wait_list = (dpolicy->type == DPOLICY_FSTRIM) ? >>> &(dcc->fstrim_list) : &(dcc->wait_list); >>> - int flag = dpolicy->sync ? REQ_SYNC : 0; >>> + int flag; >>> block_t lstart, start, len, total_len; >>> int err = 0; >>> >>> + flag = dpolicy->sync ? REQ_SYNC : 0; >>> + flag |= dpolicy->type == DPOLICY_UMOUNT ? REQ_NOWAIT : 0; >>> + >>> if (dc->state != D_PREP) >>> return 0; >>> >>> @@ -1192,10 +1198,6 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> dc->bio_ref++; >>> spin_unlock_irqrestore(&dc->lock, flags); >>> >>> - atomic_inc(&dcc->queued_discard); >>> - dc->queued++; >>> - list_move_tail(&dc->list, wait_list); >>> - >>> /* sanity check on discard range */ >>> __check_sit_bitmap(sbi, lstart, lstart + len); >>> >>> @@ -1203,6 +1205,29 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> bio->bi_end_io = f2fs_submit_discard_endio; >>> bio->bi_opf |= flag; >>> submit_bio(bio); >>> + if (flag & REQ_NOWAIT) { >>> + if (dc->error == -EAGAIN) { >>> + spin_lock_irqsave(&dc->lock, flags); >>> + dc->len -= len; >>> + if (!dc->len) { >>> + dc->len = total_len; >>> + dc->state = D_PREP; >>> + reinit_completion(&dc->wait); >>> + } else { >>> + dcc->undiscard_blks -= total_len; >>> + if (dc->state == D_PARTIAL) >>> + dc->state = D_SUBMIT; >>> + } >>> + err = dc->error; >>> + dc->error = 0; >>> + spin_unlock_irqrestore(&dc->lock, flags); >>> + break; >>> + } >>> + } >>> + >>> + atomic_inc(&dcc->queued_discard); >>> + dc->queued++; >>> + list_move_tail(&dc->list, wait_list); >>> >>> atomic_inc(&dcc->issued_discard); >>> >>> @@ -1214,8 +1239,9 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> len = total_len; >>> } >>> >>> - if (!err && len) >>> - __update_discard_tree_range(sbi, bdev, lstart, start, len); >>> + if ((!err || err == -EAGAIN) && total_len && dc->start != start) >>> + __update_discard_tree_range(sbi, bdev, lstart, start, >>> + total_len); >>> return err; >>> } >>> >>> @@ -1470,12 +1496,15 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, >>> struct list_head *pend_list; >>> struct discard_cmd *dc, *tmp; >>> struct blk_plug plug; >>> - int i, issued = 0; >>> + int i, err, issued = 0; >>> bool io_interrupted = false; >>> + bool retry; >>> >>> if (dpolicy->timeout != 0) >>> f2fs_update_time(sbi, dpolicy->timeout); >>> >>> +retry: >>> + retry = false; >>> for (i = MAX_PLIST_NUM - 1; i >= 0; i--) { >>> if (dpolicy->timeout != 0 && >>> f2fs_time_over(sbi, dpolicy->timeout)) >>> @@ -1509,7 +1538,12 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, >>> break; >>> } >>> >>> - __submit_discard_cmd(sbi, dpolicy, dc, &issued); >>> + err = __submit_discard_cmd(sbi, dpolicy, dc, &issued); >>> + if (err == -EAGAIN) { >>> + congestion_wait(BLK_RW_ASYNC, >>> + DEFAULT_IO_TIMEOUT); >>> + retry = true; >>> + } >>> >>> if (issued >= dpolicy->max_requests) >>> break; >>> @@ -1522,6 +1556,9 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, >>> break; >>> } >>> >>> + if (retry) >>> + goto retry; >>> + >>> if (!issued && io_interrupted) >>> issued = -1; >>> >>> -- >>> Qualcomm India Private Limited, on behalf of Qualcomm Innovation Center, Inc. >>> Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, a Linux Foundation Collaborative Project. >