Received: by 2002:a25:e7d8:0:0:0:0:0 with SMTP id e207csp1368720ybh; Thu, 12 Mar 2020 23:32:26 -0700 (PDT) X-Google-Smtp-Source: ADFU+vvHsOidYqHUB2FF74zbNh8mAmMLwuo9nxHEJqHSY0vxCTeI9YJ6+N6a2voCTga7bt4ttbeF X-Received: by 2002:a9d:6418:: with SMTP id h24mr9641735otl.172.1584081146362; Thu, 12 Mar 2020 23:32:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584081146; cv=none; d=google.com; s=arc-20160816; b=CPPbV8BxWKQly1aBbmRQAYTcvyBwp7pq8uRUVfs6DoelQP+B6D0NuLrkLYo8ynOD5H MP6wb4LSYAg74Hyx08OvSCi9Y7U84e5RlIE4VnclXeuoaZ+0IMQ9IhxswKjhxhaF7zD0 imcvHL9lqs0M1SnQDP640lf8cbE1PILcF6vcn5lvoYBbQW8lXw/+yE5mgH57OylhdJg/ nV7HI747+hzMCX72R3IhxRP3GZbuSSkpRUpHyE6RK54ZQgc7o31iR9rqgK6AMmXyLlcK AR1yoGoD7ZmGfCpSC4+vQmm2vY5diiZ8HXWkW9Wf3Ml9mE6Mi+L3XdUPWknDWzNpXElH mnmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=v9wInTI2i5k/8i/w0ZRHkGQhaHNvHd/8AIV3eMb+fHg=; b=h0ivSggB1xMqii+V2fn0Yg/L7NSwMacicT3aKe4a2jSQRjlEUFoNMcyko39hHFa4o+ CrBlQVU2oFdEQNRYvFlnVDYKdlxr3aLegt7ti2KgRSGezBkN7y0jtuRRJfeubd5kr4z3 njybuzfKBnjTnhDlu5qaktK8TW6HnC57dv725YGkzkeNKojKkhhRsYCeM3Q+c6u9pTmR k+SVF7nCchcQT+39Rkdt51qrQ6MefLCOZDQqOzpEavTlQl1si9QYoKy/n5c93j/ne5+7 iFtWMEIuZG55lUof5MG4M40GTb712nTG7aTerSvpl8gqPHd8iVGO0LPf9rfeL/41BvLq XbBg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r23si4114037otd.203.2020.03.12.23.32.10; Thu, 12 Mar 2020 23:32:26 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726299AbgCMGbv (ORCPT + 99 others); Fri, 13 Mar 2020 02:31:51 -0400 Received: from szxga05-in.huawei.com ([45.249.212.191]:11670 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726001AbgCMGbv (ORCPT ); Fri, 13 Mar 2020 02:31:51 -0400 Received: from DGGEMS402-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 0ED352C1F36F6B23641E; Fri, 13 Mar 2020 14:31:01 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.202) with Microsoft SMTP Server (TLS) id 14.3.487.0; Fri, 13 Mar 2020 14:30:56 +0800 Subject: Re: [PATCH] f2fs: fix long latency due to discard during umount To: Sahitya Tummala CC: Jaegeuk Kim , , References: <1584011671-20939-1-git-send-email-stummala@codeaurora.org> <20200313033912.GJ20234@codeaurora.org> From: Chao Yu Message-ID: Date: Fri, 13 Mar 2020 14:30:55 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20200313033912.GJ20234@codeaurora.org> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/3/13 11:39, Sahitya Tummala wrote: > On Fri, Mar 13, 2020 at 10:20:04AM +0800, Chao Yu wrote: >> On 2020/3/12 19:14, Sahitya Tummala wrote: >>> F2FS already has a default timeout of 5 secs for discards that >>> can be issued during umount, but it can take more than the 5 sec >>> timeout if the underlying UFS device queue is already full and there >>> are no more available free tags to be used. In that case, submit_bio() >>> will wait for the already queued discard requests to complete to get >>> a free tag, which can potentially take way more than 5 sec. >>> >>> Fix this by submitting the discard requests with REQ_NOWAIT >>> flags during umount. This will return -EAGAIN for UFS queue/tag full >>> scenario without waiting in the context of submit_bio(). The FS can >>> then handle these requests by retrying again within the stipulated >>> discard timeout period to avoid long latencies. >>> >>> Signed-off-by: Sahitya Tummala >>> --- >>> fs/f2fs/segment.c | 14 +++++++++++++- >>> 1 file changed, 13 insertions(+), 1 deletion(-) >>> >>> diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c >>> index fb3e531..a06bbac 100644 >>> --- a/fs/f2fs/segment.c >>> +++ b/fs/f2fs/segment.c >>> @@ -1124,10 +1124,13 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> struct discard_cmd_control *dcc = SM_I(sbi)->dcc_info; >>> struct list_head *wait_list = (dpolicy->type == DPOLICY_FSTRIM) ? >>> &(dcc->fstrim_list) : &(dcc->wait_list); >>> - int flag = dpolicy->sync ? REQ_SYNC : 0; >>> + int flag; >>> block_t lstart, start, len, total_len; >>> int err = 0; >>> >>> + flag = dpolicy->sync ? REQ_SYNC : 0; >>> + flag |= dpolicy->type == DPOLICY_UMOUNT ? REQ_NOWAIT : 0; >>> + >>> if (dc->state != D_PREP) >>> return 0; >>> >>> @@ -1203,6 +1206,11 @@ static int __submit_discard_cmd(struct f2fs_sb_info *sbi, >>> bio->bi_end_io = f2fs_submit_discard_endio; >>> bio->bi_opf |= flag; >>> submit_bio(bio); >>> + if ((flag & REQ_NOWAIT) && (dc->error == -EAGAIN)) { >> >> If we want to update dc->state, we need to cover it with dc->lock. > > Sure, will update it. > >> >>> + dc->state = D_PREP; >> >> BTW, one dc can be referenced by multiple bios, so dc->state could be updated to >> D_DONE later by f2fs_submit_discard_endio(), however we just relocate it to >> pending list... which is inconsistent status. > > In that case dc->bio_ref will reflect it and until it becomes 0, the dc->state > will not be updated to D_DONE in f2fs_submit_discard_endio()? __submit_discard_cmd() lock() dc->state = D_SUBMIT; dc->bio_ref++; unlock() ... submit_bio() f2fs_submit_discard_endio() dc->error = -EAGAIN; lock() dc->bio_ref--; dc->state = D_PREP; dc->state = D_DONE; unlock() So finally, dc's state is D_DONE, and it's in wait list, then will be relocated to pending list. > > Thanks, > >> >> Thanks, >> >>> + err = dc->error; >>> + break; >>> + } >>> >>> atomic_inc(&dcc->issued_discard); >>> >>> @@ -1510,6 +1518,10 @@ static int __issue_discard_cmd(struct f2fs_sb_info *sbi, >>> } >>> >>> __submit_discard_cmd(sbi, dpolicy, dc, &issued); >>> + if (dc->error == -EAGAIN) { >>> + congestion_wait(BLK_RW_ASYNC, HZ/50); >>> + __relocate_discard_cmd(dcc, dc); >>> + } >>> >>> if (issued >= dpolicy->max_requests) >>> break; >>> >