Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp2497395pxa; Mon, 3 Aug 2020 18:20:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx1MY2GfwIHofANDUSb+i9kQ08UejM9foYd4cgila1y0uSVe1DdDFn6NcslxnXYmbO09fTD X-Received: by 2002:a17:906:a24a:: with SMTP id bi10mr18747684ejb.227.1596504028057; Mon, 03 Aug 2020 18:20:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596504028; cv=none; d=google.com; s=arc-20160816; b=Srya7uqcjk7Oig6wWVouFmCeehp3R66lyr7PXQQluWtc+8WOYJRRUWvisBICvK/iSD 8R+XF+BijwOWwhOjRon6SqxcE8eQSP0Cc8D8w6/TTqQNMaKI751t4C2TbPbYoOT7+dYd gwbKb9x5Z1ptWwVpM7hzf+CzUhzQB1owPxPdL27zGA8ltMq0C+MZJM34saCaZuASMl3C CictNZEWWNai3Yqe1eYCF21vHkpMqTR9xgEjA/qtW+aYEDJrksV1w9ic06OKUaPIoF5r V8+tO+32xsp0khbFNZw//Y2QsOu34kSpQv//VuHnTJLgsvPObC1B8HqCpBMwt6HeOJaY IuhQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=DdE9YikRAfs52crrJtWBB4LsmmtbWLJqljIdTHlXc1g=; b=c7wKAc2R5I5l63Tc6tOpyPU5egEddNijIlQngXatXAl547Dw3m7eKUxzzUKth6w8C6 sR6deOhwe3uhjtwdp4SgFKRZPf51bDVTgx/+PIpmD42J0pSZ9APNIsp73SmIktnjJFbL V7Gv3b2xIn613jmJTaxzv4R08ZNcRbiM8d5nzWkHPGSO5q4mNoe/C6zJB0YWRhYP8oYq rxGYUoozZ1ZRDBFtkwfqBJUTSPUcJ5rTa4P79qy9sOlWWhdwaTVP7ODsu2NlBB0ShidP 6MvPlUeNd4Xo4lb66EqZYU3z9KvHI9MxaCy7yO4FMc0hzkstQADaehKN8Sm0/rqIjWR8 Q3HA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bu20si10915397edb.537.2020.08.03.18.20.04; Mon, 03 Aug 2020 18:20:28 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729043AbgHDBRU (ORCPT + 99 others); Mon, 3 Aug 2020 21:17:20 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:45596 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726276AbgHDBRU (ORCPT ); Mon, 3 Aug 2020 21:17:20 -0400 Received: from DGGEMS405-HUB.china.huawei.com (unknown [172.30.72.60]) by Forcepoint Email with ESMTP id B307DE50B56C500D0481; Tue, 4 Aug 2020 09:17:18 +0800 (CST) Received: from [10.164.122.247] (10.164.122.247) by smtp.huawei.com (10.3.19.205) with Microsoft SMTP Server (TLS) id 14.3.487.0; Tue, 4 Aug 2020 09:17:16 +0800 Subject: Re: [f2fs-dev] [PATCH] f2fs: remove a waiter for checkpoint completion To: Jaegeuk Kim CC: , , , Eric Biggers References: <20200803172825.4077289-1-jaegeuk@kernel.org> <9638d2c5-cfd0-359f-187a-8e23bc6d822d@huawei.com> <20200804010412.GA866340@google.com> From: Chao Yu Message-ID: <98ac9355-bb6c-5109-da73-4ab7cdbbf8d5@huawei.com> Date: Tue, 4 Aug 2020 09:17:16 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20200804010412.GA866340@google.com> Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.164.122.247] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/8/4 9:04, Jaegeuk Kim wrote: > On 08/04, Chao Yu wrote: >> On 2020/8/4 1:28, Jaegeuk Kim wrote: >>> It doesn't need to wait for checkpoint being completed triggered by end_io. >>> >>> [ 20.157753] ------------[ cut here ]------------ >>> [ 20.158393] do not call blocking ops when !TASK_RUNNING; state=2 set at [<0000000096354225>] prepare_to_wait+0xcd/0x430 >>> [ 20.159858] WARNING: CPU: 1 PID: 1152 at kernel/sched/core.c:7142 __might_sleep+0x149/0x1a0 >>> ... >>> [ 20.176110] __submit_merged_write_cond+0x191/0x310 >>> [ 20.176739] f2fs_submit_merged_write+0x18/0x20 >>> [ 20.177323] f2fs_wait_on_all_pages+0x269/0x2d0 >>> [ 20.177899] ? block_operations+0x980/0x980 >>> [ 20.178441] ? __kasan_check_read+0x11/0x20 >>> [ 20.178975] ? finish_wait+0x260/0x260 >>> [ 20.179488] ? percpu_counter_set+0x147/0x230 >>> [ 20.180049] do_checkpoint+0x1757/0x2a50 >>> [ 20.180558] f2fs_write_checkpoint+0x840/0xaf0 >>> [ 20.181126] f2fs_sync_fs+0x287/0x4a0 >>> >>> Reported-by: Eric Biggers >>> Signed-off-by: Jaegeuk Kim >>> --- >>> fs/f2fs/checkpoint.c | 6 +----- >>> fs/f2fs/data.c | 4 ---- >>> fs/f2fs/f2fs.h | 1 - >>> fs/f2fs/super.c | 1 - >>> 4 files changed, 1 insertion(+), 11 deletions(-) >>> >>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c >>> index 99c8061da55b9..2bdddc725e677 100644 >>> --- a/fs/f2fs/checkpoint.c >>> +++ b/fs/f2fs/checkpoint.c >>> @@ -1255,11 +1255,7 @@ static void unblock_operations(struct f2fs_sb_info *sbi) >>> void f2fs_wait_on_all_pages(struct f2fs_sb_info *sbi, int type) >>> { >>> - DEFINE_WAIT(wait); >>> - >>> for (;;) { >>> - prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE); >> >> Wouldn't that case high cpu usage before io end? > > This is a critical path to wait for IO completion in checkpoint, which would be > better to wait for it to avoid long latency to continue filesystem operations. Yup, in previous implementation, last end_io wakes up checkpoint() waiter, we didn't waste any time there. > Moreover, I expect io_schedule_timeout() can mitigate such the CPU overhead and > actually we don't need to make there-in context switches as well. Then io_schedule_timeout() in this loop may give CPU time slice to other thread until scheduler reselect checkpoint(), that would cause longer latency? > >> >>> - >>> if (!get_pages(sbi, type)) >>> break; >>> @@ -1271,9 +1267,9 @@ void f2fs_wait_on_all_pages(struct f2fs_sb_info *sbi, int type) >>> FS_CP_META_IO); >>> else if (type == F2FS_WB_CP_DATA) >>> f2fs_submit_merged_write(sbi, DATA); >>> + >>> io_schedule_timeout(DEFAULT_IO_TIMEOUT); >>> } >>> - finish_wait(&sbi->cp_wait, &wait); >>> } >>> static void update_ckpt_flags(struct f2fs_sb_info *sbi, struct cp_control *cpc) >>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>> index c1b676be67b9a..588d2871bbe69 100644 >>> --- a/fs/f2fs/data.c >>> +++ b/fs/f2fs/data.c >>> @@ -383,10 +383,6 @@ static void f2fs_write_end_io(struct bio *bio) >>> clear_cold_data(page); >>> end_page_writeback(page); >>> } >>> - if (!get_pages(sbi, F2FS_WB_CP_DATA) && >>> - wq_has_sleeper(&sbi->cp_wait)) >>> - wake_up(&sbi->cp_wait); >>> - >>> bio_put(bio); >>> } >>> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h >>> index 02811ce15059b..a036539363030 100644 >>> --- a/fs/f2fs/f2fs.h >>> +++ b/fs/f2fs/f2fs.h >>> @@ -1437,7 +1437,6 @@ struct f2fs_sb_info { >>> struct rw_semaphore cp_rwsem; /* blocking FS operations */ >>> struct rw_semaphore node_write; /* locking node writes */ >>> struct rw_semaphore node_change; /* locking node change */ >>> - wait_queue_head_t cp_wait; >>> unsigned long last_time[MAX_TIME]; /* to store time in jiffies */ >>> long interval_time[MAX_TIME]; /* to store thresholds */ >>> diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c >>> index 83bf9a02f83f2..5f4d9abc079d6 100644 >>> --- a/fs/f2fs/super.c >>> +++ b/fs/f2fs/super.c >>> @@ -3540,7 +3540,6 @@ static int f2fs_fill_super(struct super_block *sb, void *data, int silent) >>> init_rwsem(&sbi->cp_rwsem); >>> init_rwsem(&sbi->quota_sem); >>> - init_waitqueue_head(&sbi->cp_wait); >>> init_sb_info(sbi); >>> err = init_percpu_info(sbi); >>> > . >