Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp6422397rwn; Tue, 13 Sep 2022 04:10:39 -0700 (PDT) X-Google-Smtp-Source: AA6agR6m2fbSSh45G376CxOU/lv3FWBpUfjXXiVHcs0X+QxsPRWPsxbTMCckM2X41OzLhWVCs38s X-Received: by 2002:a17:907:6e1e:b0:77d:4fc3:d6c4 with SMTP id sd30-20020a1709076e1e00b0077d4fc3d6c4mr6905537ejc.289.1663067439229; Tue, 13 Sep 2022 04:10:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1663067439; cv=none; d=google.com; s=arc-20160816; b=AsGvdI0dSqlDViTcXPBERYpPA5V8NfGqs5HnzYmO1oXKjVch6zlbDshEDfr1+fEbfq xLJygHDg04nMDpY4wtv7GywuXlV+e2cdYzHPiRUK+/3RrxxaWGIbZBLhIBa/lhuy2nk3 aBRZior6CCKDQWS1qTgXThm//PcWaCFY9n3qsHZXdZU+5WYcIGHqb41q1B4c33yZiFh+ MhPYt8/gO8OUehONuLQg/0N+epLACuIfNpuhaSYr9tnZS+Gb2+Ea5O956+R8FlZwUlS5 i2pjDsCGXi6psfMQ3JEMS1ZfGBtJACeiWsqKpWSQpaMOc2ccztpjgWgXc9YhfUkXfaLa OPmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=xZ27/qi2QkvCL3CXaUA4ohvoroAeGczGy5sIVh2DCYo=; b=wqPudoLpltFr7REyHRj0IRGUAlo9mqL+iQwTLXSfe7/atTU7kJ7BEkq4qBr/Q/fJGs OxJHStghgkirnNB9DxRk+TdZROW103smzJ3/yu4EcORfvXkZYqSpdODG0SM5pgFDLi4A MVjQtjFAWuUnr+nGLtI4IHKqKjvi39s6dlY1KCuP6SRw/TOCsCJDNSgEgv/x5C+ZfSg8 1cBKFhnlQ93tA6fuRe2/OPzcgKpflivQg2NKHD1pnljwUzbWey33L8QhM5nw2fm/8JdD QyWCzjjtJujKaErvX4v8vQPivu35JnIXG4igYtlrZxQZLS6PFVrbeeIlEIBx6BlggBZC hPEw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id sg20-20020a170907a41400b00778a543a05asi7322486ejc.226.2022.09.13.04.10.12; Tue, 13 Sep 2022 04:10:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231584AbiIMKq4 (ORCPT + 99 others); Tue, 13 Sep 2022 06:46:56 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53150 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230243AbiIMKqz (ORCPT ); Tue, 13 Sep 2022 06:46:55 -0400 Received: from dggsgout11.his.huawei.com (dggsgout11.his.huawei.com [45.249.212.51]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 755FC5F101; Tue, 13 Sep 2022 03:46:52 -0700 (PDT) Received: from mail02.huawei.com (unknown [172.30.67.143]) by dggsgout11.his.huawei.com (SkyGuard) with ESMTP id 4MRg8V0PgXzKFSY; Tue, 13 Sep 2022 18:44:58 +0800 (CST) Received: from huaweicloud.com (unknown [10.175.127.227]) by APP2 (Coremail) with SMTP id Syh0CgBH53CYXyBjg5EjAw--.37737S4; Tue, 13 Sep 2022 18:46:50 +0800 (CST) From: Yu Kuai To: axboe@kernel.dk Cc: linux-block@vger.kernel.org, linux-kernel@vger.kernel.org, yukuai3@huawei.com, yukuai1@huaweicloud.com, yi.zhang@huawei.com Subject: [PATCH -next] blk-wbt: call rq_qos_add() after wb_normal is initialized Date: Tue, 13 Sep 2022 18:57:49 +0800 Message-Id: <20220913105749.3086243-1-yukuai1@huaweicloud.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-CM-TRANSID: Syh0CgBH53CYXyBjg5EjAw--.37737S4 X-Coremail-Antispam: 1UD129KBjvJXoW7CFy5uF1rCrWkAF4kAF1rXrb_yoW5JryDpa y2kFW3tw42gFs2vFsrtr47ZFW3Gws5Jryxur43Gw4YqF98KryjvanYkF15W34rArWkCF4S qr1FqFsxCFy8Z3DanT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDU0xBIdaVrnRJUUUyl14x267AKxVW8JVW5JwAFc2x0x2IEx4CE42xK8VAvwI8IcIk0 rVWrJVCq3wAFIxvE14AKwVWUJVWUGwA2ocxC64kIII0Yj41l84x0c7CEw4AK67xGY2AK02 1l84ACjcxK6xIIjxv20xvE14v26F1j6w1UM28EF7xvwVC0I7IYx2IY6xkF7I0E14v26F4j 6r4UJwA2z4x0Y4vEx4A2jsIE14v26rxl6s0DM28EF7xvwVC2z280aVCY1x0267AKxVW0oV Cq3wAS0I0E0xvYzxvE52x082IY62kv0487Mc02F40EFcxC0VAKzVAqx4xG6I80ewAv7VC0 I7IYx2IY67AKxVWUJVWUGwAv7VC2z280aVAFwI0_Jr0_Gr1lOx8S6xCaFVCjc4AY6r1j6r 4UM4x0Y48IcxkI7VAKI48JM4x0x7Aq67IIx4CEVc8vx2IErcIFxwCF04k20xvY0x0EwIxG rwCFx2IqxVCFs4IE7xkEbVWUJVW8JwC20s026c02F40E14v26r1j6r18MI8I3I0E7480Y4 vE14v26r106r1rMI8E67AF67kF1VAFwI0_JF0_Jw1lIxkGc2Ij64vIr41lIxAIcVC0I7IY x2IY67AKxVWUJVWUCwCI42IY6xIIjxv20xvEc7CjxVAFwI0_Jr0_Gr1lIxAIcVCF04k26c xKx2IYs7xG6rW3Jr0E3s1lIxAIcVC2z280aVAFwI0_Jr0_Gr1lIxAIcVC2z280aVCY1x02 67AKxVWUJVW8JbIYCTnIWIevJa73UjIFyTuYvjfUoOJ5UUUUU X-CM-SenderInfo: 51xn3trlr6x35dzhxuhorxvhhfrp/ X-CFilter-Loop: Reflected X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Yu Kuai Our test found a problem that wbt inflight counter is negative, which will cause io hang(noted that this problem doesn't exist in mainline): t1: device create t2: issue io add_disk blk_register_queue wbt_enable_default wbt_init rq_qos_add // wb_normal is still 0 /* * in mainline, disk can't be opened before * bdev_add(), however, in old kernels, disk * can be opened before blk_register_queue(). */ blkdev_issue_flush // disk size is 0, however, it's not checked submit_bio_wait submit_bio blk_mq_submit_bio rq_qos_throttle wbt_wait bio_to_wbt_flags rwb_enabled // wb_normal is 0, inflight is not increased wbt_queue_depth_changed(&rwb->rqos); wbt_update_limits // wb_normal is initialized rq_qos_track wbt_track rq->wbt_flags |= bio_to_wbt_flags(rwb, bio); // wb_normal is not 0,wbt_flags will be set t3: io completion blk_mq_free_request rq_qos_done wbt_done wbt_is_tracked // return true __wbt_done wbt_rqw_done atomic_dec_return(&rqw->inflight); // inflight is decreased commit 8235b5c1e8c1 ("block: call bdev_add later in device_add_disk") can avoid this problem, however it's better to fix this problem in wbt: 1) Lower kernel can't backport this patch due to lots of refactor. 2) Root cause is that wbt call rq_qos_add() before wb_normal is initialized. Fixes: e34cbd307477 ("blk-wbt: add general throttling mechanism") Cc: Signed-off-by: Yu Kuai --- block/blk-wbt.c | 9 ++++----- 1 file changed, 4 insertions(+), 5 deletions(-) diff --git a/block/blk-wbt.c b/block/blk-wbt.c index a9982000b667..246467926253 100644 --- a/block/blk-wbt.c +++ b/block/blk-wbt.c @@ -843,6 +843,10 @@ int wbt_init(struct request_queue *q) rwb->enable_state = WBT_STATE_ON_DEFAULT; rwb->wc = 1; rwb->rq_depth.default_depth = RWB_DEF_DEPTH; + rwb->min_lat_nsec = wbt_default_latency_nsec(q); + + wbt_queue_depth_changed(&rwb->rqos); + wbt_set_write_cache(q, test_bit(QUEUE_FLAG_WC, &q->queue_flags)); /* * Assign rwb and add the stats callback. @@ -853,11 +857,6 @@ int wbt_init(struct request_queue *q) blk_stat_add_callback(q, rwb->cb); - rwb->min_lat_nsec = wbt_default_latency_nsec(q); - - wbt_queue_depth_changed(&rwb->rqos); - wbt_set_write_cache(q, test_bit(QUEUE_FLAG_WC, &q->queue_flags)); - return 0; err_free: -- 2.31.1