Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp2779138pxb; Sun, 24 Jan 2021 20:43:55 -0800 (PST) X-Google-Smtp-Source: ABdhPJwTuo7U/GEpiqudy1HPxmWZ9pHuHRiB2/qsaNS8qEpmNqWkwKCZmpRSdpMjj7UEpuybxxcl X-Received: by 2002:a17:906:eb46:: with SMTP id mc6mr399151ejb.184.1611549835571; Sun, 24 Jan 2021 20:43:55 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611549835; cv=none; d=google.com; s=arc-20160816; b=bG5FAJ7GtDj+E00ENIiVXIKsiVKj1j38EeYOY6KcEbcWN8AlbcXCn/FEPhm/B2C8D3 pCppl82cG2txisUiEqIJQTY5bZ6PtCX3YZG4IBzedanXgzje3PBY24bH6dicEuQKxrFS MF6FiiWpzOn2CH7Qo/guhr+lxVbeKvcQwF8ZVEpsCC/q68ObMifF/oXbMm877M8fZJQi yEH+y1XICOz9smUHjbxf4LYWSD3QC4H9jy1gmd51eqc4EG/PS/Jri6ogZK7X57veXW50 1n5qmIMdb8paI2n0XxslLsTbpEa/8mtP5fZQTDBWpwnxTfihE+4pz8QrmZpdGls985jd 74bQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=6TSn2HJLj3NmrJIiupi1gsn3/VG/R5CIB2t8LWB5wio=; b=m0Koqo6yPWlfUtWqll8csGHGJqZ9Ja97KaESmCuGwonxOTjFyoBP0GeARmi34e89TO 9cCo9qxb9sX/lFQQxW33NkvUxTrtSroRSSdohGC4nNFd2knGHht/qTDiF5xlDRGCpTp6 QT6mi4wFMbGSsWbi9Q0jzauPsZK1EStmrbRZstDFVCq3OmTX5RcElakv3Dab8YSO6HpS dgfde+NexOwk8WNttiSiOwoYUT7lt5ukW/T6Wb9w6YLSzJ387XU3AJs0dYI+dje9AGIb X1o6/3xir2N8jVyZWpnuZ7/3S/EMXAZ6+xGyeZ60mtD+nPuB129A9zQ6GzwaF8Jn+LJ7 JiQA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=easystack.cn Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bo15si6703317edb.320.2021.01.24.20.43.32; Sun, 24 Jan 2021 20:43:55 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=easystack.cn Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727146AbhAYEjt (ORCPT + 99 others); Sun, 24 Jan 2021 23:39:49 -0500 Received: from m97179.mail.qiye.163.com ([220.181.97.179]:56421 "EHLO m97179.mail.qiye.163.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727145AbhAYEjm (ORCPT ); Sun, 24 Jan 2021 23:39:42 -0500 X-Greylist: delayed 541 seconds by postgrey-1.27 at vger.kernel.org; Sun, 24 Jan 2021 23:39:38 EST Received: from localhost.localdomain (unknown [218.94.118.90]) by m97179.mail.qiye.163.com (Hmail) with ESMTPA id C069BE02845; Mon, 25 Jan 2021 12:29:44 +0800 (CST) From: Dongsheng Yang To: colyli@suse.de, linux-bcache@vger.kernel.org, linux-kernel@vger.kernel.org, mchristi@redhat.com Cc: Dongsheng Yang Subject: [PATCH] bcache: dont reset bio opf in bch_data_insert_start Date: Mon, 25 Jan 2021 12:29:42 +0800 Message-Id: <20210125042942.1087170-1-dongsheng.yang@easystack.cn> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-HM-Spam-Status: e1kfGhgUHx5ZQUtXWQgYFAkeWUFZS1VLWVdZKFlBSUI3V1ktWUFJV1kPCR oVCBIfWUFZH0xDGR0eTRhKTkxOVkpNSkpOT0NCQ05LT0tVGRETFhoSFyQUDg9ZV1kWGg8SFR0UWU FZT0tIVUpKS0hNSlVLWQY+ X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6PVE6DBw*Aj08PgkoLDY4AT48 PSoaCz1VSlVKTUpKTk9DQkNOSUlJVTMWGhIXVR8UFRwIEx4VHFUCGhUcOx4aCAIIDxoYEFUYFUVZ V1kSC1lBWUlKQ1VCT1VKSkNVQktZV1kIAVlBSE9OSDcG X-HM-Tid: 0a7737ce068e20bdkuqyc069be02845 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org commit ad0d9e76(bcache: use bio op accessors) makes the bi_opf modified by bio_set_op_attrs(). But there is a logical problem in this commit: trace_bcache_cache_insert(k); bch_keylist_push(&op->insert_keys); - n->bi_rw |= REQ_WRITE; + bio_set_op_attrs(n, REQ_OP_WRITE, 0); bch_submit_bbio(n, op->c, k, 0); } while (n != bio); The old code add REQ_WRITE into bio n and keep other flags; the new code set REQ_OP_WRITE to bi_opf, but reset all other flags. This problem is discoverd in our performance testing: (1) start a fio with 1M x 128depth for read in /dev/nvme0n1p1 (2) start a fio with 1M x 128depth for write in /dev/escache0 (cache device is /dev/nvme0n1p2) We found the BW of reading is 2000+M/s, but the BW of writing is 0-100M/s. After some debugging, we found the problem is io submit in writting is very slow. bch_data_insert_start() insert a bio to /dev/nvme0n1p1, but as cached_dev submit stack bio will be added into current->bio_list, and return.Then __submit_bio_noacct() will submit the new bio in bio_list into /dev/nvme0n1p1. This operation would be slow in blk_mq_submit_bio() -> rq_qos_throttle(q, bio); The rq_qos_throttle() will call wbt_should_throttle(), static inline bool wbt_should_throttle(struct rq_wb *rwb, struct bio *bio) { switch (bio_op(bio)) { case REQ_OP_WRITE: /* * Don't throttle WRITE_ODIRECT */ if ((bio->bi_opf & (REQ_SYNC | REQ_IDLE)) == (REQ_SYNC | REQ_IDLE)) return false; ... ... } As the bio_set_op_attrs() reset the (REQ_SYNC | REQ_IDLE), so this write bio will be considered as non-direct write. After this fix, bio to nvme will flaged as (REQ_SYNC | REQ_IDLE), then fio for writing will get about 1000M/s bandwidth. Fixes: ad0d9e76a4124708dddd00c04fc4b56fc86c02d6 Signed-off-by: Dongsheng Yang --- drivers/md/bcache/request.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/md/bcache/request.c b/drivers/md/bcache/request.c index c7cadaafa947..eb734f7ddaac 100644 --- a/drivers/md/bcache/request.c +++ b/drivers/md/bcache/request.c @@ -244,7 +244,7 @@ static void bch_data_insert_start(struct closure *cl) trace_bcache_cache_insert(k); bch_keylist_push(&op->insert_keys); - bio_set_op_attrs(n, REQ_OP_WRITE, 0); + n->bi_opf |= REQ_OP_WRITE; bch_submit_bbio(n, op->c, k, 0); } while (n != bio); -- 2.25.1