Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp3722399pxb; Tue, 26 Jan 2021 03:07:23 -0800 (PST) X-Google-Smtp-Source: ABdhPJynck9jv7uCmwN0sxM8M2A62lkjNSGcS8K/Q7kPdwY41O+H39+zpDLtVQ743uGcZMw3fdeh X-Received: by 2002:aa7:d88e:: with SMTP id u14mr4038046edq.72.1611659242977; Tue, 26 Jan 2021 03:07:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611659242; cv=none; d=google.com; s=arc-20160816; b=TiVwFrHNUVOpxhqGfoZZHIHXE8sQ8wsjKb2bN9RsF9f1AKwe1JOXQwtX0lS92AuaVC Db27f/JBvglbfV75DJ+d8f/IBI8txbuJU9e3UhntDqixPwmGEShEbUQ6i+sAwGr3LtPC MTlYK13vBRI0+yAFsE0jYqln107e2cCcms4hlfx99QSpIupyjtGC5S82+z5bnLyxwnkt wpjmhpEFUA+ioefyFR+IYzY6oELS6lWqQZIuCmICEYc5wbUHgDSHAXS8LajVFjve3bHr peWUbda22PM6tcl1a/k9ts3Lx5L37FrxvjrysL0HQdZEFYH8YZJmjQTAAmwLPrqGDZvN kdUQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:references:to:subject:from; bh=RBwVne/h9a3jvyLpt8UAWYTwIEGPxXTwNxoN3ykJHk8=; b=pNlgTWaH9o2WZjaHSD02dWCriiQ0T/eQyu3fpMEdHAsEBT50W2v1yWzb0LryWDuIW9 nHvLCpJ1UOwSMTa5CqR6ussAeDo8rI52djqUZp0JV/CHQoDKhPsMqaph9Tn7Ei+X8KHp QwlN9YMynixPfJ2yg5s3qnUc0zIBJNdFTXe5ERrERZt1m5FlToGQ9fs+DBsLBB0LhFL2 twgNDQW3NrW+0uLQ9PrtpjkV/fG0sSqn292WUP8vxinWX+OpxxPih/S5gWS5WXnYbkAe LIi0P3UJ9I8HqfiN7k5RXPnHuw2tK8ONvBrJJk41WbCgwGpri55QteBWE+X1KYVrdh5a J/Ug== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=easystack.cn Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q13si8120312edd.419.2021.01.26.03.06.58; Tue, 26 Jan 2021 03:07:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=easystack.cn Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404871AbhAZLEv (ORCPT + 99 others); Tue, 26 Jan 2021 06:04:51 -0500 Received: from m97179.mail.qiye.163.com ([220.181.97.179]:4991 "EHLO m97179.mail.qiye.163.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729438AbhAZEcw (ORCPT ); Mon, 25 Jan 2021 23:32:52 -0500 Received: from [192.168.0.213] (unknown [218.94.118.90]) by m97179.mail.qiye.163.com (Hmail) with ESMTPA id A7F78E02884; Tue, 26 Jan 2021 12:31:55 +0800 (CST) From: Dongsheng Yang Subject: Re: [PATCH] bcache: dont reset bio opf in bch_data_insert_start To: Coly Li , linux-bcache@vger.kernel.org, linux-kernel@vger.kernel.org, mchristi@redhat.com References: <20210125042942.1087170-1-dongsheng.yang@easystack.cn> <7569abf3-3e54-986e-8307-751fa5e00828@suse.de> Message-ID: <17578d50-4113-8f25-827e-840fafb09d6f@easystack.cn> Date: Tue, 26 Jan 2021 12:32:01 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.6.1 MIME-Version: 1.0 In-Reply-To: <7569abf3-3e54-986e-8307-751fa5e00828@suse.de> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-HM-Spam-Status: e1kfGhgUHx5ZQUtXWQgYFAkeWUFZS1VLWVdZKFlBSUI3V1ktWUFJV1kPCR oVCBIfWUFZGR5CT0lJQh4dHk0aVkpNSkpNSE5OSk5DTEJVGRETFhoSFyQUDg9ZV1kWGg8SFR0UWU FZT0tIVUpKS0hNSlVLWQY+ X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6NS46NTo5GT05FAgYHA8*IgEf IjFPCS1VSlVKTUpKTUhOTkpNSk9JVTMWGhIXVR8UFRwIEx4VHFUCGhUcOx4aCAIIDxoYEFUYFUVZ V1kSC1lBWUlKQ1VCT1VKSkNVQktZV1kIAVlBT01IQzcG X-HM-Tid: 0a773cf6618520bdkuqya7f78e02884 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2021/1/25 星期一 下午 12:53, Coly Li 写道: > On 1/25/21 12:29 PM, Dongsheng Yang wrote: >> commit ad0d9e76(bcache: use bio op accessors) makes the bi_opf >> modified by bio_set_op_attrs(). But there is a logical >> problem in this commit: >> >> trace_bcache_cache_insert(k); >> bch_keylist_push(&op->insert_keys); >> >> - n->bi_rw |= REQ_WRITE; >> + bio_set_op_attrs(n, REQ_OP_WRITE, 0); >> bch_submit_bbio(n, op->c, k, 0); >> } while (n != bio); >> >> The old code add REQ_WRITE into bio n and keep other flags; the >> new code set REQ_OP_WRITE to bi_opf, but reset all other flags. >> >> This problem is discoverd in our performance testing: >> (1) start a fio with 1M x 128depth for read in /dev/nvme0n1p1 >> (2) start a fio with 1M x 128depth for write in /dev/escache0 (cache >> device is /dev/nvme0n1p2) >> >> We found the BW of reading is 2000+M/s, but the BW of writing is >> 0-100M/s. After some debugging, we found the problem is io submit in >> writting is very slow. >> >> bch_data_insert_start() insert a bio to /dev/nvme0n1p1, but as >> cached_dev submit stack bio will be added into current->bio_list, and >> return.Then __submit_bio_noacct() will submit the new bio in bio_list >> into /dev/nvme0n1p1. This operation would be slow in >> blk_mq_submit_bio() -> rq_qos_throttle(q, bio); >> >> The rq_qos_throttle() will call wbt_should_throttle(), >> static inline bool wbt_should_throttle(struct rq_wb *rwb, struct bio *bio) >> { >> switch (bio_op(bio)) { >> case REQ_OP_WRITE: >> /* >> * Don't throttle WRITE_ODIRECT >> */ >> if ((bio->bi_opf & (REQ_SYNC | REQ_IDLE)) == >> (REQ_SYNC | REQ_IDLE)) >> return false; >> ... ... >> } >> >> As the bio_set_op_attrs() reset the (REQ_SYNC | REQ_IDLE), so this write >> bio will be considered as non-direct write. >> >> After this fix, bio to nvme will flaged as (REQ_SYNC | REQ_IDLE), >> then fio for writing will get about 1000M/s bandwidth. >> >> Fixes: ad0d9e76a4124708dddd00c04fc4b56fc86c02d6 > It should be, > Fixes: ad0d9e76a412 ("bcache: use bio op accessors") > >> Signed-off-by: Dongsheng Yang > Please CC the fixed patch author Mike Christie. Hi Coly,     Should I send a V2 for commit message update? Or you can help to fix it when you take it from maillist? Thanx Yang >> --- >> drivers/md/bcache/request.c | 2 +- >> 1 file changed, 1 insertion(+), 1 deletion(-) >> >> diff --git a/drivers/md/bcache/request.c b/drivers/md/bcache/request.c >> index c7cadaafa947..eb734f7ddaac 100644 >> --- a/drivers/md/bcache/request.c >> +++ b/drivers/md/bcache/request.c >> @@ -244,7 +244,7 @@ static void bch_data_insert_start(struct closure *cl) >> trace_bcache_cache_insert(k); >> bch_keylist_push(&op->insert_keys); >> >> - bio_set_op_attrs(n, REQ_OP_WRITE, 0); >> + n->bi_opf |= REQ_OP_WRITE; >> bch_submit_bbio(n, op->c, k, 0); >> } while (n != bio); > The fix is OK to me, I'd like to see opinion from Mike Christie too. > > Thanks for the catch. > > Coly Li