Received: by 2002:ac0:a874:0:0:0:0:0 with SMTP id c49csp260400ima; Fri, 15 Mar 2019 02:10:21 -0700 (PDT) X-Google-Smtp-Source: APXvYqwD5lHZ7XgmzcszQZSL4LlcfmGAhxyEcAitFsBL5HjIGYAI25sU0/m1YB/xpu6lpOG5F+J9 X-Received: by 2002:a17:902:7c8f:: with SMTP id y15mr3070037pll.44.1552641021420; Fri, 15 Mar 2019 02:10:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1552641021; cv=none; d=google.com; s=arc-20160816; b=PAYzkR+wEceQHRph5MD6rLtQ1GluqgTFBUcSmlOjAY2odlos/0zno9y5chZkpl3CZc iWuXVEOWFRqQykvlqS/jCrOkU0O09Ct5R0/fTAlZ8VMfRcJBc6Jc+MvbfBkgtJMfZygD rirWW8WxDubPDF0JQE7+W526853O2UYpIDf+gj0dt5aPfUqYQ/Z/CAKrmXJeVsOecyQM 55lkXip8CM/2HKrkGWcziLdwLBDYI7+SvOeU2q7fM9rIvEwYs9XHAW1+jnzVa/5alQWK 8lmqe4Z/22S5f07gnlGss+tQj52qAw8iF/r9dxSyv8KC6s5ac1d5wWusd3O8I+Br8Pgb LS+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :dkim-signature; bh=ooKzMHXjxB1hW4mS22uZ/3/Gq8wyTQbRb/0fnDdxmdE=; b=dTzpqCzvTlEbwIo9UoTo/fdd373aB9NbZG4SEuQzHsOeGbs8kU2Bz4L9udGhz+Gbz+ NTLXD2IfjhCHTkYAYE6XKnrv9RSEm8SHFD64A2WxNCLdBojFAdLpF/wMOKYk/L/M8oCm Vvrc97Opu6e2yT9enku12TWb/tJ4h5YjsvxaxbZywXth6XXGb6L5GTL2HKLw8LLiNUwx rsycDXhKOyBqqucDZOgrIvHUa87qi73nb5zVBBP0qT7z3c9E/fYFgtCcjwDMmeDEKEn7 6omlSSiLiQQW0Ikuzy54jTp0mLdf0h+L9jrAO25Hrk0m0eqEvPhVoe36ib8crZvFbZtI /evA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2018-07-02 header.b="afodna/o"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f30si1369418pgl.340.2019.03.15.02.10.06; Fri, 15 Mar 2019 02:10:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2018-07-02 header.b="afodna/o"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728564AbfCOJGh (ORCPT + 99 others); Fri, 15 Mar 2019 05:06:37 -0400 Received: from userp2120.oracle.com ([156.151.31.85]:40494 "EHLO userp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727501AbfCOJGh (ORCPT ); Fri, 15 Mar 2019 05:06:37 -0400 Received: from pps.filterd (userp2120.oracle.com [127.0.0.1]) by userp2120.oracle.com (8.16.0.27/8.16.0.27) with SMTP id x2F8xHq8027924; Fri, 15 Mar 2019 09:06:20 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=from : to : cc : subject : date : message-id; s=corp-2018-07-02; bh=ooKzMHXjxB1hW4mS22uZ/3/Gq8wyTQbRb/0fnDdxmdE=; b=afodna/opPv022Ix7KYXTzEzAm00+2z1BrogluY1JNaUEaEt0/RWrcDCD8omkfmLna1X P2VC9FKonyna4rKupgiy4aLSlA/UP5UQcJjyy13AEnDj2PrEkQsbOrazBS4JykfymguQ JY5B9ptt9z1z2mNSgsp6kBhetadJheDvfFtFylfFlAaqox4aM9AL3lpbhcKUQz8fxskd 4HXmIMc7t1JjMW9BIhG7b07hpZ8IqZQafBzEBn/cXQs3S+uPO2H2OdOmqCO13/+fbBDh ufPIOfOz8ZdyqjcciwdCBzcsqWjxRdYgkKcGEdYH769Wuz4+cXGTfZgVyY8IXBDoX7DH gg== Received: from aserv0021.oracle.com (aserv0021.oracle.com [141.146.126.233]) by userp2120.oracle.com with ESMTP id 2r464rwhsu-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 15 Mar 2019 09:06:19 +0000 Received: from userv0121.oracle.com (userv0121.oracle.com [156.151.31.72]) by aserv0021.oracle.com (8.14.4/8.14.4) with ESMTP id x2F96DmS007350 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 15 Mar 2019 09:06:13 GMT Received: from abhmp0015.oracle.com (abhmp0015.oracle.com [141.146.116.21]) by userv0121.oracle.com (8.14.4/8.13.8) with ESMTP id x2F968CE005175; Fri, 15 Mar 2019 09:06:09 GMT Received: from will-ThinkCentre-M93p.cn.oracle.com (/10.182.71.12) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Fri, 15 Mar 2019 09:06:08 +0000 From: Jianchao Wang To: axboe@kernel.dk Cc: hch@lst.de, jthumshirn@suse.de, hare@suse.de, josef@toxicpanda.com, bvanassche@acm.org, sagi@grimberg.me, keith.busch@intel.com, jsmart2021@gmail.com, linux-block@vger.kernel.org, linux-nvme@lists.infradead.org, linux-kernel@vger.kernel.org Subject: [PATCH 0/8]: blk-mq: use static_rqs to iterate busy tags Date: Fri, 15 Mar 2019 16:57:36 +0800 Message-Id: <1552640264-26101-1-git-send-email-jianchao.w.wang@oracle.com> X-Mailer: git-send-email 2.7.4 X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=9195 signatures=668685 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=868 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1903150067 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jens As we know, there is a risk of accesing stale requests when iterate in-flight requests with tags->rqs[] and this has been talked in following thread, [1] https://marc.info/?l=linux-scsi&m=154511693912752&w=2 [2] https://marc.info/?l=linux-block&m=154526189023236&w=2 A typical sence could be blk_mq_get_request blk_mq_queue_tag_busy_iter -> blk_mq_get_tag -> bt_for_each -> bt_iter -> rq = taags->rqs[] -> rq->q -> blk_mq_rq_ctx_init -> data->hctx->tags->rqs[rq->tag] = rq; The root cause is that there is a window between set bit on tag sbitmap and set tags->rqs[]. This patch would fix this issue by iterating requests with tags->static_rqs[] instead of tags->rqs[] which would be changed dynamically. Moreover, we will try to get a non-zero q_usage_counter before access hctxs and tags and thus could avoid the race with updating nr_hw_queues, switching io scheduler and even queue clean up which are all under a frozen and drained queue. The 1st patch get rid of the useless of synchronize_rcu in __blk_mq_update_nr_hw_queues The 2nd patch modify the blk_mq_queue_tag_busy_iter to use tags->static_rqs[] instead of tags->rqs[] to iterate the busy tags. The 3rd ~ 7th patch change the blk_mq_tagset_busy_iter to blk_mq_queue_tag_busy_iter which is safer The 8th patch get rid of the blk_mq_tagset_busy_iter. Jianchao Wang(8) blk-mq: get rid of the synchronize_rcu in blk-mq: change the method of iterating busy tags of a blk-mq: use blk_mq_queue_tag_busy_iter in debugfs mtip32xx: use blk_mq_queue_tag_busy_iter nbd: use blk_mq_queue_tag_busy_iter skd: use blk_mq_queue_tag_busy_iter nvme: use blk_mq_queue_tag_busy_iter blk-mq: remove blk_mq_tagset_busy_iter diff stat block/blk-mq-debugfs.c | 4 +- block/blk-mq-tag.c | 173 +++++++++++++++++++++++++------------------------------------------------------------- block/blk-mq-tag.h | 2 - block/blk-mq.c | 35 ++++++------------ drivers/block/mtip32xx/mtip32xx.c | 8 ++-- drivers/block/nbd.c | 2 +- drivers/block/skd_main.c | 4 +- drivers/nvme/host/core.c | 12 ++++++ drivers/nvme/host/fc.c | 12 +++--- drivers/nvme/host/nvme.h | 2 + drivers/nvme/host/pci.c | 5 ++- drivers/nvme/host/rdma.c | 6 +-- drivers/nvme/host/tcp.c | 5 ++- drivers/nvme/target/loop.c | 6 +-- include/linux/blk-mq.h | 7 ++-- 15 files changed, 105 insertions(+), 178 deletions(- Thanks Jianchao