Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp337128pxa; Wed, 5 Aug 2020 02:24:44 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyVqVfQQ0WxK3qNwTh/N2Haa8whgszGABV3Dd2P2+J7SNoe9AD/x68DoNLnH9cSquFR/NL+ X-Received: by 2002:a50:cd5c:: with SMTP id d28mr1891632edj.176.1596619484514; Wed, 05 Aug 2020 02:24:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596619484; cv=none; d=google.com; s=arc-20160816; b=uwqOHMp97ambXfHNZ29WN7sFTZdKcu/nhcFLmB6kb/fXBCl/jR9fS3MIM8PMpddMCO 3P2/Q4UoERzT2G7T/2YrAsN+OSqDCpihBr4WPuu9j9CkVyZKhSqFi7M8sUTBolyEl4Kt BNPWXEtC8Vlc2BgBLQB3CeZ5gDuYhYqo79FYXeX/iVU+KmSIRVX7X8CcMfW2HA+f2Wig NxI5iQ5VSa842ssFC6pI0KHVyxxhsEzOLAQN4R77hxeUZ8gXkoVOY7zOMRYAOPXefPev q3pT6e+S2iCmeIApSt1yPRIB3h3qgQ5b0g1YXk7IiBtngkx0RGPjQZa5qTUPTdd+4oHm KI9Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=k3yjqFYKdJesN5lripMjBQkzUelOUOp0oDH31apSxZA=; b=qMPUJmldwtx+h0KwtBZo+PRQWxBUf0B/7GgVRdydIjhZHZK6W/wjTLN96RutydMJsP GnKb201zGTys/DTdpdJzF1vz/teRes/wr6hgApjbn2nyapZADB6iHex5HQLeF3fLVf2D duO6cQF9DIGGptF9hpBk59xxerXieDBnEuOs4q3h5yfhYiPcIHueldGJ350nzY9RaWQ5 IR7BQEy/V/HEuuij1l7s9ecOZ5iqe5R0+2fIeQvyvPYchhvs2fEqOB6WrIGS2HqehKft ZPe4fpvA+dWPNXRBbAWIUA9AFrJjL5pAkxBbLXSvgUZGgRUOEaiMynEpcw0f0Ap76r6p 6YVw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=okZSXdw6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id lt11si798677ejb.436.2020.08.05.02.24.21; Wed, 05 Aug 2020 02:24:44 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=okZSXdw6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728170AbgHEJXr (ORCPT + 99 others); Wed, 5 Aug 2020 05:23:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34096 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725809AbgHEJXD (ORCPT ); Wed, 5 Aug 2020 05:23:03 -0400 Received: from mail-lj1-x242.google.com (mail-lj1-x242.google.com [IPv6:2a00:1450:4864:20::242]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A1AD1C06174A; Wed, 5 Aug 2020 02:23:02 -0700 (PDT) Received: by mail-lj1-x242.google.com with SMTP id v4so37180396ljd.0; Wed, 05 Aug 2020 02:23:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=k3yjqFYKdJesN5lripMjBQkzUelOUOp0oDH31apSxZA=; b=okZSXdw6Y0ttXICPSCmW6GXQBbfYHEVTDCOJQarVL0Y1p6OaYWConGm15iukUXTi7v z6XGAoeP2We2TueeUobi5Z1hm0fx/RJA07FQbZWcZklj/XLTyO/UtfdR7OPGyPyuekiN XNYHeP5aWq5NixbJosMl+3cPRJn30BEUb6+IBVYa4FFdS5uFvZWNovkJJLNkCMiZdi0D 5UVs/Jumc8C3tYBAJiFRMdv1Zo+vYj4N3dfealzZc92le6EHTsNpgbi4vG+XU6tNPi8j MsXDUUohVqbcEttpwJvdEV8yLhKtlX+loH4y8tPBJTqsNl61orKP14NbYB30pYxRCtQf oF1w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=k3yjqFYKdJesN5lripMjBQkzUelOUOp0oDH31apSxZA=; b=uWcZ3mXBBqQGkVFtGnB96FX86vpVXYzIzUGG1hN+jdb2kR9sgLxhl+5SHiMe2qxfgp mqVbRQZMQzjm8WYAEO39jKXUdSKbLf9sfKekpepY7fZF6TvIdP24GCBU97iJ24iBr+sB bQxScDaEYUnR0PXxmOsSa/+BNa9Frf3CDblwunG/p7u9opZoc6GitBJBAlh+s0+IVo+k c5jju2eql9XS843+Rnc+1IlNeYWG3UyIr98gCyVXajGFnO6kb4OCu4KKOY8YOhOi95rv 3iS92CaqeThvzm+CoTxf0bhAR/X7Nm6YFH2k8TMMyApx5NetoDe+iFsY+KHAj+wtIB/T hmlw== X-Gm-Message-State: AOAM5313F/TWmqev5xBOeSWRKJldCXDdAhFKv2KeY4A4a/y41rZ+k6mw +BLT42yyi9VrnokWlZixa0X+PMRfrhJCuQbIj+kUcQ== X-Received: by 2002:a2e:b4e5:: with SMTP id s5mr936003ljm.153.1596619381093; Wed, 05 Aug 2020 02:23:01 -0700 (PDT) MIME-Version: 1.0 References: <20200805063150.41037-1-colyli@suse.de> In-Reply-To: <20200805063150.41037-1-colyli@suse.de> From: Jinpu Wang Date: Wed, 5 Aug 2020 11:22:49 +0200 Message-ID: Subject: Re: [PATCH v3] block: check queue's limits.discard_granularity in __blkdev_issue_discard() To: Coly Li Cc: linux-block@vger.kernel.org, linux-bcache@vger.kernel.org, LKML , Ming Lei , Bart Van Assche , Christoph Hellwig , Enzo Matsumiya , Evan Green , Hannes Reinecke , Jens Axboe , "Martin K . Petersen" , Xiao Ni Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Coly, one question below: Coly Li =E4=BA=8E2020=E5=B9=B48=E6=9C=885=E6=97=A5=E5=91= =A8=E4=B8=89 =E4=B8=8A=E5=8D=888:36=E5=86=99=E9=81=93=EF=BC=9A > > If create a loop device with a backing NVMe SSD, current loop device > driver doesn't correctly set its queue's limits.discard_granularity and > leaves it as 0. If a discard request at LBA 0 on this loop device, in > __blkdev_issue_discard() the calculated req_sects will be 0, and a zero > length discard request will trigger a BUG() panic in generic block layer > code at block/blk-mq.c:563. > > [ 955.565006][ C39] ------------[ cut here ]------------ > [ 955.559660][ C39] invalid opcode: 0000 [#1] SMP NOPTI > [ 955.622171][ C39] CPU: 39 PID: 248 Comm: ksoftirqd/39 Tainted: G = E 5.8.0-default+ #40 > [ 955.622171][ C39] Hardware name: Lenovo ThinkSystem SR650 -[7X05CTO1= WW]-/-[7X05CTO1WW]-, BIOS -[IVE160M-2.70]- 07/17/2020 > [ 955.622175][ C39] RIP: 0010:blk_mq_end_request+0x107/0x110 > [ 955.622177][ C39] Code: 48 8b 03 e9 59 ff ff ff 48 89 df 5b 5d 41 5c= e9 9f ed ff ff 48 8b 35 98 3c f4 00 48 83 c7 10 48 83 c6 19 e8 cb 56 c9 ff= eb cb <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 41 56 41 54 > [ 955.622179][ C39] RSP: 0018:ffffb1288701fe28 EFLAGS: 00010202 > [ 955.749277][ C39] RAX: 0000000000000001 RBX: ffff956fffba5080 RCX: 0= 000000000004003 > [ 955.749278][ C39] RDX: 0000000000000003 RSI: 0000000000000000 RDI: 0= 000000000000000 > [ 955.749279][ C39] RBP: 0000000000000000 R08: 0000000000000000 R09: 0= 000000000000000 > [ 955.749279][ C39] R10: ffffb1288701fd28 R11: 0000000000000001 R12: f= fffffffa8e05160 > [ 955.749280][ C39] R13: 0000000000000004 R14: 0000000000000004 R15: f= fffffffa7ad3a1e > [ 955.749281][ C39] FS: 0000000000000000(0000) GS:ffff95bfbda00000(00= 00) knlGS:0000000000000000 > [ 955.749282][ C39] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 955.749282][ C39] CR2: 00007f6f0ef766a8 CR3: 0000005a37012002 CR4: 0= 0000000007606e0 > [ 955.749283][ C39] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0= 000000000000000 > [ 955.749284][ C39] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0= 000000000000400 > [ 955.749284][ C39] PKRU: 55555554 > [ 955.749285][ C39] Call Trace: > [ 955.749290][ C39] blk_done_softirq+0x99/0xc0 > [ 957.550669][ C39] __do_softirq+0xd3/0x45f > [ 957.550677][ C39] ? smpboot_thread_fn+0x2f/0x1e0 > [ 957.550679][ C39] ? smpboot_thread_fn+0x74/0x1e0 > [ 957.550680][ C39] ? smpboot_thread_fn+0x14e/0x1e0 > [ 957.550684][ C39] run_ksoftirqd+0x30/0x60 > [ 957.550687][ C39] smpboot_thread_fn+0x149/0x1e0 > [ 957.886225][ C39] ? sort_range+0x20/0x20 > [ 957.886226][ C39] kthread+0x137/0x160 > [ 957.886228][ C39] ? kthread_park+0x90/0x90 > [ 957.886231][ C39] ret_from_fork+0x22/0x30 > [ 959.117120][ C39] ---[ end trace 3dacdac97e2ed164 ]--- > > This is the procedure to reproduce the panic, > # modprobe scsi_debug delay=3D0 dev_size_mb=3D2048 max_queue=3D1 Why do you need scsi_debug here, it's not relevant and not used? > # losetup -f /dev/nvme0n1 --direct-io=3Don > # blkdiscard /dev/loop0 -o 0 -l 0x200 > > This patch fixes the issue by checking q->limits.discard_granularity in > __blkdev_issue_discard() before composing the discard bio. If the value > is 0, then prints a warning oops information and returns -EOPNOTSUPP to > the caller to indicate that this buggy device driver doesn't support > discard request. The patch itself looks correct to me. > > Fixes: 9b15d109a6b2 ("block: improve discard bio alignment in __blkdev_is= sue_discard()") > Fixes: c52abf563049 ("loop: Better discard support for block devices") > Reported-and-suggested-by: Ming Lei > Signed-off-by: Coly Li > Reviewed-by: Ming Lei > Cc: Bart Van Assche > Cc: Christoph Hellwig > Cc: Enzo Matsumiya > Cc: Evan Green > Cc: Hannes Reinecke > Cc: Jens Axboe > Cc: Martin K. Petersen > Cc: Xiao Ni Reviewed-by: Jack Wang Thanks > --- > Changelog: > v3: print device name assocated with the buggy driver. > v2: fix typo of the wrong return error code. > v1: first version. > > block/blk-lib.c | 9 +++++++++ > 1 file changed, 9 insertions(+) > > diff --git a/block/blk-lib.c b/block/blk-lib.c > index 019e09bb9c0e..d3bbb3d9fac3 100644 > --- a/block/blk-lib.c > +++ b/block/blk-lib.c > @@ -47,6 +47,15 @@ int __blkdev_issue_discard(struct block_device *bdev, = sector_t sector, > op =3D REQ_OP_DISCARD; > } > > + /* In case the discard granularity isn't set by buggy device driv= er */ > + if (WARN_ON_ONCE(!q->limits.discard_granularity)) { > + char dev_name[BDEVNAME_SIZE]; > + > + bdevname(bdev, dev_name); > + pr_err("%s: Error: discard_granularity is 0.\n", dev_name= ); > + return -EOPNOTSUPP; > + } > + > bs_mask =3D (bdev_logical_block_size(bdev) >> 9) - 1; > if ((sector | nr_sects) & bs_mask) > return -EINVAL; > -- > 2.26.2 >