Received: by 10.223.176.46 with SMTP id f43csp30354wra; Thu, 18 Jan 2018 13:26:48 -0800 (PST) X-Google-Smtp-Source: ACJfBotNeUGlv5k2tZ77+Us6T8g8zoeRnAxBrvh7/yldFEga37VurlfbIpzafipuGygSQHYVFPgt X-Received: by 2002:a17:902:5417:: with SMTP id d23-v6mr426343pli.330.1516310808231; Thu, 18 Jan 2018 13:26:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516310808; cv=none; d=google.com; s=arc-20160816; b=dp5QZf0K0TACCjUNJ0J2j9VoI9LHl8jaECzc3EoxjHTXTqpnFxrRmauyKMK0ur0zXd juTDtendM2+PZQo/2VdotIu2FCuDLT55onXELz1gjTLtavDL//h2xiIo+2cwHUxE/lt/ +FGJnvO870mNtKc3pVrM1EKlNJPZfWW2pW3wI8Jyf7vIX1LH+fa3y9tQMkCZNPqUgtKd /Ck8HbxGPPvY25ZSNChqBWxAPx7P4HikRXAYS3hsaOe9PqvD+Ql0YoJ4832txPtsvDpo hun38JjCJNMa9mHMTP4LFVcV2xMxKrmnB+N3byLVPOSogT23blrSi0KtEjvwfYsrUDb/ AaVw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=+KnuGRCY3hhti6C2vxLymye9RZTu29h1f9ECkDFWZnI=; b=h4n7p6tmYEyN/RZxwNES93Nsh69fBPyzPqitL6Ww59bVprA1zNEpBWeu7g6AigWhvw UoIZqlR5OG1PoqRoNcA4HpfxIqkDFijZ14VtZ5vc1hoxP3aSoSVqOzIIfnGSMgYAUn7L Ehe3+nlSbZsyTZeh4/9TUU9YtyYsUiQYXnjb4H/a/EaNqQX26zu7gaf3okVX86thy8nA wSxD3fcs68Xzzj74VKBg1THk46axJ4xUogBz/pvP+Z7wQZuYpDzxeCuvdxLR+MKk+iSt lvH2GiagdR2ui0oxoZ36OWShUKeezLP2xtk1yBf+PvTvOZJYfSoFX2364bGzoqGv9y6G rzsw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q4-v6si218999plb.667.2018.01.18.13.26.34; Thu, 18 Jan 2018 13:26:48 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754881AbeARVYY (ORCPT + 99 others); Thu, 18 Jan 2018 16:24:24 -0500 Received: from mx1.redhat.com ([209.132.183.28]:41920 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753301AbeARVXo (ORCPT ); Thu, 18 Jan 2018 16:23:44 -0500 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id BD3AF60A80; Thu, 18 Jan 2018 21:23:43 +0000 (UTC) Received: from localhost (unknown [10.16.197.202]) by smtp.corp.redhat.com (Postfix) with ESMTPS id BC5DC614F2; Thu, 18 Jan 2018 21:23:37 +0000 (UTC) Date: Thu, 18 Jan 2018 16:23:27 -0500 From: Mike Snitzer To: Bart Van Assche Cc: "axboe@kernel.dk" , "dm-devel@redhat.com" , "hch@infradead.org" , "linux-kernel@vger.kernel.org" , "linux-block@vger.kernel.org" , "osandov@fb.com" , "ming.lei@redhat.com" Subject: Re: [RFC PATCH] blk-mq: fixup RESTART when queue becomes idle Message-ID: <20180118212327.GB31679@redhat.com> References: <20180118024124.8079-1-ming.lei@redhat.com> <20180118170353.GB19734@redhat.com> <1516296056.2676.23.camel@wdc.com> <20180118183039.GA20121@redhat.com> <1516301278.2676.35.camel@wdc.com> <20180118204856.GA31679@redhat.com> <1516309128.2676.38.camel@wdc.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1516309128.2676.38.camel@wdc.com> User-Agent: Mutt/1.5.23 (2014-03-12) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.38]); Thu, 18 Jan 2018 21:23:43 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 18 2018 at 3:58P -0500, Bart Van Assche wrote: > On Thu, 2018-01-18 at 15:48 -0500, Mike Snitzer wrote: > > For Bart's test the underlying scsi-mq driver is what is regularly > > hitting this case in __blk_mq_try_issue_directly(): > > > > if (blk_mq_hctx_stopped(hctx) || blk_queue_quiesced(q)) > > Hello Mike, > > That code path is not the code path that triggered the lockups that I reported > during the past days. If you're hitting blk_mq_sched_insert_request() then you most certainly are hitting that code path. If you aren't then what was your earlier email going on about? https://www.redhat.com/archives/dm-devel/2018-January/msg00372.html If you were just focusing on that as one possible reason, that isn't very helpful. By this point you really should _know_ what is triggering the stall based on the code paths taken. Please use ftrace's function_graph tracer if need be. > These lockups were all triggered by incorrect handling of > .queue_rq() returning BLK_STS_RESOURCE. Please be precise, dm_mq_queue_rq()'s return of BLK_STS_RESOURCE? "Incorrect" because it no longer runs blk_mq_delay_run_hw_queue()? Please try to do more work analyzing the test case that only you can easily run (due to srp_test being a PITA). And less time lobbying for a change that you don't understand to _really_ be correct. We have time to get this right, please stop hyperventilating about "regressions". Thanks, Mike