Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1989877imm; Thu, 2 Aug 2018 04:36:18 -0700 (PDT) X-Google-Smtp-Source: AAOMgpenXSz0hJ8PYq6Qkp3lT8V7rEub3qgya5oGesooccXhf0nP6D1mPih4eczqdck6YS+uARec X-Received: by 2002:a62:a649:: with SMTP id t70-v6mr2577842pfe.149.1533209778850; Thu, 02 Aug 2018 04:36:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533209778; cv=none; d=google.com; s=arc-20160816; b=FHrp9zOjqF3Vxup8SfcZUyk3JykGfNRdSS5Krj6C6/gbu48uj9oqWXFzsBrvPFKM0I 9bQAPgBdLQ3SCs+CP+MHstiaFCSrpDp3HIz7yTh7aSoK5bJcb5Gkr0/eyusqibFwI6lw fVVyWYVQ0ShqwqigopgJF+JJAKdA+12Haii5kTrbKecv5QSR1gqLQjvrfZVuNZhqIDtR kSpHfirGGnTL91K8RaBPksZc1m+z6Ib1991A1f/0x0rE4oqcgCxxWfiKWo3/5sua4Ye0 IsItm1HtQ1O/o9Igof2BcD6pE89ds03SUb2A15fg+E1lSIPHEtuc2rUWCMoutj9hPtAy qXEg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=/gmNL6tAXvmgXdEE7bSQLMRVYhDXkHJ9oDbrJtcrpc8=; b=Mg1CU5EIEQqfpQ5lK2EWcWWYGC2f1GX3j59EQiLHme3Cc53lpx9Nc4Yac2Il9RDe54 O0BzNc+ygiDNnfYC/bs1s8V6xkALt+DSnsCxpcT7VtE445xt3ZDxc3Dl7K34TOnVFmQm qNoiU+Ni8gJqTfV6c1csq8BW3YbBCFjOBjFMDcv+u6U5NO64YsaQdg15Hi3KpFHcje1h 6sjQ0Pl4RFNdaJUVajTTP1MpnI4bxH44YHzwii/+RAHFp3kXER31ioMaePqTraLD6B80 +fwbnigUnRTEJhrWftobg7yD5rtQH90pp0fiLwKUQaHZ/u6OlGk9vjaBEYhMs2cEdaId g5EQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tjSBYJyQ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 83-v6si2276836pgg.588.2018.08.02.04.36.03; Thu, 02 Aug 2018 04:36:18 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tjSBYJyQ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732191AbeHBN0A (ORCPT + 99 others); Thu, 2 Aug 2018 09:26:00 -0400 Received: from mail-wr1-f67.google.com ([209.85.221.67]:37812 "EHLO mail-wr1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731895AbeHBN0A (ORCPT ); Thu, 2 Aug 2018 09:26:00 -0400 Received: by mail-wr1-f67.google.com with SMTP id u12-v6so1772681wrr.4; Thu, 02 Aug 2018 04:35:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=/gmNL6tAXvmgXdEE7bSQLMRVYhDXkHJ9oDbrJtcrpc8=; b=tjSBYJyQP6n/o8tBfcrUNtRxUveSqVe3i8v3sZd9893rfUckIIaX60iwpqbPWABdR8 PdIQknywx2YtX0YNAZ8eqwfnn6CqrGXf0N1sdaRHthU0G5J1laOALjmQm4KY9JTNHOgV 35BciS8j6gnd4mCByyBZB6XB7PeYLMlySSmpASgaQO1LyNfr0NxZx3pJkB0c036YcVGj AEEKT3APOFq+Rnn9DStl9bd8lnlvCycifz+cApwKzW7IjbZfqwaumy7Dmq2zlc8uHaGl Mku5jYMBJo6Fg1c5F3n4L7o6E3//3TagjQk3Wr9ox0M2GGQdSslnZj478CCK+qY0WqLm sSaw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=/gmNL6tAXvmgXdEE7bSQLMRVYhDXkHJ9oDbrJtcrpc8=; b=gMj6HKM1jttj15F0rsk2pKNfatiW/+F9gM4ZG9bNiiJ4uUHkb5ShdLrkOtWSrV6hYX H5NevEPPma5XhxLCXyOTfFc5gqoCJx93u8/eHdeAxmE/i+jC/nTpt1pF6qckikpDDl1i hQuGt2FT4bccygWDOlx1DVPaHTOvsROuHsiQwO6BZmZ4b06Kx5AB8tKM5ImRTIt2T+Uu DbDIcTu1JC2+u3wvRKcaxQDxmp9d1vjZHULtecELgn+MF6MI4TlXQ2xa0xgSmay0x8Zl CeE9WoMxiLJH/PqeyN12DN1t8bpBwd0+Bh+DxKCvlP2f7SxE0iQKlhi20dgftlPJx+zy TsiA== X-Gm-Message-State: AOUpUlE2iYOTXV6YE2THdCLNJBgucTudJzeyG2S7WU9xSmp3dCeuV0eL YUhU3CZU+WkpQfFwCNWPAZd/MbyrVv4yMn4/t2w= X-Received: by 2002:a5d:5201:: with SMTP id j1-v6mr1691569wrv.198.1533209713441; Thu, 02 Aug 2018 04:35:13 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a1c:9c8:0:0:0:0:0 with HTTP; Thu, 2 Aug 2018 04:35:12 -0700 (PDT) In-Reply-To: <171b2cdc-2e74-2b3c-e5f5-c656a196601a@roeck-us.net> References: <20180801175852.36549130@canb.auug.org.au> <20180801224813.GA13074@roeck-us.net> <1533163965.3158.1.camel@HansenPartnership.com> <20180801234727.GA3762@roeck-us.net> <1533168205.3158.12.camel@HansenPartnership.com> <171b2cdc-2e74-2b3c-e5f5-c656a196601a@roeck-us.net> From: Ming Lei Date: Thu, 2 Aug 2018 19:35:12 +0800 Message-ID: Subject: Re: linux-next: Tree for Aug 1 To: Guenter Roeck , linux-ide@vger.kernel.org, Tejun Heo Cc: James Bottomley , Stephen Rothwell , Linux-Next Mailing List , Linux Kernel Mailing List , linux-scsi , Ming Lei Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 2, 2018 at 12:58 PM, Guenter Roeck wrote: > On 08/01/2018 05:03 PM, James Bottomley wrote: >> >> On Thu, 2018-08-02 at 07:57 +0800, Ming Lei wrote: >>> >>> On Thu, Aug 2, 2018 at 7:47 AM, Guenter Roeck >>> wrote: >>>> >>>> On Wed, Aug 01, 2018 at 03:52:45PM -0700, James Bottomley wrote: >>>>> >>>>> On Wed, 2018-08-01 at 15:48 -0700, Guenter Roeck wrote: >>>>>> >>>>>> On Wed, Aug 01, 2018 at 05:58:52PM +1000, Stephen Rothwell >>>>>> wrote: >>>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> Changes since 20180731: >>>>>>> >>>>>>> The pci tree gained a conflict against the pci-current tree. >>>>>>> >>>>>>> The net-next tree gained a conflict against the bpf tree. >>>>>>> >>>>>>> The block tree lost its build failure. >>>>>>> >>>>>>> The staging tree still had its build failure due to an >>>>>>> interaction >>>>>>> with >>>>>>> the vfs tree for which I disabled CONFIG_EROFS_FS. >>>>>>> >>>>>>> The kspp tree lost its build failure. >>>>>>> >>>>>>> Non-merge commits (relative to Linus' tree): 10070 >>>>>>> 9137 files changed, 417605 insertions(+), 179996 deletions(- >>>>>>> ) >>>>>>> >>>>>>> ----------------------------------------------------------- >>>>>>> ------ >>>>>>> ----------- >>>>>>> >>>>>> >>>>>> The widespread kernel hang issues are still seen. I managed >>>>>> to bisect it after working around the transient build failures. >>>>>> Bisect log is attached below. Unfortunately, it doesn't help >>>>>> much. >>>>>> The culprit is reported as: >>>>>> >>>>>> 2d542828c5e9 Merge remote-tracking branch 'scsi/for-next' >>>>>> >>>>>> The preceding merge, >>>>>> >>>>>> 453f1d821165 Merge remote-tracking branch 'cgroup/for-next' >>>>>> >>>>>> checks out fine, as does the tip of scsi-next (commit >>>>>> 103c7b7e0184, >>>>>> "Merge branch 'misc' into for-next"). No idea how to proceed. >>>>> >>>>> >>>>> This sounds like you may have a problem with this patch: >>>>> >>>>> commit d5038a13eca72fb216c07eb717169092e92284f1 >>>>> Author: Johannes Thumshirn >>>>> Date: Wed Jul 4 10:53:56 2018 +0200 >>>>> >>>>> scsi: core: switch to scsi-mq by default >>>>> >>>>> To verify, boot with the additional kernel parameter >>>>> >>>>> scsi_mod.use_blk_mq=0 >>>>> >>>>> Which will reverse the effect of the above patch. >>>>> >>>> >>>> Yes, that fixes the problem. >>> >>> >>> That may not the root cause, given this issue is only started to >>> see from next-20180731, but d5038a13eca7 (scsi: core: switch to >>> scsi-mq by default) >>> has been in -next for quite a while. >>> >>> Seems something new causes this issue. >> >> >> Read my other email about how to find this. >> >> https://marc.info/?l=linux-scsi&m=153316446223676 >> >> Now that we've confirmed the issue, Gunter, could you attempt to bisect >> it as that email describes? >> > > So, I am more and more baffled. > > I ran another round of bisect, this time each test executing twice, > once with "scsi_mod.use_blk_mq=1" and once with "scsi_mod.use_blk_mq=0", > requiring both to pass. Bisect still points to the merge as culprit. > > Ok, one step further: Actually _revert_ commit d5038a13eca72 before running > each test, meaning the default is use_blk_mq=0. Still run both tests. > Bisect _still_ points to the merge of scsi-next as culprit. > > So, to me it looks like the problem is triggered by _something_ in > scsi-next, combined with _something_ in -next prior to the merge, > not specifically associated with use_blk_mq=[0|1] or d5038a13eca72, > but to a combination of some patch in scsi-next and some other patch. Today I am a bit busy, and not trace it much. So far, I found the code hangs in scsi_test_unit_ready() <-get_capabilities()<-sr_probe(), and scsi_queue_rq()/ata_scsi_queuecmd() has queued the command successfully, but never completed. Also tried to revert commits merged to ata tree on 30th, 31th, but no difference. Thanks, Ming Lei