Received: by 2002:a05:6a10:d5a5:0:0:0:0 with SMTP id gn37csp131625pxb; Wed, 6 Oct 2021 01:15:45 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxOsoKEL2ZA5gu0uyRyXhHw9Rmw1wjICh8+jJ9IT6hzLYYSnbeWpM8A2OgWYvMQg01tV4GD X-Received: by 2002:a63:4766:: with SMTP id w38mr7464587pgk.104.1633508145812; Wed, 06 Oct 2021 01:15:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1633508145; cv=none; d=google.com; s=arc-20160816; b=Y554zyln8MtNYimj3Jzt4bzHJA40R0Y0syHZoCXhWSXb17S1gDuFcwwRIXJ0K9IrbF y35VjhErElVFtYMdH+MgBH99hfCL0zFh8gkIbXPS/utE163xnwxcyeg9x/MXUPO+nmLe 46g8OCGLo9ZBLNsvJ3bhIExvjGpLoUeo6BoVRdG0LUEcO7EYL6MZrrOlVnsOB/8F5/Nz eDai67mmk4q321Q29qRMOXzHhCtTb6boVKplv63b8EZTB/n20nu6P5xuCIbHzAz4ATZO DkKr3wRsh5XLB+l21FJQrdNWhxc1mlVggzn+nkbo2QUwh2O4L3f0Td/YE+rVwx/fny33 x3mg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=LMSIooSNnkU1lxAhYBQkhA9NoX3lypLxa1UlaastGKU=; b=MW5BNfA4+z6zrPLjdE1eyZTbVOgww66Nj/cwS/cI5ysRPozavFaFiNvwj2J32XHowg Gg+6Y44GxOMrk9nZ9nyBMPqHV3GalpzQoUpMV67SzwvlRphKLi+cD3izxLome9RjCLvZ cbgnrWKFdM0QYdTt2imIrzH8QCmXzBaI65U3fc0njRzp8M/cQEwgLRGAFI/5vE5CUb68 4Wo18p6GsvwAwpyqioePMZBqu91jTqwLiu6OO6HxoCMfhJZ/bnP6MJMMOFegeRKdbSYp fc9UCMBziY1EaxX4HnXPR2IwgV16MTo/H9Onbb5L2eph7P8R85KD7+9TtU//kuWZcZdf 0lzw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c15si25072532pgi.9.2021.10.06.01.15.28; Wed, 06 Oct 2021 01:15:45 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237687AbhJFIQa (ORCPT + 99 others); Wed, 6 Oct 2021 04:16:30 -0400 Received: from frasgout.his.huawei.com ([185.176.79.56]:3936 "EHLO frasgout.his.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237638AbhJFIQ3 (ORCPT ); Wed, 6 Oct 2021 04:16:29 -0400 Received: from fraeml738-chm.china.huawei.com (unknown [172.18.147.207]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4HPRx70hmjz67WRt; Wed, 6 Oct 2021 16:11:23 +0800 (CST) Received: from lhreml724-chm.china.huawei.com (10.201.108.75) by fraeml738-chm.china.huawei.com (10.206.15.219) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2308.8; Wed, 6 Oct 2021 10:14:33 +0200 Received: from [10.47.95.252] (10.47.95.252) by lhreml724-chm.china.huawei.com (10.201.108.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256) id 15.1.2308.8; Wed, 6 Oct 2021 09:14:32 +0100 Subject: Re: [PATCH] scsi: storvsc: Cap scsi_driver.can_queue to fix a hang issue during boot To: , , , , , , , , , , CC: , References: <20211006070345.51713-1-decui@microsoft.com> From: John Garry Message-ID: Date: Wed, 6 Oct 2021 09:17:09 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.12.1 MIME-Version: 1.0 In-Reply-To: <20211006070345.51713-1-decui@microsoft.com> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.47.95.252] X-ClientProxiedBy: lhreml709-chm.china.huawei.com (10.201.108.58) To lhreml724-chm.china.huawei.com (10.201.108.75) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06/10/2021 08:03, Dexuan Cui wrote: > After commit ea2f0f77538c, a 416-CPU VM running on Hyper-V hangs during > boot because scsi_add_host_with_dma() sets shost->cmd_per_lun to a > negative number: > 'max_outstanding_req_per_channel' is 352, > 'max_sub_channels' is (416 - 1) / 4 = 103, so in storvsc_probe(), > scsi_driver.can_queue = 352 * (103 + 1) * (100 - 10) / 100 = 32947, which > is bigger than SHRT_MAX (i.e. 32767). Out of curiosity, are these values realistic? You're capping can_queue just because of a data size issue, so, if these values are realistic, seems a weak reason. > > Fix the hang issue by capping scsi_driver.can_queue. > > Add the below Fixed tag though ea2f0f77538c itself is good. > > Fixes: ea2f0f77538c ("scsi: core: Cap scsi_host cmd_per_lun at can_queue") > Cc: stable@vger.kernel.org > Signed-off-by: Dexuan Cui > --- > drivers/scsi/storvsc_drv.c | 10 ++++++++++ > 1 file changed, 10 insertions(+) > > diff --git a/drivers/scsi/storvsc_drv.c b/drivers/scsi/storvsc_drv.c > index ebbbc1299c62..ba374908aec2 100644 > --- a/drivers/scsi/storvsc_drv.c > +++ b/drivers/scsi/storvsc_drv.c > @@ -1976,6 +1976,16 @@ static int storvsc_probe(struct hv_device *device, > (max_sub_channels + 1) * > (100 - ring_avail_percent_lowater) / 100; > > + /* > + * v5.14 (see commit ea2f0f77538c) implicitly requires that > + * scsi_driver.can_queue should not exceed SHRT_MAX, otherwise > + * scsi_add_host_with_dma() sets shost->cmd_per_lun to a negative > + * number (note: the type of the "cmd_per_lun" field is "short"), and > + * the system may hang during early boot. > + */ The different data sizes for cmd_per_lun and can_queue are problematic here. I'd be more inclined to set cmd_per_lun to the same data size as can_queue. We did discuss this when ea2f0f77538c was upstreamed (actually it was the other way around - setting can_queue to 16b). Thanks, John > + if (scsi_driver.can_queue > SHRT_MAX) > + scsi_driver.can_queue = SHRT_MAX; > + > host = scsi_host_alloc(&scsi_driver, > sizeof(struct hv_host_device)); > if (!host) >