Received: by 2002:a05:6a10:af89:0:0:0:0 with SMTP id iu9csp3366001pxb; Mon, 17 Jan 2022 18:43:36 -0800 (PST) X-Google-Smtp-Source: ABdhPJzfp90kdWRnXdYupJdhKUiDhe95RoLzbXVqK1Ntv2sp4raZcJtVOOTiNiRi5iISwjQ9Hs8H X-Received: by 2002:a17:90a:b789:: with SMTP id m9mr38453550pjr.161.1642473816115; Mon, 17 Jan 2022 18:43:36 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1642473816; cv=none; d=google.com; s=arc-20160816; b=QpdynR6ytBp0Kq3c0iIRzs9cte1kAUHp9Ad+sJwIJvS4aLO3OK3DCszI1AzYllekC3 F4yJ1xwtJrPKuzFL6gxw1FziwXBpDM9GI9gc0+ZYLXIvTKvX/M/92uXm8AbX2RAMNlpO xjme/AW5hh8Rh9qXxmMgXDMWah0XWdQqpa7C/VLMua3b4Fl++6m3WsyB9tlWikxnrAgG Jb76g7WiG477KKeMXE2E/LJA40E4bKLBb8noeE8I1UFCBVEh28F1FyLybcUSJ4cDZLGI +FC4TIoCN9g00ouPrl4rQuPqkWAsCzT18GI2zO4qy1msNveWilGRYA4YtR30iW+lZ/cY XhxQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=wmZxHBOI7cN119bm1A2GLB33x5Q/N12gCpTHhTskjLU=; b=phvSvsaoOto0iFKJr3+07ebJn26519msrpyzyFmWC9kWWt1pfFrGD1Kz4eAjq1yXyK KAGh7yPi/lQdDL0ThIwesDh39HuiUt0QL11cY3jYozp+FI4Y3LmufsWSAfuhYvIm64sH 56PQ2nOhY+XEthgzirFMbTSizMpytp0Wx3OrM61b83z/p7df/nfvpDVlJ8dTmt8UETkh mBQ+qXKn2q+jJiWgksHEyAQDMwLjkqfJkJDOZ5rHoljd5/IjVI9ACBaRwFtQM79LKW+M QVe6BmYo7rDJSNbMUIGKWoJKFpTiLcKat7bSBnQ1jCauesHKWXSShrrMSEoSmh+t6zw5 IE8w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=g0U8L+Go; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i1si15833050pgq.344.2022.01.17.18.43.24; Mon, 17 Jan 2022 18:43:36 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=g0U8L+Go; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241333AbiAQRB3 (ORCPT + 99 others); Mon, 17 Jan 2022 12:01:29 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44984 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S241713AbiAQRAf (ORCPT ); Mon, 17 Jan 2022 12:00:35 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3B93AC0613E3; Mon, 17 Jan 2022 09:00:29 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id BCB9B611CA; Mon, 17 Jan 2022 17:00:29 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 5B445C36AED; Mon, 17 Jan 2022 17:00:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1642438829; bh=UAjvnhTZ2Yn5EHOvdWzEPrFzvMqBIFWTeR/luTOCF24=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=g0U8L+Gom+mdi9kd2eqlr8nu5NvTRT1eOmsAjyzD+wzsrEYBj+9Srck6EMNpFAY1F oqgmW3bfBeHFLsUr1Da9keAN/iUBk5eWMnMUqQXYrcybnJPcokbvs7Gv6r7L/y7G8v KHgV6IsW2z8nCVIQeazUnd6+YvC8sz6VFkwr/4FsTnHENCubhzgNlUKLLp1xyC/dew tPplOBQKIY6G+qi2GG0tdy4moPEbyKJbY37Q2xeBfQyut1yFYiNhlZ3RryoUgVJG4Q l5mp+6+GOktGeEj5SaU24gNQDGIcwdKNygDItAxdSErnrnTTx9IkPnAidHXIIe9ifr rls/cnZtJLNpw== From: Sasha Levin To: linux-kernel@vger.kernel.org, stable@vger.kernel.org Cc: Qi Liu , John Garry , "Martin K . Petersen" , Sasha Levin , jejb@linux.ibm.com, linux-scsi@vger.kernel.org Subject: [PATCH AUTOSEL 5.16 37/52] scsi: hisi_sas: Prevent parallel FLR and controller reset Date: Mon, 17 Jan 2022 11:58:38 -0500 Message-Id: <20220117165853.1470420-37-sashal@kernel.org> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20220117165853.1470420-1-sashal@kernel.org> References: <20220117165853.1470420-1-sashal@kernel.org> MIME-Version: 1.0 X-stable: review X-Patchwork-Hint: Ignore Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Qi Liu [ Upstream commit 16775db613c2bdea09705dcb876942c0641a1098 ] If we issue a controller reset command during executing a FLR a hung task may be found: Call trace: __switch_to+0x158/0x1cc __schedule+0x2e8/0x85c schedule+0x7c/0x110 schedule_timeout+0x190/0x1cc __down+0x7c/0xd4 down+0x5c/0x7c hisi_sas_task_exec+0x510/0x680 [hisi_sas_main] hisi_sas_queue_command+0x24/0x30 [hisi_sas_main] smp_execute_task_sg+0xf4/0x23c [libsas] sas_smp_phy_control+0x110/0x1e0 [libsas] transport_sas_phy_reset+0xc8/0x190 [libsas] phy_reset_work+0x2c/0x40 [libsas] process_one_work+0x1dc/0x48c worker_thread+0x15c/0x464 kthread+0x160/0x170 ret_from_fork+0x10/0x18 This is a race condition which occurs when the FLR completes first. Here the host HISI_SAS_RESETTING_BIT flag out gets of sync as HISI_SAS_RESETTING_BIT is not always cleared with the hisi_hba.sem held, so now only set/unset HISI_SAS_RESETTING_BIT under hisi_hba.sem . Link: https://lore.kernel.org/r/1639579061-179473-7-git-send-email-john.garry@huawei.com Signed-off-by: Qi Liu Signed-off-by: John Garry Signed-off-by: Martin K. Petersen Signed-off-by: Sasha Levin --- drivers/scsi/hisi_sas/hisi_sas_main.c | 8 +++++--- drivers/scsi/hisi_sas/hisi_sas_v3_hw.c | 1 + 2 files changed, 6 insertions(+), 3 deletions(-) diff --git a/drivers/scsi/hisi_sas/hisi_sas_main.c b/drivers/scsi/hisi_sas/hisi_sas_main.c index f206c433de325..8a13bc08d6575 100644 --- a/drivers/scsi/hisi_sas/hisi_sas_main.c +++ b/drivers/scsi/hisi_sas/hisi_sas_main.c @@ -1581,7 +1581,6 @@ void hisi_sas_controller_reset_prepare(struct hisi_hba *hisi_hba) { struct Scsi_Host *shost = hisi_hba->shost; - down(&hisi_hba->sem); hisi_hba->phy_state = hisi_hba->hw->get_phys_state(hisi_hba); scsi_block_requests(shost); @@ -1606,9 +1605,9 @@ void hisi_sas_controller_reset_done(struct hisi_hba *hisi_hba) if (hisi_hba->reject_stp_links_msk) hisi_sas_terminate_stp_reject(hisi_hba); hisi_sas_reset_init_all_devices(hisi_hba); - up(&hisi_hba->sem); scsi_unblock_requests(shost); clear_bit(HISI_SAS_RESETTING_BIT, &hisi_hba->flags); + up(&hisi_hba->sem); hisi_sas_rescan_topology(hisi_hba, hisi_hba->phy_state); } @@ -1619,8 +1618,11 @@ static int hisi_sas_controller_prereset(struct hisi_hba *hisi_hba) if (!hisi_hba->hw->soft_reset) return -1; - if (test_and_set_bit(HISI_SAS_RESETTING_BIT, &hisi_hba->flags)) + down(&hisi_hba->sem); + if (test_and_set_bit(HISI_SAS_RESETTING_BIT, &hisi_hba->flags)) { + up(&hisi_hba->sem); return -1; + } if (hisi_sas_debugfs_enable && hisi_hba->debugfs_itct[0].itct) hisi_hba->hw->debugfs_snapshot_regs(hisi_hba); diff --git a/drivers/scsi/hisi_sas/hisi_sas_v3_hw.c b/drivers/scsi/hisi_sas/hisi_sas_v3_hw.c index 0ef6c21bf0811..11a44d9dd9b2d 100644 --- a/drivers/scsi/hisi_sas/hisi_sas_v3_hw.c +++ b/drivers/scsi/hisi_sas/hisi_sas_v3_hw.c @@ -4848,6 +4848,7 @@ static void hisi_sas_reset_prepare_v3_hw(struct pci_dev *pdev) int rc; dev_info(dev, "FLR prepare\n"); + down(&hisi_hba->sem); set_bit(HISI_SAS_RESETTING_BIT, &hisi_hba->flags); hisi_sas_controller_reset_prepare(hisi_hba); -- 2.34.1