Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp528426rwb; Tue, 27 Sep 2022 00:28:18 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7TzJRg++OHnDjNinhir17OSSOPHJlFCfSbwoDgXu1KMapBKOHoLw5j8yfr58C18k/t6RF/ X-Received: by 2002:a05:6402:35c5:b0:450:4b7d:9c49 with SMTP id z5-20020a05640235c500b004504b7d9c49mr26384026edc.149.1664263698218; Tue, 27 Sep 2022 00:28:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664263698; cv=none; d=google.com; s=arc-20160816; b=xxqNCHOHwSrP4bj/Bnunh/vBAOu91OqmDa7u5srA0rwTW/Dpvpq4iIQaQp2Xi96Npd WwGWqcy2NPZe9XwSFs/i3DFTlm/eLUJGp/2Siz903W8XoyUHtv4LMFdX3Mjj7hmnA6ke W67Sf9jEZu4Dvlyujl7IiHG2hXWhs4x8/GVJNFvBXdCYNAedR6xIIqj/0uKtHIkIBi/7 yXogB6zBLQAezwwTk+IRvcLGxTXxClKimOuv1YuBVgwRG1glpK4FbuVwR6mdtT1QXsb5 TTCCBGEmcFeQgczY7beeVQWXgo+ZUB8ThJYk1RWTV/dO5Y3+Lfq+9X4Po7cHJX3rIBar GMlw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:subject:cc:to:from; bh=ZgeJUBFkqD8rhMX45MJrX6fTdeazGmK6IWjdMA2PsfU=; b=N+Hk7JwrlqpXAUyNxuF7N8GbXw/+dByQ1XfVj9WAzRH1XAezxBMCjqkU8Eq0pNGp9h w83cb6gnGNJltybLe/sMddhjF3CbDQJY77VDGdLnRVBx5tmMPL+7K8QY9rbgDiGf+g1/ 1Lm53+RGVhMlqGs0QbpGDeOHy3kqhG1MWZORmo4zMNKNtLB2usrWm3FDGL33RW8bz3j7 yvy08vI09SDVzh/afTkAqctq+XqQHzsCrJ7dqemLtSFD3vlcsFL/HsY5DBilUhBQ6tdy R1CtuZE1CtgabJ6IH9kmIevRIQ/L/EklZmqLW2T2DkbCJWSyK8hFPQXmu+LQj2b0iUO9 cP3Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t16-20020a056402525000b00457c123d802si408857edd.476.2022.09.27.00.27.53; Tue, 27 Sep 2022 00:28:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230243AbiI0HLe (ORCPT + 99 others); Tue, 27 Sep 2022 03:11:34 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39504 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229967AbiI0HLc (ORCPT ); Tue, 27 Sep 2022 03:11:32 -0400 Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B6BC2A6C65; Tue, 27 Sep 2022 00:11:31 -0700 (PDT) Received: from fraeml709-chm.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4Mc9lh4qPJz689NN; Tue, 27 Sep 2022 15:11:28 +0800 (CST) Received: from lhrpeml500003.china.huawei.com (7.191.162.67) by fraeml709-chm.china.huawei.com (10.206.15.37) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Tue, 27 Sep 2022 09:11:29 +0200 Received: from localhost.localdomain (10.69.192.58) by lhrpeml500003.china.huawei.com (7.191.162.67) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Tue, 27 Sep 2022 08:11:26 +0100 From: John Garry To: , , , CC: , , , , , "John Garry" Subject: [PATCH v5 0/7] libsas and drivers: NCQ error handling Date: Tue, 27 Sep 2022 15:04:51 +0800 Message-ID: <1664262298-239952-1-git-send-email-john.garry@huawei.com> X-Mailer: git-send-email 2.8.1 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.69.192.58] X-ClientProxiedBy: dggems702-chm.china.huawei.com (10.3.19.179) To lhrpeml500003.china.huawei.com (7.191.162.67) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org As reported in [0], the pm8001 driver NCQ error handling more or less duplicates what libata does in link error handling, as follows: - abort all commands - do autopsy with read log ext 10 command - reset the target to recover, if necessary Indeed for the hisi_sas driver we want to add similar handling for NCQ errors. This series add a new libsas API - sas_ata_device_link_abort() - to handle host NCQ errors, and fixes up pm8001 and hisi_sas drivers to use it. A difference in the pm8001 driver NCQ error handling is that we send SATA_ABORT per-task prior to read log ext10, but I feel that this should not make a difference to the error handling. Damien kindly tested previous the series for pm8001, but any further pm8001 testing would be appreciated as I have since tweaked pm8001 handling again. This is because the pm8001 driver hangs on my arm64 machine read log ext10 command. Finally with these changes we can make the libsas task alloc/free APIs private, which they should always have been. Based on mkp-scsi @ 6.1/scsi-staging 57569c37f0ad ("scsi: iscsi: iscsi_tcp: Fix null-ptr-deref while calling getpeername()") [0] https://lore.kernel.org/linux-scsi/8fb3b093-55f0-1fab-81f4-e8519810a978@huawei.com/ Changes since v4: - Add Jason's tags (thanks) - Rebase Changes since v3: - Add Damien's tags (thanks) - Modify hisi_sas processing as follows: - use sas_task_abort() for rejected IO - Modify abort task processing to issue softreset in certain circumstances - rebase Changes since v2: - Stop sending SATA_ABORT all for pm8001 handling - Make "reset" optional in sas_ata_device_link_abort() - Drop Jack's ACK John Garry (5): scsi: libsas: Add sas_ata_device_link_abort() scsi: hisi_sas: Move slot variable definition in hisi_sas_abort_task() scsi: pm8001: Modify task abort handling for SATA task scsi: pm8001: Use sas_ata_device_link_abort() to handle NCQ errors scsi: libsas: Make sas_{alloc, alloc_slow, free}_task() private Xingui Yang (2): scsi: hisi_sas: Add SATA_DISK_ERR bit handling for v3 hw scsi: hisi_sas: Modify v3 HW SATA disk error state completion processing drivers/scsi/hisi_sas/hisi_sas.h | 1 + drivers/scsi/hisi_sas/hisi_sas_main.c | 26 +++- drivers/scsi/hisi_sas/hisi_sas_v3_hw.c | 53 ++++++- drivers/scsi/libsas/sas_ata.c | 12 ++ drivers/scsi/libsas/sas_init.c | 3 - drivers/scsi/libsas/sas_internal.h | 4 + drivers/scsi/pm8001/pm8001_hwi.c | 186 ++++--------------------- drivers/scsi/pm8001/pm8001_sas.c | 8 ++ drivers/scsi/pm8001/pm8001_sas.h | 4 - drivers/scsi/pm8001/pm80xx_hwi.c | 177 +++-------------------- include/scsi/libsas.h | 4 - include/scsi/sas_ata.h | 6 + 12 files changed, 143 insertions(+), 341 deletions(-) -- 2.35.3