Received: by 2002:a05:7412:3210:b0:e2:908c:2ebd with SMTP id eu16csp898666rdb; Fri, 1 Sep 2023 07:19:32 -0700 (PDT) X-Google-Smtp-Source: AGHT+IHENdkG4WvOKehsKVDkKaTDUgDUOzkiCSczLmRuP7dEY5519vDP0mOFCjOfVu1uv5hX3OTq X-Received: by 2002:a05:6a20:9381:b0:14d:6a82:d7e9 with SMTP id x1-20020a056a20938100b0014d6a82d7e9mr3835664pzh.9.1693577972048; Fri, 01 Sep 2023 07:19:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1693577972; cv=none; d=google.com; s=arc-20160816; b=JLCFOWjmAkUjFdPbgC2ud3QkiyOzjd2TPZvWUDpj4WaKNzWgfLoaJ/WPOPX0uMDngK 0jWJT4tKwWiIIu78zYJvvPU4G0/Z1XOx7Zj5prszlx0H9kAn9KZ4s8UmZuJO42rRTP0r cQzpX30CRso+eoD5Pd9NPJaaliWRlV/Xl2h3sje9frTod/UelugMb06yvRFjhGOKF5ED EUHtvadUf0nIEBA8UTvYpmSwzoEYl3yHuUzaYQ8OBzlahtIBD9X/2kkr4KzbfoaNESEB 7SPIlYjOgO8ZG02LC8GhW2KkurpL8kh1ESpzxDERbEpOvBmOsIRhpNKPwVJHbxK3iHwx FgvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=ciuGjU02lUpxeH/E+O3JuvtGOr56aSlJbHsUrpJiG8k=; fh=U8QwZ5xHaU2/8ln497S/E2sKzHGkM9mdrpYdmq3DJOs=; b=I/fy4EcWThWO95IcZAx92U05yZWMk9I02QbAgprvC6feqp9ldu3BGSTz4eKsldC0jo YtjwIJc4T1fWlgFW7N1nvS6oOP4/16XCacC3wev22R4t+LRz16RwI2MqQwhDnOAwgw1E NwbqXfa07u3wu+okuDbypYR88W7eLKpXf7tN9DOZBchP0FP70W/rQF67Ga0Vo9DpaR33 garSZD+nh4jc1hBVfQW9lQliPVt86HQWAxrc398X0j4aiaKp2wC5f/Yl2Gjd1TrxpJx9 CZ6/4GKTwEOxiyZ5wzicp7T6b/7Kqbip5QuY3gblEfDuUwhrHhLJCFD5KdupI0cHttin o3EQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id m8-20020a63fd48000000b00565e01815f2si3023810pgj.732.2023.09.01.07.19.14; Fri, 01 Sep 2023 07:19:32 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1348912AbjIAJmG (ORCPT + 99 others); Fri, 1 Sep 2023 05:42:06 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56288 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1345337AbjIAJmA (ORCPT ); Fri, 1 Sep 2023 05:42:00 -0400 Received: from szxga08-in.huawei.com (szxga08-in.huawei.com [45.249.212.255]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7C0161703; Fri, 1 Sep 2023 02:41:53 -0700 (PDT) Received: from kwepemm600012.china.huawei.com (unknown [172.30.72.55]) by szxga08-in.huawei.com (SkyGuard) with ESMTP id 4RcY0n5c5Jz1L8wk; Fri, 1 Sep 2023 17:40:09 +0800 (CST) Received: from build.huawei.com (10.175.101.6) by kwepemm600012.china.huawei.com (7.193.23.74) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.31; Fri, 1 Sep 2023 17:41:49 +0800 From: Wenchao Hao To: "James E . J . Bottomley" , "Martin K . Petersen" , CC: Hannes Reinecke , , , , Wenchao Hao Subject: [RFC PATCH v2 00/18] scsi: scsi_error: Introduce new error handle mechanism Date: Fri, 1 Sep 2023 17:41:08 +0800 Message-ID: <20230901094127.2010873-1-haowenchao2@huawei.com> X-Mailer: git-send-email 2.32.0 MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.175.101.6] X-ClientProxiedBy: dggems702-chm.china.huawei.com (10.3.19.179) To kwepemm600012.china.huawei.com (7.193.23.74) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org It's unbearable for systems with large scale scsi devices share HBAs to block all devices' IOs when handle error commands, we need a new error handle mechanism to address this issue. I consulted about this issue a year ago, the discuss link can be found in refenence. Hannes replied about why we have to block the SCSI host then perform error recovery kindly. I think it's unnecessary to block SCSI host for all drivers and can try a small level recovery(LUN based for example) first to avoid block the SCSI host. The new error handle mechanism introduced in this patchset has been developed and tested with out self developed hardware since one year ago, now we want this mechanism can be used by more drivers. Drivers can decide if using the new error handle mechanism and how to handle error commands when scsi_device are scanned,the new mechanism makes SCSI error handle more flexible. SCSI error recovery strategy after blocking host's IO is mainly following steps: - LUN reset - Target reset - Bus reset - Host reset Some drivers did not implement callbacks for host reset, it's unnecessary to block host's IO for these drivers. For example, smartpqi only registered device reset, if device reset failed, it's meaningless to fallback to target reset, bus reset or host reset any more, because these steps would also failed. Here are some drivers we concerned:(there are too many kinds of drivers to figure out, so here I just list some drivers I am familiar with) +-------------+--------------+--------------+-----------+------------+ | drivers | device_reset | target_reset | bus_reset | host_reset | +-------------+--------------+--------------+-----------+------------+ | mpt3sas | Y | Y | N | Y | +-------------+--------------+--------------+-----------+------------+ | smartpqi | Y | N | N | N | +-------------+--------------+--------------+-----------+------------+ | megaraidsas | N | Y | N | Y | +-------------+--------------+--------------+-----------+------------+ | virtioscsi | Y | N | N | N | +-------------+--------------+--------------+-----------+------------+ | iscsi_tcp | Y | Y | N | N | +-------------+--------------+--------------+-----------+------------+ | hisisas | Y | Y | N | N | +-------------+--------------+--------------+-----------+------------+ For LUN based error handle, when scsi command is classified as error, we would block the scsi device's IO and try to recover this scsi device, if still can not recover all error commands, it might fallback to target or host level recovery. It's same for target based error handle, but target based error handle would block the scsi target's IO then try to recover the error commands of this target. The first patch defines basic framework to support LUN/target based error handle mechanism, three key operations are abstracted which are: - add error command - wake up error handle - block IOs when error command is added and recoverying. Drivers can implement these three function callbacks and setup to SCSI middle level; I also add a general LUN/target based error handle strategy which can be called directly from drivers to implement LUN/tartget based error handle. The changes of SCSI middle level's error handle are tested with scsi_debug which support single LUN error injection, the scsi_debug patches can be found in reference, following scenarios are tested. Scenario1: LUN based error handle is enabled: +-----------+---------+-------------------------------------------------------+ | lun reset | TUR | Desired result | + --------- + ------- + ------------------------------------------------------+ | success | success | retry or finish with EIO(may offline disk) | + --------- + ------- + ------------------------------------------------------+ | success | fail | fallback to host recovery, retry or finish with | | | | EIO(may offline disk) | + --------- + ------- + ------------------------------------------------------+ | fail | NA | fallback to host recovery, retry or finish with | | | | EIO(may offline disk) | + --------- + ------- + ------------------------------------------------------+ Scenario2: target based error handle is enabled: +-----------+---------+--------------+---------+------------------------------+ | lun reset | TUR | target reset | TUR | Desired result | +-----------+---------+--------------+---------+------------------------------+ | success | success | NA | NA | retry or finish with | | | | | | EIO(may offline disk) | +-----------+---------+--------------+---------+------------------------------+ | success | fail | success | success | retry or finish with | | | | | | EIO(may offline disk) | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | success | success | retry or finish with | | | | | | EIO(may offline disk) | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | success | fail | fallback to host recovery, | | | | | | retry or finish with EIO(may | | | | | | offline disk) | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | fail | NA | fallback to host recovery, | | | | | | retry or finish with EIO(may | | | | | | offline disk) | +-----------+---------+--------------+---------+------------------------------+ Scenario3: both LUN and target based error handle are enabled: +-----------+---------+--------------+---------+------------------------------+ | lun reset | TUR | target reset | TUR | Desired result | +-----------+---------+--------------+---------+------------------------------+ | success | success | NA | NA | retry or finish with | | | | | | EIO(may offline disk) | +-----------+---------+--------------+---------+------------------------------+ | success | fail | success | success | lun recovery fallback to | | | | | | target recovery, retry or | | | | | | finish with EIO(may offline | | | | | | disk | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | success | success | lun recovery fallback to | | | | | | target recovery, retry or | | | | | | finish with EIO(may offline | | | | | | disk | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | success | fail | lun recovery fallback to | | | | | | target recovery, then fall | | | | | | back to host recovery, retry | | | | | | or fhinsi with EIO(may | | | | | | offline disk) | +-----------+---------+--------------+---------+------------------------------+ | fail | NA | fail | NA | lun recovery fallback to | | | | | | target recovery, then fall | | | | | | back to host recovery, retry | | | | | | or fhinsi with EIO(may | | | | | | offline disk) | +-----------+---------+--------------+---------+------------------------------+ References: https://lore.kernel.org/linux-scsi/20230815122316.4129333-1-haowenchao2@huawei.com/ References: https://lore.kernel.org/linux-scsi/71e09bb4-ff0a-23fe-38b4-fe6425670efa@huawei.com/ Wenchao Hao (19): scsi: scsi_error: Define framework for LUN/target based error handle scsi: scsi_error: Move complete variable eh_action from shost to sdevice scsi: scsi_error: Check if to do reset in scsi_try_xxx_reset scsi: scsi_error: Add helper scsi_eh_sdev_stu to do START_UNIT scsi: scsi_error: Add helper scsi_eh_sdev_reset to do lun reset scsi: scsi_error: Add flags to mark error handle steps has done scsi: scsi_error: Add helper to handle scsi device's error command list scsi: scsi_error: Add a general LUN based error handler scsi: core: increase/decrease target_busy without check can_queue scsi: scsi_error: Add helper to handle scsi target's error command list scsi: scsi_error: Add a general target based error handler scsi: scsi_debug: Add param to control LUN bassed error handler scsi: scsi_debug: Add param to control target based error handle scsi: mpt3sas: Add param to control LUN based error handle scsi: mpt3sas: Add param to control target based error handle scsi: smartpqi: Add param to control LUN based error handle scsi: megaraid_sas: Add param to control target based error handle scsi: virtio_scsi: Add param to control LUN based error handle scsi: iscsi_tcp: Add param to control LUN based error handle drivers/scsi/iscsi_tcp.c | 20 + drivers/scsi/megaraid/megaraid_sas_base.c | 20 + drivers/scsi/mpt3sas/mpt3sas_scsih.c | 28 + drivers/scsi/scsi_debug.c | 24 + drivers/scsi/scsi_error.c | 756 ++++++++++++++++++++-- drivers/scsi/scsi_lib.c | 23 +- drivers/scsi/scsi_priv.h | 18 + drivers/scsi/smartpqi/smartpqi_init.c | 14 + drivers/scsi/virtio_scsi.c | 16 +- include/scsi/scsi_device.h | 97 +++ include/scsi/scsi_eh.h | 8 + include/scsi/scsi_host.h | 2 - 12 files changed, 963 insertions(+), 63 deletions(-) -- 2.35.3