Received: by 2002:a05:7412:f584:b0:e2:908c:2ebd with SMTP id eh4csp2070120rdb; Tue, 5 Sep 2023 13:29:13 -0700 (PDT) X-Google-Smtp-Source: AGHT+IG1R463UBkvOzxBEI4YUMzf2RtWa0zccRvrZRpwLGtOtD/k9TIXj/NaiI5aYno6uYOUQruQ X-Received: by 2002:a17:90a:6447:b0:25c:1ad3:a4a1 with SMTP id y7-20020a17090a644700b0025c1ad3a4a1mr12979501pjm.1.1693945753263; Tue, 05 Sep 2023 13:29:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1693945753; cv=none; d=google.com; s=arc-20160816; b=DoKQKNL9Azidg8EkJMkT2qqUkGb5KrCIRXT6BfdpuJg725kyej3IPV5TpwVPZUPFZu 1uYwvIzql8KAp9rJJvf2y5myrJeyi+Xk4xy0JfsiqCeI2Gp/b9wCgEfQZgfMz44ueCNe KiAwtK1nfyuaF493psXbq/uxuOtpZvRVPV+TAWctfAEMhSeL2rJDJe0LUhgp3/42blXu ZrOB1rZ9ff+EE7pIYEMjT2bD92mtJuW+sw5Dyxn2ZzG5mLtDlKj6N93+2lNyrYqxLcV1 zyS7shgBFszRgqLL0/D2GXjqaQC2Gs4J+jdmW8SoXf/cgWYyYgzEQDHXBM45O9px8Shk QpHg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=Ji06nZco+1a84o/lrjFQWKK1LyGMB5SJ8177ESaFiAo=; fh=o9MSuLaV/UnKc+jDaMmvjnHs44JowOk6CDc3e40IQrg=; b=Vda/zIs9DSC99jypBw6BFfZ9+HgTM8jSuDq6AvXL1xwmJucRwubIU3F4Ys1BRS6Hx+ NkIkTgUptgJw3eCddN8XTt16QTADWBZjs2u4DPkJ0IAQ6rzxZPcPRPHIfyLRMM2hIo+Z a1NqcuA50FtN2jD/aPqNRpEVDfc63IOCVldz0uomRk11fR6RUjVed/WbGXEEsv3gxViN V8Ztarqz03cUj2rVHXB0JiIC1bxjGJMJAwedde9VHneumf5f/M2HplUBOKdYTZ9mmuSj yW36J3d+6gFQX+H+vRWCNAh1Z8+5S7uOz2Cygk6uXEkSFmwVkPwA8Apb0xIo7pXbTeer Irog== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id n127-20020a632785000000b005694a49d062si9993384pgn.314.2023.09.05.13.28.57; Tue, 05 Sep 2023 13:29:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1352534AbjIEQ1p (ORCPT + 99 others); Tue, 5 Sep 2023 12:27:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52788 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344772AbjIEDxj (ORCPT ); Mon, 4 Sep 2023 23:53:39 -0400 Received: from dggsgout11.his.huawei.com (dggsgout11.his.huawei.com [45.249.212.51]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 32A77CCB; Mon, 4 Sep 2023 20:53:34 -0700 (PDT) Received: from mail02.huawei.com (unknown [172.30.67.143]) by dggsgout11.his.huawei.com (SkyGuard) with ESMTP id 4Rfs6y0QmDz4f3wt3; Tue, 5 Sep 2023 11:53:30 +0800 (CST) Received: from huaweicloud.com (unknown [10.175.104.67]) by APP4 (Coremail) with SMTP id gCh0CgDHVqk5pvZkQLWYCQ--.57068S4; Tue, 05 Sep 2023 11:53:31 +0800 (CST) From: linan666@huaweicloud.com To: dlemoal@kernel.org Cc: linux-ide@vger.kernel.org, linux-kernel@vger.kernel.org, linan122@huawei.com, yukuai3@huawei.com, yi.zhang@huawei.com, houtao1@huawei.com, yangerkun@huawei.com Subject: [PATCH v2] scsi: ata: Fix a race condition between scsi error handler and ahci interrupt Date: Tue, 5 Sep 2023 11:48:40 +0800 Message-Id: <20230905034840.478332-1-linan666@huaweicloud.com> X-Mailer: git-send-email 2.39.2 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-CM-TRANSID: gCh0CgDHVqk5pvZkQLWYCQ--.57068S4 X-Coremail-Antispam: 1UD129KBjvJXoW7ZFy7XF4DWFWxGr1ftw1DZFb_yoW5JFy7pF Z8Zw1DWryUtry2qr4qq3WrXryrGaykK3y2gryDGw1Svr4qka4rt39rCF90gFy3Kr97XF13 uan0g3sxCF18Zr7anT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDU0xBIdaVrnRJUUUkCb4IE77IF4wAFF20E14v26r4j6ryUM7CY07I20VC2zVCF04k2 6cxKx2IYs7xG6rWj6s0DM7CIcVAFz4kK6r1j6r18M28lY4IEw2IIxxk0rwA2F7IY1VAKz4 vEj48ve4kI8wA2z4x0Y4vE2Ix0cI8IcVAFwI0_tr0E3s1l84ACjcxK6xIIjxv20xvEc7Cj xVAFwI0_Gr1j6F4UJwA2z4x0Y4vEx4A2jsIE14v26rxl6s0DM28EF7xvwVC2z280aVCY1x 0267AKxVW0oVCq3wAac4AC62xK8xCEY4vEwIxC4wAS0I0E0xvYzxvE52x082IY62kv0487 Mc02F40EFcxC0VAKzVAqx4xG6I80ewAv7VC0I7IYx2IY67AKxVWUJVWUGwAv7VC2z280aV AFwI0_Jr0_Gr1lOx8S6xCaFVCjc4AY6r1j6r4UM4x0Y48IcxkI7VAKI48JM4kE6xkIj40E w7xC0wCF04k20xvY0x0EwIxGrwCFx2IqxVCFs4IE7xkEbVWUJVW8JwC20s026c02F40E14 v26r1j6r18MI8I3I0E7480Y4vE14v26r106r1rMI8E67AF67kF1VAFwI0_Jw0_GFylIxkG c2Ij64vIr41lIxAIcVC0I7IYx2IY67AKxVWUJVWUCwCI42IY6xIIjxv20xvEc7CjxVAFwI 0_Jr0_Gr1lIxAIcVCF04k26cxKx2IYs7xG6rW3Jr0E3s1lIxAIcVC2z280aVAFwI0_Jr0_ Gr1lIxAIcVC2z280aVCY1x0267AKxVWUJVW8JbIYCTnIWIevJa73UjIFyTuYvjxUwc_TUU UUU X-CM-SenderInfo: polqt0awwwqx5xdzvxpfor3voofrz/ X-CFilter-Loop: Reflected X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Li Nan If a disk is removed and quickly inserted when an I/O error is processing, the disk may not be able to be re-added. The function call timeline is as follows: interrupt scsi_eh ahci_error_intr ata_port_freeze __ata_port_freeze =>ahci_freeze (turn IRQ off) ata_port_abort ata_do_link_abort ata_port_schedule_eh =>ata_std_sched_eh ata_eh_set_pending set EH_PENDING scsi_schedule_eh shost->host_eh_scheduled++ (=1) scsi_error_handler =>ata_scsi_error ata_scsi_port_error_handler clear EH_PENDING =>ahci_error_handler . sata_pmp_error_handler . ata_eh_reset . ata_eh_thaw_port . . =>ahci_thaw (turn IRQ on) ahci_error_intr . . ata_port_freeze . . __ata_port_freeze . . =>ahci_freeze (turn IRQ off) . . ... . . ata_eh_set_pending . . set EH_PENDING . . scsi_schedule_eh . . shost->host_eh_scheduled++ (=2) . . . clear EH_PENDING check EH_PENDING =>ata_std_end_eh host->host_eh_scheduled = 0; 'host_eh_scheduled' is 0 and scsi eh thread will not be scheduled again. The ata port remain freeze and will never be enabled. Decrease ’host_eh_scheduled‘ instead of set it to 0 roughly and move WARN_ON of nr_active_links to ata_scsi_port_error_handler(). Reported-by: luojian Signed-off-by: Li Nan --- Changes in v2: - fix the bug by decrease 'host_eh_scheduled’ - improve commit message. drivers/ata/libata-eh.c | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/drivers/ata/libata-eh.c b/drivers/ata/libata-eh.c index 159ba6ba19eb..b9454756ecde 100644 --- a/drivers/ata/libata-eh.c +++ b/drivers/ata/libata-eh.c @@ -735,6 +735,12 @@ void ata_scsi_port_error_handler(struct Scsi_Host *host, struct ata_port *ap) */ ap->ops->end_eh(ap); + if (!ap->scsi_host->host_eh_scheduled) { + /* make sure nr_active_links is zero after EH */ + WARN_ON(ap->nr_active_links); + ap->nr_active_links = 0; + } + spin_unlock_irqrestore(ap->lock, flags); ata_eh_release(ap); @@ -948,7 +954,7 @@ void ata_std_end_eh(struct ata_port *ap) { struct Scsi_Host *host = ap->scsi_host; - host->host_eh_scheduled = 0; + host->host_eh_scheduled--; } EXPORT_SYMBOL(ata_std_end_eh); @@ -3922,10 +3928,6 @@ void ata_eh_finish(struct ata_port *ap) } } } - - /* make sure nr_active_links is zero after EH */ - WARN_ON(ap->nr_active_links); - ap->nr_active_links = 0; } /** -- 2.39.2