Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp493931yba; Mon, 1 Apr 2019 10:24:50 -0700 (PDT) X-Google-Smtp-Source: APXvYqxWj2KRc+l9Ao1dxuDxiDkIX2y6jcQ5Rs6i0aVuMhZ7edhdOpC/qut5fkaOeXSclPMwe0pr X-Received: by 2002:a63:6ac1:: with SMTP id f184mr43424135pgc.25.1554139490795; Mon, 01 Apr 2019 10:24:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1554139490; cv=none; d=google.com; s=arc-20160816; b=f5LitsYobZeV+uHbKG6FPdiEPXqRyg29CKsmskV8fgTG/B5IcKJ7OncgTjEJp2IbAX A0SdR5crwGBjC+mCQrKODQ/yN79XRL4evtQlBcJizPifPz1qdeBEQXiP6YRuMC1HeG6D WTdXiw/MpNqtMW1LwWrRLnbbERfDpfCM6ox0U2p+NI8atkLzRABb3JX/nTWaffAqj/vP cPtwEdOJO196JZ267hGMUUunmaJs1jovdgSqSYYY/TvhfEnewjitOzjpucSFQktt5vpu B0gYh+DTXfWaefIi+pHX3bEzWedlR1zNrjf2GIrseKqI2Wy5ujKMJNfn8jwoykrAOt0y 6MAg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=VNAKAAARFhe1kdwQlQlZ2fXRt+UR9Nok3tVjLeO1P7M=; b=BxjIW9qNA3+yOk0su7aYD9swkab9WGmuYEWQoluSs4urBQUjQwrRcxFlFdGocpFoFO PZ6CxLwhznTMktwVp54y+93SCxiShJlkp2avYZNXJ0lcB5iQt5r47vVgFX4bCxgJihe/ SiX5TkGN+gPwSgwdpm1RekR6YROSOoLgJ3Q9uRjLZyzWizrV0Tn+/Od83HNwC/jSqCHd 0cuHEG6C9IpBnjmHncHsx7Iwjo/r2Q5tdsHrveZMvv//L3VADiNtsenxH367tjPyAsWi eO8v8vJiWeBX+tz99aLXbbPYnjoPSvAdfd0keqG6zEBJxmK9ouQfiqgd3EcmMfVaA0fn UGCQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=kfqz98RR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c14si1460583pls.106.2019.04.01.10.24.35; Mon, 01 Apr 2019 10:24:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=kfqz98RR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732020AbfDARYD (ORCPT + 99 others); Mon, 1 Apr 2019 13:24:03 -0400 Received: from mail.kernel.org ([198.145.29.99]:54190 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731748AbfDARYA (ORCPT ); Mon, 1 Apr 2019 13:24:00 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 435A5206C0; Mon, 1 Apr 2019 17:23:59 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1554139439; bh=6Mzfk2YHY6l2UEKvM3ywN9E3R7D9xXLnzy86E5sJwa4=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=kfqz98RR+rPE3a1vZmIFDE6m7PWPTjOzlx0iUlIZX2aFhHzVlsilo2eT1CKliRLzJ ffCB2RHyc7pXVcmfWcF5nbAJeybqTrpx8xzGP+ZgwSskWnkb/3OlxTEMdmcmpwHxp5 atObBMLp1jkAcl1UancpHqqRGQszcGrRpgY7qfEk= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Steffen Maier , Jens Remus , Benjamin Block , "Martin K. Petersen" Subject: [PATCH 4.14 075/107] scsi: zfcp: fix scsi_eh host reset with port_forced ERP for non-NPIV FCP devices Date: Mon, 1 Apr 2019 19:02:30 +0200 Message-Id: <20190401170052.460583598@linuxfoundation.org> X-Mailer: git-send-email 2.21.0 In-Reply-To: <20190401170045.246405031@linuxfoundation.org> References: <20190401170045.246405031@linuxfoundation.org> User-Agent: quilt/0.65 X-stable: review X-Patchwork-Hint: ignore MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 4.14-stable review patch. If anyone has any objections, please let me know. ------------------ From: Steffen Maier commit 242ec1455151267fe35a0834aa9038e4c4670884 upstream. Suppose more than one non-NPIV FCP device is active on the same channel. Send I/O to storage and have some of the pending I/O run into a SCSI command timeout, e.g. due to bit errors on the fibre. Now the error situation stops. However, we saw FCP requests continue to timeout in the channel. The abort will be successful, but the subsequent TUR fails. Scsi_eh starts. The LUN reset fails. The target reset fails. The host reset only did an FCP device recovery. However, for non-NPIV FCP devices, this does not close and reopen ports on the SAN-side if other non-NPIV FCP device(s) share the same open ports. In order to resolve the continuing FCP request timeouts, we need to explicitly close and reopen ports on the SAN-side. This was missing since the beginning of zfcp in v2.6.0 history commit ea127f975424 ("[PATCH] s390 (7/7): zfcp host adapter."). Note: The FSF requests for forced port reopen could run into FSF request timeouts due to other reasons. This would trigger an internal FCP device recovery. Pending forced port reopen recoveries would get dismissed. So some ports might not get fully reopened during this host reset handler. However, subsequent I/O would trigger the above described escalation and eventually all ports would be forced reopen to resolve any continuing FCP request timeouts due to earlier bit errors. Signed-off-by: Steffen Maier Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2") Cc: #3.0+ Reviewed-by: Jens Remus Reviewed-by: Benjamin Block Signed-off-by: Martin K. Petersen Signed-off-by: Greg Kroah-Hartman --- drivers/s390/scsi/zfcp_erp.c | 14 ++++++++++++++ drivers/s390/scsi/zfcp_ext.h | 2 ++ drivers/s390/scsi/zfcp_scsi.c | 4 ++++ 3 files changed, 20 insertions(+) --- a/drivers/s390/scsi/zfcp_erp.c +++ b/drivers/s390/scsi/zfcp_erp.c @@ -652,6 +652,20 @@ static void zfcp_erp_strategy_memwait(st add_timer(&erp_action->timer); } +void zfcp_erp_port_forced_reopen_all(struct zfcp_adapter *adapter, + int clear, char *dbftag) +{ + unsigned long flags; + struct zfcp_port *port; + + write_lock_irqsave(&adapter->erp_lock, flags); + read_lock(&adapter->port_list_lock); + list_for_each_entry(port, &adapter->port_list, list) + _zfcp_erp_port_forced_reopen(port, clear, dbftag); + read_unlock(&adapter->port_list_lock); + write_unlock_irqrestore(&adapter->erp_lock, flags); +} + static void _zfcp_erp_port_reopen_all(struct zfcp_adapter *adapter, int clear, char *id) { --- a/drivers/s390/scsi/zfcp_ext.h +++ b/drivers/s390/scsi/zfcp_ext.h @@ -68,6 +68,8 @@ extern void zfcp_erp_clear_port_status(s extern int zfcp_erp_port_reopen(struct zfcp_port *, int, char *); extern void zfcp_erp_port_shutdown(struct zfcp_port *, int, char *); extern void zfcp_erp_port_forced_reopen(struct zfcp_port *, int, char *); +extern void zfcp_erp_port_forced_reopen_all(struct zfcp_adapter *adapter, + int clear, char *dbftag); extern void zfcp_erp_set_lun_status(struct scsi_device *, u32); extern void zfcp_erp_clear_lun_status(struct scsi_device *, u32); extern void zfcp_erp_lun_reopen(struct scsi_device *, int, char *); --- a/drivers/s390/scsi/zfcp_scsi.c +++ b/drivers/s390/scsi/zfcp_scsi.c @@ -327,6 +327,10 @@ static int zfcp_scsi_eh_host_reset_handl struct zfcp_adapter *adapter = zfcp_sdev->port->adapter; int ret = SUCCESS, fc_ret; + if (!(adapter->connection_features & FSF_FEATURE_NPIV_MODE)) { + zfcp_erp_port_forced_reopen_all(adapter, 0, "schrh_p"); + zfcp_erp_wait(adapter); + } zfcp_erp_adapter_reopen(adapter, 0, "schrh_1"); zfcp_erp_wait(adapter); fc_ret = fc_block_scsi_eh(scpnt);