Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp1741691imm; Wed, 6 Jun 2018 23:01:52 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKIwRpzO6PAHccRi8sAowx0mk3fdcVndsLOUqJVl/78zJagoq95LYSiIT62qtY/wPEqsMJA X-Received: by 2002:a65:65d6:: with SMTP id y22-v6mr455135pgv.270.1528351312565; Wed, 06 Jun 2018 23:01:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528351312; cv=none; d=google.com; s=arc-20160816; b=IDMUjNLhw1EO2Af8up3lf5z7woeczxyT1JepKj6zKaO57Q+LizsVXvoUOw4rnBcJH+ jhdOyyuYK7rm9+R+5eNEc96RhozrTgCiXJFzMXi+A8FTHmiTItyCJOlxnZAAKOZ7vjLY 5Vdes7ANzkCUweNZuwDoPNS8A3Vn+2dJzJZ7UVRjBMmUNmdardBZpIwlVAxaTNBq0YyE +GEzGIiwVXy7EAthau7OvH2JsptRzACeRi/AlyovBY8llKRnDYMTAfMGVxwMibWin0l8 3vdjoEt4LDi2D7fngGsAgdluWqWl2B1affw5Hl2e9ZK07ujhcPLazxfJdiyXnjDXIQy0 wG/A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:arc-authentication-results; bh=lUoBBiliVqZcGnbasSUd7tpW6QY1wRVVbPbjy2DBt0E=; b=mXhA1GBYkRrO4t1BQoGh9z8P27UgoNUIPTNa+Z85NNIyUjb1ckUErFBLNmnYt0OxwZ odLnL+OFySFrSADktKQNDlw/6kOvLiNNV2BBZ2vTYhHp53Cct6EM2YVjWjDwlpbRj2/P qrEiFnXP39B2TS3N+DyCQntcXD+/dxyI/jInXz5wIWQzV6jdya5da8R9EES9UBVI6YFR ouHEWby90HdAPZ2tuDz3vpVQJ//V7CV5ooeZ9HDIgJG2C1I3a3Zzj9ACrUKiPj44IVK2 OI9gQnIMvNg0WqvL8WoObFuMNenH0uPXtS0FrCe5G4KkmPh/Af7n9ZzVAGUTjqY/efDU F01Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y13-v6si15591299pfc.302.2018.06.06.23.01.37; Wed, 06 Jun 2018 23:01:52 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752966AbeFGGAl (ORCPT + 99 others); Thu, 7 Jun 2018 02:00:41 -0400 Received: from wolverine01.qualcomm.com ([199.106.114.254]:25158 "EHLO wolverine01.qualcomm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751472AbeFGGAi (ORCPT ); Thu, 7 Jun 2018 02:00:38 -0400 X-IronPort-AV: E=Sophos;i="5.49,486,1520924400"; d="scan'208";a="343895036" Received: from unknown (HELO ironmsg05-sd.qualcomm.com) ([10.53.140.145]) by wolverine01.qualcomm.com with ESMTP; 06 Jun 2018 23:00:37 -0700 X-IronPort-AV: E=McAfee;i="5900,7806,8916"; a="103689610" Received: from westreach.qualcomm.com ([10.228.196.125]) by ironmsg05-sd.qualcomm.com with ESMTP; 06 Jun 2018 23:00:36 -0700 Received: by westreach.qualcomm.com (Postfix, from userid 467151) id E7C641F30; Thu, 7 Jun 2018 02:00:35 -0400 (EDT) From: Oza Pawandeep To: Bjorn Helgaas , Philippe Ombredanne , Thomas Gleixner , Greg Kroah-Hartman , Kate Stewart , linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, Dongdong Liu , Keith Busch , Wei Zhang , Sinan Kaya , Timur Tabi Cc: Oza Pawandeep Subject: [PATCH NEXT 4/6] PCI/AER: Clear device status error bits during ERR_FATAL and ERR_NONFATAL Date: Thu, 7 Jun 2018 02:00:32 -0400 Message-Id: <1528351234-26914-4-git-send-email-poza@codeaurora.org> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1528351234-26914-1-git-send-email-poza@codeaurora.org> References: <1528351234-26914-1-git-send-email-poza@codeaurora.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org We are handling ERR_FATAL by resetting the Link in software,skipping the driver pci_error_handlers callbacks, removing the devices from the PCI subsystem, and re-enumerating, the device status has to be cleared, which fixes BUG existed before. Signed-off-by: Oza Pawandeep diff --git a/drivers/pci/pcie/aer/aerdrv.c b/drivers/pci/pcie/aer/aerdrv.c index 8cbc62b..0d9eaba 100644 --- a/drivers/pci/pcie/aer/aerdrv.c +++ b/drivers/pci/pcie/aer/aerdrv.c @@ -341,12 +341,8 @@ static pci_ers_result_t aer_root_reset(struct pci_dev *dev) */ static void aer_error_resume(struct pci_dev *dev) { - u16 reg16; - /* Clean up Root device status */ - pcie_capability_read_word(dev, PCI_EXP_DEVSTA, ®16); - pcie_capability_write_word(dev, PCI_EXP_DEVSTA, reg16); - + pci_cleanup_aer_error_device_status(dev); /* Clean AER Root Error Status */ pci_cleanup_aer_uncorrect_error_status(dev); } diff --git a/drivers/pci/pcie/aer/aerdrv_core.c b/drivers/pci/pcie/aer/aerdrv_core.c index 6745e37..95e9828 100644 --- a/drivers/pci/pcie/aer/aerdrv_core.c +++ b/drivers/pci/pcie/aer/aerdrv_core.c @@ -47,6 +47,17 @@ int pci_disable_pcie_error_reporting(struct pci_dev *dev) } EXPORT_SYMBOL_GPL(pci_disable_pcie_error_reporting); +int pci_cleanup_aer_error_device_status(struct pci_dev *dev) +{ + u16 reg16; + + /* Clean up Root device status */ + pcie_capability_read_word(dev, PCI_EXP_DEVSTA, ®16); + pcie_capability_write_word(dev, PCI_EXP_DEVSTA, reg16); + + return 0; +} + int pci_cleanup_aer_uncorrect_error_status(struct pci_dev *dev) { int pos; diff --git a/drivers/pci/pcie/err.c b/drivers/pci/pcie/err.c index 3998ed7..e1e642c 100644 --- a/drivers/pci/pcie/err.c +++ b/drivers/pci/pcie/err.c @@ -252,6 +252,7 @@ static pci_ers_result_t broadcast_error_message(struct pci_dev *dev, dev->error_state = state; pci_walk_bus(dev->subordinate, cb, &result_data); if (cb == report_resume) { + pci_cleanup_aer_error_device_status(dev); pci_cleanup_aer_uncorrect_error_status(dev); dev->error_state = pci_channel_io_normal; } @@ -312,6 +313,7 @@ void pcie_do_fatal_recovery(struct pci_dev *dev, u32 service) * do error recovery on all subordinates of the bridge instead * of the bridge and clear the error status of the bridge. */ + pci_cleanup_aer_error_device_status(dev); pci_cleanup_aer_uncorrect_error_status(dev); } diff --git a/include/linux/aer.h b/include/linux/aer.h index 514bffa..165a147 100644 --- a/include/linux/aer.h +++ b/include/linux/aer.h @@ -44,6 +44,7 @@ struct aer_capability_regs { /* PCIe port driver needs this function to enable AER */ int pci_enable_pcie_error_reporting(struct pci_dev *dev); int pci_disable_pcie_error_reporting(struct pci_dev *dev); +int pci_cleanup_aer_error_device_status(struct pci_dev *dev); int pci_cleanup_aer_uncorrect_error_status(struct pci_dev *dev); int pci_cleanup_aer_error_status_regs(struct pci_dev *dev); #else @@ -55,6 +56,10 @@ static inline int pci_disable_pcie_error_reporting(struct pci_dev *dev) { return -EINVAL; } +static inline int pci_cleanup_aer_error_device_status(struct pci_dev *dev) +{ + return -EINVAL; +} static inline int pci_cleanup_aer_uncorrect_error_status(struct pci_dev *dev) { return -EINVAL; -- 2.7.4