Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp3861222imm; Mon, 11 Jun 2018 03:04:04 -0700 (PDT) X-Google-Smtp-Source: ADUXVKImugo/R1u2/aTtEWrEgM5k6twcCnBqO4HCDik//mbYRC5qDrIAI5wsg6NysFxjKDWw/ET1 X-Received: by 2002:a62:830e:: with SMTP id h14-v6mr17105624pfe.64.1528711444449; Mon, 11 Jun 2018 03:04:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528711444; cv=none; d=google.com; s=arc-20160816; b=W+lnaEtbwgaXfVM6wK7YlFosxLHFWiWlf43i255Aa42mBgE+p959UvX5z3QknlE5HL Nnf+yfYFZ4Ma1cHahglUanDfzr6C8hnalhZx0MdOXtpyatDDRc/SeEwhIPyxKG+DoCNT 8cwUYB3cLRNHtx9kGx4kpH5EsA3WPBStfd7Wmp3g96/JuwueBaP0p868DYXIcmfnBaG5 Q4AjZ9UI/e5xU56VBkLfkBjVrw3oAeIDtnNLHCicwVON1v3goGAG1IOV9IarpxdmukIv RxD5BDhfsUCrHTlstLPGiHdzxZOxy+9oM9O04HA6kp2qEK8nqfniVFFbiFS8ZUdL80zK mNbg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:message-id:references :in-reply-to:subject:cc:to:from:date:content-transfer-encoding :mime-version:dkim-signature:dkim-signature :arc-authentication-results; bh=wp/YbcxzmxARor9wX6NSJXWTdYrz422mEVzROSy6dm8=; b=FBEuzesn+HmRtB8wKxwoowwZxMVS0WNZN7KBDNp2icTBhivlTYyQCWD7y1Azqj+Jpa hqg7vZPdHwIjQLyEOLEKBRh/rrANpuDsb9k4VwXygXFIRC6mPj4QD8/8BzOejSkJJRSn Zh4wVbNv0uTlPztrBFZ1t52bmS4QyXhg9OHtC9ljaaKNXBox7LoJ9QNuMQF5yIvIZksI hKeOU7NyMQoRmWOFohPxS9jnhgNIOZA9nWO3EGmqLuXHr29fJH2V1j3OiKNN9iLrdYdB JfiB3ImoYuW4l14lCtqoawpw8TT1eE0WCXlRnna4coPczjimKqhvEUROneBsQpPbUvWO lYXw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=R3UJGOch; dkim=pass header.i=@codeaurora.org header.s=default header.b=J88dwj29; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g14-v6si18376004plo.95.2018.06.11.03.03.50; Mon, 11 Jun 2018 03:04:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=R3UJGOch; dkim=pass header.i=@codeaurora.org header.s=default header.b=J88dwj29; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932754AbeFKKBs (ORCPT + 99 others); Mon, 11 Jun 2018 06:01:48 -0400 Received: from smtp.codeaurora.org ([198.145.29.96]:44000 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932405AbeFKKBq (ORCPT ); Mon, 11 Jun 2018 06:01:46 -0400 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id 0F248608B8; Mon, 11 Jun 2018 10:01:45 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1528711306; bh=ozHNCfECRVbX+1IAzFUy+aeG+G4dkqJnMQo3lOSFgAI=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=R3UJGOchjM2XoAVj/tgPIgjqyF15EF+4u9m+/IKmDI+FztCZC8PzsHOlv3HDezVYb rKqaHvj08epWht816p5tJeX43VlNoLvtV5LfgMe2ZzCoMtcraQc1SMPFbsbHHzHlff KGjWP+N0RQt/z3aasEcS1WsUhIoKbPrWtaN1ukrE= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from mail.codeaurora.org (localhost.localdomain [127.0.0.1]) by smtp.codeaurora.org (Postfix) with ESMTP id CF13760791; Mon, 11 Jun 2018 10:01:44 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1528711304; bh=ozHNCfECRVbX+1IAzFUy+aeG+G4dkqJnMQo3lOSFgAI=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=J88dwj293M0OSN5kj2pDXHMJf+gOiuAQGGId08qY6YR+2SPc/aAjuy52IWyxjeDrl tJdyhYZQoJMGyCNp1ez5MIv8C1kRYfUQ7BDvpCzW4LuEsCsagJC8LN+Y16RtOQ4nVD LqB2ZPMfcTxXMelPCuECd4hk/OYhE2QEuzYr39TU= MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Mon, 11 Jun 2018 15:31:44 +0530 From: poza@codeaurora.org To: Bjorn Helgaas Cc: Bjorn Helgaas , Philippe Ombredanne , Thomas Gleixner , Greg Kroah-Hartman , Kate Stewart , linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, Dongdong Liu , Keith Busch , Wei Zhang , Sinan Kaya , Timur Tabi Subject: Re: [PATCH NEXT 6/6] PCI/PORTDRV: Remove ERR_FATAL handling from pcie_portdrv_slot_reset() In-Reply-To: <20180607213448.GB37077@bhelgaas-glaptop.roam.corp.google.com> References: <1528351234-26914-1-git-send-email-poza@codeaurora.org> <1528351234-26914-6-git-send-email-poza@codeaurora.org> <94661add3e71e3694aa22c2a9cabf503@codeaurora.org> <20180607213448.GB37077@bhelgaas-glaptop.roam.corp.google.com> Message-ID: <50f38cb058b29f84ca60a19825efd28a@codeaurora.org> X-Sender: poza@codeaurora.org User-Agent: Roundcube Webmail/1.2.5 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018-06-08 03:04, Bjorn Helgaas wrote: > On Thu, Jun 07, 2018 at 07:18:03PM +0530, poza@codeaurora.org wrote: >> On 2018-06-07 11:30, Oza Pawandeep wrote: >> > We are handling ERR_FATAL by resetting the Link in software,skipping the >> > driver pci_error_handlers callbacks, removing the devices from the PCI >> > subsystem, and re-enumerating, as a result of that, no more calling >> > pcie_portdrv_slot_reset in ERR_FATAL case. >> > >> > Signed-off-by: Oza Pawandeep >> > >> > diff --git a/drivers/pci/pcie/portdrv_pci.c >> > b/drivers/pci/pcie/portdrv_pci.c >> > index 973f1b8..92f5d330 100644 >> > --- a/drivers/pci/pcie/portdrv_pci.c >> > +++ b/drivers/pci/pcie/portdrv_pci.c >> > @@ -42,17 +42,6 @@ __setup("pcie_ports=", pcie_port_setup); >> > >> > /* global data */ >> > >> > -static int pcie_portdrv_restore_config(struct pci_dev *dev) >> > -{ >> > - int retval; >> > - >> > - retval = pci_enable_device(dev); >> > - if (retval) >> > - return retval; >> > - pci_set_master(dev); >> > - return 0; >> > -} >> > - >> > #ifdef CONFIG_PM >> > static int pcie_port_runtime_suspend(struct device *dev) >> > { >> > @@ -162,14 +151,6 @@ static pci_ers_result_t >> > pcie_portdrv_mmio_enabled(struct pci_dev *dev) >> > >> > static pci_ers_result_t pcie_portdrv_slot_reset(struct pci_dev *dev) >> > { >> > - /* If fatal, restore cfg space for possible link reset at upstream */ >> > - if (dev->error_state == pci_channel_io_frozen) { >> > - dev->state_saved = true; >> > - pci_restore_state(dev); >> > - pcie_portdrv_restore_config(dev); >> > - pci_enable_pcie_error_reporting(dev); >> > - } >> > - >> > return PCI_ERS_RESULT_RECOVERED; >> > } >> >> >> Hi Bjorn, >> >> the above patch removes ERR_FATAL handling from >> pcie_portdrv_slot_reset() >> because now we are handling ERR_FATAL differently than before. >> >> I tried to dig into pcie_portdrv_slot_reset() handling for ERR_FATAL >> case >> where it >> restores the config space, enable device, set master and enable error >> reporting.... >> and as far as I understand this is being done for upstream link >> (bridges >> etc..) >> >> why was it done at the first point (I checked the commit description, >> but >> could not really get it) >> and do we need to handle the same thing in ERR_FATAL now ? > > You mean 4bf3392e0bf5 ("PCI-Express AER implemetation: pcie_portdrv > error handler"), which added pcie_portdrv_slot_reset()? I agree, that > commit log has no useful information. I don't know any of the history > behind it. Hi Bjorn and Keith, broadcast_error_message() if (dev->hdr_type == PCI_HEADER_TYPE_BRIDGE) { ..... pci_walk_bus(dev->subordinate, cb, &result_data); so in case of ERR_FATAL, the walk bus is happening on subordinates, and if I understand the walk right then, pcie_portdrv_slot_reset() is called only on BRIDGES/Switches If is never called on Root-Ports having said that, now since we are removing the devices (compare to previous error callback handling in ERR_FATAL) I dont see the need of the above code anymore. because there is nothing to restore to any more. as we are initiating re-enumeration. Regards, Oza.