Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp1144794imm; Wed, 23 May 2018 10:59:49 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpzk/dEjW1raWLVq8K8ykMx9Mozper90CbspeKRle4dFRgHHg1yCaNFF2kg1zrGO4Jmv3Y7 X-Received: by 2002:a62:bd18:: with SMTP id a24-v6mr3779256pff.30.1527098388966; Wed, 23 May 2018 10:59:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1527098388; cv=none; d=google.com; s=arc-20160816; b=teqg64xBfHaslKg9gotq8JxJqHzF49P6AQW2Im4OVbMnjRL918mkGLfThFKpv+5Mdx M2We/a8XaxHT2/wUiGgACPTvA/XKQhJJN52fp1YTTlPHWfcIXxCyI//KBDcy+cInvRV9 thI28C3y6nXUYGevZCRW9PRBqX6gEttgS9xygp3WtkhHA67ibTEwc7U/KWMcvbMJD59D 71T7QGApdt8KkBioeurtPyRw8Kh6BfFkiOgr7KmPe0pSElgatJDVxSm6+wfrSGuBIpo7 4zg+8sjQZfXJkqZnnOITdXLe3FtjdBvZ4Lz3hM2weSCjn+mCq7lg9tkbwuMG08KfXCef JOKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature:arc-authentication-results; bh=2EJ4D+L/5H9znFFvXT8cS2yNFydduxQdfWP8MkCCpCY=; b=JMsrXvBC9bdLEDyfb1XMVIgRokKwc3icpTectHUOrEo0wc9tWfwDASMrXsJOe9mg6P DkAibVIk4E3V5tGiMPr+jcZsuLH3iuHRwhuvyzDExGJgIeFn2TrY0Mg2y8jxGCLO61Zq /FFPdECjy0+8P3jmRmpWkr9VflW7qVWGcMa01upKLybR97b41TkfwZh81PjLNGeb/lQ1 dGLKRxdJ2yg4JkCWw//UY+IRrv7cGP8VjpVIKlnMPso40xkcJvv0uMAVFHmpSKzpS/ZL GCLV+4Ip2X9G75pV6Ak6U8T/YqOXhuCz+QMcuq5yIwGFlNtbSo8jkmiy8+OUeQVG6ziu WFdg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=PB0Ho++U; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u14-v6si19403262pfa.84.2018.05.23.10.59.34; Wed, 23 May 2018 10:59:48 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=PB0Ho++U; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933944AbeEWR7H (ORCPT + 99 others); Wed, 23 May 2018 13:59:07 -0400 Received: from mail-pg0-f68.google.com ([74.125.83.68]:35873 "EHLO mail-pg0-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933899AbeEWR6z (ORCPT ); Wed, 23 May 2018 13:58:55 -0400 Received: by mail-pg0-f68.google.com with SMTP id u7-v6so1249913pgp.3 for ; Wed, 23 May 2018 10:58:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=2EJ4D+L/5H9znFFvXT8cS2yNFydduxQdfWP8MkCCpCY=; b=PB0Ho++UDSX86btyoWRXqC1Djl9bfLrR7i3orXdiboy4BkwUN8Q/NGNhOZT5yD1E6f XezwC/7BrycTX4uwMFDZ2PUGPU3oPcdAEltyCYn5NnS0Jm+eiA6u9dxb1HpEHmX3hoHu NteX6WMFSjQU8hhRPXZK9eibRDrsNJbQJRJM5qCckdEy1ZMi8q53f8guer5ML3MhW95x fWcgqk2/4NXTGnI+tq4zbX873lRMnTfDoMJH5vlb17gVf4+2NQzVypgIQOfN5/NA4bmj Oi4mqys6sQZusMcJCSGi5jcXbUGuQxuxIfiX0doNIzQHv1lRwRolRr9REsAF+wLPV0Lc 5+Pg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=2EJ4D+L/5H9znFFvXT8cS2yNFydduxQdfWP8MkCCpCY=; b=ShwX61IOe12BUxbIlj6ctcE4XkW5eDlbtLdGMVTk5ZgUnB7CX/V4ulZ9wBktUEl1+r c300v6ZWyBb5sN5pTSNqajuZrBGPpJpSE7l8lsDcyjDME9wOnsfbx/ioA0Jn526F5/05 zKZ7+4kgbAaKsLhdW0ej9Fb8HjC400dwWSXyvlAts5iVYdEOU3smqtkptcELpnKxzz+U tCFtsa8BtI/PYgjzjhU8gIlYf0PKC8CeJBdbVWPLjqXTwKu/CXR++Le7uT66KfEuZCxb 6FeSX7mem/cSMgL3Wy3/r10/S7b0rZTUMK3TQ5TwMh+oi15lLzJNLQU5cBnrP7OjOVxe indg== X-Gm-Message-State: ALKqPweSfVQoEZXTqUeA0WrQPAeveTOW939U7LBWsvarEkx4H6w1D/eQ lQ+GlmqZ1E9d1TP0wLCoUGRETA== X-Received: by 2002:a63:a743:: with SMTP id w3-v6mr3085079pgo.374.1527098334165; Wed, 23 May 2018 10:58:54 -0700 (PDT) Received: from rajat.mtv.corp.google.com ([2620:0:1000:1501:dc81:9a9e:fdee:decf]) by smtp.gmail.com with ESMTPSA id k186-v6sm41433025pfc.142.2018.05.23.10.58.52 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 23 May 2018 10:58:53 -0700 (PDT) From: Rajat Jain To: Bjorn Helgaas , Jonathan Corbet , Philippe Ombredanne , Kate Stewart , Thomas Gleixner , Greg Kroah-Hartman , Rajat Jain , Frederick Lawler , Oza Pawandeep , Keith Busch , Gabriele Paoloni , Alexandru Gagniuc , Thomas Tai , "Steven Rostedt (VMware)" , linux-pci@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, Jes Sorensen , Kyle McMartin Cc: rajatxjain@gmail.com Subject: [PATCH v2 5/5] Documentation/ABI: Add details of PCI AER statistics Date: Wed, 23 May 2018 10:58:08 -0700 Message-Id: <20180523175808.28030-6-rajatja@google.com> X-Mailer: git-send-email 2.17.0.441.gb46fe60e1d-goog In-Reply-To: <20180523175808.28030-1-rajatja@google.com> References: <20180522222805.80314-1-rajatja@google.com> <20180523175808.28030-1-rajatja@google.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Add the PCI AER statistics details to Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats and provide a pointer to it in Documentation/PCI/pcieaer-howto.txt Signed-off-by: Rajat Jain --- v2: Move the documentation to Documentation/ABI/ .../testing/sysfs-bus-pci-devices-aer_stats | 103 ++++++++++++++++++ Documentation/PCI/pcieaer-howto.txt | 5 + 2 files changed, 108 insertions(+) create mode 100644 Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats diff --git a/Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats b/Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats new file mode 100644 index 000000000000..f55c389290ac --- /dev/null +++ b/Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats @@ -0,0 +1,103 @@ +========================== +PCIe Device AER statistics +========================== +These attributes show up under all the devices that are AER capable. These +statistical counters indicate the errors "as seen/reported by the device". +Note that this may mean that if an end point is causing problems, the AER +counters may increment at its link partner (e.g. root port) because the +errors will be "seen" / reported by the link partner and not the the +problematic end point itself (which may report all counters as 0 as it never +saw any problems). + +Where: /sys/bus/pci/devices//aer_stats/dev_total_cor_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of correctable errors seen and reported by this + PCI device using ERR_COR. + +Where: /sys/bus/pci/devices//aer_stats/dev_total_fatal_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of uncorrectable fatal errors seen and reported + by this PCI device using ERR_FATAL. + +Where: /sys/bus/pci/devices//aer_stats/dev_total_nonfatal_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of uncorrectable non-fatal errors seen and reported + by this PCI device using ERR_NONFATAL. + +Where: /sys/bus/pci/devices//aer_stats/dev_breakdown_correctable +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Breakdown of of correctable errors seen and reported by this + PCI device using ERR_COR. A sample result looks like this: +----------------------------------------- +Receiver Error = 0x174 +Bad TLP = 0x19 +Bad DLLP = 0x3 +RELAY_NUM Rollover = 0x0 +Replay Timer Timeout = 0x1 +Advisory Non-Fatal = 0x0 +Corrected Internal Error = 0x0 +Header Log Overflow = 0x0 +----------------------------------------- + +Where: /sys/bus/pci/devices//aer_stats/dev_breakdown_uncorrectable +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Breakdown of of correctable errors seen and reported by this + PCI device using ERR_FATAL or ERR_NONFATAL. A sample result + looks like this: +----------------------------------------- +Undefined = 0x0 +Data Link Protocol = 0x0 +Surprise Down Error = 0x0 +Poisoned TLP = 0x0 +Flow Control Protocol = 0x0 +Completion Timeout = 0x0 +Completer Abort = 0x0 +Unexpected Completion = 0x0 +Receiver Overflow = 0x0 +Malformed TLP = 0x0 +ECRC = 0x0 +Unsupported Request = 0x0 +ACS Violation = 0x0 +Uncorrectable Internal Error = 0x0 +MC Blocked TLP = 0x0 +AtomicOp Egress Blocked = 0x0 +TLP Prefix Blocked Error = 0x0 +----------------------------------------- + +============================ +PCIe Rootport AER statistics +============================ +These attributes showup under only the rootports that are AER capable. These +indicate the number of error messages as "reported to" the rootport. Please note +that the rootports also transmit (internally) the ERR_* messages for errors seen +by the internal rootport PCI device, so these counters includes them and are +thus cumulative of all the error messages on the PCI hierarchy originating +at that root port. + +Where: /sys/bus/pci/devices//aer_stats/rootport_total_cor_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of ERR_COR messages reported to rootport. + +Where: /sys/bus/pci/devices//aer_stats/rootport_total_fatal_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of ERR_FATAL messages reported to rootport. + +Where: /sys/bus/pci/devices//aer_stats/rootport_total_nonfatal_errs +Date: May 2018 +Kernel Version: 4.17.0 +Contact: linux-pci@vger.kernel.org, rajatja@google.com +Description: Total number of ERR_NONFATAL messages reported to rootport. diff --git a/Documentation/PCI/pcieaer-howto.txt b/Documentation/PCI/pcieaer-howto.txt index acd0dddd6bb8..91b6e677cb8c 100644 --- a/Documentation/PCI/pcieaer-howto.txt +++ b/Documentation/PCI/pcieaer-howto.txt @@ -73,6 +73,11 @@ In the example, 'Requester ID' means the ID of the device who sends the error message to root port. Pls. refer to pci express specs for other fields. +2.4 AER Statistics / Counters + +When PCIe AER errors are captured, the counters / statistics are also exposed +in form of sysfs attributes which are documented at +Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats 3. Developer Guide -- 2.17.0.441.gb46fe60e1d-goog