1 PCIe Device AER statistics
2 --------------------------
4 These attributes show up under all the devices that are AER capable. These
5 statistical counters indicate the errors "as seen/reported by the device".
6 Note that this may mean that if an endpoint is causing problems, the AER
7 counters may increment at its link partner (e.g. root port) because the
8 errors may be "seen" / reported by the link partner and not the
9 problematic endpoint itself (which may report all counters as 0 as it never
12 What: /sys/bus/pci/devices/<dev>/aer_dev_correctable
15 Contact: linux-pci@vger.kernel.org, rajatja@google.com
16 Description: List of correctable errors seen and reported by this
17 PCI device using ERR_COR. Note that since multiple errors may
18 be reported using a single ERR_COR message, thus
19 TOTAL_ERR_COR at the end of the file may not match the actual
20 total of all the errors in the file. Sample output::
22 localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
27 Replay Timer Timeout 0
29 Corrected Internal Error 0
33 What: /sys/bus/pci/devices/<dev>/aer_dev_fatal
36 Contact: linux-pci@vger.kernel.org, rajatja@google.com
37 Description: List of uncorrectable fatal errors seen and reported by this
38 PCI device using ERR_FATAL. Note that since multiple errors may
39 be reported using a single ERR_FATAL message, thus
40 TOTAL_ERR_FATAL at the end of the file may not match the actual
41 total of all the errors in the file. Sample output::
43 localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
48 Flow Control Protocol 0
51 Unexpected Completion 0
57 Uncorrectable Internal Error 0
59 AtomicOp Egress Blocked 0
60 TLP Prefix Blocked Error 0
63 What: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal
66 Contact: linux-pci@vger.kernel.org, rajatja@google.com
67 Description: List of uncorrectable nonfatal errors seen and reported by this
68 PCI device using ERR_NONFATAL. Note that since multiple errors
69 may be reported using a single ERR_FATAL message, thus
70 TOTAL_ERR_NONFATAL at the end of the file may not match the
71 actual total of all the errors in the file. Sample output::
73 localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
78 Flow Control Protocol 0
81 Unexpected Completion 0
87 Uncorrectable Internal Error 0
89 AtomicOp Egress Blocked 0
90 TLP Prefix Blocked Error 0
93 PCIe Rootport AER statistics
94 ----------------------------
96 These attributes show up under only the rootports (or root complex event
97 collectors) that are AER capable. These indicate the number of error messages as
98 "reported to" the rootport. Please note that the rootports also transmit
99 (internally) the ERR_* messages for errors seen by the internal rootport PCI
100 device, so these counters include them and are thus cumulative of all the error
101 messages on the PCI hierarchy originating at that root port.
103 What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor
105 KernelVersion: 4.19.0
106 Contact: linux-pci@vger.kernel.org, rajatja@google.com
107 Description: Total number of ERR_COR messages reported to rootport.
109 What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal
111 KernelVersion: 4.19.0
112 Contact: linux-pci@vger.kernel.org, rajatja@google.com
113 Description: Total number of ERR_FATAL messages reported to rootport.
115 What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal
117 KernelVersion: 4.19.0
118 Contact: linux-pci@vger.kernel.org, rajatja@google.com
119 Description: Total number of ERR_NONFATAL messages reported to rootport.