2020-12-07 13:11:31

by Paul Menzel

[permalink] [raw]
Subject: pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid

[Bringing the issue up on the list in case the Linux Bugzilla is not
monitored/used.]


Dear Linux folks,


On Intel Tiger Lake Dell laptop, Linux logs the error below [1].

[ 0.507307] pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid
[ 0.508835] pci 0000:00:07.2: DPC: RP PIO log size 0 is invalid

$ lspci -nn -s 00:07
00:07.0 PCI bridge [0604]: Intel Corporation Tiger Lake-LP
Thunderbolt PCI Express Root Port #0 [8086:9a23] (rev 01)
00:07.2 PCI bridge [0604]: Intel Corporation Tiger Lake-LP
Thunderbolt PCI Express Root Port #2 [8086:9a27] (rev 01)

Commit 2700561817 (PCI/DPC: Cache DPC capabilities in
pci_init_capabilities()) [1] probably introduced it in Linux 5.7.

What does this error actually mean?

pdev->dpc_rp_log_size = (cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE) >> 8;
if (pdev->dpc_rp_log_size < 4 || pdev->dpc_rp_log_size > 9) {
pci_err(pdev, "RP PIO log size %u is invalid\n",
pdev->dpc_rp_log_size);
pdev->dpc_rp_log_size = 0;
}

(I guess `cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE` is zero too?)

Is it a firmware issue or a hardware issue?


Kind regards,

Paul


[1]: https://bugzilla.kernel.org/show_bug.cgi?id=209943
"pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid"
[2]:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=27005618178ef9e9bf9c42fd91101771c92e9308


Subject: Re: pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid

Hi,

On 12/7/20 5:08 AM, Paul Menzel wrote:
> [Bringing the issue up on the list in case the Linux Bugzilla is not monitored/used.]
>
>
> Dear Linux folks,
>
>
> On Intel Tiger Lake Dell laptop, Linux logs the error below [1].
>
>     [    0.507307] pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid
>     [    0.508835] pci 0000:00:07.2: DPC: RP PIO log size 0 is invalid
>
>     $ lspci -nn -s 00:07
>     00:07.0 PCI bridge [0604]: Intel Corporation Tiger Lake-LP Thunderbolt PCI Express Root Port #0
> [8086:9a23] (rev 01)
>     00:07.2 PCI bridge [0604]: Intel Corporation Tiger Lake-LP Thunderbolt PCI Express Root Port #2
> [8086:9a27] (rev 01)
>
> Commit 2700561817 (PCI/DPC: Cache DPC capabilities in pci_init_capabilities()) [1] probably
> introduced it in Linux 5.7.
>
> What does this error actually mean?
>
>     pdev->dpc_rp_log_size = (cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE) >> 8;
>     if (pdev->dpc_rp_log_size < 4 || pdev->dpc_rp_log_size > 9) {
>         pci_err(pdev, "RP PIO log size %u is invalid\n",
>             pdev->dpc_rp_log_size);
>         pdev->dpc_rp_log_size = 0;
As per PCIe spec r5.0, sec 7.9.15.2, valid RP log size is 4 or greater. Please see
the text copied from spec

- - - -
RP PIO Log Size - This field indicates how many DWORDs are allocated for the RP
PIO log registers, comprised by the RP PIO Header Log, the RP PIO ImpSpec Log,
and RP PIO TLP Prefix Log. If the Root Port supports RP Extensions for DPC, the
value of this field must be 4 or greater; otherwise, the value of
this field must be 0. See Section 7.9.15.11 , Section 7.9.15.12 , and Section 7.9.15.13 .
- - - -

In this case, since "(!(cap & PCI_EXP_DPC_CAP_RP_EXT))" condition is false, RP
EXT is supported. If RP EXT is supported, valid log size should be at-least 4.


>     }
>
> (I guess `cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE` is zero too?)
>
> Is it a firmware issue or a hardware issue?
I think this could be hardware issue.
>
>
> Kind regards,
>
> Paul
>
>
> [1]: https://bugzilla.kernel.org/show_bug.cgi?id=209943
>      "pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid"
> [2]:
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=27005618178ef9e9bf9c42fd91101771c92e9308
>

--
Sathyanarayanan Kuppuswamy
Linux Kernel Developer

2020-12-07 23:19:21

by Bjorn Helgaas

[permalink] [raw]
Subject: Re: pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid

On Mon, Dec 07, 2020 at 09:44:20AM -0800, Kuppuswamy, Sathyanarayanan wrote:
> On 12/7/20 5:08 AM, Paul Menzel wrote:
> > [Bringing the issue up on the list in case the Linux Bugzilla is not monitored/used.]
> >
> >
> > Dear Linux folks,
> >
> >
> > On Intel Tiger Lake Dell laptop, Linux logs the error below [1].
> >
> > ??? [??? 0.507307] pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid
> > ??? [??? 0.508835] pci 0000:00:07.2: DPC: RP PIO log size 0 is invalid
> >
> > ??? $ lspci -nn -s 00:07
> > ??? 00:07.0 PCI bridge [0604]: Intel Corporation Tiger Lake-LP
> > Thunderbolt PCI Express Root Port #0 [8086:9a23] (rev 01)
> > ??? 00:07.2 PCI bridge [0604]: Intel Corporation Tiger Lake-LP
> > Thunderbolt PCI Express Root Port #2 [8086:9a27] (rev 01)
> >
> > Commit 2700561817 (PCI/DPC: Cache DPC capabilities in
> > pci_init_capabilities()) [1] probably introduced it in Linux 5.7.
> >
> > What does this error actually mean?
> >
> > ????pdev->dpc_rp_log_size = (cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE) >> 8;
> > ????if (pdev->dpc_rp_log_size < 4 || pdev->dpc_rp_log_size > 9) {
> > ??????? pci_err(pdev, "RP PIO log size %u is invalid\n",
> > ??????????? pdev->dpc_rp_log_size);
> > ??????? pdev->dpc_rp_log_size = 0;
> As per PCIe spec r5.0, sec 7.9.15.2, valid RP log size is 4 or greater. Please see
> the text copied from spec
>
> - - - -
> RP PIO Log Size - This field indicates how many DWORDs are allocated for the RP
> PIO log registers, comprised by the RP PIO Header Log, the RP PIO ImpSpec Log,
> and RP PIO TLP Prefix Log. If the Root Port supports RP Extensions for DPC, the
> value of this field must be 4 or greater; otherwise, the value of
> this field must be 0. See Section 7.9.15.11 , Section 7.9.15.12 , and Section 7.9.15.13 .
> - - - -
>
> In this case, since "(!(cap & PCI_EXP_DPC_CAP_RP_EXT))" condition is false, RP
> EXT is supported. If RP EXT is supported, valid log size should be at-least 4.
>
>
> > ????}
> >
> > (I guess `cap & PCI_EXP_DPC_RP_PIO_LOG_SIZE` is zero too?)
> >
> > Is it a firmware issue or a hardware issue?
> I think this could be hardware issue.

I agree, it looks like a hardware issue. I posted this to the
bugzilla before I saw this email:

I assume the only problem is the "DPC: RP PIO log size 0 is invalid"
message itself?

It sure looks like the device is out of spec because PCIe r5.0, sec
7.9.15.2, says

RP PIO Log Size - This field indicates how many DWORDs are
allocated for the RP PIO log registers, comprised by the RP PIO
Header Log, the RP PIO ImpSpec Log, and RP PIO TLP Prefix Log. If
the Root Port supports RP Extensions for DPC, the value of this
field must be 4 or greater; otherwise, the value of this field
must be 0.

Maybe we just need to tone down the message to pci_info()? It looks
like dpc_process_rp_pio_error() would do the right thing even when
dpc_rp_log_size == 0.

In the attached messages, the firmware retains control of AER, so
Linux never tries to use DPC itself anyway.

> > [1]: https://bugzilla.kernel.org/show_bug.cgi?id=209943
> > ???? "pci 0000:00:07.0: DPC: RP PIO log size 0 is invalid"
> > [2]: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=27005618178ef9e9bf9c42fd91101771c92e9308
> >
>
> --
> Sathyanarayanan Kuppuswamy
> Linux Kernel Developer