Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp3541058pxk; Mon, 7 Sep 2020 16:55:17 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwv06Qfq7Xmeyc/Wsiz7AeigeJQB5e5Hfr2wsj7fPIVLglVs6XRlJFbZnOAKq6GPZaC/A7g X-Received: by 2002:a17:906:43c9:: with SMTP id j9mr23018433ejn.542.1599522916920; Mon, 07 Sep 2020 16:55:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1599522916; cv=none; d=google.com; s=arc-20160816; b=YCSp8RuUixW1dP+qwpSp3vAmArulMzBFiu5aeTLUXmDKeTzdXGadi0bD6WUOcoTGBm 2Lqm2d+eKXq4qPqnhU/bReD83rEa6JGHFCOZCRJ/j9rxGjSr4z/AqM3bsZyIpnU++aDR cE4Ch6Egxc2jY5K8bPOFXmn8FqaHIxjJAJ1SEGq6gQLzmwgv3HIhHtEDELUyhjrAEE6R 274EdqIMvlS7e6Hc1cAGvshlHibtP7ZVBvEs6X5CVc6LYCTyimq4HT27pEvg4uv3BIF1 1mWZLK+IjJT3j7Rmf2pa6en2M6Q5PK9cYYi+pXQtZd/qoJEa0qkL3xBXjrEcAkuPzPUh X8gA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=110Unz5qrB7/Ny3qdZoUTzTeS4D4CUQMCdJdAAxtNLA=; b=epdArAcXJL7qpeqsYcotZS5/8ulICHj1WONROTRD+jH4P7b3ZZdvjULjWdazPsTCJr gKLDu+fd0dHvGQ+JPfHIGUKhJDpVjYZjOGR9kDf2GBnviyrEvMDb7fPmDCdHKsCVPBh5 4bkVv6yRXGL494MDXVKvOcv2s4tURrbRNSn7kTLlRdDra8FaEB/A1bFhW1Ilug6DdyIK gfXMoPj9TveKvJ0f/SCvlQZAwwo6uX7jX6kcDIsLwdq0UqboWZia6vrGrOI2iHh8NxGa OY2iXjA116sMiNVEV1w2L6n2lCXT3hwOVRwqrGW2xGOBaMh8aurvPQNu4lQRRQ3Od/fC wjqQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=gYEF+MeN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p18si3771007edy.100.2020.09.07.16.54.51; Mon, 07 Sep 2020 16:55:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=gYEF+MeN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728035AbgIGXvn (ORCPT + 99 others); Mon, 7 Sep 2020 19:51:43 -0400 Received: from us-smtp-delivery-124.mimecast.com ([63.128.21.124]:49435 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727058AbgIGXvj (ORCPT ); Mon, 7 Sep 2020 19:51:39 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1599522695; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=110Unz5qrB7/Ny3qdZoUTzTeS4D4CUQMCdJdAAxtNLA=; b=gYEF+MeNGCctuIQ6DyNQpP0xJPUfZmruee1zHZLR1JBiWJcbxrU4nGlY5tdlPel9T59lKS sVMdbzGM96vLCN1pCnVX8itJEDFUDyBfGRMeM/Zq1pdwakGo0Md1b6ZskV63I4JQM+Tgyh gFwj9+rnPg/S+kiL6zMulfneThpqQnQ= Received: from mail-qk1-f200.google.com (mail-qk1-f200.google.com [209.85.222.200]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-48-wWIR-XU2PtWY9RSa0UyW8A-1; Mon, 07 Sep 2020 19:51:31 -0400 X-MC-Unique: wWIR-XU2PtWY9RSa0UyW8A-1 Received: by mail-qk1-f200.google.com with SMTP id v16so8169959qka.18 for ; Mon, 07 Sep 2020 16:51:31 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=110Unz5qrB7/Ny3qdZoUTzTeS4D4CUQMCdJdAAxtNLA=; b=r3epofOu67T7avanmh6U/V20DSt7XvFsTu3J83Tcsogcnc73Uu0WHXQqWCpsXsTP+N ZJ1ik3EmT+kxvDGRCkBa6rEz/wyawTAXJM1AXx/PcY4Fsmy1CRRinuc7uq42lo/zuorA IvUHXeaJebxo4usqb9f6lJQuxcUd3h2IVhPANBwcjuQeewF6Pqcvkm8yeBmYf87UV41G htLsQJwwGbOJpiP4o3CLn/b6KpIrfcX/ekrlvoYkEAyN36c3FjTtLlYH+UcOSnWnHxQQ uZULQbGwOmzvyjuA28WV/wspixtUCcSMyPCoXQHpYKdoXEg0bOvQZriO22F1q7hi4NaK KTwg== X-Gm-Message-State: AOAM530CIh/JkkD3vlzu50HcEWJbtTZ/SCXyAT/PqtjzUI0r5FOht4BD DfDR2EcZv0ea59vHsyjOKaeemq9xZyUnt3UqM957kGzXCuWbBCOKkEvHjqAyJOes0rSY0WB2Jpx TgszVULsMABm/VHcyZYMDFuurgg7yWobsmuPX151s X-Received: by 2002:a37:7946:: with SMTP id u67mr7195212qkc.381.1599522690935; Mon, 07 Sep 2020 16:51:30 -0700 (PDT) X-Received: by 2002:a37:7946:: with SMTP id u67mr7195180qkc.381.1599522690419; Mon, 07 Sep 2020 16:51:30 -0700 (PDT) MIME-Version: 1.0 References: <20191004123947.11087-1-mika.westerberg@linux.intel.com> <20191004123947.11087-2-mika.westerberg@linux.intel.com> <20200808202202.GA12007@merlins.org> <20200906181852.GC13955@merlins.org> <20200907205825.GB20064@merlins.org> In-Reply-To: <20200907205825.GB20064@merlins.org> From: Karol Herbst Date: Tue, 8 Sep 2020 01:51:19 +0200 Message-ID: Subject: Re: [Nouveau] pcieport 0000:00:01.0: PME: Spurious native interrupt (nvidia with nouveau and thunderbolt on thinkpad P73) To: Marc MERLIN Cc: Mika Westerberg , Kai-Heng Feng , Nicholas Johnson , nouveau , LKML , Linux PCI , Bjorn Helgaas , Len Brown , Lyude Paul , Ben Skeggs Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Sep 7, 2020 at 10:58 PM Marc MERLIN wrote: > > On Mon, Sep 07, 2020 at 09:14:03PM +0200, Karol Herbst wrote: > > > - changes in the nouveau driver. Mika told me the PCIe regression > > > "pcieport 0000:00:01.0: PME: Spurious native interrupt!" is supposed > > > to be fixed in 5.8, but I still get a 4mn hang or so during boot and > > > with 5.8, removing the USB key, didn't help make the boot faster > > > > that's the root port the GPU is attached to, no? I saw that message on > > the Thinkpad P1G2 when runtime resuming the Nvidia GPU, but it does > > seem to come from the root port. > > Hi Karol, thanks for your answer. > > 00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) (rev 0d) > 01:00.0 VGA compatible controller: NVIDIA Corporation TU104GLM [Quadro RTX 4000 Mobile / Max-Q] (rev a1) > > > Well, you'd also need it when attaching external displays. > > Indeed. I just don't need that on this laptop, but familiar with the not > so seemless procedure to turn on both GPUs, and mirror the intel one into > the nvidia one for external output. > > > > [ 11.262985] nvidia-gpu 0000:01:00.3: PME# enabled > > > [ 11.303060] nvidia-gpu 0000:01:00.3: PME# disabled > > > > mhh, interesting. I heard some random comments that the Nvidia > > USB-C/UCSI driver is a bit broken and can cause various issues. Mind > > blacklisting i2c-nvidia-gpu and typec_nvidia (and verify they don't > > get loaded) and see if that helps? > > Right, this one: > 01:00.3 Serial bus controller [0c80]: NVIDIA Corporation TU104 USB Type-C UCSI Controller (rev a1) > Sure, I'll blacklist it. Ok, just did that, removed from initrd, > rebooted, and it was no better. > > From initrd (before root gets mounted), I have this: > nouveau 1961984 0 > mxm_wmi 16384 1 nouveau > hwmon 32768 1 nouveau > ttm 102400 1 nouveau > wmi 32768 2 nouveau,mxm_wmi > > I still got a 2mn hang. and a nouveau probe error > [ 189.124530] nouveau: probe of 0000:01:00.0 failed with error -12 > > > Here's what it looks like: > [ 9.693230] hid: raw HID events driver (C) Jiri Kosina > [ 9.694988] usbcore: registered new interface driver usbhid > [ 9.694989] usbhid: USB HID core driver > [ 9.696700] hid-generic 0003:1050:0200.0001: hiddev0,hidraw0: USB HID v1.00 Device [Yubico Yubico Gnubby (gnubby1)] on usb-0000:00:14.0-2/input0 > [ 9.784456] Console: switching to colour frame buffer device 240x67 > [ 9.816297] i915 0000:00:02.0: fb0: i915drmfb frame buffer device > [ 25.087400] thunderbolt 0000:06:00.0: saving config space at offset 0x0 (reading 0x15eb8086) > [ 25.087414] thunderbolt 0000:06:00.0: saving config space at offset 0x4 (reading 0x100406) > [ 25.087419] thunderbolt 0000:06:00.0: saving config space at offset 0x8 (reading 0x8800006) > [ 25.087424] thunderbolt 0000:06:00.0: saving config space at offset 0xc (reading 0x20) > [ 25.087430] thunderbolt 0000:06:00.0: saving config space at offset 0x10 (reading 0xcc100000) > [ 25.087435] thunderbolt 0000:06:00.0: saving config space at offset 0x14 (reading 0xcc140000) > [ 25.087440] thunderbolt 0000:06:00.0: saving config space at offset 0x18 (reading 0x0) > [ 25.087445] thunderbolt 0000:06:00.0: saving config space at offset 0x1c (reading 0x0) > [ 25.087450] thunderbolt 0000:06:00.0: saving config space at offset 0x20 (reading 0x0) > [ 25.087455] thunderbolt 0000:06:00.0: saving config space at offset 0x24 (reading 0x0) > [ 25.087460] thunderbolt 0000:06:00.0: saving config space at offset 0x28 (reading 0x0) > [ 25.087466] thunderbolt 0000:06:00.0: saving config space at offset 0x2c (reading 0x229b17aa) > [ 25.087471] thunderbolt 0000:06:00.0: saving config space at offset 0x30 (reading 0x0) > [ 25.087476] thunderbolt 0000:06:00.0: saving config space at offset 0x34 (reading 0x80) > [ 25.087481] thunderbolt 0000:06:00.0: saving config space at offset 0x38 (reading 0x0) > [ 25.087486] thunderbolt 0000:06:00.0: saving config space at offset 0x3c (reading 0x1ff) > [ 25.087571] thunderbolt 0000:06:00.0: PME# enabled > [ 25.105353] pcieport 0000:05:00.0: saving config space at offset 0x0 (reading 0x15ea8086) > [ 25.105364] pcieport 0000:05:00.0: saving config space at offset 0x4 (reading 0x100407) > [ 25.105370] pcieport 0000:05:00.0: saving config space at offset 0x8 (reading 0x6040006) > [ 25.105375] pcieport 0000:05:00.0: saving config space at offset 0xc (reading 0x10020) > [ 25.105380] pcieport 0000:05:00.0: saving config space at offset 0x10 (reading 0x0) > [ 25.105384] pcieport 0000:05:00.0: saving config space at offset 0x14 (reading 0x0) > [ 25.105389] pcieport 0000:05:00.0: saving config space at offset 0x18 (reading 0x60605) > [ 25.105394] pcieport 0000:05:00.0: saving config space at offset 0x1c (reading 0x1f1) > [ 25.105399] pcieport 0000:05:00.0: saving config space at offset 0x20 (reading 0xcc10cc10) > [ 25.105404] pcieport 0000:05:00.0: saving config space at offset 0x24 (reading 0x1fff1) > [ 25.105409] pcieport 0000:05:00.0: saving config space at offset 0x28 (reading 0x0) > [ 25.105413] pcieport 0000:05:00.0: saving config space at offset 0x2c (reading 0x0) > [ 25.105418] pcieport 0000:05:00.0: saving config space at offset 0x30 (reading 0x0) > [ 25.105423] pcieport 0000:05:00.0: saving config space at offset 0x34 (reading 0x80) > [ 25.105428] pcieport 0000:05:00.0: saving config space at offset 0x38 (reading 0x0) > [ 25.105432] pcieport 0000:05:00.0: saving config space at offset 0x3c (reading 0x201ff) > [ 25.105517] pcieport 0000:05:00.0: PME# enabled > [ 25.125367] pcieport 0000:04:00.0: saving config space at offset 0x0 (reading 0x15ea8086) > [ 25.125378] pcieport 0000:04:00.0: saving config space at offset 0x4 (reading 0x100007) > [ 25.125383] pcieport 0000:04:00.0: saving config space at offset 0x8 (reading 0x6040006) > [ 25.125388] pcieport 0000:04:00.0: saving config space at offset 0xc (reading 0x10020) > [ 25.125393] pcieport 0000:04:00.0: saving config space at offset 0x10 (reading 0x0) > [ 25.125398] pcieport 0000:04:00.0: saving config space at offset 0x14 (reading 0x0) > [ 25.125403] pcieport 0000:04:00.0: saving config space at offset 0x18 (reading 0x510504) > [ 25.125407] pcieport 0000:04:00.0: saving config space at offset 0x1c (reading 0x5141) > [ 25.125412] pcieport 0000:04:00.0: saving config space at offset 0x20 (reading 0xcc10b400) > [ 25.125417] pcieport 0000:04:00.0: saving config space at offset 0x24 (reading 0x3ff10001) > [ 25.125422] pcieport 0000:04:00.0: saving config space at offset 0x28 (reading 0x60) > [ 25.125427] pcieport 0000:04:00.0: saving config space at offset 0x2c (reading 0x60) > [ 25.125431] pcieport 0000:04:00.0: saving config space at offset 0x30 (reading 0x0) > [ 25.125436] pcieport 0000:04:00.0: saving config space at offset 0x34 (reading 0x80) > [ 25.125441] pcieport 0000:04:00.0: saving config space at offset 0x38 (reading 0x0) > [ 25.125446] pcieport 0000:04:00.0: saving config space at offset 0x3c (reading 0x201ff) > [ 25.125528] pcieport 0000:04:00.0: PME# enabled > [ 25.145423] pcieport 0000:00:1c.0: saving config space at offset 0x0 (reading 0xa3388086) > [ 25.145437] pcieport 0000:00:1c.0: saving config space at offset 0x4 (reading 0x100407) > [ 25.145445] pcieport 0000:00:1c.0: saving config space at offset 0x8 (reading 0x60400f0) > [ 25.145453] pcieport 0000:00:1c.0: saving config space at offset 0xc (reading 0x810000) > [ 25.145460] pcieport 0000:00:1c.0: saving config space at offset 0x10 (reading 0x0) > [ 25.145464] pcieport 0000:00:1c.0: saving config space at offset 0x14 (reading 0x0) > [ 25.145469] pcieport 0000:00:1c.0: saving config space at offset 0x18 (reading 0x510400) > [ 25.145476] pcieport 0000:00:1c.0: saving config space at offset 0x1c (reading 0x20006040) > [ 25.145484] pcieport 0000:00:1c.0: saving config space at offset 0x20 (reading 0xcc10b400) > [ 25.145488] pcieport 0000:00:1c.0: saving config space at offset 0x24 (reading 0x3ff10001) > [ 25.145493] pcieport 0000:00:1c.0: saving config space at offset 0x28 (reading 0x60) > [ 25.145497] pcieport 0000:00:1c.0: saving config space at offset 0x2c (reading 0x60) > [ 25.145502] pcieport 0000:00:1c.0: saving config space at offset 0x30 (reading 0x0) > [ 25.145506] pcieport 0000:00:1c.0: saving config space at offset 0x34 (reading 0x40) > [ 25.145510] pcieport 0000:00:1c.0: saving config space at offset 0x38 (reading 0x0) > [ 25.145515] pcieport 0000:00:1c.0: saving config space at offset 0x3c (reading 0x201ff) > [ 25.145604] pcieport 0000:00:1c.0: PME# enabled > [ 26.265697] pcieport 0000:00:1c.0: power state changed by ACPI to D3cold > [ 45.468365] random: crng init done > [ 105.032727] usb 1-2: USB disconnect, device number 2 <= I removed a usb key, didn't help > [ 128.495144] async_tx: api initialized (async) > [ 128.514820] device-mapper: uevent: version 1.0.3 > [ 128.518186] device-mapper: ioctl: 4.42.0-ioctl (2020-02-27) initialised: dm-devel@redhat.com > [ 144.869445] e1000e 0000:00:1f.6 eth0: NIC Link is Down > [ 172.851384] BTRFS: device label btrfs_pool4 devid 1 transid 78270 /dev/sdb4 scanned by btrfs (1293) > [ 172.851648] BTRFS: device label btrfs_pool3 devid 1 transid 27410 /dev/sda5 scanned by btrfs (1293) > [ 172.852030] BTRFS: device fsid de9694f8-9c0d-4e9d-bd12-57adc4381cd7 devid 1 transid 41 /dev/sda3 scanned by btrfs (1293) > [ 172.852224] BTRFS: device fsid 23e1398d-e462-41aa-b85e-f574906ddc03 devid 1 transid 585 /dev/nvme0n1p4 scanned by btrfs (1293) > [ 189.124291] nouveau 0000:01:00.0: disp ctor failed, -12 > [ 189.124530] nouveau: probe of 0000:01:00.0 failed with error -12 > > The next boot looks similar: > [ 25.161759] pcieport 0000:00:1c.0: PME# enabled > [ 26.297810] pcieport 0000:00:1c.0: power state changed by ACPI to D3cold > [ 128.427270] async_tx: api initialized (async) > [ 128.446525] device-mapper: uevent: version 1.0.3 > [ 128.446691] device-mapper: ioctl: 4.42.0-ioctl (2020-02-27) initialised: dm-devel@redhat.com > [ 128.458120] random: cryptsetup: uninitialized urandom read (4 bytes read) > [ 138.507373] random: cryptsetup: uninitialized urandom read (4 bytes read) > [ 144.793573] e1000e 0000:00:1f.6 eth0: NIC Link is Down > [ 159.627780] random: crng init done > [ 171.814064] process '/usr/bin/fstype' started with executable stack > [ 181.949989] BTRFS: device label btrfs_boot devid 1 transid 394687 /dev/mapper/cryptroot scanned by btrfs (1063) > [ 181.953437] BTRFS: device label btrfs_pool4 devid 1 transid 78270 /dev/sdb4 scanned by btrfs (1063) > [ 181.956989] BTRFS: device label btrfs_pool3 devid 1 transid 27410 /dev/sda5 scanned by btrfs (1063) > [ 181.960473] BTRFS: device fsid de9694f8-9c0d-4e9d-bd12-57adc4381cd7 devid 1 transid 41 /dev/sda3 scanned by btrfs (1063) > [ 181.964097] BTRFS: device fsid 23e1398d-e462-41aa-b85e-f574906ddc03 devid 1 transid 585 /dev/nvme0n1p4 scanned by btrfs (1063) > [ 188.733645] nouveau 0000:01:00.0: disp ctor failed, -12 > [ 188.740653] nouveau: probe of 0000:01:00.0 failed with error -12 oh, I somehow missed that "disp ctor failed" message. I think that might explain why things are a bit hanging. From the top of my head I am not sure if that's something known or something new. But just in case I CCed Lyude and Ben. And I think booting with nouveau.debug=disp=trace could already show something relevant. > [ 188.901070] PM: Image not found (code -22) > > Does that help? > > Thanks, > Marc > -- > "A mouse is a device used to point at the xterm you want to type in" - A.S.R. > > Home page: http://marc.merlins.org/ | PGP 7F55D5F27AAF9D08 >