Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp5423281ybc; Wed, 27 Nov 2019 03:53:44 -0800 (PST) X-Google-Smtp-Source: APXvYqwsw0jnwGzOtWRMOap2gRteSVBpHHnWv1nsX+4Da4XM8MJH1HlmAfl4DvDkUWaa3E+5mGVJ X-Received: by 2002:a17:906:da04:: with SMTP id fi4mr29141617ejb.24.1574855624826; Wed, 27 Nov 2019 03:53:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574855624; cv=none; d=google.com; s=arc-20160816; b=YTDgOG49JXif6C5/hjNaAf0tPoMT/4Qjz4+436Xe71c9bxFs4hGP8aXYH+qMw4FtTK R2DP2QoGiD+JP+inmzg+/1RxRNhQw0Xy0zrvFdCDqaHgS014w9pehdRMZrr6qHsthPZr RjchM+x8yWdqsXDzK7f9GWlSRmFRNp7FkiWd0+ElksnHGDkGCV+gUhC3P/xpfEilIUhU ZSHGlfBi1tXPCYppNCV4spFcKAJgK6/FlisNV2CWL1LbcC0Q7lsv0dR774jvhzgzTPS8 qlfTFkgzdcWzoFseXgdmB1gym5nwnbUrmW0IS8CIXeR05WDt+8kny7mP8Xz44rXkr7Sw k0Vg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=/XhIGMtsi7KjLuMFks8eIn7a7OR+crqiPZAFOU/Qxss=; b=Oi0e9+BhxDXvfJkQWjwkgwIdo2Pvvuzjbdra46JGmCkp4G+gvY77JQU0apK4mrv20v Oem4BsTmqoW6lw0jM0z0OthB5gZZajhTxgaXhXYSZGZMA0loYLiifBQ9k+JLEk/UkDm5 /6cy2KwvcwDSA3NAfceh+xxRoY2IG2BzmfsUFOVLdx3WFtZDbuJHL2iXzZRlY7WLoCt2 yvK0xyVfOVC8V+sWcN96y9eM2ErO857BSkcF0r/dZVgOMT5PXR9iJgsrMG/efMO2MfrS DsSYF15FzPyQ1+16ulZrCkYIiMdLRxDqE1JNaSKtNlTRahfZQJC4gceoP784IUobMFjA edRg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=iFRDAi7X; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id jp18si8888532ejb.318.2019.11.27.03.53.20; Wed, 27 Nov 2019 03:53:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=iFRDAi7X; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727007AbfK0Lv6 (ORCPT + 99 others); Wed, 27 Nov 2019 06:51:58 -0500 Received: from us-smtp-1.mimecast.com ([207.211.31.81]:52734 "EHLO us-smtp-delivery-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726556AbfK0Lv6 (ORCPT ); Wed, 27 Nov 2019 06:51:58 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1574855517; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=/XhIGMtsi7KjLuMFks8eIn7a7OR+crqiPZAFOU/Qxss=; b=iFRDAi7XhSv2bcOJ+Ffw0LowCrf4yFK4HhrIA73B7bKD8SE1LqZVID+ilrKeX0QLAu0BdY RbvvGZmX4WAEXIG4nXqlq4SFOJDImwK65lEAQrMkOPecvTKLqCXfYT6h3y2i6+QKOUAWf/ GunixtYqp8VOss5xH1bWyY/hOGHle48= Received: from mail-qt1-f198.google.com (mail-qt1-f198.google.com [209.85.160.198]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-39-lyJ-O4l_OLaQEc0p2RyDOg-1; Wed, 27 Nov 2019 06:51:54 -0500 Received: by mail-qt1-f198.google.com with SMTP id x21so14644838qtp.1 for ; Wed, 27 Nov 2019 03:51:54 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=zfYAUmGvWJZJSRiVsjUka5mWwDiEaeMITYpQOX7mwXY=; b=MDcsB/K7LsorhhpG6DHZI04rQra7jCcbXwLZr6kSQ4tPheMsbTDBq3TpHp3yNg6bm+ gsAvSHSxPzZUdWgshB6nFR/5V3NN5+GP4Dr0fGsjjlBY3eCysdXhfen+RPCZ5nw9hvl/ TsDxWYPi0YjPl2DD4hwWJIqEaZygrju4u74bZhSqqIrwBQ7o/V+gZAIywtM3v4mAW/hZ SKsBec64F445BIDpkrqb92XlEbeMPLtQERe7F2DGAJMr9muBFKdfmgTkqi+oRbJ6sfD8 6MaisNMG1c+ZjhwLmAp5q4zEBWurk7X0C9qBeFCBponWbbZ7qxxJGpVHrguIoPmqU5wk VlPg== X-Gm-Message-State: APjAAAU9AssXAVML6Px0Z28AiCvu12gzAHgwcYkiiUaATHtPYQtawrr/ U0YIZZYtCYHt7NVSaASPNoGjPAAoQuWHM+uN4D4aIfNMsBwiXwl/oCT7GUvJ7TF9tFr8SOlyCZN nHSDOfafX0biwoHzlD5C0t11X0KLwwHMHjsEEdW0X X-Received: by 2002:a37:9083:: with SMTP id s125mr3828282qkd.192.1574855513971; Wed, 27 Nov 2019 03:51:53 -0800 (PST) X-Received: by 2002:a37:9083:: with SMTP id s125mr3828257qkd.192.1574855513663; Wed, 27 Nov 2019 03:51:53 -0800 (PST) MIME-Version: 1.0 References: <20191121112821.GU11621@lahna.fi.intel.com> <20191121114610.GW11621@lahna.fi.intel.com> <20191127114856.GZ11621@lahna.fi.intel.com> In-Reply-To: <20191127114856.GZ11621@lahna.fi.intel.com> From: Karol Herbst Date: Wed, 27 Nov 2019 12:51:42 +0100 Message-ID: Subject: Re: [PATCH v4] pci: prevent putting nvidia GPUs into lower device states on certain intel bridges To: Mika Westerberg Cc: Lyude Paul , "Rafael J. Wysocki" , Bjorn Helgaas , LKML , "Rafael J . Wysocki" , Linux PCI , Linux PM , dri-devel , nouveau , Dave Airlie , Mario Limonciello X-MC-Unique: lyJ-O4l_OLaQEc0p2RyDOg-1 X-Mimecast-Spam-Score: 0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Nov 27, 2019 at 12:49 PM Mika Westerberg wrote: > > On Tue, Nov 26, 2019 at 06:10:36PM -0500, Lyude Paul wrote: > > Hey-this is almost certainly not the right place in this thread to resp= ond, > > but this thread has gotten so deep evolution can't push the subject fur= ther to > > the right, heh. So I'll just respond here. > > :) > > > I've been following this and helping out Karol with testing here and th= ere. > > They had me test Bjorn's PCI branch on the X1 Extreme 2nd generation, w= hich > > has a turing GPU and 8086:1901 PCI bridge. > > > > I was about to say "the patch fixed things, hooray!" but it seems that = after > > trying runtime suspend/resume a couple times things fall apart again: > > You mean $subject patch, no? > no, I told Lyude to test the pci/pm branch as the runpm errors we saw on that machine looked different. Some BAR error the GPU reported after it got resumed, so I was wondering if the delays were helping with that. But after some cycles it still caused the same issue, that the GPU disappeared. Later testing also showed that my patch also didn't seem to help with this error sadly :/ > > [ 686.883247] nouveau 0000:01:00.0: DRM: suspending object tree... > > [ 752.866484] ACPI Error: Aborting method \_SB.PCI0.PEG0.PEGP.NVPO due= to previous error (AE_AML_LOOP_TIMEOUT) (20190816/psparse-529) > > [ 752.866508] ACPI Error: Aborting method \_SB.PCI0.PGON due to previo= us error (AE_AML_LOOP_TIMEOUT) (20190816/psparse-529) > > [ 752.866521] ACPI Error: Aborting method \_SB.PCI0.PEG0.PG00._ON due = to previous error (AE_AML_LOOP_TIMEOUT) (20190816/psparse-529) > > This is probably the culprit. The same AML code fails to properly turn > on the device. > > Is acpidump from this system available somewhere? >