Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp4082540ybl; Mon, 13 Jan 2020 07:33:55 -0800 (PST) X-Google-Smtp-Source: APXvYqwSaDIHBRv57m/X7dXfG0sr1t8Clf/ynrVZZ6RbnZxF4oEfG2XZT4jubxcAXu8US6Xnxqfa X-Received: by 2002:aca:cdd5:: with SMTP id d204mr13185195oig.134.1578929635087; Mon, 13 Jan 2020 07:33:55 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1578929635; cv=none; d=google.com; s=arc-20160816; b=E+CaPx/wLK9C+z18VTqwQBt5MVEuBrFWoCOqbHub7qywpuKJB4X3823fi/9ub38yu1 r/zN67YPHH36hucoBNjfWQt+/QurSOimbj5zcL9z0TNl2pAvV3AlHYz2neATN4SwYbBx y+bWzI+Wt2NXRMcwT69KmLTNtdLRtTqup6rVruVroYbGeZV8iZPVPoTMTlsSbn1dm+Ta cS45Iq6Ky7zA3aCZE2C0/oQKCmBXBT0OdJSDsBbHHfXSpjZRrF9v4ETgh+ZDaX+utkQc 8lBNmM6Y7L3xHfVovhHjc2BaHI5bz0CtH6r966zbwUQMDzxT0LEG1glK9c0t7lYJzFUv sfGg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=ElsdEkFyMX9PyqBKUyWZEu82iD4Q0YddLseGJka/eAI=; b=Jy4lVn0makXc0djU1LnMa7xaSinu5VmcmEyLtg4haI4dEBb2yiHEJ7iZTMBDII/v5q Y+RRno/UgeLgmWo8YdcSn7nZQ9MOb2UaAF1ywDa7JKsVuPQpZWGRfs0IzzgBSA+blD10 7uYJXm0pz5MX30Cv768mkS+iHyNm0MVb5G1TPSaRmfV6B9q8rrm9SzAh46dHndyrVgnc DHDg1rM7IHq2C9zP4yJm7isF9GrtodiRQajqIQVYYMXnY33dDHsv1cVzCxZixz6Xkk+I pAAL39DljLGYY2+YZ1TX0/qWbvUA3L65X0OBTq1nRNJ6vKkr8I6Z2+2ZHMQjFnjwIlQo UwOQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Ooaz6XlF; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t1si5881607oic.140.2020.01.13.07.33.42; Mon, 13 Jan 2020 07:33:55 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Ooaz6XlF; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728757AbgAMPcp (ORCPT + 99 others); Mon, 13 Jan 2020 10:32:45 -0500 Received: from us-smtp-delivery-1.mimecast.com ([205.139.110.120]:59670 "EHLO us-smtp-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727222AbgAMPco (ORCPT ); Mon, 13 Jan 2020 10:32:44 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1578929563; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=ElsdEkFyMX9PyqBKUyWZEu82iD4Q0YddLseGJka/eAI=; b=Ooaz6XlFmNrVJKmco00BzFbmOs7B+48bssYuHOkNAAc1PzDKN0hdLlgJvgLnBUc/n1xz/M X3Jd4I/iD9dQvNez1/gNr12YOpMEuhCeozBKDOTQUgMZO0e5vbYWqxsSrIh6mdfWt1wNtM KKArvcBTW+5ngXyzgAwsDm+UXV30GfM= Received: from mail-qt1-f197.google.com (mail-qt1-f197.google.com [209.85.160.197]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-166-PlcLqXMtPVCMZaGERpaRRw-1; Mon, 13 Jan 2020 10:32:40 -0500 X-MC-Unique: PlcLqXMtPVCMZaGERpaRRw-1 Received: by mail-qt1-f197.google.com with SMTP id m30so6768583qtb.2 for ; Mon, 13 Jan 2020 07:32:40 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=ElsdEkFyMX9PyqBKUyWZEu82iD4Q0YddLseGJka/eAI=; b=IYm+OyL17B/+2r7oH8DBoeXhGSql/Nmr/33PZYjsT1i1UpuJsxf6ZjWfJy/zRg8wzm 21TUjpkOtAZokPIm1TDjN9pycXlac7JV8p9LTwNcKMO6OZfpAY4Ac0M4bTKxDuxloIT+ CGntPB8uFibJG/0VGapiZZW4KNfUq2JacQbDjFinKZl8pcXhF5jAbFFKcQ6N29+pJkRD jdclfQo2ILeUH0YrOJRX/N/l01bsdUlzW9X/8l6pygKIH9Serx1/Tj+Rn678yowmsT2S C5219WDGLfRGyVqeF0wMOtq/25wYKs8rkszZI+PUeMbPdRtU4lfn9iHl4M2uZfiEg6q2 6ihg== X-Gm-Message-State: APjAAAUyAkayUnzdhGDJtsnXGgKNFf/HQxHYi8SNjg+5Au7hWQXTjtn/ aiAIQllLvY2KJZNbhlkd0lbiie1yUPyfG+JjhEEFdZACmsauo+UpbAoaYf4/0VOXVCweYsGsBZj Y4s9qjcZV/pChLzhrPcBwx64DZ4vppYuHeUMDfXOG X-Received: by 2002:a37:9245:: with SMTP id u66mr12325311qkd.102.1578929560037; Mon, 13 Jan 2020 07:32:40 -0800 (PST) X-Received: by 2002:a37:9245:: with SMTP id u66mr12325263qkd.102.1578929559700; Mon, 13 Jan 2020 07:32:39 -0800 (PST) MIME-Version: 1.0 References: <20191121112821.GU11621@lahna.fi.intel.com> <20191121114610.GW11621@lahna.fi.intel.com> <20191127114856.GZ11621@lahna.fi.intel.com> In-Reply-To: From: Karol Herbst Date: Mon, 13 Jan 2020 16:31:50 +0100 Message-ID: Subject: Re: [PATCH v4] pci: prevent putting nvidia GPUs into lower device states on certain intel bridges To: Dave Airlie Cc: "Rafael J. Wysocki" , Lyude Paul , Mika Westerberg , Bjorn Helgaas , LKML , "Rafael J . Wysocki" , Linux PCI , Linux PM , dri-devel , nouveau , Mario Limonciello Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org okay.. so checking whatever is the difference with _REV being 5 (meaning the firmware uses the legacy paths) doesn't help in any way. It's using a different method to turn the link of and the other ACPI variables touched either point to undocumented registers on the PCI bridge or internal ACPI memory... so, anybody with any other ideas? I really wished the nvidia driver would enable runpm on pre turing GPUs, but that's sadly not the case and on Turing things seem to be totally different, so it wouldn't help to check there as well... *sigh* On Tue, Dec 10, 2019 at 9:49 PM Karol Herbst wrote: > > On Tue, Dec 10, 2019 at 8:58 PM Dave Airlie wrote: > > > > On Mon, 9 Dec 2019 at 21:39, Rafael J. Wysocki wrote: > > > > > > On Mon, Dec 9, 2019 at 12:17 PM Karol Herbst wrote: > > > > > > > > anybody any other ideas? > > > > > > Not yet, but I'm trying to collect some more information. > > > > > > > It seems that both patches don't really fix > > > > the issue and I have no idea left on my side to try out. The only > > > > thing left I could do to further investigate would be to reverse > > > > engineer the Nvidia driver as they support runpm on Turing+ GPUs now, > > > > but I've heard users having similar issues to the one Lyude told us > > > > about... and I couldn't verify that the patches help there either in a > > > > reliable way. > > > > > > It looks like the newer (8+) versions of Windows expect the GPU driver > > > to prepare the GPU for power removal in some specific way and the > > > latter fails if the GPU has not been prepared as expected. > > > > > > Because testing indicates that the Windows 7 path in the platform > > > firmware works, it may be worth trying to do what it does to the PCIe > > > link before invoking the _OFF method for the power resource > > > controlling the GPU power. > > > > > > > Remember the pre Win8 path required calling a DSM method to actually > > power the card down, I think by the time we reach these methods in > > those cases the card is already gone. > > > > Dave. > > > > The point was that the firmware seems to do more in the legacy paths > and maybe we just have to do those things inside the driver instead > when using the new method. Also the _DSM call just wraps around the > interfaces on newer firmware anyway. The OS check is usually what > makes the difference. I might be wrong about the _DSM call just > wrapping though, but I think I saw it at least in some firmware at > some point.