Received: by 10.223.185.116 with SMTP id b49csp1425726wrg; Sun, 11 Feb 2018 11:42:51 -0800 (PST) X-Google-Smtp-Source: AH8x226RrRioGUOwg5V6kdAcQPRCAlk3S1Fh8Ash/WHLOZT0S1U5G7A3tzy7IJgTaJu9zcDcxp3J X-Received: by 10.98.161.7 with SMTP id b7mr9049060pff.68.1518378171780; Sun, 11 Feb 2018 11:42:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518378171; cv=none; d=google.com; s=arc-20160816; b=KJI7EvKniJqehVE8WxeP9wKi0u6oSj/e36qS8grkPaIPfh4ox58Bj3410bz7V9zUMZ IaCr2nPRy1ZKHN9fQGiYMdrYkchZnuTqMDPl0InP0VYV6Dv0mAUPGFsWtuEMNIiyPzzo oUYn70T7sKSaSCADTjiaD5ly+tDG0NrJUqibCeb3PLMm4IIah8TrBeNOz+fRULj9/jIm hA0OSegBeHfjt9Pcq4L6XuKbaH/Z8XCKdhg33pSs1KPBF8055H4xho4nM2ZPpO9s6xuS 20z/fQDNXjgRY3ObEXdJC0eY5rIGcY8k6FfD+L/Oke1BUpqQwIDPpqTmoczOZa+768WP XS5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=f4TQcX4pRc05FSBNYa91vfZY47tS5fH59xMbKDqOk1k=; b=nwCodwaJyjqjlkEUikUsr0najCL3iBnMsppmOZosxl8IWGxq0/CUioZhaN8Zogykxa bBryDjPSaMucY5yXX+TXtLRGri3zYAlxcS12G20eLC8EpMWEsFajNlRuGtaqlBAsyR/z oGJuPNTjCYUPr8EiQjGHeDLF2IOmrCK7jOYJka4/0DpjpploB2RUgQs02jbf/6P2vWhS Y1ZRvBcHDcHm/XPab6AMCz4FWuV1DzDM46fXDgNNij3iy1kZP5daaNwPZWdQ9bepJByG ik+hBVbIw3/Xknc72E65olzGzr0K9+k7abkpU4kvVaxWbNh8weiAS1lmb9M025hSdWHs 6rWA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j11-v6si2776154pll.485.2018.02.11.11.42.37; Sun, 11 Feb 2018 11:42:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932102AbeBKTl5 (ORCPT + 99 others); Sun, 11 Feb 2018 14:41:57 -0500 Received: from bmailout2.hostsharing.net ([83.223.90.240]:41553 "EHLO bmailout2.hostsharing.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753459AbeBKTl4 (ORCPT ); Sun, 11 Feb 2018 14:41:56 -0500 Received: from h08.hostsharing.net (h08.hostsharing.net [83.223.95.28]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "*.hostsharing.net", Issuer "COMODO RSA Domain Validation Secure Server CA" (not verified)) by bmailout2.hostsharing.net (Postfix) with ESMTPS id D13082800B48B; Sun, 11 Feb 2018 20:41:54 +0100 (CET) Received: by h08.hostsharing.net (Postfix, from userid 100393) id 89A5EF2D4; Sun, 11 Feb 2018 20:41:54 +0100 (CET) Date: Sun, 11 Feb 2018 20:41:54 +0100 From: Lukas Wunner To: Mike Lothian Cc: Tejun Heo , Lai Jiangshan , Alex Deucher , Dave Airlie , Ben Skeggs , Ismo Toijala , nouveau@lists.freedesktop.org, Intel Graphics Development , Liviu Dudau , Linux Kernel Mailing List , Maling list - DRI developers , Hans de Goede , Peter Wu Subject: Re: [PATCH 0/5] Fix deadlock on runtime suspend in DRM drivers Message-ID: <20180211194154.GB22869@wunner.de> References: <20180211192314.GA22869@wunner.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180211192314.GA22869@wunner.de> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Feb 11, 2018 at 08:23:14PM +0100, Lukas Wunner wrote: > On Sun, Feb 11, 2018 at 06:58:11PM +0000, Mike Lothian wrote: > > On 11 February 2018 at 09:38, Lukas Wunner wrote: > > > The patches for radeon and amdgpu are compile-tested only, I only have a > > > MacBook Pro with an Nvidia GK107 to test. To test the patches, add an > > > "msleep(12*1000);" at the top of the driver's ->runtime_suspend hook. > > > This ensures that the poll worker runs after ->runtime_suspend has begun. > > > Wait 12 sec after the GPU has begun runtime suspend, then check > > > /sys/bus/pci/devices/0000:01:00.0/power/runtime_status. Without this > > > series, the status will be stuck at "suspending" and you'll get hung task > > > errors in dmesg after a few minutes. > > > > I wasn't quite sure where to add that msleep. I've tested the patches > > as is on top of agd5f's wip branch without ill effects > > > > I've had a radeon and now a amdgpu PRIME setup and don't believe I've > > ever seen this issue > > > > If you could pop a patch together for the msleep I'll give it a test on > > amdgpu > > Here you go, this is for all 3 drivers. > Should deadlock without the series. > Thanks! Sorry, I missed that amdgpu_drv.c and radeon_drv.c don't include delay.h, rectified testing patch below: diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c index 50afcf6..beaaf2c 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c @@ -36,6 +36,7 @@ #include #include +#include #include #include #include @@ -718,6 +719,9 @@ static int amdgpu_pmops_runtime_suspend(struct device *dev) return -EBUSY; } + printk("waiting 12 sec\n"); + msleep(12*1000); + printk("done waiting 12 sec\n"); drm_dev->switch_power_state = DRM_SWITCH_POWER_CHANGING; drm_kms_helper_poll_disable(drm_dev); vga_switcheroo_set_dynamic_switch(pdev, VGA_SWITCHEROO_OFF); diff --git a/drivers/gpu/drm/drm_probe_helper.c b/drivers/gpu/drm/drm_probe_helper.c index 555fbe5..ee7cf0d 100644 --- a/drivers/gpu/drm/drm_probe_helper.c +++ b/drivers/gpu/drm/drm_probe_helper.c @@ -586,6 +586,7 @@ static void output_poll_execute(struct work_struct *work) repoll = true; goto out; } + dev_info(&dev->pdev->dev, "begin poll\n"); drm_connector_list_iter_begin(dev, &conn_iter); drm_for_each_connector_iter(connector, &conn_iter) { @@ -651,6 +652,7 @@ static void output_poll_execute(struct work_struct *work) if (repoll) schedule_delayed_work(delayed_work, DRM_OUTPUT_POLL_PERIOD); + dev_info(&dev->pdev->dev, "end poll\n"); } /** diff --git a/drivers/gpu/drm/nouveau/nouveau_drm.c b/drivers/gpu/drm/nouveau/nouveau_drm.c index 3e29302..f9da5bc 100644 --- a/drivers/gpu/drm/nouveau/nouveau_drm.c +++ b/drivers/gpu/drm/nouveau/nouveau_drm.c @@ -855,6 +855,9 @@ static int nouveau_drm_probe(struct pci_dev *pdev, return -EBUSY; } + printk("waiting 12 sec\n"); + msleep(12*1000); + printk("done waiting 12 sec\n"); drm_kms_helper_poll_disable(drm_dev); vga_switcheroo_set_dynamic_switch(pdev, VGA_SWITCHEROO_OFF); nouveau_switcheroo_optimus_dsm(); diff --git a/drivers/gpu/drm/radeon/radeon_drv.c b/drivers/gpu/drm/radeon/radeon_drv.c index 31dd04f..2b4e7e0 100644 --- a/drivers/gpu/drm/radeon/radeon_drv.c +++ b/drivers/gpu/drm/radeon/radeon_drv.c @@ -35,6 +35,7 @@ #include #include +#include #include #include #include @@ -413,6 +414,9 @@ static int radeon_pmops_runtime_suspend(struct device *dev) return -EBUSY; } + printk("waiting 12 sec\n"); + msleep(12*1000); + printk("done waiting 12 sec\n"); drm_dev->switch_power_state = DRM_SWITCH_POWER_CHANGING; drm_kms_helper_poll_disable(drm_dev); vga_switcheroo_set_dynamic_switch(pdev, VGA_SWITCHEROO_OFF);