Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp2918746ybl; Thu, 29 Aug 2019 15:05:16 -0700 (PDT) X-Google-Smtp-Source: APXvYqzMVYORlVqh8E5KqiM2mHsFpSDbCE0kcYvhrJhdsKqyMM3wjWFSxo9i6bSzIxxKwrugmchH X-Received: by 2002:a17:90a:d792:: with SMTP id z18mr12320826pju.36.1567116316851; Thu, 29 Aug 2019 15:05:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1567116316; cv=none; d=google.com; s=arc-20160816; b=xiNX3Mu5TtQenEAvE89XYj4Av52ScqHIFHjOFahJwUltLfoEsoIoJLHJWZhpWXSV95 +nU087ZPrk8vCN6cuVHT82JieC8M572Z8zs0TejD1FzRFSezR6HTlqVT6BIJYcZV8+0V 0Pc40F90jgHkb1IUqtrD8oztRl9gjvjZKN193HqA3V/WdB3b9swDKY5cnFg9lhvCksVZ bRM1ZMIv8GDXW1nLNMjb0i4vfzH4TxgqrkoiW8aDAD5cR4r1SpJPxG87I3LGgarId7im H02gA716aPzzuZfBqGuiuSOc8nd4TyAgfu7o3mTfOgF3JoSImvmIukVsFCQc6UDf9d81 IjXg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id:dkim-signature; bh=qmQAdDRYulUDjSpMspTEPnnb4TRIYFsZQNmhOKS11AU=; b=bNID+22aU7pZAqFcYjDvZLiPXCzELdYZ2/8Z6GjolZ1F0C+o5TYCDhJ/dufrumX6BI jNI4nSAaohn/5QvXuWlfmv8nfKlFqeMNGqEbDP9RdgxMjyY0VajNc57B9dQh2U/UIHYJ d60vLFo4YuKN4NfpOrT3VU4Hzvy1jGoQmnOnfIAsZq5fdcTWsQ+q3PSo4B4BXdeKWIOi 2xBuhbQRN8hoymm/1aqwQJtUu7R+SYttk5eB60Hmq+exBw5uRDE1MRCFLwfdTcd80flr lH5YZHWsTRkzlDBhc80DJbqixelCReRR3qoqnQ63MkByfyo6lDV+hTvECgjLpDoMCfD4 wNDA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=olk4Y4iZ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r9si3042998pli.405.2019.08.29.15.05.00; Thu, 29 Aug 2019 15:05:16 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=olk4Y4iZ; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728017AbfH2WEF (ORCPT + 99 others); Thu, 29 Aug 2019 18:04:05 -0400 Received: from mail-lf1-f49.google.com ([209.85.167.49]:37910 "EHLO mail-lf1-f49.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726894AbfH2WEF (ORCPT ); Thu, 29 Aug 2019 18:04:05 -0400 Received: by mail-lf1-f49.google.com with SMTP id c12so3737869lfh.5 for ; Thu, 29 Aug 2019 15:04:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=message-id:subject:from:to:cc:date:in-reply-to:references :user-agent:mime-version:content-transfer-encoding; bh=qmQAdDRYulUDjSpMspTEPnnb4TRIYFsZQNmhOKS11AU=; b=olk4Y4iZag61wP1hPA3Br9s19D+SI48mrDe97epo/a7ja3nmIUJ1UW8dDTQEudnCot 1rnKEtiyP81Axb4ndf5Ni20lVk7FBuW1LOu7/V5wvu5s8axkfpD2k8Its1nqKDucVCrS S8yMPZ8Nw6GPrdw3UkBvomVkp1MQw1CgvQh8WBMnqa5qO0lAfXIzXAi50aOm1UCbg2ai GJ+LzdGBPdyjQ5Eb8a60J9SAQ2dB5ZNFSuvTeLhDZSArZX6uf03nNmd4KBwPQli0UcBZ +8Ce2PdTca8oaDWDIc6NF2csNQVP0JO+LkIGg0U7Q/tgFjsOMYp40upVHb1DBhO4Ok42 ZRNA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:message-id:subject:from:to:cc:date:in-reply-to :references:user-agent:mime-version:content-transfer-encoding; bh=qmQAdDRYulUDjSpMspTEPnnb4TRIYFsZQNmhOKS11AU=; b=MgaWRYJlbOtLzbIdk8Oj5IULgnd24L5VJFMZA5iyuqkh9hcT6ueu1RENZ9Ed4Yqdxc Z7MJYCR6WUfY5Wzl2JENCyfNNMjbgO7xy+z1t3RQS4lx/lwLNfkceTSH6S/tVT8OPcu1 03T22FqY2dCqUKKtZ7DmVLucsCoRZCZdw6ovjzXt/bUoCdROrlx4HXkOYfPZgmJcQd7f KJ3bep9VdFcONze3sc2/FYVqx36ez/uWgG4zWYDFvfpqJ0pCg9SD0Y0gZXmC2+ghLmK9 nnejQHw0Ic+PJtqi25kllxQVXhMuKNyY4/xnfICvMfnM2YHbAoOEpKH2b+t0ArwgA/91 ZvQw== X-Gm-Message-State: APjAAAUcRKfFiMWxfeOCAHXOI7U6l3ZJ/r0uxpRqdQuwzMayG/+bt9Cz H/Ra6LOdQ36kjCmWapq1/6A= X-Received: by 2002:a19:6a12:: with SMTP id u18mr7345008lfu.133.1567116242773; Thu, 29 Aug 2019 15:04:02 -0700 (PDT) Received: from localhost.localdomain ([178.214.243.78]) by smtp.gmail.com with ESMTPSA id e21sm575402lfj.10.2019.08.29.15.04.01 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 29 Aug 2019 15:04:02 -0700 (PDT) Message-ID: Subject: Re: gnome-shell stuck because of amdgpu driver [5.3 RC5] From: mikhail.v.gavrilov@gmail.com To: Daniel Vetter , Hillf Danton Cc: dri-devel , amd-gfx list , Linux List Kernel Mailing Date: Fri, 30 Aug 2019 03:03:58 +0500 In-Reply-To: <20190826092408.GA2112@phenom.ffwll.local> References: <20190825141305.13984-1-hdanton@sina.com> <20190826092408.GA2112@phenom.ffwll.local> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.33.91 (3.33.91-1.fc32) MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Aug 25, 2019 at 10:13:05PM +0800, Hillf Danton wrote: > Can we try to add the fallback timer manually? > > --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c > +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c > @@ -322,6 +322,10 @@ int amdgpu_fence_wait_empty(struct amdgp > } > rcu_read_unlock(); > > + if (!timer_pending(&ring->fence_drv.fallback_timer)) > + mod_timer(&ring->fence_drv.fallback_timer, > + jiffies + (AMDGPU_FENCE_JIFFIES_TIMEOUT << > 1)); > + > r = dma_fence_wait(fence, false); > dma_fence_put(fence); > return r; > -- > > Or simply wait with an ear on signal and timeout if adding timer > seems to go a bit too far? > > --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c > +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c > @@ -322,7 +322,12 @@ int amdgpu_fence_wait_empty(struct amdgp > } > rcu_read_unlock(); > > - r = dma_fence_wait(fence, false); > + if (0 < dma_fence_wait_timeout(fence, true, > + AMDGPU_FENCE_JIFFIES_TIMEOUT + > + (AMDGPU_FENCE_JIFFIES_TIMEOUT >> 3))) > + r = 0; > + else > + r = -EINVAL; > dma_fence_put(fence); > return r; > } I tested both patches on top of 5.3 RC6. Each patch I was tested more than 24 hours and I don't seen any regressions or problems with them. On Mon, 2019-08-26 at 11:24 +0200, Daniel Vetter wrote: > > This will paper over the issue, but won't fix it. dma_fences have to > complete, at least for normal operations, otherwise your desktop will > start feeling like the gpu hangs all the time. > > I think would be much more interesting to dump which fence isn't > completing here in time, i.e. not just the timeout, but lots of debug > printks. > -Daniel As I am understood none of these patches couldn't be merged because they do not fix the root cause they eliminate only the consequences? Eliminating consequences has any negative effects? And we will never know the root cause because not having enough debugging information.