Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp5773486yba; Thu, 11 Apr 2019 05:37:56 -0700 (PDT) X-Google-Smtp-Source: APXvYqyVqPuT19S9kDbydBO4qU5zt6ojVvOjpi+kCddswQ42rcz5hUld18jUql6rfVMIhmAYiPdt X-Received: by 2002:aa7:9ab7:: with SMTP id x23mr29356201pfi.27.1554986276744; Thu, 11 Apr 2019 05:37:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1554986276; cv=none; d=google.com; s=arc-20160816; b=AOSg5xXqwJDnWE8mIWZ/ei3GjMeC8BPGO3eCT4saWUzkDB1z8b1fdDPhur/0mvoc5C i+TGYH5p6fn5YhXVg+nlkLPSSNXx6hMTSyvgFlT5bmc1JPGIHlEz+qPYAECh800VqmAN pFcWV+9LnxyTaqVDMTHs7bSpjtJFpaR3p/Dc0geN90N9inGt929SVuZ+ptdX4cZj2PqW jDStJo+KfxgsAod1feuDVTsOR6eaXo70hs7wqtprb0AuPeX3b7zvlkZ3jbbvx5/f3cFi 1xX1QfubzZSA/9frHfOcXimL1RQFa+hw2djX2raD07u2c8wh+Q6MpfBT+JGFuv3avVc4 8ExA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=jgazQ/T4O4Kx/7oYdaSrsBMdBsoF5ucsxN9pg/4OW4U=; b=0lertk0uL8Ab/nyKb3xNDAcE5Fy+E86+0FG6WtdWrl4E1H9JN+HUaqhHRUyOjXUkEC BI22qTia4ZsBSRxSVD6YNZ8aUnTnNEo1IrFhIZCZxMZkZ7Z8yPha0y9oPYsSq+zt8bgw Zi76uVOQMJcWxiK559PUCsag4q/5LuKoRoWX4HhJzEwo4kZ/w8ayptbPrdBhA7FAqjy3 di4CYCDWB8GRaeofuE5y9gVw2WWOP2V6+1/arWDew+GRWZ8A7zi8/BTyfWgf454V0T3V vAdwj69vye/7mIIEwqtdnLUk0AjiAunFwepeg5n72ZT86rK8d83+dihg6F/DbNFNEzSB NCoA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w22si12352673pgj.174.2019.04.11.05.37.40; Thu, 11 Apr 2019 05:37:56 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726599AbfDKMhA (ORCPT + 99 others); Thu, 11 Apr 2019 08:37:00 -0400 Received: from mx2.suse.de ([195.135.220.15]:36478 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726014AbfDKMg7 (ORCPT ); Thu, 11 Apr 2019 08:36:59 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 9BAA4AC4C; Thu, 11 Apr 2019 12:36:58 +0000 (UTC) Date: Thu, 11 Apr 2019 14:36:56 +0200 From: "jroedel@suse.de" To: "Deucher, Alexander" Cc: Bjorn Helgaas , Nikolai Kostrigin , "Suthikulpanit, Suravee" , "Lendacky, Thomas" , "Kuehling, Felix" , "Koenig, Christian" , "linux-pci@vger.kernel.org" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH RESEND 1/1] PCI: Add ATS-disable quirk for AMD Radeon R7 GPUs Message-ID: <20190411123656.GE3349@suse.de> References: <20190408103725.30426-1-nickel@altlinux.org> <20190408103725.30426-2-nickel@altlinux.org> <20190409215927.GC256045@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Apr 10, 2019 at 03:59:57PM +0000, Deucher, Alexander wrote: > > + a few AMD people > > > > Seeing this bug makes it more clear. I don't think this is a problem with the > > GPU. I think it's a problem with either the sbios or iommu. I think the original > > quirk added for stoney (0x98e4) is probably wrong as well. I suspect we > > need a quirk for a particular laptop or sbios versions. We validated ATS > > extensively with Carrizo based systems (the system in the bug report above > > is Carrizo based) since it is the basis of our ROCm support on APUs. We have > > also been involved in tons of Linux OEM preloads with both Carrizo and > > Stoney based APUs in combination with TOPAZ dGPUs (0x6900) and haven't > > seen this issue in those programs. We also have TOPAZ dGPUs used in OEM > > programs with Intel chipsets and haven't seen the issue. I suspect since > > windows does not use the IOMMU by default, the sbios settings may not be > > well validated on certain windows only skus. I'd rather make these DMI > > matches or something like that for the platform or at the very least match > > the SSIDs as well. > > Reading through these bugs again it seems to be an issue with Stoney > APUs, not the dGPU specifically. I think it would be better to > disable ATS in general if a stoney based platform was detected rather > than adding ATS quirks for devices then someone may put in a Stoney > based platform. It also seems to be related to runtime pm on the > dGPU. Disabling runtime pm also seem to fix the issue. On these > systems runtime pm for the dGPU is controlled via ACPI (either ATPX or > _PR3 depending on the platform). Maybe something doesn't get restored > properly on runtime resume which cases the ATS issues? This seems all pretty much possible, but we lack the ability to debug this further on our side. So until we have a real root-cause with a more specific quirk that only targets systems with a broken sbios or whatever, we need to catch-all approach. We can remove these quirks again when AMD sends more specific quirks upstream. Regards, Joerg