Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp825322imm; Wed, 6 Jun 2018 06:35:51 -0700 (PDT) X-Google-Smtp-Source: ADUXVKIDxwsLMgrYings07f6g8O1qRBccI1cmUeZB1G2IqHlon0O5jx8O8pq3M8dMajKclL3kWJy X-Received: by 2002:a17:902:9b83:: with SMTP id y3-v6mr3224516plp.358.1528292151785; Wed, 06 Jun 2018 06:35:51 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528292151; cv=none; d=google.com; s=arc-20160816; b=tYLikTwmIS/OuJQVu4a85UYUCFlXqZffpDOMLKrOCf5BrI/YMIfloQwon5TjiemfiB ZmNh3pzaaHmJccVG+8lUVWYUD9oMTAeTh+bGn37jOlEhWYKhwc7u5KUG+F0NAfIwF3o6 uLOFH6yGEDymgm0mA+cSD3nMmirr8BG958Jos5K73/cABJ+vo8VTgqUUfrGTCU+y3tl9 fx9aqerprYzL4H41SYw48nWtdWlfDN7yvBI4AX5hqyd07aFK/2WtrTpHDykqZxfN9f4X ArD0uj9EI78cT+3Tdgp6yfUW6AIj7lP66rA+4y4LpJa1FrYFxKTtj3AegK3nKUVrMSCO Cbxg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:mime-version:dkim-signature :arc-authentication-results; bh=Fl31pIqXxCk8S8UMDwAERUU7z+YgeasMxFErwQw2Gxo=; b=A9FDx5RDeEmluZJ3OeCyXM7Xw73Kky0eK/C7OiO4AkjJKLuDS2li7IRd9z1DkfgW61 uvzyQLrk03ZR+CfYwDHCPXnNRRSlX8o/sIT1VBcrw+RmP7NTTimQ1gr59k7SMOqfO2OX cH9T4uPVCLVYvaR3Evb+9iZJDSQ9lollQj27twDnq5gNvg6F6NwRCYRZAlHK3eQUeBCt MSerrGzcVJbauauJKhvluxTnRFfVgMr8a3XQAPUP5SFH2lRYyib0mJZ9UUfdq0nr6mKa fIuL2JZjFvHQBeFIfSHsR6QRd+OnxLeoUKoWaQxN2qiC0lBh6GC2z/nLB84C85jP9HOl QmDQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=ht60gzYS; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c10-v6si22446644pll.275.2018.06.06.06.35.37; Wed, 06 Jun 2018 06:35:51 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=ht60gzYS; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752246AbeFFNeW (ORCPT + 99 others); Wed, 6 Jun 2018 09:34:22 -0400 Received: from mail-lf0-f67.google.com ([209.85.215.67]:38330 "EHLO mail-lf0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752205AbeFFNeU (ORCPT ); Wed, 6 Jun 2018 09:34:20 -0400 Received: by mail-lf0-f67.google.com with SMTP id i83-v6so9161786lfh.5 for ; Wed, 06 Jun 2018 06:34:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to:cc :content-transfer-encoding; bh=Fl31pIqXxCk8S8UMDwAERUU7z+YgeasMxFErwQw2Gxo=; b=ht60gzYSJLGjJevhOQRVSNfTQR+r50nVCwOpi9GTfJsH+1paeuxKwwMMqaqyC+69i+ QnNwA+RQKTj6cZZUT5jKgG6fKC0BdCsT3WQThvy+ak9d1bVT1UULHO2j9cFDNKlfUyN4 IyjUw7Xtc0C6C/DdanSIKhV6NJ/UpDA4aRPnomF9W0RmeIqR6iWsIn+rrD12jI1K/Phr kaiLFRIUdWswU8yroV1JgotFOB81bLq/ETeEBwi+v2DFvsWOyIzGNc+XaMDobia4rZaG H2uiGNJveJPOX5o/4j8/V5qa3TUKoUWRvlTiULlHJDqXKSMNZOapPiAE8D4R6QIuFMSZ wEKA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to:cc :content-transfer-encoding; bh=Fl31pIqXxCk8S8UMDwAERUU7z+YgeasMxFErwQw2Gxo=; b=I4BQYB+LzDzNL12cSb2bLzB2oNXJTVwIMvtux5ziQKf76NS2u1ktS5l27oo1Pxny2H QLLx9Cbh3GJ6Bz4jrepTYsq+AK/pE87kf9whhfeQV7Z+LvuihfGNQG71O3p8UfBcq+hu 0op1UU8CGbuIutOsR97EW2yf5tGE+GWrgd+k9Kruc+UznkwKok4GDn6H/zm1nsIdyN4r ysauWn47glJP8TEBJ16cG8oY0W7ZnKDnI8Tkn1yW9jK+9tI9bFxDloDQ2+tLbuelxU/k uqt2lOtwq225tKNGJjZFHPfz1RFZyfDhSdhub0Y4RsbNkQvzsETxmLI8KA53vFVOZUS3 32+A== X-Gm-Message-State: APt69E2cTJ7KO1lCX0Mn109EEkEWzXRXpdCraKyQmkFSRL4KCT4nWPik bt79XiSuKQm9beEUwxOSkLnSO0XkNhQEHDjvNA== X-Received: by 2002:a19:9101:: with SMTP id t1-v6mr1864289lfd.29.1528292058264; Wed, 06 Jun 2018 06:34:18 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a19:e45c:0:0:0:0:0 with HTTP; Wed, 6 Jun 2018 06:33:47 -0700 (PDT) From: Gabriel C Date: Wed, 6 Jun 2018 15:33:47 +0200 Message-ID: Subject: Re: Kernel and ADM hardware roulette ( was AMD graphics performance regression in 4.15 and later ) To: =?UTF-8?Q?Christian_K=C3=B6nig?= Cc: Jean-Marc Valin , Dave Airlie , alexander.deucher@amd.com, Felix Kuehling , Laura Abbott , Andrew Morton , michel.daenzer@amd.com, dri-devel@lists.freedesktop.org, LKML , Linus Torvalds Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 2018-06-06 14:19 GMT+02:00 Christian K=C3=B6nig : > Am 06.06.2018 um 14:08 schrieb Gabriel C: >> >> 2018-06-06 13:33 GMT+02:00 Christian K=C3=B6nig : >>> >>> Am 06.06.2018 um 13:28 schrieb Gabriel C: >>>> >>>> 2018-04-11 7:02 GMT+02:00 Gabriel C : >>>>>> >>>>>> 2018-04-11 6:00 GMT+02:00 Gabriel C : >>>>>> 2018-04-09 11:42 GMT+02:00 Christian K=C3=B6nig >>>>>> : >>>>>>> >>>>>>> Am 07.04.2018 um 00:00 schrieb Jean-Marc Valin: >>>>> >>>>> ... >>>>>> >>>>>> I can help testing code for 4.17/++ if you wish but that is >>>>>> *different* >>>>>> storry. >>>>>> >>>>> Quick tested an 4.16.0-11490-gb284d4d5a678 , amdgpu and radeon driver >>>>> are broken now in this one. >>>>> >>>>> radeon tells: >>>>> >>>>> ... >>>>> >>>>> [ 6.337838] [drm] PCIE GART of 2048M enabled (table at >>>>> 0x00000000001D6000). >>>>> [ 6.338210] radeon 0000:21:00.0: (-12) create WB bo failed >>>>> [ 6.338214] radeon 0000:21:00.0: disabling GPU acceleration >>>>> >>>>> ... >>>>> >>>> I have the same Issue now on final 4.17. >>> >>> >>> Actually Michel came up with a fix for the performance regression which >>> is >>> now backported to older kernels as well. >>> >>> So the original issue of this mail thread should be fixed by now. >> >> Ok , will test as soon I get the GPU to work :)) >> >>>> Also I played with BIOS options also which does not fix anything but >>>> changes the error message. >>>> >>>> IOMMU && SR-IOV disabled the error changes to this : >>>> >>>> [ 7.092044] [drm:r600_ring_test [radeon]] *ERROR* radeon: ring 0 >>>> test failed (scratch(0x850C)=3D0xCAFEDEAD) >>>> [ 7.092059] radeon 0000:21:00.0: disabling GPU acceleration >>>> >>>> >>>> While I could workaround SWIOTLB bugs in 4.15 and 4.16 , 4.17 seems to >>>> kill the GPU with no way >>>> for me to make it work ( at least I could not find any workaround by n= ow >>>> ) >>> >>> >>> That actually sounds like something completely different. Can you provi= de >>> a >>> full dmesg of radeon and/or amdgpu? >> >> Sure here from boot with IOMMU/SR-IOV ON/OFF in BIOS : >> >> >> http://ftp.frugalware.org/pub/other/people/crazy/radeon/dmesg-iommu-sr-i= ov-off.txt >> >> http://ftp.frugalware.org/pub/other/people/crazy/radeon/dmesg-iommu-sr-i= ov-on.txt >> >> Also nothing else changed in that setup just testing kernel 4.17. > > > That has nothing TODO with the driver nor the original bug you reported. = The > problem is that SME is active and that is currently not supported at all > with a that hardware. Ok .. so are we playing now kernel an AMD Hardware roulette on each release= ? SME was like this in kernel 4.16.x here and all worked. Also if you don't support SME at all now on that Hardware while worked befo= re please add proper error handling and proper dmesg messages letting the user know. radeon: xxxx : SME not supported on that Hardware anymore , please disable SME... radeon: xxxx: Update your GPU < or whatever > How hard would be that ? No one but developers , can guess from these error messges why his hardware suddenly isn't working anymore by just updating the kernel. > > Try to disable SME either in the BIOS or on the kernel command line. Yes that works but is not the point. Really you just can't break users setups like this.