Received: by 2002:a05:6a10:413:0:0:0:0 with SMTP id 19csp2366965pxp; Fri, 18 Mar 2022 09:06:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwBScECSJqHtlgsEULGqOTRj01r/1OoSYqkcFxs7ZdlTzaXdH53/YSFgMjVsVX2VAwSShty X-Received: by 2002:a05:6808:1814:b0:2da:6acf:a06a with SMTP id bh20-20020a056808181400b002da6acfa06amr4668003oib.112.1647619568419; Fri, 18 Mar 2022 09:06:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1647619568; cv=none; d=google.com; s=arc-20160816; b=Ks9DHnNOIh4676GW1/EC02EORKg5C0TTxVuB4WCQQuB+iDHsXoWd2JKTrx/gY3fzaw aPZsMh3JpgmytiDNu87KiMHbscwR/eGlzmjlzFXjcP3t3ooLSuqOxIBwfFxI7dNgxn/V QTDVKVaaNt7WUAp9ArhItR0mh3lHkZug/BKcojXuanqPul1f20Xig4VX5AvOmaL/yfhO lbYGrwmGaikWI042aPv0TqQuE+kNKDdrERMFm13HvNrbUBXUEhy+tG99s4Cz6LUIBnIC 4P0mkSdAS98hfGk+h4JrJ69XPRvo9vUU7O6z47rW1kc2IGWDg7TyklK/WxpdiowumSxl vCvw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature; bh=TNP/cfaogRB9CjpdHZApi9zDHXNx0ATc+DZIQOXTyTo=; b=UR6z087L4yEm1SglBFWEYHio9T/8lfoVH/gCGefGroudEf8hZ7Y5PmTnOBZ79QOmC+ SfdavPszy9xG51AHUrjhG70V75tBpFRYEJVFV9C7vlnqKdaF7MC6vGiZ29GVtAXDslRv R+dgnjgiEOhT/A73yNaFpA0NBUF1j2+eNM0ktBbrjtkGNWgAHhcu569cuLRbk6I05aYe 7m84X6/tm9+WWVEKz9dI6UaoOPFKp+UiRYrBbpQfA29jFprYzK6xRWbVn263u9G7Y9UH MiRDwL3qeNIA2w7eSw+BUqEtJ415Y7C0oHfyp2i9RXi6Fn8eIKFggC7URjXBJfUEynBF OwsQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=H5+6DWtN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id lx9-20020a0568704b8900b000db3d4d4c72si2307525oab.192.2022.03.18.09.05.54; Fri, 18 Mar 2022 09:06:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=H5+6DWtN; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237642AbiCROtH (ORCPT + 99 others); Fri, 18 Mar 2022 10:49:07 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34130 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237710AbiCROsC (ORCPT ); Fri, 18 Mar 2022 10:48:02 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 844672EAF53 for ; Fri, 18 Mar 2022 07:46:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1647614790; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=TNP/cfaogRB9CjpdHZApi9zDHXNx0ATc+DZIQOXTyTo=; b=H5+6DWtNzk2hJYLxAOqgOGZNOVS4w9gFbAa+TIWc35vvcodBpZBxOmilsuEAInE6rO5SKC +gsm/olYGRguFKPMYgfacq3jE6zXy0JDcipBvEPtUzCJikN+5IXwZ+8qN19wE/OIdm33cj e7dFHFAgm9z/52SPoH6fMF9Xekfdvq8= Received: from mail-il1-f198.google.com (mail-il1-f198.google.com [209.85.166.198]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-437-WxVzzgh4NoWIROAKUZJ4xw-1; Fri, 18 Mar 2022 10:46:29 -0400 X-MC-Unique: WxVzzgh4NoWIROAKUZJ4xw-1 Received: by mail-il1-f198.google.com with SMTP id m17-20020a923f11000000b002c10e8f4c44so4941458ila.1 for ; Fri, 18 Mar 2022 07:46:29 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:in-reply-to :references:organization:mime-version:content-transfer-encoding; bh=TNP/cfaogRB9CjpdHZApi9zDHXNx0ATc+DZIQOXTyTo=; b=3xg6sJVb6mT9kRtTl56WRyJGIPY8imao0SIalbMD4hVd8A/5sl0Kll2UUgEuPBlrHf OcFWOEGBfypBmNjNuSvQby+tMqRvRJ48iaBp2x7bjcpvIIxc9laY0UGTiQVVOjp5XFI5 KcJhqo1Tu5W20jVWbXM09FunZxpFIjlMjFZ6VpfRcLtllOTmfxQm/1Uokh4yxOjCZ7VO cO3+5MPji6x7Slh0MHumPuJYHE56btG2l9QtSq4zBijYRgg8NfsD5qM+KRKcjJGw9g+N awhaEaDAgAEkb0iXFYzlb7rdDmytCSdCOwArYUe0sqbTzxXhfSPwgkI8em7vTe0CQ7Wz NZug== X-Gm-Message-State: AOAM533W5ne+iklOAcduPBQz8wsZFdnGUgtCq3Z1TJdRQlbOwLDAxbzY NmdU5G9yNY8g1tOPGcmKLDOVw/qD1Of28VjaAc6JP2M9WIxjoWIFyuAp8VxyH3oJMmf15vJh4Ks yMsSW7weeZ1bfPmbWP673X/cf X-Received: by 2002:a05:6602:1414:b0:63d:d5fd:c16e with SMTP id t20-20020a056602141400b0063dd5fdc16emr4602197iov.39.1647614788629; Fri, 18 Mar 2022 07:46:28 -0700 (PDT) X-Received: by 2002:a05:6602:1414:b0:63d:d5fd:c16e with SMTP id t20-20020a056602141400b0063dd5fdc16emr4602182iov.39.1647614788331; Fri, 18 Mar 2022 07:46:28 -0700 (PDT) Received: from redhat.com ([98.55.18.59]) by smtp.gmail.com with ESMTPSA id q7-20020a5d87c7000000b0064132d5bd73sm4433845ios.4.2022.03.18.07.46.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 18 Mar 2022 07:46:27 -0700 (PDT) Date: Fri, 18 Mar 2022 08:46:25 -0600 From: Alex Williamson To: Thorsten Leemhuis Cc: Paul Menzel , James Turner , Xinhui Pan , regressions@lists.linux.dev, kvm@vger.kernel.org, Greg KH , Lijo Lazar , LKML , amd-gfx@lists.freedesktop.org, Alexander Deucher , Alex Deucher , Christian =?UTF-8?B?S8O2bmln?= Subject: Re: [REGRESSION] Too-low frequency limit for AMD GPU PCI-passed-through to Windows VM Message-ID: <20220318084625.27d42a51.alex.williamson@redhat.com> In-Reply-To: References: <87ee57c8fu.fsf@turner.link> <87czkk1pmt.fsf@dmarc-none.turner.link> <87sftfqwlx.fsf@dmarc-none.turner.link> <87ee4wprsx.fsf@turner.link> <4b3ed7f6-d2b6-443c-970e-d963066ebfe3@amd.com> <87pmo8r6ob.fsf@turner.link> <5a68afe4-1e9e-c683-e06d-30afc2156f14@leemhuis.info> <87pmnnpmh5.fsf@dmarc-none.turner.link> <092b825a-10ff-e197-18a1-d3e3a097b0e3@leemhuis.info> <877d96to55.fsf@dmarc-none.turner.link> <87lexdw8gd.fsf@turner.link> <40b3084a-11b8-0962-4b33-34b56d3a87a3@molgen.mpg.de> Organization: Red Hat MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-3.6 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H4,RCVD_IN_MSPIKE_WL,SPF_HELO_NONE,SPF_NONE, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 18 Mar 2022 08:01:31 +0100 Thorsten Leemhuis wrote: > On 18.03.22 06:43, Paul Menzel wrote: > > > > Am 17.03.22 um 13:54 schrieb Thorsten Leemhuis: > >> On 13.03.22 19:33, James Turner wrote: > >>> > >>>> My understanding at this point is that the root problem is probably > >>>> not in the Linux kernel but rather something else (e.g. the machine > >>>> firmware or AMD Windows driver) and that the change in f9b7f3703ff9 > >>>> ("drm/amdgpu/acpi: make ATPX/ATCS structures global (v2)") simply > >>>> exposed the underlying problem. > >> > >> FWIW: that in the end is irrelevant when it comes to the Linux kernel's > >> 'no regressions' rule. For details see: > >> > >> https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/tree/Documentation/admin-guide/reporting-regressions.rst > >> > >> https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/tree/Documentation/process/handling-regressions.rst > >> > >> > >> That being said: sometimes for the greater good it's better to not > >> insist on that. And I guess that might be the case here. > > > > But who decides that? > > In the end afaics: Linus. But he can't watch each and every discussion, > so it partly falls down to people discussing a regression, as they can > always decide to get him involved in case they are unhappy with how a > regression is handled. That obviously includes me in this case. I simply > use my best judgement in such situations. I'm still undecided if that > path is appropriate here, that's why I wrote above to see what James > would say, as he afaics was the only one that reported this regression. > > > Running stuff in a virtual machine is not that uncommon. > > No, it's about passing through a GPU to a VM, which is a lot less common > -- and afaics an area where blacklisting GPUs on the host to pass them > through is not uncommon (a quick internet search confirmed that, but I > might be wrong there). Right, interference from host drivers and pre-boot environments is always a concern with GPU assignment in particular. AMD GPUs have a long history of poor behavior relative to things like PCI secondary bus resets which we use to try to get devices to clean, reusable states for assignment. Here a device is being bound to a host driver that initiates some sort of power control, unbound from that driver and exposed to new drivers far beyond the scope of the kernel's regression policy. Perhaps it's possible to undo such power control when unbinding the device, but it's not necessarily a given that such a thing is possible for this device without a cold reset. IMO, it's not fair to restrict the kernel from such advancements. If the use case is within a VM, don't bind host drivers. It's difficult to make promises when dynamically switching between host and userspace drivers for devices that don't have functional reset mechanisms. Thanks, Alex