Received: by 2002:a05:6a10:9e8c:0:0:0:0 with SMTP id y12csp700980pxx; Wed, 28 Oct 2020 14:57:09 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzjOY1Sre0OM+JojSb8ytj4YvUunlkxI/oe8I7HXpcGnZkQo2htlo0y7EwWLVIq6cZK1jiv X-Received: by 2002:a17:906:1b51:: with SMTP id p17mr1157534ejg.462.1603922229448; Wed, 28 Oct 2020 14:57:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1603922229; cv=none; d=google.com; s=arc-20160816; b=HDvMsOzLp9cKc+L3bBVbJrk2yxjSZCtf84mvuT+rAp46FnXPZKWHKquu18/nqHZ3t5 booqJrRebm6S0hzEtnr/aQWuXFcKyNvo9o5RY3TnvT3VvSwTn/Xpnh0h2f/IoLHlABfQ LxO10g1M0WMR4Nc8dTDSQlPKz5PQXFhG/jVGBGzoMaPzGEALtm5HWPPef20cxIYKl4vi 8/4IiYWN5Qb9HIJXy9veVaKBD7Xxv/gsVmuamcl5sQyMUDQtRXK1kUWzMdrmhtp3T0Lc 160Lk03A+/ya9juc8LfiwT2Cg03c6eppN2F0LC1N0RMPSNPVitwLWO9BmcYN4hpUPToQ TeUw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:message-id:subject:cc:to:from:date :dkim-signature; bh=zM8fIGEBCbLKhv229c7qK/YTo1O0pKm1Ithnlp7Q5JY=; b=NH67hi4uqlD9HXyoYAqeFVx5vRkOVgE6MgVanO1Ani+DNBKVKdRjGmTSBTq/Ejijps eZCyMd21vQcsXJmPo6wUNMzp0u1fTZ5GgXaA9gmPBa5vzxhRySQ7PpBV2E9lygh3mqmx UbQy+2xi1VsoVSpQcTC/iMEMzRVAr+f08PbuMkEs2SFvxlP6i9XhfMA4VB4iQcIw39nH VgRGH9P/LOJtNWX4D5OhH2rK/tx5a8KibFQ8t5/E/cHQNbMVwDJ/73N0ddu3VzAe25L9 KOLoYeQUQcQEOmraCCSyhEIRngLLXUv4wnSK4FcYrkMDsiQ5pp8/zRiqJtMVJYW6i/yo NA+Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=OkGwjg3E; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 27si651217edx.157.2020.10.28.14.56.48; Wed, 28 Oct 2020 14:57:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=OkGwjg3E; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728451AbgJ1VzH (ORCPT + 99 others); Wed, 28 Oct 2020 17:55:07 -0400 Received: from mail.kernel.org ([198.145.29.99]:45292 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728513AbgJ1Vwt (ORCPT ); Wed, 28 Oct 2020 17:52:49 -0400 Received: from localhost (230.sub-72-107-127.myvzw.com [72.107.127.230]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 163A620723; Wed, 28 Oct 2020 21:52:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1603921968; bh=Ng0aaOYvvOnvpMzlDKmn4n7HlYQExwjFLCCoQek4EYk=; h=Date:From:To:Cc:Subject:In-Reply-To:From; b=OkGwjg3EZQEnuiEnjpA2b5DgmFELJtqumDRNi916bl+Wet4+BdEbdQus4ZVElFIRM mk5WMEkiZbqbiq3OJ7rGRcXVj30eVMWd/j2OtniRTXWdwvFdpAJE9grjebDWkljH7K Mu/fuvevz6HptQ+RvnXWw91szc5mSuW+W+krUMRg= Date: Wed, 28 Oct 2020 16:52:46 -0500 From: Bjorn Helgaas To: "Boris V." Cc: linux-pci@vger.kernel.org, Rajat Jain , linux-kernel@vger.kernel.org Subject: Re: Kernel 5.9 IOMMU groups regression/change Message-ID: <20201028215246.GA351595@bjorn-Precision-5520> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [+cc Rajat, LKML] On Tue, Oct 27, 2020 at 08:31:09PM +0100, Boris V. wrote: > On 25/10/2020 20:45, Boris V. wrote: > > With upgrade to kernel 5.9 my VMs stopped working, because some devices > > can't be passed through. > > This is caused by different IOMMU groups and devices being in the same > > group. > > > > For ex. with kernel 5.8 this are IOMMU groups: > > IOMMU Group 40: > > ??????? 08:01.0 PCI bridge [0604]: ASMedia Technology Inc. Device > > [1b21:118f] > > ??????? 09:00.0 Ethernet controller [0200]: Intel Corporation I211 > > Gigabit Network Connection [8086:1539] (rev 03) > > IOMMU Group 43: > > ??????? 0c:00.0 SATA controller [0106]: ASMedia Technology Inc. ASM1062 > > Serial ATA Controller [1b21:0612] (rev 02) > > IOMMU Group 44: > > ??????? 0d:00.0 USB controller [0c03]: ASMedia Technology Inc. ASM1042A > > USB 3.0 Host Controller [1b21:1142] > > > > Ethernet, SATA and USB controller in its own group. > > > > And with 5.9, everything is in one group: > > IOMMU Group 29: > > ??????? 00:1c.0 PCI bridge [0604]: Intel Corporation C610/X99 series > > chipset PCI Express Root Port #1 [8086:8d10] (rev d5) > > ??????? 00:1c.3 PCI bridge [0604]: Intel Corporation C610/X99 series > > chipset PCI Express Root Port #4 [8086:8d16] (rev d5) > > ??????? 00:1c.4 PCI bridge [0604]: Intel Corporation C610/X99 series > > chipset PCI Express Root Port #5 [8086:8d18] (rev d5) > > ??????? 00:1c.6 PCI bridge [0604]: Intel Corporation C610/X99 series > > chipset PCI Express Root Port #7 [8086:8d1c] (rev d5) > > ??????? 07:00.0 PCI bridge [0604]: ASMedia Technology Inc. Device > > [1b21:118f] > > ??????? 08:01.0 PCI bridge [0604]: ASMedia Technology Inc. Device > > [1b21:118f] > > ??????? 08:03.0 PCI bridge [0604]: ASMedia Technology Inc. Device > > [1b21:118f] > > ??????? 08:04.0 PCI bridge [0604]: ASMedia Technology Inc. Device > > [1b21:118f] > > ??????? 09:00.0 Ethernet controller [0200]: Intel Corporation I211 > > Gigabit Network Connection [8086:1539] (rev 03) > > ??????? 0c:00.0 SATA controller [0106]: ASMedia Technology Inc. ASM1062 > > Serial ATA Controller [1b21:0612] (rev 02) > > ??????? 0d:00.0 USB controller [0c03]: ASMedia Technology Inc. ASM1042A > > USB 3.0 Host Controller [1b21:1142] > > > > > > This seems to be caused by commit > > 52fbf5bdeeef415b28b8e6cdade1e48927927f60. > > commit 52fbf5bdeeef415b28b8e6cdade1e48927927f60 > > Author: Rajat Jain > > Date:?? Tue Jul 7 15:46:02 2020 -0700 > > > > ??? PCI: Cache ACS capability offset in device > > > > ??? Currently the ACS capability is being looked up at a number of > > places. Read > > ??? and store it once at enumeration so that it can be used by all > > later.? No > > ??? functional change intended. > > > > ??? Link: > > https://lore.kernel.org/r/20200707224604.3737893-2-rajatja@google.com > > ??? Signed-off-by: Rajat Jain > > ??? Signed-off-by: Bjorn Helgaas > > > > ?drivers/pci/p2pdma.c |? 2 +- > > ?drivers/pci/pci.c??? | 20 ++++++++++++++++---- > > ?drivers/pci/pci.h??? |? 2 +- > > ?drivers/pci/probe.c? |? 2 +- > > ?drivers/pci/quirks.c |? 8 ++++---- > > ?include/linux/pci.h? |? 1 + > > ?6 files changed, 24 insertions(+), 11 deletions(-) > > > > > > If I revert this commit, I get back old groups. > > > > In commit log there is message 'No functional change intended'. But > > there is functional change. > > > > This is Intel Core i7-5930K CPU and X99 chipset. But I see the same > > thing on other Intel systems (didn't test on AMD). > > Some more info. > Problem seems to be that pci_dev_specific_enable_acs() is not called > anymore. > Before, pci_enable_acs() was called from pci_init_capabilities() and in > pci_enable_acs(), pci_dev_specific_enable_acs() was called. > I don't know anything about PCI and this stuff, but I'm guessing that this > function enable ACS for some Intel devices. > But after this commit, pci_acs_init() is called from pci_init_capabilities() > and if pci_find_ext_capability(dev, PCI_EXT_CAP_ID_ACS) returns 0, > pci_enable_acs() and pci_dev_specific_enable_acs() is not called anymore. > If I apply for ex. this patch bellow, groups are right again and everything > works as before. Thanks very much for the report and the debugging. Maybe we can get this sorted and fixed for v5.10-rc2 or -rc3. > diff -ur linux-5.9.1.orig/drivers/pci/pci.c linux-5.9.1/drivers/pci/pci.c > --- linux-5.9.1.orig/drivers/pci/pci.c? 2020-10-17 08:31:22.000000000 +0200 > +++ linux-5.9.1/drivers/pci/pci.c?????? 2020-10-27 19:01:32.650010803 +0100 > @@ -3502,9 +3502,7 @@ > ?void pci_acs_init(struct pci_dev *dev) > ?{ > ??????? dev->acs_cap = pci_find_ext_capability(dev, PCI_EXT_CAP_ID_ACS); > - > -?????? if (dev->acs_cap) > -?????????????? pci_enable_acs(dev); > +?????? pci_enable_acs(dev); > ?} > > ?/** > >