Received: by 10.223.185.116 with SMTP id b49csp6675519wrg; Wed, 28 Feb 2018 13:31:27 -0800 (PST) X-Google-Smtp-Source: AH8x2277KIF7ycRswafWuROh5RTYVgMw7NW0le5JjOK49LtA5tf5o/YD4tnuQ2ZY0ANTREOWYbbs X-Received: by 2002:a17:902:9686:: with SMTP id n6-v6mr19175353plp.333.1519853487318; Wed, 28 Feb 2018 13:31:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1519853487; cv=none; d=google.com; s=arc-20160816; b=BV7kSreU5nqywmZmB4YA9OTrVwzla3zTJWVb/aY/36ALmb8G/Z05yNye5kyEieK1E3 uQDdWKbcpWVkT+FcN9FhPe2JA1cC4Pi09Fjfj08buOZ7bCjh4GKBYEFQXjBJjScdt9fA /pcK7X81eY+iYKJJPe9166/Vx602qLpjsvF8v6djB9DZnZENGKbW6Cmfds+d/kjYrvty yGVRVNUN11YOiz13Iy0M2m0kXQaMwN1Pkfyg1evXIZ+Wbbsjvv4fwDNmh1HPY5HN1JW5 BLzc/1qbl9GE8VW8J3CmvR3gaXj25tFTLG8zVA7V8xRRRx4WnzkeSGgthP3KESFPongH PkXQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dmarc-filter:arc-authentication-results; bh=U+cXXZdRfvoJi+BtZ9X0Yu9d84O4i1inCmXH8unqLpI=; b=c3/G1PD+P0TbUe4MLieHo3d4CDcx+GSPvr5GlTlN6ZvDI9uPInbgje8P9FKLTLfG9p EhzWqHJZkEOOYNQkiIV21RltYwHd/XHVlGSoF3xyW6geBGE17Fk32jRqLlwUM9B0c2ou 7ZJ4NGZO9Bg2c75/AfkCuTv5+31X5MoflQbIxcDcYZv+Vd4NnklSwE0qu0WcTqt+CToK uKUNROJEp1gAJ7DBNS72EwwEe30Mkj8PmcuDNBLupw7xbWWJjQWO8FRjnCPQpDSQ7djg 253DNXhsjJOoaq2Dizc0Hb+9H6T3BJY8EIFBJZHJDnJu6LgjOnh9hkb22cRP/usCaQE1 Bd7Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o65si1822955pfj.316.2018.02.28.13.31.11; Wed, 28 Feb 2018 13:31:27 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935017AbeB1Va3 (ORCPT + 99 others); Wed, 28 Feb 2018 16:30:29 -0500 Received: from mail.kernel.org ([198.145.29.99]:44354 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S934954AbeB1Va2 (ORCPT ); Wed, 28 Feb 2018 16:30:28 -0500 Received: from localhost (unknown [69.55.156.246]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 8B1802133D; Wed, 28 Feb 2018 21:30:27 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 8B1802133D Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=helgaas@kernel.org Date: Wed, 28 Feb 2018 15:30:26 -0600 From: Bjorn Helgaas To: KarimAllah Ahmed Cc: linux-pci@vger.kernel.org, Bjorn Helgaas , linux-kernel@vger.kernel.org Subject: Re: [PATCH] pci: Store more data about VFs into the SRIOV struct Message-ID: <20180228213026.GN127842@bhelgaas-glaptop.roam.corp.google.com> References: <1516211063-1488-1-git-send-email-karahmed@amazon.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1516211063-1488-1-git-send-email-karahmed@amazon.de> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 17, 2018 at 06:44:23PM +0100, KarimAllah Ahmed wrote: > ... to avoid reading them from the config space of all the PCI VFs. This is > specially a useful optimization when bringing up thousands of VFs. > > Cc: Bjorn Helgaas > Cc: linux-pci@vger.kernel.org > Cc: linux-kernel@vger.kernel.org > Signed-off-by: KarimAllah Ahmed What does this patch apply to? It doesn't apply to v4.16-rc1 (my "master" branch). I don't see anything in the history of drivers/pci/iov.c about pci_iov_wq_fn(). > --- > drivers/pci/iov.c | 20 ++++++++++++++++++-- > drivers/pci/pci.h | 6 +++++- > drivers/pci/probe.c | 42 ++++++++++++++++++++++++++++++++---------- > 3 files changed, 55 insertions(+), 13 deletions(-) > > diff --git a/drivers/pci/iov.c b/drivers/pci/iov.c > index 168328a..78e9595 100644 > --- a/drivers/pci/iov.c > +++ b/drivers/pci/iov.c > @@ -129,7 +129,7 @@ resource_size_t pci_iov_resource_size(struct pci_dev *dev, int resno) > if (!dev->is_physfn) > return 0; > > - return dev->sriov->barsz[resno - PCI_IOV_RESOURCES]; > + return dev->sriov->vf_barsz[resno - PCI_IOV_RESOURCES]; > } > > int batch_pci_iov_add_virtfn(struct pci_dev *dev, struct pci_bus **bus, > @@ -325,6 +325,20 @@ static void pci_iov_wq_fn(struct work_struct *work) > kfree(req); > } > > +static void pci_read_vf_config_common(struct pci_bus *bus, > + struct pci_dev *dev) > +{ > + int devfn = pci_iov_virtfn_devfn(dev, 0); > + > + pci_bus_read_config_dword(bus, devfn, PCI_CLASS_REVISION, > + &dev->sriov->vf_class); > + pci_bus_read_config_word(bus, devfn, PCI_SUBSYSTEM_ID, > + &dev->sriov->vf_subsystem_device); > + pci_bus_read_config_word(bus, devfn, PCI_SUBSYSTEM_VENDOR_ID, > + &dev->sriov->vf_subsystem_vendor); > + pci_bus_read_config_byte(bus, devfn, PCI_HEADER_TYPE, &dev->sriov->vf_hdr_type); > +} > + > static struct workqueue_struct *pci_iov_wq; > > static int __init init_pci_iov_wq(void) > @@ -361,6 +375,8 @@ static int enable_vfs(struct pci_dev *dev, int nr_vfs) > goto add_bus_fail; > } > > + pci_read_vf_config_common(bus[0], dev); > + > while (remaining_vfs > 0) { > bool ret; > struct pci_iov_wq_item *req; > @@ -617,7 +633,7 @@ static int sriov_init(struct pci_dev *dev, int pos) > rc = -EIO; > goto failed; > } > - iov->barsz[i] = resource_size(res); > + iov->vf_barsz[i] = resource_size(res); > res->end = res->start + resource_size(res) * total - 1; > dev_info(&dev->dev, "VF(n) BAR%d space: %pR (contains BAR%d for %d VFs)\n", > i, res, i, total); > diff --git a/drivers/pci/pci.h b/drivers/pci/pci.h > index f6b58b3..3264c9e 100644 > --- a/drivers/pci/pci.h > +++ b/drivers/pci/pci.h > @@ -271,7 +271,11 @@ struct pci_sriov { > u16 driver_max_VFs; /* max num VFs driver supports */ > struct pci_dev *dev; /* lowest numbered PF */ > struct pci_dev *self; /* this PF */ > - resource_size_t barsz[PCI_SRIOV_NUM_BARS]; /* VF BAR size */ > + u8 vf_hdr_type; /* VF header type */ > + u32 vf_class; /* VF device */ > + u16 vf_subsystem_vendor; /* VF subsystem vendor */ > + u16 vf_subsystem_device; /* VF subsystem device */ > + resource_size_t vf_barsz[PCI_SRIOV_NUM_BARS]; /* VF BAR size */ > bool drivers_autoprobe; /* auto probing of VFs by driver */ > }; > > diff --git a/drivers/pci/probe.c b/drivers/pci/probe.c > index 14e0ea1..65099d0 100644 > --- a/drivers/pci/probe.c > +++ b/drivers/pci/probe.c > @@ -175,6 +175,7 @@ static inline unsigned long decode_bar(struct pci_dev *dev, u32 bar) > int __pci_read_base(struct pci_dev *dev, enum pci_bar_type type, > struct resource *res, unsigned int pos) > { > + int bar = res - dev->resource; > u32 l = 0, sz = 0, mask; > u64 l64, sz64, mask64; > u16 orig_cmd; > @@ -194,9 +195,13 @@ int __pci_read_base(struct pci_dev *dev, enum pci_bar_type type, > res->name = pci_name(dev); > > pci_read_config_dword(dev, pos, &l); > - pci_write_config_dword(dev, pos, l | mask); > - pci_read_config_dword(dev, pos, &sz); > - pci_write_config_dword(dev, pos, l); > + if (dev->is_virtfn) { > + sz = dev->physfn->sriov->vf_barsz[bar] & 0xffffffff; > + } else { > + pci_write_config_dword(dev, pos, l | mask); > + pci_read_config_dword(dev, pos, &sz); > + pci_write_config_dword(dev, pos, l); > + } > > /* > * All bits set in sz means the device isn't working properly. > @@ -236,9 +241,14 @@ int __pci_read_base(struct pci_dev *dev, enum pci_bar_type type, > > if (res->flags & IORESOURCE_MEM_64) { > pci_read_config_dword(dev, pos + 4, &l); > - pci_write_config_dword(dev, pos + 4, ~0); > - pci_read_config_dword(dev, pos + 4, &sz); > - pci_write_config_dword(dev, pos + 4, l); > + > + if (dev->is_virtfn) { > + sz = (dev->physfn->sriov->vf_barsz[bar] >> 32) & 0xffffffff; > + } else { > + pci_write_config_dword(dev, pos + 4, ~0); > + pci_read_config_dword(dev, pos + 4, &sz); > + pci_write_config_dword(dev, pos + 4, l); > + } > > l64 |= ((u64)l << 32); > sz64 |= ((u64)sz << 32); > @@ -327,6 +337,8 @@ static void pci_read_bases(struct pci_dev *dev, unsigned int howmany, int rom) > for (pos = 0; pos < howmany; pos++) { > struct resource *res = &dev->resource[pos]; > reg = PCI_BASE_ADDRESS_0 + (pos << 2); > + if (dev->is_virtfn && dev->physfn->sriov->vf_barsz[pos] == 0) > + continue; > pos += __pci_read_base(dev, pci_bar_unknown, res, reg); > } > > @@ -1444,7 +1456,9 @@ int pci_setup_device(struct pci_dev *dev) > struct pci_bus_region region; > struct resource *res; > > - if (pci_read_config_byte(dev, PCI_HEADER_TYPE, &hdr_type)) > + if (dev->is_virtfn) > + hdr_type = dev->physfn->sriov->vf_hdr_type; > + else if (pci_read_config_byte(dev, PCI_HEADER_TYPE, &hdr_type)) > return -EIO; > > dev->sysdata = dev->bus->sysdata; > @@ -1464,7 +1478,10 @@ int pci_setup_device(struct pci_dev *dev) > dev->bus->number, PCI_SLOT(dev->devfn), > PCI_FUNC(dev->devfn)); > > - pci_read_config_dword(dev, PCI_CLASS_REVISION, &class); > + if (dev->is_virtfn) > + class = dev->physfn->sriov->vf_class; > + else > + pci_read_config_dword(dev, PCI_CLASS_REVISION, &class); > dev->revision = class & 0xff; > dev->class = class >> 8; /* upper 3 bytes */ > > @@ -1503,8 +1520,13 @@ int pci_setup_device(struct pci_dev *dev) > goto bad; > pci_read_irq(dev); > pci_read_bases(dev, 6, PCI_ROM_ADDRESS); > - pci_read_config_word(dev, PCI_SUBSYSTEM_VENDOR_ID, &dev->subsystem_vendor); > - pci_read_config_word(dev, PCI_SUBSYSTEM_ID, &dev->subsystem_device); > + if (dev->is_virtfn) { > + dev->subsystem_vendor = dev->physfn->sriov->vf_subsystem_vendor; > + dev->subsystem_device = dev->physfn->sriov->vf_subsystem_device; > + } else { > + pci_read_config_word(dev, PCI_SUBSYSTEM_VENDOR_ID, &dev->subsystem_vendor); > + pci_read_config_word(dev, PCI_SUBSYSTEM_ID, &dev->subsystem_device); > + } > > /* > * Do the ugly legacy mode stuff here rather than broken chip > -- > 2.7.4 >