Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp692297pxj; Thu, 17 Jun 2021 11:31:42 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxJ99xZJlyhmCANl65wCfKDvFhl4wzKuo2AMx/ljcSN6QV7gNtyQFo/NyIKxW4j63CVSRb0 X-Received: by 2002:a17:906:7f0e:: with SMTP id d14mr6624946ejr.103.1623954702426; Thu, 17 Jun 2021 11:31:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1623954702; cv=none; d=google.com; s=arc-20160816; b=iZxT6wDzgHhFM12K0mC3Q9Z9fnTqoMIs+SnHT8wFOGwDufu9YRkeKeadLZLPdpQTNU 9izWgBxVFtIexU+OKM/2U3f5cu1snpvXvh9AOhXD4n3Vb9iHk6F5vv/cy9NjRcMCJwHI fx0UGXtvXCIKOsDk1uxLAXkE9hu97X6EY5DNUfRTcYR4+uy7DHgwSWc4PrkxNYSK9jRH C7xjYo1yBfVP10cT+EuzFfO6yugsHGUMd0xE5YW+nZe4iWxceON47/WJX9OjMkTN6pJh CSth6KaO0RwTInz4e1lAGOvf4W7um/2iqZooIIDPlLVR4gmbXrHlNBVOaxInRBHttrqk 3fIA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :message-id:subject:cc:to:from:date:dkim-signature; bh=pZ+I8MEDzBQfPtXQx+70BenZAOgupuWrwI/zA+ZDmsc=; b=bUwBCPZU+ugePf4Iyo6xh7Wmr6nQh1UeSwOqnDIXuIOoJqeFxdlvFdeJA/o0lvxGTZ fPYiAPJvmVvneiUQ0JLErLea6KNREelIgoAPzyb6+zplWe21nR1bCJwB8pYg4v+/KFR6 LMUVdGkyKprVGkhtPgXl+31TDS8MjZw1wy6rB77Evu7v4h4aqzB98xLbsqAlaLS+mCuh T0vvmoWdF1UtfPeTWcsIX6a2tPtalx2mKgmrk/IABNC8ocRtmzJFh3q+Jb2FE1tILnH2 5ldxVDwZsAOeYXmtVhpX6bxlS46Jiy4dIK0HqytwxEKmwfGOLG+fSmwhi/dMvKq4C7po syjA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Y26Ymu84; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q11si6489316ejs.90.2021.06.17.11.31.19; Thu, 17 Jun 2021 11:31:42 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Y26Ymu84; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232704AbhFQQAd (ORCPT + 99 others); Thu, 17 Jun 2021 12:00:33 -0400 Received: from mail.kernel.org ([198.145.29.99]:35558 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231941AbhFQQAc (ORCPT ); Thu, 17 Jun 2021 12:00:32 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 5758361351; Thu, 17 Jun 2021 15:58:24 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1623945504; bh=Q6/JU/E0btuKwbG5TetUX1vsUfsVbPrY8xkHBDYqwIc=; h=Date:From:To:Cc:Subject:In-Reply-To:From; b=Y26Ymu84AfR2wvDLO2NKU00KsE3y8ppLK6t25Mr+96EKQMjfAyQue3hVQAfa5cq8A 77mwrdQIEjuudC7Egcuw+ovgpEFNEK1EbmFhlyldkQldKq98S6R4LPW8i3HPrxb9Cd rKRlLTlmFISd1+JEJmGPVw5Em8MPVJ/vActCqEV1PHsckAjDNDk4Clpd8rVnOI/Pf1 gHZ5Edw1o7iVo+bpsClulUKqWKtP5JpdPBBRF0M0n7VMHc71brfUNkd7guxN5TdSgs kkeUZbcrQuN7j6hee4H0jLVUzwRThTJSFCL12b9vE9UPa2eqaLAHuEn22dzKFPdYI+ n27q+2q17Oi/g== Date: Thu, 17 Jun 2021 10:58:23 -0500 From: Bjorn Helgaas To: Andy Shevchenko Cc: Christoph Hellwig , Dejin Zheng , corbet@lwn.net, jarkko.nikula@linux.intel.com, mika.westerberg@linux.intel.com, rric@kernel.org, bhelgaas@google.com, wsa@kernel.org, Sanket.Goswami@amd.com, linux-doc@vger.kernel.org, linux-i2c@vger.kernel.org, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, kernel test robot Subject: Re: [PATCH v7 1/4] PCI: Introduce pcim_alloc_irq_vectors() Message-ID: <20210617155823.GA3077181@bjorn-Precision-5520> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 17, 2021 at 04:20:16PM +0300, Andy Shevchenko wrote: > On Wed, Jun 16, 2021 at 02:25:43PM -0500, Bjorn Helgaas wrote: > > On Fri, Jun 11, 2021 at 12:37:22PM +0300, Andy Shevchenko wrote: > > > On Thu, Jun 10, 2021 at 05:41:43PM -0500, Bjorn Helgaas wrote: > > > > On Mon, Jun 07, 2021 at 11:39:13PM +0800, Dejin Zheng wrote: > > > > > Introduce pcim_alloc_irq_vectors(), a device-managed version of > > > > > pci_alloc_irq_vectors(). Introducing this function can simplify > > > > > the error handling path in many drivers. > > > > > > > > > > And use pci_free_irq_vectors() to replace some code in pcim_release(), > > > > > they are equivalent, and no functional change. It is more explicit > > > > > that pcim_alloc_irq_vectors() is a device-managed function. > > > > > > ... > > > > > > > > @@ -1989,10 +1989,7 @@ static void pcim_release(struct device *gendev, void *res) > > > > > struct pci_devres *this = res; > > > > > int i; > > > > > > > > > > - if (dev->msi_enabled) > > > > > - pci_disable_msi(dev); > > > > > - if (dev->msix_enabled) > > > > > - pci_disable_msix(dev); > > > > > + pci_free_irq_vectors(dev); > > > > > > > > If I understand correctly, this hunk is a nice simplification, but > > > > actually has nothing to do with making pcim_alloc_irq_vectors(). I > > > > have it split to a separate patch in my local tree. Or am I wrong > > > > about that? > > > > > > It's a good simplification that had to be done when pci_free_irq_vectors() > > > appeared. > > > > Sorry to be pedantic. You say the simplification "had to be done," > > but AFAICT there was no actual *requirement* for this simplification > > to be done since pci_free_irq_vectors() is functionally identical to > > the previous code. > > I think we should do it because it's a little > > simpler, but not because it *fixes* anything. > > It makes things more straightforward. So it definitely "fixes" something, but > not the code in this case, rather how we maintain this code. > > > > But here is the fact that indirectly it's related to the pcim_*() > > > APIs, i.e. pcim_alloc_irq_vectors(), because you may noticed this is inside > > > pcim_release(). > > > > Yes. For posterity, my notes about the call chain (after applying > > this patch): > > > > pci_alloc_irq_vectors > > pci_alloc_irq_vectors_affinity > > __pci_enable_msix_range # MSI-X path > > __pci_enable_msix > > msix_capability_init > > msix_setup_entries > > for (...) > > entry = alloc_msi_entry > > kzalloc(msi_desc) <--- alloc > > kmemdup(msi_desc->affinity) <--- alloc > > dev->msix_enabled = 1 # MSI-X enabled > > __pci_enable_msi_range # MSI path > > msi_capability_init > > msi_setup_entry > > alloc_msi_entry <--- alloc > > dev->msi_enabled = 1 # MSI enabled > > > > pcim_release > > pci_free_irq_vectors > > pci_disable_msix # MSI-X > > if (!dev->msix_enabled) > > return > > pci_msix_shutdown > > dev->msix_enabled = 0 # MSI-X disabled > > free_msi_irqs > > list_for_each_entry_safe(..., msi_list, ...) > > free_msi_entry > > kfree(msi_desc->affinity) <--- free > > kfree(msi_desc) <--- free > > pci_disable_msi # MSI > > if (!dev->msi_enabled) > > return > > pci_msi_shutdown > > dev->msi_enabled = 0 # MSI disabled > > free_msi_irqs <--- free > > > > So I *think* (correct me if I'm wrong): > > > > - If a driver calls pcim_enable_device(), we will call > > pcim_release() when the last reference to the device is dropped. > > > > - pci_alloc_irq_vectors() allocates msi_desc and irq_affinity_desc > > structures via msix_setup_entries() or msi_setup_entry(). > > > > - pcim_release() will free those msi_desc and irq_affinity_desc > > structures. > > > > - Even before this series, pcim_release() frees msi_desc and > > irq_affinity_desc structures by calling pci_disable_msi() and > > pci_disable_msix(). > > > > - Calling pci_free_irq_vectors() (or pci_disable_msi() or > > pci_disable_msix()) twice is unnecessary but probably harmless > > because they bail out early. > > > So this series actually does not fix any problems whatsoever. > > I tend to disagree. > > The PCI managed API is currently inconsistent and what you got is > what I already know and had been using until (see below) Christoph > told not to do [1]. > > Even do you as PCI maintainer it took some time to figure this out. > But current APIs make it hard for mere users who wants to use it in > the drivers. > > So, main point of fix here is _API inconsistency_ [0]. > > But hey, I believe you have been Cc'ed to the initial submission of > the pci_*_irq_vector*() rework done by Christoph [2] (hmm... don't > see your name there). And he updated documentation as well [3]. > > Moreover, he insisted to use pci_free_irq_vectors() whenever we are > using pci_alloc_irq_vectors(). And he suggested if we want to avoid > this we have to make pcim_ variant of the API (see [1] again). I'd like to consider this, but it's hard without a reference :) I do think it would be helpful to have clear guidance about when drivers need to use pci_free_irq_vectors(). The existing text in msi-howto.rst doesn't address pcim_ at all. > Maybe you, guys, should got some agreement and clarify it in the > documentation? I agree that the pcim_*() API is confusing at best and it would be nice to improve it and document it, but I don't think this series really does it. There are several MSI-related interfaces that use alloc_msi_entry() and hence magically become managed if we call pcim_enable_device(): pci_alloc_irq_vectors() # ~150 callers pci_alloc_irq_vectors_affinity() # ~10 callers pci_enable_msix_exact() # ~20 callers (deprecated) pci_enable_msix_range() # ~50 callers (deprecated) pci_enable_msi() # ~100 callers (deprecated) This series adds pcim_alloc_irq_vectors(), which sort of fixes *one* of them and makes this sequence look nice: pcim_enable_device(); pcim_alloc_irq_vectors(); but all the others are still potentially managed even though the name doesn't indicate it. And it really doesn't improve the documentation. Possible steps forward: - Add comments in include/linux/pci.h to indicate deprecation (AFAICS, deprecation is currently only mentioned in msi-howto.rst). - Migrate callers away from deprecated interfaces (a lot of work). - Remove deprecated interfaces. - Add pcim_ variants of remaining interfaces (I think only pci_alloc_*()). Consider returning error for pci_alloc_*() usage by managed drivers. - Convert managed callers from pci_alloc_*() to pcim_alloc_*() and remove usage of pci_free_irq_vectors(), pci_disable_msi(), pci_disable_msix(). > [0]: We have a few functions with pcim_ prefix, few without and some from the > latter group imply to behave _differently_ when pcim_enable_device() had > been called. > [1]: I'm not able to find the archive of the mailing, but I remember that it > was something like that IIRC during 8250_lpss.c development. > [2]: https://lore.kernel.org/linux-pci/1467621574-8277-1-git-send-email-hch@lst.de/ > [3]: https://www.kernel.org/doc/html/latest/PCI/msi-howto.html#using-msi > > > It *does* remove unnecessary pci_free_irq_vectors() calls from > > i2c-designware-pcidrv.c. > > > > But because pci_alloc_irq_vectors() and related interfaces are > > *already* managed as soon as a driver calls pcim_enable_device(), > > we can simply remove the pci_free_irq_vectors() without doing anything > > else. > > > > I don't think we *should* do anything else. > > See above. > > > There are many callers of > > pcim_enable_device() that also call pci_alloc_irq_vectors(), > > pci_enable_msix_range(), etc. We don't have pcim_enable_msix_range(), > > pcim_enable_msi(), pcim_alloc_irq_vectors_affinity(), etc. I don't > > think it's worth the churn of adding all those and changing all the > > callers to use pcim_*() (as in patch 4/4 here). > > > > Browsing the output of this: > > > > git grep -En "pcim_enable_device|pci_alloc_irq_vectors|pci_enable_msix_|pci_free_irq_vectors|pci_disable_msi" > > > > leads me to believe there are similar calls of pci_free_irq_vectors() > > that could be removed here: > > > > mtip_pci_probe > > sp_pci_probe > > dw_edma_pcie_probe > > hisi_dma_probe > > ioat_pci_probe > > plx_dma_probe > > cci_pci_probe > > hibmc_pci_probe > > ... > > > > and many more, but I got tired of looking. > > > > > > > +/** > > > > > + * pcim_alloc_irq_vectors - a device-managed pci_alloc_irq_vectors() > > > > > + * @dev: PCI device to operate on > > > > > + * @min_vecs: minimum number of vectors required (must be >= 1) > > > > > + * @max_vecs: maximum (desired) number of vectors > > > > > + * @flags: flags or quirks for the allocation > > > > > + * > > > > > + * Return the number of vectors allocated, (which might be smaller than > > > > > + * @max_vecs) if successful, or a negative error code on error. If less > > > > > + * than @min_vecs interrupt vectors are available for @dev the function > > > > > + * will fail with -ENOSPC. > > > > > + * > > > > > + * It depends on calling pcim_enable_device() to make IRQ resources > > > > > + * manageable. > > > > > + */ > > > > > +static inline int > > > > > +pcim_alloc_irq_vectors(struct pci_dev *dev, unsigned int min_vecs, > > > > > + unsigned int max_vecs, unsigned int flags) > > > > > +{ > > > > > + if (!pci_is_managed(dev)) > > > > > + return -EINVAL; > > > > > + return pci_alloc_irq_vectors(dev, min_vecs, max_vecs, flags); > > > > > > > > This is great, but can you explain how pci_alloc_irq_vectors() > > > > magically becomes a managed interface if we've already called > > > > pcim_enable_device()? > > > > > > > > I certainly believe it does; I'd just like to put a hint in the commit > > > > log since my 5 minutes of grepping around didn't make it obvious to > > > > me. > > > > > > > > I see that pcim_enable_device() sets pdev->is_managed, but I didn't > > > > find the connection between that and pci_alloc_irq_vectors(). > > > > > > One needs to read and understand the code, I agree. The explanation is spread > > > between pcim_release() and __pci_enable_msi/x_range(). > > > > > > The call chain is > > > > > > msi_capability_init() / msix_capability_init() > > > ... > > > <- __pci_enable_msi/x_range() > > > <- pci_alloc_irq_vectors_affinity() > > > <- pci_alloc_irq_vectors() > > > > > > where device msi_enabled / msix_enabled is set. > > > > > > So, it may deserve to be explained in the commit message. > > > > > > > > +} > > -- > With Best Regards, > Andy Shevchenko > >