Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp712987ybz; Wed, 15 Apr 2020 17:18:08 -0700 (PDT) X-Google-Smtp-Source: APiQypKEq+xH4ZMp22hT7ylmtw1V9/F5Kiw+zijdw4lgM4dgZUu5AZ5CsmRd7NfSmCGDhxxH1aXG X-Received: by 2002:a05:6402:17f0:: with SMTP id t16mr26902261edy.157.1586996287868; Wed, 15 Apr 2020 17:18:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586996287; cv=none; d=google.com; s=arc-20160816; b=XXqUN0bnk9QEddvjAEtfdCiRH+xzN/ZIOFTM1fApgooMcSgdW0XPN6nAi8XynfWjw8 rf0/Mlzp526GhQk7e/YfhnM1P/B96J9DIHagBAn+v2j4D7lyIpDfK+JB8w10nK1rZO0m 5+Vsd8OOONvDB0ta/5hfS0v//Gz4hSsoViL2mCiFMQDFXQqBQ0w778KNA6d6ducZg5AU 9J7dl+iGElikrFhU5WeVrEZXdCxn6Qos754K76etwRodP5TBdWom8bWV40YMG/SZCnAO HdS7FvAFORvu1PhMDCU+Wqtepb9Rv3eSv70wFyO4/XhnEDdUYVOkpS0f6FsjH1f/ej5A 4nMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:ironport-sdr:ironport-sdr; bh=5VQ0/vbzCZO4E3HhsF4QYjhc6Rf34ptd+ZGLtg6pF8w=; b=qtDHSIrdEuLPNHM9sWNCtVC8XrUzUCYAAxwKqFrEkLfm5ujRwgp/X5y2j51Le+zYlx rdXDXwusONXxufl+eQHXk25D6mV6t4UpNsA8Y1p+Lks+ychVzjlmr+or46betMNGbdvM A0IMSTdmC/XFR0auWUrvHj/KHKsGvYazX4+YKHnil9V+k5ugMyi2CySwvAu1DpP/UuCR txxDL5qVkNafbe6636jtqVWsJuLRdrZooK3Dpr02w1H8b+eC2ymm9n0hrrNLHlo/qS7r C0n+L1M2KR/bxmpgvxFQe3pNzmFl8kXy4RwL93SJPDpmvAyJeiTMAIYztpiycuZAxQkg iFTQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u3si12991275edp.278.2020.04.15.17.17.44; Wed, 15 Apr 2020 17:18:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1414926AbgDOPxU (ORCPT + 99 others); Wed, 15 Apr 2020 11:53:20 -0400 Received: from mga07.intel.com ([134.134.136.100]:60400 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404240AbgDOPxP (ORCPT ); Wed, 15 Apr 2020 11:53:15 -0400 IronPort-SDR: UK7pOp3orln/hRGlJaf1yIvcD3F9TmmcY7crDVAb0Ys8zjfmtPOH7bT2zpn4WNwpOytqC5jX/N YB/wC5eMNdzg== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga003.jf.intel.com ([10.7.209.27]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 15 Apr 2020 08:53:13 -0700 IronPort-SDR: F6uUaqyd6cceLSV0Kq4PASCZhpm64sTmBxjVMAAcmV6plfo2xf/+A5Rr/5MG5sM7lvlIP+oEsC oMslcndCbN5A== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.72,387,1580803200"; d="scan'208";a="253561163" Received: from jacob-builder.jf.intel.com (HELO jacob-builder) ([10.7.199.155]) by orsmga003.jf.intel.com with ESMTP; 15 Apr 2020 08:53:13 -0700 Date: Wed, 15 Apr 2020 08:59:08 -0700 From: Jacob Pan To: Auger Eric Cc: eric.auger.pro@gmail.com, iommu@lists.linux-foundation.org, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, kvmarm@lists.cs.columbia.edu, will@kernel.org, joro@8bytes.org, maz@kernel.org, robin.murphy@arm.com, jean-philippe@linaro.org, zhangfei.gao@linaro.org, shameerali.kolothum.thodi@huawei.com, alex.williamson@redhat.com, yi.l.liu@intel.com, peter.maydell@linaro.org, zhangfei.gao@gmail.com, tn@semihalf.com, zhangfei.gao@foxmail.com, bbhushan2@marvell.com, jacob.jun.pan@linux.intel.com Subject: Re: [PATCH v11 01/13] iommu: Introduce attach/detach_pasid_table API Message-ID: <20200415085908.0e1803b7@jacob-builder> In-Reply-To: References: <20200414150607.28488-1-eric.auger@redhat.com> <20200414150607.28488-2-eric.auger@redhat.com> <20200414151548.658a0401@jacob-builder> Organization: OTC X-Mailer: Claws Mail 3.13.2 (GTK+ 2.24.30; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 15 Apr 2020 16:52:10 +0200 Auger Eric wrote: > Hi Jacob, > On 4/15/20 12:15 AM, Jacob Pan wrote: > > Hi Eric, > > > > There are some discussions about how to size the uAPI data. > > https://lkml.org/lkml/2020/4/14/939 > > > > I think the problem with the current scheme is that when uAPI data > > gets extended, if VFIO continue to use: > > > > minsz = offsetofend(struct vfio_iommu_type1_set_pasid_table, > > config); if (copy_from_user(&spt, (void __user *)arg, minsz)) > > > > It may copy more data from user than what was setup by the user. > > > > So, as suggested by Alex, we could add argsz to the IOMMU uAPI > > struct. So if argsz > minsz, then fail the attach_table since > > kernel might be old, doesn't know about the extra data. > > If argsz <= minsz, kernel can support the attach_table but must > > process the data based on flags or config. > > So I guess we would need both an argsz _u32 + a new flag _u32 right? > Yes. > I am ok with that idea. Besides how will you manage for existing IOMMU > UAPIs? I plan to add argsz and flags (if not already have one) > At some point you envisionned to have a getter at iommu api > level to retrieve the size of a structure for a given version, right? > This idea is shot down. There is no version-size lookup. So the current plan is for user to fill out argsz in each IOMMU uAPI struct. VFIO does the copy_from_user() based on argsz (sanitized against the size of current kernel struct). IOMMU vendor driver process the data based on flags which indicates new capability/extensions. > Thanks > > Eric > > > > Does it make sense to you? > > > > > > On Tue, 14 Apr 2020 17:05:55 +0200 > > Eric Auger wrote: > > > >> From: Jacob Pan > >> > >> In virtualization use case, when a guest is assigned > >> a PCI host device, protected by a virtual IOMMU on the guest, > >> the physical IOMMU must be programmed to be consistent with > >> the guest mappings. If the physical IOMMU supports two > >> translation stages it makes sense to program guest mappings > >> onto the first stage/level (ARM/Intel terminology) while the host > >> owns the stage/level 2. > >> > >> In that case, it is mandated to trap on guest configuration > >> settings and pass those to the physical iommu driver. > >> > >> This patch adds a new API to the iommu subsystem that allows > >> to set/unset the pasid table information. > >> > >> A generic iommu_pasid_table_config struct is introduced in > >> a new iommu.h uapi header. This is going to be used by the VFIO > >> user API. > >> > >> Signed-off-by: Jean-Philippe Brucker > >> Signed-off-by: Liu, Yi L > >> Signed-off-by: Ashok Raj > >> Signed-off-by: Jacob Pan > >> Signed-off-by: Eric Auger > >> Reviewed-by: Jean-Philippe Brucker > >> --- > >> drivers/iommu/iommu.c | 19 ++++++++++++++ > >> include/linux/iommu.h | 18 ++++++++++++++ > >> include/uapi/linux/iommu.h | 51 > >> ++++++++++++++++++++++++++++++++++++++ 3 files changed, 88 > >> insertions(+) > >> > >> diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c > >> index 2b471419e26c..b71ad56f8c99 100644 > >> --- a/drivers/iommu/iommu.c > >> +++ b/drivers/iommu/iommu.c > >> @@ -1723,6 +1723,25 @@ int iommu_sva_unbind_gpasid(struct > >> iommu_domain *domain, struct device *dev, } > >> EXPORT_SYMBOL_GPL(iommu_sva_unbind_gpasid); > >> > >> +int iommu_attach_pasid_table(struct iommu_domain *domain, > >> + struct iommu_pasid_table_config *cfg) > >> +{ > >> + if (unlikely(!domain->ops->attach_pasid_table)) > >> + return -ENODEV; > >> + > >> + return domain->ops->attach_pasid_table(domain, cfg); > >> +} > >> +EXPORT_SYMBOL_GPL(iommu_attach_pasid_table); > >> + > >> +void iommu_detach_pasid_table(struct iommu_domain *domain) > >> +{ > >> + if (unlikely(!domain->ops->detach_pasid_table)) > >> + return; > >> + > >> + domain->ops->detach_pasid_table(domain); > >> +} > >> +EXPORT_SYMBOL_GPL(iommu_detach_pasid_table); > >> + > >> static void __iommu_detach_device(struct iommu_domain *domain, > >> struct device *dev) > >> { > >> diff --git a/include/linux/iommu.h b/include/linux/iommu.h > >> index 7ef8b0bda695..3e1057c3585a 100644 > >> --- a/include/linux/iommu.h > >> +++ b/include/linux/iommu.h > >> @@ -248,6 +248,8 @@ struct iommu_iotlb_gather { > >> * @cache_invalidate: invalidate translation caches > >> * @sva_bind_gpasid: bind guest pasid and mm > >> * @sva_unbind_gpasid: unbind guest pasid and mm > >> + * @attach_pasid_table: attach a pasid table > >> + * @detach_pasid_table: detach the pasid table > >> * @pgsize_bitmap: bitmap of all possible supported page sizes > >> * @owner: Driver module providing these ops > >> */ > >> @@ -307,6 +309,9 @@ struct iommu_ops { > >> void *drvdata); > >> void (*sva_unbind)(struct iommu_sva *handle); > >> int (*sva_get_pasid)(struct iommu_sva *handle); > >> + int (*attach_pasid_table)(struct iommu_domain *domain, > >> + struct iommu_pasid_table_config > >> *cfg); > >> + void (*detach_pasid_table)(struct iommu_domain *domain); > >> > >> int (*page_response)(struct device *dev, > >> struct iommu_fault_event *evt, > >> @@ -446,6 +451,9 @@ extern int iommu_sva_bind_gpasid(struct > >> iommu_domain *domain, struct device *dev, struct > >> iommu_gpasid_bind_data *data); extern int > >> iommu_sva_unbind_gpasid(struct iommu_domain *domain, struct device > >> *dev, ioasid_t pasid); +extern int iommu_attach_pasid_table(struct > >> iommu_domain *domain, > >> + struct > >> iommu_pasid_table_config *cfg); +extern void > >> iommu_detach_pasid_table(struct iommu_domain *domain); extern > >> struct iommu_domain *iommu_get_domain_for_dev(struct device *dev); > >> extern struct iommu_domain *iommu_get_dma_domain(struct device > >> *dev); extern int iommu_map(struct iommu_domain *domain, unsigned > >> long iova, @@ -1048,6 +1056,16 @@ iommu_aux_get_pasid(struct > >> iommu_domain *domain, struct device *dev) return -ENODEV; } > >> > >> +static inline > >> +int iommu_attach_pasid_table(struct iommu_domain *domain, > >> + struct iommu_pasid_table_config *cfg) > >> +{ > >> + return -ENODEV; > >> +} > >> + > >> +static inline > >> +void iommu_detach_pasid_table(struct iommu_domain *domain) {} > >> + > >> static inline struct iommu_sva * > >> iommu_sva_bind_device(struct device *dev, struct mm_struct *mm, > >> void *drvdata) { > >> diff --git a/include/uapi/linux/iommu.h > >> b/include/uapi/linux/iommu.h index 4ad3496e5c43..8d00be10dc6d > >> 100644 --- a/include/uapi/linux/iommu.h > >> +++ b/include/uapi/linux/iommu.h > >> @@ -321,4 +321,55 @@ struct iommu_gpasid_bind_data { > >> }; > >> }; > >> > >> +/** > >> + * struct iommu_pasid_smmuv3 - ARM SMMUv3 Stream Table Entry > >> stage 1 related > >> + * information > >> + * @version: API version of this structure > >> + * @s1fmt: STE s1fmt (format of the CD table: single CD, linear > >> table > >> + * or 2-level table) > >> + * @s1dss: STE s1dss (specifies the behavior when @pasid_bits != 0 > >> + * and no PASID is passed along with the incoming > >> transaction) > >> + * @padding: reserved for future use (should be zero) > >> + * > >> + * The PASID table is referred to as the Context Descriptor (CD) > >> table on ARM > >> + * SMMUv3. Please refer to the ARM SMMU 3.x spec (ARM IHI 0070A) > >> for full > >> + * details. > >> + */ > >> +struct iommu_pasid_smmuv3 { > >> +#define PASID_TABLE_SMMUV3_CFG_VERSION_1 1 > >> + __u32 version; > >> + __u8 s1fmt; > >> + __u8 s1dss; > >> + __u8 padding[2]; > >> +}; > >> + > >> +/** > >> + * struct iommu_pasid_table_config - PASID table data used to bind > >> guest PASID > >> + * table to the host IOMMU > >> + * @version: API version to prepare for future extensions > >> + * @format: format of the PASID table > >> + * @base_ptr: guest physical address of the PASID table > >> + * @pasid_bits: number of PASID bits used in the PASID table > >> + * @config: indicates whether the guest translation stage must > >> + * be translated, bypassed or aborted. > >> + * @padding: reserved for future use (should be zero) > >> + * @smmuv3: table information when @format is > >> %IOMMU_PASID_FORMAT_SMMUV3 > >> + */ > >> +struct iommu_pasid_table_config { > >> +#define PASID_TABLE_CFG_VERSION_1 1 > >> + __u32 version; > >> +#define IOMMU_PASID_FORMAT_SMMUV3 1 > >> + __u32 format; > >> + __u64 base_ptr; > >> + __u8 pasid_bits; > >> +#define IOMMU_PASID_CONFIG_TRANSLATE 1 > >> +#define IOMMU_PASID_CONFIG_BYPASS 2 > >> +#define IOMMU_PASID_CONFIG_ABORT 3 > >> + __u8 config; > >> + __u8 padding[6]; > >> + union { > >> + struct iommu_pasid_smmuv3 smmuv3; > >> + }; > >> +}; > >> + > >> #endif /* _UAPI_IOMMU_H */ > > > > [Jacob Pan] > > > [Jacob Pan]