Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp4934811yba; Mon, 13 May 2019 02:19:36 -0700 (PDT) X-Google-Smtp-Source: APXvYqyU1rDUXQ4tqvFPVEgEVp0t9+ZsQ81hVZRmvxEmJC15r2NhGnmpRXvY2ut2GfbVuPVkHnvt X-Received: by 2002:a63:ce43:: with SMTP id r3mr11904055pgi.368.1557739175917; Mon, 13 May 2019 02:19:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1557739175; cv=none; d=google.com; s=arc-20160816; b=Yw0MRYUVqnEIcytjXZJeoIxfqMtjGbM4DzkUAUR2rx0K6uho9dLWivOTmIIQL36/0A vCtD2AmuyIdmBPoaCqW70D5Hf7WN/5Z7v3bI0TH9cMViYJJtAYaL/v+zCzCXIcA/G/d6 y5Ec9Gva5fTWpSNiT0oiNaWv1Sl9zFIBqFmzX5DsXrXPKD3uwU4gW9DviDqKjTZkjoZL 81oQBrqg35160VKrVQkyENGmqkSf/YPWJAml0LVcJFE1LHexraXvzFlxXbneiyaXORWp aLgKUpEtGLOsqVa9z7IJH7YMdbsAjqJBBCYyQCHSLTaxVbumITpb09AJR3c1uSu+HTkn 1Qtg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=v2B3pgLuEHv6i1B7nVGrRR+9VrIx8u+ZR2TKCwqNXMs=; b=w5fYLsyVOeHf2iypvU7n0Od88DuiJPL096YIHvHY3pCMk9uZn0NTg9QEHqL2aVzOzp aclDj+OIrW5b936BCzvYvL/bOpv0h/rd6gebbDqbqqh1JH21XtXFqhhaEtFOS3qwccZM CAltWZx817GnQ9Yw5jYMWrTJ7ajY6xB9p/VPXdI03qvze/KqVZmD8kW8zdZtS62LCjaZ o5PoNwVPKAhv5NXqi4HasRanLK6ss0YhOi9GPNeiV+sSEfjkIb0wW4vEXD1hBZSOm7fE 3Xncj9p0ivhhywbWiaJCDwYrnJkmjvcffc3SfQdQDyAHtzKbIZ9KLx+Q1rGUR+qViLfi hGmA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m19si4610276pgl.241.2019.05.13.02.19.20; Mon, 13 May 2019 02:19:35 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728844AbfEMJOz (ORCPT + 99 others); Mon, 13 May 2019 05:14:55 -0400 Received: from mx1.redhat.com ([209.132.183.28]:50842 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728829AbfEMJOx (ORCPT ); Mon, 13 May 2019 05:14:53 -0400 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.phx2.redhat.com [10.5.11.16]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 5F73D3086205; Mon, 13 May 2019 09:14:52 +0000 (UTC) Received: from [10.36.116.17] (ovpn-116-17.ams2.redhat.com [10.36.116.17]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 1E1CF68D9B; Mon, 13 May 2019 09:14:46 +0000 (UTC) Subject: Re: [PATCH v3 02/16] iommu: Introduce cache_invalidate API To: Jacob Pan , iommu@lists.linux-foundation.org, LKML , Joerg Roedel , David Woodhouse , Alex Williamson , Jean-Philippe Brucker Cc: Yi Liu , "Tian, Kevin" , Raj Ashok , Christoph Hellwig , Lu Baolu , Andriy Shevchenko References: <1556922737-76313-1-git-send-email-jacob.jun.pan@linux.intel.com> <1556922737-76313-3-git-send-email-jacob.jun.pan@linux.intel.com> From: Auger Eric Message-ID: Date: Mon, 13 May 2019 11:14:45 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: <1556922737-76313-3-git-send-email-jacob.jun.pan@linux.intel.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.16 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.42]); Mon, 13 May 2019 09:14:52 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jacob, Jean-Philippe, On 5/4/19 12:32 AM, Jacob Pan wrote: > From: "Liu, Yi L" > > In any virtualization use case, when the first translation stage > is "owned" by the guest OS, the host IOMMU driver has no knowledge > of caching structure updates unless the guest invalidation activities > are trapped by the virtualizer and passed down to the host. > > Since the invalidation data are obtained from user space and will be > written into physical IOMMU, we must allow security check at various > layers. Therefore, generic invalidation data format are proposed here, > model specific IOMMU drivers need to convert them into their own format. > > Signed-off-by: Liu, Yi L > Signed-off-by: Jean-Philippe Brucker > Signed-off-by: Jacob Pan > Signed-off-by: Ashok Raj > Signed-off-by: Eric Auger > > --- > v6 -> v7: > - detail which fields are used for each invalidation type > - add a comment about multiple cache invalidation > > v5 -> v6: > - fix merge issue > > v3 -> v4: > - full reshape of the API following Alex' comments > > v1 -> v2: > - add arch_id field > - renamed tlb_invalidate into cache_invalidate as this API allows > to invalidate context caches on top of IOTLBs > > v1: > renamed sva_invalidate into tlb_invalidate and add iommu_ prefix in > header. Commit message reworded. > --- > drivers/iommu/iommu.c | 14 ++++++++ > include/linux/iommu.h | 15 ++++++++- > include/uapi/linux/iommu.h | 80 ++++++++++++++++++++++++++++++++++++++++++++++ > 3 files changed, 108 insertions(+), 1 deletion(-) > > diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c > index 8df9d34..a2f6f3e 100644 > --- a/drivers/iommu/iommu.c > +++ b/drivers/iommu/iommu.c > @@ -1645,6 +1645,20 @@ void iommu_detach_pasid_table(struct iommu_domain *domain) > } > EXPORT_SYMBOL_GPL(iommu_detach_pasid_table); > > +int iommu_cache_invalidate(struct iommu_domain *domain, struct device *dev, > + struct iommu_cache_invalidate_info *inv_info) > +{ > + int ret = 0; > + > + if (unlikely(!domain->ops->cache_invalidate)) > + return -ENODEV; > + > + ret = domain->ops->cache_invalidate(domain, dev, inv_info); > + > + return ret; > +} > +EXPORT_SYMBOL_GPL(iommu_cache_invalidate); > + > static void __iommu_detach_device(struct iommu_domain *domain, > struct device *dev) > { > diff --git a/include/linux/iommu.h b/include/linux/iommu.h > index ab4d922..d182525 100644 > --- a/include/linux/iommu.h > +++ b/include/linux/iommu.h > @@ -266,6 +266,7 @@ struct page_response_msg { > * @page_response: handle page request response > * @attach_pasid_table: attach a pasid table > * @detach_pasid_table: detach the pasid table > + * @cache_invalidate: invalidate translation caches > * @pgsize_bitmap: bitmap of all possible supported page sizes > */ > struct iommu_ops { > @@ -328,8 +329,9 @@ struct iommu_ops { > int (*attach_pasid_table)(struct iommu_domain *domain, > struct iommu_pasid_table_config *cfg); > void (*detach_pasid_table)(struct iommu_domain *domain); > - > int (*page_response)(struct device *dev, struct page_response_msg *msg); > + int (*cache_invalidate)(struct iommu_domain *domain, struct device *dev, > + struct iommu_cache_invalidate_info *inv_info); > > unsigned long pgsize_bitmap; > }; > @@ -442,6 +444,9 @@ extern void iommu_detach_device(struct iommu_domain *domain, > extern int iommu_attach_pasid_table(struct iommu_domain *domain, > struct iommu_pasid_table_config *cfg); > extern void iommu_detach_pasid_table(struct iommu_domain *domain); > +extern int iommu_cache_invalidate(struct iommu_domain *domain, > + struct device *dev, > + struct iommu_cache_invalidate_info *inv_info); > extern struct iommu_domain *iommu_get_domain_for_dev(struct device *dev); > extern struct iommu_domain *iommu_get_dma_domain(struct device *dev); > extern int iommu_map(struct iommu_domain *domain, unsigned long iova, > @@ -982,6 +987,14 @@ static inline int iommu_sva_get_pasid(struct iommu_sva *handle) > static inline > void iommu_detach_pasid_table(struct iommu_domain *domain) {} > > +static inline int > +iommu_cache_invalidate(struct iommu_domain *domain, > + struct device *dev, > + struct iommu_cache_invalidate_info *inv_info) > +{ > + return -ENODEV; > +} > + > #endif /* CONFIG_IOMMU_API */ > > #ifdef CONFIG_IOMMU_DEBUGFS > diff --git a/include/uapi/linux/iommu.h b/include/uapi/linux/iommu.h > index 8848514..fa96ecb 100644 > --- a/include/uapi/linux/iommu.h > +++ b/include/uapi/linux/iommu.h > @@ -162,4 +162,84 @@ struct iommu_pasid_table_config { > }; > }; I noticed my qemu integration was currently incorrectly using PASID invalidation for ASID based invalidation (SMMUV3 Stage1 CMD_TLBI_NH_ASID invalidation command). So I think we also need ARCHID invalidation. Sorry for the late notice. > > +/* defines the granularity of the invalidation */ > +enum iommu_inv_granularity { > + IOMMU_INV_GRANU_DOMAIN, /* domain-selective invalidation */ IOMMU_INV_GRANU_ARCHID, /* archid-selective invalidation */ > + IOMMU_INV_GRANU_PASID, /* pasid-selective invalidation */ > + IOMMU_INV_GRANU_ADDR, /* page-selective invalidation */ > + IOMMU_INVAL_GRANU_NR, /* number of invalidation granularities */ > +}; > + > +/** > + * Address Selective Invalidation Structure > + * > + * @flags indicates the granularity of the address-selective invalidation > + * - if PASID bit is set, @pasid field is populated and the invalidation > + * relates to cache entries tagged with this PASID and matching the > + * address range. > + * - if ARCHID bit is set, @archid is populated and the invalidation relates > + * to cache entries tagged with this architecture specific id and matching > + * the address range. > + * - Both PASID and ARCHID can be set as they may tag different caches. > + * - if neither PASID or ARCHID is set, global addr invalidation applies > + * - LEAF flag indicates whether only the leaf PTE caching needs to be > + * invalidated and other paging structure caches can be preserved. > + * @pasid: process address space id > + * @archid: architecture-specific id > + * @addr: first stage/level input address > + * @granule_size: page/block size of the mapping in bytes > + * @nb_granules: number of contiguous granules to be invalidated > + */ > +struct iommu_inv_addr_info { > +#define IOMMU_INV_ADDR_FLAGS_PASID (1 << 0) > +#define IOMMU_INV_ADDR_FLAGS_ARCHID (1 << 1) > +#define IOMMU_INV_ADDR_FLAGS_LEAF (1 << 2) > + __u32 flags; > + __u32 archid; > + __u64 pasid; > + __u64 addr; > + __u64 granule_size; > + __u64 nb_granules; > +}; > + > +/** > + * First level/stage invalidation information > + * @cache: bitfield that allows to select which caches to invalidate > + * @granularity: defines the lowest granularity used for the invalidation: > + * domain > pasid > addr > + * > + * Not all the combinations of cache/granularity make sense: > + * > + * type | DEV_IOTLB | IOTLB | PASID | > + * granularity | | | cache | > + * -------------+---------------+---------------+---------------+ > + * DOMAIN | N/A | Y | Y | * ARCHID | N/A | Y | N/A | > + * PASID | Y | Y | Y | > + * ADDR | Y | Y | N/A | > + * > + * Invalidations by %IOMMU_INV_GRANU_ADDR use field @addr_info. * Invalidations by %IOMMU_INV_GRANU_ARCHID use field @archid. > + * Invalidations by %IOMMU_INV_GRANU_PASID use field @pasid. > + * Invalidations by %IOMMU_INV_GRANU_DOMAIN don't take any argument. > + * > + * If multiple cache types are invalidated simultaneously, they all > + * must support the used granularity. > + */ > +struct iommu_cache_invalidate_info { > +#define IOMMU_CACHE_INVALIDATE_INFO_VERSION_1 1 > + __u32 version; > +/* IOMMU paging structure cache */ > +#define IOMMU_CACHE_INV_TYPE_IOTLB (1 << 0) /* IOMMU IOTLB */ > +#define IOMMU_CACHE_INV_TYPE_DEV_IOTLB (1 << 1) /* Device IOTLB */ > +#define IOMMU_CACHE_INV_TYPE_PASID (1 << 2) /* PASID cache */ > +#define IOMMU_CACHE_TYPE_NR (3) > + __u8 cache; > + __u8 granularity; > + __u8 padding[2]; > + union { > + __u64 pasid; __u32 archid; Thanks Eric > + struct iommu_inv_addr_info addr_info; > + }; > +}; > + > + > #endif /* _UAPI_IOMMU_H */ >