Received: by 2002:ab2:6d45:0:b0:1fb:d597:ff75 with SMTP id d5csp36883lqr; Tue, 4 Jun 2024 20:04:13 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCXA89xBUdpEyWH18JFfM4OfJHb1+XgCp/T34+OvYG5b/DZo7RLOjnf5x4KMSIGFyoeF617jzOZb4JM1yOovAZrS5udqtwKg5ME++hJmZg== X-Google-Smtp-Source: AGHT+IE09EIvTDWXboqQH8otZNnXu0GcDOvxL6u8d6iGMiziw7IzEvl1YGTBCASjaNVz8BLAL5b/ X-Received: by 2002:a05:6a00:b93:b0:6ed:d8d2:5040 with SMTP id d2e1a72fcca58-703e5a1d57dmr1789020b3a.21.1717556652719; Tue, 04 Jun 2024 20:04:12 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1717556652; cv=pass; d=google.com; s=arc-20160816; b=vkWfk94swH0PMbJJs1knDmnT+h8Bos1eM7KaNNupiGJM1AE9mjgHVS2Vj6Y/B6t2VO VbLUnh0SVvXrbfo5JZ39qc64VpbBJ4Z7AAe5vL2Ol0wY5vbDY/mV6qugrxLrLldYBAJ7 x/d44pNfsvsY9fULiNChTRgwXxAUAgnO06u8z0dtrr+1LFQ0eahFlzkQuWKwJx2KsDob qiGDfqaAtNqP6fJ+crv/bJvc9ylxc75ONsIYSZ2z0IToRUXxXX4e7mu3ytUGW/PTBgFQ 4RRbRSVyIbg3HtvZf2lATG5nrP5lVY8eOlCRvJ27tyhtd3S/YZeluSTlFoFi5+qAQlEc /rog== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:content-language :references:to:subject:cc:user-agent:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:date:message-id:dkim-signature; bh=DsyQ0aFuujQprEsGpbUXi0MkXTn+JQVAP2AvmkRXJyo=; fh=nBgjHH8zp1vFX69aJLdYlgP555nTnI/jA+2imT6oIts=; b=GGRITR0JWqlLNRk+v8YhYF4wmAGIXAUI9vUS22GUTm/9GtWZimOFnVpNr1a3ytq3b6 W5xh2pLxaFYvSvY/Gwe9kXFcNaR7mweBPAU+RAcjc/JqdVH8NMgRDb0z9x5AqwyWh7eA Y4ITlvFLnEGcJKVC/w0029MjQRVyM1KavCXWtwlZBQ+dc4pSkopk6oXlZpFRIU4STIdP Fzd3HGHcq9DBRifuFvg4L/Q3+Ehr/mdmQvnaz1tCEGQENCSEB4OwLit/DUmBtjkbHfQl VqeBb4MkOT7sH53HpccgbPV8XY+P9/YfNjvPLz9uey21RIOKQqsp0+6EPhbNyZLG8ZSN 2/Zg==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=BqR2CXy1; arc=pass (i=1 dkim=pass dkdomain=intel.com dmarc=pass fromdomain=linux.intel.com); spf=pass (google.com: domain of linux-kernel+bounces-201664-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.48.161 as permitted sender) smtp.mailfrom="linux-kernel+bounces-201664-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from sy.mirrors.kernel.org (sy.mirrors.kernel.org. [147.75.48.161]) by mx.google.com with ESMTPS id 41be03b00d2f7-6c35c17bb84si9003795a12.615.2024.06.04.20.04.12 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 04 Jun 2024 20:04:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-201664-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.48.161 as permitted sender) client-ip=147.75.48.161; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=BqR2CXy1; arc=pass (i=1 dkim=pass dkdomain=intel.com dmarc=pass fromdomain=linux.intel.com); spf=pass (google.com: domain of linux-kernel+bounces-201664-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.48.161 as permitted sender) smtp.mailfrom="linux-kernel+bounces-201664-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sy.mirrors.kernel.org (Postfix) with ESMTPS id 20878B22B69 for ; Wed, 5 Jun 2024 02:02:32 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id E95CB28379; Wed, 5 Jun 2024 02:02:25 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="BqR2CXy1" Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.19]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 477134AEE7 for ; Wed, 5 Jun 2024 02:02:23 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=192.198.163.19 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717552944; cv=none; b=nAIwSUj5hLQLUpr7X7fPOKAumuBpuiCXDANfTFHe3uA/mCCYFvIDxHvfkkqfMjsZS1NZhrJRWaDG0KthfajJjdzKrhQ6I3YpNAq7B4GHKcBRNu+rTB6DbhVOwDG9aB+JyfXhViljAax4Y9y5Zfglr/VbYjQxyuXjehud9Eq1h1k= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717552944; c=relaxed/simple; bh=DVudwJDdRkFCO/7DezhJu4C8/HNl1KHfIYtmD/xTOpU=; h=Message-ID:Date:MIME-Version:Cc:Subject:To:References:From: In-Reply-To:Content-Type; b=McW/Sgrm+iGycWh9wmtaDkhCQTpa12tPkFMXGJk2BvEMFBmH3UuDD2txVQjJtZdcMZX4yHY9rOsisWPtFM7Tc2yn4VZpaFyvnRu8oMHPUIWdumEwqDpj5ECmM1Jcsevf0xJUw+sAeJaR8QvKhMdVqG84S8b75ezt0X5ATbH6E+A= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com; spf=none smtp.mailfrom=linux.intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=BqR2CXy1; arc=none smtp.client-ip=192.198.163.19 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com Authentication-Results: smtp.subspace.kernel.org; spf=none smtp.mailfrom=linux.intel.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1717552943; x=1749088943; h=message-id:date:mime-version:cc:subject:to:references: from:in-reply-to:content-transfer-encoding; bh=DVudwJDdRkFCO/7DezhJu4C8/HNl1KHfIYtmD/xTOpU=; b=BqR2CXy1+3Jj04Lpa1KPl87ywlpCBAs79nr4RVvkm5njy/0wXhH0hhNl gH14gFdCsCGsIzYyCU5IDoxsTq2hJQH0x6jyw+VWVZ7hJ+f/0M0kDFDXQ fp1ZDdX56owIHQYqFw9qw4zS/Juv+A0x6RmxO6uOz7uzEzmD7sHgH/85o ljhiI/25Le8JXx+Jwa1exWIWkGBZe4vD5qpQEMiQtm1yzXygDUjbAklUI 7CBS35Mey2dOzNaIo03mpLv2h42JuBjrk+ajpGlCgOAiawGxmcJW9RMoB +TX+0RRwqcY50evKDTMAllxA63SDIgCzcQDotYWDznYiqdq4m5cMW/DQU w==; X-CSE-ConnectionGUID: gPwMVJZHTFWHGl29UBXAew== X-CSE-MsgGUID: llb4Z1KMSF2xk/i1ShVz8w== X-IronPort-AV: E=McAfee;i="6600,9927,11093"; a="13942511" X-IronPort-AV: E=Sophos;i="6.08,215,1712646000"; d="scan'208";a="13942511" Received: from orviesa006.jf.intel.com ([10.64.159.146]) by fmvoesa113.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 04 Jun 2024 19:02:23 -0700 X-CSE-ConnectionGUID: RItW8yqwTuq9l9wOE2V1jA== X-CSE-MsgGUID: e3RctwFJShOkhwUisZR8pg== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,215,1712646000"; d="scan'208";a="37870969" Received: from unknown (HELO [10.239.159.127]) ([10.239.159.127]) by orviesa006.jf.intel.com with ESMTP; 04 Jun 2024 19:02:17 -0700 Message-ID: <91a42f46-060a-4430-a10a-7bfded67c4a9@linux.intel.com> Date: Wed, 5 Jun 2024 10:00:09 +0800 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Cc: baolu.lu@linux.intel.com, David Airlie , Daniel Vetter , Kalle Valo , Bjorn Andersson , Mathieu Poirier , Alex Williamson , mst@redhat.com, Jason Wang , Thierry Reding , Jonathan Hunter , Mikko Perttunen , iommu@lists.linux.dev, dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v2 01/22] iommu: Add iommu_user_domain_alloc() interface To: Yi Liu , Joerg Roedel , Will Deacon , Robin Murphy , Jason Gunthorpe , Kevin Tian References: <20240604015134.164206-1-baolu.lu@linux.intel.com> <20240604015134.164206-2-baolu.lu@linux.intel.com> <445fae9f-ea1e-4864-9f0e-f348c51146a1@intel.com> Content-Language: en-US From: Baolu Lu In-Reply-To: <445fae9f-ea1e-4864-9f0e-f348c51146a1@intel.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit On 6/4/24 4:03 PM, Yi Liu wrote: > On 2024/6/4 09:51, Lu Baolu wrote: >> Commit <909f4abd1097> ("iommu: Add new iommu op to create domains owned >> by userspace") added a dedicated iommu op to allocate a user domain. >> While IOMMUFD has already made use of this callback, other frameworks >> like vfio/type1 and vDPA still use the paging domain allocation >> interface. >> >> Add a new interface named iommu_user_domain_alloc(), which indicates the >> allocation of a domain for device DMA managed by user space driver. All >> device passthrough frameworks could use this interface for their domain >> allocation. >> >> Although it is expected that all iommu drivers could implement their own >> domain_alloc_user ops, most drivers haven't implemented it yet. Rollback >> to the paging domain allocation interface if the iommu driver hasn't >> implemented this op yet. >> >> Signed-off-by: Lu Baolu >> --- >>   include/linux/iommu.h |  6 ++++++ >>   drivers/iommu/iommu.c | 42 ++++++++++++++++++++++++++++++++++++++++++ >>   2 files changed, 48 insertions(+) >> >> diff --git a/include/linux/iommu.h b/include/linux/iommu.h >> index 7bc8dff7cf6d..6648b2415474 100644 >> --- a/include/linux/iommu.h >> +++ b/include/linux/iommu.h >> @@ -780,6 +780,7 @@ extern bool iommu_present(const struct bus_type >> *bus); >>   extern bool device_iommu_capable(struct device *dev, enum iommu_cap >> cap); >>   extern bool iommu_group_has_isolated_msi(struct iommu_group *group); >>   extern struct iommu_domain *iommu_domain_alloc(const struct bus_type >> *bus); >> +struct iommu_domain *iommu_user_domain_alloc(struct device *dev, u32 >> flags); >>   extern void iommu_domain_free(struct iommu_domain *domain); >>   extern int iommu_attach_device(struct iommu_domain *domain, >>                      struct device *dev); >> @@ -1086,6 +1087,11 @@ static inline struct iommu_domain >> *iommu_domain_alloc(const struct bus_type *bus >>       return NULL; >>   } >> +static inline struct iommu_domain *iommu_user_domain_alloc(struct >> device *dev, u32 flags) >> +{ >> +    return ERR_PTR(-ENODEV); >> +} >> + >>   static inline void iommu_domain_free(struct iommu_domain *domain) >>   { >>   } >> diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c >> index 9df7cc75c1bc..f1416892ef8e 100644 >> --- a/drivers/iommu/iommu.c >> +++ b/drivers/iommu/iommu.c >> @@ -2032,6 +2032,48 @@ struct iommu_domain *iommu_domain_alloc(const >> struct bus_type *bus) >>   } >>   EXPORT_SYMBOL_GPL(iommu_domain_alloc); >> +/** >> + * iommu_user_domain_alloc() - Allocate a user domain >> + * @dev: device for which the domain is allocated >> + * @flags: iommufd_hwpt_alloc_flags defined in uapi/linux/iommufd.h >> + * >> + * Allocate a user domain which will be managed by a userspace >> driver. Return >> + * allocated domain if successful, or a ERR pointer for failure. > > do you want to mention that this is for paging domain allocation? You are worrying about its confusion with nesting domain allocation, right? My understanding is that if we want a common interface for nesting domain allocation, we should make it in another interface. Here, the user domain is a paging domain for GVA->HPA mapping, which is common across iommufd, vfio, and vdpa. > >> + */ >> +struct iommu_domain *iommu_user_domain_alloc(struct device *dev, u32 >> flags) >> +{ >> +    struct iommu_domain *domain; >> +    const struct iommu_ops *ops; >> + >> +    if (!dev_has_iommu(dev)) >> +        return ERR_PTR(-ENODEV); >> + >> +    ops = dev_iommu_ops(dev); >> +    if (ops->domain_alloc_user) { >> +        domain = ops->domain_alloc_user(dev, flags, NULL, NULL); >> +        if (IS_ERR(domain)) >> +            return domain; >> + >> +        domain->type = IOMMU_DOMAIN_UNMANAGED; >> +        domain->owner = ops; >> +        domain->pgsize_bitmap = ops->pgsize_bitmap; > > this seems to break the iommufd selftest as the mock driver sets extra > bits in the domain->pgsize_bitmap in allocation. Override it may fail > something in the testing. you may need to check if domain->pgsize_bitmap > is set or use &=. > > static struct iommu_domain *mock_domain_alloc_paging(struct device *dev) > { >     struct mock_dev *mdev = container_of(dev, struct mock_dev, dev); >     struct mock_iommu_domain *mock; > >     mock = kzalloc(sizeof(*mock), GFP_KERNEL); >     if (!mock) >         return NULL; >     mock->domain.geometry.aperture_start = MOCK_APERTURE_START; >     mock->domain.geometry.aperture_end = MOCK_APERTURE_LAST; >     mock->domain.pgsize_bitmap = MOCK_IO_PAGE_SIZE; >     if (dev && mdev->flags & MOCK_FLAGS_DEVICE_HUGE_IOVA) >         mock->domain.pgsize_bitmap |= MOCK_HUGE_PAGE_SIZE; >     mock->domain.ops = mock_ops.default_domain_ops; >     mock->domain.type = IOMMU_DOMAIN_UNMANAGED; >     xa_init(&mock->pfns); >     return &mock->domain; > } You are right. I should follow the code in __iommu_domain_alloc() /* * If not already set, assume all sizes by default; the driver * may override this later */ if (!domain->pgsize_bitmap) domain->pgsize_bitmap = ops->pgsize_bitmap; Does it work? Best regards, baolu