Received: by 2002:a89:d88:0:b0:1fa:5c73:8e2d with SMTP id eb8csp1831737lqb; Sun, 26 May 2024 21:09:57 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWTqqJp6S76j34njP4o9CTXo+s51/XmPeyZMWcREm2u1J9237QplhKQrYFVmU7eHw/M+iHNbJteA2YJPFMtwUHNbGMCEH6WdKTC4GEFPg== X-Google-Smtp-Source: AGHT+IG4y8PbytPbmSm8UOlh50Cbvqzwaz88PJ57TPZmY53v52ZrcATI5ElgcEmNzsgm1+HICtLv X-Received: by 2002:a05:6214:3911:b0:6ad:79e7:fe43 with SMTP id 6a1803df08f44-6ad79e7ff04mr30965286d6.55.1716782997074; Sun, 26 May 2024 21:09:57 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1716782997; cv=pass; d=google.com; s=arc-20160816; b=hnUgGik12S2jlrVLz7x9Tm7HOS0NrnQ0fhaKM1Mj76nZDmYpWqD17e0Ogud/+rC7M9 zXm79fS70y8lubuZQ6BkUgyU2ddUcIbE3dXOHzcVTNFhx+3Gq+ehC2XRZPwHVUpyzG2y d4sh62NS0d6Dxe39VliWOCwkJ/RhiB1HOq2wZoNE+Gnqw7ufAjAIDyKfsqzRHk2GpyXT R4yVfhKH+adTtEsNoT96ZhCf5c0QYzm75QL95lULnGR4JJ/s7bQsz0ccPDyreyG7SSk7 sJwOp1TrwMq/RlLNvC8fQw8ohnK0Ns2zpxbl5HtbHADEXo9CO6J471v3bhMYkkkQVR8S dRJA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:in-reply-to:message-id :date:subject:cc:to:from:dkim-signature; bh=+pLholxNFZCZAhDjXDi3MKGXt1TH0Gblznll4k5VOhk=; fh=tXslIEC/iII+0o9YvlUtRXapl6zSPusBr1yEidN1Q7w=; b=Ao3WdWaYRqr9Y7a+ZzA0f//lBTe5skcejlOhlEGDi+Grup2hQzfLZfJD9cJzTNIKXV 9tHRG/i7VUvKvcWPbaWMcFnn5ARFS+crhROObiDuqLXcX+lM68DmcuYtENGGROygENNx za5V79kBAYDzap5rOT8+NN2hQL3lXm7MZr8L2urGn611i6FTODHbBBA6OSRjtT+aiDNj v6cGNu40ZgX/93WuOH6hOs4yTXjAGZ1kP2QtIV2QWtV85jM7hs23FAQEqkhvxNVsAcNj K6Doc/az8dVQMBnPdRVdbILpZQCSfLh/GFFcpRkMTDrY6l75jpVlO4ehGULyFOwNttA/ mOVA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=WOPx25Em; arc=pass (i=1 dkim=pass dkdomain=intel.com dmarc=pass fromdomain=linux.intel.com); spf=pass (google.com: domain of linux-kernel+bounces-190008-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-190008-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id 6a1803df08f44-6ac06f4f7b2si71038096d6.72.2024.05.26.21.09.56 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 26 May 2024 21:09:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-190008-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=WOPx25Em; arc=pass (i=1 dkim=pass dkdomain=intel.com dmarc=pass fromdomain=linux.intel.com); spf=pass (google.com: domain of linux-kernel+bounces-190008-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-190008-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id A89D71C219D6 for ; Mon, 27 May 2024 04:09:56 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 8D9F63BBC2; Mon, 27 May 2024 04:08:10 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="WOPx25Em" Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.21]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 12B923B295 for ; Mon, 27 May 2024 04:08:07 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=198.175.65.21 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1716782889; cv=none; b=e32Mm7xjARtvr6O0aHBlYOia5vog2y+/Aks7xPIKVVCl8h9Ozhax6cKIZ7KzbPlfMDuXB/dhbEw5+vvN/ism8f53vz2sukGn6jkI4MtpqufJqflsyI2PwTVIesiCdip5dwk7e1fr4VhMrJEHwWAxUaCTidbhco2xM8SwoyXeJmo= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1716782889; c=relaxed/simple; bh=fSa4QxhDk60BJQCv6VirqWArTfxsUAjFh9plrUfoEiQ=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=t4MoNFAoX77yHlgSXGu1UXqEYXdnzyYRwIKcCfXnxpmK0f8gKKS6fX6/7pN7Cd37Sf+xfVKWFpg/p9oHn6mpBnXduiYUI5fJLEeN1OvkA1XSKIeGKyezlPfBC5hNCh3R1iwljspy2ZKDgHf2OO7BYbkTsy4RTelQEc3+y/DroXg= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com; spf=none smtp.mailfrom=linux.intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=WOPx25Em; arc=none smtp.client-ip=198.175.65.21 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com Authentication-Results: smtp.subspace.kernel.org; spf=none smtp.mailfrom=linux.intel.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1716782888; x=1748318888; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=fSa4QxhDk60BJQCv6VirqWArTfxsUAjFh9plrUfoEiQ=; b=WOPx25EmqJ0cL8E7eEtPPS2V5svUtg5JpM+8TQig95YmGziMdZZfHFED KP/hqofEpHChI8XguNRd0nAuHgQ/6fCRwhhF6YLCE+FUlMtqg/44YDUPu Z99wx9Nk0WTHqMqgYDVCVf7qeEoSTmq9KuPpPasOXxD4wyYBvfVsTOuUj 23OXw4NNS61nxU2NQ5umZwSdc3LQ8ec1Ic7Gg9AAnJI2I+HS0cTMNW7k1 3Vk96n2M9eFx8Do1DvZWBGkFUlnb0ufU4Hevj7+NE8NMNV+IpZ9NBnkix m/x6agZUQlkuQr8taLxAsVu4t0C4GRsqulHB+P8hrb+uJVMeuSAz6YuH9 Q==; X-CSE-ConnectionGUID: vg147w//QruKGKqTe4Jllw== X-CSE-MsgGUID: 6vaOL3H7QsCyCUImPeRomQ== X-IronPort-AV: E=McAfee;i="6600,9927,11084"; a="13022881" X-IronPort-AV: E=Sophos;i="6.08,191,1712646000"; d="scan'208";a="13022881" Received: from orviesa003.jf.intel.com ([10.64.159.143]) by orvoesa113.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 May 2024 21:07:54 -0700 X-CSE-ConnectionGUID: n7XH8z+7QsqI7twCJEfRWg== X-CSE-MsgGUID: /ZtI4c5nRpm38IZXlLqZQw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,191,1712646000"; d="scan'208";a="39399999" Received: from unknown (HELO allen-box.sh.intel.com) ([10.239.159.127]) by orviesa003.jf.intel.com with ESMTP; 26 May 2024 21:07:50 -0700 From: Lu Baolu To: Jason Gunthorpe , Kevin Tian , Joerg Roedel , Will Deacon , Robin Murphy , Jean-Philippe Brucker , Nicolin Chen , Yi Liu , Jacob Pan , Joel Granados Cc: iommu@lists.linux.dev, virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, Lu Baolu Subject: [PATCH v6 08/10] iommufd: Associate fault object with iommufd_hw_pgtable Date: Mon, 27 May 2024 12:05:15 +0800 Message-Id: <20240527040517.38561-9-baolu.lu@linux.intel.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240527040517.38561-1-baolu.lu@linux.intel.com> References: <20240527040517.38561-1-baolu.lu@linux.intel.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit When allocating a user iommufd_hw_pagetable, the user space is allowed to associate a fault object with the hw_pagetable by specifying the fault object ID in the page table allocation data and setting the IOMMU_HWPT_FAULT_ID_VALID flag bit. On a successful return of hwpt allocation, the user can retrieve and respond to page faults by reading and writing the file interface of the fault object. Once a fault object has been associated with a hwpt, the hwpt is iopf-capable, indicated by hwpt->fault is non NULL. Attaching, detaching, or replacing an iopf-capable hwpt to an RID or PASID will differ from those that are not iopf-capable. Signed-off-by: Lu Baolu --- drivers/iommu/iommufd/iommufd_private.h | 9 ++++++ include/uapi/linux/iommufd.h | 8 +++++ drivers/iommu/iommufd/fault.c | 17 ++++++++++ drivers/iommu/iommufd/hw_pagetable.c | 41 +++++++++++++++++++------ 4 files changed, 66 insertions(+), 9 deletions(-) diff --git a/drivers/iommu/iommufd/iommufd_private.h b/drivers/iommu/iommufd/iommufd_private.h index ba89c86e1af7..db50881e76f6 100644 --- a/drivers/iommu/iommufd/iommufd_private.h +++ b/drivers/iommu/iommufd/iommufd_private.h @@ -457,8 +457,17 @@ struct iommufd_attach_handle { /* Convert an iommu attach handle to iommufd handle. */ #define to_iommufd_handle(hdl) container_of(hdl, struct iommufd_attach_handle, handle) +static inline struct iommufd_fault * +iommufd_get_fault(struct iommufd_ucmd *ucmd, u32 id) +{ + return container_of(iommufd_get_object(ucmd->ictx, id, + IOMMUFD_OBJ_FAULT), + struct iommufd_fault, obj); +} + int iommufd_fault_alloc(struct iommufd_ucmd *ucmd); void iommufd_fault_destroy(struct iommufd_object *obj); +int iommufd_fault_iopf_handler(struct iopf_group *group); int iommufd_fault_domain_attach_dev(struct iommufd_hw_pagetable *hwpt, struct iommufd_device *idev); diff --git a/include/uapi/linux/iommufd.h b/include/uapi/linux/iommufd.h index eba452d4344e..5391db3a7180 100644 --- a/include/uapi/linux/iommufd.h +++ b/include/uapi/linux/iommufd.h @@ -357,10 +357,13 @@ struct iommu_vfio_ioas { * the parent HWPT in a nesting configuration. * @IOMMU_HWPT_ALLOC_DIRTY_TRACKING: Dirty tracking support for device IOMMU is * enforced on device attachment + * @IOMMU_HWPT_FAULT_ID_VALID: The fault_id field of hwpt allocation data is + * valid. */ enum iommufd_hwpt_alloc_flags { IOMMU_HWPT_ALLOC_NEST_PARENT = 1 << 0, IOMMU_HWPT_ALLOC_DIRTY_TRACKING = 1 << 1, + IOMMU_HWPT_FAULT_ID_VALID = 1 << 2, }; /** @@ -412,6 +415,9 @@ enum iommu_hwpt_data_type { * @data_type: One of enum iommu_hwpt_data_type * @data_len: Length of the type specific data * @data_uptr: User pointer to the type specific data + * @fault_id: The ID of IOMMUFD_FAULT object. Valid only if flags field of + * IOMMU_HWPT_FAULT_ID_VALID is set. + * @__reserved2: Padding to 64-bit alignment. Must be 0. * * Explicitly allocate a hardware page table object. This is the same object * type that is returned by iommufd_device_attach() and represents the @@ -442,6 +448,8 @@ struct iommu_hwpt_alloc { __u32 data_type; __u32 data_len; __aligned_u64 data_uptr; + __u32 fault_id; + __u32 __reserved2; }; #define IOMMU_HWPT_ALLOC _IO(IOMMUFD_TYPE, IOMMUFD_CMD_HWPT_ALLOC) diff --git a/drivers/iommu/iommufd/fault.c b/drivers/iommu/iommufd/fault.c index 94dde1f57cfc..dd07e3b1b4c1 100644 --- a/drivers/iommu/iommufd/fault.c +++ b/drivers/iommu/iommufd/fault.c @@ -416,3 +416,20 @@ int iommufd_fault_alloc(struct iommufd_ucmd *ucmd) return rc; } + +int iommufd_fault_iopf_handler(struct iopf_group *group) +{ + struct iommufd_hw_pagetable *hwpt; + struct iommufd_fault *fault; + + hwpt = group->attach_handle->domain->fault_data; + fault = hwpt->fault; + + mutex_lock(&fault->mutex); + list_add_tail(&group->node, &fault->deliver); + mutex_unlock(&fault->mutex); + + wake_up_interruptible(&fault->wait_queue); + + return 0; +} diff --git a/drivers/iommu/iommufd/hw_pagetable.c b/drivers/iommu/iommufd/hw_pagetable.c index 33d142f8057d..14a32bf80549 100644 --- a/drivers/iommu/iommufd/hw_pagetable.c +++ b/drivers/iommu/iommufd/hw_pagetable.c @@ -8,6 +8,15 @@ #include "../iommu-priv.h" #include "iommufd_private.h" +static void __iommufd_hwpt_destroy(struct iommufd_hw_pagetable *hwpt) +{ + if (hwpt->domain) + iommu_domain_free(hwpt->domain); + + if (hwpt->fault) + iommufd_put_object(hwpt->fault->ictx, &hwpt->fault->obj); +} + void iommufd_hwpt_paging_destroy(struct iommufd_object *obj) { struct iommufd_hwpt_paging *hwpt_paging = @@ -22,9 +31,7 @@ void iommufd_hwpt_paging_destroy(struct iommufd_object *obj) hwpt_paging->common.domain); } - if (hwpt_paging->common.domain) - iommu_domain_free(hwpt_paging->common.domain); - + __iommufd_hwpt_destroy(&hwpt_paging->common); refcount_dec(&hwpt_paging->ioas->obj.users); } @@ -49,9 +56,7 @@ void iommufd_hwpt_nested_destroy(struct iommufd_object *obj) struct iommufd_hwpt_nested *hwpt_nested = container_of(obj, struct iommufd_hwpt_nested, common.obj); - if (hwpt_nested->common.domain) - iommu_domain_free(hwpt_nested->common.domain); - + __iommufd_hwpt_destroy(&hwpt_nested->common); refcount_dec(&hwpt_nested->parent->common.obj.users); } @@ -213,7 +218,8 @@ iommufd_hwpt_nested_alloc(struct iommufd_ctx *ictx, struct iommufd_hw_pagetable *hwpt; int rc; - if (flags || !user_data->len || !ops->domain_alloc_user) + if ((flags & ~IOMMU_HWPT_FAULT_ID_VALID) || + !user_data->len || !ops->domain_alloc_user) return ERR_PTR(-EOPNOTSUPP); if (parent->auto_domain || !parent->nest_parent) return ERR_PTR(-EINVAL); @@ -227,7 +233,8 @@ iommufd_hwpt_nested_alloc(struct iommufd_ctx *ictx, refcount_inc(&parent->common.obj.users); hwpt_nested->parent = parent; - hwpt->domain = ops->domain_alloc_user(idev->dev, flags, + hwpt->domain = ops->domain_alloc_user(idev->dev, + flags & ~IOMMU_HWPT_FAULT_ID_VALID, parent->common.domain, user_data); if (IS_ERR(hwpt->domain)) { rc = PTR_ERR(hwpt->domain); @@ -308,13 +315,29 @@ int iommufd_hwpt_alloc(struct iommufd_ucmd *ucmd) goto out_put_pt; } + if (cmd->flags & IOMMU_HWPT_FAULT_ID_VALID) { + struct iommufd_fault *fault; + + fault = iommufd_get_fault(ucmd, cmd->fault_id); + if (IS_ERR(fault)) { + rc = PTR_ERR(fault); + goto out_hwpt; + } + hwpt->fault = fault; + hwpt->domain->iopf_handler = iommufd_fault_iopf_handler; + hwpt->domain->fault_data = hwpt; + } + cmd->out_hwpt_id = hwpt->obj.id; rc = iommufd_ucmd_respond(ucmd, sizeof(*cmd)); if (rc) - goto out_hwpt; + goto out_put_fault; iommufd_object_finalize(ucmd->ictx, &hwpt->obj); goto out_unlock; +out_put_fault: + if (cmd->flags & IOMMU_HWPT_FAULT_ID_VALID) + iommufd_put_object(ucmd->ictx, &hwpt->fault->obj); out_hwpt: iommufd_object_abort_and_destroy(ucmd->ictx, &hwpt->obj); out_unlock: -- 2.34.1