Received: by 2002:a05:6358:9144:b0:117:f937:c515 with SMTP id r4csp128021rwr; Thu, 4 May 2023 16:05:11 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7nxd3Wwd4udnnJeLR/fqMTlfob1gWrXjO/eTrxZYS5d118FW4LpI1XylXlV0E5LSSNHa2k X-Received: by 2002:a05:6a00:4206:b0:63b:6373:786b with SMTP id cd6-20020a056a00420600b0063b6373786bmr3880392pfb.2.1683241511502; Thu, 04 May 2023 16:05:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1683241511; cv=none; d=google.com; s=arc-20160816; b=psRANpA1qVmeEBOTdgVcnci5BsP30He4QDozsFfaHcWT+Aq5Xc22khPMMkM2cWC7ds gT6iBm8MMaeeAoIs6hbW7eIoYBANa5kZJTYKfM2F6Xu00rR0f4nSm9TFG1l42ZrwU0Ki khtBJ4Fxs+G6i2eWQaQES9LHmTbkj+473vnQJuM1O+z6rslotQeFbKacSYIwu73jhmRB lQKNFq6mwJrI9NoChklrwilB2ttUD+3GF2T92DZ6aIElUzVS/Vm8nq/GirxIgaPKBx3W WzVPtwCG1dno7JQf09C7L09tnKLuWT21SYxC96kxDSQXHAUFu2c+5GUDz5/8MRrxiN4M xkCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature; bh=tHlUJkaaMMi794mdoguBQ2SnB4pHv7Ngx9HOAe9gT6I=; b=WaQWHJXKi50DKrjv3ADZNZ7vCN+XAG/8XhUVRlJ/kcTp3XjIdv73s77TR1sWtl7mkA uN8nTmFwqCNmmbsSFcCRKsHT+Eyr6aXxsJgr1HlQ9NMpz2HVTWZSEcws7wrR/OBb3H1t EsRFQRdSEpl95UUJ3iV7wO2k7lioxrlbNj0btpvuYrdZbGsgBtnj916ujdFnC4T2Gn1H UO2tgGSdnExqP/DHfztIhN96yJlDa87dolDQR/Tzbnv/VmU+0ocFloZQpxLBUDxdMeVR nq1zh8la2vgK31x7kI/WRH2fjao0ZCkzCseFePT0y/n+P9LVBPawb10AZOw5jDm5fDzR xyrg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=NMHyBS3r; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t191-20020a6381c8000000b0052877366da6si517504pgd.104.2023.05.04.16.04.53; Thu, 04 May 2023 16:05:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=NMHyBS3r; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229535AbjEDW7P (ORCPT + 99 others); Thu, 4 May 2023 18:59:15 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46346 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229446AbjEDW7N (ORCPT ); Thu, 4 May 2023 18:59:13 -0400 Received: from mga11.intel.com (mga11.intel.com [192.55.52.93]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C8915114; Thu, 4 May 2023 15:59:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1683241152; x=1714777152; h=date:from:to:cc:subject:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=xjHzN3TESceP/PzYSNJRFwhqxoyO0lB2D8Fa0Xfoc/Q=; b=NMHyBS3rNh0X5ycoTooZSIF56anx5IJfNc91yE+uXxLeW+NBs6se8mJX aKm89Q3fREzg6/42GOvlDUlL6hWycbpHhHnxSaq3JH+UBqTNswJ707SKG V8AZ/+hqjwgHBqZQbVudVReVEFlv97IKbqqDBkarLjBTvru1sBgbxaFE2 QiI6Qi5t3eglHAI2Msl0sYqswm6fTeepFakT6oK8cFugP21hoKbPTrar5 NVNRt7OpoglFSEnUuaEbbZZrS5PrbkPYjsoRYH7RK1cvEmblIB1uDshyF IDDsJZbpNr3EKrBa/ouXVjcKgecY40ZVTUJRJw68Y1jURkEuW2NCxdVcI Q==; X-IronPort-AV: E=McAfee;i="6600,9927,10700"; a="346549517" X-IronPort-AV: E=Sophos;i="5.99,250,1677571200"; d="scan'208";a="346549517" Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 04 May 2023 15:59:12 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10700"; a="730016588" X-IronPort-AV: E=Sophos;i="5.99,250,1677571200"; d="scan'208";a="730016588" Received: from jacob-builder.jf.intel.com (HELO jacob-builder) ([10.24.100.114]) by orsmga001-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 04 May 2023 15:59:12 -0700 Date: Thu, 4 May 2023 16:03:34 -0700 From: Jacob Pan To: Baolu Lu Cc: LKML , iommu@lists.linux.dev, Robin Murphy , Jason Gunthorpe , Joerg Roedel , dmaengine@vger.kernel.org, vkoul@kernel.org, Will Deacon , David Woodhouse , Raj Ashok , "Tian, Kevin" , Yi Liu , "Yu, Fenghua" , Dave Jiang , Tony Luck , "Zanussi, Tom" , narayan.ranganathan@intel.com, jacob.jun.pan@linux.intel.com Subject: Re: [PATCH v5 6/7] iommu/vt-d: Implement set_dev_pasid domain op Message-ID: <20230504160334.496085db@jacob-builder> In-Reply-To: <76c98e62-1cac-2ab6-7721-08ec2c1fceb8@linux.intel.com> References: <20230427174937.471668-1-jacob.jun.pan@linux.intel.com> <20230427174937.471668-7-jacob.jun.pan@linux.intel.com> <76c98e62-1cac-2ab6-7721-08ec2c1fceb8@linux.intel.com> Organization: OTC X-Mailer: Claws Mail 3.17.5 (GTK+ 2.24.32; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-4.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_NONE,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Baolu, On Wed, 3 May 2023 15:26:00 +0800, Baolu Lu wrote: > On 4/28/23 1:49 AM, Jacob Pan wrote: > > Devices that use ENQCMDS to submit work on buffers mapped by DMA API > > must attach a PASID to the default domain of the device. In preparation > > for this use case, this patch implements set_dev_pasid() for the > > default_domain_ops. > > > > If the device context has not been set up prior to this call, this will > > set up the device context in addition to PASID attachment. > > > > Signed-off-by: Jacob Pan > > --- > > drivers/iommu/intel/iommu.c | 92 ++++++++++++++++++++++++++++++------- > > 1 file changed, 76 insertions(+), 16 deletions(-) > > > > diff --git a/drivers/iommu/intel/iommu.c b/drivers/iommu/intel/iommu.c > > index 388453a7415e..f9d6c31cdc8e 100644 > > --- a/drivers/iommu/intel/iommu.c > > +++ b/drivers/iommu/intel/iommu.c > > @@ -278,6 +278,8 @@ static LIST_HEAD(dmar_satc_units); > > list_for_each_entry(rmrr, &dmar_rmrr_units, list) > > > > static void device_block_translation(struct device *dev); > > +static void intel_iommu_detach_device_pasid(struct iommu_domain > > *domain, > > + struct device *dev, > > ioasid_t pasid); static void intel_iommu_domain_free(struct > > iommu_domain *domain); > > int dmar_disabled = !IS_ENABLED(CONFIG_INTEL_IOMMU_DEFAULT_ON); > > @@ -4091,8 +4093,7 @@ static void device_block_translation(struct > > device *dev) iommu_disable_pci_caps(info); > > if (!dev_is_real_dma_subdevice(dev)) { > > if (sm_supported(iommu)) > > - intel_pasid_tear_down_entry(iommu, dev, > > - > > IOMMU_DEF_RID_PASID, false); > > + > > intel_iommu_detach_device_pasid(&info->domain->domain, dev, > > IOMMU_DEF_RID_PASID); > > device_block_translation() is called when switch RID's domain or release > the device. I assume that we don't need to touch this path when we add > the attach_dev_pasid support. > > Blocking DMA translation through RID/PASID should be done in > remove_dev_pasid path. > > Or, I overlooked anything? > > [...] > > > > > +static int intel_iommu_attach_device_pasid(struct iommu_domain *domain, > > + struct device *dev, > > ioasid_t pasid) +{ > > + struct device_domain_info *info = dev_iommu_priv_get(dev); > > + struct dmar_domain *dmar_domain = to_dmar_domain(domain); > > + struct intel_iommu *iommu = info->iommu; > > + int ret; > > + > > + if (!pasid_supported(iommu)) > > + return -ENODEV; > > + > > + ret = prepare_domain_attach_device(domain, dev); > > + if (ret) > > + return ret; > > + > > + /* > > + * Most likely the device context has already been set up, > > will only > > + * take a domain ID reference. Otherwise, device context will > > be set > > + * up here. > > The "otherwise" case is only default domain deferred attaching case, > right? it might be the only case so far, but my intention is to be general. i.e. no ordering requirements. I believe it is more future proof in case device_attach_pasid called before device_attach. > When the device driver starts to call attach_dev_pasid api, it means > that the bus and device DMA configuration have been done. We could do > the deferred default domain attaching now. So, perhaps we should add > below code in the core: > > diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c > index f1dcfa3f1a1b..633b5ca53606 100644 > --- a/drivers/iommu/iommu.c > +++ b/drivers/iommu/iommu.c > @@ -3296,6 +3296,12 @@ int iommu_attach_device_pasid(struct iommu_domain > *domain, > if (!group) > return -ENODEV; > > + ret = iommu_deferred_attach(dev, group->default_domain); > + if (ret) { > + iommu_group_put(group); > + return ret; > + } it will cover the device_attach, but adding a special case. > mutex_lock(&group->mutex); > curr = xa_cmpxchg(&group->pasid_array, pasid, NULL, domain, > GFP_KERNEL); > if (curr) { > > Perhaps need to call iommu_deferred_attach() inside the group->mutex > critical region? i agree, attach RID_PASID should also be on the group's pasid_array. > > + * The upper layer APIs make no assumption about the ordering > > between > > + * device attachment and the PASID attachment. > > + */ > > + ret = dmar_domain_attach_device(to_dmar_domain(domain), dev); > > Calling attach_device on the attach_dev_pasid path is not right. I think it comes down to the philosophical differences in terms of who is responsible for ensuring device ctx is set up prior to device pasid attach: 1. vt-d driver 2. upper layer API > > + if (ret) { > > + dev_err(dev, "Attach device failed\n"); > > + return ret; > > + } > > + return dmar_domain_attach_device_pasid(dmar_domain, iommu, > > dev, pasid); +} > > + > > + > > + > > const struct iommu_ops intel_iommu_ops = { > > .capable = intel_iommu_capable, > > .domain_alloc = intel_iommu_domain_alloc, > > @@ -4802,6 +4861,7 @@ const struct iommu_ops intel_iommu_ops = { > > #endif > > .default_domain_ops = &(const struct iommu_domain_ops) { > > .attach_dev = > > intel_iommu_attach_device, > > + .set_dev_pasid = > > intel_iommu_attach_device_pasid, .map_pages = > > intel_iommu_map_pages, .unmap_pages = > > intel_iommu_unmap_pages, .iotlb_sync_map = > > intel_iommu_iotlb_sync_map, > > Best regards, > baolu Thanks, Jacob