Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp212905pxb; Wed, 14 Apr 2021 13:23:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwLqGXPGJnw6pYcM9a+XAVFGfl883fl/8GouGRkQYGjg8SRD+64Q5ErEb/7Re+xBhgrTGiY X-Received: by 2002:a05:6402:344e:: with SMTP id l14mr30273edc.184.1618431786559; Wed, 14 Apr 2021 13:23:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618431786; cv=none; d=google.com; s=arc-20160816; b=stX+tvIUwxe9rK14JjYwZxKNgYA9GiwfEF50V11uGdBqe7TotwWYhVbA1QDoXDX5m9 FdcHfeuDckTzJlWtwZdm/KbBjuVLuBlbdy4oBUdIm2xTiTt2WfsJAYwzrKvbF7LkfDhA fhqfJKMyxymNfDiCd/QYvz9EoLtU+FlooQ+FDVNX92ci6HM+gYmUq98loBquX2td3MjB yoMGTXWbF/YMsf4DACQHJUn6IYfGULefhTQv/8hiVahaKXBmUdYOYCzzyWnHj/AL3r1t x+0w6ejO5Q87jJTqr6xvOKZC1FhW7YPNof1RQU3VtDb1wOA6E9uXElA7gXrj1unhfF39 E02A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :to:subject:cc:ironport-sdr:ironport-sdr; bh=Zo4KnmUUCGNxZ/YV/nBWBlWMusO2xTIc3RBRpskkwRw=; b=qF22BQYCNmY2mxjdn/K4GmCVDCCEGh8VJVmN4BxsLpjQUrECLKp2PUXOsK+fcomGJf MEiX8Q4XDVm6qmcdqv5kPtMxhHUB/EPuznWJKLJOqdjrmhPEmRlQ9z7xCIKrI7SE6C3f of5fF58DY4+wss68avE0v904ZC4RvmY/kwAQ/+JoDzForCqag2g8nu7ZXaS9Qe/oyA3w 1zVvfgdzjqvT1YvP5T5JtPiXt8k53pLtRe3c0sT21lMZRPOFNuK7iZa5rm86pETR6+oT VMgQ0r7+tCAHaaj0ojfKFmu/aHaQZVIVqpTKKHk+MWFt704ONxA0dD/fpJAlZzrBbFKw ZAsg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t13si322529ejs.195.2021.04.14.13.22.43; Wed, 14 Apr 2021 13:23:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1347471AbhDNHY5 (ORCPT + 99 others); Wed, 14 Apr 2021 03:24:57 -0400 Received: from mga07.intel.com ([134.134.136.100]:38252 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1349710AbhDNHYg (ORCPT ); Wed, 14 Apr 2021 03:24:36 -0400 IronPort-SDR: zYFUc1Z3Bhf+6/AqwZTL1v89T6AkF1AuPrynnURcFUyybNuDY9vOV34m6X2G4DTeOAhknHKBFA xfDXn1KbSLfg== X-IronPort-AV: E=McAfee;i="6200,9189,9953"; a="258552405" X-IronPort-AV: E=Sophos;i="5.82,221,1613462400"; d="scan'208";a="258552405" Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Apr 2021 00:24:13 -0700 IronPort-SDR: 35lROPy9KaI3sWZf4uqKtjH/Qunpt9OTUZ5AFPasffdrQiGF7IKnLpGAMJwBXoXQVlETsZ32D6 mVt3msA3+Tgw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.82,221,1613462400"; d="scan'208";a="443720741" Received: from allen-box.sh.intel.com (HELO [10.239.159.128]) ([10.239.159.128]) by fmsmga004.fm.intel.com with ESMTP; 14 Apr 2021 00:24:09 -0700 Cc: baolu.lu@linux.intel.com, Alex Williamson , Cornelia Huck , Kirti Wankhede , wanghaibin.wang@huawei.com, jiangkunkun@huawei.com, yuzenghui@huawei.com, lushenming@huawei.com Subject: Re: [PATCH v3 02/12] iommu: Add iommu_split_block interface To: Keqian Zhu , linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, iommu@lists.linux-foundation.org, Robin Murphy , Will Deacon , Joerg Roedel , Yi Sun , Jean-Philippe Brucker , Jonathan Cameron , Tian Kevin References: <20210413085457.25400-1-zhukeqian1@huawei.com> <20210413085457.25400-3-zhukeqian1@huawei.com> From: Lu Baolu Message-ID: Date: Wed, 14 Apr 2021 15:14:32 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.7.1 MIME-Version: 1.0 In-Reply-To: <20210413085457.25400-3-zhukeqian1@huawei.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 4/13/21 4:54 PM, Keqian Zhu wrote: > Block(largepage) mapping is not a proper granule for dirty log tracking. > Take an extreme example, if DMA writes one byte, under 1G mapping, the > dirty amount reported is 1G, but under 4K mapping, the dirty amount is > just 4K. > > This adds a new interface named iommu_split_block in IOMMU base layer. > A specific IOMMU driver can invoke it during start dirty log. If so, the > driver also need to realize the split_block iommu ops. > > We flush all iotlbs after the whole procedure is completed to ease the > pressure of IOMMU, as we will hanle a huge range of mapping in general. > > Signed-off-by: Keqian Zhu > Signed-off-by: Kunkun Jiang > --- > drivers/iommu/iommu.c | 41 +++++++++++++++++++++++++++++++++++++++++ > include/linux/iommu.h | 11 +++++++++++ > 2 files changed, 52 insertions(+) > > diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c > index 667b2d6d2fc0..bb413a927870 100644 > --- a/drivers/iommu/iommu.c > +++ b/drivers/iommu/iommu.c > @@ -2721,6 +2721,47 @@ int iommu_domain_set_attr(struct iommu_domain *domain, > } > EXPORT_SYMBOL_GPL(iommu_domain_set_attr); > > +int iommu_split_block(struct iommu_domain *domain, unsigned long iova, > + size_t size) > +{ > + const struct iommu_ops *ops = domain->ops; > + unsigned int min_pagesz; > + size_t pgsize; > + bool flush = false; > + int ret = 0; > + > + if (unlikely(!ops || !ops->split_block)) > + return -ENODEV; > + > + min_pagesz = 1 << __ffs(domain->pgsize_bitmap); > + if (!IS_ALIGNED(iova | size, min_pagesz)) { > + pr_err("unaligned: iova 0x%lx size 0x%zx min_pagesz 0x%x\n", > + iova, size, min_pagesz); > + return -EINVAL; > + } > + > + while (size) { > + flush = true; > + > + pgsize = iommu_pgsize(domain, iova, size); > + > + ret = ops->split_block(domain, iova, pgsize); > + if (ret) > + break; > + > + pr_debug("split handled: iova 0x%lx size 0x%zx\n", iova, pgsize); > + > + iova += pgsize; > + size -= pgsize; > + } > + > + if (flush) > + iommu_flush_iotlb_all(domain); > + > + return ret; > +} > +EXPORT_SYMBOL_GPL(iommu_split_block); Do you really have any consumers of this interface other than the dirty bit tracking? If not, I don't suggest to make this as a generic IOMMU interface. There is an implicit requirement for such interfaces. The iommu_map/unmap(iova, size) shouldn't be called at the same time. Currently there's no such sanity check in the iommu core. A poorly written driver could mess up the kernel by misusing this interface. This also applies to iommu_merge_page(). Best regards, baolu