Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp463622ybb; Wed, 8 Apr 2020 03:33:29 -0700 (PDT) X-Google-Smtp-Source: APiQypKgQkuXFcIYvRCySyQ5/d2v5XHBwoCMpNlN1iaQBePVkCp4kyWmqWsR1apKgxGeAITHFW9E X-Received: by 2002:a05:6808:605:: with SMTP id y5mr1966521oih.172.1586342009373; Wed, 08 Apr 2020 03:33:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586342009; cv=none; d=google.com; s=arc-20160816; b=mKss+7BNN2e+4yXE7gsnz9MapibQZvP8WkONnEVTVaYoEo1OtDjG4TZ1Ytp9n3xWiO /A6OJvmOcv8ViHebnOAkumS5S6hbAD7fBD81EAvLWd3yyIQ+M1EbnUB8zp+oAJaXB/5l ezPDzSFBfrqb1hoKwicPPKQc9o5KU+WbMg2DJz76aTG59HqPKecI8uM/Bw0Nzf8nH9oo hgl74pqoHifku2rdOgdzvXB6Y9FWELPEA8247TQGHvIy7L5gGb/6q607hy2/l9LWDD9V +rBNd1mGEnpAdunXAYkAW2RBMEJPs6lNvdL8EwpGK+nyMW8TH4UzdRq4L3lualFyM0s2 xz8A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature; bh=EC/D1f86N5vzsKE44T+7aeogRJCUDLjHigmziVHut9I=; b=o0hyzMqlg2dVowFLJonoWCSEvrnM0zkRWAYcXdxIn+iZwV5TntLJTZjJPHMeqeM5Zx zIbeGZiJ5CqN1VnLIDCn+CUsF2Kh+O9CkNpD9oJtRFDvYo+f8926RsA3dSFDdqvPPrsN jmBnbLKRgHnz1XtWBQSSLBfl7YitOpoV/cVT3IRoXTvOGqSOxRjEMZpavVO7UNNcqws/ Uhy2QWbwowXs+H9b9YmbQBZa1d2q85loIJZg1nXCW4eKqeC7h0ow7lOlarxPavGQbedp 9Ovga2PdvPmLlCqhs2djrNYrESWO3zNstg3mhAX9jKDgXHjEZVKi7jvpb3SAnSIokX+T x5HQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=KA4+i1+d; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a5si1850658otd.116.2020.04.08.03.33.13; Wed, 08 Apr 2020 03:33:29 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=KA4+i1+d; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728080AbgDHK2R (ORCPT + 99 others); Wed, 8 Apr 2020 06:28:17 -0400 Received: from us-smtp-delivery-1.mimecast.com ([207.211.31.120]:37664 "EHLO us-smtp-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728099AbgDHK2R (ORCPT ); Wed, 8 Apr 2020 06:28:17 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1586341695; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=EC/D1f86N5vzsKE44T+7aeogRJCUDLjHigmziVHut9I=; b=KA4+i1+dkewsQ02CHXNDydg0TiWld8bvHdvUady9wgqykpwR9w7MfelLAHoc+Xe6Ggz7zb mTdU4a5IYOd5S2Fydwzzm7qZ4XrrMFgpsPmOzbK9677ci3/owMcdlDs9VJXiyS6SP7Hquz zQtfAacBmFMaMtuOISZiayG4JyL/nqE= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-381-KVkFh0gbOm2Hxe1TEk_Ytw-1; Wed, 08 Apr 2020 06:28:13 -0400 X-MC-Unique: KVkFh0gbOm2Hxe1TEk_Ytw-1 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 1ECA7801E5E; Wed, 8 Apr 2020 10:28:11 +0000 (UTC) Received: from [10.36.115.53] (ovpn-115-53.ams2.redhat.com [10.36.115.53]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 7591C5D9CD; Wed, 8 Apr 2020 10:28:00 +0000 (UTC) Subject: Re: [PATCH v1 5/8] vfio/type1: Report 1st-level/stage-1 format to userspace To: "Liu, Yi L" , Jean-Philippe Brucker Cc: "alex.williamson@redhat.com" , "Tian, Kevin" , "jacob.jun.pan@linux.intel.com" , "joro@8bytes.org" , "Raj, Ashok" , "Tian, Jun J" , "Sun, Yi Y" , "peterx@redhat.com" , "iommu@lists.linux-foundation.org" , "kvm@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "Wu, Hao" References: <1584880325-10561-1-git-send-email-yi.l.liu@intel.com> <1584880325-10561-6-git-send-email-yi.l.liu@intel.com> <20200403082305.GA1269501@myrica> From: Auger Eric Message-ID: Date: Wed, 8 Apr 2020 12:27:58 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Yi, On 4/7/20 11:43 AM, Liu, Yi L wrote: > Hi Jean, > >> From: Jean-Philippe Brucker >> Sent: Friday, April 3, 2020 4:23 PM >> To: Auger Eric >> userspace >> >> On Wed, Apr 01, 2020 at 03:01:12PM +0200, Auger Eric wrote: >>>>>> header = vfio_info_cap_add(caps, sizeof(*nesting_cap), >>>>>> VFIO_IOMMU_TYPE1_INFO_CAP_NESTING, 1); >> @@ -2254,6 +2309,7 >>>>>> @@ static int vfio_iommu_info_add_nesting_cap(struct >>>>> vfio_iommu *iommu, >>>>>> /* nesting iommu type supports PASID requests (alloc/free) */ >>>>>> nesting_cap->nesting_capabilities |= VFIO_IOMMU_PASID_REQS; >>>>> What is the meaning for ARM? >>>> >>>> I think it's just a software capability exposed to userspace, on >>>> userspace side, it has a choice to use it or not. :-) The reason >>>> define it and report it in cap nesting is that I'd like to make the >>>> pasid alloc/free be available just for IOMMU with type >>>> VFIO_IOMMU_TYPE1_NESTING. Please feel free tell me if it is not good >>>> for ARM. We can find a proper way to report the availability. >>> >>> Well it is more a question for jean-Philippe. Do we have a system wide >>> PASID allocation on ARM? >> >> We don't, the PASID spaces are per-VM on Arm, so this function should consult the >> IOMMU driver before setting flags. As you said on patch 3, nested doesn't >> necessarily imply PASID support. The SMMUv2 does not support PASID but does >> support nesting stages 1 and 2 for the IOVA space. >> SMMUv3 support of PASID depends on HW capabilities. So I think this needs to be >> finer grained: >> >> Does the container support: >> * VFIO_IOMMU_PASID_REQUEST? >> -> Yes for VT-d 3 >> -> No for Arm SMMU >> * VFIO_IOMMU_{,UN}BIND_GUEST_PGTBL? >> -> Yes for VT-d 3 >> -> Sometimes for SMMUv2 >> -> No for SMMUv3 (if we go with BIND_PASID_TABLE, which is simpler due to >> PASID tables being in GPA space.) >> * VFIO_IOMMU_BIND_PASID_TABLE? >> -> No for VT-d >> -> Sometimes for SMMUv3 >> >> Any bind support implies VFIO_IOMMU_CACHE_INVALIDATE support. > > good summary. do you expect to see any > >> >>>>>> + nesting_cap->stage1_formats = formats; >>>>> as spotted by Kevin, since a single format is supported, rename >>>> >>>> ok, I was believing it may be possible on ARM or so. :-) will rename >>>> it. >> >> Yes I don't think an u32 is going to cut it for Arm :( We need to describe all sorts of >> capabilities for page and PASID tables (granules, GPA size, ASID/PASID size, HW >> access/dirty, etc etc.) Just saying "Arm stage-1 format" wouldn't mean much. I >> guess we could have a secondary vendor capability for these? > > Actually, I'm wondering if we can define some formats to stands for a set of > capabilities. e.g. VTD_STAGE1_FORMAT_V1 which may indicates the 1st level > page table related caps (aw, a/d, SRE, EA and etc.). And vIOMMU can parse > the capabilities. But eventually do we really need all those capability getters? I mean can't we simply rely on the actual call to VFIO_IOMMU_BIND_GUEST_PGTBL() to detect any mismatch? Definitively the error handling may be heavier on userspace but can't we manage. My fear is we end up with an overly complex series. This capability getter may be interesting if we can switch to a fallback implementation but here I guess we don't have any fallback. With smmuv3 nested stage we don't have any fallback solution either. For the versions, it is different because the userspace shall be able to adapt (or not) to the max version supported by the kernel. Thanks Eric > > Regards, > Yi Liu >