Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp2535587yba; Mon, 22 Apr 2019 08:31:16 -0700 (PDT) X-Google-Smtp-Source: APXvYqxLxQNCFGaQgpQHnOdAYiuBHKc6c+xhW5QYRcn8UyTF8+H+4VqbGUUvlM9QwMcwsGWT6l5O X-Received: by 2002:aa7:87c5:: with SMTP id i5mr21205800pfo.20.1555947076683; Mon, 22 Apr 2019 08:31:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555947076; cv=none; d=google.com; s=arc-20160816; b=lFENEdVDmLspTE9168jfuuoC1clmDBUyDlsvkbYqMCq6dwXV9eMI8+BR+M/NmhWcBB 3AvqJ0s6/UlaDbeUjn28tvCNuWQNCgoizvsNzJRyElpfaAbsKMzFlUNuI0y/taZ+etxA 3je5f9VKS8PKf9DOC71prpEBGetZPFcM9r8byPm4sDtn0PL277X6gl9m1n+G8HvzwTv3 wb13oDBf8FaLoYTys2qu5XJY+LOTNsM8+2yflrTsB/LIkbmBXX+QErsJ7p20CXa62Nxw 6q6sUkQ1/Z8qacTyav6vonmuQb7IC0oAxIcRpJ7zLOTLwXpqSkweH7GLK425B3MUCD+J reMg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date; bh=XYwX765pfhpSceJFzAKFWDY5sgFpf7XquB9o+rusFmo=; b=xj85Qxj2dyNYKJ2moh6hNFC3W7TIRJW+SVtGeVz0Qw3pQCVorfmvz+imizLFszDRz5 zcbOyqJRmGWDTtw72Hz2CKAAAlXGl9KVtxEgztipoWvDXN+vgkurSkf8KqusBivkKrdP dF/rlRQRjIDiKlxBUYlXIJdu2t+frvwPWYrOHZ9iQQu9O2bUCcahue/ETLzvxYerCANc y/pgoZEurlUmTd/Qhup3LWFoLVnJtL3JIDE0b3tg4+csrRH2ORxJzE6BCwSWttzmvL8v TEG22pvBbQkBJv3ZnI7cR5b5ocALady4GBDlxAJR6BBygWBeHsDhSiXFykWDGSggyVwP AW7w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m17si14353659pff.130.2019.04.22.08.31.01; Mon, 22 Apr 2019 08:31:16 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727714AbfDVOkB (ORCPT + 99 others); Mon, 22 Apr 2019 10:40:01 -0400 Received: from mx1.redhat.com ([209.132.183.28]:60460 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726132AbfDVOkA (ORCPT ); Mon, 22 Apr 2019 10:40:00 -0400 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.phx2.redhat.com [10.5.11.22]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 7B3C881DEC; Mon, 22 Apr 2019 14:39:58 +0000 (UTC) Received: from x1.home (ovpn-116-122.phx2.redhat.com [10.3.116.122]) by smtp.corp.redhat.com (Postfix) with ESMTP id 33DF21001E97; Mon, 22 Apr 2019 14:39:51 +0000 (UTC) Date: Mon, 22 Apr 2019 08:39:50 -0600 From: Alex Williamson To: Yan Zhao Cc: intel-gvt-dev@lists.freedesktop.org, arei.gonglei@huawei.com, aik@ozlabs.ru, Zhengxiao.zx@alibaba-inc.com, shuangtai.tst@alibaba-inc.com, qemu-devel@nongnu.org, eauger@redhat.com, yi.l.liu@intel.com, ziye.yang@intel.com, mlevitsk@redhat.com, pasic@linux.ibm.com, felipe@nutanix.com, changpeng.liu@intel.com, Ken.Xue@amd.com, jonathan.davies@nutanix.com, shaopeng.he@intel.com, kvm@vger.kernel.org, linux-kernel@vger.kernel.org, libvir-list@redhat.com, eskultet@redhat.com, dgilbert@redhat.com, cohuck@redhat.com, kevin.tian@intel.com, zhenyuw@linux.intel.com, zhi.a.wang@intel.com, cjia@nvidia.com, kwankhede@nvidia.com Subject: Re: [PATCH 1/2] vfio/mdev: add version field as mandatory attribute for mdev device Message-ID: <20190422083950.25eb9e7c@x1.home> In-Reply-To: <20190419083505.19654-1-yan.y.zhao@intel.com> References: <20190419083258.19580-1-yan.y.zhao@intel.com> <20190419083505.19654-1-yan.y.zhao@intel.com> Organization: Red Hat MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.84 on 10.5.11.22 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.25]); Mon, 22 Apr 2019 14:39:59 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 19 Apr 2019 04:35:04 -0400 Yan Zhao wrote: > device version attribute in mdev sysfs is used by user space software > (e.g. libvirt) to query device compatibility for live migration of VFIO > mdev devices. This attribute is mandatory if a mdev device supports live > migration. The Subject: doesn't quite match what's being proposed here. > It consists of two parts: common part and vendor proprietary part. > common part: 32 bit. lower 16 bits is vendor id and higher 16 bits > identifies device type. e.g., for pci device, it is > "pci vendor id" | (VFIO_DEVICE_FLAGS_PCI << 16). What purpose does this serve? If it's intended as some sort of namespace feature, shouldn't we first assume that we can only support migration to devices of the same type? Therefore each type would already have its own namespace. Also that would make the trailing bit of the version string listed below in the example redundant. A vendor is still welcome to include this in their version string if they wish, but I think the string should be entirely vendor defined. > vendor proprietary part: this part is varied in length. vendor driver can > specify any string to identify a device. > > When reading this attribute, it should show device version string of the > device of type . If a device does not support live migration, it > should return errno. > When writing a string to this attribute, it returns errno for > incompatibility or returns written string length in compatibility case. > If a device does not support live migration, it always returns errno. > > For user space software to use: > 1. > Before starting live migration, user space software first reads source side > mdev device's version. e.g. > "#cat \ > /sys/bus/pci/devices/0000\:00\:02.0/5ac1fb20-2bbf-4842-bb7e-36c58c3be9cd/mdev_type/version" > 00028086-193b-i915-GVTg_V5_4 > > 2. > Then, user space software writes the source side returned version string > to device version attribute in target side, and checks the return value. > If a negative errno is returned in the target side, then mdev devices in > source and target sides are not compatible; > If a positive number is returned and it equals to the length of written > string, then the two mdev devices in source and target side are compatible. > e.g. > (a) compatibility case > "# echo 00028086-193b-i915-GVTg_V5_4 > > /sys/bus/pci/devices/0000\:00\:02.0/882cc4da-dede-11e7-9180-078a62063ab1/mdev_type/version" > > (b) incompatibility case > "#echo 00028086-193b-i915-GVTg_V5_1 > > /sys/bus/pci/devices/0000\:00\:02.0/882cc4da-dede-11e7-9180-078a62063ab1/mdev_type/version" > -bash: echo: write error: Invalid argument > > 3. if two mdev devices are compatible, user space software can start > live migration, and vice versa. > > Note: if a mdev device does not support live migration, it either does > not provide a version attribute, or always returns errno when its version > attribute is read/written. I think it would be cleaner to do the former, not supply the attribute. This seems to do the latter in the sample drivers. Thanks, Alex > Cc: Alex Williamson > Cc: Erik Skultety > Cc: "Dr. David Alan Gilbert" > Cc: Cornelia Huck > Cc: "Tian, Kevin" > Cc: Zhenyu Wang > Cc: "Wang, Zhi A" > Cc: Neo Jia > Cc: Kirti Wankhede > > Signed-off-by: Yan Zhao > --- > Documentation/vfio-mediated-device.txt | 36 ++++++++++++++++++++++++++ > samples/vfio-mdev/mbochs.c | 17 ++++++++++++ > samples/vfio-mdev/mdpy.c | 16 ++++++++++++ > samples/vfio-mdev/mtty.c | 16 ++++++++++++ > 4 files changed, 85 insertions(+) > > diff --git a/Documentation/vfio-mediated-device.txt b/Documentation/vfio-mediated-device.txt > index c3f69bcaf96e..bc28471c0667 100644 > --- a/Documentation/vfio-mediated-device.txt > +++ b/Documentation/vfio-mediated-device.txt > @@ -202,6 +202,7 @@ Directories and files under the sysfs for Each Physical Device > | | |--- available_instances > | | |--- device_api > | | |--- description > + | | |--- version > | | |--- [devices] > | |--- [] > | | |--- create > @@ -209,6 +210,7 @@ Directories and files under the sysfs for Each Physical Device > | | |--- available_instances > | | |--- device_api > | | |--- description > + | | |--- version > | | |--- [devices] > | |--- [] > | |--- create > @@ -216,6 +218,7 @@ Directories and files under the sysfs for Each Physical Device > | |--- available_instances > | |--- device_api > | |--- description > + | |--- version > | |--- [devices] > > * [mdev_supported_types] > @@ -225,6 +228,8 @@ Directories and files under the sysfs for Each Physical Device > [], device_api, and available_instances are mandatory attributes > that should be provided by vendor driver. > > + version is a mandatory attribute if a mdev device supports live migration. > + > * [] > > The [] name is created by adding the device driver string as a prefix > @@ -246,6 +251,35 @@ Directories and files under the sysfs for Each Physical Device > This attribute should show the number of devices of type that can be > created. > > +* version > + > + This attribute is rw. It is used to check whether two devices are compatible > + for live migration. If this attribute is missing, then the corresponding mdev > + device is regarded as not supporting live migration. > + > + It consists of two parts: common part and vendor proprietary part. > + common part: 32 bit. lower 16 bits is vendor id and higher 16 bits identifies > + device type. e.g., for pci device, it is > + "pci vendor id" | (VFIO_DEVICE_FLAGS_PCI << 16). > + vendor proprietary part: this part is varied in length. vendor driver can > + specify any string to identify a device. > + > + When reading this attribute, it should show device version string of the device > + of type . If a device does not support live migration, it should > + return errno. > + When writing a string to this attribute, it returns errno for incompatibility > + or returns written string length in compatibility case. If a device does not > + support live migration, it always returns errno. > + > + for example. > + # cat \ > + /sys/bus/pci/devices/0000\:00\:02.0/mdev_supported_types/i915-GVTg_V5_2/version > + 00028086-193b-i915-GVTg_V5_2 > + > + #echo 00028086-193b-i915-GVTg_V5_2 > \ > + /sys/bus/pci/devices/0000\:00\:02.0/mdev_supported_types/i915-GVTg_V5_4/version > + -bash: echo: write error: Invalid argument > + > * [device] > > This directory contains links to the devices of type that have been > @@ -327,12 +361,14 @@ card. > | | |-- available_instances > | | |-- create > | | |-- device_api > + | | |-- version > | | |-- devices > | | `-- name > | `-- mtty-2 > | |-- available_instances > | |-- create > | |-- device_api > + | |-- version > | |-- devices > | `-- name > |-- mtty_dev > diff --git a/samples/vfio-mdev/mbochs.c b/samples/vfio-mdev/mbochs.c > index b038aa9f5a70..2f5ba96b91a2 100644 > --- a/samples/vfio-mdev/mbochs.c > +++ b/samples/vfio-mdev/mbochs.c > @@ -1391,11 +1391,28 @@ static ssize_t device_api_show(struct kobject *kobj, struct device *dev, > } > MDEV_TYPE_ATTR_RO(device_api); > > +static ssize_t version_show(struct kobject *kobj, struct device *dev, > + char *buf) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > + > +static ssize_t version_store(struct kobject *kobj, struct device *dev, > + const char *buf, size_t count) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > + > +static MDEV_TYPE_ATTR_RW(version); > + > static struct attribute *mdev_types_attrs[] = { > &mdev_type_attr_name.attr, > &mdev_type_attr_description.attr, > &mdev_type_attr_device_api.attr, > &mdev_type_attr_available_instances.attr, > + &mdev_type_attr_version.attr, > NULL, > }; > > diff --git a/samples/vfio-mdev/mdpy.c b/samples/vfio-mdev/mdpy.c > index cc86bf6566e4..ff15fdfc7d46 100644 > --- a/samples/vfio-mdev/mdpy.c > +++ b/samples/vfio-mdev/mdpy.c > @@ -695,11 +695,27 @@ static ssize_t device_api_show(struct kobject *kobj, struct device *dev, > } > MDEV_TYPE_ATTR_RO(device_api); > > +static ssize_t version_show(struct kobject *kobj, struct device *dev, > + char *buf) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > + > +static ssize_t version_store(struct kobject *kobj, struct device *dev, > + const char *buf, size_t count) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > +static MDEV_TYPE_ATTR_RW(version); > + > static struct attribute *mdev_types_attrs[] = { > &mdev_type_attr_name.attr, > &mdev_type_attr_description.attr, > &mdev_type_attr_device_api.attr, > &mdev_type_attr_available_instances.attr, > + &mdev_type_attr_version.attr, > NULL, > }; > > diff --git a/samples/vfio-mdev/mtty.c b/samples/vfio-mdev/mtty.c > index 1c77c370c92f..4ae3aad3474d 100644 > --- a/samples/vfio-mdev/mtty.c > +++ b/samples/vfio-mdev/mtty.c > @@ -1390,10 +1390,26 @@ static ssize_t device_api_show(struct kobject *kobj, struct device *dev, > > MDEV_TYPE_ATTR_RO(device_api); > > +static ssize_t version_show(struct kobject *kobj, struct device *dev, > + char *buf) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > + > +static ssize_t version_store(struct kobject *kobj, struct device *dev, > + const char *buf, size_t count) > +{ > + /* do not support live migration */ > + return -EINVAL; > +} > + > +static MDEV_TYPE_ATTR_RW(version); > static struct attribute *mdev_types_attrs[] = { > &mdev_type_attr_name.attr, > &mdev_type_attr_device_api.attr, > &mdev_type_attr_available_instances.attr, > + &mdev_type_attr_version.attr, > NULL, > }; >