Received: by 2002:a25:b323:0:0:0:0:0 with SMTP id l35csp1668039ybj; Fri, 20 Sep 2019 14:28:14 -0700 (PDT) X-Google-Smtp-Source: APXvYqxCedEWdX2l5BvQd12qc+svBRXSVEBYZ832OyPJ4YS41FS5vIQ7MXem/HJXmXWq5jvcZ5Vx X-Received: by 2002:a50:d718:: with SMTP id t24mr8959071edi.168.1569014894347; Fri, 20 Sep 2019 14:28:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569014894; cv=none; d=google.com; s=arc-20160816; b=UFai0yrSnKeULKjfG2x8aKhfNaEp0c5D50+bT/3N0P3EuMHJflTO69kq+i+vGK8jTn oB83y9D5WJA+a1hyrdj8d6jPYJiwdL3T4s77x58VUB5VXLGmgPhDx9iueVo8rj7F8ij1 xZ58MReWE3N1XkR9q//FUazgdq0XzOwCWiW00ja2I3kiENbC0KFzulOgZGmR5yBIBfNz HUEp2/VjaeAAYmDARh6YwxbuuMr1pKrCGMEx+T+al0lGI8MzzLJK99vP+pZJWy+Qwzs+ ReLcWdnF/7axlrQ9hj7ktCAtgwBr8+TPJraPpGUsYZ2yFDT3EW1PjO+L7FMYvw5QcTAy T/4g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=DTsaTH2E0x2kvaTL5gnoypCsLGTSqYAz6zmm16pya94=; b=Y2H9woiA0zVeaAu1YL2a4cMFm8xMcahdrogJaOBMzLp3pxj6IJ8wHkrx3eytiPjatb 54Axqo/WtD4s3Lfy5Cn3WNa30+K6pR/yzes+yrS0We9y0a13XSz646q7n/c8qnxNIu9J DGOd5MpTA/7dq5uuoobl3fmAyynmJJJPDOUDc3zYrLF5+3Aqm9UbYVg/RvLYJtf6Pp3C It5ka4RM+MBnZaYhd59jFOxp0S9kqrb9LBZkyRKBZ4SVFskZzwzfec8siHseNcizDzak KXz9XTcRqtg1Ld1inGVYcx/TktbijDC8KTQqE3ArGmy6A8zcmTw+FCZEWI/q9+mRQpcz L9qg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e20si1726679ejl.225.2019.09.20.14.27.51; Fri, 20 Sep 2019 14:28:14 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2394534AbfITA7q (ORCPT + 99 others); Thu, 19 Sep 2019 20:59:46 -0400 Received: from mx1.redhat.com ([209.132.183.28]:43612 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2389293AbfITA7q (ORCPT ); Thu, 19 Sep 2019 20:59:46 -0400 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 2C55B81DE7; Fri, 20 Sep 2019 00:59:46 +0000 (UTC) Received: from [10.72.12.88] (ovpn-12-88.pek2.redhat.com [10.72.12.88]) by smtp.corp.redhat.com (Postfix) with ESMTP id 4F6D75D9CD; Fri, 20 Sep 2019 00:59:34 +0000 (UTC) Subject: Re: [RFC v4 0/3] vhost: introduce mdev based hardware backend To: Tiwei Bie Cc: "Michael S. Tsirkin" , alex.williamson@redhat.com, maxime.coquelin@redhat.com, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, dan.daly@intel.com, cunming.liang@intel.com, zhihong.wang@intel.com, lingshan.zhu@intel.com References: <20190917010204.30376-1-tiwei.bie@intel.com> <993841ed-942e-c90b-8016-8e7dc76bf13a@redhat.com> <20190917105801.GA24855@___> <20190918102923-mutt-send-email-mst@kernel.org> <20190919154552.GA27657@___> From: Jason Wang Message-ID: <11bc30a9-1cf5-4a5f-109a-f307d70c35fa@redhat.com> Date: Fri, 20 Sep 2019 08:59:32 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20190919154552.GA27657@___> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.25]); Fri, 20 Sep 2019 00:59:46 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/9/19 下午11:45, Tiwei Bie wrote: > On Thu, Sep 19, 2019 at 09:08:11PM +0800, Jason Wang wrote: >> On 2019/9/18 下午10:32, Michael S. Tsirkin wrote: >>>>>> So I have some questions: >>>>>> >>>>>> 1) Compared to method 2, what's the advantage of creating a new vhost char >>>>>> device? I guess it's for keep the API compatibility? >>>>> One benefit is that we can avoid doing vhost ioctls on >>>>> VFIO device fd. >>>> Yes, but any benefit from doing this? >>> It does seem a bit more modular, but it's certainly not a big deal. >> Ok, if we go this way, it could be as simple as provide some callback to >> vhost, then vhost can just forward the ioctl through parent_ops. >> >>>>>> 2) For method 2, is there any easy way for user/admin to distinguish e.g >>>>>> ordinary vfio-mdev for vhost from ordinary vfio-mdev? >>>>> I think device-api could be a choice. >>>> Ok. >>>> >>>> >>>>>> I saw you introduce >>>>>> ops matching helper but it's not friendly to management. >>>>> The ops matching helper is just to check whether a given >>>>> vfio-device is based on a mdev device. >>>>> >>>>>> 3) A drawback of 1) and 2) is that it must follow vfio_device_ops that >>>>>> assumes the parameter comes from userspace, it prevents support kernel >>>>>> virtio drivers. >>>>>> >>>>>> 4) So comes the idea of method 3, since it register a new vhost-mdev driver, >>>>>> we can use device specific ops instead of VFIO ones, then we can have a >>>>>> common API between vDPA parent and vhost-mdev/virtio-mdev drivers. >>>>> As the above draft shows, this requires introducing a new >>>>> VFIO device driver. I think Alex's opinion matters here. >> Just to clarify, a new type of mdev driver but provides dummy >> vfio_device_ops for VFIO to make container DMA ioctl work. > I see. Thanks! IIUC, you mean we can provide a very tiny > VFIO device driver in drivers/vhost/mdev.c, e.g.: > > static int vfio_vhost_mdev_open(void *device_data) > { > if (!try_module_get(THIS_MODULE)) > return -ENODEV; > return 0; > } > > static void vfio_vhost_mdev_release(void *device_data) > { > module_put(THIS_MODULE); > } > > static const struct vfio_device_ops vfio_vhost_mdev_dev_ops = { > .name = "vfio-vhost-mdev", > .open = vfio_vhost_mdev_open, > .release = vfio_vhost_mdev_release, > }; > > static int vhost_mdev_probe(struct device *dev) > { > struct mdev_device *mdev = to_mdev_device(dev); > > ... Check the mdev device_id proposed in ... > ... https://lkml.org/lkml/2019/9/12/151 ... > > return vfio_add_group_dev(dev, &vfio_vhost_mdev_dev_ops, mdev); > } > > static void vhost_mdev_remove(struct device *dev) > { > vfio_del_group_dev(dev); > } > > static struct mdev_driver vhost_mdev_driver = { > .name = "vhost_mdev", > .probe = vhost_mdev_probe, > .remove = vhost_mdev_remove, > }; > > So we can bind above mdev driver to the virtio-mdev compatible > mdev devices when we want to use vhost-mdev. > > After binding above driver to the mdev device, we can setup IOMMU > via VFIO and get VFIO device fd of this mdev device, and pass it > to vhost fd (/dev/vhost-mdev) with a SET_BACKEND ioctl. > > Thanks, > Tiwei Yes, something like this. Thanks >> Thanks >> >> >>>> Yes, it is. >>>> >>>> Thanks >>>> >>>>