Received: by 2002:a25:824b:0:0:0:0:0 with SMTP id d11csp5555644ybn; Sun, 29 Sep 2019 00:40:59 -0700 (PDT) X-Google-Smtp-Source: APXvYqxD3yBmcNdb4oU21RjSnF6n9iA1MBXTydjrYIxHB9C6hO5VE+C/dmwyW/EKfX5J67ht+3C4 X-Received: by 2002:a17:906:f145:: with SMTP id gw5mr1983909ejb.68.1569742858922; Sun, 29 Sep 2019 00:40:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569742858; cv=none; d=google.com; s=arc-20160816; b=Hip1VcLANm1KFCkrYyzkFQsl+zf2c4cSX2iZgezGlqPefMr/3Rf82yVEf5Sk0qyDV/ gUhvzZzUhVDOq4n6vVbSLxkbTJF2I3ERKZDJanT8NbkG08B3dI7QjDKgkwiAqWe0/Zlg GQ6Szv2/UiituUMCrexdR28TaMgU2GfQRINlcWu96ryzbvWjEy5f4t5a626v+xY45RcS g97QjD1VRYcWKLkEtI6RsdpmRFJ94DB9xIC6QHbzO6SiOHN+LzBe+hqI3oCpk/Vio+TC y5xq3yNHSJiQXEErP6LYDWIjEFuG1HdRZ9hmF+ui05aPA/7ls9CjPaArKbWk2KX6pJYT UXqQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=UfuYrcavWaNx5z2xiawASuLduyTNfams6QrVHHeoL48=; b=ZfJr6J1NyNhuPMuMSct+TX3hNBRCZbqxZbuAaWziIQhWaYOGCoLiHIkKl1I1iuzgMO t4n3w4StL1WdU8taJH3mbOrb3bOGd78Xt6pLuT8HSEo8k0mw6EDLJh87cnhfTImxRJ/4 Up4Q8KDRoW7wxnZ+l7bPXHfC+gsfwFHiHQesbjMczDT5NpMjr+NRvyIkwWaz1c+ATq7P 48WV6LMdL5sHKm8TgqfltAnMwGbJHkySjvu8brJWAhxdx1bO/qIpbkPxaHqwyPGC9mbk W7PfkSo7UUio+6P1b8KLz154ItHggwti0k8LeKAgSjekfNX+FO2Qq5TFFAVwcykefgx/ 3XDw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a94si4659524edf.110.2019.09.29.00.40.22; Sun, 29 Sep 2019 00:40:58 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728696AbfI2HjV (ORCPT + 99 others); Sun, 29 Sep 2019 03:39:21 -0400 Received: from mx1.redhat.com ([209.132.183.28]:57164 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725906AbfI2HjV (ORCPT ); Sun, 29 Sep 2019 03:39:21 -0400 Received: from smtp.corp.redhat.com (int-mx01.intmail.prod.int.phx2.redhat.com [10.5.11.11]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 812D485360; Sun, 29 Sep 2019 07:39:20 +0000 (UTC) Received: from [10.72.12.202] (ovpn-12-202.pek2.redhat.com [10.72.12.202]) by smtp.corp.redhat.com (Postfix) with ESMTP id 3F877600D1; Sun, 29 Sep 2019 07:39:06 +0000 (UTC) Subject: Re: [PATCH] vhost: introduce mdev based hardware backend To: "Michael S. Tsirkin" Cc: Tiwei Bie , alex.williamson@redhat.com, maxime.coquelin@redhat.com, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, dan.daly@intel.com, cunming.liang@intel.com, zhihong.wang@intel.com, lingshan.zhu@intel.com References: <20190926045427.4973-1-tiwei.bie@intel.com> <20190926042156-mutt-send-email-mst@kernel.org> <20190926131439.GA11652@___> <8ab5a8d9-284d-bba5-803d-08523c0814e1@redhat.com> <20190927053935-mutt-send-email-mst@kernel.org> <20190927084408-mutt-send-email-mst@kernel.org> <20190927092219-mutt-send-email-mst@kernel.org> From: Jason Wang Message-ID: <1aba781c-d776-422d-2629-86b11990a9b5@redhat.com> Date: Sun, 29 Sep 2019 15:39:05 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20190927092219-mutt-send-email-mst@kernel.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Scanned-By: MIMEDefang 2.79 on 10.5.11.11 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.25]); Sun, 29 Sep 2019 07:39:20 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/9/27 下午9:23, Michael S. Tsirkin wrote: > On Fri, Sep 27, 2019 at 09:17:56PM +0800, Jason Wang wrote: >> On 2019/9/27 下午8:46, Michael S. Tsirkin wrote: >>> On Fri, Sep 27, 2019 at 08:17:47PM +0800, Jason Wang wrote: >>>> On 2019/9/27 下午5:41, Michael S. Tsirkin wrote: >>>>> On Fri, Sep 27, 2019 at 11:27:12AM +0800, Jason Wang wrote: >>>>>> On 2019/9/26 下午9:14, Tiwei Bie wrote: >>>>>>> On Thu, Sep 26, 2019 at 04:35:18AM -0400, Michael S. Tsirkin wrote: >>>>>>>> On Thu, Sep 26, 2019 at 12:54:27PM +0800, Tiwei Bie wrote: >>>>>>> [...] >>>>>>>>> diff --git a/include/uapi/linux/vhost.h b/include/uapi/linux/vhost.h >>>>>>>>> index 40d028eed645..5afbc2f08fa3 100644 >>>>>>>>> --- a/include/uapi/linux/vhost.h >>>>>>>>> +++ b/include/uapi/linux/vhost.h >>>>>>>>> @@ -116,4 +116,12 @@ >>>>>>>>> #define VHOST_VSOCK_SET_GUEST_CID _IOW(VHOST_VIRTIO, 0x60, __u64) >>>>>>>>> #define VHOST_VSOCK_SET_RUNNING _IOW(VHOST_VIRTIO, 0x61, int) >>>>>>>>> +/* VHOST_MDEV specific defines */ >>>>>>>>> + >>>>>>>>> +#define VHOST_MDEV_SET_STATE _IOW(VHOST_VIRTIO, 0x70, __u64) >>>>>>>>> + >>>>>>>>> +#define VHOST_MDEV_S_STOPPED 0 >>>>>>>>> +#define VHOST_MDEV_S_RUNNING 1 >>>>>>>>> +#define VHOST_MDEV_S_MAX 2 >>>>>>>>> + >>>>>>>>> #endif >>>>>>>> So assuming we have an underlying device that behaves like virtio: >>>>>>> I think they are really good questions/suggestions. Thanks! >>>>>>> >>>>>>>> 1. Should we use SET_STATUS maybe? >>>>>>> I like this idea. I will give it a try. >>>>>>> >>>>>>>> 2. Do we want a reset ioctl? >>>>>>> I think it is helpful. If we use SET_STATUS, maybe we >>>>>>> can use it to support the reset. >>>>>>> >>>>>>>> 3. Do we want ability to enable rings individually? >>>>>>> I will make it possible at least in the vhost layer. >>>>>> Note the API support e.g set_vq_ready(). >>>>> virtio spec calls this "enabled" so let's stick to that. >>>> Ok. >>>> >>>> >>>>>>>> 4. Does device need to limit max ring size? >>>>>>>> 5. Does device need to limit max number of queues? >>>>>>> I think so. It's helpful to have ioctls to report the max >>>>>>> ring size and max number of queues. >>>>>> An issue is the max number of queues is done through a device specific way, >>>>>> usually device configuration space. This is supported by the transport API, >>>>>> but how to expose it to userspace may need more thought. >>>>>> >>>>>> Thanks >>>>> an ioctl for device config? But for v1 I'd be quite happy to just have >>>>> a minimal working device with 2 queues. >>>> I'm fully agree, and it will work as long as VIRTIO_NET_F_MQ and >>>> VIRTIO_NET_F_CTRL_VQ is not advertised by the mdev device. >>>> >>>> Thanks >>> Hmm this means we need to validate the features bits, >>> not just pass them through to the hardware. >>> Problem is, how do we add more feature bits later, >>> without testing all hardware? >>> I guess this means the device specific driver must do it. >>> >> That looks not good, maybe a virtio device id based features blacklist in >> vhost-mdev. Then MQ and CTRL_VQ could be filtered out by vhost-mdev. >> >> Thanks > Two implementations of e.g. virtio net can have different > features whitelisted. I meant for kernel driver, we won't blacklist any feature, but for vhost-mdev, we need to do that. > So I think there's no way but let > the driver do it. We should probably provide a standard place > in the ops for driver to supply the whitelist, to make sure > drivers don't forget. For 'driver' do you mean userspace driver of  the mdev device? Thanks >