Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp230055pxy; Wed, 21 Apr 2021 01:21:32 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz25MxLgxKikAaTi9VHzNcsnyw0l0jgi2+Yv5LNAEilt1u54YgOBTLukmaZkti2VpZVogW2 X-Received: by 2002:aa7:c3d8:: with SMTP id l24mr15844122edr.319.1618993292710; Wed, 21 Apr 2021 01:21:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618993292; cv=none; d=google.com; s=arc-20160816; b=Cr8991X3rKuDQbcQHW8+/UKpOwoJcW7xH36qp+CpRIiEujtdm4olghHIWxEUILwG63 d8rqq9Sd49A3M1xDg9O63jdhQJKeQnJizX24t12oW8IOGvkMhy5n72B39mBQpDQ6rzk0 3AC7a29Yq2j6ugn3mtbQ5qq0XDluZe3AavhQZU7a1N4SJXsB++QwLbghzY3a8YodqvPl hokVkCne6E7Njwop3XASSdvSBPR5aUkNWFUM28i8VHl/Jnty51FtyEhhs2sinmqvxNUO FIs9FWg5RdlHvpDI+7YEfW9aYsFEcu8tXVbf3iURAXAjwUuCfqUrKFTGg0WfQlWw0tBP 4IWQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject:dkim-signature; bh=P0FbeJ53aKVzlQQGzIJrlpdNfqU7q4WOuEBNdaNenu8=; b=IyHVuTmQS6auJ+hoV54P3iztJUClXo7M9iVBF1Omcx8okeS61OSQ4EQm6zQuDqVBfH UEOC/v6CurrDk5lZpvKDFQ+UMDJQ8Yp/JIxlvNiVRUTi4/4sDt780KdGiBjbp0n5BIoe irooziNiLz7AlXVix7ckAa4UwVMeNNIt78YXdi41HoUG+bkchrkV5ACskyg6HQf2JgqX NGFpFVrb7KzX2wk+x7NZjJa1RsgpjysgVePHsFaAYkVupifN3WDydr3zT8gl9nZJG3Qy Z4ckZW4AJIbeMJYTvZ3MUcg6lSKfHTWqEQjWIYJkdJBOfwoSPhWv4kW1x9AKwe+1G2T5 km7Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TmuNS4Ez; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a25si1160762ejv.555.2021.04.21.01.21.08; Wed, 21 Apr 2021 01:21:32 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=TmuNS4Ez; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234321AbhDUIR6 (ORCPT + 99 others); Wed, 21 Apr 2021 04:17:58 -0400 Received: from us-smtp-delivery-124.mimecast.com ([170.10.133.124]:47769 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234010AbhDUIRy (ORCPT ); Wed, 21 Apr 2021 04:17:54 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1618993041; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=P0FbeJ53aKVzlQQGzIJrlpdNfqU7q4WOuEBNdaNenu8=; b=TmuNS4Ezw+hEaxT9UfTKKaaeRi7f3SrQo/MQ7G9GYaD29bzOub3YlYjW6PQ2/CyWIdpmM5 hjBYbaD/5U2phpCAip9NXEaqAcL/yhiFoY9LLRJCMmxNorkDVRyq+qXSm5Y0BCJK3Vrikj 2C3jjVLQcfGQ2AQwxlUf3L0DvGcSSRI= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-604-BkXkWc7QOw6372n4GTzxVA-1; Wed, 21 Apr 2021 04:17:17 -0400 X-MC-Unique: BkXkWc7QOw6372n4GTzxVA-1 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id C74A2107ACC7; Wed, 21 Apr 2021 08:17:15 +0000 (UTC) Received: from wangxiaodeMacBook-Air.local (ovpn-13-189.pek2.redhat.com [10.72.13.189]) by smtp.corp.redhat.com (Postfix) with ESMTP id 2C8F060C05; Wed, 21 Apr 2021 08:17:09 +0000 (UTC) Subject: Re: [RFC PATCH] vdpa: mandate 1.0 device To: "Michael S. Tsirkin" Cc: virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, parav@nvidia.com, elic@nvidia.com, "Zhu, Lingshan" References: <20210408082648.20145-1-jasowang@redhat.com> <20210408115834-mutt-send-email-mst@kernel.org> <20210409115343-mutt-send-email-mst@kernel.org> <42891807-cb24-5352-f8cb-798e9d1a1854@redhat.com> <20210412050730-mutt-send-email-mst@kernel.org> <01918e14-7f7a-abf2-5864-292a32f0233c@redhat.com> <20210421035331-mutt-send-email-mst@kernel.org> From: Jason Wang Message-ID: <3d5754f3-c012-67ad-5f01-fc16ec53df4e@redhat.com> Date: Wed, 21 Apr 2021 16:17:08 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.9.1 MIME-Version: 1.0 In-Reply-To: <20210421035331-mutt-send-email-mst@kernel.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2021/4/21 下午4:03, Michael S. Tsirkin 写道: > On Wed, Apr 21, 2021 at 03:41:36PM +0800, Jason Wang wrote: >> 在 2021/4/12 下午5:23, Jason Wang 写道: >>> 在 2021/4/12 下午5:09, Michael S. Tsirkin 写道: >>>> On Mon, Apr 12, 2021 at 02:35:07PM +0800, Jason Wang wrote: >>>>> 在 2021/4/10 上午12:04, Michael S. Tsirkin 写道: >>>>>> On Fri, Apr 09, 2021 at 12:47:55PM +0800, Jason Wang wrote: >>>>>>> 在 2021/4/8 下午11:59, Michael S. Tsirkin 写道: >>>>>>>> On Thu, Apr 08, 2021 at 04:26:48PM +0800, Jason Wang wrote: >>>>>>>>> This patch mandates 1.0 for vDPA devices. The goal is to have the >>>>>>>>> semantic of normative statement in the virtio >>>>>>>>> spec and eliminate the >>>>>>>>> burden of transitional device for both vDPA bus and vDPA parent. >>>>>>>>> >>>>>>>>> uAPI seems fine since all the vDPA parent mandates >>>>>>>>> VIRTIO_F_ACCESS_PLATFORM which implies 1.0 devices. >>>>>>>>> >>>>>>>>> For legacy guests, it can still work since Qemu will mediate when >>>>>>>>> necessary (e.g doing the endian conversion). >>>>>>>>> >>>>>>>>> Signed-off-by: Jason Wang >>>>>>>> Hmm. If we do this, don't we still have a problem with >>>>>>>> legacy drivers which don't ack 1.0? >>>>>>> Yes, but it's not something that is introduced in this >>>>>>> commit. The legacy >>>>>>> driver never work ... >>>>>> My point is this neither fixes or prevents this. >>>>>> >>>>>> So my suggestion is to finally add ioctls along the lines >>>>>> of PROTOCOL_FEATURES of vhost-user. >>>>>> >>>>>> Then that one can have bits for legacy le, legacy be and modern. >>>>>> >>>>>> BTW I looked at vhost-user and it does not look like that >>>>>> has a solution for this problem either, right? >>>>> Right. >>>>> >>>>> >>>>>>>> Note 1.0 affects ring endianness which is not mediated in QEMU >>>>>>>> so QEMU can't pretend to device guest is 1.0. >>>>>>> Right, I plan to send patches to do mediation in the >>>>>>> Qemu to unbreak legacy >>>>>>> drivers. >>>>>>> >>>>>>> Thanks >>>>>> I frankly think we'll need PROTOCOL_FEATURES anyway, it's >>>>>> too useful ... >>>>>> so why not teach drivers about it and be done with it? You >>>>>> can't emulate >>>>>> legacy on modern in a cross endian situation because of vring >>>>>> endian-ness ... >>>>> So the problem still. This can only work when the hardware can support >>>>> legacy vring endian-ness. >>>>> >>>>> Consider: >>>>> >>>>> 1) the leagcy driver support is non-normative in the spec >>>>> 2) support a transitional device in the kenrel may requires the >>>>> hardware >>>>> support and a burden of kernel codes >>>>> >>>>> I'd rather simply drop the legacy driver support >>>> My point is this patch does not drop legacy support. It merely mandates >>>> modern support. >>> >>> I am not sure I get here. This patch fails the set_feature if VERSION_1 >>> is not negotiated. This means: >>> >>> 1) vDPA presents a modern device instead of transitonal device >>> 2) legacy driver can't be probed >>> >>> What I'm missing? >> >> Hi Michael: >> >> Do you agree to find the way to present modern device? We need a conclusion >> to make the netlink API work to move forward. >> >> Thanks > I think we need a way to support legacy with no data path overhead. qemu > setting VERSION_1 for a legacy guest affects the ring format so it does > not really work. This seems to rule out emulating config space entirely > in userspace. So I'd rather drop the legacy support in this case. It never work for vDPA in the past and virtio-vDPA doesn't even need that. Note that ACCESS_PLATFORM is mandated for all the vDPA parents right now which implies modern device and LE. I wonder what's the value for supporting legacy in this case or do we really encourage vendors to ship card with legacy support (e.g endian support in the hardware)? > > So I think we should add an ioctl along the lines of > protocol features. Then I think we can reserve feature bits > for config space format: legacy LE, legacy BE, modern. We had VHOST_SET_VRING_ENDIAN but this will complicates both the vDPA parent and management. What's more important, legacy behaviour is not restrictied by the spec. > > Querying the feature bits will provide us with info about > what does the device support. Acking them will tell device > what does guest need. I think this can work, but I wonder how much we can gain from such complexitiy. Thanks > > > > > >>> >>>>> to have a simple and easy >>>>> abstarction in the kenrel. For legacy driver in the guest, >>>>> hypervisor is in >>>>> charge of the mediation: >>>>> >>>>> 1) config space access endian conversion >>>>> 2) using shadow virtqueue to change the endian in the vring >>>>> >>>>> Thanks >>>> I'd like to avoid shadow virtqueue hacks if at all possible. >>>> Last I checked performance wasn't much better than just emulating >>>> virtio in software. >>> >>> I think the legacy driver support is just a nice to have. Or do you see >>> any value to that? I guess for mellanox and intel, only modern device is >>> supported in the hardware. >>> >>> Thanks >>> >>> >>>>>>>> >>>>>>>> >>>>>>>>> --- >>>>>>>>>     include/linux/vdpa.h | 6 ++++++ >>>>>>>>>     1 file changed, 6 insertions(+) >>>>>>>>> >>>>>>>>> diff --git a/include/linux/vdpa.h b/include/linux/vdpa.h >>>>>>>>> index 0fefeb976877..cfde4ec999b4 100644 >>>>>>>>> --- a/include/linux/vdpa.h >>>>>>>>> +++ b/include/linux/vdpa.h >>>>>>>>> @@ -6,6 +6,7 @@ >>>>>>>>>     #include >>>>>>>>>     #include >>>>>>>>>     #include >>>>>>>>> +#include >>>>>>>>>     /** >>>>>>>>>      * vDPA callback definition. >>>>>>>>> @@ -317,6 +318,11 @@ static inline int >>>>>>>>> vdpa_set_features(struct vdpa_device *vdev, u64 >>>>>>>>> features) >>>>>>>>>     { >>>>>>>>>             const struct vdpa_config_ops *ops = vdev->config; >>>>>>>>> +        /* Mandating 1.0 to have semantics of >>>>>>>>> normative statements in >>>>>>>>> +         * the spec. */ >>>>>>>>> +        if (!(features & BIT_ULL(VIRTIO_F_VERSION_1))) >>>>>>>>> +        return -EINVAL; >>>>>>>>> + >>>>>>>>>         vdev->features_valid = true; >>>>>>>>>             return ops->set_features(vdev, features); >>>>>>>>>     } >>>>>>>>> -- >>>>>>>>> 2.25.1