Received: by 2002:a05:6a10:d5a5:0:0:0:0 with SMTP id gn37csp2389808pxb; Fri, 8 Oct 2021 06:54:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx3orirfBRd+5Bh4I091wuJEfzpVVyhYv9htCbwalnOz5b/p/EIZ9r+zd8kj/3gkaLlOQNd X-Received: by 2002:a05:6a00:248c:b0:447:52ee:f646 with SMTP id c12-20020a056a00248c00b0044752eef646mr10348937pfv.75.1633701269713; Fri, 08 Oct 2021 06:54:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1633701269; cv=none; d=google.com; s=arc-20160816; b=oGMYqTPhDcIgnoN6ukkOUKiQwPZZ8DOTLwmc2wcHpiBXyMQz15nCUvDlc8mjtzszhT 3C4zZnGz1HGxssG1iwA2IvlwnoZ9YZWAZvY4V5rmJ/op4FkPyrH/Ns8BelgfQi5nBk6J Y0GgvcBSzugOP3GrrQF0AYjLuGYcRy8FQJMVz48iObVEcbNBKIHk1WpSTMl+8aoHpt1U bu2e5t6wl47aZjW7N6r0xaT2x/Cl1sR2Afu1naqqDslN97BcnAaMvIriXp9pJ3jWbdE9 Wu9dWJid8G/tJhnzEJhwsBBTKpf2Yk9s1kYBB0X/uQDCoXKT8+FmjU9zZWnENW5lAmjH R+hg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature; bh=K567I+7jxfCQg2Gkz57JmtngrImX/7p0SSN0LwX1PsY=; b=gjy50BInWk29XGm8HZFc2NAS8Bg4U0m2eC2b2B0/KAIfbbYkhUQhNGQwgWPRpDbFG8 pG+q5ahakpCde55o+MElIJMpd5InsFrHpvnvat3oZeiQERqoXavVUXQqOpuHg6d119SA 0xwKYrvFz0G/7HMcNP0EztEaAkPinc4vORy/eTOIz8aAf7MfQCXDFRzqE0bfqAzXdjvH N+xdF3HmN0JWnKLKpiKB88Zs8ISZs3tKwv1W5bFf9FtXodhz2lNUXAD3f7tq55K1esQI K7OZvr4aXJn7KK39a6i8yk373k2tSm/mkraLgc0176lIxUoZefQr7h2Xrh9YvCQHLYM8 XZIQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=a1oGPW7T; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 14si3375440pgo.462.2021.10.08.06.54.16; Fri, 08 Oct 2021 06:54:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=a1oGPW7T; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234046AbhJHNyL (ORCPT + 99 others); Fri, 8 Oct 2021 09:54:11 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:64792 "EHLO mx0b-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234044AbhJHNyK (ORCPT ); Fri, 8 Oct 2021 09:54:10 -0400 Received: from pps.filterd (m0098421.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.1.2/8.16.1.2) with SMTP id 198DcrWE006278; Fri, 8 Oct 2021 09:52:07 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=date : from : to : cc : subject : message-id : in-reply-to : references : mime-version : content-type : content-transfer-encoding; s=pp1; bh=K567I+7jxfCQg2Gkz57JmtngrImX/7p0SSN0LwX1PsY=; b=a1oGPW7Tph2T8irdRKGDJNX5WAfkv0qTjsyMPbxnAkE+UO0dAurNNwxi9a77Hk69sS2u E7CZiyGUXPk3dYyIrYP0OPLKyLkAHaVb7yngCLFw7+TE1BLmLxjgizfGNnx7giBh0xWO 6ys36j3ipuncbJrj/Z6oq4MggZUm0Gs85scWgqhJzE20Ql2f46mRLDEAwu8Lc/JQpB2n a0/AEFUIipOlg4GoUaKxbrY5CMNkRF0pp7OJL79GViQDUq2SOi3w1B10bosZiZb+sh6X ds7k3rG6UAeo3/QatZI8b68R2DdA7+0N1ZaLQRa4l91NLMWQXiru7O2pnro5yC8/BawJ Yg== Received: from pps.reinject (localhost [127.0.0.1]) by mx0a-001b2d01.pphosted.com with ESMTP id 3bjk0epb2y-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 08 Oct 2021 09:52:06 -0400 Received: from m0098421.ppops.net (m0098421.ppops.net [127.0.0.1]) by pps.reinject (8.16.0.43/8.16.0.43) with SMTP id 198DmoaK013412; Fri, 8 Oct 2021 09:52:06 -0400 Received: from ppma06ams.nl.ibm.com (66.31.33a9.ip4.static.sl-reverse.com [169.51.49.102]) by mx0a-001b2d01.pphosted.com with ESMTP id 3bjk0epb2a-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 08 Oct 2021 09:52:06 -0400 Received: from pps.filterd (ppma06ams.nl.ibm.com [127.0.0.1]) by ppma06ams.nl.ibm.com (8.16.1.2/8.16.1.2) with SMTP id 198DgGxr028375; Fri, 8 Oct 2021 13:52:04 GMT Received: from b06avi18878370.portsmouth.uk.ibm.com (b06avi18878370.portsmouth.uk.ibm.com [9.149.26.194]) by ppma06ams.nl.ibm.com with ESMTP id 3bhepdc0sc-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 08 Oct 2021 13:52:04 +0000 Received: from b06wcsmtp001.portsmouth.uk.ibm.com (b06wcsmtp001.portsmouth.uk.ibm.com [9.149.105.160]) by b06avi18878370.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 198DkXdF50987514 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 8 Oct 2021 13:46:33 GMT Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 48045A405F; Fri, 8 Oct 2021 13:52:00 +0000 (GMT) Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 2776EA4062; Fri, 8 Oct 2021 13:51:59 +0000 (GMT) Received: from li-e979b1cc-23ba-11b2-a85c-dfd230f6cf82 (unknown [9.171.45.119]) by b06wcsmtp001.portsmouth.uk.ibm.com (Postfix) with SMTP; Fri, 8 Oct 2021 13:51:59 +0000 (GMT) Date: Fri, 8 Oct 2021 15:51:56 +0200 From: Halil Pasic To: "Michael S. Tsirkin" Cc: Jason Wang , Xie Yongji , virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org, markver@us.ibm.com, Cornelia Huck , Christian Borntraeger , linux-s390@vger.kernel.org, stefanha@redhat.com, Raphael Norwitz , qemu-devel@nongnu.org, Halil Pasic Subject: Re: [PATCH v2 1/1] virtio: write back F_VERSION_1 before validate Message-ID: <20211008155156.626e78b5.pasic@linux.ibm.com> In-Reply-To: <20211008085839-mutt-send-email-mst@kernel.org> References: <20211008123422.1415577-1-pasic@linux.ibm.com> <20211008085839-mutt-send-email-mst@kernel.org> Organization: IBM X-Mailer: Claws Mail 3.17.8 (GTK+ 2.24.32; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 X-Proofpoint-GUID: AwLL2TwiE5bb7sR5YjTQOfnp3HJWl-Nz X-Proofpoint-ORIG-GUID: pXbbnkG0k_9qzQg33cFNFi_GxeNjjWsS X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.182.1,Aquarius:18.0.790,Hydra:6.0.391,FMLib:17.0.607.475 definitions=2021-10-08_03,2021-10-07_02,2020-04-07_01 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxlogscore=999 clxscore=1015 malwarescore=0 spamscore=0 priorityscore=1501 mlxscore=0 adultscore=0 bulkscore=0 impostorscore=0 lowpriorityscore=0 phishscore=0 suspectscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2109230001 definitions=main-2110080081 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 8 Oct 2021 09:05:03 -0400 "Michael S. Tsirkin" wrote: > On Fri, Oct 08, 2021 at 02:34:22PM +0200, Halil Pasic wrote: > > The virtio specification virtio-v1.1-cs01 states: "Transitional devices > > MUST detect Legacy drivers by detecting that VIRTIO_F_VERSION_1 has not > > been acknowledged by the driver." This is exactly what QEMU as of 6.1 > > has done relying solely on VIRTIO_F_VERSION_1 for detecting that. > > > > However, the specification also says: "... the driver MAY read (but MUST > > NOT write) the device-specific configuration fields to check that it can > > support the device ..." before setting FEATURES_OK. > > > > In that case, any transitional device relying solely on > > VIRTIO_F_VERSION_1 for detecting legacy drivers will return data in > > legacy format. In particular, this implies that it is in big endian > > format for big endian guests. This naturally confuses the driver which > > expects little endian in the modern mode. > > > > It is probably a good idea to amend the spec to clarify that > > VIRTIO_F_VERSION_1 can only be relied on after the feature negotiation > > is complete. However, we already have a regression so let's try to address > > actually, regressions. and we can add > "since originally before validate callback existed > config space was only read after > FEATURES_OK. See Fixes tags for relevant commits" > > > it. How about replacing the paragraph above with the following? "It is probably a good idea to amend the spec to clarify that VIRTIO_F_VERSION_1 can only be relied on after the feature negotiation is complete. Before validate callback existed, config space was only read after FEATURES_OK. However, we already have two regression, so let's address this here as well." > > > > The regressions affect the VIRTIO_NET_F_MTU feature of virtio-net and > > the VIRTIO_BLK_F_BLK_SIZE feature of virtio-blk for BE guests when > > virtio 1.0 is used on both sides. The latter renders virtio-blk > > unusable with DASD backing, because things simply don't work with > > the default. and add "See Fixes tags for relevant commits." here. > > Let's add a work around description now: > > > For QEMU, we can work around the issue by writing out the features > register with VIRTIO_F_VERSION_1 bit set. We (ab) use the s/features register/feature bits/ rationale: ccw does not have a features register, and qemu does not really act as if its behavior was controlled by the values in a features register. I.e. when we read the register we see VIRTIO_F_VERSION_! because the feature is offered. In QEMU we basically read host_featues but write the guest_features. And what drives device behavior is mostly guest_features. s/(ab) use/(ab)use/ > finalize_features config op for this. It's not enough to address vhost s/It's/This is/ > user and vhost block devices since these do not get the features until s/vhost user and vhost block/some vhost-user and vhost-vdpa/ ? Ratioale: I think vhost block is just a vhost-user device. On the other hand vhost-user-fs works like charm because the config space is implemented in qemu and not in the vhost-user device. I didn't check vhost_net. I'm not even sure qemu offers a vhost_net implementation. Anyway I wouldn't like to make any false statements here. > FEATURES_OK, however it looks like these two actually never handled the > endian-ness for legacy mode correctly, so at least that's not a > regression. > > No devices except virtio net and virtio blk seem to be affected. > > Long term the right thing to do is to fix the hypervisors. > Sounds good. Thanks! Are you OK with my changes proposed to your changes? Regards, Halil > > > > > Cc: #v4.11 > > Signed-off-by: Halil Pasic > > Fixes: 82e89ea077b9 ("virtio-blk: Add validation for block size in > > config space") Fixes: fe36cbe0671e ("virtio_net: clear MTU when out > > of range") Reported-by: markver@us.ibm.com > > --- > > drivers/virtio/virtio.c | 11 +++++++++++ > > 1 file changed, 11 insertions(+) > > > > diff --git a/drivers/virtio/virtio.c b/drivers/virtio/virtio.c > > index 0a5b54034d4b..236081afe9a2 100644 > > --- a/drivers/virtio/virtio.c > > +++ b/drivers/virtio/virtio.c > > @@ -239,6 +239,17 @@ static int virtio_dev_probe(struct device *_d) > > driver_features_legacy = driver_features; > > } > > > > + /* > > + * Some devices detect legacy solely via F_VERSION_1. Write > > + * F_VERSION_1 to force LE config space accesses before > > FEATURES_OK for > > + * these when needed. > > + */ > > + if (drv->validate && !virtio_legacy_is_little_endian() > > + && device_features & > > BIT_ULL(VIRTIO_F_VERSION_1)) { > > + dev->features = BIT_ULL(VIRTIO_F_VERSION_1); > > + dev->config->finalize_features(dev); > > + } > > + > > if (device_features & (1ULL << VIRTIO_F_VERSION_1)) > > dev->features = driver_features & device_features; > > else > > > > base-commit: 60a9483534ed0d99090a2ee1d4bb0b8179195f51 > > -- > > 2.25.1 >