Received: by 2002:ac0:8c8e:0:0:0:0:0 with SMTP id r14csp911825ima; Wed, 6 Feb 2019 10:19:49 -0800 (PST) X-Google-Smtp-Source: AHgI3IapfTP8F4xL0Mlcfq0D7VseRQChscE2oEBwEI3xmq1ZSNwG+CLu1GkUHR2S2chS8FMvMaQA X-Received: by 2002:a17:902:33c1:: with SMTP id b59mr11908730plc.220.1549477189684; Wed, 06 Feb 2019 10:19:49 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549477189; cv=none; d=google.com; s=arc-20160816; b=AtCZR3gKnxTT65I+y2A/dK+md1v8ok6Qo1jRkm6GpMOzpcQMR4LeBu5eegUVul36ns 1MsNpLES/kg0WsGaYbiIIILZgnkrFGlgdYX6bh/TWhlrsMNaA6JwmK7nAhaX9D2Wg5sP Dh7YvXWCaxlwN+RIW3pGGNWJ1eYgfZVbMZxz/irf84qCEdWiEaks+RLjRx8chzzCsaEv 3cW28qYkb7DnvfO3TP/OeRzdLzduZWsCb7pkVGcJIcQmBR3DX+LmxoVCuuQ/b+EKXtse 1apeNxDig30l7XR+4w9h0lZqNMMRz3U2Z2cYLG85bZ1sWuwN6DljeBObhXupFCBCiBI5 Qszw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:mime-version:user-agent:date :message-id:organization:references:subject:cc:to:from; bh=kWRqiOBfpH1JzjhDG/mBexEA6kqiZaHfh+noyYNUTik=; b=CSlb75UmkFJFI6Bd55HXP4FGrVFCAhHfLRAB15bx0pMVRzsLuWTU4100eg6w/B45vZ HBXRvgEHQj+osapNH3TWwvkn7ekQrE0dGxirec6LRafu+gLVRXAkF6KDda2Vk/jdPlhm LZjIEbLDJ2lLRchKFlNwEqawU5bHiJCQvv3xO1nJJ5y/ytlVNxbcQc6YthLoZ7jvSIGL nA+SPwtmI6WlatyV9Tc7H4PyzltQdTQ6e63mRkT9Gi43uIO46r4Uvc4eSmWImNsy1+/M 2hDpfetqfoBPeMdioFQ9R7m2qaaXWkSrp5QOzORAsiv27Y8Yvrt2fBouEVCey28Z6PAW LB0g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n21si6106600pgm.456.2019.02.06.10.19.33; Wed, 06 Feb 2019 10:19:49 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728599AbfBFSTV (ORCPT + 99 others); Wed, 6 Feb 2019 13:19:21 -0500 Received: from mx1.redhat.com ([209.132.183.28]:48714 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726270AbfBFSTV (ORCPT ); Wed, 6 Feb 2019 13:19:21 -0500 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.phx2.redhat.com [10.5.11.22]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 76E012D4B6E; Wed, 6 Feb 2019 18:19:20 +0000 (UTC) Received: from [10.18.17.32] (dhcp-17-32.bos.redhat.com [10.18.17.32]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 047A01048117; Wed, 6 Feb 2019 18:19:10 +0000 (UTC) From: Nitesh Narayan Lal To: "Michael S. Tsirkin" Cc: kvm@vger.kernel.org, linux-kernel@vger.kernel.org, pbonzini@redhat.com, lcapitulino@redhat.com, pagupta@redhat.com, wei.w.wang@intel.com, yang.zhang.wz@gmail.com, riel@surriel.com, david@redhat.com, dodgen@google.com, konrad.wilk@oracle.com, dhildenb@redhat.com, aarcange@redhat.com Subject: Re: [RFC][Patch v8 5/7] virtio: Enables to add a single descriptor to the host References: <20190204201854.2328-1-nitesh@redhat.com> <20190204201854.2328-6-nitesh@redhat.com> <20190205154545-mutt-send-email-mst@kernel.org> <26a36489-2289-f970-3362-60547b268a76@redhat.com> <20190206130201-mutt-send-email-mst@kernel.org> Organization: Red Hat Inc, Message-ID: <85a799bf-93f1-3b4b-c4a0-db7af2aef292@redhat.com> Date: Wed, 6 Feb 2019 13:19:09 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: <20190206130201-mutt-send-email-mst@kernel.org> Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="Q6sZGAJ51pl5TzrWDpkYMI19Gl7KcJ3h8" X-Scanned-By: MIMEDefang 2.84 on 10.5.11.22 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.29]); Wed, 06 Feb 2019 18:19:20 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --Q6sZGAJ51pl5TzrWDpkYMI19Gl7KcJ3h8 Content-Type: multipart/mixed; boundary="O2qf5b9opplM6CBL1D4fEoPN7vjh41AiG"; protected-headers="v1" From: Nitesh Narayan Lal To: "Michael S. Tsirkin" Cc: kvm@vger.kernel.org, linux-kernel@vger.kernel.org, pbonzini@redhat.com, lcapitulino@redhat.com, pagupta@redhat.com, wei.w.wang@intel.com, yang.zhang.wz@gmail.com, riel@surriel.com, david@redhat.com, dodgen@google.com, konrad.wilk@oracle.com, dhildenb@redhat.com, aarcange@redhat.com Message-ID: <85a799bf-93f1-3b4b-c4a0-db7af2aef292@redhat.com> Subject: Re: [RFC][Patch v8 5/7] virtio: Enables to add a single descriptor to the host --O2qf5b9opplM6CBL1D4fEoPN7vjh41AiG Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On 2/6/19 1:03 PM, Michael S. Tsirkin wrote: > On Wed, Feb 06, 2019 at 07:56:37AM -0500, Nitesh Narayan Lal wrote: >> On 2/5/19 3:49 PM, Michael S. Tsirkin wrote: >>> On Mon, Feb 04, 2019 at 03:18:52PM -0500, Nitesh Narayan Lal wrote: >>>> This patch enables the caller to expose a single buffers to the >>>> other end using vring descriptor. It also allows the caller to >>>> perform this action in synchornous manner by using virtqueue_kick_sy= nc. >>>> >>>> Signed-off-by: Nitesh Narayan Lal >>> I am not sure why do we need this API. Polling in guest >>> until host runs isn't great either since these >>> might be running on the same host CPU. >> True. >> >> However, my understanding is that the existing API such as >> virtqueue_add_outbuf() requires an allocation which will be problemati= c >> for my implementation. > Not with a single s/g entry, no. Didn't know this. I will re-check. > >> Although I am not blocking the allocation path during normal Linux >> kernel usage as even if one of the zone is locked the other zone could= >> be used to get free pages. > > I am a bit confused about locking,=20 My bad, I think I created the confusion. > I was under the impression > that you are not calling virtio under a zone lock. Yeap. Your understanding is correct. I will re-visit this and correct it in the next version. > FYI doing that was nacked by Linus. > > >> But during the initial boot time (device initialization), in certain >> situations the allocation can only come from a single zone, acquiring = a >> lock on it may result in a deadlock situation. >> >>> >>> >>>> --- >>>> drivers/virtio/virtio_ring.c | 72 +++++++++++++++++++++++++++++++++= +++ >>>> include/linux/virtio.h | 4 ++ >>>> 2 files changed, 76 insertions(+) >>>> >>>> diff --git a/drivers/virtio/virtio_ring.c b/drivers/virtio/virtio_ri= ng.c >>>> index cd7e755484e3..93c161ac6a28 100644 >>>> --- a/drivers/virtio/virtio_ring.c >>>> +++ b/drivers/virtio/virtio_ring.c >>>> @@ -1695,6 +1695,52 @@ static inline int virtqueue_add(struct virtqu= eue *_vq, >>>> out_sgs, in_sgs, data, ctx, gfp); >>>> } >>>> =20 >>>> +/** >>>> + * virtqueue_add_desc - add a buffer to a chain using a vring desc >>>> + * @vq: the struct virtqueue we're talking about. >>>> + * @addr: address of the buffer to add. >>>> + * @len: length of the buffer. >>>> + * @in: set if the buffer is for the device to write. >>>> + * >>>> + * Returns zero or a negative error (ie. ENOSPC, ENOMEM, EIO). >>>> + */ >>>> +int virtqueue_add_desc(struct virtqueue *_vq, u64 addr, u32 len, in= t in) >>>> +{ >>>> + struct vring_virtqueue *vq =3D to_vvq(_vq); >>>> + struct vring_desc *desc =3D vq->split.vring.desc; >>>> + u16 flags =3D in ? VRING_DESC_F_WRITE : 0; >>>> + unsigned int i; >>>> + void *data =3D (void *)addr; >>>> + int avail_idx; >>>> + >>>> + /* Sanity check */ >>>> + if (!_vq) >>>> + return -EINVAL; >>>> + >>>> + START_USE(vq); >>>> + if (unlikely(vq->broken)) { >>>> + END_USE(vq); >>>> + return -EIO; >>>> + } >>>> + >>>> + i =3D vq->free_head; >>>> + flags &=3D ~VRING_DESC_F_NEXT; >>>> + desc[i].flags =3D cpu_to_virtio16(_vq->vdev, flags); >>>> + desc[i].addr =3D cpu_to_virtio64(_vq->vdev, addr); >>>> + desc[i].len =3D cpu_to_virtio32(_vq->vdev, len); >>>> + >>>> + vq->vq.num_free--; >>>> + vq->free_head =3D virtio16_to_cpu(_vq->vdev, desc[i].next); >>>> + vq->split.desc_state[i].data =3D data; >>>> + vq->split.avail_idx_shadow =3D 1; >>>> + avail_idx =3D vq->split.avail_idx_shadow; >>>> + vq->split.vring.avail->idx =3D cpu_to_virtio16(_vq->vdev, avail_id= x); >>>> + vq->num_added =3D 1; >>>> + END_USE(vq); >>>> + return 0; >>>> +} >>>> +EXPORT_SYMBOL_GPL(virtqueue_add_desc); >>>> + >>>> /** >>>> * virtqueue_add_sgs - expose buffers to other end >>>> * @vq: the struct virtqueue we're talking about. >>>> @@ -1842,6 +1888,32 @@ bool virtqueue_notify(struct virtqueue *_vq) >>>> } >>>> EXPORT_SYMBOL_GPL(virtqueue_notify); >>>> =20 >>>> +/** >>>> + * virtqueue_kick_sync - update after add_buf and busy wait till up= date is done >>>> + * @vq: the struct virtqueue >>>> + * >>>> + * After one or more virtqueue_add_* calls, invoke this to kick >>>> + * the other side. Busy wait till the other side is done with the u= pdate. >>>> + * >>>> + * Caller must ensure we don't call this with other virtqueue >>>> + * operations at the same time (except where noted). >>>> + * >>>> + * Returns false if kick failed, otherwise true. >>>> + */ >>>> +bool virtqueue_kick_sync(struct virtqueue *vq) >>>> +{ >>>> + u32 len; >>>> + >>>> + if (likely(virtqueue_kick(vq))) { >>>> + while (!virtqueue_get_buf(vq, &len) && >>>> + !virtqueue_is_broken(vq)) >>>> + cpu_relax(); >>>> + return true; >>>> + } >>>> + return false; >>>> +} >>>> +EXPORT_SYMBOL_GPL(virtqueue_kick_sync); >>>> + >>>> /** >>>> * virtqueue_kick - update after add_buf >>>> * @vq: the struct virtqueue >>>> diff --git a/include/linux/virtio.h b/include/linux/virtio.h >>>> index fa1b5da2804e..58943a3a0e8d 100644 >>>> --- a/include/linux/virtio.h >>>> +++ b/include/linux/virtio.h >>>> @@ -57,6 +57,10 @@ int virtqueue_add_sgs(struct virtqueue *vq, >>>> unsigned int in_sgs, >>>> void *data, >>>> gfp_t gfp); >>>> +/* A desc with this init id is treated as an invalid desc */ >>>> +int virtqueue_add_desc(struct virtqueue *_vq, u64 addr, u32 len, in= t in); >>>> + >>>> +bool virtqueue_kick_sync(struct virtqueue *vq); >>>> =20 >>>> bool virtqueue_kick(struct virtqueue *vq); >>>> =20 >>>> --=20 >>>> 2.17.2 >> --=20 >> Regards >> Nitesh >> > > --=20 Regards Nitesh --O2qf5b9opplM6CBL1D4fEoPN7vjh41AiG-- --Q6sZGAJ51pl5TzrWDpkYMI19Gl7KcJ3h8 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAEBCAAdFiEEkXcoRVGaqvbHPuAGo4ZA3AYyozkFAlxbJR0ACgkQo4ZA3AYy ozlk/w//QOrzz18+1WBn/AxrF6Zmt/PTMWVeiFVRVs1G9V4n/JTvT4Q/ZoEBtwhs K196jOD5uUtoidZr1xTFiuG9RDcVbY8IEHo+qC6dLEGl6qvtpyEUpach0lwothsF /PmuuDhOCMp3b4uzgkub12dY1i75X4rho3cxBW5Btysik28r6o06SsWb+pf0/S3I GKXGKwlusX5ZGpJAu8992QgnL09DqBfVLWtq2zsOUBd9Ls80+DcpDSQ7ss/G7ZNh bfM38r1jPdNJrMBxSYo1rr2rceg0L0E8qfCjGvVebuJQeQuj02xjWuWBErfKSGEL mvfuXuIx3px3UuDY/dkTIcLvy/hjbNhxJdbM8pej1y/N9gbG+MNeBl1HfXIOkn5y Hv6vpK9Xz1iY9UxXUrc3VN6UdP0q8xM5befm358SRQZbJ4onCTNrfNkWmhECiI0Z z3yyO5JXXRVqpGdAhjYAQePMf18svrDZuV4y/TW0fGx6bN11rBgvSFJ9L8QcyyGH SYDyr56obc+jQ8XIhgwcKvbn84nL+k1k3TO9PPQvCOg6kpzfZauCHm6yi5zwlSuP vMAPboZleCvK0AObrWThvImuUriblesNC1RYSU95DVise0NUy5vlbZOuLOOmOSHP nepsTB2g94XdlMbK+Y7wKsOaHuMjWJn0FZk+OipCHI8a+FqgLiA= =wtiu -----END PGP SIGNATURE----- --Q6sZGAJ51pl5TzrWDpkYMI19Gl7KcJ3h8--