Received: by 2002:ac0:8c8e:0:0:0:0:0 with SMTP id r14csp904497ima; Wed, 6 Feb 2019 10:12:15 -0800 (PST) X-Google-Smtp-Source: AHgI3Ibz6FDHTbMaYApDePrZev7ZSJjDKQEts4KSFjR0oI43VEPNBlVBbIiFTc068ypI6GznHpN5 X-Received: by 2002:a62:b511:: with SMTP id y17mr11853321pfe.199.1549476735892; Wed, 06 Feb 2019 10:12:15 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549476735; cv=none; d=google.com; s=arc-20160816; b=W/Tjn+M8A5QoIrf70FsLXoyN2ZmwIMPuCCqDkECHA3XnW/dfyk93vKh+KMKeymyR2w tFODfQINQINc0X8IgUbc4sONr+7Np4VZ4trljB3DuPT8PXUPD5+8qiX4AT+ktg5MWchq 8e4PgoGYBO+DJvt0Rs5NQZ8i17BhQ/FG7ookoX0i142dwzyTk+vfatc5MNhhJcUMm/tv hSmmOLlsUPTwtEuXsSviwgIFjpppcKouXHv9fbUayRYIjmoMjSLh4NK3Kaknn8l9YNB/ gaq+86SS1Pm7MqtseJ5OV4Yuh0i3R5zEVVV2B7nhS1lgDfe19pgWATb3RNTCQZuSRGE/ l5JQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=XuzqngKu8KUPfIRV5jmn8hSEcP8BvxXaGvPKOwbXGeA=; b=nzlrwx2+dTdZaNo798CpzFwnoM1zKFFWtrX53WtQXDgOfBhI/FYE0hk5FLYHmjTRRG ngTwU4bSjjpbW81XobfQtdrSiLzY77ypOU9H3Jfn02jfPqnLjFQ3SU9W4o/06Xxf2Vyi GoLws6EH/HDVSBdTNezPRvGGjKIMZ5qD2zOrmdc4HcOjD+lyU2Nc/Hp2RrM0DNAXWie3 gwS3nQYnljkRFZFM724VUoKZ/MSyOe7gAHZ1cxA1uaU7ctSv5bTbGPx7Kxp9PGduD1xX nZDHeFtoo68xzRK+SEFKB3ZL/UpxfEHqLWAYmMHJee6fHf+UxQDJFAhVbDyLXtQdZCUW PzcA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d1si6756348pla.412.2019.02.06.10.11.59; Wed, 06 Feb 2019 10:12:15 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729107AbfBFSDq (ORCPT + 99 others); Wed, 6 Feb 2019 13:03:46 -0500 Received: from mx1.redhat.com ([209.132.183.28]:51928 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726480AbfBFSDp (ORCPT ); Wed, 6 Feb 2019 13:03:45 -0500 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id B4D2D80F6D; Wed, 6 Feb 2019 18:03:44 +0000 (UTC) Received: from redhat.com (ovpn-122-237.rdu2.redhat.com [10.10.122.237]) by smtp.corp.redhat.com (Postfix) with SMTP id 1217F67619; Wed, 6 Feb 2019 18:03:22 +0000 (UTC) Date: Wed, 6 Feb 2019 13:03:21 -0500 From: "Michael S. Tsirkin" To: Nitesh Narayan Lal Cc: kvm@vger.kernel.org, linux-kernel@vger.kernel.org, pbonzini@redhat.com, lcapitulino@redhat.com, pagupta@redhat.com, wei.w.wang@intel.com, yang.zhang.wz@gmail.com, riel@surriel.com, david@redhat.com, dodgen@google.com, konrad.wilk@oracle.com, dhildenb@redhat.com, aarcange@redhat.com Subject: Re: [RFC][Patch v8 5/7] virtio: Enables to add a single descriptor to the host Message-ID: <20190206130201-mutt-send-email-mst@kernel.org> References: <20190204201854.2328-1-nitesh@redhat.com> <20190204201854.2328-6-nitesh@redhat.com> <20190205154545-mutt-send-email-mst@kernel.org> <26a36489-2289-f970-3362-60547b268a76@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <26a36489-2289-f970-3362-60547b268a76@redhat.com> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Wed, 06 Feb 2019 18:03:44 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 06, 2019 at 07:56:37AM -0500, Nitesh Narayan Lal wrote: > > On 2/5/19 3:49 PM, Michael S. Tsirkin wrote: > > On Mon, Feb 04, 2019 at 03:18:52PM -0500, Nitesh Narayan Lal wrote: > >> This patch enables the caller to expose a single buffers to the > >> other end using vring descriptor. It also allows the caller to > >> perform this action in synchornous manner by using virtqueue_kick_sync. > >> > >> Signed-off-by: Nitesh Narayan Lal > > I am not sure why do we need this API. Polling in guest > > until host runs isn't great either since these > > might be running on the same host CPU. > True. > > However, my understanding is that the existing API such as > virtqueue_add_outbuf() requires an allocation which will be problematic > for my implementation. Not with a single s/g entry, no. > Although I am not blocking the allocation path during normal Linux > kernel usage as even if one of the zone is locked the other zone could > be used to get free pages. I am a bit confused about locking, I was under the impression that you are not calling virtio under a zone lock. FYI doing that was nacked by Linus. > But during the initial boot time (device initialization), in certain > situations the allocation can only come from a single zone, acquiring a > lock on it may result in a deadlock situation. > > > > > > > > >> --- > >> drivers/virtio/virtio_ring.c | 72 ++++++++++++++++++++++++++++++++++++ > >> include/linux/virtio.h | 4 ++ > >> 2 files changed, 76 insertions(+) > >> > >> diff --git a/drivers/virtio/virtio_ring.c b/drivers/virtio/virtio_ring.c > >> index cd7e755484e3..93c161ac6a28 100644 > >> --- a/drivers/virtio/virtio_ring.c > >> +++ b/drivers/virtio/virtio_ring.c > >> @@ -1695,6 +1695,52 @@ static inline int virtqueue_add(struct virtqueue *_vq, > >> out_sgs, in_sgs, data, ctx, gfp); > >> } > >> > >> +/** > >> + * virtqueue_add_desc - add a buffer to a chain using a vring desc > >> + * @vq: the struct virtqueue we're talking about. > >> + * @addr: address of the buffer to add. > >> + * @len: length of the buffer. > >> + * @in: set if the buffer is for the device to write. > >> + * > >> + * Returns zero or a negative error (ie. ENOSPC, ENOMEM, EIO). > >> + */ > >> +int virtqueue_add_desc(struct virtqueue *_vq, u64 addr, u32 len, int in) > >> +{ > >> + struct vring_virtqueue *vq = to_vvq(_vq); > >> + struct vring_desc *desc = vq->split.vring.desc; > >> + u16 flags = in ? VRING_DESC_F_WRITE : 0; > >> + unsigned int i; > >> + void *data = (void *)addr; > >> + int avail_idx; > >> + > >> + /* Sanity check */ > >> + if (!_vq) > >> + return -EINVAL; > >> + > >> + START_USE(vq); > >> + if (unlikely(vq->broken)) { > >> + END_USE(vq); > >> + return -EIO; > >> + } > >> + > >> + i = vq->free_head; > >> + flags &= ~VRING_DESC_F_NEXT; > >> + desc[i].flags = cpu_to_virtio16(_vq->vdev, flags); > >> + desc[i].addr = cpu_to_virtio64(_vq->vdev, addr); > >> + desc[i].len = cpu_to_virtio32(_vq->vdev, len); > >> + > >> + vq->vq.num_free--; > >> + vq->free_head = virtio16_to_cpu(_vq->vdev, desc[i].next); > >> + vq->split.desc_state[i].data = data; > >> + vq->split.avail_idx_shadow = 1; > >> + avail_idx = vq->split.avail_idx_shadow; > >> + vq->split.vring.avail->idx = cpu_to_virtio16(_vq->vdev, avail_idx); > >> + vq->num_added = 1; > >> + END_USE(vq); > >> + return 0; > >> +} > >> +EXPORT_SYMBOL_GPL(virtqueue_add_desc); > >> + > >> /** > >> * virtqueue_add_sgs - expose buffers to other end > >> * @vq: the struct virtqueue we're talking about. > >> @@ -1842,6 +1888,32 @@ bool virtqueue_notify(struct virtqueue *_vq) > >> } > >> EXPORT_SYMBOL_GPL(virtqueue_notify); > >> > >> +/** > >> + * virtqueue_kick_sync - update after add_buf and busy wait till update is done > >> + * @vq: the struct virtqueue > >> + * > >> + * After one or more virtqueue_add_* calls, invoke this to kick > >> + * the other side. Busy wait till the other side is done with the update. > >> + * > >> + * Caller must ensure we don't call this with other virtqueue > >> + * operations at the same time (except where noted). > >> + * > >> + * Returns false if kick failed, otherwise true. > >> + */ > >> +bool virtqueue_kick_sync(struct virtqueue *vq) > >> +{ > >> + u32 len; > >> + > >> + if (likely(virtqueue_kick(vq))) { > >> + while (!virtqueue_get_buf(vq, &len) && > >> + !virtqueue_is_broken(vq)) > >> + cpu_relax(); > >> + return true; > >> + } > >> + return false; > >> +} > >> +EXPORT_SYMBOL_GPL(virtqueue_kick_sync); > >> + > >> /** > >> * virtqueue_kick - update after add_buf > >> * @vq: the struct virtqueue > >> diff --git a/include/linux/virtio.h b/include/linux/virtio.h > >> index fa1b5da2804e..58943a3a0e8d 100644 > >> --- a/include/linux/virtio.h > >> +++ b/include/linux/virtio.h > >> @@ -57,6 +57,10 @@ int virtqueue_add_sgs(struct virtqueue *vq, > >> unsigned int in_sgs, > >> void *data, > >> gfp_t gfp); > >> +/* A desc with this init id is treated as an invalid desc */ > >> +int virtqueue_add_desc(struct virtqueue *_vq, u64 addr, u32 len, int in); > >> + > >> +bool virtqueue_kick_sync(struct virtqueue *vq); > >> > >> bool virtqueue_kick(struct virtqueue *vq); > >> > >> -- > >> 2.17.2 > -- > Regards > Nitesh >