Received: by 10.192.165.148 with SMTP id m20csp1317050imm; Wed, 2 May 2018 19:09:35 -0700 (PDT) X-Google-Smtp-Source: AB8JxZo3uaooI/gbFbmTmPOqafcEXrvy4usbbHuX1Y9poqcc5e9CdW1a1mpsMci71ZQVHtHEBs2a X-Received: by 2002:a65:5844:: with SMTP id s4-v6mr17980790pgr.232.1525313375325; Wed, 02 May 2018 19:09:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525313375; cv=none; d=google.com; s=arc-20160816; b=KuhMsDwGMvcQd/gwBfrtMu4VYqAgBClpUHlX76RiqX86P3RmIiDXuCKd/Zm8ebnrPO FxxGV16o0YuTdIj9kxT2BDD9GTx9PiKMyKrn8eywcaf9GbKFQ6ZvwxuvnVKzLMa/Kpal 1fIlTdW2k6NvkTko19aPH5U7Kg79LatsaQ1KWRfJz2I6Q8hir02V9OhJ8O7SePDl3aZD ozYDOdwVoHgd5ZsuPBGF4Hdh/1RqhYUfdBOKNKLCl4C788EGA7J3kmDAUVReDoavzZtz Q1ufGGacIeZF2TsjxlScQlV1Tf4mCvPQYTuUzCtNMAvmU0FtVYxmI6pnEXgMqZJHgsTf eA3A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date :arc-authentication-results; bh=JuirfSLnfHH+rWZRDAJ1nGiLYEpsKU/BG708F0jQ4Mk=; b=zUCufmUTtIf9KdlaCTXn/FT1DxfixlyeJ1QIaM58EEIPh5/b++1zr81F6iCmVmTb6Z HCnjf8VRyBQYWbnGafkeU4MR39BRxKGfbnFhIcQjMfuWmPod0fSHXoXcxVU5NccwQE58 m0I3vQqvB2fakx1NKgkv/cCcgmKcHe8kB2SLY/MorInsMNXcQ0TX+eCNvmDLApnmr2kK ykV2vDQsfOt+fqfDOMg8z1EGKrelFVZ/RyRN5BqNulLtwoL2pSxWKG9M1zCttfj33Phs nvfUKDn4k8ABfQeAzqTBNKYA7UJyHk/QzmTI4DNSMSsIWtzOuVTpBkkrQrZV0+wYan9c 4Eig== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u133-v6si10864416pgc.555.2018.05.02.19.09.20; Wed, 02 May 2018 19:09:35 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751985AbeECCJK (ORCPT + 99 others); Wed, 2 May 2018 22:09:10 -0400 Received: from mga18.intel.com ([134.134.136.126]:3122 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751837AbeECCJH (ORCPT ); Wed, 2 May 2018 22:09:07 -0400 X-Amp-Result: UNSCANNABLE X-Amp-File-Uploaded: False Received: from orsmga002.jf.intel.com ([10.7.209.21]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 02 May 2018 19:09:06 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.49,356,1520924400"; d="scan'208";a="55539037" Received: from debian.sh.intel.com (HELO debian) ([10.67.104.164]) by orsmga002.jf.intel.com with ESMTP; 02 May 2018 19:09:04 -0700 Date: Thu, 3 May 2018 10:09:49 +0800 From: Tiwei Bie To: "Michael S. Tsirkin" Cc: Jason Wang , virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, netdev@vger.kernel.org, wexu@redhat.com, jfreimann@redhat.com Subject: Re: [RFC v3 4/5] virtio_ring: add event idx support in packed ring Message-ID: <20180503020949.5u3qz32gsk33z6vk@debian> References: <20180425051550.24342-1-tiwei.bie@intel.com> <20180425051550.24342-5-tiwei.bie@intel.com> <34781052-df9f-e505-cd3f-08e460b34dcc@redhat.com> <20180502072819.mf5l3dypk6dwx2s7@debian> <20180502164828-mutt-send-email-mst@kernel.org> <20180502151255.h3x6rhszxa3euinl@debian> <20180502184015-mutt-send-email-mst@kernel.org> <20180503011116.qvoyblcpklinrk26@debian> <20180503044218-mutt-send-email-mst@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20180503044218-mutt-send-email-mst@kernel.org> User-Agent: NeoMutt/20170113 (1.7.2) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 03, 2018 at 04:44:39AM +0300, Michael S. Tsirkin wrote: > On Thu, May 03, 2018 at 09:11:16AM +0800, Tiwei Bie wrote: > > On Wed, May 02, 2018 at 06:42:57PM +0300, Michael S. Tsirkin wrote: > > > On Wed, May 02, 2018 at 11:12:55PM +0800, Tiwei Bie wrote: > > > > On Wed, May 02, 2018 at 04:51:01PM +0300, Michael S. Tsirkin wrote: > > > > > On Wed, May 02, 2018 at 03:28:19PM +0800, Tiwei Bie wrote: > > > > > > On Wed, May 02, 2018 at 10:51:06AM +0800, Jason Wang wrote: > > > > > > > On 2018年04月25日 13:15, Tiwei Bie wrote: > > > > > > > > This commit introduces the event idx support in packed > > > > > > > > ring. This feature is temporarily disabled, because the > > > > > > > > implementation in this patch may not work as expected, > > > > > > > > and some further discussions on the implementation are > > > > > > > > needed, e.g. do we have to check the wrap counter when > > > > > > > > checking whether a kick is needed? > > > > > > > > > > > > > > > > Signed-off-by: Tiwei Bie > > > > > > > > --- > > > > > > > > drivers/virtio/virtio_ring.c | 53 ++++++++++++++++++++++++++++++++++++++++---- > > > > > > > > 1 file changed, 49 insertions(+), 4 deletions(-) > > > > > > > > > > > > > > > > diff --git a/drivers/virtio/virtio_ring.c b/drivers/virtio/virtio_ring.c > > > > > > > > index 0181e93897be..b1039c2985b9 100644 > > > > > > > > --- a/drivers/virtio/virtio_ring.c > > > > > > > > +++ b/drivers/virtio/virtio_ring.c > > > > > > > > @@ -986,7 +986,7 @@ static inline int virtqueue_add_packed(struct virtqueue *_vq, > > > > > > > > static bool virtqueue_kick_prepare_packed(struct virtqueue *_vq) > > > > > > > > { > > > > > > > > struct vring_virtqueue *vq = to_vvq(_vq); > > > > > > > > - u16 flags; > > > > > > > > + u16 new, old, off_wrap, flags; > > > > > > > > bool needs_kick; > > > > > > > > u32 snapshot; > > > > > > > > @@ -995,7 +995,12 @@ static bool virtqueue_kick_prepare_packed(struct virtqueue *_vq) > > > > > > > > * suppressions. */ > > > > > > > > virtio_mb(vq->weak_barriers); > > > > > > > > + old = vq->next_avail_idx - vq->num_added; > > > > > > > > + new = vq->next_avail_idx; > > > > > > > > + vq->num_added = 0; > > > > > > > > + > > > > > > > > snapshot = *(u32 *)vq->vring_packed.device; > > > > > > > > + off_wrap = virtio16_to_cpu(_vq->vdev, snapshot & 0xffff); > > > > > > > > flags = cpu_to_virtio16(_vq->vdev, snapshot >> 16) & 0x3; > > > > > > > > #ifdef DEBUG > > > > > > > > @@ -1006,7 +1011,10 @@ static bool virtqueue_kick_prepare_packed(struct virtqueue *_vq) > > > > > > > > vq->last_add_time_valid = false; > > > > > > > > #endif > > > > > > > > - needs_kick = (flags != VRING_EVENT_F_DISABLE); > > > > > > > > + if (flags == VRING_EVENT_F_DESC) > > > > > > > > + needs_kick = vring_need_event(off_wrap & ~(1<<15), new, old); > > > > > > > > > > > > > > I wonder whether or not the math is correct. Both new and event are in the > > > > > > > unit of descriptor ring size, but old looks not. > > > > > > > > > > > > What vring_need_event() cares is the distance between > > > > > > `new` and `old`, i.e. vq->num_added. So I think there > > > > > > is nothing wrong with `old`. But the calculation of the > > > > > > distance between `new` and `event_idx` isn't right when > > > > > > `new` wraps. How do you think about the below code: > > > > > > > > > > > > wrap_counter = off_wrap >> 15; > > > > > > event_idx = off_wrap & ~(1<<15); > > > > > > if (wrap_counter != vq->wrap_counter) > > > > > > event_idx -= vq->vring_packed.num; > > > > > > > > > > > > needs_kick = vring_need_event(event_idx, new, old); > > > > > > > > > > I suspect this hack won't work for non power of 2 ring. > > > > > > > > Above code doesn't require the ring size to be a power of 2. > > > > > > > > For (__u16)(new_idx - old), what we want to get is vq->num_added. > > > > > > > > old = vq->next_avail_idx - vq->num_added; > > > > new = vq->next_avail_idx; > > > > > > > > When vq->next_avail_idx >= vq->num_added, it's obvious that, > > > > (__u16)(new_idx - old) is vq->num_added. > > > > > > > > And when vq->next_avail_idx < vq->num_added, new will be smaller > > > > than old (old will be a big unsigned number), but (__u16)(new_idx > > > > - old) is still vq->num_added. > > > > > > > > For (__u16)(new_idx - event_idx - 1), when new wraps and event_idx > > > > doesn't wrap, the most straightforward way to calculate it is: > > > > (new + vq->vring_packed.num) - event_idx - 1. > > > > > > So how about we use the straightforward way then? > > > > You mean we do new += vq->vring_packed.num instead > > of event_idx -= vq->vring_packed.num before calling > > vring_need_event()? > > > > The problem is that, the second param (new_idx) of > > vring_need_event() will be used for: > > > > (__u16)(new_idx - event_idx - 1) > > (__u16)(new_idx - old) > > > > So if we change new, we will need to change old too. > > I think that since we have a branch there anyway, > we are better off just special-casing if (wrap_counter != vq->wrap_counter). > Treat is differenty and avoid casts. > > > And that would be an ugly hack.. > > > > Best regards, > > Tiwei Bie > > I consider casts and huge numbers with two's complement > games even uglier. The dependency on two's complement game is introduced since the split ring. In packed ring, old is calculated via: old = vq->next_avail_idx - vq->num_added; In split ring, old is calculated via: old = vq->avail_idx_shadow - vq->num_added; In both cases, when vq->num_added is bigger, old will be a big number. Best regards, Tiwei Bie > > > > > > > > But we can also calculate it in this way: > > > > > > > > event_idx -= vq->vring_packed.num; > > > > (event_idx will be a big unsigned number) > > > > > > > > Then (__u16)(new_idx - event_idx - 1) will be the value we want. > > > > > > > > Best regards, > > > > Tiwei Bie > > > > > > > > > > > > > > > > > > > > > > Best regards, > > > > > > Tiwei Bie > > > > > > > > > > > > > > > > > > > > > > > > > > Thanks > > > > > > > > > > > > > > > + else > > > > > > > > + needs_kick = (flags != VRING_EVENT_F_DISABLE); > > > > > > > > END_USE(vq); > > > > > > > > return needs_kick; > > > > > > > > } > > > > > > > > @@ -1116,6 +1124,15 @@ static void *virtqueue_get_buf_ctx_packed(struct virtqueue *_vq, > > > > > > > > if (vq->last_used_idx >= vq->vring_packed.num) > > > > > > > > vq->last_used_idx -= vq->vring_packed.num; > > > > > > > > + /* If we expect an interrupt for the next entry, tell host > > > > > > > > + * by writing event index and flush out the write before > > > > > > > > + * the read in the next get_buf call. */ > > > > > > > > + if (vq->event_flags_shadow == VRING_EVENT_F_DESC) > > > > > > > > + virtio_store_mb(vq->weak_barriers, > > > > > > > > + &vq->vring_packed.driver->off_wrap, > > > > > > > > + cpu_to_virtio16(_vq->vdev, vq->last_used_idx | > > > > > > > > + (vq->wrap_counter << 15))); > > > > > > > > + > > > > > > > > #ifdef DEBUG > > > > > > > > vq->last_add_time_valid = false; > > > > > > > > #endif > > > > > > > > @@ -1143,10 +1160,17 @@ static unsigned virtqueue_enable_cb_prepare_packed(struct virtqueue *_vq) > > > > > > > > /* We optimistically turn back on interrupts, then check if there was > > > > > > > > * more to do. */ > > > > > > > > + /* Depending on the VIRTIO_RING_F_USED_EVENT_IDX feature, we need to > > > > > > > > + * either clear the flags bit or point the event index at the next > > > > > > > > + * entry. Always update the event index to keep code simple. */ > > > > > > > > + > > > > > > > > + vq->vring_packed.driver->off_wrap = cpu_to_virtio16(_vq->vdev, > > > > > > > > + vq->last_used_idx | (vq->wrap_counter << 15)); > > > > > > > > if (vq->event_flags_shadow == VRING_EVENT_F_DISABLE) { > > > > > > > > virtio_wmb(vq->weak_barriers); > > > > > > > > - vq->event_flags_shadow = VRING_EVENT_F_ENABLE; > > > > > > > > + vq->event_flags_shadow = vq->event ? VRING_EVENT_F_DESC : > > > > > > > > + VRING_EVENT_F_ENABLE; > > > > > > > > vq->vring_packed.driver->flags = cpu_to_virtio16(_vq->vdev, > > > > > > > > vq->event_flags_shadow); > > > > > > > > } > > > > > > > > @@ -1172,15 +1196,34 @@ static bool virtqueue_poll_packed(struct virtqueue *_vq, unsigned last_used_idx) > > > > > > > > static bool virtqueue_enable_cb_delayed_packed(struct virtqueue *_vq) > > > > > > > > { > > > > > > > > struct vring_virtqueue *vq = to_vvq(_vq); > > > > > > > > + u16 bufs, used_idx, wrap_counter; > > > > > > > > START_USE(vq); > > > > > > > > /* We optimistically turn back on interrupts, then check if there was > > > > > > > > * more to do. */ > > > > > > > > + /* Depending on the VIRTIO_RING_F_USED_EVENT_IDX feature, we need to > > > > > > > > + * either clear the flags bit or point the event index at the next > > > > > > > > + * entry. Always update the event index to keep code simple. */ > > > > > > > > + > > > > > > > > + /* TODO: tune this threshold */ > > > > > > > > + bufs = (u16)(vq->next_avail_idx - vq->last_used_idx) * 3 / 4; > > > > > > > > + > > > > > > > > + used_idx = vq->last_used_idx + bufs; > > > > > > > > + wrap_counter = vq->wrap_counter; > > > > > > > > + > > > > > > > > + if (used_idx >= vq->vring_packed.num) { > > > > > > > > + used_idx -= vq->vring_packed.num; > > > > > > > > + wrap_counter ^= 1; > > > > > > > > + } > > > > > > > > + > > > > > > > > + vq->vring_packed.driver->off_wrap = cpu_to_virtio16(_vq->vdev, > > > > > > > > + used_idx | (wrap_counter << 15)); > > > > > > > > if (vq->event_flags_shadow == VRING_EVENT_F_DISABLE) { > > > > > > > > virtio_wmb(vq->weak_barriers); > > > > > > > > - vq->event_flags_shadow = VRING_EVENT_F_ENABLE; > > > > > > > > + vq->event_flags_shadow = vq->event ? VRING_EVENT_F_DESC : > > > > > > > > + VRING_EVENT_F_ENABLE; > > > > > > > > vq->vring_packed.driver->flags = cpu_to_virtio16(_vq->vdev, > > > > > > > > vq->event_flags_shadow); > > > > > > > > } > > > > > > > > @@ -1822,8 +1865,10 @@ void vring_transport_features(struct virtio_device *vdev) > > > > > > > > switch (i) { > > > > > > > > case VIRTIO_RING_F_INDIRECT_DESC: > > > > > > > > break; > > > > > > > > +#if 0 > > > > > > > > case VIRTIO_RING_F_EVENT_IDX: > > > > > > > > break; > > > > > > > > +#endif > > > > > > > > case VIRTIO_F_VERSION_1: > > > > > > > > break; > > > > > > > > case VIRTIO_F_IOMMU_PLATFORM: > > > > > > >