Received: by 10.192.165.148 with SMTP id m20csp5039414imm; Tue, 8 May 2018 20:43:54 -0700 (PDT) X-Google-Smtp-Source: AB8JxZqSIiDdBmZ7xe++j62knessKKCWUTppcflF7CFgbmcNqaDoN9BRHQ/shLYu6lpeJhfwGv0s X-Received: by 2002:a17:902:6b44:: with SMTP id g4-v6mr2352335plt.390.1525837434516; Tue, 08 May 2018 20:43:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525837434; cv=none; d=google.com; s=arc-20160816; b=CJe+jEzZ7Ex+/8tBxGJ9+hXKq2tRiIXewU0OKryriJHbLjQWM8Apdopy82Vs7i9bWd xx6ER/AMVxLtD4fS1j88TuLHPZb0enfkHdj1Y2Op2+/MLkXBI+qXgm41TWrgKAgx7mlm gla+Z4xA1tApmAx9aN0pW8NT40gLfMrCMiru9weyqxW19P2pVdHy+8967nNeUCK9p/e/ 0nxxSAcSn2017DyGrR6MGLUq6z0jXGExUHEz7ntuWTjGL6Lj1Z+bmIM7pT+litfE95/x wjc5r6/xJEDWCCYb5XB000kHDEBg/PquJsWOT2M/B1Wcr0VpIpesc3j0KR7BpgTRwxMq cWbA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=AF46JjfqEwJSkEBNdwGemcWIV9fe+etmHkxnz/nufMQ=; b=nhyArHeQCpmm+ZvZssNRz/q0Uhkt78/jNQrUOB+RkJNe9KrAnmtafzG0rDY/sjzHwI NoxzyGNkLnP7zv/RO5meLTPdg1RFeCPlw7ASmHDC64iU2CwuA9sNT9psOF45PFacjb9H 2fgQsVtHie5Mk67N7MkaydyuqSMmjq6e8pLq934jDP+lE1IKuYH83CJE1rOAa6VyV/QO M+ockX8EOHK9oVXz3KV1tiTV+5CIz38zJm/0SuVYHuB+VkL4H1ybIV5UTaViE2/EWJ+Z 1136bwhMOMYxCYb69jKUdrmGLaOYwcV5FzG0BRrclir62aIogM0SrN+qO4q2QG1JSeDz EdEg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q4-v6si25262598plb.251.2018.05.08.20.43.39; Tue, 08 May 2018 20:43:54 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933706AbeEIDnY (ORCPT + 99 others); Tue, 8 May 2018 23:43:24 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:59368 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S932115AbeEIDnW (ORCPT ); Tue, 8 May 2018 23:43:22 -0400 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.rdu2.redhat.com [10.11.54.4]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 299C0406F8B0; Wed, 9 May 2018 03:43:22 +0000 (UTC) Received: from [10.72.12.103] (ovpn-12-103.pek2.redhat.com [10.72.12.103]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 5B8842022C00; Wed, 9 May 2018 03:43:17 +0000 (UTC) Subject: Re: [RFC v3 4/5] virtio_ring: add event idx support in packed ring To: Tiwei Bie Cc: "Michael S. Tsirkin" , virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, netdev@vger.kernel.org, wexu@redhat.com, jfreimann@redhat.com References: <20180503044218-mutt-send-email-mst@kernel.org> <20180503020949.5u3qz32gsk33z6vk@debian> <9f0b4e37-63ff-42f9-f2e6-3747a19a0206@redhat.com> <20180503135430.lbtvn4p4lyu3ksqo@debian> <12ede490-f674-2b89-d639-266b5fe15466@redhat.com> <20180508064409.kcn6amhsxu7nkuuc@debian> <34f2c690-7cb2-f9ea-2ce9-40f4ccb594c9@redhat.com> <20180508091628.d7jzpopqopq4abhy@debian> <122277c6-d103-e1f6-d695-4d64e6934a51@redhat.com> <20180508094406.qjlaism3hqy4hvjd@debian> From: Jason Wang Message-ID: Date: Wed, 9 May 2018 11:43:15 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <20180508094406.qjlaism3hqy4hvjd@debian> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Scanned-By: MIMEDefang 2.78 on 10.11.54.4 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.7]); Wed, 09 May 2018 03:43:22 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.7]); Wed, 09 May 2018 03:43:22 +0000 (UTC) for IP:'10.11.54.4' DOMAIN:'int-mx04.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'jasowang@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018年05月08日 17:44, Tiwei Bie wrote: > On Tue, May 08, 2018 at 05:34:40PM +0800, Jason Wang wrote: >> On 2018年05月08日 17:16, Tiwei Bie wrote: >>> On Tue, May 08, 2018 at 03:16:53PM +0800, Jason Wang wrote: >>>> On 2018年05月08日 14:44, Tiwei Bie wrote: >>>>> On Tue, May 08, 2018 at 01:40:40PM +0800, Jason Wang wrote: >>>>>> On 2018年05月08日 11:05, Jason Wang wrote: >>>>>>>> Because in virtqueue_enable_cb_delayed(), we may set an >>>>>>>> event_off which is bigger than new and both of them have >>>>>>>> wrapped. And in this case, although new is smaller than >>>>>>>> event_off (i.e. the third param -- old), new shouldn't >>>>>>>> add vq->num, and actually we are expecting a very big >>>>>>>> idx diff. >>>>>>> Yes, so to calculate distance correctly between event and new, we just >>>>>>> need to compare the warp counter and return false if it doesn't match >>>>>>> without the need to try to add vq.num here. >>>>>>> >>>>>>> Thanks >>>>>> Sorry, looks like the following should work, we need add vq.num if >>>>>> used_wrap_counter does not match: >>>>>> >>>>>> static bool vhost_vring_packed_need_event(struct vhost_virtqueue *vq, >>>>>>                       __u16 off_wrap, __u16 new, >>>>>>                       __u16 old) >>>>>> { >>>>>>     bool wrap = off_wrap >> 15; >>>>>>     int off = off_wrap & ~(1 << 15); >>>>>>     __u16 d1, d2; >>>>>> >>>>>>     if (wrap != vq->used_wrap_counter) >>>>>>         d1 = new + vq->num - off - 1; >>>>> Just to draw your attention (maybe you have already >>>>> noticed this). >>>> I miss this, thanks! >>>> >>>>> In this case (i.e. wrap != vq->used_wrap_counter), >>>>> it's also possible that (off < new) is true. Because, >>>>> >>>>> when virtqueue_enable_cb_delayed_packed() is used, >>>>> `off` is calculated in driver in a way like this: >>>>> >>>>> off = vq->last_used_idx + bufs; >>>>> if (off >= vq->vring_packed.num) { >>>>> off -= vq->vring_packed.num; >>>>> wrap_counter ^= 1; >>>>> } >>>>> >>>>> And when `new` (in vhost) is close to vq->num. The >>>>> vq->last_used_idx + bufs (in driver) can be bigger >>>>> than vq->vring_packed.num, and: >>>>> >>>>> 1. `off` will wrap; >>>>> 2. wrap counters won't match; >>>>> 3. off < new; >>>>> >>>>> And d1 (i.e. new + vq->num - off - 1) will be a value >>>>> bigger than vq->num. I'm okay with this, although it's >>>>> a bit weird. >>>> So I'm considering something more compact by reusing vring_need_event() by >>>> pretending a larger queue size and adding vq->num back when necessary: >>>> >>>> static bool vhost_vring_packed_need_event(struct vhost_virtqueue *vq, >>>>                       __u16 off_wrap, __u16 new, >>>>                       __u16 old) >>>> { >>>>     bool wrap = vq->used_wrap_counter; >>> If the wrap counter is obtained from the vq, >>> I think `new` should also be obtained from >>> the vq. Or the wrap counter should be carried >>> in `new`. >>> >>>>     int off = off_wrap & ~(1 << 15); >>>>     __u16 d1, d2; >>>> >>>>     if (new < old) { >>>>         new += vq->num; >>>>         wrap ^= 1; >>>>     } >>>> >>>>     if (wrap != off_wrap >> 15) >>>>         off += vq->num; >>> When `new` and `old` wraps, and `off` doesn't wrap, >>> wrap != (off_wrap >> 15) will be true. In this case, >>> `off` is bigger than `new`, and what we should do >>> is `off -= vq->num` instead of `off += vq->num`. >> If I understand this correctly, if we track old correctly, it won't happen >> if guest driver behave correctly. That means it should only happen for a >> buggy driver (e.g trying to move off_wrap back). > If vhost is faster than virtio driver, I guess above > case may happen. The `old` and `new` will be updated > each time we want to notify the driver. If the driver > is slower, `old` and `new` in vhost may wrap before > the `off` which is set by driver wraps. > > Best regards, > Tiwei Bie > Oh, right. But the code still work (in this case new - event_idx - 1 will underflow). (And I admit it still looks ugly). Thanks