Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp3356355imw; Wed, 6 Jul 2022 23:42:58 -0700 (PDT) X-Google-Smtp-Source: AGRyM1soWc/2SXBSFxO392QTAqEHUFt5XzhtR6K2DMlFnfBP5vUcnj2ez8uR/kbGt3UxzkYzPOGb X-Received: by 2002:a17:907:869e:b0:72a:a008:82fc with SMTP id qa30-20020a170907869e00b0072aa00882fcmr28630042ejc.549.1657176178299; Wed, 06 Jul 2022 23:42:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1657176178; cv=none; d=google.com; s=arc-20160816; b=eBzjUYG4gfWcVJxNrOiV/ET7jaxClx6y3rUIGzYslSbUaQKFlho84wdQQSjmULPeIc GoEawK9keQ5vbP5A8TKpW0qFJSQgy1aDw9BoR/xH7wRM5WxJBUPPsNVLN7YkWcu0qPXO Wi7aHWWdIczsjaKycmq9flPJWf6FsX8cqLMU8604rswsx2W9TX5UNa0pwbBTRuei2GCf R/2MgKt4hVXrvVNZn8MFCDFjlFs3eadCEPv7AsqZrxv+EQ6ONEW+3xbgKzeMD6ZwuX7v TJsXj5rLx+gH99hqURk1OeG/L+S0g9/8o3lD09WVTU7VMzXPPHOkFp0MfMRmKM71sk+a Nr3A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=2q7+TPvdFuP3InqW/d0k42xLZ4iHApQghMEZG4X8Q3Y=; b=YWha7bo9FqJVB2s8turtzuU7bFUSrc48Q/N75TTzOTYdSHV4wvfKJmn5t4QCp/e5bV HyKNyDnX8CIjyEqoIV/PPORrkerMfTPgUSKON0yRCbBytOlJJForjxHbSlSsT++xlsAt RWZbgqfLlpDLofA1FX5qR+NXqGOtl411gUdj5aqS5uw9pf7Xk5g5AZ5TjILWFVCZZ331 DYg1+/wQzNNFiNIJuvXdDBmzM3hG1uHfg4phJ3w3ZAKrEmDLkjPKTt7Ej6vKC7G6TnJd DnZSuGZYPYSBa+4EjuraIZrbmSRsyMuxK7iGI9ilXSqmYRmb8ytPbL1ApuYiEl5KeTeF LR8Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=CWXzXyR4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hr10-20020a1709073f8a00b006fec3023067si24897724ejc.343.2022.07.06.23.42.32; Wed, 06 Jul 2022 23:42:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=CWXzXyR4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234377AbiGGF71 (ORCPT + 99 others); Thu, 7 Jul 2022 01:59:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42184 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232893AbiGGF7Z (ORCPT ); Thu, 7 Jul 2022 01:59:25 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 26CD331920 for ; Wed, 6 Jul 2022 22:59:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1657173563; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=2q7+TPvdFuP3InqW/d0k42xLZ4iHApQghMEZG4X8Q3Y=; b=CWXzXyR4IDvti3hon4k1qlXK8bsIwbZr+jUtbLRsmbvigGRAufeLxIF3GYOd82t8HMweLJ cLo2MShKHQE0ZjS9BnlTBV7F5v1JU89LHdvt48omsihiG6rz5gqHyX3BLq53fXDCxF1YIf EZ7pJYaJWU041doHjfl6Fr5v3Vkf4XE= Received: from mail-wr1-f69.google.com (mail-wr1-f69.google.com [209.85.221.69]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-557-I8qScZNwPICjd91bStWIdA-1; Thu, 07 Jul 2022 01:59:21 -0400 X-MC-Unique: I8qScZNwPICjd91bStWIdA-1 Received: by mail-wr1-f69.google.com with SMTP id e7-20020adfa447000000b0021d669c6a76so2227636wra.19 for ; Wed, 06 Jul 2022 22:59:21 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=2q7+TPvdFuP3InqW/d0k42xLZ4iHApQghMEZG4X8Q3Y=; b=q4j60X+D+Bd957b3g7D4dGRomBcm9Zu3noEZ5e+X83cbvR7vmde1mVbQ9z3ebkDspc YcV7/8CVSUIgQdSgRGiJIK41oQhhjrlK1q9a8SsD2rR8q+Ohwa+BAjsnYhkLRDhld+y7 J6hi7X3b0LvifAKpPJkWh4VlxnbYm9X7/IhZR/JJgMMJB6Yk1kH5hOQj5Kqf7jUjwnUu Qbq1xh/W4Qo/oQTOFaZV2ug0yMXaRpYBWzY4EUZ8Q79v1O8810flUImBBAuY8Wx3htdl ohA6K51GOzNXppfWs5bW/JBSRNQdVQI6fLXGCKgdJRvvNwDciTrYwBkMEkJpnUcJey+p 5xDQ== X-Gm-Message-State: AJIora/4EqxfCXc1KSisItc6hZVQVUl9f7BMQgmMeXOh2JFrmVZqfZgm ZvQ+U5i0EUD6Ol71KiY7lCBm8/Ly1HYNDL467GHkeX80/zx4h+PmKElex7XN+zIuEuS2RslVzmM lXlAdQ0OYZfj5HpHSkorvw7P3 X-Received: by 2002:a7b:cb10:0:b0:3a0:312c:fd3c with SMTP id u16-20020a7bcb10000000b003a0312cfd3cmr2423652wmj.52.1657173560284; Wed, 06 Jul 2022 22:59:20 -0700 (PDT) X-Received: by 2002:a7b:cb10:0:b0:3a0:312c:fd3c with SMTP id u16-20020a7bcb10000000b003a0312cfd3cmr2423633wmj.52.1657173560015; Wed, 06 Jul 2022 22:59:20 -0700 (PDT) Received: from redhat.com ([2.52.154.68]) by smtp.gmail.com with ESMTPSA id o4-20020a05600c378400b003a2b708c26dsm7219527wmr.40.2022.07.06.22.59.18 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 06 Jul 2022 22:59:19 -0700 (PDT) Date: Thu, 7 Jul 2022 01:59:15 -0400 From: "Michael S. Tsirkin" To: Guo Zhi Cc: jasowang@redhat.com, eperezma@redhat.com, virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, sgarzare@redhat.com Subject: Re: [PATCH v2 4/4] virtio_test: enable indirection feature Message-ID: <20220707015717-mutt-send-email-mst@kernel.org> References: <20220707024409.1869-1-qtxuning1999@sjtu.edu.cn> <20220707024409.1869-5-qtxuning1999@sjtu.edu.cn> <20220707005311-mutt-send-email-mst@kernel.org> <2ff418ad-0ab8-1474-4308-2d163c1e6652@sjtu.edu.cn> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <2ff418ad-0ab8-1474-4308-2d163c1e6652@sjtu.edu.cn> X-Spam-Status: No, score=-3.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_LOW, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jul 07, 2022 at 01:56:37PM +0800, Guo Zhi wrote: > On 2022/7/7 12:59, Michael S. Tsirkin wrote: > > On Thu, Jul 07, 2022 at 10:44:09AM +0800, Guo Zhi wrote: > > > Prior implementation don't use indirection feature because there is only > > > one descriptor for every io event, actually prior implementation don't > > > support indirection because vhost can't translate and find the indirect > > > descriptors. This commit enable virtio_test malloc indirect descriptors > > > in a indirect buffer and map this buffer to vhost, thus resolve this > > > problem. > > > > > > Signed-off-by: Guo Zhi > > > --- > > > tools/virtio/virtio_test.c | 50 ++++++++++++++++++++++++++++++++------ > > > 1 file changed, 42 insertions(+), 8 deletions(-) > > > > > > diff --git a/tools/virtio/virtio_test.c b/tools/virtio/virtio_test.c > > > index 363695b33..dca408a5c 100644 > > > --- a/tools/virtio/virtio_test.c > > > +++ b/tools/virtio/virtio_test.c > > > @@ -25,7 +25,7 @@ > > > #define RINGSIZE 256 > > > #define TEST_BUF_NUM 0x100000 > > > #define BUF_SIZE 1024 > > > -/* Unused */ > > > +#define INDIRECTS_SIZE (RINGSIZE * sizeof(struct vring_desc) * 8) > > > void *__kmalloc_fake, *__kfree_ignore_start, *__kfree_ignore_end; > > > struct vq_info { > > > @@ -47,6 +47,8 @@ struct vdev_info { > > > int nvqs; > > > void *buf; > > > size_t buf_size; > > > + void *indirects; > > > + size_t indirects_size; > > What are these exactly? > The buffer is used to put indirect descriptors, and the buffer will be added > in the vhost iotlb through VHOST_SET_MEM_TABLE, so that vhost can translate > the descriptors in user mode correctly. Pls come up with a better name. indirect_buf and indirect_buf_size? And add a comment. > > > > > struct vhost_memory *mem; > > > }; > > > @@ -131,6 +133,8 @@ static void vq_info_add(struct vdev_info *dev, int num) > > > static void vdev_info_init(struct vdev_info* dev, unsigned long long features) > > > { > > > int r; > > > + int nregions = 2; > > > + > > > memset(dev, 0, sizeof *dev); > > > dev->vdev.features = features; > > > INIT_LIST_HEAD(&dev->vdev.vqs); > > > @@ -138,19 +142,25 @@ static void vdev_info_init(struct vdev_info* dev, unsigned long long features) > > > dev->buf_size = BUF_SIZE; > > > dev->buf = malloc(dev->buf_size); > > > assert(dev->buf); > > > - dev->control = open("/dev/vhost-test", O_RDWR); > > > + dev->indirects_size = INDIRECTS_SIZE; > > > + dev->indirects = malloc(dev->indirects_size); > > > + assert(dev->indirects); > > > + dev->control = open("/dev/vhost-test", O_RDWR); > > > assert(dev->control >= 0); > > > r = ioctl(dev->control, VHOST_SET_OWNER, NULL); > > > assert(r >= 0); > > > dev->mem = malloc(offsetof(struct vhost_memory, regions) + > > > - sizeof dev->mem->regions[0]); > > > + (sizeof(dev->mem->regions[0])) * nregions); > > > assert(dev->mem); > > > memset(dev->mem, 0, offsetof(struct vhost_memory, regions) + > > > - sizeof dev->mem->regions[0]); > > > - dev->mem->nregions = 1; > > > + (sizeof(dev->mem->regions[0])) * nregions); > > > + dev->mem->nregions = nregions; > > > dev->mem->regions[0].guest_phys_addr = (long)dev->buf; > > > dev->mem->regions[0].userspace_addr = (long)dev->buf; > > > dev->mem->regions[0].memory_size = dev->buf_size; > > > + dev->mem->regions[1].guest_phys_addr = (long)dev->indirects; > > > + dev->mem->regions[1].userspace_addr = (long)dev->indirects; > > > + dev->mem->regions[1].memory_size = dev->indirects_size; > > > r = ioctl(dev->control, VHOST_SET_MEM_TABLE, dev->mem); > > > assert(r >= 0); > > > } > > > @@ -170,6 +180,19 @@ static void wait_for_interrupt(struct vdev_info *dev) > > > } > > > } > > > +static int test_virtqueue_add_outbuf(struct virtqueue *vq, > > > + struct scatterlist *sg, unsigned int num, > > > + void *data, void *indirects) > > > +{ > > > + int r; > > > + > > > + __kmalloc_fake = indirects; > > > + r = virtqueue_add_outbuf(vq, sg, num, data, > > > + GFP_ATOMIC); > > > + __kmalloc_fake = NULL; > > > + return r; > > > +} > > > + > > Quite a hack. Please add comments with documentation. Also - no way to > > avoid hacks? > > The __kmalloc_fake is refered from vringh_test. > > If not using __kmalloc_fake here, the vhost doesn't know how to translate > the indirects descriptors(user address). > > We may could register a single region as large as the whole virtual space in > the vhost iotlb using 1:1 mapping. > > But since they are tests, IMHO, better here to use VHOST_SET_MEM_TABLE with > more regions. But why do we need to do this on each buffer add? > > > > > static void run_test(struct vdev_info *dev, struct vq_info *vq, > > > bool delayed, int batch, int reset_n, int bufs) > > > { > > > @@ -181,6 +204,7 @@ static void run_test(struct vdev_info *dev, struct vq_info *vq, > > > unsigned len; > > > long long spurious = 0; > > > const bool random_batch = batch == RANDOM_BATCH; > > > + void *indirects; > > > r = ioctl(dev->control, VHOST_TEST_RUN, &test); > > > assert(r >= 0); > > > @@ -188,10 +212,15 @@ static void run_test(struct vdev_info *dev, struct vq_info *vq, > > > next_reset = INT_MAX; > > > } > > > + /* Don't kfree indirects. */ > > > + __kfree_ignore_start = dev->indirects; > > > + __kfree_ignore_end = dev->indirects + dev->indirects_size; > > > + > > > for (;;) { > > > virtqueue_disable_cb(vq->vq); > > > completed_before = completed; > > > started_before = started; > > > + indirects = dev->indirects; > > > do { > > > const bool reset = completed > next_reset; > > > if (random_batch) > > > @@ -203,9 +232,13 @@ static void run_test(struct vdev_info *dev, struct vq_info *vq, > > > sg_init_table(sg, sg_size); > > > for (int i = 0; i < sg_size; ++i) > > > sg_set_buf(&sg[i], dev->buf + i, 0x1); > > > - r = virtqueue_add_outbuf(vq->vq, sg, sg_size, > > > - dev->buf + started, > > > - GFP_ATOMIC); > > > + > > > + // use indirects buffer repeatedly > > > > C style comments please. > It will be modified. > > > > > + if (indirects + sg_size * sizeof(struct vring_desc) > > > > + dev->indirects + dev->indirects_size) > > > + indirects = dev->indirects; > > > + r = test_virtqueue_add_outbuf(vq->vq, sg, sg_size, > > > + dev->buf + started, indirects); > > > if (unlikely(r != 0)) { > > > if (r == -ENOSPC && > > > started > started_before) > > > @@ -216,6 +249,7 @@ static void run_test(struct vdev_info *dev, struct vq_info *vq, > > > } > > > ++started; > > > + indirects += sg_size * sizeof(struct vring_desc); > > > } > > > if (unlikely(!virtqueue_kick(vq->vq))) { > > > r = -1; > > > -- > > > 2.17.1 >