Received: by 2002:a05:7412:da14:b0:e2:908c:2ebd with SMTP id fe20csp1963219rdb; Mon, 9 Oct 2023 08:18:38 -0700 (PDT) X-Google-Smtp-Source: AGHT+IG9w0FkK43hpoVbYrZSfK32rbGPipu7Q1NDTcCw3eQKf0HOwmtTAosllSEsADlgbuAHxjuU X-Received: by 2002:a17:90a:fe94:b0:262:ba7f:30cd with SMTP id co20-20020a17090afe9400b00262ba7f30cdmr12528951pjb.31.1696864717616; Mon, 09 Oct 2023 08:18:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1696864717; cv=none; d=google.com; s=arc-20160816; b=gs5MjtDpWQz6AIdBQufTPN0CoeCgSLSpjfCpnnhU5Ft/M1OtuT0bEeVaGXXjRZvWZU /XuGTMT1c0d0nzhbUZb/HQk1Yd5tDuQYuZGDxjc5HVnlLi0Ow4x8Ss8q6mfNQtjMc5jO iHHzpGmgWu9+xEXeVMkE71c9FVL5hGRVyKdXcXZ72H1YVe8j9dPDlxNVvLg7J8zl74/e qC4F+pxTry7Nxk020kCdXceGch/14lE949YbmLkEFaZo02MaZ034aqKOT0LRzWkIcrPB Avv31WZBtBLM4ymelGyeCNyuKW/T5SKiS8qfJIhdkunCbWmvnnplrYCDlHpfSje1At/d Wvcw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=xDBWSVPFzuu4QHrut7TRW4v5VVDIvv3bRJmPztxHrTA=; fh=JEq7ZyqHwnKMAbPjHbEHIOO1u1YxTWxtQlCbJmwG+xw=; b=HU7CDnG7oICPx3rIMkCLjqNZxk5fTqwulpd85+j8NVFN+QZfMBFvC0GW/RweozimbP Ye1jb3z8b4GLEd8g10qQT3ioUyEgXeat7GEixGQY5JYwcZhBwkB70yZdJzjiJTQmY+u1 b3oM/R9gtRHT26J35VPmT8KIL37Wq7ZgptX/JmXqqtqjk7filxnc87XJ7oNTtgeU4hPb 2DN583+j4EZA7W/tUl2F1r6D31Mja8j/kTBXuWKq+08bVuT5uuNyoHoafsB6mV/vddbf iomB6h4I6/bOaUq1g1kiq1XOlHYf/+puyiqpS7a6CXgj0HR+ziKy36FODTjYcaM0Nmt3 Rb3Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="E5/xNa0O"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [2620:137:e000::3:3]) by mx.google.com with ESMTPS id fh24-20020a17090b035800b00274ac850fbbsi4177266pjb.132.2023.10.09.08.18.37 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 09 Oct 2023 08:18:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) client-ip=2620:137:e000::3:3; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="E5/xNa0O"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id 5969580465D8; Mon, 9 Oct 2023 08:18:33 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1376903AbjJIPSS (ORCPT + 99 others); Mon, 9 Oct 2023 11:18:18 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58538 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1376698AbjJIPSO (ORCPT ); Mon, 9 Oct 2023 11:18:14 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5585FDA for ; Mon, 9 Oct 2023 08:17:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1696864648; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=xDBWSVPFzuu4QHrut7TRW4v5VVDIvv3bRJmPztxHrTA=; b=E5/xNa0ORcNyyzDqpCCnKqNmId5Vl/4i778MnHWB9r6/NAwwR09/J5WbQpkk5nNXZQ9p4C +1K79LXHpx85j2VdK199dyuVhlF9p57gg8kBN4tDVGrNvSwefCnKQr/3GNDEBvRvG4q8UJ IiKy3I3kxKhczOMps7h4srvQCpEBlzc= Received: from mail-ed1-f72.google.com (mail-ed1-f72.google.com [209.85.208.72]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-219-2Cj9pQmRNvmh44eYMxWplw-1; Mon, 09 Oct 2023 11:17:24 -0400 X-MC-Unique: 2Cj9pQmRNvmh44eYMxWplw-1 Received: by mail-ed1-f72.google.com with SMTP id 4fb4d7f45d1cf-538128e18e9so3770801a12.2 for ; Mon, 09 Oct 2023 08:17:24 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1696864643; x=1697469443; h=in-reply-to:content-transfer-encoding:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=xDBWSVPFzuu4QHrut7TRW4v5VVDIvv3bRJmPztxHrTA=; b=ZTap0Lea9QaOM4NQeFV+BZ7RvtUWPxcSFniA9BxLfI2rp1trSi8JYhrSUW7NQQeVFf B/OsvLu3cD3/rUbfHwZRFL83TREQLWhEgZcExgXPOuXK0vOjTk3Kgj/bA4YdYgTJGfWc nuy8OjDTJrHgZnEk3EOslsSEUy3iJgM1s11dEk9XtqfuGG2qMbuKM2N6bkFocD3UJ0Az FnUXhOrr06OboKQ7fZ94ngMcAqpka5YbWAmGnHk0SQ2UKFQ2pIFAdI5ZTUz0fuiPap3E JL4cM1x7OkPwVjDXwq9al7wdW5GPiIHDFEkvECD/1ySMf40O2zHIZ1kOGBKOUFA/iNQ4 jNNw== X-Gm-Message-State: AOJu0YxdLCU0WfqsiTK8YPV2CV5TKxb1IjmpDr05ppsAR0lfMH7xxmDv nMpFzrvupER/7Bdy1nAj32kSdB1a9k8LDzvi23+Uic1Ui6GdqV/Le1frpjhfXG4/ABe/29952C0 5Wy39QW4fli8i8T+G75hlPKPy X-Received: by 2002:aa7:d94e:0:b0:530:bd6b:7a94 with SMTP id l14-20020aa7d94e000000b00530bd6b7a94mr15374208eds.24.1696864643555; Mon, 09 Oct 2023 08:17:23 -0700 (PDT) X-Received: by 2002:aa7:d94e:0:b0:530:bd6b:7a94 with SMTP id l14-20020aa7d94e000000b00530bd6b7a94mr15374194eds.24.1696864643232; Mon, 09 Oct 2023 08:17:23 -0700 (PDT) Received: from sgarzare-redhat (host-82-57-51-114.retail.telecomitalia.it. [82.57.51.114]) by smtp.gmail.com with ESMTPSA id o14-20020aa7c50e000000b0052595b17fd4sm6146625edq.26.2023.10.09.08.17.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 09 Oct 2023 08:17:22 -0700 (PDT) Date: Mon, 9 Oct 2023 17:17:20 +0200 From: Stefano Garzarella To: Arseniy Krasnov Cc: Stefan Hajnoczi , "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , "Michael S. Tsirkin" , Jason Wang , Bobby Eshleman , kvm@vger.kernel.org, virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org, kernel@sberdevices.ru, oxffffaa@gmail.com Subject: Re: [PATCH net-next v3 10/12] test/vsock: MSG_ZEROCOPY flag tests Message-ID: References: <20231007172139.1338644-1-avkrasnov@salutedevices.com> <20231007172139.1338644-11-avkrasnov@salutedevices.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20231007172139.1338644-11-avkrasnov@salutedevices.com> X-Spam-Status: No, score=2.7 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RCVD_IN_SBL_CSS,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Mon, 09 Oct 2023 08:18:33 -0700 (PDT) X-Spam-Level: ** On Sat, Oct 07, 2023 at 08:21:37PM +0300, Arseniy Krasnov wrote: >This adds three tests for MSG_ZEROCOPY feature: >1) SOCK_STREAM tx with different buffers. >2) SOCK_SEQPACKET tx with different buffers. >3) SOCK_STREAM test to read empty error queue of the socket. > >Patch also works as preparation for the next patches for tools in this >patchset: vsock_perf and vsock_uring_test: >1) Adds several new functions to util.c - they will be also used by > vsock_uring_test. >2) Adds two new functions for MSG_ZEROCOPY handling to a new header > file - such header will be shared between vsock_test, vsock_perf and > vsock_uring_test, thus avoiding code copy-pasting. > >Signed-off-by: Arseniy Krasnov >--- > Changelog: > v1 -> v2: > * Move 'SOL_VSOCK' and 'VSOCK_RECVERR' from 'util.c' to 'util.h'. > v2 -> v3: > * Patch was reworked. Now it is also preparation patch (see commit > message). Shared stuff for 'vsock_perf' and tests is placed to a > new header file, while shared code between current test tool and > future uring test is placed to the 'util.c'. I think, that making > this patch as preparation allows to reduce number of changes in the > next patches in this patchset. > * Make 'struct vsock_test_data' private by placing it to the .c file. > Also add comments to this struct to clarify sense of its fields. > > tools/testing/vsock/Makefile | 2 +- > tools/testing/vsock/msg_zerocopy_common.h | 92 ++++++ > tools/testing/vsock/util.c | 110 +++++++ > tools/testing/vsock/util.h | 5 + > tools/testing/vsock/vsock_test.c | 16 + > tools/testing/vsock/vsock_test_zerocopy.c | 367 ++++++++++++++++++++++ > tools/testing/vsock/vsock_test_zerocopy.h | 15 + > 7 files changed, 606 insertions(+), 1 deletion(-) > create mode 100644 tools/testing/vsock/msg_zerocopy_common.h > create mode 100644 tools/testing/vsock/vsock_test_zerocopy.c > create mode 100644 tools/testing/vsock/vsock_test_zerocopy.h > >diff --git a/tools/testing/vsock/Makefile b/tools/testing/vsock/Makefile >index 21a98ba565ab..1a26f60a596c 100644 >--- a/tools/testing/vsock/Makefile >+++ b/tools/testing/vsock/Makefile >@@ -1,7 +1,7 @@ > # SPDX-License-Identifier: GPL-2.0-only > all: test vsock_perf > test: vsock_test vsock_diag_test >-vsock_test: vsock_test.o timeout.o control.o util.o >+vsock_test: vsock_test.o vsock_test_zerocopy.o timeout.o control.o util.o > vsock_diag_test: vsock_diag_test.o timeout.o control.o util.o > vsock_perf: vsock_perf.o > >diff --git a/tools/testing/vsock/msg_zerocopy_common.h b/tools/testing/vsock/msg_zerocopy_common.h >new file mode 100644 >index 000000000000..ce89f1281584 >--- /dev/null >+++ b/tools/testing/vsock/msg_zerocopy_common.h >@@ -0,0 +1,92 @@ >+/* SPDX-License-Identifier: GPL-2.0-only */ >+#ifndef MSG_ZEROCOPY_COMMON_H >+#define MSG_ZEROCOPY_COMMON_H >+ >+#include >+#include >+#include >+#include >+#include >+ >+#ifndef SOL_VSOCK >+#define SOL_VSOCK 287 >+#endif >+ >+#ifndef VSOCK_RECVERR >+#define VSOCK_RECVERR 1 >+#endif >+ >+static void enable_so_zerocopy(int fd) >+{ >+ int val = 1; >+ >+ if (setsockopt(fd, SOL_SOCKET, SO_ZEROCOPY, &val, sizeof(val))) { >+ perror("setsockopt"); >+ exit(EXIT_FAILURE); >+ } >+} >+ >+static void vsock_recv_completion(int fd, const bool *zerocopied) __maybe_unused; To avoid this, maybe we can implement those functions in .c file and link the object. WDYT? Ah, here (cc (GCC) 13.2.1 20230728 (Red Hat 13.2.1-1)) the build is failing: In file included from vsock_perf.c:23: msg_zerocopy_common.h: In function ‘vsock_recv_completion’: msg_zerocopy_common.h:29:67: error: expected declaration specifiers before ‘__maybe_unused’ 29 | static void vsock_recv_completion(int fd, const bool *zerocopied) __maybe_unused; | ^~~~~~~~~~~~~~ msg_zerocopy_common.h:31:1: error: expected ‘=’, ‘,’, ‘;’, ‘asm’ or ‘__attribute__’ before ‘{’ token 31 | { | ^ >+static void vsock_recv_completion(int fd, const bool *zerocopied) >+{ >+ struct sock_extended_err *serr; >+ struct msghdr msg = { 0 }; >+ char cmsg_data[128]; >+ struct cmsghdr *cm; >+ ssize_t res; >+ >+ msg.msg_control = cmsg_data; >+ msg.msg_controllen = sizeof(cmsg_data); >+ >+ res = recvmsg(fd, &msg, MSG_ERRQUEUE); >+ if (res) { >+ fprintf(stderr, "failed to read error queue: %zi\n", res); >+ exit(EXIT_FAILURE); >+ } >+ >+ cm = CMSG_FIRSTHDR(&msg); >+ if (!cm) { >+ fprintf(stderr, "cmsg: no cmsg\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (cm->cmsg_level != SOL_VSOCK) { >+ fprintf(stderr, "cmsg: unexpected 'cmsg_level'\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (cm->cmsg_type != VSOCK_RECVERR) { >+ fprintf(stderr, "cmsg: unexpected 'cmsg_type'\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ serr = (void *)CMSG_DATA(cm); >+ if (serr->ee_origin != SO_EE_ORIGIN_ZEROCOPY) { >+ fprintf(stderr, "serr: wrong origin: %u\n", serr->ee_origin); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (serr->ee_errno) { >+ fprintf(stderr, "serr: wrong error code: %u\n", serr->ee_errno); >+ exit(EXIT_FAILURE); >+ } >+ >+ /* This flag is used for tests, to check that transmission was >+ * performed as expected: zerocopy or fallback to copy. If NULL >+ * - don't care. >+ */ >+ if (!zerocopied) >+ return; >+ >+ if (*zerocopied && (serr->ee_code & SO_EE_CODE_ZEROCOPY_COPIED)) { >+ fprintf(stderr, "serr: was copy instead of zerocopy\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (!*zerocopied && !(serr->ee_code & SO_EE_CODE_ZEROCOPY_COPIED)) { >+ fprintf(stderr, "serr: was zerocopy instead of copy\n"); >+ exit(EXIT_FAILURE); >+ } >+} >+ >+#endif /* MSG_ZEROCOPY_COMMON_H */ >diff --git a/tools/testing/vsock/util.c b/tools/testing/vsock/util.c >index 6779d5008b27..b1770edd8cc1 100644 >--- a/tools/testing/vsock/util.c >+++ b/tools/testing/vsock/util.c >@@ -11,10 +11,12 @@ > #include > #include > #include >+#include > #include > #include > #include > #include >+#include > > #include "timeout.h" > #include "control.h" >@@ -444,3 +446,111 @@ unsigned long hash_djb2(const void *data, size_t len) > > return hash; > } >+ >+size_t iovec_bytes(const struct iovec *iov, size_t iovnum) >+{ >+ size_t bytes; >+ int i; >+ >+ for (bytes = 0, i = 0; i < iovnum; i++) >+ bytes += iov[i].iov_len; >+ >+ return bytes; >+} >+ >+unsigned long iovec_hash_djb2(const struct iovec *iov, size_t iovnum) >+{ >+ unsigned long hash; >+ size_t iov_bytes; >+ size_t offs; >+ void *tmp; >+ int i; >+ >+ iov_bytes = iovec_bytes(iov, iovnum); >+ >+ tmp = malloc(iov_bytes); >+ if (!tmp) { >+ perror("malloc"); >+ exit(EXIT_FAILURE); >+ } >+ >+ for (offs = 0, i = 0; i < iovnum; i++) { >+ memcpy(tmp + offs, iov[i].iov_base, iov[i].iov_len); >+ offs += iov[i].iov_len; >+ } >+ >+ hash = hash_djb2(tmp, iov_bytes); >+ free(tmp); >+ >+ return hash; >+} >+ >+struct iovec *iovec_from_test_data(const struct iovec *test_iovec, int >iovnum) From the name this function seems related to vsock_test_data, so I'd suggest to move this and free_iovec_test_data() in vsock_test_zerocopy.c >+{ >+ struct iovec *iovec; >+ int i; >+ >+ iovec = malloc(sizeof(*iovec) * iovnum); >+ if (!iovec) { >+ perror("malloc"); >+ exit(EXIT_FAILURE); >+ } >+ >+ for (i = 0; i < iovnum; i++) { >+ iovec[i].iov_len = test_iovec[i].iov_len; >+ >+ iovec[i].iov_base = mmap(NULL, iovec[i].iov_len, >+ PROT_READ | PROT_WRITE, >+ MAP_PRIVATE | MAP_ANONYMOUS | MAP_POPULATE, >+ -1, 0); >+ if (iovec[i].iov_base == MAP_FAILED) { >+ perror("mmap"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (test_iovec[i].iov_base != MAP_FAILED) >+ iovec[i].iov_base += (uintptr_t)test_iovec[i].iov_base; >+ } >+ >+ /* Unmap "invalid" elements. */ >+ for (i = 0; i < iovnum; i++) { >+ if (test_iovec[i].iov_base == MAP_FAILED) { >+ if (munmap(iovec[i].iov_base, iovec[i].iov_len)) { >+ perror("munmap"); >+ exit(EXIT_FAILURE); >+ } >+ } >+ } >+ >+ for (i = 0; i < iovnum; i++) { >+ int j; >+ >+ if (test_iovec[i].iov_base == MAP_FAILED) >+ continue; >+ >+ for (j = 0; j < iovec[i].iov_len; j++) >+ ((uint8_t *)iovec[i].iov_base)[j] = rand() & 0xff; >+ } >+ >+ return iovec; >+} >+ >+void free_iovec_test_data(const struct iovec *test_iovec, >+ struct iovec *iovec, int iovnum) >+{ >+ int i; >+ >+ for (i = 0; i < iovnum; i++) { >+ if (test_iovec[i].iov_base != MAP_FAILED) { >+ if (test_iovec[i].iov_base) >+ iovec[i].iov_base -= (uintptr_t)test_iovec[i].iov_base; >+ >+ if (munmap(iovec[i].iov_base, iovec[i].iov_len)) { >+ perror("munmap"); >+ exit(EXIT_FAILURE); >+ } >+ } >+ } >+ >+ free(iovec); >+} >diff --git a/tools/testing/vsock/util.h b/tools/testing/vsock/util.h >index e5407677ce05..4cacb8d804c1 100644 >--- a/tools/testing/vsock/util.h >+++ b/tools/testing/vsock/util.h >@@ -53,4 +53,9 @@ void list_tests(const struct test_case *test_cases); > void skip_test(struct test_case *test_cases, size_t test_cases_len, > const char *test_id_str); > unsigned long hash_djb2(const void *data, size_t len); >+size_t iovec_bytes(const struct iovec *iov, size_t iovnum); >+unsigned long iovec_hash_djb2(const struct iovec *iov, size_t iovnum); >+struct iovec *iovec_from_test_data(const struct iovec *test_iovec, int iovnum); >+void free_iovec_test_data(const struct iovec *test_iovec, >+ struct iovec *iovec, int iovnum); > #endif /* UTIL_H */ >diff --git a/tools/testing/vsock/vsock_test.c b/tools/testing/vsock/vsock_test.c >index da4cb819a183..c1f7bc9abd22 100644 >--- a/tools/testing/vsock/vsock_test.c >+++ b/tools/testing/vsock/vsock_test.c >@@ -21,6 +21,7 @@ > #include > #include > >+#include "vsock_test_zerocopy.h" > #include "timeout.h" > #include "control.h" > #include "util.h" >@@ -1269,6 +1270,21 @@ static struct test_case test_cases[] = { > .run_client = test_stream_shutrd_client, > .run_server = test_stream_shutrd_server, > }, >+ { >+ .name = "SOCK_STREAM MSG_ZEROCOPY", >+ .run_client = test_stream_msgzcopy_client, >+ .run_server = test_stream_msgzcopy_server, >+ }, >+ { >+ .name = "SOCK_SEQPACKET MSG_ZEROCOPY", >+ .run_client = test_seqpacket_msgzcopy_client, >+ .run_server = test_seqpacket_msgzcopy_server, >+ }, >+ { >+ .name = "SOCK_STREAM MSG_ZEROCOPY empty MSG_ERRQUEUE", >+ .run_client = test_stream_msgzcopy_empty_errq_client, >+ .run_server = test_stream_msgzcopy_empty_errq_server, >+ }, > {}, > }; > >diff --git a/tools/testing/vsock/vsock_test_zerocopy.c b/tools/testing/vsock/vsock_test_zerocopy.c >new file mode 100644 >index 000000000000..af14efdf334b >--- /dev/null >+++ b/tools/testing/vsock/vsock_test_zerocopy.c >@@ -0,0 +1,367 @@ >+// SPDX-License-Identifier: GPL-2.0-only >+/* MSG_ZEROCOPY feature tests for vsock >+ * >+ * Copyright (C) 2023 SberDevices. >+ * >+ * Author: Arseniy Krasnov >+ */ >+ >+#include >+#include >+#include >+#include >+#include >+#include >+#include >+#include >+#include >+ >+#include "control.h" >+#include "vsock_test_zerocopy.h" >+#include "msg_zerocopy_common.h" >+ >+#define PAGE_SIZE 4096 In some tests I saw `sysconf(_SC_PAGESIZE)` is used, e.g. in selftests/ptrace/peeksiginfo.c: #ifndef PAGE_SIZE #define PAGE_SIZE sysconf(_SC_PAGESIZE) #endif WDYT? >+ >+#define VSOCK_TEST_DATA_MAX_IOV 3 >+ >+struct vsock_test_data { >+ /* This test case if for SOCK_STREAM only. */ >+ bool stream_only; >+ /* Data must be zerocopied. This field is checked against >+ * field 'ee_code' of the 'struct sock_extended_err', which >+ * contains bit to detect that zerocopy transmission was >+ * fallbacked to copy mode. >+ */ >+ bool zerocopied; >+ /* Enable SO_ZEROCOPY option on the socket. Without enabled >+ * SO_ZEROCOPY, every MSG_ZEROCOPY transmission will behave >+ * like without MSG_ZEROCOPY flag. >+ */ >+ bool so_zerocopy; >+ /* 'errno' after 'sendmsg()' call. */ >+ int sendmsg_errno; >+ /* Number of valid elements in 'vecs'. */ >+ int vecs_cnt; >+ /* Array how to allocate buffers for test. >+ * 'iov_base' == NULL -> valid buf: mmap('iov_len'). >+ * >+ * 'iov_base' == MAP_FAILED -> invalid buf: >+ * mmap('iov_len'), then munmap('iov_len'). >+ * 'iov_base' still contains result of >+ * mmap(). >+ * >+ * 'iov_base' == number -> unaligned valid buf: >+ * mmap('iov_len') + number. >+ */ >+ struct iovec vecs[VSOCK_TEST_DATA_MAX_IOV]; >+}; >+ >+static struct vsock_test_data test_data_array[] = { >+ /* Last element has non-page aligned size. */ >+ { >+ .zerocopied = true, >+ .so_zerocopy = true, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 3, >+ { >+ { NULL, PAGE_SIZE }, >+ { NULL, PAGE_SIZE }, >+ { NULL, 200 } >+ } >+ }, >+ /* All elements have page aligned base and size. */ >+ { >+ .zerocopied = true, >+ .so_zerocopy = true, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 3, >+ { >+ { NULL, PAGE_SIZE }, >+ { NULL, PAGE_SIZE * 2 }, >+ { NULL, PAGE_SIZE * 3 } >+ } >+ }, >+ /* All elements have page aligned base and size. But >+ * data length is bigger than 64Kb. >+ */ >+ { >+ .zerocopied = true, >+ .so_zerocopy = true, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 3, >+ { >+ { NULL, PAGE_SIZE * 16 }, >+ { NULL, PAGE_SIZE * 16 }, >+ { NULL, PAGE_SIZE * 16 } >+ } >+ }, >+ /* Middle element has both non-page aligned base and size. */ >+ { >+ .zerocopied = true, >+ .so_zerocopy = true, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 3, >+ { >+ { NULL, PAGE_SIZE }, >+ { (void *)1, 100 }, >+ { NULL, PAGE_SIZE } >+ } >+ }, >+ /* Middle element is unmapped. */ >+ { >+ .zerocopied = false, >+ .so_zerocopy = true, >+ .sendmsg_errno = ENOMEM, >+ .vecs_cnt = 3, >+ { >+ { NULL, PAGE_SIZE }, >+ { MAP_FAILED, PAGE_SIZE }, >+ { NULL, PAGE_SIZE } >+ } >+ }, >+ /* Valid data, but SO_ZEROCOPY is off. This >+ * will trigger fallback to copy. >+ */ >+ { >+ .zerocopied = false, >+ .so_zerocopy = false, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 1, >+ { >+ { NULL, PAGE_SIZE } >+ } >+ }, >+ /* Valid data, but message is bigger than peer's >+ * buffer, so this will trigger fallback to copy. >+ * This test is for SOCK_STREAM only, because >+ * for SOCK_SEQPACKET, 'sendmsg()' returns EMSGSIZE. >+ */ >+ { >+ .stream_only = true, >+ .zerocopied = false, >+ .so_zerocopy = true, >+ .sendmsg_errno = 0, >+ .vecs_cnt = 1, >+ { >+ { NULL, 100 * PAGE_SIZE } >+ } >+ }, >+}; >+ >+#define POLL_TIMEOUT_MS 100 >+ >+static void test_client(const struct test_opts *opts, >+ const struct vsock_test_data *test_data, >+ bool sock_seqpacket) >+{ >+ struct pollfd fds = { 0 }; >+ struct msghdr msg = { 0 }; >+ ssize_t sendmsg_res; >+ struct iovec *iovec; >+ int fd; >+ >+ if (sock_seqpacket) >+ fd = vsock_seqpacket_connect(opts->peer_cid, 1234); >+ else >+ fd = vsock_stream_connect(opts->peer_cid, 1234); >+ >+ if (fd < 0) { >+ perror("connect"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (test_data->so_zerocopy) >+ enable_so_zerocopy(fd); >+ >+ iovec = iovec_from_test_data(test_data->vecs, test_data->vecs_cnt); >+ >+ msg.msg_iov = iovec; >+ msg.msg_iovlen = test_data->vecs_cnt; >+ >+ errno = 0; >+ >+ sendmsg_res = sendmsg(fd, &msg, MSG_ZEROCOPY); >+ if (errno != test_data->sendmsg_errno) { >+ fprintf(stderr, "expected 'errno' == %i, got %i\n", >+ test_data->sendmsg_errno, errno); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (!errno) { >+ if (sendmsg_res != iovec_bytes(iovec, test_data->vecs_cnt)) { >+ fprintf(stderr, "expected 'sendmsg()' == %li, got %li\n", >+ iovec_bytes(iovec, test_data->vecs_cnt), >+ sendmsg_res); >+ exit(EXIT_FAILURE); >+ } >+ } >+ >+ fds.fd = fd; >+ fds.events = 0; >+ >+ if (poll(&fds, 1, POLL_TIMEOUT_MS) < 0) { >+ perror("poll"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (fds.revents & POLLERR) { >+ vsock_recv_completion(fd, &test_data->zerocopied); >+ } else if (test_data->so_zerocopy && !test_data->sendmsg_errno) { >+ /* If we don't have data in the error queue, but >+ * SO_ZEROCOPY was enabled and 'sendmsg()' was >+ * successful - this is an error. >+ */ >+ fprintf(stderr, "POLLERR expected\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ if (!test_data->sendmsg_errno) >+ control_writeulong(iovec_hash_djb2(iovec, test_data->vecs_cnt)); >+ else >+ control_writeulong(0); >+ >+ control_writeln("DONE"); >+ free_iovec_test_data(test_data->vecs, iovec, test_data->vecs_cnt); >+ close(fd); >+} >+ >+void test_stream_msgzcopy_client(const struct test_opts *opts) >+{ >+ int i; >+ >+ for (i = 0; i < ARRAY_SIZE(test_data_array); i++) >+ test_client(opts, &test_data_array[i], false); >+} >+ >+void test_seqpacket_msgzcopy_client(const struct test_opts *opts) >+{ >+ int i; >+ >+ for (i = 0; i < ARRAY_SIZE(test_data_array); i++) { >+ if (test_data_array[i].stream_only) >+ continue; >+ >+ test_client(opts, &test_data_array[i], true); >+ } >+} >+ >+static void test_server(const struct test_opts *opts, >+ const struct vsock_test_data *test_data, >+ bool sock_seqpacket) >+{ >+ unsigned long remote_hash; >+ unsigned long local_hash; >+ ssize_t total_bytes_rec; >+ unsigned char *data; >+ size_t data_len; >+ int fd; >+ >+ if (sock_seqpacket) >+ fd = vsock_seqpacket_accept(VMADDR_CID_ANY, 1234, NULL); >+ else >+ fd = vsock_stream_accept(VMADDR_CID_ANY, 1234, NULL); >+ >+ if (fd < 0) { >+ perror("accept"); >+ exit(EXIT_FAILURE); >+ } >+ >+ data_len = iovec_bytes(test_data->vecs, test_data->vecs_cnt); >+ >+ data = malloc(data_len); >+ if (!data) { >+ perror("malloc"); >+ exit(EXIT_FAILURE); >+ } >+ >+ total_bytes_rec = 0; >+ >+ while (total_bytes_rec != data_len) { >+ ssize_t bytes_rec; >+ >+ bytes_rec = read(fd, data + total_bytes_rec, >+ data_len - total_bytes_rec); >+ if (bytes_rec <= 0) >+ break; >+ >+ total_bytes_rec += bytes_rec; >+ } >+ >+ if (test_data->sendmsg_errno == 0) >+ local_hash = hash_djb2(data, data_len); >+ else >+ local_hash = 0; >+ >+ free(data); >+ >+ /* Waiting for some result. */ >+ remote_hash = control_readulong(); >+ if (remote_hash != local_hash) { >+ fprintf(stderr, "hash mismatch\n"); >+ exit(EXIT_FAILURE); >+ } >+ >+ control_expectln("DONE"); >+ close(fd); >+} >+ >+void test_stream_msgzcopy_server(const struct test_opts *opts) >+{ >+ int i; >+ >+ for (i = 0; i < ARRAY_SIZE(test_data_array); i++) >+ test_server(opts, &test_data_array[i], false); >+} >+ >+void test_seqpacket_msgzcopy_server(const struct test_opts *opts) >+{ >+ int i; >+ >+ for (i = 0; i < ARRAY_SIZE(test_data_array); i++) { >+ if (test_data_array[i].stream_only) >+ continue; >+ >+ test_server(opts, &test_data_array[i], true); >+ } >+} >+ >+void test_stream_msgzcopy_empty_errq_client(const struct test_opts *opts) >+{ >+ struct msghdr msg = { 0 }; >+ char cmsg_data[128]; >+ ssize_t res; >+ int fd; >+ >+ fd = vsock_stream_connect(opts->peer_cid, 1234); >+ if (fd < 0) { >+ perror("connect"); >+ exit(EXIT_FAILURE); >+ } >+ >+ msg.msg_control = cmsg_data; >+ msg.msg_controllen = sizeof(cmsg_data); >+ >+ res = recvmsg(fd, &msg, MSG_ERRQUEUE); >+ if (res != -1) { >+ fprintf(stderr, "expected 'recvmsg(2)' failure, got %zi\n", >+ res); >+ exit(EXIT_FAILURE); >+ } >+ >+ control_writeln("DONE"); >+ close(fd); >+} >+ >+void test_stream_msgzcopy_empty_errq_server(const struct test_opts *opts) >+{ >+ int fd; >+ >+ fd = vsock_stream_accept(VMADDR_CID_ANY, 1234, NULL); >+ if (fd < 0) { >+ perror("accept"); >+ exit(EXIT_FAILURE); >+ } >+ >+ control_expectln("DONE"); >+ close(fd); >+} >diff --git a/tools/testing/vsock/vsock_test_zerocopy.h b/tools/testing/vsock/vsock_test_zerocopy.h >new file mode 100644 >index 000000000000..3ef2579e024d >--- /dev/null >+++ b/tools/testing/vsock/vsock_test_zerocopy.h >@@ -0,0 +1,15 @@ >+/* SPDX-License-Identifier: GPL-2.0-only */ >+#ifndef VSOCK_TEST_ZEROCOPY_H >+#define VSOCK_TEST_ZEROCOPY_H >+#include "util.h" >+ >+void test_stream_msgzcopy_client(const struct test_opts *opts); >+void test_stream_msgzcopy_server(const struct test_opts *opts); >+ >+void test_seqpacket_msgzcopy_client(const struct test_opts *opts); >+void test_seqpacket_msgzcopy_server(const struct test_opts *opts); >+ >+void test_stream_msgzcopy_empty_errq_client(const struct test_opts *opts); >+void test_stream_msgzcopy_empty_errq_server(const struct test_opts *opts); >+ >+#endif /* VSOCK_TEST_ZEROCOPY_H */ >-- >2.25.1 >