Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp2820709rwd; Fri, 9 Jun 2023 18:17:44 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7z9s6/7Ckoq9WLS652p7LZV7RTvCA9+NhcOaK4yJ8noJLXXy3HJN7GnE7+Us7j4gzfifnX X-Received: by 2002:a17:90b:4c49:b0:256:23f:abc5 with SMTP id np9-20020a17090b4c4900b00256023fabc5mr2261792pjb.10.1686359864156; Fri, 09 Jun 2023 18:17:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686359864; cv=none; d=google.com; s=arc-20160816; b=0TEvNFR+Eluyb2GMGOlnXzvM5uekk9HtzV39ufddKqqvtqE2GsS8LxY1I/ZugHeagp WZWhoEyd2M9+EwkkUQj3B3XnaBGLSSlXaTTBFOHvJw6Cxo/lr1zprK1Y8FOdGpdL6cx1 hrvEeOC+VR4VL4hK3uKsYq2OnoGr2qj+pepTf4FBOaSU2gbIa2QGHoCpVBDbv3E7LvsZ sZv2aq9SjeLfLqSkVdr5Da/6NrQXor79VJGxG/pIRUz2CShD5wbFI3nQaCjm6llFfxhB YffRl0cca77N5ZqLSbqEAkyxpKy7WUr2EnaXxRLGlZBqeWUU6yPj+Hr7oHIK+CJI5/5X 4TbQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:content-transfer-encoding:mime-version :message-id:date:subject:from:dkim-signature; bh=2cTgA2QhKCUmFzlxsj9JpU4bWG7UYDuiM5WyYDPIkg4=; b=HZXFepSPkz37wXQz0V95bXgDSrTgh/Kq5Ge5mAyA1aMCGvGyW3MVLYoBdpezgh8Hum Ls+796blm9RByUSA8wiSDMqhEde+O+kkStBJMPixVmFZTIHjAOtGpTkSrJ/G+qUvYLeR kpS0WKpOpIf+5m6QlxIqKOqyx6GfbTYRjePOnNDs/GT/NasppsUWqYrxxbEDgGeeRQfQ xHXJ+lGdLtOvcPnpiKWYIo4oklxSzu829FpLldFufzRlxjw2Tbaxf8pyZGz6RQxKePfN oyIU0U7sDtDyRPBSl7/G0cKyl3wo2H6KukVa5cf4DspsNdDf5TE+mqJ54maS20TkhFNn UHwg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=dLF7z+tZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id z70-20020a633349000000b00534dbd0d219si3563227pgz.301.2023.06.09.18.17.32; Fri, 09 Jun 2023 18:17:44 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=dLF7z+tZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233009AbjFJA7O (ORCPT + 99 others); Fri, 9 Jun 2023 20:59:14 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55138 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230044AbjFJA7M (ORCPT ); Fri, 9 Jun 2023 20:59:12 -0400 Received: from mail-qv1-xf35.google.com (mail-qv1-xf35.google.com [IPv6:2607:f8b0:4864:20::f35]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 550CE269A for ; Fri, 9 Jun 2023 17:58:45 -0700 (PDT) Received: by mail-qv1-xf35.google.com with SMTP id 6a1803df08f44-6261a25e9b6so17412156d6.0 for ; Fri, 09 Jun 2023 17:58:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1686358724; x=1688950724; h=cc:to:content-transfer-encoding:mime-version:message-id:date :subject:from:from:to:cc:subject:date:message-id:reply-to; bh=2cTgA2QhKCUmFzlxsj9JpU4bWG7UYDuiM5WyYDPIkg4=; b=dLF7z+tZblaKj8qNrPX91Y7xjsg+J5JkLmiZYvjhDAXHtXewod39Dz83gAjfKdwWNt YQkx7tRWBTLL9HOucP6hTOnK134IGM8NSI2eF0IrjfochR1UMseQwUeZPoJGou56FT5p GhjlIKASDPw7MoZ47M31OxTpwpdGJsXToAtCtMaFfTVHL1+7jiY7rEmhsrYakDRN2r1c BAO7cFSt6UXTpfo73gWG5JjedCC2o6wD8GiNDdUHmXr+AV8LhOVaUeNcDMfHL+SRlMR7 fYrPc0ZDQqAJWP51llTEpj9IeD+KgBhI8QmBUCU/uHLFs4H9rlej/tdhDAWqtLSq8ZR4 9/lw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1686358724; x=1688950724; h=cc:to:content-transfer-encoding:mime-version:message-id:date :subject:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=2cTgA2QhKCUmFzlxsj9JpU4bWG7UYDuiM5WyYDPIkg4=; b=PvMNXFH5Zx/uNbfKT2avVwLt0ib3MqgFI4z8B3wgJiYEvrTvfDwYMqClL0k0JN/HwN /IPhz63ZvBZ1o0zkfTfSPjq7BWTFGmKxm7inGkIO6og065Ynt745bQrK7+Gylnrr5UMb wPon/oi61nmZGkb3kjs4/Vg/O3m9cMREb2vhPsqTCFyGgx6GWw3kL7/mjgdSDJBwNp3b xP16+UAt4gbM3sFizS7iRe7linvMabBLuNBWCdJoNjDAez1CzBHrNAhMLmOoT2ePadqq NvzIIBbdmBI6vfapMakBLdyKTWAevjDe5+JwcnXuQ9EK+4zFFb30ns5cJS7sHiV1VXIu thBw== X-Gm-Message-State: AC+VfDxb9WErxK3EXGTuhqC1xKBRuEiYaOVExW7Qu6LE04fi5924q9+4 dPQL/TzNgxqTVZjkCdZbGn5iMg== X-Received: by 2002:ad4:5aee:0:b0:626:2527:54a6 with SMTP id c14-20020ad45aee000000b00626252754a6mr3741754qvh.8.1686358724421; Fri, 09 Jun 2023 17:58:44 -0700 (PDT) Received: from [172.17.0.4] ([130.44.212.126]) by smtp.gmail.com with ESMTPSA id x17-20020a0ce251000000b00606750abaf9sm1504075qvl.136.2023.06.09.17.58.43 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 09 Jun 2023 17:58:44 -0700 (PDT) From: Bobby Eshleman Subject: [PATCH RFC net-next v4 0/8] virtio/vsock: support datagrams Date: Sat, 10 Jun 2023 00:58:27 +0000 Message-Id: <20230413-b4-vsock-dgram-v4-0-0cebbb2ae899@bytedance.com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit X-B4-Tracking: v=1; b=H4sIALPKg2QC/3WOQQrCMBBFryKzdiSmMUVXguAB3EoXSTq2QZpIE kJL6d1Nu3Dn8s3/nzczRAqWIlx2MwTKNlrvCoj9DkyvXEdo28LAGa+YOFaoBebozRvbLqgBKy1 JK6mkoBOUkVaRUAflTL/OHCV0NKY1+gR62XFzPeFxv623X94U6G1MPkzbL5lvtX/azJEhq8/G1 IZIcrrqKVFbtHQwfoBmWZYvssY4TtwAAAA= To: Stefan Hajnoczi , Stefano Garzarella , "Michael S. Tsirkin" , Jason Wang , Xuan Zhuo , "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , "K. Y. Srinivasan" , Haiyang Zhang , Wei Liu , Dexuan Cui , Bryan Tan , Vishnu Dasa , VMware PV-Drivers Reviewers Cc: Dan Carpenter , Simon Horman , Krasnov Arseniy , kvm@vger.kernel.org, virtualization@lists.linux-foundation.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org, linux-hyperv@vger.kernel.org, bpf@vger.kernel.org, Bobby Eshleman , Jiang Wang X-Mailer: b4 0.12.2 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hey all! This series introduces support for datagrams to virtio/vsock. It is a spin-off (and smaller version) of this series from the summer: https://lore.kernel.org/all/cover.1660362668.git.bobby.eshleman@bytedance.com/ Please note that this is an RFC and should not be merged until associated changes are made to the virtio specification, which will follow after discussion from this series. Another aside, the v4 of the series has only been mildly tested with a run of tools/testing/vsock/vsock_test. Some code likely needs cleaning up, but I'm hoping to get some of the design choices agreed upon before spending too much time making it pretty. This series first supports datagrams in a basic form for virtio, and then optimizes the sendpath for all datagram transports. The result is a very fast datagram communication protocol that outperforms even UDP on multi-queue virtio-net w/ vhost on a variety of multi-threaded workload samples. For those that are curious, some summary data comparing UDP and VSOCK DGRAM (N=5): vCPUS: 16 virtio-net queues: 16 payload size: 4KB Setup: bare metal + vm (non-nested) UDP: 287.59 MB/s VSOCK DGRAM: 509.2 MB/s Some notes about the implementation... This datagram implementation forces datagrams to self-throttle according to the threshold set by sk_sndbuf. It behaves similar to the credits used by streams in its effect on throughput and memory consumption, but it is not influenced by the receiving socket as credits are. The device drops packets silently. As discussed previously, this series introduces datagrams and defers fairness to future work. See discussion in v2 for more context around datagrams, fairness, and this implementation. Signed-off-by: Bobby Eshleman --- Changes in v4: - style changes - vsock: use sk_vsock(vsk) in vsock_dgram_recvmsg instead of &sk->vsk - vsock: fix xmas tree declaration - vsock: fix spacing issues - virtio/vsock: virtio_transport_recv_dgram returns void because err unused - sparse analysis warnings/errors - virtio/vsock: fix unitialized skerr on destroy - virtio/vsock: fix uninitialized err var on goto out - vsock: fix declarations that need static - vsock: fix __rcu annotation order - bugs - vsock: fix null ptr in remote_info code - vsock/dgram: make transport_dgram a fallback instead of first priority - vsock: remove redundant rcu read lock acquire in getname() - tests - add more tests (message bounds and more) - add vsock_dgram_bind() helper - add vsock_dgram_connect() helper Changes in v3: - Support multi-transport dgram, changing logic in connect/bind to support VMCI case - Support per-pkt transport lookup for sendto() case - Fix dgram_allow() implementation - Fix dgram feature bit number (now it is 3) - Fix binding so dgram and connectible (cid,port) spaces are non-overlapping - RCU protect transport ptr so connect() calls never leave a lockless read of the transport and remote_addr are always in sync - Link to v2: https://lore.kernel.org/r/20230413-b4-vsock-dgram-v2-0-079cc7cee62e@bytedance.com --- Bobby Eshleman (7): vsock/dgram: generalize recvmsg and drop transport->dgram_dequeue vsock: refactor transport lookup code vsock: support multi-transport datagrams vsock: make vsock bind reusable virtio/vsock: add VIRTIO_VSOCK_F_DGRAM feature bit virtio/vsock: support dgrams vsock: Add lockless sendmsg() support Jiang Wang (1): tests: add vsock dgram tests drivers/vhost/vsock.c | 44 ++- include/linux/virtio_vsock.h | 13 +- include/net/af_vsock.h | 52 ++- include/uapi/linux/virtio_vsock.h | 2 + net/vmw_vsock/af_vsock.c | 616 ++++++++++++++++++++++++++------ net/vmw_vsock/diag.c | 10 +- net/vmw_vsock/hyperv_transport.c | 42 ++- net/vmw_vsock/virtio_transport.c | 28 +- net/vmw_vsock/virtio_transport_common.c | 226 +++++++++--- net/vmw_vsock/vmci_transport.c | 152 ++++---- net/vmw_vsock/vsock_bpf.c | 10 +- net/vmw_vsock/vsock_loopback.c | 13 +- tools/testing/vsock/util.c | 141 +++++++- tools/testing/vsock/util.h | 6 + tools/testing/vsock/vsock_test.c | 432 ++++++++++++++++++++++ 15 files changed, 1533 insertions(+), 254 deletions(-) --- base-commit: 28cfea989d6f55c3d10608eba2a2bae609c5bf3e change-id: 20230413-b4-vsock-dgram-3b6eba6a64e5 Best regards, -- Bobby Eshleman