Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp4801021pxj; Wed, 12 May 2021 13:38:00 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx49hHaM0rISL6HcUWttMan9qoQA08nxPmpCq8ScuVf+BS5z0wgDA7w/zHnixcYk9e4gtim X-Received: by 2002:a50:f9cc:: with SMTP id a12mr46251361edq.24.1620851880091; Wed, 12 May 2021 13:38:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620851880; cv=none; d=google.com; s=arc-20160816; b=xd+5e/FUBUf/K3BMvPbNwKBdeVUQWn9d9Xj9zJ9UNxaFTraDDWZGfIU5zeHSezgQL2 zIGpAxy+vRfJxxgg+8zonZ1uTJrzmFN5Z6JJ8JDiLo+OR+VGUeCKLREoQGmvV/NJuuMH w3FTu2/C/XSeTk887zm/A6IRLewtEWt9xvpUov8t6eqS7/CMIZDgSmuU+BT0ThxzHtnN +xtzbqHJi5pvef9H7JTLfknNRGtY3wln23YSY44ZuxH5c3q3lq07kIJAs6Dm7KIHCrDV HnyjKmInvqTNPz4Aun2A4gvVfjAd8cH/OZMPTp6qrrosoAUrnH3OyPhBdABkzmvT54+Y 2Kiw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=syGZF7+djubx9pa7ZPTidnBUE+ock6AOR0GRlEeKssA=; b=e8r5zHoc5SePGDBs2xCsllN1wVfvn8/PYX5sFYgBz/yQIw23g8/wijcF4ffKIM7rVg IuQz5C14LOjOQn0GOJJtCp2utnBDcR0Y8UgEJzHZP5mnNDGftPy+wuFl+M+9WdOJc+e3 0V/6e0gcDg/qYfUdZwS1HankIsv0oJeBE7UUXWsls8nYLJrv/b/zg05lJ5SJOhwvmNyw JTf2M4ovZjSr5mBRN0HiiCU3+uLzMoBKDHF9TDqSLRzhhqrBrmmJjBlcnFAjaULKqgry YxicS/2ZmbLcFBKSwRXC8dOYpcQlOi+1Vmq4WtoPjwCXGh6isVnO3/A7Sn1Vb2t8m5vF SZ3w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="Y7If/mRb"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id g13si497791edu.140.2021.05.12.13.36.40; Wed, 12 May 2021 13:38:00 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="Y7If/mRb"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1386450AbhELUVK (ORCPT + 99 others); Wed, 12 May 2021 16:21:10 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49918 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1384484AbhELT62 (ORCPT ); Wed, 12 May 2021 15:58:28 -0400 Received: from mail-qt1-x835.google.com (mail-qt1-x835.google.com [IPv6:2607:f8b0:4864:20::835]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 33F5FC06137C; Wed, 12 May 2021 12:53:56 -0700 (PDT) Received: by mail-qt1-x835.google.com with SMTP id t7so18177950qtn.3; Wed, 12 May 2021 12:53:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=syGZF7+djubx9pa7ZPTidnBUE+ock6AOR0GRlEeKssA=; b=Y7If/mRbIPQabIipvrYs+XR6pg0h45N6t3fp+NE68nX1LUDvs9tkhx3O9N61NQBagf E+DmWYzqEyvaCIwx0kFYhQAPFN/FCaDfoloPozKlx2M/CYZS8hYi/xTA6hVdJQVDZ/Ma tK/HUWkhjFUOBcI22gXsCCHz7aXuqGlj6/gI9sSllp3mRrsbg20y4RuhWMynXWLcPBKJ 4JZN0KzBwKJsgIhS0Yer/pzi2KN4ly0qNS8nM9QclLr9Rn/6QCAovFzRAcFxYlQM5acj rAiSYvzjfT1Ac2FRzpRbPGR75Dum8zk+1aZF+npfYQk4jaRKv4bO/3pmtBdarEyhuc28 HKfQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=syGZF7+djubx9pa7ZPTidnBUE+ock6AOR0GRlEeKssA=; b=tfBKEooSZGZ+OuKMf5A1yHicMoliahjRVKTsZKNSMkFFm0tXLBO6DCfNgRdl/UjxoK S12u3dtxNcrh/PdCUUXxtIehpHWXPiK9p2YgRJxr0Fmv6jSVZMrJMn1P83znMUCnxx+L NnrqGUDPQcQ7Y8E88+XkskGUbIj2mZEhE7xsW7E/F6oW1J1U0/fqZuVmJxLdbiKwxtOB 0IbWrRxvWkN+/TwYQzgJzcWCbou2MFFLnBo/r+sabNwM/lumlXNoXz2a3whnPgA2Gzxy DvSHGDD6vQoSXdSWUls7a9V/yyQLcSI2FcF1FpJPZlXxnmoj9XNWcPHyA0gdyOpW3dxV hiRw== X-Gm-Message-State: AOAM53228eFZjyu5x5yv8uDODnN2dxWfaYnibCosFiMc5CA8gSgkgQva ALZZ60saVVQGvcbd0jHOv4fPR3ioGu4cwNLjoUTXN3WVhdY= X-Received: by 2002:ac8:5850:: with SMTP id h16mr23134269qth.355.1620849234999; Wed, 12 May 2021 12:53:54 -0700 (PDT) MIME-Version: 1.0 References: <20210511044253.469034-1-yuri.benditovich@daynix.com> <20210511044253.469034-3-yuri.benditovich@daynix.com> In-Reply-To: From: Willem de Bruijn Date: Wed, 12 May 2021 15:53:17 -0400 Message-ID: Subject: Re: [PATCH 2/4] virtio-net: add support of UDP segmentation (USO) on the host To: Yuri Benditovich Cc: David Miller , Jakub Kicinski , "Michael S. Tsirkin" , Jason Wang , Network Development , linux-kernel , virtualization , Yan Vugenfirer Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, May 12, 2021 at 2:56 PM Yuri Benditovich wrote: > > On Wed, May 12, 2021 at 5:33 PM Willem de Bruijn > wrote: > > > > On Wed, May 12, 2021 at 2:10 AM Yuri Benditovich > > wrote: > > > > > > On Tue, May 11, 2021 at 8:48 PM Willem de Bruijn > > > wrote: > > > > > > > > On Tue, May 11, 2021 at 12:43 AM Yuri Benditovich > > > > wrote: > > > > > > > > > > Large UDP packet provided by the guest with GSO type set to > > > > > VIRTIO_NET_HDR_GSO_UDP_L4 will be divided to several UDP > > > > > packets according to the gso_size field. > > > > > > > > > > Signed-off-by: Yuri Benditovich > > > > > --- > > > > > include/linux/virtio_net.h | 5 +++++ > > > > > 1 file changed, 5 insertions(+) > > > > > > > > > > diff --git a/include/linux/virtio_net.h b/include/linux/virtio_net.h > > > > > index b465f8f3e554..4ecf9a1ca912 100644 > > > > > --- a/include/linux/virtio_net.h > > > > > +++ b/include/linux/virtio_net.h > > > > > @@ -51,6 +51,11 @@ static inline int virtio_net_hdr_to_skb(struct sk_buff *skb, > > > > > ip_proto = IPPROTO_UDP; > > > > > thlen = sizeof(struct udphdr); > > > > > break; > > > > > + case VIRTIO_NET_HDR_GSO_UDP_L4: > > > > > + gso_type = SKB_GSO_UDP_L4; > > > > > + ip_proto = IPPROTO_UDP; > > > > > + thlen = sizeof(struct udphdr); > > > > > + break; > > > > > > > > If adding a new VIRTIO_NET_HDR type I suggest adding separate IPv4 and > > > > IPv6 variants, analogous to VIRTIO_NET_HDR_GSO_TCPV[46]. To avoid > > > > having to infer protocol again, as for UDP fragmentation offload (the > > > > retry case below this code). > > > > > > Thank you for denoting this important point of distinguishing between v4 and v6. > > > Let's try to take a deeper look to see what is the correct thing to do > > > and please correct me if I'm wrong: > > > 1. For USO we do not need to guess the protocol as it is used with > > > VIRTIO_NET_HDR_F_NEEDS_CSUM (unlike UFO) > > > > Enforcing that is a good start. We should also enforce that > > skb->protocol is initialized to one of htons(ETH_P_IP) or > > htons(ETH_P_IPV6), so that it does not have to be inferred by parsing. > > As this feature is new and is not used in any public release of any > misbehaving driver, probably it is enough to state in the spec that > VIRTIO_NET_HDR_F_NEEDS_CSUM is required for USO packets. > The spec states that the USO feature requires checksumming feature. The spec is not sufficient. These rules need to be enforced in the kernel code, too. > > > > These requirements were not enforced for previous values, and cannot > > be introduced afterwards, which has led to have to add that extra code > > to handle these obscure edge cases. > > > > I agree that with well behaved configurations, the need for separate > > _V4 and _V6 variants is not needed. > > > > > and the USO packets > > > transmitted by the guest are under the same clause as both > > > VIRTIO_NET_HDR_GSO_TCP, i.e. under if (hdr->flags & > > > VIRTIO_NET_HDR_F_NEEDS_CSUM) { > > > 2. If we even define VIRTIO_NET_HDR_GSO_UDPv4_L4 and > > > VIRTIO_NET_HDR_GSO_UDPv6_L4 - both will be translated to > > > SKB_GSO_UDP_L4, so this information is immediately lost (the code will > > > look like: > > > case VIRTIO_NET_HDR_GSO_UDP4_L4: case VIRTIO_NET_HDR_GSO_UDP6_L4 > > > gso_type = SKB_GSO_UDP; > > > > > > 3. When we will define the respective guest features (like > > > VIRTIO_NET_F_HOST_USO4 VIRTIO_NET_F_HOST_USO6) we will need to > This is my typo: VIRTIO_NET_F_GUEST_USO4... > > > recreate the virtio_net header from the skb when both v4 and v6 have > > > the same SKB_GSO_UDP_L4, (see virtio_net_hdr_from_skb) and I'm not > > > sure whether somebody needs the exact v4 or v6 information on guest RX > > > path. > > > > FWIW, it is good to keep in mind that virtio_net_hdr is also used > > outside virtio, in both ingress and egress paths. > > Can you please elaborate in which scenarios we do not have any virtio > device in path but need virtio_net_hdr? Packet sockets, tuntap. > > > 4. What is completely correct is that when we will start working with > > > the guest RX path we will need to define something like NETIF_F_USO4 > > > and NETIF_F_USO6 and configure them according to exact guest offload > > > capabilities. > > > Do you agree? > > > > I don't immediately see the need for advertising this device feature > > on a per-protocol basis. Can you elaborate? > > Separate offload setting (controlled by the guest) for v4 and v6 in > guest RX path is mandatory, at least Windows always requires this for > any offload. > In this case it seems easy to have also virtio-net device features to > be indicated separately (the TAP/TUN should report its capabilities). Ah, ok.