Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp2120235pxb; Mon, 12 Apr 2021 15:09:18 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwU8SsxZd0FBqvZbQ4txjJqC1VIu3j3bjXsPpI26EWNPMkLzZGWQkNs7GbVbt9VZIF/9Yr8 X-Received: by 2002:a17:907:24c9:: with SMTP id e9mr29181865ejn.517.1618265358398; Mon, 12 Apr 2021 15:09:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618265358; cv=none; d=google.com; s=arc-20160816; b=k01/LGJCCSl7+qvIFsMu0BgXGRTUUtOgvo8MgzjtbZm7DpIEbnDThHfrj3MB5rfG8W Wb02zM+B9mVZNUwBInEq5kxoPvQdgIwnprBAs82QBMxNCzJ27/16zvdsVlOgvdpBMk4y A6PGd+lM1dOEEAolejX5m32ermD8i2Zyqobvizepg5bATsRkJ9P/jraZEQY5gaZ7P/84 F4lYvX+z5yLlMzjKiwkM1SyqAKFBlRxjLtXAf/cnjHr/afrdOCTpFN2+DME1XO5Jfix4 nmSmVb19JmvFoBJrxDvRqU+2eRojaMI6C/DZO9DsJ/y4ZdmxfqRLiaRqTKyRjBubi2g7 /xsg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=xT8unvnFdZwMybeFOwLgvVRw07mT9Lj1Bum0D1xiLgg=; b=N9Cqf19H6PapoQ9zL4Mvl1kOuSg+yhMsfFSGYzbZWOJZImAkPU3aHj23tuR1evkWiW 0XJY7LDjdfaHVsTl7XvLZC9cvLg8fbfcaCytG+JBcrNlI6fpZjbhghIvaX9uoMUcDEMT M3jtSINRgXDunJ/ZjHmBgLypyD8s/mjZ0fBovmum73DxkL9fAUDeXk80yklTlkt3OAmx Sb+zD6ZWXFO3NnAl9KrhGf4nFqb4LYoQhDzwthGGPH0/nC9On2n/Eg1DFoBHFjPpYkFP 0PaQ8ycO+XVlutYIud0ZUFMylT0wFWI6Lgi1Wbk+QWSg+SQI2aklR4V4TXfa2vY6MdCu XaNw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=yrsoSPJk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dp12si8176877ejc.526.2021.04.12.15.08.54; Mon, 12 Apr 2021 15:09:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=yrsoSPJk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241760AbhDLJ0n (ORCPT + 99 others); Mon, 12 Apr 2021 05:26:43 -0400 Received: from mail.kernel.org ([198.145.29.99]:54822 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239111AbhDLJC2 (ORCPT ); Mon, 12 Apr 2021 05:02:28 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 9873B6109E; Mon, 12 Apr 2021 09:01:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1618218074; bh=EOAJMLiLgHXnRcEMOcLL+XyDHNdvw9oCX+7yVNpVNEc=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=yrsoSPJkmzk5zhSQXg5J3f6VlWPYodtOINVZXlfp6ore5WdsEd5ooZwIESLFLABwm CwMp4k++BBZmirqWaCGTJhH+VW0/OhkjEssxeZdKEPIz7dN40AY5GsTbGe06vdyeWr z6dUg3RK4OW3yr8PyzLudtHEvcXHPCOiCON4xS/4= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Xuan Zhuo , Eric Dumazet , "Michael S. Tsirkin" , Jason Wang , virtualization@lists.linux-foundation.org, "David S. Miller" Subject: [PATCH 5.11 062/210] virtio_net: Do not pull payload in skb->head Date: Mon, 12 Apr 2021 10:39:27 +0200 Message-Id: <20210412084018.062277755@linuxfoundation.org> X-Mailer: git-send-email 2.31.1 In-Reply-To: <20210412084016.009884719@linuxfoundation.org> References: <20210412084016.009884719@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Eric Dumazet commit 0f6925b3e8da0dbbb52447ca8a8b42b371aac7db upstream. Xuan Zhuo reported that commit 3226b158e67c ("net: avoid 32 x truesize under-estimation for tiny skbs") brought a ~10% performance drop. The reason for the performance drop was that GRO was forced to chain sk_buff (using skb_shinfo(skb)->frag_list), which uses more memory but also cause packet consumers to go over a lot of overhead handling all the tiny skbs. It turns out that virtio_net page_to_skb() has a wrong strategy : It allocates skbs with GOOD_COPY_LEN (128) bytes in skb->head, then copies 128 bytes from the page, before feeding the packet to GRO stack. This was suboptimal before commit 3226b158e67c ("net: avoid 32 x truesize under-estimation for tiny skbs") because GRO was using 2 frags per MSS, meaning we were not packing MSS with 100% efficiency. Fix is to pull only the ethernet header in page_to_skb() Then, we change virtio_net_hdr_to_skb() to pull the missing headers, instead of assuming they were already pulled by callers. This fixes the performance regression, but could also allow virtio_net to accept packets with more than 128bytes of headers. Many thanks to Xuan Zhuo for his report, and his tests/help. Fixes: 3226b158e67c ("net: avoid 32 x truesize under-estimation for tiny skbs") Reported-by: Xuan Zhuo Link: https://www.spinics.net/lists/netdev/msg731397.html Co-Developed-by: Xuan Zhuo Signed-off-by: Xuan Zhuo Signed-off-by: Eric Dumazet Cc: "Michael S. Tsirkin" Cc: Jason Wang Cc: virtualization@lists.linux-foundation.org Acked-by: Jason Wang Signed-off-by: David S. Miller Signed-off-by: Greg Kroah-Hartman --- drivers/net/virtio_net.c | 10 +++++++--- include/linux/virtio_net.h | 14 +++++++++----- 2 files changed, 16 insertions(+), 8 deletions(-) --- a/drivers/net/virtio_net.c +++ b/drivers/net/virtio_net.c @@ -406,9 +406,13 @@ static struct sk_buff *page_to_skb(struc offset += hdr_padded_len; p += hdr_padded_len; - copy = len; - if (copy > skb_tailroom(skb)) - copy = skb_tailroom(skb); + /* Copy all frame if it fits skb->head, otherwise + * we let virtio_net_hdr_to_skb() and GRO pull headers as needed. + */ + if (len <= skb_tailroom(skb)) + copy = len; + else + copy = ETH_HLEN + metasize; skb_put_data(skb, p, copy); if (metasize) { --- a/include/linux/virtio_net.h +++ b/include/linux/virtio_net.h @@ -65,14 +65,18 @@ static inline int virtio_net_hdr_to_skb( skb_reset_mac_header(skb); if (hdr->flags & VIRTIO_NET_HDR_F_NEEDS_CSUM) { - u16 start = __virtio16_to_cpu(little_endian, hdr->csum_start); - u16 off = __virtio16_to_cpu(little_endian, hdr->csum_offset); + u32 start = __virtio16_to_cpu(little_endian, hdr->csum_start); + u32 off = __virtio16_to_cpu(little_endian, hdr->csum_offset); + u32 needed = start + max_t(u32, thlen, off + sizeof(__sum16)); + + if (!pskb_may_pull(skb, needed)) + return -EINVAL; if (!skb_partial_csum_set(skb, start, off)) return -EINVAL; p_off = skb_transport_offset(skb) + thlen; - if (p_off > skb_headlen(skb)) + if (!pskb_may_pull(skb, p_off)) return -EINVAL; } else { /* gso packets without NEEDS_CSUM do not set transport_offset. @@ -102,14 +106,14 @@ retry: } p_off = keys.control.thoff + thlen; - if (p_off > skb_headlen(skb) || + if (!pskb_may_pull(skb, p_off) || keys.basic.ip_proto != ip_proto) return -EINVAL; skb_set_transport_header(skb, keys.control.thoff); } else if (gso_type) { p_off = thlen; - if (p_off > skb_headlen(skb)) + if (!pskb_may_pull(skb, p_off)) return -EINVAL; } }