Received: by 2002:ab2:710b:0:b0:1ef:a325:1205 with SMTP id z11csp1846027lql; Wed, 13 Mar 2024 09:40:05 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWlKo5ck3WXg8fZSa8ieIkpcew0ren8udJCzlTJSAWQ9hkqYmwIjfjRx05E3iOdhLe0zdUynubWp5UvaQZsMlSlDIppLgP0vLac6JXOSw== X-Google-Smtp-Source: AGHT+IFwnziuAT9QE+FD2/SZIeKLDzzMSLG+02Zq4lmn+kHyXvF3ME7RRFDdgB8crpIuBOz5KBMG X-Received: by 2002:ac8:5ac1:0:b0:42e:d6cb:669d with SMTP id d1-20020ac85ac1000000b0042ed6cb669dmr7514838qtd.60.1710348005742; Wed, 13 Mar 2024 09:40:05 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710348005; cv=pass; d=google.com; s=arc-20160816; b=lxpkCP8W0wNQGDOlXmG9Sc3C+L1HqSKmcXQUZ9Q4euCwbfUNtP6xiSNv9iczgAjSUi ZaDG87w1edbZsRvnUAZjyGBNm3iKdA1dgMJ4dsAtCOJiLScoiIrvfrJg/nDjGKi4ob7L XT4SurssmkPkfn3cwwkVgkzq4rE9ypmdvq+GiINEFcB/KGhB7BebE0XOHDf5kFqKaZuh Q0r+7yavphFvxC57qh1RQfps+iCevWrdSnQ17DDef8blWo3p5b9EV23FNwGvrL9s0tEy 88dEFYIrmWEEEyM4qOPkx9YCKEFUEyUeLX1XGD1znFr90MGsJ/DrUy9Z5j6qa29a4M20 2lTA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:in-reply-to:message-id :date:subject:cc:to:from:dkim-signature; bh=unW3cpRaiI6vqyLwQ1tep+g3qDBVbou61wtFw53Wgas=; fh=LmQDM/ecgqarGnmytzKj+4NLExy/IosUpa7xx3iFziU=; b=uTkWP/Rcqm7VtvilNo4T1vjZ4TjxHlk5BIm/fQ7cuR/Cm+AZAYJi6ybXzXOY8Yq17m aUCQrtKgvrLMx8VI3C3pWwYI7gaJreKtHK+HAljM3RAFgD/XHgLWhV5RnMtZnP89Q2vj lwk7FuzR80rbHVHxxDb/SV2OIA5k4hX6XItabppA44dgB06EwHj6sRoxZfdXjBr9+lSC DjAUuynLFBf0gXa9EEeLv228Veb+hHFzlXpG2htD/T2R5s9PaYeeTFa0VuwlC75OWx1L xu4f1FC6iF1VYHl8pxw/XwDDhng6H1XPW6KsrflKbmQC2zqpAwjJJP4UE7fg1poMPNPC hdOQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=dPVFhVbr; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-101771-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-101771-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id w19-20020ac857d3000000b0042eef656654si10160871qta.724.2024.03.13.09.40.05 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 13 Mar 2024 09:40:05 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-101771-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=dPVFhVbr; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-101771-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-101771-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 692971C20EE3 for ; Wed, 13 Mar 2024 16:40:05 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 78AAF612C3; Wed, 13 Mar 2024 16:33:19 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="dPVFhVbr" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7AD2560EDE; Wed, 13 Mar 2024 16:33:18 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710347598; cv=none; b=Nk9dgaFfhY6sKvf4NQ2U+2SAJSRBxpmwmxtLMGPqaQqgEExoXW0lJo6Th0s7+Evo3xaoYxtw2u9tjV6jAu1hjKwa95nKuOuejAsan1gEsPpqRU/KQEbty+Qsm7ZeJkfC4W3WmKw0Rg+cX0ffHzb+13Jk7j+kB9BJD4xPX9hZvC0= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710347598; c=relaxed/simple; bh=DdhodGuJ/Hd78C+APs5YBJ6Mhf3aD4i2/yO0SCt0KsY=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=iCC083lYC2qCWPb87gDLbaK8LCkmH5U+NvXK7uQIB3lB/tyjaMyG8jKVDIM0TPSyE1jYP5DrrmJVhMfEy67gc0/agMJtveRZ6edpfHdK01Jtn2eXsTFaesJ/LaYhgLWI+sgcxKry1OESGvFDkMxU5/qcVRp63NZiqdigy3iBKnQ= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=dPVFhVbr; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 1DE12C43394; Wed, 13 Mar 2024 16:33:17 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1710347598; bh=DdhodGuJ/Hd78C+APs5YBJ6Mhf3aD4i2/yO0SCt0KsY=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=dPVFhVbrS3k/qTHIAdN19wLyDQThGrm7OLjkDnaPkieC220M824bwr4OxiT1yZ5Zd 3hO0khUqog6UypxpvRQRDJF1znd5p9ms+s9DV94+DahMfDjync2H9mSFOkH3rVpggT zmu0wJ0QmwcabaxhDXPwJHKDkJ/uKhGv9x0W648oMIpKi3OcjVAhzsyTZn54o+bRgU P/3aWs3Fhh1Bsco+msZvlYdIoUmxrGNKqlfMLThjacRlrSgLRmVkNXSppRGbcgu6M1 7he9MKC4PgkIDSiBH2tAbIX4HJZlzCKaJ2KZUGjMkkYTa+XqFKldO3tcCjjp1UH0OI M1t4WnQzYAu0g== From: Sasha Levin To: linux-kernel@vger.kernel.org, stable@vger.kernel.org Cc: Florian Kauer , Maciej Fijalkowski , Naama Meir , Tony Nguyen , Sasha Levin Subject: [PATCH 6.7 32/61] igc: avoid returning frame twice in XDP_REDIRECT Date: Wed, 13 Mar 2024 12:32:07 -0400 Message-ID: <20240313163236.613880-33-sashal@kernel.org> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20240313163236.613880-1-sashal@kernel.org> References: <20240313163236.613880-1-sashal@kernel.org> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 X-KernelTest-Patch: http://kernel.org/pub/linux/kernel/v6.x/stable-review/patch-6.7.10-rc1.gz X-KernelTest-Tree: git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable-rc.git X-KernelTest-Branch: linux-6.7.y X-KernelTest-Patches: git://git.kernel.org/pub/scm/linux/kernel/git/stable/stable-queue.git X-KernelTest-Version: 6.7.10-rc1 X-KernelTest-Deadline: 2024-03-15T16:32+00:00 X-stable: review X-Patchwork-Hint: Ignore Content-Transfer-Encoding: 8bit From: Florian Kauer [ Upstream commit ef27f655b438bed4c83680e4f01e1cde2739854b ] When a frame can not be transmitted in XDP_REDIRECT (e.g. due to a full queue), it is necessary to free it by calling xdp_return_frame_rx_napi. However, this is the responsibility of the caller of the ndo_xdp_xmit (see for example bq_xmit_all in kernel/bpf/devmap.c) and thus calling it inside igc_xdp_xmit (which is the ndo_xdp_xmit of the igc driver) as well will lead to memory corruption. In fact, bq_xmit_all expects that it can return all frames after the last successfully transmitted one. Therefore, break for the first not transmitted frame, but do not call xdp_return_frame_rx_napi in igc_xdp_xmit. This is equally implemented in other Intel drivers such as the igb. There are two alternatives to this that were rejected: 1. Return num_frames as all the frames would have been transmitted and release them inside igc_xdp_xmit. While it might work technically, it is not what the return value is meant to represent (i.e. the number of SUCCESSFULLY transmitted packets). 2. Rework kernel/bpf/devmap.c and all drivers to support non-consecutively dropped packets. Besides being complex, it likely has a negative performance impact without a significant gain since it is anyway unlikely that the next frame can be transmitted if the previous one was dropped. The memory corruption can be reproduced with the following script which leads to a kernel panic after a few seconds. It basically generates more traffic than a i225 NIC can transmit and pushes it via XDP_REDIRECT from a virtual interface to the physical interface where frames get dropped. #!/bin/bash INTERFACE=enp4s0 INTERFACE_IDX=`cat /sys/class/net/$INTERFACE/ifindex` sudo ip link add dev veth1 type veth peer name veth2 sudo ip link set up $INTERFACE sudo ip link set up veth1 sudo ip link set up veth2 cat << EOF > redirect.bpf.c SEC("prog") int redirect(struct xdp_md *ctx) { return bpf_redirect($INTERFACE_IDX, 0); } char _license[] SEC("license") = "GPL"; EOF clang -O2 -g -Wall -target bpf -c redirect.bpf.c -o redirect.bpf.o sudo ip link set veth2 xdp obj redirect.bpf.o cat << EOF > pass.bpf.c SEC("prog") int pass(struct xdp_md *ctx) { return XDP_PASS; } char _license[] SEC("license") = "GPL"; EOF clang -O2 -g -Wall -target bpf -c pass.bpf.c -o pass.bpf.o sudo ip link set $INTERFACE xdp obj pass.bpf.o cat << EOF > trafgen.cfg { /* Ethernet Header */ 0xe8, 0x6a, 0x64, 0x41, 0xbf, 0x46, 0xFF, 0xFF, 0xFF, 0xFF, 0xFF, 0xFF, const16(ETH_P_IP), /* IPv4 Header */ 0b01000101, 0, # IPv4 version, IHL, TOS const16(1028), # IPv4 total length (UDP length + 20 bytes (IP header)) const16(2), # IPv4 ident 0b01000000, 0, # IPv4 flags, fragmentation off 64, # IPv4 TTL 17, # Protocol UDP csumip(14, 33), # IPv4 checksum /* UDP Header */ 10, 0, 1, 1, # IP Src - adapt as needed 10, 0, 1, 2, # IP Dest - adapt as needed const16(6666), # UDP Src Port const16(6666), # UDP Dest Port const16(1008), # UDP length (UDP header 8 bytes + payload length) csumudp(14, 34), # UDP checksum /* Payload */ fill('W', 1000), } EOF sudo trafgen -i trafgen.cfg -b3000MB -o veth1 --cpp Fixes: 4ff320361092 ("igc: Add support for XDP_REDIRECT action") Signed-off-by: Florian Kauer Reviewed-by: Maciej Fijalkowski Tested-by: Naama Meir Signed-off-by: Tony Nguyen Signed-off-by: Sasha Levin --- drivers/net/ethernet/intel/igc/igc_main.c | 13 ++++++------- 1 file changed, 6 insertions(+), 7 deletions(-) diff --git a/drivers/net/ethernet/intel/igc/igc_main.c b/drivers/net/ethernet/intel/igc/igc_main.c index e9bb403bbacf9..58ffddc6419ad 100644 --- a/drivers/net/ethernet/intel/igc/igc_main.c +++ b/drivers/net/ethernet/intel/igc/igc_main.c @@ -6489,7 +6489,7 @@ static int igc_xdp_xmit(struct net_device *dev, int num_frames, int cpu = smp_processor_id(); struct netdev_queue *nq; struct igc_ring *ring; - int i, drops; + int i, nxmit; if (unlikely(!netif_carrier_ok(dev))) return -ENETDOWN; @@ -6505,16 +6505,15 @@ static int igc_xdp_xmit(struct net_device *dev, int num_frames, /* Avoid transmit queue timeout since we share it with the slow path */ txq_trans_cond_update(nq); - drops = 0; + nxmit = 0; for (i = 0; i < num_frames; i++) { int err; struct xdp_frame *xdpf = frames[i]; err = igc_xdp_init_tx_descriptor(ring, xdpf); - if (err) { - xdp_return_frame_rx_napi(xdpf); - drops++; - } + if (err) + break; + nxmit++; } if (flags & XDP_XMIT_FLUSH) @@ -6522,7 +6521,7 @@ static int igc_xdp_xmit(struct net_device *dev, int num_frames, __netif_tx_unlock(nq); - return num_frames - drops; + return nxmit; } static void igc_trigger_rxtxq_interrupt(struct igc_adapter *adapter, -- 2.43.0