Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp1563500rwi; Wed, 19 Oct 2022 12:15:28 -0700 (PDT) X-Google-Smtp-Source: AMsMyM5S4PebuOg9LSveqoIHEdRS4AVhcj4+SkV8dez7sFdM46cYjY403iO/3OPJuekTlEm+XjU5 X-Received: by 2002:a63:2b41:0:b0:46e:9364:eb07 with SMTP id r62-20020a632b41000000b0046e9364eb07mr341896pgr.46.1666206927981; Wed, 19 Oct 2022 12:15:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666206927; cv=none; d=google.com; s=arc-20160816; b=sOv5aB+wIabyZTned8xRJsw24hO95LDoGJwDK8LkD+f87g32fHPaqb3axKTAJGF6Kj 0xNV3vqb9nQdSp7xcP9/ZJ4k+WkOHfrIbnmx0GenmYBMX1oc7woRCzeBc4mckszaeO9h /s4r2GbkuL23qg3g+d1sARaOU4je1s3EacHmh+98TBeaxLoihObCLTjAKiGv4D+brmT+ xeE15fm7oF4sGK51OGUFythPo45r6b9nywF6BrZaEqb/OyeQ1l0zDdHbD42OlEdln07A 5lUBLDjsgb/aKwtJWwRq4XvkNjmNP9oSu2YJcTnD3hYSz+NqmdMC1uxmNZk7RJHi3HO1 sRJQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Ktkbq0iHmuOGr8YKybw9yRJ3NFz4+wIdXshxJ8vlzvI=; b=Veed8NldtvMvtQtVe2IczYNtZ9rvh/gDrTK0HcU+lwA2Wh6lflEd3e9ZSBqR53UU9f dkM2zPLylZG5UVaSGb6AEXl5QDmHBddHwPVOjEX2bATvR1ODtl5kM5EX9OKg/V0nDKhe c6ZGI9zZFi+cfFG1O31iq3J6ZAQ5Q1x3gAV57TLeVw5x5Mfu9g5HPkmYR6hnODYSHTdV o7+lQ1EbL/eH2HCDH3ArMpnGeX0pNFbzp4hQn7EtUScIBdUA8bjgXAPtL+0m3e2Q2EoN PPONNhYfhkvj+/yTuDOh+0x/a68krfPZI6Wjunef2nbgh07Qxt2MlPLBH9Zv9Z2KWNsh 13vg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@tigera.io header.s=google header.b=hlq0t5O4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=tigera.io Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j4-20020a170902da8400b0017f6557bf88si23108119plx.588.2022.10.19.12.15.16; Wed, 19 Oct 2022 12:15:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@tigera.io header.s=google header.b=hlq0t5O4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=tigera.io Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230160AbiJSTKp (ORCPT + 99 others); Wed, 19 Oct 2022 15:10:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48772 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229920AbiJSTKm (ORCPT ); Wed, 19 Oct 2022 15:10:42 -0400 Received: from mail-pf1-x429.google.com (mail-pf1-x429.google.com [IPv6:2607:f8b0:4864:20::429]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7EC251814A9 for ; Wed, 19 Oct 2022 12:10:41 -0700 (PDT) Received: by mail-pf1-x429.google.com with SMTP id g28so18121138pfk.8 for ; Wed, 19 Oct 2022 12:10:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tigera.io; s=google; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=Ktkbq0iHmuOGr8YKybw9yRJ3NFz4+wIdXshxJ8vlzvI=; b=hlq0t5O46kYRtoorj9rCVBuGvthMsQoJWM0Ekd+3w7Pidjgzzod8b/q1klsnYEsIDl CVnnlT7m+Bx3X9IgGerr/YH0Uspii3p6hwQXNHKN3ZBEK4143xhBRCSGKeG0XBF3D9Xo hN093Khih0GclxrQeK2qHp+jamrB+4BgiNECw= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=Ktkbq0iHmuOGr8YKybw9yRJ3NFz4+wIdXshxJ8vlzvI=; b=8QdEA0B1AZ3VKbOha0fssNdDK/YALsJNh7xWOHwefotN0p/tBRenQzadvaNaJAiS4P qyPnX954VOenr9jTpr1ZBj3ygtgaXkFQ24+d8DTeHNDpAEtI3LT9vUvTpbHp7OjDMVNO q62AtksmE/rPve1XHeQ6jEqZmTJVZRgUBOEGVdT+7/NHF4wj6eOWdETgl7uzfUsj4RJI bgQgOCf5eq3xihTTKBx5pjPLH4vn8PXYIOX6Jiqat893H8DUa1/RCr6DNa3qRQBzsEo2 pA5GadUCmZrP1gzCHlmy5odIk6ruDpmgD6MrSvYoteIE2DqkiDlCoMttIgnEyjC2vxuD oosQ== X-Gm-Message-State: ACrzQf0IfNP+gT5WNG/I0gYbfAh2hQS9fIktQUSX4d5FWhp1EJnl+B2c NAkuCzZk8plxYR7556rLwP5Zxy+UR5g80OZu2fb5LaD7ElSmYw== X-Received: by 2002:a63:1608:0:b0:45a:355a:9420 with SMTP id w8-20020a631608000000b0045a355a9420mr8420755pgl.354.1666206640880; Wed, 19 Oct 2022 12:10:40 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Tomas Hruby Date: Wed, 19 Oct 2022 12:10:30 -0700 Message-ID: Subject: Re: kernel BUG at net/core/skbuff.c:4219 To: eric.dumazet@gmail.com Cc: edumazet@google.com, herbert@gondor.apana.org.au, jpiotrowski@linux.microsoft.com, linux-kernel@vger.kernel.org, netdev@vger.kernel.org, seh@panix.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > > Hi, > > > > One of our Flatcar users has been hitting the kernel BUG in the subject line > > for the past year (https://github.com/flatcar/Flatcar/issues/378). This was > > first reported when on 5.10.25, but has been happening across kernel updates, > > most recently with 5.15.63. The nodes where this happens are AWS EC2 instances, > > using ENA and calico networking in eBPF mode with VXLAN encapsulation. When > > GRO/GSO is enabled, the host hits this bug and prints the following stacktrace: > > > I suspect eBPF code lowers gso_size ? > > gso stack is not able to arbitrarily segment a GRO packet after gso_size > being changed. Based on the stack trace, it happens for a tcp packet. Since it seems like it is on egress from a host, I suspect that it is encapsulating a tcp packet into a vxlan tunnel and useds bpf_skb_adjust_room() here https://github.com/projectcalico/calico/blob/master/felix/bpf-gpl/nat.h#L77-L80 Not sure if it should use BPF_F_ADJ_ROOM_FIXED_GSO in that case. Calico uses the flag when decapsulating packets here https://github.com/projectcalico/calico/blob/master/felix/bpf-gpl/nat.h#L143 and it uses the flag as the packet is UDP. Any of that could cause the BUG? Tomas