Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp2367694pxk; Mon, 14 Sep 2020 11:22:15 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwAcc3CbfPoFWZJHtfqykvlpuU/zj5lt0P+ZcqBqHCQFQau8uA0x3AIxa2BWtRVkLCvfIpw X-Received: by 2002:a17:906:e24d:: with SMTP id gq13mr15480615ejb.152.1600107735704; Mon, 14 Sep 2020 11:22:15 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1600107735; cv=none; d=google.com; s=arc-20160816; b=UIowg7jEMMwSzV5kpd+xtK+ckLLxQs0DZsz6anxd0HhYErCTPXwr7HjEcBjQpJQQ65 7sh78Z37advmHYp6OxGqGmygmv8kjvjj4RJ38TNnxewNjKtFWcV10tLZqA2wsSIRhfoq BPuZ5TTwm8q3uT8YKBuLDuEEre0CNHwRa4jmvLdS0u6fKDtjQOWk7XkdUjD2O/zDIQBs nYfqLun2/jUUj9KzA7IuaJb/9jVOowDbN8/gJYNTySW2pgJ2djEjvTLtxNyHpwLTB6bX 6qT3hCUHUYKk8rZR+0x5S+xvNku8Dqe9XaKB/C6lm9lLQQqzuAKh1t/Xl08pshNo51cp cuIA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=PRS9hPGTzNqWYQCnhRACZ0J9stQvEN+CXJ0aPUWYmeI=; b=tCOzpzBDn12aktnSGXLMYw3BzdLq2ysQ4XfEVY1z4htKHk4jfhzajeSLosIAQrtu6I NFndkz7DPT3KY9P93/ckboD/GKPiBA4VFcHuQHKE1DlHNl21T7+cX01pPdx49vlDjtea jaFJIVTEz38/zhxnpUW4sLog7QNp1gp5YkHFXKaeok4HxKXUajNWnHnimBm6NPb8AyH7 AgcyvXgJf3pWm910YB5NGeA9ApEJhJeB4W0tS+uMgB0S0nrXp2NSQQ3XpWnSkPe0eksu uorAxcXEzg3rf3GyICQBmeYlb0679qMApB/zIUQxVzqea2JaI0lobNf29s8iaHaWXhUP RXZg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=kPWTPtFm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id d19si8534126edj.40.2020.09.14.11.21.52; Mon, 14 Sep 2020 11:22:15 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=kPWTPtFm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726069AbgINSU5 (ORCPT + 99 others); Mon, 14 Sep 2020 14:20:57 -0400 Received: from mail.kernel.org ([198.145.29.99]:45150 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725944AbgINSUy (ORCPT ); Mon, 14 Sep 2020 14:20:54 -0400 Received: from mail-lf1-f47.google.com (mail-lf1-f47.google.com [209.85.167.47]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 67E9321D7B; Mon, 14 Sep 2020 18:20:53 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1600107653; bh=BAjJTlDtrbfSxx3WpXifRRlUlkkL6nq/6bBLXSRWPJM=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=kPWTPtFmGeAcUwV2ZzcSWRLVUn+chuLd0NhGULBRQ+mOorNLZ19F5q/V2RhCj2MRj vN3hZ1c3c87HkfKtvIQpoAf/F7rrxQgnJ5PTgfWTl9lNtg4pqyVtSwWPubgfqOnNCI +zRYSUDpw8rpBpZQ/xk8gLHFcEZgPE+EE9CWAb5g= Received: by mail-lf1-f47.google.com with SMTP id d15so266429lfq.11; Mon, 14 Sep 2020 11:20:53 -0700 (PDT) X-Gm-Message-State: AOAM533aiOb13/7WyyO9G5jKLmSQHASmgMA6xp2fBHTuSKdSB4SlPY/M fGKSPkMoPt3UKHXR0B0u60htp59IdV1r96R3V8o= X-Received: by 2002:a19:992:: with SMTP id 140mr4464849lfj.273.1600107651627; Mon, 14 Sep 2020 11:20:51 -0700 (PDT) MIME-Version: 1.0 References: <20200911143022.414783-1-nicolas.rybowski@tessares.net> In-Reply-To: <20200911143022.414783-1-nicolas.rybowski@tessares.net> From: Song Liu Date: Mon, 14 Sep 2020 11:20:40 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH bpf-next v2 1/5] bpf: expose is_mptcp flag to bpf_tcp_sock To: Nicolas Rybowski Cc: Alexei Starovoitov , Daniel Borkmann , Martin KaFai Lau , Song Liu , Yonghong Song , Andrii Nakryiko , John Fastabend , KP Singh , "David S. Miller" , Jakub Kicinski , Matthieu Baerts , Mat Martineau , Networking , bpf , open list Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Sep 11, 2020 at 8:07 AM Nicolas Rybowski wrote: > > is_mptcp is a field from struct tcp_sock used to indicate that the > current tcp_sock is part of the MPTCP protocol. > > In this protocol, a first socket (mptcp_sock) is created with > sk_protocol set to IPPROTO_MPTCP (=262) for control purpose but it > isn't directly on the wire. This is the role of the subflow (kernel) > sockets which are classical tcp_sock with sk_protocol set to > IPPROTO_TCP. The only way to differentiate such sockets from plain TCP > sockets is the is_mptcp field from tcp_sock. > > Such an exposure in BPF is thus required to be able to differentiate > plain TCP sockets from MPTCP subflow sockets in BPF_PROG_TYPE_SOCK_OPS > programs. > > The choice has been made to silently pass the case when CONFIG_MPTCP is > unset by defaulting is_mptcp to 0 in order to make BPF independent of > the MPTCP configuration. Another solution is to make the verifier fail > in 'bpf_tcp_sock_is_valid_ctx_access' but this will add an additional > '#ifdef CONFIG_MPTCP' in the BPF code and a same injected BPF program > will not run if MPTCP is not set. > > An example use-case is provided in > https://github.com/multipath-tcp/mptcp_net-next/tree/scripts/bpf/examples > > Suggested-by: Matthieu Baerts > Acked-by: Matthieu Baerts > Acked-by: Mat Martineau > Signed-off-by: Nicolas Rybowski > --- > include/uapi/linux/bpf.h | 1 + > net/core/filter.c | 9 ++++++++- > tools/include/uapi/linux/bpf.h | 1 + > 3 files changed, 10 insertions(+), 1 deletion(-) > > diff --git a/include/uapi/linux/bpf.h b/include/uapi/linux/bpf.h > index 7dd314176df7..7d179eada1c3 100644 > --- a/include/uapi/linux/bpf.h > +++ b/include/uapi/linux/bpf.h > @@ -4060,6 +4060,7 @@ struct bpf_tcp_sock { > __u32 delivered; /* Total data packets delivered incl. rexmits */ > __u32 delivered_ce; /* Like the above but only ECE marked packets */ > __u32 icsk_retransmits; /* Number of unrecovered [RTO] timeouts */ > + __u32 is_mptcp; /* Is MPTCP subflow? */ Shall we have an __u32 flags, and make is_mptcp a bit of it? Thanks, Song [...]