Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp4398946pxb; Tue, 26 Jan 2021 22:19:19 -0800 (PST) X-Google-Smtp-Source: ABdhPJxI6EWbVxQwAfa40/rHrQFja4V+lZgqlJkxBjCKKbpNKGvwtZ/b5Kvo9hu/sOWSa3d0l2zq X-Received: by 2002:aa7:de8f:: with SMTP id j15mr7753760edv.268.1611728359486; Tue, 26 Jan 2021 22:19:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611728359; cv=none; d=google.com; s=arc-20160816; b=pd2D81m838pds2b5y7dlH9UzuRLYRBrNddCK3Vff5eGiRuPF3kkPQ/J9KN7yem4WPO PnVGr3bVvqNv+erso0l7nLIUXjnRBQDl6r/bh14IqtQIyhhHXIW/gt1K+GpfSEA6erx3 s4C9o33UKfmH7CVCyGBszYq0bLg+ZtKZwf3xIjZosGQ+cS2Qr5JleOWRoo/+11JC/aRz uNC2Lx42yew+zqA386OjAQCdjXuSOqd191QTxJsTvn5aGYO/UpOcvT53oZAY5z4ZnbNC FmrfLqi3QBtuWa8RTgAVZwrG4YFH98616xU9LBnkVr3+QIh17GgO+LVnVaXfGAMptFAc 1fdg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=L6F+uE+UliFGVqs/NDlYnbVdP53TZOrJEQl/d5NWBPk=; b=UdLE3llaULDOKq/hN1qCc4la7wwLtZkETeVQLqZCabo3X+kgAHMMSbTYx9WcaawIt/ R6SaC2Cme57UOwmLOpE8MWKJ2fJc5FCAt5oJzcMyDaitWx2mPilRLfVUbO2zQvlZvqWX yVKzCAwSJ5bAfwqBCmp9Hsp7OaIWu5YVQ51pf3Oe9autG7rdqDNVBBLxml2AZ0wYELiv OWTR6ebCZbLFWT68n5X1HZUGK477KJ7wqRnqJHFbOmGrSsxRa7jLX08BkCu1PQ4VpUUX pJB+FtDLSNkXtF8Cvwo25KgyD5BrPV3qCO0JVboMi+OiRJrq3JE8UG+ViM2ii3/4XST6 RdXw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=abofJErB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id w13si488323edd.262.2021.01.26.22.18.55; Tue, 26 Jan 2021 22:19:19 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=abofJErB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2394504AbhAZSQs (ORCPT + 99 others); Tue, 26 Jan 2021 13:16:48 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34350 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2393901AbhAZSBO (ORCPT ); Tue, 26 Jan 2021 13:01:14 -0500 Received: from mail-io1-xd2f.google.com (mail-io1-xd2f.google.com [IPv6:2607:f8b0:4864:20::d2f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 58348C061756 for ; Tue, 26 Jan 2021 10:00:44 -0800 (PST) Received: by mail-io1-xd2f.google.com with SMTP id 16so4897571ioz.5 for ; Tue, 26 Jan 2021 10:00:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=L6F+uE+UliFGVqs/NDlYnbVdP53TZOrJEQl/d5NWBPk=; b=abofJErBU6FkaNrP5/miR4Lb6l8kBeIt/iUqk5FV+2eP8QyxZulcnmhVa8sX/HbjGq wU2mrWHfaLs9xU3p39iTQpSyxJ7x6Ixary+Uuf+G2I05w2JjTxbyrHmU79eLzLDLHT5o LvyQuokaXm8wHlgbYSbG6B0z2SWKUXuqxxiSY= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=L6F+uE+UliFGVqs/NDlYnbVdP53TZOrJEQl/d5NWBPk=; b=Y/So+mVpoEqCsJOl3ZtkVTQfjza13YPzTU9wI1YHrYp42V7FJb6kAj57WWf5psIVzd 6wodH90YvZLIHTSdh966TVDJONAlUMbgMU4VDkeIDEDlXA7krlGE9HFAQM1nrDZ0F63C Nq4EpgyU7RjyQVTmNU+9AAa7KgV3cT7I0atE3WxH46AIrWKIWIwUAm/uMHTh/Aw75Uuq gOji3qKvoxp1eYr2RO8xkP2/fFicfuxBwn8YEafdC9WQMyoE8vxrFy+XFQ339FlkHBCS T23O9CwoLLWsIbpCSYA1/Wys6DdrDcOJ99piK3rgAV5Cio7Dwr5FaXCjATJXjbNIGnQF JXWw== X-Gm-Message-State: AOAM532HUJ0YWaZwmiomnGuuNVHbf64ElKG3+9HH4IGtiJ4VDkUtDzCo w8kmsPA5v4X7egoOO4xhLaDFP9eRAlySnygDo+ROJA== X-Received: by 2002:a5d:9586:: with SMTP id a6mr5064887ioo.83.1611684043624; Tue, 26 Jan 2021 10:00:43 -0800 (PST) MIME-Version: 1.0 References: <20210119155953.803818-1-revest@chromium.org> <20210119155953.803818-4-revest@chromium.org> <93be5434-58ca-1e3b-d7dc-7ae079381104@fb.com> In-Reply-To: <93be5434-58ca-1e3b-d7dc-7ae079381104@fb.com> From: Florent Revest Date: Tue, 26 Jan 2021 19:00:32 +0100 Message-ID: Subject: Re: [PATCH bpf-next v5 4/4] selftests/bpf: Add a selftest for the tracing bpf_get_socket_cookie To: Yonghong Song Cc: Alexei Starovoitov , KP Singh , bpf , Alexei Starovoitov , Daniel Borkmann , Andrii Nakryiko , Florent Revest , open list , Brendan Jackman Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Jan 23, 2021 at 9:45 PM Yonghong Song wrote: > On 1/22/21 7:34 AM, Florent Revest wrote: > > On Wed, Jan 20, 2021 at 8:06 PM Florent Revest wrote: > >> > >> On Wed, Jan 20, 2021 at 8:04 PM Alexei Starovoitov > >> wrote: > >>> > >>> On Wed, Jan 20, 2021 at 9:08 AM KP Singh wrote: > >>>> > >>>> On Tue, Jan 19, 2021 at 5:00 PM Florent Revest wrote: > >>>>> > >>>>> This builds up on the existing socket cookie test which checks whether > >>>>> the bpf_get_socket_cookie helpers provide the same value in > >>>>> cgroup/connect6 and sockops programs for a socket created by the > >>>>> userspace part of the test. > >>>>> > >>>>> Adding a tracing program to the existing objects requires a different > >>>>> attachment strategy and different headers. > >>>>> > >>>>> Signed-off-by: Florent Revest > >>>> > >>>> Acked-by: KP Singh > >>>> > >>>> (one minor note, doesn't really need fixing as a part of this though) > >>>> > >>>>> --- > >>>>> .../selftests/bpf/prog_tests/socket_cookie.c | 24 +++++++---- > >>>>> .../selftests/bpf/progs/socket_cookie_prog.c | 41 ++++++++++++++++--- > >>>>> 2 files changed, 52 insertions(+), 13 deletions(-) > >>>>> > >>>>> diff --git a/tools/testing/selftests/bpf/prog_tests/socket_cookie.c b/tools/testing/selftests/bpf/prog_tests/socket_cookie.c > >>>>> index 53d0c44e7907..e5c5e2ea1deb 100644 > >>>>> --- a/tools/testing/selftests/bpf/prog_tests/socket_cookie.c > >>>>> +++ b/tools/testing/selftests/bpf/prog_tests/socket_cookie.c > >>>>> @@ -15,8 +15,8 @@ struct socket_cookie { > >>>>> > >>>>> void test_socket_cookie(void) > >>>>> { > >>>>> + struct bpf_link *set_link, *update_sockops_link, *update_tracing_link; > >>>>> socklen_t addr_len = sizeof(struct sockaddr_in6); > >>>>> - struct bpf_link *set_link, *update_link; > >>>>> int server_fd, client_fd, cgroup_fd; > >>>>> struct socket_cookie_prog *skel; > >>>>> __u32 cookie_expected_value; > >>>>> @@ -39,15 +39,21 @@ void test_socket_cookie(void) > >>>>> PTR_ERR(set_link))) > >>>>> goto close_cgroup_fd; > >>>>> > >>>>> - update_link = bpf_program__attach_cgroup(skel->progs.update_cookie, > >>>>> - cgroup_fd); > >>>>> - if (CHECK(IS_ERR(update_link), "update-link-cg-attach", "err %ld\n", > >>>>> - PTR_ERR(update_link))) > >>>>> + update_sockops_link = bpf_program__attach_cgroup( > >>>>> + skel->progs.update_cookie_sockops, cgroup_fd); > >>>>> + if (CHECK(IS_ERR(update_sockops_link), "update-sockops-link-cg-attach", > >>>>> + "err %ld\n", PTR_ERR(update_sockops_link))) > >>>>> goto free_set_link; > >>>>> > >>>>> + update_tracing_link = bpf_program__attach( > >>>>> + skel->progs.update_cookie_tracing); > >>>>> + if (CHECK(IS_ERR(update_tracing_link), "update-tracing-link-attach", > >>>>> + "err %ld\n", PTR_ERR(update_tracing_link))) > >>>>> + goto free_update_sockops_link; > >>>>> + > >>>>> server_fd = start_server(AF_INET6, SOCK_STREAM, "::1", 0, 0); > >>>>> if (CHECK(server_fd < 0, "start_server", "errno %d\n", errno)) > >>>>> - goto free_update_link; > >>>>> + goto free_update_tracing_link; > >>>>> > >>>>> client_fd = connect_to_fd(server_fd, 0); > >>>>> if (CHECK(client_fd < 0, "connect_to_fd", "errno %d\n", errno)) > >>>>> @@ -71,8 +77,10 @@ void test_socket_cookie(void) > >>>>> close(client_fd); > >>>>> close_server_fd: > >>>>> close(server_fd); > >>>>> -free_update_link: > >>>>> - bpf_link__destroy(update_link); > >>>>> +free_update_tracing_link: > >>>>> + bpf_link__destroy(update_tracing_link); > >>>> > >>>> I don't think this need to block submission unless there are other > >>>> issues but the > >>>> bpf_link__destroy can just be called in a single cleanup label because > >>>> it handles null or > >>>> erroneous inputs: > >>>> > >>>> int bpf_link__destroy(struct bpf_link *link) > >>>> { > >>>> int err = 0; > >>>> > >>>> if (IS_ERR_OR_NULL(link)) > >>>> return 0; > >>>> [...] > >>> > >>> +1 to KP's point. > >>> > >>> Also Florent, how did you test it? > >>> This test fails in CI and in my manual run: > >>> ./test_progs -t cook > >>> libbpf: load bpf program failed: Permission denied > >>> libbpf: -- BEGIN DUMP LOG --- > >>> libbpf: > >>> ; int update_cookie_sockops(struct bpf_sock_ops *ctx) > >>> 0: (bf) r6 = r1 > >>> ; if (ctx->family != AF_INET6) > >>> 1: (61) r1 = *(u32 *)(r6 +20) > >>> ; if (ctx->family != AF_INET6) > >>> 2: (56) if w1 != 0xa goto pc+21 > >>> R1_w=inv10 R6_w=ctx(id=0,off=0,imm=0) R10=fp0 > >>> ; if (ctx->op != BPF_SOCK_OPS_TCP_CONNECT_CB) > >>> 3: (61) r1 = *(u32 *)(r6 +0) > >>> ; if (ctx->op != BPF_SOCK_OPS_TCP_CONNECT_CB) > >>> 4: (56) if w1 != 0x3 goto pc+19 > >>> R1_w=inv3 R6_w=ctx(id=0,off=0,imm=0) R10=fp0 > >>> ; if (!ctx->sk) > >>> 5: (79) r1 = *(u64 *)(r6 +184) > >>> ; if (!ctx->sk) > >>> 6: (15) if r1 == 0x0 goto pc+17 > >>> R1_w=sock(id=0,ref_obj_id=0,off=0,imm=0) R6_w=ctx(id=0,off=0,imm=0) R10=fp0 > >>> ; p = bpf_sk_storage_get(&socket_cookies, ctx->sk, 0, 0); > >>> 7: (79) r2 = *(u64 *)(r6 +184) > >>> ; p = bpf_sk_storage_get(&socket_cookies, ctx->sk, 0, 0); > >>> 8: (18) r1 = 0xffff888106e41400 > >>> 10: (b7) r3 = 0 > >>> 11: (b7) r4 = 0 > >>> 12: (85) call bpf_sk_storage_get#107 > >>> R2 type=sock_or_null expected=sock_common, sock, tcp_sock, xdp_sock, ptr_ > >>> processed 12 insns (limit 1000000) max_states_per_insn 0 total_states > >>> 0 peak_states 0 mark_read 0 > >>> > >>> libbpf: -- END LOG -- > >>> libbpf: failed to load program 'update_cookie_sockops' > >>> libbpf: failed to load object 'socket_cookie_prog' > >>> libbpf: failed to load BPF skeleton 'socket_cookie_prog': -4007 > >>> test_socket_cookie:FAIL:socket_cookie_prog__open_and_load skeleton > >>> open_and_load failed > >>> #95 socket_cookie:FAIL > >>> Summary: 0/0 PASSED, 0 SKIPPED, 1 FAILED > >> > >> Oh :| I must have missed something in the rebase, I will fix this and > >> address KP's comment then. Thanks for the review and sorry for the > >> waste of time :) > > > > So this is actually an interesting one I think. :) The failure was > > triggered by the combination of an LLVM update and this change: > > > > -#include > > +#include "vmlinux.h" > > > > With an older LLVM, this used to work. > > With a recent LLVM, the change of header causes those 3 lines to get > > compiled differently: > > > > if (!ctx->sk) > > return 1; > > p = bpf_sk_storage_get(&socket_cookies, ctx->sk, 0, 0); > > > > When including linux/bpf.h > > ; if (!ctx->sk) > > 5: 79 62 b8 00 00 00 00 00 r2 = *(u64 *)(r6 + 184) > > 6: 15 02 10 00 00 00 00 00 if r2 == 0 goto +16 > > ; p = bpf_sk_storage_get(&socket_cookies, ctx->sk, 0, 0); > > 7: 18 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 r1 = 0 ll > > 9: b7 03 00 00 00 00 00 00 r3 = 0 > > 10: b7 04 00 00 00 00 00 00 r4 = 0 > > 11: 85 00 00 00 6b 00 00 00 call 107 > > 12: bf 07 00 00 00 00 00 00 r7 = r0 > > > > When including vmlinux.h > > ; if (!ctx->sk) > > 5: 79 61 b8 00 00 00 00 00 r1 = *(u64 *)(r6 + 184) > > 6: 15 01 11 00 00 00 00 00 if r1 == 0 goto +17 > > ; p = bpf_sk_storage_get(&socket_cookies, ctx->sk, 0, 0); > > 7: 79 62 b8 00 00 00 00 00 r2 = *(u64 *)(r6 + 184) > > 8: 18 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 r1 = 0 ll > > 10: b7 03 00 00 00 00 00 00 r3 = 0 > > 11: b7 04 00 00 00 00 00 00 r4 = 0 > > 12: 85 00 00 00 6b 00 00 00 call 107 > > 13: bf 07 00 00 00 00 00 00 r7 = r0 > > > > Note that ctx->sk gets fetched once in the first case (l5), and twice > > in the second case (l5 and l7). > > I'm assuming that struct bpf_sock_ops gets defined with different > > attributes in vmlinux.h and causes LLVM to assume that ctx->sk could > > have changed between the time of check and the time of use so it > > yields two fetches and the verifier can't track that r2 is non null. > > > > If I save ctx->sk in a temporary variable first: > > > > struct bpf_sock *sk = ctx->sk; > > if (!sk) > > return 1; > > p = bpf_sk_storage_get(&socket_cookies, sk, 0, 0); > > > > this loads correctly. However, Brendan pointed out that this is also a > > weak guarantee and that LLVM could still decide to compile this into > > two fetches, Brendan suggested that we save sk out of an access to a > > volatile pointer to ctx, what do you think ? > > Your above workaround should be good. Compiler should not do *bad* > optimizations to have two fetches if the source code just has one > in the above case. If this happens, it will be a llvm bug. > > The latest llvm trunk can reproduce the above issue. It is due to > (1). llvm's partial (not complete) CSE and (2). this partial CSE > resulted in llvm BPF backend generating two CORE_MEM operations (for > CORE relocations) instead of one. If llvm will do complete cse like the > above your code, we will be fine. > > Although fixing llvm CSE is desired, it may take > some time. At the same time, please use your above source workaround. > Thanks. Good to know! Thank you Yonghong :)