Received: by 2002:a05:6a10:413:0:0:0:0 with SMTP id 19csp1707315pxp; Mon, 21 Mar 2022 03:20:10 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxoEatd6147Ikhr3CkaVHBX+YntE7SLYIqj/ot9IG9aEF77ZJHNu3IlAqDQxlZySt7ZIK7M X-Received: by 2002:a05:6a00:130e:b0:4cc:3c7d:4dec with SMTP id j14-20020a056a00130e00b004cc3c7d4decmr23292779pfu.32.1647858010183; Mon, 21 Mar 2022 03:20:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1647858010; cv=none; d=google.com; s=arc-20160816; b=QjdhPRnc12P8KaR18b9+Le0SZr/WC/RkBdtefOxWhaokCQpF7/CgcfZbx6q8UnXttZ JflqWzIaUF8WjgZ0FE55xGQc3KPvVgdF3M+YBZzLhJ0Z7rPVZ6JW4Ee1suj2FYaXS/3D /q6mf9WhsuwHV+nH6k+fqCjrZlSWVuyuqTvQBTV1dQqYl2sRDRYAWzS93LdMyNNiR0gk ANci9/MtM3YRSMlMykvCy8UetOUss98aCf2MIxw2Hx2IFpSIGgqDMFZjgV/QgwoVxKxD luICSTG5fKQ0XScZam4rqVC4UqdaIZaeexbJIBfik1IYpFnhOcNYmhkI9demxCNWLSir zIJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=ThtTW+jDr+eNDTEVCZ1dafLPsPyl9A84KoG84EmfQD0=; b=lKOWvEiLOApSDJoCwLjDdIwlm+Dm4XuQcLXfP0+Z6Pe22pWHvIvY0v5+dv4DeTs5NN h6AIt/CcUU4o13wFWiDnCjXEvpLy70MCDPZFzIgCscFls837UBHdvi+dVI8gDoYDyNTx Gswu35MhL2oXPUNshKkLsNSfRpMbeU9TeBbBCMy2I0QRlUvN9IuKA9VBK1hxwuH4L8m3 vaBb11HMrF/Kk6Yo+Gy5kGBJHlvrDqp45p6IHBiwp/OOpP5DqFJrAxtNBOceTodqgSFD L1N/ZKaPpjwz8K3hge8O5SPyvluiiJYhXrXx/6VadWgVMIgpI0rnE01UyA0bUJnkCQLm dXXQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=ax7X6G7j; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id kk16-20020a17090b4a1000b001c648d1fb6esi3836143pjb.132.2022.03.21.03.19.56; Mon, 21 Mar 2022 03:20:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=ax7X6G7j; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S242902AbiCSM3V (ORCPT + 99 others); Sat, 19 Mar 2022 08:29:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45050 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242882AbiCSM3U (ORCPT ); Sat, 19 Mar 2022 08:29:20 -0400 Received: from mail-ed1-x52b.google.com (mail-ed1-x52b.google.com [IPv6:2a00:1450:4864:20::52b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6C0DA6AA66; Sat, 19 Mar 2022 05:27:58 -0700 (PDT) Received: by mail-ed1-x52b.google.com with SMTP id w25so13022897edi.11; Sat, 19 Mar 2022 05:27:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=ThtTW+jDr+eNDTEVCZ1dafLPsPyl9A84KoG84EmfQD0=; b=ax7X6G7jNeaPVjwbIlOqtZ6ek2P0J1VGeZ3Cnb+fFOphvPa6VT2/lH8HOAwRnAa2Pm 9gFdCNmUn2eROdcYdlcLhrEiNIniu9Ulr6U5dQ5f/Ea2vbiLXHW2Tu5ZJ15d3Bqsfiac ygBr/p5XJb9ZWEXMzZYQQUGoNrruTXoohunw8xcamutVwztYwKAe+YrNxCfW3V99cw1/ 0AeUQDhoF1d7kcdO+5cvipKenKU0TeJ5SBvP6iIzOs8hBn3rqlj1Cm/zX4hcBCe30wwq HfLHpzN1vOEE4/cVb6P1/ep1kpHULyHI7jDFZHPxn3f05H9yJ0n1EJ1FPdqZNrqrj50D 6kZA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=ThtTW+jDr+eNDTEVCZ1dafLPsPyl9A84KoG84EmfQD0=; b=2mFPoneqEGG92LubUwY+mr6dR3oM7VeWnj5yXBxvDfxW8ZcXfu//5V2+GvyBBfc0aR 5spyehsWZ44WsW9R/fMfvXC5ijo+OW+fkW9+FfvtsQHJXMeLTqYqSX1kW1uOcT8TJAzD tmawuGEUWD14Ha3/e7WUtl9++7rDFDndtTNBM9yFuH42hjuse2g4ypiUuJ01xrCuxV9L Itr1yBX0X3I/1K1ACXwUaP6Ilnm1xhXQJy3pUDHg5h7GJ5xGOxeEqEA5Z2oLRzmSMbgM aTy/z3zpcqbjF5C5a6Hzms9ijVTuLjwRhwTheYr1IBitS9cadEl04rpt81HCE0v/I96s Ncmg== X-Gm-Message-State: AOAM531gQCnfY50HBI8v9H+PnMgkDEr+gW05bUPX2pm54WaGu2VtNKHq 7nhSW+Pr2Xg9xioaE2oSWQY= X-Received: by 2002:a05:6402:1650:b0:418:fbfc:971b with SMTP id s16-20020a056402165000b00418fbfc971bmr12030883edx.351.1647692876647; Sat, 19 Mar 2022 05:27:56 -0700 (PDT) Received: from krava ([83.240.61.119]) by smtp.gmail.com with ESMTPSA id f1-20020a056402194100b00416b174987asm5672374edz.35.2022.03.19.05.27.55 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 19 Mar 2022 05:27:56 -0700 (PDT) Date: Sat, 19 Mar 2022 13:27:54 +0100 From: Jiri Olsa To: Andrii Nakryiko Cc: Jiri Olsa , Alexei Starovoitov , Daniel Borkmann , Andrii Nakryiko , Masami Hiramatsu , Networking , bpf , lkml , Martin KaFai Lau , Song Liu , Yonghong Song , John Fastabend , KP Singh , Steven Rostedt Subject: Re: [PATCHv3 bpf-next 00/13] bpf: Add kprobe multi link Message-ID: References: <20220316122419.933957-1-jolsa@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Mar 18, 2022 at 10:50:46PM -0700, Andrii Nakryiko wrote: > On Wed, Mar 16, 2022 at 5:24 AM Jiri Olsa wrote: > > > > hi, > > this patchset adds new link type BPF_TRACE_KPROBE_MULTI that attaches > > kprobe program through fprobe API [1] instroduced by Masami. > > > > The fprobe API allows to attach probe on multiple functions at once very > > fast, because it works on top of ftrace. On the other hand this limits > > the probe point to the function entry or return. > > > > > > With bpftrace support I see following attach speed: > > > > # perf stat --null -r 5 ./src/bpftrace -e 'kprobe:x* { } i:ms:1 { exit(); } ' > > Attaching 2 probes... > > Attaching 3342 functions > > ... > > > > 1.4960 +- 0.0285 seconds time elapsed ( +- 1.91% ) > > > > v3 changes: > > - based on latest fprobe post from Masami [2] > > - add acks > > - add extra comment to kprobe_multi_link_handler wrt entry ip setup [Masami] > > - keep swap_words_64 static and swap values directly in > > bpf_kprobe_multi_cookie_swap [Andrii] > > - rearrange locking/migrate setup in kprobe_multi_link_prog_run [Andrii] > > - move uapi fields [Andrii] > > - add bpf_program__attach_kprobe_multi_opts function [Andrii] > > - many small test changes [Andrii] > > - added tests for bpf_program__attach_kprobe_multi_opts > > - make kallsyms_lookup_name check for empty string [Andrii] > > > > v2 changes: > > - based on latest fprobe changes [1] > > - renaming the uapi interface to kprobe multi > > - adding support for sort_r to pass user pointer for swap functions > > and using that in cookie support to keep just single functions array > > - moving new link to kernel/trace/bpf_trace.c file > > - using single fprobe callback function for entry and exit > > - using kvzalloc, libbpf_ensure_mem functions > > - adding new k[ret]probe.multi sections instead of using current kprobe > > - used glob_match from test_progs.c, added '?' matching > > - move bpf_get_func_ip verifier inline change to seprate change > > - couple of other minor fixes > > > > > > Also available at: > > https://git.kernel.org/pub/scm/linux/kernel/git/jolsa/perf.git > > bpf/kprobe_multi > > > > thanks, > > jirka > > > > > > [1] https://lore.kernel.org/bpf/164458044634.586276.3261555265565111183.stgit@devnote2/ > > [2] https://lore.kernel.org/bpf/164735281449.1084943.12438881786173547153.stgit@devnote2/ > > --- > > Jiri Olsa (13): > > lib/sort: Add priv pointer to swap function > > kallsyms: Skip the name search for empty string > > bpf: Add multi kprobe link > > bpf: Add bpf_get_func_ip kprobe helper for multi kprobe link > > bpf: Add support to inline bpf_get_func_ip helper on x86 > > bpf: Add cookie support to programs attached with kprobe multi link > > libbpf: Add libbpf_kallsyms_parse function > > libbpf: Add bpf_link_create support for multi kprobes > > libbpf: Add bpf_program__attach_kprobe_multi_opts function > > selftests/bpf: Add kprobe_multi attach test > > selftests/bpf: Add kprobe_multi bpf_cookie test > > selftests/bpf: Add attach test for bpf_program__attach_kprobe_multi_opts > > selftests/bpf: Add cookie test for bpf_program__attach_kprobe_multi_opts > > > > Ok, so I've integrated multi-attach kprobes into retsnoop. It was > pretty straightforward. Here are some numbers for the speed of > attaching and, even more importantly, detaching for a set of almost > 400 functions. It's a bit less functions for fentry-based mode due to > more limited BTF information for static functions. Note that retsnoop > attaches two BPF programs for each kernel function, so it's actually > two multi-attach kprobes, each attaching to 397 functions. For > single-attach kprobe, we perform 397 * 2 = 794 attachments. > > I've been invoking retsnoop with the following specified set of > functions: -e '*sys_bpf*' -a ':kernel/bpf/syscall.c' -a > ':kernel/bpf/verifier.c' -a ':kernel/bpf/btf.c' -a > ':kernel/bpf/core.c'. So basically bpf syscall entry functions and all > the discoverable functions from syscall.c, verifier.c, core.c and > btf.c from kernel/bpf subdirectory. > > Results: > > fentry attach/detach (263 kfuncs): 2133 ms / 31671 ms (33 seconds) > kprobe attach/detach (397 kfuncs): 3121 ms / 13195 ms (16 seconds) > multi-kprobe attach/detach (397 kfuncs): 9 ms / 248 ms (0.25 seconds) > > So as you can see, the speed up is tremendous! API is also very > convenient, I didn't have to modify retsnoop internals much to > accommodate multi-attach API. Great job! nice! thanks for doing that so quickly > > Now for the bad news. :( > > Stack traces from multi-attach kretprobe are broken, which makes all > this way less useful. > > Below, see stack traces captured with multi- and single- kretprobes > for two different use cases. Single kprobe stack traces make much more > sense. Ignore that last function doesn't have actual address > associated with it (i.e. for __sys_bpf and bpf_tracing_prog_attach, > respectively). That's just how retsnoop is doing things, I think. We > actually were capturing stack traces from inside __sys_bpf (first > case) and bpf_tracing_prog_attach (second case). > > MULTI KPROBE: > ffffffff81185a80 __sys_bpf+0x0 > (kernel/bpf/syscall.c:4622:1) > > SINGLE KPROBE: > ffffffff81e0007c entry_SYSCALL_64_after_hwframe+0x44 > (arch/x86/entry/entry_64.S:113:0) > ffffffff81cd2b15 do_syscall_64+0x35 > (arch/x86/entry/common.c:80:7) > . do_syscall_x64 > (arch/x86/entry/common.c:50:12) > ffffffff811881aa __x64_sys_bpf+0x1a > (kernel/bpf/syscall.c:4765:1) > __sys_bpf > > > MULTI KPROBE: > ffffffff811851b0 bpf_tracing_prog_attach+0x0 > (kernel/bpf/syscall.c:2708:1) > > SINGLE KPROBE: > ffffffff81e0007c entry_SYSCALL_64_after_hwframe+0x44 > (arch/x86/entry/entry_64.S:113:0) > ffffffff81cd2b15 do_syscall_64+0x35 > (arch/x86/entry/common.c:80:7) > . do_syscall_x64 > (arch/x86/entry/common.c:50:12) > ffffffff811881aa __x64_sys_bpf+0x1a > (kernel/bpf/syscall.c:4765:1) > ffffffff81185e79 __sys_bpf+0x3f9 > (kernel/bpf/syscall.c:4705:9) > ffffffff8118583a bpf_raw_tracepoint_open+0x19a > (kernel/bpf/syscall.c:3069:6) > bpf_tracing_prog_attach > > You can see that in multi-attach kprobe we only get one entry, which > is the very last function in the stack trace. We have no parent > function captured whatsoever. Jiri, Masami, any ideas what's wrong and > how to fix this? Let's try to figure this out and fix it before the > feature makes it into the kernel release. Thanks in advance! oops, I should have tried kstack with the bpftrace's kretprobe, I see the same: # ./src/bpftrace -e 'kretprobe:x* { @[kstack] = count(); }' Attaching 1 probe... Attaching 3340probes ^C @[ xfs_trans_apply_dquot_deltas+0 ]: 22 @[ xlog_cil_commit+0 ]: 22 @[ xlog_grant_push_threshold+0 ]: 22 @[ xfs_trans_add_item+0 ]: 22 @[ xfs_log_reserve+0 ]: 22 @[ xlog_space_left+0 ]: 22 @[ xfs_buf_offset+0 ]: 22 @[ xfs_trans_free_dqinfo+0 ]: 22 @[ xlog_ticket_alloc+0 xfs_log_reserve+5 ]: 22 @[ xfs_cil_prepare_item+0 I think it's because we set original ip for return probe to have bpf_get_func_ip working properly, but it breaks backtrace of course I'm not sure we could bring along the original regs for return probe, but I have an idea how to workaround the bpf_get_func_ip issue and keep the registers intact for other helpers jirka