Received: by 2002:a05:6358:16cc:b0:ea:6187:17c9 with SMTP id r12csp9067990rwl; Wed, 11 Jan 2023 00:41:32 -0800 (PST) X-Google-Smtp-Source: AMrXdXuAAv7PPtEm+gUatZUfd6JEN9ZhBWup0TsW4H8rwV+cokQDvRVdOW6s5MpO36rfVtjbX7D1 X-Received: by 2002:a17:907:9814:b0:857:1e36:3b7b with SMTP id ji20-20020a170907981400b008571e363b7bmr5917404ejc.11.1673426492053; Wed, 11 Jan 2023 00:41:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1673426492; cv=none; d=google.com; s=arc-20160816; b=Dh5Q4X7fTd/ZeBmqi4Yd9iN5ubg1DpAKEHH1W7P9QfeQzDqcT9EE3U8pvtPV6yJHCV /RznyDRQmYy+i0J9R/3ItpjO8QPXLkqzfCh5Gzy/4MQBYz8eHQ8mb/Z/2UFu3k6/05VH 1TpGW4rk7GD7HNNLEN5lIB1QcteV+dr92JeDdoqiwbUtfZz6lua5TX0z/JVoj0HMgKt8 CCrQk41W9vHKplhT1BaEOwZEHbyfVMddFC6DpRE1hGiqd9V6Tk6XaM2kcvMDaDrh3Ypr AiY3/N0Ddu/JUc1BE7y2tgrmwQCB/PgGhRSZWhNrP4bR5q30L2LMDu7Zj/C2+jwgzo/Q F9Fg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=6HaL8IXePBXJSNiav3hT/6nMubJ71nXLJedFvCkAqm4=; b=JxlDoDun+MMgBx6KJ1DoNElxYD0SNP5R8fEJa5Pn8jCWmNFwmStccWIazP+VjVWErn grio6wpHaPZtATe9ViV6KyGZnhweE4hLTo2rZSV5XZhFljqynT/uKAf7VB6ZOgCzX8x2 Jiu8V4dCMWn8aBU9JcbkUOVRoi+J4QJwQNp7S3Uubs7GpQdoruYFpsicOZlc+NfuVu83 q1QTWcdPLWFOIuyEoezAA//u4ifDI0mx22FHfXY8ltIZZyJUWSuPKGatEFOx6lbdBeS/ 7lgMZ9LYL6conbf6HRRCBD+VPJGR6VNzEDGU6W/uHcrGnifmGT4YxK14yNiZQAal0QHg M60A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=VhDFu52Z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id gt33-20020a1709072da100b007c151fc73d0si17869791ejc.653.2023.01.11.00.41.19; Wed, 11 Jan 2023 00:41:32 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=VhDFu52Z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235800AbjAKIYJ (ORCPT + 53 others); Wed, 11 Jan 2023 03:24:09 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41748 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231994AbjAKIXm (ORCPT ); Wed, 11 Jan 2023 03:23:42 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 05A8662CF for ; Wed, 11 Jan 2023 00:23:41 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 91D4061AE0 for ; Wed, 11 Jan 2023 08:23:40 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 02E07C433A1 for ; Wed, 11 Jan 2023 08:23:40 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1673425420; bh=vYClw1nBfnvksGxzNm9Sirm21Qd0CVm/YxCjF5jLLvo=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=VhDFu52ZiV+et4fFWtIbU1mKMJ/X4w3MqUX3iqQBXpYP2agKEz3dZR++tvCLpYL6h r/RDcldDZ241I6X7NtfF9fbRJLc4JGrIBBZXqEMJ7pK9aXdXy0srVRjWhAVq5UIdmU c52e6yCI+8qE8VVAgyOCKDImRDPtMpuA8ycfqTFWJQA4BHpSoid1UZUSItk6ZwROLz vel2ldzBaNOGlum45kJjoE7YtCBYxJX0lO4H1WoogKwIshLXIDRJoUUdDUcVyBps2a IDEzyOz50DyevAjJsys7FGTZL1vebFg4vDaN86foq5fDasp4XiODv7pL6vhyu0YrV8 Utu0vKfJDAstA== Received: by mail-ej1-f41.google.com with SMTP id fy8so34826006ejc.13 for ; Wed, 11 Jan 2023 00:23:39 -0800 (PST) X-Gm-Message-State: AFqh2kodbaK6H8eb1kVYyl90LGXp8CYvZderT4z5a8YqGG3EGJez3HwM L5WG4v382QIu6hQbK3G/gEyrVYnEuf269NvA+RU= X-Received: by 2002:a17:906:66d8:b0:7c4:efcf:3bc8 with SMTP id k24-20020a17090666d800b007c4efcf3bc8mr9059542ejp.702.1673425418007; Wed, 11 Jan 2023 00:23:38 -0800 (PST) MIME-Version: 1.0 References: <20230107133549.4192639-1-guoren@kernel.org> <20230107133549.4192639-6-guoren@kernel.org> In-Reply-To: From: Guo Ren Date: Wed, 11 Jan 2023 16:23:26 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH -next V6 5/7] riscv: ftrace: Make ftrace_caller call ftrace_graph_func To: Evgenii Shatokhin Cc: linux-riscv@lists.infradead.org, linux-kernel@vger.kernel.org, anup@brainfault.org, paul.walmsley@sifive.com, palmer@dabbelt.com, conor.dooley@microchip.com, heiko@sntech.de, rostedt@goodmis.org, mhiramat@kernel.org, jolsa@redhat.com, bp@suse.de, jpoimboe@kernel.org, suagrfillet@gmail.com, andy.chiu@sifive.com, linux@yadro.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 11, 2023 at 1:16 AM Evgenii Shatokhin wrote: > > On 07.01.2023 16:35, guoren@kernel.org wrote: > > From: Song Shuai > > > > In order to make the function graph use ftrace directly, ftrace_caller > > should be adjusted to save the necessary regs against the pt_regs layout > > so it can call ftrace_graph_func reasonably. > > > > SAVE_ALL now saves all the regs according to the pt_regs struct. Here > > supersedes SAVE_ALL by SAVE_ABI_REGS which has an extra option to allow > > saving only the necessary ABI-related regs for ftrace_caller. > > > > ftrace_caller and ftrace_regs_caller save their regs with the respective > > option of SAVE_ABI_REGS, then call the tracing function, especially > > graph_ops's ftrace_graph_func. So the ftrace_graph_[regs]_call labels > > aren't needed anymore if FTRACE_WITH_REGS is defined. > > > > As the previous patch described, the ftrace_caller remains with its > > ftrace_graph_call if FTRACE_WITH_REGS isn't defined, > > > > For convenience, the original argument setup for the tracing function in > > ftrace_[regs]_caller is separated as PREPARE_ARGS. > > > > Signed-off-by: Song Shuai > > Tested-by: Guo Ren > > Signed-off-by: Guo Ren > > --- > > arch/riscv/kernel/mcount-dyn.S | 142 ++++++++++++++++++++++++--------- > > 1 file changed, 104 insertions(+), 38 deletions(-) > > > > diff --git a/arch/riscv/kernel/mcount-dyn.S b/arch/riscv/kernel/mcount-dyn.S > > index b75332ced757..d7d4d51b4bd7 100644 > > --- a/arch/riscv/kernel/mcount-dyn.S > > +++ b/arch/riscv/kernel/mcount-dyn.S > > @@ -57,19 +57,52 @@ > > .endm > > > > #ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS > > - .macro SAVE_ALL > > + > > +/** > > +* SAVE_ABI_REGS - save regs against the pt_regs struct > > +* > > +* @all: tell if saving all the regs > > +* > > +* If all is set, all the regs will be saved, otherwise only ABI > > +* related regs (a0-a7,epc,ra and optional s0) will be saved. > > +* > > +* After the stack is established, > > +* > > +* 0(sp) stores the PC of the traced function which can be accessed > > +* by &(fregs)->regs->epc in tracing function. Note that the real > > +* function entry address should be computed with -FENTRY_RA_OFFSET. > > +* > > +* 8(sp) stores the function return address (i.e. parent IP) that > > +* can be accessed by &(fregs)->regs->ra in tracing function. > > +* > > +* The other regs are saved at the respective localtion and accessed > > +* by the respective pt_regs member. > > +* > > +* Here is the layout of stack for your reference. > > +* > > +* PT_SIZE_ON_STACK -> +++++++++ > > +* + ..... + > > +* + t3-t6 + > > +* + s2-s11+ > > +* + a0-a7 + --++++-> ftrace_caller saved > > +* + s1 + + > > +* + s0 + --+ > > +* + t0-t2 + + > > +* + tp + + > > +* + gp + + > > +* + sp + + > > +* + ra + --+ // parent IP > > +* sp -> + epc + --+ // PC > > +* +++++++++ > > +**/ > > + .macro SAVE_ABI_REGS, all=0 > > addi sp, sp, -PT_SIZE_ON_STACK > > > > REG_S t0, PT_EPC(sp) > > REG_S x1, PT_RA(sp) > > - REG_S x2, PT_SP(sp) > > - REG_S x3, PT_GP(sp) > > - REG_S x4, PT_TP(sp) > > - REG_S x5, PT_T0(sp) > > - REG_S x6, PT_T1(sp) > > - REG_S x7, PT_T2(sp) > > - REG_S x8, PT_S0(sp) > > - REG_S x9, PT_S1(sp) > > + > > + // always save the ABI regs > > + > > REG_S x10, PT_A0(sp) > > REG_S x11, PT_A1(sp) > > REG_S x12, PT_A2(sp) > > @@ -78,6 +111,18 @@ > > REG_S x15, PT_A5(sp) > > REG_S x16, PT_A6(sp) > > REG_S x17, PT_A7(sp) > > + > > + // save the leftover regs > > + > > + .if \all == 1 > > + REG_S x2, PT_SP(sp) > > + REG_S x3, PT_GP(sp) > > + REG_S x4, PT_TP(sp) > > + REG_S x5, PT_T0(sp) > > + REG_S x6, PT_T1(sp) > > + REG_S x7, PT_T2(sp) > > + REG_S x8, PT_S0(sp) > > + REG_S x9, PT_S1(sp) > > REG_S x18, PT_S2(sp) > > REG_S x19, PT_S3(sp) > > REG_S x20, PT_S4(sp) > > @@ -92,19 +137,19 @@ > > REG_S x29, PT_T4(sp) > > REG_S x30, PT_T5(sp) > > REG_S x31, PT_T6(sp) > > + > > + // save s0 if FP_TEST defined > > + > > + .else > > +#ifdef HAVE_FUNCTION_GRAPH_FP_TEST > > + REG_S x8, PT_S0(sp) > > +#endif > > + .endif > > .endm > > > > - .macro RESTORE_ALL > > + .macro RESTORE_ABI_REGS, all=0 > > REG_L t0, PT_EPC(sp) > > REG_L x1, PT_RA(sp) > > - REG_L x2, PT_SP(sp) > > - REG_L x3, PT_GP(sp) > > - REG_L x4, PT_TP(sp) > > - REG_L x5, PT_T0(sp) > > - REG_L x6, PT_T1(sp) > > - REG_L x7, PT_T2(sp) > > - REG_L x8, PT_S0(sp) > > - REG_L x9, PT_S1(sp) > > REG_L x10, PT_A0(sp) > > REG_L x11, PT_A1(sp) > > REG_L x12, PT_A2(sp) > > @@ -113,6 +158,16 @@ > > REG_L x15, PT_A5(sp) > > REG_L x16, PT_A6(sp) > > REG_L x17, PT_A7(sp) > > + > > + .if \all == 1 > > + REG_L x2, PT_SP(sp) > > + REG_L x3, PT_GP(sp) > > + REG_L x4, PT_TP(sp) > > + REG_L x5, PT_T0(sp) > > Same as for the patch #3, please skip "REG_L x5, PT_T0(sp)" here. The > correct value of t0/x5 has already been read from PT_EPC(sp) at this point. Oh, I don't want to do that here. It's a common macro. Because it's a continuous load within the cacheline, I don't think it would cause a performance gap. > > > + REG_L x6, PT_T1(sp) > > + REG_L x7, PT_T2(sp) > > + REG_L x8, PT_S0(sp) > > + REG_L x9, PT_S1(sp) > > REG_L x18, PT_S2(sp) > > REG_L x19, PT_S3(sp) > > REG_L x20, PT_S4(sp) > > @@ -128,10 +183,25 @@ > > REG_L x30, PT_T5(sp) > > REG_L x31, PT_T6(sp) > > > > + .else > > +#ifdef HAVE_FUNCTION_GRAPH_FP_TEST > > + REG_L x8, PT_S0(sp) > > +#endif > > + .endif > > addi sp, sp, PT_SIZE_ON_STACK > > .endm > > + > > + .macro PREPARE_ARGS > > + addi a0, t0, -FENTRY_RA_OFFSET // ip > > + la a1, function_trace_op > > + REG_L a2, 0(a1) // op > > + mv a1, ra // parent_ip > > + mv a3, sp // fregs > > + .endm > > + > > #endif /* CONFIG_DYNAMIC_FTRACE_WITH_REGS */ > > > > +#ifndef CONFIG_DYNAMIC_FTRACE_WITH_REGS > > ENTRY(ftrace_caller) > > SAVE_ABI > > > > @@ -160,33 +230,29 @@ ftrace_graph_call: > > jr t0 > > ENDPROC(ftrace_caller) > > > > -#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS > > +#else /* CONFIG_DYNAMIC_FTRACE_WITH_REGS */ > > ENTRY(ftrace_regs_caller) > > - SAVE_ALL > > - > > - addi a0, t0, -FENTRY_RA_OFFSET > > - la a1, function_trace_op > > - REG_L a2, 0(a1) > > - mv a1, ra > > - mv a3, sp > > + SAVE_ABI_REGS 1 > > + PREPARE_ARGS > > > > ftrace_regs_call: > > .global ftrace_regs_call > > call ftrace_stub > > > > -#ifdef CONFIG_FUNCTION_GRAPH_TRACER > > - addi a0, sp, PT_RA > > - REG_L a1, PT_T0(sp) > > - addi a1, a1, -FENTRY_RA_OFFSET > > -#ifdef HAVE_FUNCTION_GRAPH_FP_TEST > > - mv a2, s0 > > -#endif > > -ftrace_graph_regs_call: > > - .global ftrace_graph_regs_call > > - call ftrace_stub > > -#endif > > > > - RESTORE_ALL > > + RESTORE_ABI_REGS 1 > > jr t0 > > ENDPROC(ftrace_regs_caller) > > + > > +ENTRY(ftrace_caller) > > + SAVE_ABI_REGS 0 > > + PREPARE_ARGS > > + > > +ftrace_call: > > + .global ftrace_call > > + call ftrace_stub > > + > > + RESTORE_ABI_REGS 0 > > + jr t0 > > +ENDPROC(ftrace_caller) > > #endif /* CONFIG_DYNAMIC_FTRACE_WITH_REGS */ > > -- > > 2.36.1 > > > > > Regards, > Evgenii > > -- Best Regards Guo Ren