Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp231402rwb; Tue, 15 Nov 2022 22:50:57 -0800 (PST) X-Google-Smtp-Source: AA0mqf650OBp79w5JQTfd5tjtYfW96RiZmjsO/cgosQpUIOYYipPXcGDxJvuKzBvJn9SazBZNfgu X-Received: by 2002:a17:907:8c06:b0:78d:f3f4:b1a0 with SMTP id ta6-20020a1709078c0600b0078df3f4b1a0mr16588337ejc.489.1668581457192; Tue, 15 Nov 2022 22:50:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1668581457; cv=none; d=google.com; s=arc-20160816; b=DRaM8TR2LSKHA4pkBobA3TkYuXMqZFm2OGlNAYrrZTCFR/vTbeeZR7CQUQgTbRGpeu gmFD/l5STCBeh0/NPNfk1eVnPSqkesWMwjm34vOFdIobhpSnxx46L2twQoogO56nPv6s +ste41ORTAmT1JWcy1l33clfVMnCacm1WMTxW9uYIGpDajh+iyhrnQ9YZFtjJGoxQX9j yOrfeCI/9uqW+M+exDzV67nG+m/AdoW8JV6QwVgzLBtdLoBRb+0HsnIVh4G9PNLew0FY AN1m5RIuig/EicEy/EHrjUE/Scxq01dmUtbe7oxDaBtIqNcniLGBdfNIANN/zqxHGNTe Zieg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=pLtcBCGQN71GjgpsxnNgWyg1OzsqYwxTfhladZ91Ipk=; b=vASjgSzj6xjDN34xb1fp1ZNTACfMwQdpj16E/3wcPmLxTXOD0fWa+xCcNwCspiASpu /vcOq3qaMQYkuNqZZuESAVwQYRrBkM2d/iFMmQsV86h1Tbg5Aktv+dIIx7+I0IaGQmf7 86euT1tA1DFj1NGYTicfLPxTdo98NOhddW5z9goTqLm49x/V6qEbDeKk1Wyqsrxg1HYS VZ33JmQD9kgoXI4loqdFodHM1dIMX9K2/iQAf22CykLWW+s2K4aMfW8pl25Y1jWYTGxc aSn7HJt4AOdCbUppfehgW5Y7cpn5cZpjctg2LPMJCppQ3Pd44IqGoOffgWj0p3H2pewL ZWtQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=SXvogX1Z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id w10-20020a056402070a00b0045cf3759408si12781449edx.49.2022.11.15.22.50.35; Tue, 15 Nov 2022 22:50:57 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=SXvogX1Z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232623AbiKPGau (ORCPT + 90 others); Wed, 16 Nov 2022 01:30:50 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47238 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232447AbiKPGaX (ORCPT ); Wed, 16 Nov 2022 01:30:23 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6BF501EEE5 for ; Tue, 15 Nov 2022 22:30:21 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 0353B6185D for ; Wed, 16 Nov 2022 06:30:21 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 56A8EC433B5 for ; Wed, 16 Nov 2022 06:30:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1668580220; bh=qJbwjn+qU2PFGQkwTNsWF6gjGd7xuEf1eyUCDUVmfco=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=SXvogX1ZFKley+fp3nLCc4gpojR+08/s1hxgetU930+zhYJJXZWOAu4WVUjM4rRWE YGxJ+rmiJfMwN0iLmTvNIN4gho0j1NIvpJ74j2QeWN7VrIyFq6xQD95kK3+5TQFvwd h1WXQqkMsVjgIxfQc97JDTIaj+Hd+7VnuMdpyglBkaQtDXT0ZRNOaHflaF39aUgjW/ nQCNF4GRHUD09ZTPU1YqW3WEE8QAaryN0BrGG5NS+YvczkjJrqA0G1b4dPeMYAxZBW 6aQFex57oTZznWbOD+IQ/5SwJuK0sVgt2V9wOiM7cQR77+Hu8ng0zd6vJa2JDUfaUv xOMDmfZ+WR4Xg== Received: by mail-ed1-f47.google.com with SMTP id s12so25089956edd.5 for ; Tue, 15 Nov 2022 22:30:20 -0800 (PST) X-Gm-Message-State: ANoB5pl4B0eDRwS7G/36VPmJ0OKCxuTM4YZhLi8CXIEr8XF/6MDMntcc MvELgtURKUicxfRc/80kWQ9+v78mPOuIveEvab8= X-Received: by 2002:a05:6402:2903:b0:467:65a2:f635 with SMTP id ee3-20020a056402290300b0046765a2f635mr17510237edb.106.1668580218523; Tue, 15 Nov 2022 22:30:18 -0800 (PST) MIME-Version: 1.0 References: <20221116025324.1624-1-zhangqing@loongson.cn> <20221116025324.1624-2-zhangqing@loongson.cn> In-Reply-To: <20221116025324.1624-2-zhangqing@loongson.cn> From: Huacai Chen Date: Wed, 16 Nov 2022 14:30:06 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v7 1/9] LoongArch/ftrace: Add basic support To: Qing Zhang Cc: Steven Rostedt , Ingo Molnar , loongarch@lists.linux.dev, linux-kernel@vger.kernel.org, Jinyang He Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Qing, On Wed, Nov 16, 2022 at 10:53 AM Qing Zhang wrote: > > This patch contains basic ftrace support for LoongArch. > Specifically, function tracer (HAVE_FUNCTION_TRACER), function graph > tracer (HAVE_FUNCTION_GRAPH_TRACER) are implemented following the > instructions in Documentation/trace/ftrace-design.txt. > > Use `-pg` makes stub like a child function `void _mcount(void *ra)`. > Thus, it can be seen store RA and open stack before `call _mcount`. > Find `open stack` at first, and then find `store RA` > > Note that the functions in both inst.c and time.c should not be > hooked with the compiler's -pg option: to prevent infinite self- > referencing for the former, and to ignore early setup stuff for the > latter. > > Co-developed-by: Jinyang He > Signed-off-by: Jinyang He > Signed-off-by: Qing Zhang > --- > arch/loongarch/Kconfig | 2 + > arch/loongarch/Makefile | 5 ++ > arch/loongarch/include/asm/ftrace.h | 18 ++++++ > arch/loongarch/kernel/Makefile | 8 +++ > arch/loongarch/kernel/ftrace.c | 74 +++++++++++++++++++++++ > arch/loongarch/kernel/mcount.S | 94 +++++++++++++++++++++++++++++ > 6 files changed, 201 insertions(+) > create mode 100644 arch/loongarch/include/asm/ftrace.h > create mode 100644 arch/loongarch/kernel/ftrace.c > create mode 100644 arch/loongarch/kernel/mcount.S > > diff --git a/arch/loongarch/Kconfig b/arch/loongarch/Kconfig > index 1943f840e494..92c4ec8c8527 100644 > --- a/arch/loongarch/Kconfig > +++ b/arch/loongarch/Kconfig > @@ -91,6 +91,8 @@ config LOONGARCH > select HAVE_EBPF_JIT > select HAVE_EXIT_THREAD > select HAVE_FAST_GUP > + select HAVE_FUNCTION_GRAPH_TRACER > + select HAVE_FUNCTION_TRACER > select HAVE_GENERIC_VDSO > select HAVE_IOREMAP_PROT > select HAVE_IRQ_EXIT_ON_IRQ_STACK > diff --git a/arch/loongarch/Makefile b/arch/loongarch/Makefile > index a0fc1f9980e3..6832a8f891fd 100644 > --- a/arch/loongarch/Makefile > +++ b/arch/loongarch/Makefile > @@ -36,6 +36,11 @@ ifneq ($(SUBARCH),$(ARCH)) > endif > endif > > +ifdef CONFIG_DYNAMIC_FTRACE > +KBUILD_CPPFLAGS += -DCC_USING_PATCHABLE_FUNCTION_ENTRY > +CC_FLAGS_FTRACE := -fpatchable-function-entry=2 > +endif I think these lines should be moved to Patch-3. Huacai > + > ifdef CONFIG_64BIT > ld-emul = $(64bit-emul) > cflags-y += -mabi=lp64s > diff --git a/arch/loongarch/include/asm/ftrace.h b/arch/loongarch/include/asm/ftrace.h > new file mode 100644 > index 000000000000..6a3e76234618 > --- /dev/null > +++ b/arch/loongarch/include/asm/ftrace.h > @@ -0,0 +1,18 @@ > +/* SPDX-License-Identifier: GPL-2.0 */ > +/* > + * Copyright (C) 2022 Loongson Technology Corporation Limited > + */ > + > +#ifndef _ASM_LOONGARCH_FTRACE_H > +#define _ASM_LOONGARCH_FTRACE_H > + > +#ifdef CONFIG_FUNCTION_TRACER > +#define MCOUNT_INSN_SIZE 4 /* sizeof mcount call */ > + > +#ifndef __ASSEMBLY__ > +extern void _mcount(void); > +#define mcount _mcount > + > +#endif /* __ASSEMBLY__ */ > +#endif /* CONFIG_FUNCTION_TRACER */ > +#endif /* _ASM_LOONGARCH_FTRACE_H */ > diff --git a/arch/loongarch/kernel/Makefile b/arch/loongarch/kernel/Makefile > index 86744531b100..3f71bce1c7ce 100644 > --- a/arch/loongarch/kernel/Makefile > +++ b/arch/loongarch/kernel/Makefile > @@ -15,6 +15,14 @@ obj-$(CONFIG_EFI) += efi.o > > obj-$(CONFIG_CPU_HAS_FPU) += fpu.o > > +ifdef CONFIG_FUNCTION_TRACER > +obj-y += mcount.o ftrace.o > +CFLAGS_REMOVE_ftrace.o = $(CC_FLAGS_FTRACE) > +CFLAGS_REMOVE_inst.o = $(CC_FLAGS_FTRACE) > +CFLAGS_REMOVE_time.o = $(CC_FLAGS_FTRACE) > +CFLAGS_REMOVE_perf_event.o = $(CC_FLAGS_FTRACE) > +endif > + > obj-$(CONFIG_MODULES) += module.o module-sections.o > obj-$(CONFIG_STACKTRACE) += stacktrace.o > > diff --git a/arch/loongarch/kernel/ftrace.c b/arch/loongarch/kernel/ftrace.c > new file mode 100644 > index 000000000000..c8ddc5f11f32 > --- /dev/null > +++ b/arch/loongarch/kernel/ftrace.c > @@ -0,0 +1,74 @@ > +// SPDX-License-Identifier: GPL-2.0 > +/* > + * Copyright (C) 2022 Loongson Technology Corporation Limited > + */ > + > +#include > +#include > +#include > +#include > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include > + > +#ifdef CONFIG_FUNCTION_GRAPH_TRACER > + > +/* > + * As `call _mcount` follows LoongArch psABI, ra-saved operation and > + * stack operation can be found before this insn. > + */ > + > +static int ftrace_get_parent_ra_addr(unsigned long insn_addr, int *ra_off) > +{ > + union loongarch_instruction *insn; > + int limit = 32; > + > + insn = (union loongarch_instruction *)insn_addr; > + > + do { > + insn--; > + limit--; > + > + if (is_ra_save_ins(insn)) > + *ra_off = -((1 << 12) - insn->reg2i12_format.immediate); > + > + } while (!is_stack_alloc_ins(insn) && limit); > + > + if (!limit) > + return -EINVAL; > + > + return 0; > +} > + > +void prepare_ftrace_return(unsigned long self_addr, > + unsigned long callsite_sp, unsigned long old) > +{ > + int ra_off; > + unsigned long return_hooker = (unsigned long)&return_to_handler; > + > + if (unlikely(ftrace_graph_is_dead())) > + return; > + > + if (unlikely(atomic_read(¤t->tracing_graph_pause))) > + return; > + > + if (ftrace_get_parent_ra_addr(self_addr, &ra_off)) > + goto out; > + > + if (!function_graph_enter(old, self_addr, 0, NULL)) > + *(unsigned long *)(callsite_sp + ra_off) = return_hooker; > + > + return; > + > +out: > + ftrace_graph_stop(); > + WARN_ON(1); > +} > +#endif /* CONFIG_FUNCTION_GRAPH_TRACER */ > diff --git a/arch/loongarch/kernel/mcount.S b/arch/loongarch/kernel/mcount.S > new file mode 100644 > index 000000000000..3de7c2d7fd12 > --- /dev/null > +++ b/arch/loongarch/kernel/mcount.S > @@ -0,0 +1,94 @@ > +/* SPDX-License-Identifier: GPL-2.0 */ > +/* > + * LoongArch specific _mcount support > + * > + * Copyright (C) 2022 Loongson Technology Corporation Limited > + */ > + > +#include > +#include > +#include > +#include > + > + .text > + > +#define MCOUNT_STACK_SIZE (2 * SZREG) > +#define MCOUNT_S0_OFFSET (0) > +#define MCOUNT_RA_OFFSET (SZREG) > + > + .macro MCOUNT_SAVE_REGS > + PTR_ADDI sp, sp, -MCOUNT_STACK_SIZE > + PTR_S s0, sp, MCOUNT_S0_OFFSET > + PTR_S ra, sp, MCOUNT_RA_OFFSET > + move s0, a0 > + .endm > + > + .macro MCOUNT_RESTORE_REGS > + move a0, s0 > + PTR_L ra, sp, MCOUNT_RA_OFFSET > + PTR_L s0, sp, MCOUNT_S0_OFFSET > + PTR_ADDI sp, sp, MCOUNT_STACK_SIZE > + .endm > + > + > +SYM_FUNC_START(_mcount) > + la.pcrel t1, ftrace_stub > + la.pcrel t2, ftrace_trace_function /* Prepare t2 for (1) */ > + PTR_L t2, t2, 0 > + beq t1, t2, fgraph_trace > + > + MCOUNT_SAVE_REGS > + > + move a0, ra /* arg0: self return address */ > + move a1, s0 /* arg1: parent's return address */ > + jirl ra, t2, 0 /* (1) call *ftrace_trace_function */ > + > + MCOUNT_RESTORE_REGS > + > +fgraph_trace: > +#ifdef CONFIG_FUNCTION_GRAPH_TRACER > + la.pcrel t1, ftrace_stub > + la.pcrel t3, ftrace_graph_return > + PTR_L t3, t3, 0 > + bne t1, t3, ftrace_graph_caller > + la.pcrel t1, ftrace_graph_entry_stub > + la.pcrel t3, ftrace_graph_entry > + PTR_L t3, t3, 0 > + bne t1, t3, ftrace_graph_caller > +#endif > + > + .globl ftrace_stub > +ftrace_stub: > + jr ra > +SYM_FUNC_END(_mcount) > +EXPORT_SYMBOL(_mcount) > + > +#ifdef CONFIG_FUNCTION_GRAPH_TRACER > +SYM_FUNC_START(ftrace_graph_caller) > + MCOUNT_SAVE_REGS > + > + PTR_ADDI a0, ra, -4 /* arg0: Callsite self return addr */ > + PTR_ADDI a1, sp, MCOUNT_STACK_SIZE /* arg1: Callsite sp */ > + move a2, s0 /* arg2: Callsite parent ra */ > + bl prepare_ftrace_return > + > + MCOUNT_RESTORE_REGS > + jr ra > +SYM_FUNC_END(ftrace_graph_caller) > + > +SYM_FUNC_START(return_to_handler) > + PTR_ADDI sp, sp, -2 * SZREG > + PTR_S a0, sp, 0 > + PTR_S a1, sp, SZREG > + > + bl ftrace_return_to_handler > + > + /* restore the real parent address: a0 -> ra */ > + move ra, a0 > + > + PTR_L a0, sp, 0 > + PTR_L a1, sp, SZREG > + PTR_ADDI sp, sp, 2 * SZREG > + jr ra > +SYM_FUNC_END(return_to_handler) > +#endif /* CONFIG_FUNCTION_GRAPH_TRACER */ > -- > 2.36.0 > >