Received: by 2002:a25:e74b:0:0:0:0:0 with SMTP id e72csp1677018ybh; Tue, 14 Jul 2020 04:33:37 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyhF+Pv9waKgofu06ynCg5gMpN4Rv11/UtkAUCRrWsBjjycLjSxr2JnT1yZQqRpq4BdYvaY X-Received: by 2002:a17:906:1356:: with SMTP id x22mr4164447ejb.429.1594726417556; Tue, 14 Jul 2020 04:33:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1594726417; cv=none; d=google.com; s=arc-20160816; b=WQ398T8Plo2Rg36kEELicDwl9xGOZCvbJN4IG1JevhNgJwyf0xiWxE7bD25xPJyBcH OesaLmaKTcJ89d7ZE9JpkW3WUEQAdaY3OmrLlRBBhvNzsdxNOueIGVpVRE8Pg6oZF6SJ WVuAS7n1OZA1wwyCQr5AuoI7hk3n9epCARFM1sQKdo4NUx6raMNGLAByNthd/iRKlnbg HWYZlHm8TWt/lLQzzTAGHD/9+HTsdcbgujxcZVddyOEhrS6xGcTjxk4J5OTdybxYKwVz //w5ui/1AwkCnxF2N6pfQM6zV8aS0Lkwr337RY/Rcpv0eBq5XGRbEYCHxJhe98StjFCk ghOg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=Ov9nlABnnVV10+XQxDIZAaHrUxgsKgdWORvJq3UttGw=; b=J1ZBA5UJZTGM53GuMjO/7nAaNQgz1p/AAymJXxLco6ic/si5e1bH2Pi7Jrv06naDdh dAIHW6iDtxbywuVZmS2C9WGq6XKm+9GQNWcYaHVTzvgFnPP9nHfpGnoiV0nK9hgF5h1q mlz3AbhF+3hcv5eHdRWblzRyUS7Ta8Dy6CGuCtbYR8GGJjEhHA8xmn+oTBfAqZlXkhlS SjFizJYq23ZNcLVzgKF1hZleb82cQnEaFxjdkT7rX2QKtL0s0TQtpB0JJdjzzPWdxLMW wN9YnooSOfZESXqhb54WiRMNRKz7+vUnEaweHWKbTWYwq3pQwwgqMlydk75mRrK9gyRB CIPg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=K9OxxVXF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m16si4572063ejb.364.2020.07.14.04.33.10; Tue, 14 Jul 2020 04:33:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=K9OxxVXF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727121AbgGNLcR (ORCPT + 99 others); Tue, 14 Jul 2020 07:32:17 -0400 Received: from mail.kernel.org ([198.145.29.99]:46808 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726332AbgGNLcQ (ORCPT ); Tue, 14 Jul 2020 07:32:16 -0400 Received: from devnote2 (NE2965lan1.rev.em-net.ne.jp [210.141.244.193]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id ED7E622203; Tue, 14 Jul 2020 11:32:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1594726335; bh=TtYXJCaAfVAvmXpndjUmq8mNbDnr+kPGfgYNHKOGZfw=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=K9OxxVXF0L5Tjod5E1qIyBeJa5zwBm4VZFHIxJUcNK2+Gu0v0neorOgsLghlxU5XY URaH5+u4Sz2oza/Acl3WTQ47DDv7ccQ1D6TA44pgArxCcjzH3E7g2hJIohyyOwRvhB BWbYBouIxFi4uOYS+4p1f8P7YxXfwPMztvQAITgk= Date: Tue, 14 Jul 2020 20:32:09 +0900 From: Masami Hiramatsu To: Guo Ren Cc: Palmer Dabbelt , Paul Walmsley , Anup Patel , Greentime Hu , Zong Li , Patrick =?UTF-8?B?U3TDpGhsaW4=?= , =?UTF-8?B?QmrDtnJuIFTDtnBlbA==?= , Atish Patra , penberg@kernel.org, linux-riscv , Linux Kernel Mailing List , linux-csky@vger.kernel.org, Guo Ren Subject: Re: [PATCH v2 6/6] riscv: Add KPROBES_ON_FTRACE supported Message-Id: <20200714203209.29d3a3b967238f0cf04c4c6e@kernel.org> In-Reply-To: References: <1594261154-69745-1-git-send-email-guoren@kernel.org> <1594261154-69745-7-git-send-email-guoren@kernel.org> <20200710225017.5ce329485e911f99e17cd483@kernel.org> <20200712223748.9cb00fdf7938fbf7353c11cc@kernel.org> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.32; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Guo, On Tue, 14 Jul 2020 07:47:20 +0800 Guo Ren wrote: > Hi Masami, > > On Sun, Jul 12, 2020 at 9:37 PM Masami Hiramatsu wrote: > > > > On Sat, 11 Jul 2020 09:32:01 +0800 > > Guo Ren wrote: > > > > > Thx Masami, > > > > > > On Fri, Jul 10, 2020 at 9:50 PM Masami Hiramatsu wrote: > > > > > > > > Hi Guo, > > > > > > > > On Thu, 9 Jul 2020 02:19:14 +0000 > > > > guoren@kernel.org wrote: > > > > > > > > > +/* Ftrace callback handler for kprobes -- called under preepmt disabed */ > > > > > +void kprobe_ftrace_handler(unsigned long ip, unsigned long parent_ip, > > > > > + struct ftrace_ops *ops, struct pt_regs *regs) > > > > > +{ > > > > > + struct kprobe *p; > > > > > + struct kprobe_ctlblk *kcb; > > > > > + > > > > > + p = get_kprobe((kprobe_opcode_t *)ip); > > > > > + if (unlikely(!p) || kprobe_disabled(p)) > > > > > + return; > > > > > + > > > > > + kcb = get_kprobe_ctlblk(); > > > > > + if (kprobe_running()) { > > > > > + kprobes_inc_nmissed_count(p); > > > > > + } else { > > > > > + /* > > > > > + * The regs->epc hasn't been saved by SAVE_ALL in mcount-dyn.S > > > > > + * So no need to resume it, just for kprobe handler. > > > > > + */ > > > > > + instruction_pointer_set(regs, ip); > > > > > + __this_cpu_write(current_kprobe, p); > > > > > + kcb->kprobe_status = KPROBE_HIT_ACTIVE; > > > > > + if (!p->pre_handler || !p->pre_handler(p, regs)) { > > > > > + /* > > > > > + * Emulate singlestep (and also recover regs->pc) > > > > > + * as if there is a nop > > > > > + */ > > > > > + instruction_pointer_set(regs, > > > > > + (unsigned long)p->addr + MCOUNT_INSN_SIZE); > > > > > + if (unlikely(p->post_handler)) { > > > > > + kcb->kprobe_status = KPROBE_HIT_SSDONE; > > > > > + p->post_handler(p, regs, 0); > > > > > + } > > > > > > > > Hmm, don't you need restoring the previous instruction pointer here? > > > look at riscv mcount-dyn.S SAVE_ALL function, sp frame lay out like this: > > > ----------------------- > > > | return address | > > > ----------------------- > > > | frame pointer | > > > ----------------------- > > > | pt_regs x1-x31| > > > ----------------------- > > > It's not a complete pt_regs for the handler, so modifing regs->ip is no use. > > > > Yes, that is same on x86. But ftrace regs_caller on x86-64 modifies the > > return address on the stack by regs->ip. > > > > See arch/x86/kernel/ftrace_64.S: > > > > ----- > > SYM_INNER_LABEL(ftrace_regs_call, SYM_L_GLOBAL) > > call ftrace_stub > > > > /* Copy flags back to SS, to restore them */ > > movq EFLAGS(%rsp), %rax > > movq %rax, MCOUNT_REG_SIZE(%rsp) > > > > /* Handlers can change the RIP */ > > movq RIP(%rsp), %rax > > movq %rax, MCOUNT_REG_SIZE+8(%rsp) > > ----- > Yes, and in save_mcount_regs, it prepare the origin pt_regs RIP. > > /* Move RIP to its proper location */ > movq MCOUNT_REG_SIZE+\added(%rsp), %rdi > movq %rdi, RIP(%rsp) > > I've fixed up riscv's: [1] > [1]: https://lore.kernel.org/linux-riscv/1594683562-68149-4-git-send-email-guoren@kernel.org/T/#u OK, looks good :) > > > > The last part update the return address on the stack. > > > > > > If you don't support modifying the instruction pointer in the handler, > > > We can modify ip like this if necessary: > > > *(unsigned long *)((unsigned long)regs + sizeof(struct pt_regs) + 8) = xxx; > > > > > > > it must not be compatible with kprobes. > > > Why, can you show related codes? thank you very much. > > > > The "Changing Execution Path" section in the Documentation/kprobes.txt said > > > > ----- > > Since kprobes can probe into a running kernel code, it can change the > > register set, including instruction pointer. > > ----- > Got it, thx for correct me. > > > > > As you can see, this is the expected behavior on kprobes. > > Since user will not know the kprobes is working on a breakpoint or > > ftrace, we have to support this behavior transparently. So, could you > > enable this feature at first on ftrace@RISCV? > Ditto > > > > > > > > > > > > > > Now BPF function override and function error injection depends on > > > > this behevior, so could you consider to support it in the "ftrace" > > > > implementation at first? (And if it is enabled, you can enable the > > > > livepatch on RISCV too) > > > Great message! > > > > > > But can you show me codes that bpf and err-jnject using the behavior? Thx > > > > In kernel/fail_function.c > Nice tip, thx. And I've tested err-jnject: [2] > [2]: https://lore.kernel.org/linux-riscv/1594683562-68149-4-git-send-email-guoren@kernel.org/T/#md66e4e58f3463c1369e11ab9e5d646343f7d250d Good! Thanks for porting it! > > > > > ----- > > static int fei_kprobe_handler(struct kprobe *kp, struct pt_regs *regs) > > { > > struct fei_attr *attr = container_of(kp, struct fei_attr, kp); > > > > if (should_fail(&fei_fault_attr, 1)) { > > regs_set_return_value(regs, attr->retval); > > override_function_with_return(regs); > > return 1; > > } > > > > return 0; > > } > > ----- > > > > And follow the code related to CONFIG_BPF_KPROBE_OVERRIDE in > > kernel/trace/bpf_trace.c. (doing similar thing, but fail_function.c > > is simpler.) > Nice tip, thx > > -- > Best Regards > Guo Ren > > ML: https://lore.kernel.org/linux-csky/ -- Masami Hiramatsu