Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753651AbZCUBcf (ORCPT ); Fri, 20 Mar 2009 21:32:35 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751735AbZCUBc0 (ORCPT ); Fri, 20 Mar 2009 21:32:26 -0400 Received: from mx2.redhat.com ([66.187.237.31]:37087 "EHLO mx2.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751674AbZCUBcZ (ORCPT ); Fri, 20 Mar 2009 21:32:25 -0400 Message-ID: <49C443E1.6070400@redhat.com> Date: Fri, 20 Mar 2009 21:33:21 -0400 From: Masami Hiramatsu User-Agent: Thunderbird 2.0.0.19 (X11/20090105) MIME-Version: 1.0 To: Ingo Molnar , Steven Rostedt , Ananth N Mavinakayanahalli , Frederic Weisbecker CC: LKML , systemtap-ml Subject: [RFC][PATCH -tip 0/5 V2] tracing: kprobe-based event tracer X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4516 Lines: 119 Hi, This is a series of patches which introduce a proof-of concept of kprobe-based event tracer for x86. I decided to start it from minimum support, so I shrank down the patchset to support only x86(-32/-64). (If someone is interested in porting this to other architectures, I'd happy to help :)) This can be applied on the linux-2.6-tip tree. This patchset includes following changes: - Add kprobe-tracer plugin [1/5, 3/5, 5/5] - Fix kernel_trap_sp() on x86 according to systemtap runtime. [2/5] - Add module_*probe api for repawning/removing kprobes when target module is coming/going. [4/5] Future items: - Enhance module_*probe for batch unregistering support. - Add kernel_trap_sp() and fetch_*() on other archs. - Support name-based register fetching (ax, bx, and so on) - Support symbol-based memory fetching (for global variables) - Support primitive types(long, ulong, int, uint, etc) for args. - Support indirect memory fetch from registers etc. - Check insertion point safety by using instruction decoder. We may need to separate above arch-dependent fetching infrastructure. kprobe-based event tracer --------------------------- This tracer is similar to the events tracer which is based on Tracepoint infrastructure. Instead of Tracepoint, this tracer is based on kprobes(kprobe and kretprobe). It probes anywhere where kprobes can probe(this means, all functions body except for __kprobes functions). Unlike the function tracer, this tracer can probe instructions inside of kernel functions. It allows you to check which instruction has been executed. Unlike the Tracepoint based events tracer, this tracer can add new probe points on the fly. Similar to the events tracer, this tracer doesn't need to be activated via current_tracer, instead of that, just set probe points via /debug/tracing/kprobe_probes. Synopsis of kprobe_probes: p SYMBOL[+offs|-offs]|MEMADDR [FETCHARGS] : set a probe r SYMBOL[+0] [FETCHARGS] : set a return probe FETCHARGS: rN : Fetch Nth register (N >= 0) sN : Fetch Nth entry of stack (N >= 0) mADDR : Fetch memory at ADDR (ADDR should be in kernel) aN : Fetch function argument. (N >= 1)(*) rv : Fetch return value.(**) rp : Fetch return address.(**) (*) aN may not correct on asmlinkaged functions and at function body. (**) only for return probe. E.g. echo p do_sys_open a1 a2 a3 a4 > /debug/tracing/kprobe_probes This sets a kprobe on the top of do_sys_open() function with recording 1st to 4th arguments. echo r do_sys_open rv rp >> /debug/tracing/kprobe_probes This sets a kretprobe on the return point of do_sys_open() function with recording return value and return address. echo > /debug/tracing/kprobe_probes This clears all probe points. and you can see the traced information via /debug/tracing/trace. echo /debug/tracing/trace # tracer: nop # # TASK-PID CPU# TIMESTAMP FUNCTION # | | | | | <...>-2376 [001] 262.389131: do_sys_open: @do_sys_open+0 0xffffff9c 0x98db83e 0x8880 0x0 <...>-2376 [001] 262.391166: sys_open: <-do_sys_open+0 0x5 0xc06e8ebb <...>-2376 [001] 264.384876: do_sys_open: @do_sys_open+0 0xffffff9c 0x98db83e 0x8880 0x0 <...>-2376 [001] 264.386880: sys_open: <-do_sys_open+0 0x5 0xc06e8ebb <...>-2084 [001] 265.380330: do_sys_open: @do_sys_open+0 0xffffff9c 0x804be3e 0x0 0x1b6 <...>-2084 [001] 265.380399: sys_open: <-do_sys_open+0 0x3 0xc06e8ebb @SYMBOL means that kernel hits a probe, and <-SYMBOL means kernel returns from SYMBOL(e.g. "sys_open: <-do_sys_open+0" means kernel returns from do_sys_open to sys_open). Documentation/ftrace.txt | 66 ++++ arch/x86/include/asm/ptrace.h | 4 +- include/linux/kprobes.h | 44 +++ kernel/kprobes.c | 252 +++++++++++++++ kernel/trace/Kconfig | 9 + kernel/trace/Makefile | 1 + kernel/trace/trace_kprobe.c | 693 +++++++++++++++++++++++++++++++++++++++++ 7 files changed, 1067 insertions(+), 2 deletions(-) Thank you, -- Masami Hiramatsu Software Engineer Hitachi Computer Products (America) Inc. Software Solutions Division e-mail: mhiramat@redhat.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/