Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758777AbZCYKp3 (ORCPT ); Wed, 25 Mar 2009 06:45:29 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755193AbZCYKpU (ORCPT ); Wed, 25 Mar 2009 06:45:20 -0400 Received: from metis.ext.pengutronix.de ([92.198.50.35]:36770 "EHLO metis.ext.pengutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754880AbZCYKpS (ORCPT ); Wed, 25 Mar 2009 06:45:18 -0400 Date: Wed, 25 Mar 2009 11:45:05 +0100 From: Uwe =?iso-8859-1?Q?Kleine-K=F6nig?= To: Russell King - ARM Linux Cc: Ingo Molnar , Frederic Weisbecker , Abhishek Sagar , Tim Bird , linux-arm-kernel , linux kernel , Steven Rostedt , Ingo Molnar Subject: Re: Anyone working on ftrace function graph support on ARM? Message-ID: <20090325104505.GA27803@pengutronix.de> References: <49C936CA.8070800@am.sony.com> <20090324213618.GC5975@nowhere> <49C95EAF.7030901@gmail.com> <20090324224857.GE5975@nowhere> <20090325084248.GF4697@n2100.arm.linux.org.uk> <20090325085418.GA2341@elte.hu> <20090325095751.GA31464@n2100.arm.linux.org.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20090325095751.GA31464@n2100.arm.linux.org.uk> User-Agent: Mutt/1.5.18 (2008-05-17) X-SA-Exim-Connect-IP: 2001:6f8:1178:2:215:17ff:fe12:23b0 X-SA-Exim-Mail-From: ukl@pengutronix.de X-SA-Exim-Scanned: No (on metis.ext.pengutronix.de); SAEximRunCond expanded to false X-PTX-Original-Recipient: linux-kernel@vger.kernel.org Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3726 Lines: 106 Hi Russell, On Wed, Mar 25, 2009 at 09:57:51AM +0000, Russell King - ARM Linux wrote: > On Wed, Mar 25, 2009 at 09:54:18AM +0100, Ingo Molnar wrote: > > Unwinding is not realistic or desired for the function tracer - it > > runs in every kernel function so performance is paramount. > > Which would also include the unwinder itself as well. > > > So, if i understood you correctly, an OABI_COMPAT and FRAME_POINTERS > > dependency has to be added to the ARM HAVE_FUNCTION_GRAPH_TRACER > > Kconfig rule. > > If we have frame pointers enabled with EABI, then it looks like it will > work as well. So the dependency should be on FRAME_POINTERS for _every_ > feature using the mcount code. > > Hmm, and it looks like the ftrace code is rather crap: > > ENTRY(mcount) > stmdb sp!, {r0-r3, lr} > ldr r0, =ftrace_trace_function > ldr r2, [r0] > adr r0, ftrace_stub > cmp r0, r2 > bne trace > ldr lr, [fp, #-4] @ restore lr > ldmia sp!, {r0-r3, pc} > > trace: > ldr r1, [fp, #-4] @ lr of instrumented routine > mov r0, lr > sub r0, r0, #MCOUNT_INSN_SIZE > mov lr, pc > mov pc, r2 > XXX calling a C function results in r0-r3,ip,lr being clobbered XXX > > mov lr, r1 @ restore lr > XXX not necessarily, r1 might be some other random value > > ldmia sp!, {r0-r3, pc} > > In fact, to me the above code looks totally crap, because it's checking > whether the caller is 'ftrace_stub'. It can never be 'ftrace_stub' > because that is an assembly function: > > .globl ftrace_stub > ftrace_stub: > mov pc, lr > > and therefore gcc has no hand in adding a mcount call to it. Hhhm. Isn't the equivalent C-Code ~ as follows: if (ftrace_trace_function != ftrace_stub) trace(some, args); return; ? ftrace_trace_function is initialised to ftrace_stub at compile time and is changed when a tracerfunction is registered. > Moreover, the _dynamic_ ftrace code does this: > > ENTRY(mcount) > stmdb sp!, {r0-r3, lr} > mov r0, lr > sub r0, r0, #MCOUNT_INSN_SIZE > > .globl mcount_call > mcount_call: > bl ftrace_stub > ldr lr, [fp, #-4] @ restore lr > ldmia sp!, {r0-r3, pc} > > ENTRY(ftrace_caller) > stmdb sp!, {r0-r3, lr} > ldr r1, [fp, #-4] > mov r0, lr > sub r0, r0, #MCOUNT_INSN_SIZE > > .globl ftrace_call > ftrace_call: > bl ftrace_stub > ldr lr, [fp, #-4] @ restore lr > ldmia sp!, {r0-r3, pc} > > In other words, it pushes some words onto the stack, sets r0, calls > an assembly function which does nothing but just returns, reloads lr, > restores the stack and returns. This ftrace implementation looks like > an exercise in slowing down execution to me with no added value. The idea is that the instruction at address mcount_call (and ftrace_call IIRC) is rewritten at run time. Still the code is not active currently (because CONFIG_DYNAMIC_FTRACE isn't selectable for ARM) and needs some care anyhow on reactivation because the way how dynamic ftrace works changed somehow. Didn't look at it up to now though. Best regards Uwe -- Pengutronix e.K. | Uwe Kleine-K?nig | Industrial Linux Solutions | http://www.pengutronix.de/ | -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/