Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752802Ab0KYPC0 (ORCPT ); Thu, 25 Nov 2010 10:02:26 -0500 Received: from casper.infradead.org ([85.118.1.10]:52195 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751490Ab0KYPCZ convert rfc822-to-8bit (ORCPT ); Thu, 25 Nov 2010 10:02:25 -0500 Subject: Re: [PATCH 1/2] perf_events: add support for per-cpu per-cgroup monitoring (v5) From: Peter Zijlstra To: Stephane Eranian Cc: linux-kernel@vger.kernel.org, mingo@elte.hu, paulus@samba.org, davem@davemloft.net, fweisbec@gmail.com, perfmon2-devel@lists.sf.net, eranian@gmail.com, robert.richter@amd.com, acme@redhat.com, lizf@cn.fujitsu.com In-Reply-To: References: <4ce510d4.0fedd80a.742c.10b9@mx.google.com> <1290684033.2145.25.camel@laptop> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT Date: Thu, 25 Nov 2010 16:02:24 +0100 Message-ID: <1290697344.2145.56.camel@laptop> Mime-Version: 1.0 X-Mailer: Evolution 2.30.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2714 Lines: 80 On Thu, 2010-11-25 at 15:51 +0100, Stephane Eranian wrote: > > > On Thu, Nov 25, 2010 at 12:20 PM, Peter Zijlstra wrote: > On Thu, 2010-11-18 at 12:40 +0200, Stephane Eranian wrote: > > @@ -919,6 +945,10 @@ static inline void perf_event_task_sched_in(struct task_struct *task) > > static inline > > void perf_event_task_sched_out(struct task_struct *task, struct task_struct *next) > > { > > +#ifdef CONFIG_CGROUPS > > + atomic_t *cgroup_events = &__get_cpu_var(perf_cgroup_events); > > + COND_STMT(cgroup_events, perf_cgroup_switch(task, next)); > > +#endif > > COND_STMT(&perf_task_events, __perf_event_task_sched_out(task, next)); > > } > > > I don't think that'll actually work, the jump label stuff needs a static > address. > > I did not know that. Yeah, its unfortunate the fallback code doesn't mandate this :/ > Why not simply: s/perf_task_events/perf_sched_events/ and > increment it > for cgroup events as well? > > But you would need to demultiplex. that's not because perf_sched_events is > set that you want BOTH perf_cgroup_switch() AND perf_event_task_sched_out(). The main purpose of the jump-label stuff is to optimize the function call and conditional into the perf code away, the moment we a function call we might as well do everything, at that point its only a single conditional. Jump labels are supposed to work like (they don't actually work like this yet): my_func: asm-foo addr_of_nop: nop5 after_nop: more-asm-foo iret out_of_line: do-special-foo jmp after_nop We then keep a section of tuples: __jump_labels: &perf_task_events,addr_of_nop Then when we flip perf_task_events from 0 -> !0 we rewrite the nop5 at addr_of_nop to "jmp out_of_line" (5 bytes on x86, hence nop5), or the reverse on !0 -> 0. So 1) we need the 'key' (&perf_task_events) to be a static address because the compiler needs to place the address in the special section -- otherwise we can never find the nop location again, this also means per-cpu variables don't make sense, there's only 1 copy of the text. and 2) the moment we take the out-of-line branch we incur the icache hit and already set up a call, so optimizing away another conditional at the cost of an extra function call doesn't really make sense. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/