Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752397AbdHDOxc (ORCPT ); Fri, 4 Aug 2017 10:53:32 -0400 Received: from bombadil.infradead.org ([65.50.211.133]:44397 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751921AbdHDOxa (ORCPT ); Fri, 4 Aug 2017 10:53:30 -0400 Date: Fri, 4 Aug 2017 16:53:24 +0200 From: Peter Zijlstra To: Alexey Budankov Cc: Ingo Molnar , Arnaldo Carvalho de Melo , Alexander Shishkin , Andi Kleen , Kan Liang , Dmitri Prokhorov , Valery Cherepennikov , Mark Rutland , Stephane Eranian , David Carrillo-Cisneros , linux-kernel Subject: Re: [PATCH v6 1/3] perf/core: use rb trees for pinned/flexible groups Message-ID: <20170804145324.tb3d377f4kkeqhho@hirez.programming.kicks-ass.net> References: <96c7776f-1f17-a39e-23e9-658596216d6b@linux.intel.com> <20170803130002.oatczvnaalplrsep@hirez.programming.kicks-ass.net> <86cbe0b0-a1ec-4d5f-addc-87bccf2e97d7@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <86cbe0b0-a1ec-4d5f-addc-87bccf2e97d7@linux.intel.com> User-Agent: NeoMutt/20170609 (1.8.3) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1220 Lines: 27 On Thu, Aug 03, 2017 at 11:30:09PM +0300, Alexey Budankov wrote: > On 03.08.2017 16:00, Peter Zijlstra wrote: > > On Wed, Aug 02, 2017 at 11:13:54AM +0300, Alexey Budankov wrote: > >> @@ -2759,13 +2932,13 @@ static void ctx_sched_out(struct perf_event_context *ctx, > >> > >> perf_pmu_disable(ctx->pmu); > >> if (is_active & EVENT_PINNED) { > >> - list_for_each_entry(event, &ctx->pinned_groups, group_entry) > >> - group_sched_out(event, cpuctx, ctx); > >> + perf_event_groups_iterate(&ctx->pinned_groups, > >> + group_sched_out_callback, ¶ms); > > > > So here I would expect to not iterate events where event->cpu != > > smp_processor_id() (and ideally not where event->pmu != ctx->pmu). > > > > We still need to iterate thru all groups on thread context switch in > and out as well as iterate thru cpu == -1 list (software events) additionally > to smp_processor_id() list from multiplexing timer interrupt handler. Well, just doing the @cpu=-1 and @cpu=this_cpu subtrees is less work than iterating _everything_, right? The rest will not survive event_filter_match() anyway, so iterating them is complete waste of time, and once we have them in a tree, its actually easy to find this subset.