Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751484AbdHaKMq (ORCPT ); Thu, 31 Aug 2017 06:12:46 -0400 Received: from mga02.intel.com ([134.134.136.20]:22110 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750918AbdHaKMp (ORCPT ); Thu, 31 Aug 2017 06:12:45 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.41,451,1498546800"; d="scan'208";a="146010003" Subject: Re: [PATCH v7 0/2] perf/core: addressing 4x slowdown during per-process profiling of STREAM benchmark on Intel Xeon Phi To: Peter Zijlstra Cc: Ingo Molnar , Arnaldo Carvalho de Melo , Alexander Shishkin , Andi Kleen , Kan Liang , Dmitri Prokhorov , Valery Cherepennikov , Mark Rutland , Stephane Eranian , David Carrillo-Cisneros , linux-kernel References: <20170822202106.GP32112@worktop.programming.kicks-ass.net> From: Alexey Budankov Organization: Intel Corp. Message-ID: <20a0366a-80ac-4e78-9b0d-277cf79cdd8b@linux.intel.com> Date: Thu, 31 Aug 2017 13:12:40 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.3.0 MIME-Version: 1.0 In-Reply-To: <20170822202106.GP32112@worktop.programming.kicks-ass.net> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1176 Lines: 31 Hi, On 22.08.2017 23:21, Peter Zijlstra wrote: > On Fri, Aug 18, 2017 at 08:17:15AM +0300, Alexey Budankov wrote: >> Hi, > > Please don't post new versions in reply to old versions, that gets them > lost in thread sorted views. > >> This patch set v7 moves event groups into rb trees and implements >> skipping to the current CPU's list on hrtimer interrupt. > > Does this depend on your timekeeping rework posted in that v6 thread? > If so, I would have expected to see that as part of these patches, if > not, I'm confused, because part of the problem was that we currently > need to update times for events we don't want to schedule etc.. > >> Events allocated for the same CPU are still kept in a linked list >> of the event directly attached to the tree because it is unclear >> how to implement fast iteration thru events allocated for >> the same CPU when they are all attached to a tree employing >> additional 64bit index as a secondary treee key. > > Finding the CPU subtree and rb_next() wasn't good? I eventually managed to overcome difficulties with implementation of rb_tree indexed by {cpu,index} for event groups so please see patches v9. > > >