Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756738AbcKXE1I (ORCPT ); Wed, 23 Nov 2016 23:27:08 -0500 Received: from mail-wj0-f196.google.com ([209.85.210.196]:33244 "EHLO mail-wj0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754622AbcKXE1G (ORCPT ); Wed, 23 Nov 2016 23:27:06 -0500 Date: Thu, 24 Nov 2016 05:27:02 +0100 From: Ingo Molnar To: kan.liang@intel.com Cc: peterz@infradead.org, mingo@redhat.com, acme@kernel.org, linux-kernel@vger.kernel.org, alexander.shishkin@linux.intel.com, tglx@linutronix.de, namhyung@kernel.org, jolsa@kernel.org, adrian.hunter@intel.com, wangnan0@huawei.com, mark.rutland@arm.com, andi@firstfloor.org Subject: Re: [PATCH 00/14] export perf overheads information Message-ID: <20161124042702.GA10321@gmail.com> References: <1479894292-16277-1-git-send-email-kan.liang@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1479894292-16277-1-git-send-email-kan.liang@intel.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2369 Lines: 52 * kan.liang@intel.com wrote: > From: Kan Liang > > Profiling brings additional overhead. High overhead may impacts the > behavior of the profiling object, impacts the accuracy of the > profiling result, and even hang the system. > Currently, perf has dynamic interrupt throttle mechanism to lower the > sample rate and overhead. But it has limitations. > - The mechanism only focus in the overhead from NMI. However, there > are other parts which bring big overhead. E.g, multiplexing. > - The hint from the mechanism doesn't work on fixed period. > - The system changes which caused by the mechanism are not recorded > in the perf.data. Users have no idea about the overhead and its > impact. > Acctually, any passive ways like dynamic interrupt throttle mechanism > are only palliative. The best way is to export overheads information, > provide more hints, and help the users design more proper perf command. > > According to our test, there are four parts which can bring big overhead. > They include NMI handler, multiplexing handler, iterate side-band events, > and write data in file. Two new perf record type PERF_RECORD_OVERHEAD and > PERF_RECORD_USER_OVERHEAD are introduced to record the overhead > information in kernel and user space respectively. > The overhead information is the system per-CPU overhead, not per-event > overhead. The implementation takes advantage of the existing event log > mechanism. > To reduce the additional overhead from logging overhead information, the > overhead information only be output when the event is going to be > disabled or task is scheduling out. > > In perf report, the overhead will be checked automatically. If the > overhead rate is larger than 10%. A warning will be displayed. > A new option is also introduced to display detial per-CPU overhead > information. > > Current implementation only include four overhead sources. There could be > more in other parts. The new overhead source can be easily added as a > new type. Please include sample output of the new instrumentation! Not even the tooling patches show any of the output, nor is it clear anywhere what kind of 'overhead' measurement it is, what the units are, what the metrics are, how users can _use_ this information, etc. This is totally inadequate description. Thanks, Ingo