Received: by 2002:a25:868d:0:0:0:0:0 with SMTP id z13csp1343508ybk; Thu, 14 May 2020 06:46:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzAVQCCCIdBFqjuYlH7lOlRmZB9txqAaAIpUsWfagTTlwIhmwsTvtHVxWtOSIm9UMK0yVBV X-Received: by 2002:a05:6402:1681:: with SMTP id a1mr1879301edv.116.1589463993218; Thu, 14 May 2020 06:46:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1589463993; cv=none; d=google.com; s=arc-20160816; b=lAroqwfQWVbCiGlR2i1M2b8AsyZNJnMKEhWToZ4JM3Rk+D2KxiysyncYrW8WhMcD9a YkUtvexQSEipMsqDscNBJZtzJ7joBtzcr4yebUXVlKosz/If2sBw3hYrWPQO2UfqNgqG Jr52+MyiQRwl0WRgJA9PJi4m3YmXde/tjHPC/6jKrbtawyWvdlSP0s1aQDrU2uJHSfIL +qmog2d50pquv6RWF5tXvaDzkOlVNSlOSWRiSodu+QaFC+O5b6+XFK1ldoZZeDJutb+W 0dX+ZTx6/0NADgVc+hYMs4MdqBtPaomiCqc0PhtV7HREONIYztxVAslVCh018NMC/vkF lSTQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:ironport-sdr:ironport-sdr; bh=HlzVNAQEQPepXYFRrutWhpN1GH+HJI39PiCd9OGHPfs=; b=pcII138yeaMxfSLaIzoZn0LVEJSOkjdhMBwKYY5V29W3A2v5PU3DRAyKMn+z6vJo+g O3cEl3fs9Bwp6q4VLPewjHLyqF7TJtZ1J3yD+Lp5WnacR5eH+NBJFNYYcGDPcQkbVLed I2OgoIDIIBNDPeErMtH6qBDk/KKYRZdS7Fc1ojIxUKMs/OWMk7eZUsfLa9PFLKIYSGJa 2IrHj4j5VCusSz9a/9fqZBCgC6+NsrbO3noBgVOJfYdH0FA3L9WFW+GH4SBfvfY3QuuJ i5PDQBxQ/Rk3TzOk3PIGKC+vJk8ZJel1REz5qnzPUyVF4h65TnL2l1evtb2ifydeNkl4 K98w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z2si1985628edp.280.2020.05.14.06.46.10; Thu, 14 May 2020 06:46:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726294AbgENNo4 (ORCPT + 99 others); Thu, 14 May 2020 09:44:56 -0400 Received: from mga14.intel.com ([192.55.52.115]:10632 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726066AbgENNo4 (ORCPT ); Thu, 14 May 2020 09:44:56 -0400 IronPort-SDR: PhvGVwElh4R+kU4ItTvBd4aNwsYZ3JoN1oyZk6aYRL3fMCWNRxlZVG1yoJ3ysWf7mzokcf4Luh Ox+XNqzFvJtQ== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga002.jf.intel.com ([10.7.209.21]) by fmsmga103.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 May 2020 06:44:55 -0700 IronPort-SDR: mx54nfpiWUd1dviUVhZ5ABosByxREm8wGxC+DCRsHbIOM0ivoyVvu4cERj9sCi6PoHv+FlmOKC EHGiQWE0Ctiw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.73,391,1583222400"; d="scan'208";a="280859330" Received: from yjin15-mobl1.ccr.corp.intel.com (HELO [10.254.211.6]) ([10.254.211.6]) by orsmga002.jf.intel.com with ESMTP; 14 May 2020 06:44:51 -0700 Subject: Re: [PATCH v5 0/5] perf stat: Support overall statistics for interval mode To: kajoljain , acme@kernel.org, jolsa@kernel.org, peterz@infradead.org, mingo@redhat.com, alexander.shishkin@linux.intel.com Cc: Linux-kernel@vger.kernel.org, ak@linux.intel.com, kan.liang@intel.com, yao.jin@intel.com References: <20200514053638.3736-1-yao.jin@linux.intel.com> <104037dd-5e2e-a6fd-51ae-2e0a1cc2860a@linux.ibm.com> From: "Jin, Yao" Message-ID: Date: Thu, 14 May 2020 21:44:50 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.8.0 MIME-Version: 1.0 In-Reply-To: <104037dd-5e2e-a6fd-51ae-2e0a1cc2860a@linux.ibm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Kajoljain, On 5/14/2020 5:53 PM, kajoljain wrote: > > > On 5/14/20 11:06 AM, Jin Yao wrote: >> Currently perf-stat supports to print counts at regular interval (-I), >> but it's not very easy for user to get the overall statistics. >> >> With this patchset, it supports to report the summary at the end of >> interval output. >> >> For example, >> >> root@kbl-ppc:~# perf stat -e cycles -I1000 --interval-count 2 >> # time counts unit events >> 1.000412064 2,281,114 cycles >> 2.001383658 2,547,880 cycles >> >> Performance counter stats for 'system wide': >> >> 4,828,994 cycles >> >> 2.002860349 seconds time elapsed >> >> root@kbl-ppc:~# perf stat -e cycles,instructions -I1000 --interval-count 2 >> # time counts unit events >> 1.000389902 1,536,093 cycles >> 1.000389902 420,226 instructions # 0.27 insn per cycle >> 2.001433453 2,213,952 cycles >> 2.001433453 735,465 instructions # 0.33 insn per cycle >> >> Performance counter stats for 'system wide': >> >> 3,750,045 cycles >> 1,155,691 instructions # 0.31 insn per cycle >> >> 2.003023361 seconds time elapsed >> >> root@kbl-ppc:~# perf stat -M CPI,IPC -I1000 --interval-count 2 >> # time counts unit events >> 1.000435121 905,303 inst_retired.any # 2.9 CPI >> 1.000435121 2,663,333 cycles >> 1.000435121 914,702 inst_retired.any # 0.3 IPC >> 1.000435121 2,676,559 cpu_clk_unhalted.thread >> 2.001615941 1,951,092 inst_retired.any # 1.8 CPI >> 2.001615941 3,551,357 cycles >> 2.001615941 1,950,837 inst_retired.any # 0.5 IPC >> 2.001615941 3,551,044 cpu_clk_unhalted.thread >> >> Performance counter stats for 'system wide': >> >> 2,856,395 inst_retired.any # 2.2 CPI >> 6,214,690 cycles >> 2,865,539 inst_retired.any # 0.5 IPC >> 6,227,603 cpu_clk_unhalted.thread >> >> 2.003403078 seconds time elapsed > > Hi Jin, > Reporting the summary will be great for understanding overall stats. So, Before the > patch where we are reseting rt_stat before read_counters to make sure, whatever printing > in final aggregate is as per counts on that interval, > Yes, I had similar thoughts, so I posted following patch. https://lore.kernel.org/lkml/20200420145417.6864-1-yao.jin@linux.intel.com/ > we used to update stats->means and other info as described in > > RFC: https://lkml.org/lkml/2020/3/24/158 > I've checked your patch but sorry I'm also not very sure if it's the expected behavior. > Now, stats->means is same as counts which we are using in generic_metric function. Is this expected behavior? > I am not sure, if data like stats->means and all suppose to update per interval or we are using it somewhere else. > I just think it's easy to understand, that is the metric calculated by the counts per interval. > So, As we call update_stats for each event and for each interval, can we somehow use that > to print overall stats maybe by adding some var in `struct stats` to keep count of total counts for that event. > Please let me know if my understanding is fine. > Adding var in 'struct stats' looks not enough (or more complicated), because perf-stat also needs to report some counts according to different aggregation modes (not only the metric). I just think copying total counts to current counts is a easy way because we can reuse most of existing non-interval processing code. Thanks Jin Yao > Thanks, > Kajol Jain > > > >> >> v5: >> --- >> 1. Create new patch "perf stat: Save aggr value to first member >> of prev_raw_counts". >> >> 2. Call perf_evlist__save_aggr_prev_raw_counts to save aggr value >> to first member of prev_raw_counts for AGGR_GLOBAL. Then next, >> perf_stat_process_counter can create aggr values from per cpu >> values. >> >> Following patches are impacted in v5: >> perf stat: Copy counts from prev_raw_counts to evsel->counts >> perf stat: Save aggr value to first member of prev_raw_counts >> perf stat: Report summary for interval mode >> >> v4: >> --- >> 1. Create runtime_stat_reset. >> >> 2. Zero the aggr in perf_counts__reset and use it to reset >> prev_raw_counts. >> >> 3. Move affinity setup and read_counter_cpu to a new function >> read_affinity_counters. It's only called when stat_config.summary >> is not set. >> >> v3: >> --- >> 1. 'perf stat: Fix wrong per-thread runtime stat for interval mode' >> is a new patch which fixes an existing issue found in test. >> >> 2. We use the prev_raw_counts for summary counts. Drop the summary_counts in v2. >> >> 3. Fix some issues. >> >> v2: >> --- >> Rebase to perf/core branch >> >> Jin Yao (5): >> perf stat: Fix wrong per-thread runtime stat for interval mode >> perf counts: Reset prev_raw_counts counts >> perf stat: Copy counts from prev_raw_counts to evsel->counts >> perf stat: Save aggr value to first member of prev_raw_counts >> perf stat: Report summary for interval mode >> >> tools/perf/builtin-stat.c | 101 ++++++++++++++++++++++++++------------ >> tools/perf/util/counts.c | 4 +- >> tools/perf/util/counts.h | 1 + >> tools/perf/util/stat.c | 43 +++++++++++++--- >> tools/perf/util/stat.h | 3 ++ >> 5 files changed, 113 insertions(+), 39 deletions(-) >>