Received: by 2002:a5d:925a:0:0:0:0:0 with SMTP id e26csp396165iol; Thu, 9 Jun 2022 06:08:07 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx6XUshOlh0NOs+Y8kyCsiqYtaGOxt0V+YSi8LU3ntAV4GMb1WHVrOIJs0/sBr+xOeLehs/ X-Received: by 2002:a17:90a:c7d7:b0:1e8:3d2d:dd67 with SMTP id gf23-20020a17090ac7d700b001e83d2ddd67mr3431521pjb.89.1654780087639; Thu, 09 Jun 2022 06:08:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654780087; cv=none; d=google.com; s=arc-20160816; b=VMm2gjkpDJGFgcCRCrV09Di7JF2qxzT5zWuff8LbuDqWOqhZp3veathcnusZwGNJVm m66rSGTwV7jOLc0iSZfBMkz12u/ZfR9PPatDDfmVFQoToiy2HsXRFiqDYkQ5MIS9NGt2 r72TyVcwLcMEghVQEZzdjtw7C4MdPvhEtaNJrykeQ892PCr0gYult+G4e5r+lmbxT2QP mkklAIAMmlJ4w0s2lrY9gydbXZYIw5Hdh8ELOmr9sJROH4QnEtfVADcznOjioLuo0XKm 6zeCS2UJxJ7U4lpLG3fkIiwssGg+TU+L79G3wYsMLaVUF3UftY7M1ykegamAMRVsoeW5 Rspw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=Uv9BP6QW0lQrx3OGQumQQsNgcBQAH7uNxTWKqTrnGXQ=; b=zuI280sFuZMS73SBG92Hz7AsKXdJuUDjovM5/XWQcyZkrIlV/uZW+cCyNdFn0vgHJS OEq5rEV4yjatRti9jrf0aiMRLzQpWBDEhChrO8imPHPoBamMfomSO5QQqbZLVvMwnoxo RSqHzLWqBrmWrw4e0hra+3TQrwkGXL7snwdSulKPYevFui5uXqKEV1qWRbzj1yE4XvF0 274adHLDc1M+M+6AqEEK+Tr2bnsxfqtHiqAMy/uvKCd3qFdS/oUkXHDYq4DZmRaYY816 IHMPQeEG3QcPMXs3PQCqe4XqKKzsQiDBFwWChRhcymbNWaAfZqwq+vGg3F5KD2n/YV3Z YRLg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=XqLiX3uV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id s23-20020a63e817000000b003fdc5a65a32si14544562pgh.406.2022.06.09.06.07.47; Thu, 09 Jun 2022 06:08:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=XqLiX3uV; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236131AbiFIMre (ORCPT + 99 others); Thu, 9 Jun 2022 08:47:34 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58694 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230490AbiFIMra (ORCPT ); Thu, 9 Jun 2022 08:47:30 -0400 Received: from mga11.intel.com (mga11.intel.com [192.55.52.93]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 613C02E4C80; Thu, 9 Jun 2022 05:47:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1654778849; x=1686314849; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=UGnNiLF3y4lS3+1gFKfQOUrGaju9xL60XoOfxI27cb4=; b=XqLiX3uVK1VORKFz2mgjmHj7ciVFwVaBtZg8QEFQEUHh0R8mZYldba3E UQpoAHr9071fxjn65RzPfXx8TwcKpu7+3dciU+oR71P6mmjK6Q149tthT Kj95qqbWO3YZZ0N5nAaBQkACLmRivCeC6DYoPk0JxV9EM75G4DkoRZQzG Do5q9rg2Ijs1alqrCuC5R1cySIqJwFrkpz+QIvgZrltbks9rNQ3UwP4ZC Y7heLziNi1AcNXoQ2/4wx4UZ4U1tL6T9niqXMayzaa93KCruyRzLJ2Srb jRTJ1nDQDKtBNvYNx+k7PYOX4GeANy0LHuQkPeSUfS04VmAA6u6065P7j A==; X-IronPort-AV: E=McAfee;i="6400,9594,10372"; a="274790464" X-IronPort-AV: E=Sophos;i="5.91,287,1647327600"; d="scan'208";a="274790464" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 09 Jun 2022 05:47:20 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.91,287,1647327600"; d="scan'208";a="908296017" Received: from linux.intel.com ([10.54.29.200]) by fmsmga005.fm.intel.com with ESMTP; 09 Jun 2022 05:47:18 -0700 Received: from [10.252.211.126] (kliang2-MOBL.ccr.corp.intel.com [10.252.211.126]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by linux.intel.com (Postfix) with ESMTPS id 8E9B45808F0; Thu, 9 Jun 2022 05:47:16 -0700 (PDT) Message-ID: <3d687182-f189-870b-f0b4-dc80e48845af@linux.intel.com> Date: Thu, 9 Jun 2022 08:47:14 -0400 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.9.1 Subject: Re: [PATCH 4/5] perf x86 evlist: Add default hybrid events for perf stat Content-Language: en-US To: Namhyung Kim , Xing Zhengjun Cc: Arnaldo Carvalho de Melo , Peter Zijlstra , Ingo Molnar , alexander.shishkin@intel.com, Jiri Olsa , linux-kernel , linux-perf-users , Ian Rogers , Adrian Hunter , Andi Kleen References: <20220607013315.1956301-1-zhengjun.xing@linux.intel.com> <20220607013315.1956301-5-zhengjun.xing@linux.intel.com> From: "Liang, Kan" In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-9.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/8/2022 8:04 PM, Namhyung Kim wrote: > Hello, > > On Tue, Jun 7, 2022 at 12:31 AM wrote: >> >> From: Kan Liang >> >> Provide a new solution to replace the reverted commit ac2dc29edd21 >> ("perf stat: Add default hybrid events"). >> >> For the default software attrs, nothing is changed. >> For the default hardware attrs, create a new evsel for each hybrid pmu. >> >> With the new solution, adding a new default attr will not require the >> special support for the hybrid platform anymore. >> >> Also, the "--detailed" is supported on the hybrid platform >> >> With the patch, >> >> ./perf stat -a -ddd sleep 1 >> >> Performance counter stats for 'system wide': >> >> 32,231.06 msec cpu-clock # 32.056 CPUs utilized >> 529 context-switches # 16.413 /sec >> 32 cpu-migrations # 0.993 /sec >> 69 page-faults # 2.141 /sec >> 176,754,151 cpu_core/cycles/ # 5.484 M/sec (41.65%) >> 161,695,280 cpu_atom/cycles/ # 5.017 M/sec (49.92%) >> 48,595,992 cpu_core/instructions/ # 1.508 M/sec (49.98%) >> 32,363,337 cpu_atom/instructions/ # 1.004 M/sec (58.26%) >> 10,088,639 cpu_core/branches/ # 313.010 K/sec (58.31%) >> 6,390,582 cpu_atom/branches/ # 198.274 K/sec (58.26%) >> 846,201 cpu_core/branch-misses/ # 26.254 K/sec (66.65%) >> 676,477 cpu_atom/branch-misses/ # 20.988 K/sec (58.27%) >> 14,290,070 cpu_core/L1-dcache-loads/ # 443.363 K/sec (66.66%) >> 9,983,532 cpu_atom/L1-dcache-loads/ # 309.749 K/sec (58.27%) >> 740,725 cpu_core/L1-dcache-load-misses/ # 22.982 K/sec (66.66%) >> cpu_atom/L1-dcache-load-misses/ >> 480,441 cpu_core/LLC-loads/ # 14.906 K/sec (66.67%) >> 326,570 cpu_atom/LLC-loads/ # 10.132 K/sec (58.27%) >> 329 cpu_core/LLC-load-misses/ # 10.208 /sec (66.68%) >> 0 cpu_atom/LLC-load-misses/ # 0.000 /sec (58.32%) >> cpu_core/L1-icache-loads/ >> 21,982,491 cpu_atom/L1-icache-loads/ # 682.028 K/sec (58.43%) >> 4,493,189 cpu_core/L1-icache-load-misses/ # 139.406 K/sec (33.34%) >> 4,711,404 cpu_atom/L1-icache-load-misses/ # 146.176 K/sec (50.08%) >> 13,713,090 cpu_core/dTLB-loads/ # 425.462 K/sec (33.34%) >> 9,384,727 cpu_atom/dTLB-loads/ # 291.170 K/sec (50.08%) >> 157,387 cpu_core/dTLB-load-misses/ # 4.883 K/sec (33.33%) >> 108,328 cpu_atom/dTLB-load-misses/ # 3.361 K/sec (50.08%) >> cpu_core/iTLB-loads/ >> cpu_atom/iTLB-loads/ >> 37,655 cpu_core/iTLB-load-misses/ # 1.168 K/sec (33.32%) >> 61,661 cpu_atom/iTLB-load-misses/ # 1.913 K/sec (50.03%) >> cpu_core/L1-dcache-prefetches/ >> cpu_atom/L1-dcache-prefetches/ >> cpu_core/L1-dcache-prefetch-misses/ >> cpu_atom/L1-dcache-prefetch-misses/ >> >> 1.005466919 seconds time elapsed >> >> Signed-off-by: Kan Liang >> Signed-off-by: Zhengjun Xing >> --- >> tools/perf/arch/x86/util/evlist.c | 52 ++++++++++++++++++++++++++++++- >> tools/perf/util/evlist.c | 2 +- >> tools/perf/util/evlist.h | 2 ++ >> 3 files changed, 54 insertions(+), 2 deletions(-) >> >> diff --git a/tools/perf/arch/x86/util/evlist.c b/tools/perf/arch/x86/util/evlist.c >> index 777bdf182a58..1b3f9e1a2287 100644 >> --- a/tools/perf/arch/x86/util/evlist.c >> +++ b/tools/perf/arch/x86/util/evlist.c >> @@ -4,16 +4,66 @@ >> #include "util/evlist.h" >> #include "util/parse-events.h" >> #include "topdown.h" >> +#include "util/event.h" >> +#include "util/pmu-hybrid.h" >> >> #define TOPDOWN_L1_EVENTS "{slots,topdown-retiring,topdown-bad-spec,topdown-fe-bound,topdown-be-bound}" >> #define TOPDOWN_L2_EVENTS "{slots,topdown-retiring,topdown-bad-spec,topdown-fe-bound,topdown-be-bound,topdown-heavy-ops,topdown-br-mispredict,topdown-fetch-lat,topdown-mem-bound}" >> >> +static int ___evlist__add_default_attrs(struct evlist *evlist, struct perf_event_attr *attrs, size_t nr_attrs) >> +{ >> + struct perf_cpu_map *cpus; >> + struct evsel *evsel, *n; >> + struct perf_pmu *pmu; >> + LIST_HEAD(head); >> + size_t i, j = 0; >> + >> + for (i = 0; i < nr_attrs; i++) >> + event_attr_init(attrs + i); >> + >> + if (!perf_pmu__has_hybrid()) >> + return evlist__add_attrs(evlist, attrs, nr_attrs); >> + >> + for (i = 0; i < nr_attrs; i++) { >> + if (attrs[i].type == PERF_TYPE_SOFTWARE) { >> + evsel = evsel__new_idx(attrs + i, evlist->core.nr_entries + j); > > Probably no need to calculate index (j) as it's updated > later when it goes to the evlist... > > >> + if (evsel == NULL) >> + goto out_delete_partial_list; >> + j++; >> + list_add_tail(&evsel->core.node, &head); >> + continue; >> + } >> + >> + perf_pmu__for_each_hybrid_pmu(pmu) { >> + evsel = evsel__new_idx(attrs + i, evlist->core.nr_entries + j); >> + if (evsel == NULL) >> + goto out_delete_partial_list; >> + j++; >> + evsel->core.attr.config |= (__u64)pmu->type << PERF_PMU_TYPE_SHIFT; >> + cpus = perf_cpu_map__get(pmu->cpus); >> + evsel->core.cpus = cpus; >> + evsel->core.own_cpus = perf_cpu_map__get(cpus); >> + evsel->pmu_name = strdup(pmu->name); >> + list_add_tail(&evsel->core.node, &head); >> + } >> + } >> + >> + evlist__splice_list_tail(evlist, &head); > > ... like here. Yes, the index of all new evsel will be updated when adding to the evlist. Zhengjun, could you please handle the patch? Just set 0 idx for the new evsel should be good enough. Thanks, Kan > > Thanks, > Namhyung > > >> + >> + return 0; >> + >> +out_delete_partial_list: >> + __evlist__for_each_entry_safe(&head, n, evsel) >> + evsel__delete(evsel); >> + return -1; >> +}