Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp1232347iog; Thu, 16 Jun 2022 01:48:02 -0700 (PDT) X-Google-Smtp-Source: AGRyM1s+F7Xfr+xdydOpZ1+adhNDdX/7NBrBCEiCjIYm1Hn1Zy8gTBBCxJDTl8d2V+sB/QzxauHv X-Received: by 2002:a17:906:7290:b0:714:d36b:d6f with SMTP id b16-20020a170906729000b00714d36b0d6fmr3487459ejl.39.1655369281877; Thu, 16 Jun 2022 01:48:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1655369281; cv=none; d=google.com; s=arc-20160816; b=JoLNtNXuI7GUWtL55kNp2eOLdRa12AIEDYUo80qWVf8HIGg00sSjDWBrkDBW0kJOaj 94kzlTBGYksmriVHWGeGXwuiRDkzX7SBUewUXf+Qd9Jpl5vSsL9IuELX5ldxGibUS3tT toqvVCg9ZcmeUvW+nhaVY8nyRipB3ZyDFne/gxDvB59NH5PYY0mplmB1ZqQdgOPF3t0c +UT6u/UjfN+9Myyh4F/NRTL6GEsmJUK6bWUQMUCvfU92SExqENJURjWeozRJMcpYtG7g YxZPqjzNIZgYexI3m91CaqvfpkflEyiToJQ7GbX+9imd/dVI7g6k8b0EUQAppvQBeour GTQg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=JyPKZhH6FP8LWLTIVSth91+VZly+3eVvSbLGrg8SyLs=; b=dIIfqLawz2CYDLMrTrGMj44diZwk+L1T4bEqoGZlmc13eClZmlg74wh0Tv6fkLH6pq SpveoR0y5EOaW/ivkmAcIy0MWewnySi11KmJXp4T+Nh8mYRa3z9+xjFMlrfZD6ulo1KQ LE43JTjWTAhz4UpncK3qbl+IkCeGKGrm4zWuP5XF2FxU7I1sRWHZbRqAx/Xmi3Usyqv/ 5LZn15l/G36k74q3hbD65iAUp0xt6J/yv8HYaXykoiDVbpk9N1XRNBVVbtFdG+CCEeLn AT2i2ZdDzfJ66G5yKuI577g12wLBegqQP6d6xY7vcpU0Y9ofdfbiUDNvoENoYShdJBsp biXA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=UBO56ep8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l1-20020a170907914100b0071208640567si1107115ejs.529.2022.06.16.01.47.25; Thu, 16 Jun 2022 01:48:01 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=UBO56ep8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1348539AbiFPIbr (ORCPT + 99 others); Thu, 16 Jun 2022 04:31:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50618 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1348251AbiFPIbp (ORCPT ); Thu, 16 Jun 2022 04:31:45 -0400 Received: from mga02.intel.com (mga02.intel.com [134.134.136.20]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2771C5DA5F; Thu, 16 Jun 2022 01:31:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1655368304; x=1686904304; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=LLQKWpBbbH2gL+fmhc1WDqRYlnfOMa7XL0kc10V7SSQ=; b=UBO56ep8LpHuDcJZLaMw4ubNY605XQIutJypwJis3s7QrHCmDVPjBycf fnCwLaVyRmd2sMhtAEk3ilFLoi5ATBphVsTRXvjQbJAhA1pwwbx9mmvkG XJenprUusTprm7PVzbBAxR/uFz4K/KLhlIz0XxvOJz5S5UImuubQPd2Yx lVm5qT/9i8kJGXyFG9Zju2fHLIFnoq8W6cCkg+cZb/8DPOrJYR0xP2NhS asILvQOQngT3tpYlkwuYTzYRomu6N5b9ULcfwhbQe6N5tukjnNOZ8t74Y 1VFNXFZomwfPGo8E3FHboyV/ua73LzC1wDZwir7wr/v2F64TBIqscFC7D Q==; X-IronPort-AV: E=McAfee;i="6400,9594,10379"; a="267887691" X-IronPort-AV: E=Sophos;i="5.91,304,1647327600"; d="scan'208";a="267887691" Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by orsmga101.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 16 Jun 2022 01:31:42 -0700 X-IronPort-AV: E=Sophos;i="5.91,304,1647327600"; d="scan'208";a="641437890" Received: from xingzhen-mobl.ccr.corp.intel.com (HELO [10.238.6.2]) ([10.238.6.2]) by fmsmga008-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 16 Jun 2022 01:31:39 -0700 Message-ID: Date: Thu, 16 Jun 2022 16:31:37 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.10.0 Subject: Re: [PATCH] perf record: Support "--cputype" option for hybrid events Content-Language: en-US To: Ian Rogers Cc: acme@kernel.org, peterz@infradead.org, mingo@redhat.com, alexander.shishkin@intel.com, jolsa@kernel.org, namhyung@kernel.org, linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, ak@linux.intel.com, kan.liang@linux.intel.com References: <20220615150823.2230349-1-zhengjun.xing@linux.intel.com> From: Xing Zhengjun In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-7.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_MED, RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL,SPF_HELO_NONE,SPF_NONE, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Ian, On 6/15/2022 11:16 PM, Ian Rogers wrote: > On Wed, Jun 15, 2022 at 8:07 AM wrote: >> >> From: Zhengjun Xing >> >> perf stat already has the "--cputype" option to enable events only on the >> specified PMU for the hybrid platform, this commit extends the "--cputype" >> support to perf record. >> >> Without "--cputype", it reports events for both cpu_core and cpu_atom. >> >> # ./perf record -e cycles -a sleep 1 | ./perf report >> >> # To display the perf.data header info, please use --header/--header-only options. >> # >> [ perf record: Woken up 1 times to write data ] >> [ perf record: Captured and wrote 0.000 MB (null) ] >> # >> # Total Lost Samples: 0 >> # >> # Samples: 335 of event 'cpu_core/cycles/' >> # Event count (approx.): 35855267 >> # >> # Overhead Command Shared Object Symbol >> # ........ ............... ................. ......................................... >> # >> 10.31% swapper [kernel.kallsyms] [k] poll_idle >> 9.42% swapper [kernel.kallsyms] [k] menu_select >> ... ... ... ... ... >> >> # Samples: 61 of event 'cpu_atom/cycles/' >> # Event count (approx.): 16453825 >> # >> # Overhead Command Shared Object Symbol >> # ........ ............. ................. ...................................... >> # >> 26.36% snapd [unknown] [.] 0x0000563cc6d03841 >> 7.43% migration/13 [kernel.kallsyms] [k] update_sd_lb_stats.constprop.0 >> ... ... ... ... ... >> >> With "--cputype", it reports events only for the specified PMU. >> >> # ./perf record --cputype core -e cycles -a sleep 1 | ./perf report >> >> # To display the perf.data header info, please use --header/--header-only options. >> # >> [ perf record: Woken up 1 times to write data ] >> [ perf record: Captured and wrote 0.000 MB (null) ] >> # >> # Total Lost Samples: 0 >> # >> # Samples: 221 of event 'cpu_core/cycles/' >> # Event count (approx.): 27121818 >> # >> # Overhead Command Shared Object Symbol >> # ........ ............... ................. ......................................... >> # >> 11.24% swapper [kernel.kallsyms] [k] e1000_irq_enable >> 7.77% swapper [kernel.kallsyms] [k] mwait_idle_with_hints.constprop.0 >> ... ... ... ... ... > > This is already possible by having the PMU name as part of the event, > cpu_atom/cycles/ or cpu_core/cycles/. I don't know why we're adding > "hybrid" all over the code base, I wish it would stop. You are asking > for an option here for an implied PMU for events that don't specify a > PMU. The option should be called --pmutype and consider all PMUs. We > should remove the "hybrid" PMU list and make all of the "hybrid" code > generic. > I can change the option name from "cputype" to "pmutype". We have planned to clean up the hybrid code, and try to reduce the code with "if perf_pmu__has_hybrid()", this will be done step by step, from this patch, we have begun to do some clean-up, move the hybrid API from the common file to the hybrid-specific files. For the clean-up, Do you have any ideas? Thanks. > Thanks, > Ian > >> Signed-off-by: Zhengjun Xing >> Reviewed-by: Kan Liang >> --- >> tools/perf/Documentation/perf-record.txt | 4 ++++ >> tools/perf/builtin-record.c | 3 +++ >> tools/perf/builtin-stat.c | 20 -------------------- >> tools/perf/util/pmu-hybrid.c | 19 +++++++++++++++++++ >> tools/perf/util/pmu-hybrid.h | 2 ++ >> 5 files changed, 28 insertions(+), 20 deletions(-) >> >> diff --git a/tools/perf/Documentation/perf-record.txt b/tools/perf/Documentation/perf-record.txt >> index cf8ad50f3de1..ba8d680da1ac 100644 >> --- a/tools/perf/Documentation/perf-record.txt >> +++ b/tools/perf/Documentation/perf-record.txt >> @@ -402,6 +402,10 @@ Enable weightened sampling. An additional weight is recorded per sample and can >> displayed with the weight and local_weight sort keys. This currently works for TSX >> abort events and some memory events in precise mode on modern Intel CPUs. >> >> +--cputype:: >> +Only enable events on applying cpu with this type for hybrid platform(e.g. core or atom). >> +For non-hybrid events, it should be no effect. >> + >> --namespaces:: >> Record events of type PERF_RECORD_NAMESPACES. This enables 'cgroup_id' sort key. >> >> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c >> index 9a71f0330137..e1edd4e98358 100644 >> --- a/tools/perf/builtin-record.c >> +++ b/tools/perf/builtin-record.c >> @@ -3183,6 +3183,9 @@ static struct option __record_options[] = { >> OPT_INCR('v', "verbose", &verbose, >> "be more verbose (show counter open errors, etc)"), >> OPT_BOOLEAN('q', "quiet", &quiet, "don't print any message"), >> + OPT_CALLBACK(0, "cputype", &record.evlist, "hybrid cpu type", >> + "Only enable events on applying cpu with this type for hybrid platform (e.g. core or atom)", >> + parse_hybrid_type), >> OPT_BOOLEAN('s', "stat", &record.opts.inherit_stat, >> "per thread counts"), >> OPT_BOOLEAN('d', "data", &record.opts.sample_address, "Record the sample addresses"), >> diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c >> index 4ce87a8eb7d7..0d95b29273f4 100644 >> --- a/tools/perf/builtin-stat.c >> +++ b/tools/perf/builtin-stat.c >> @@ -1184,26 +1184,6 @@ static int parse_stat_cgroups(const struct option *opt, >> return parse_cgroups(opt, str, unset); >> } >> >> -static int parse_hybrid_type(const struct option *opt, >> - const char *str, >> - int unset __maybe_unused) >> -{ >> - struct evlist *evlist = *(struct evlist **)opt->value; >> - >> - if (!list_empty(&evlist->core.entries)) { >> - fprintf(stderr, "Must define cputype before events/metrics\n"); >> - return -1; >> - } >> - >> - evlist->hybrid_pmu_name = perf_pmu__hybrid_type_to_pmu(str); >> - if (!evlist->hybrid_pmu_name) { >> - fprintf(stderr, "--cputype %s is not supported!\n", str); >> - return -1; >> - } >> - >> - return 0; >> -} >> - >> static struct option stat_options[] = { >> OPT_BOOLEAN('T', "transaction", &transaction_run, >> "hardware transaction statistics"), >> diff --git a/tools/perf/util/pmu-hybrid.c b/tools/perf/util/pmu-hybrid.c >> index f51ccaac60ee..5c490b5201b7 100644 >> --- a/tools/perf/util/pmu-hybrid.c >> +++ b/tools/perf/util/pmu-hybrid.c >> @@ -13,6 +13,7 @@ >> #include >> #include >> #include >> +#include "util/evlist.h" >> #include "fncache.h" >> #include "pmu-hybrid.h" >> >> @@ -87,3 +88,21 @@ char *perf_pmu__hybrid_type_to_pmu(const char *type) >> free(pmu_name); >> return NULL; >> } >> + >> +int parse_hybrid_type(const struct option *opt, const char *str, int unset __maybe_unused) >> +{ >> + struct evlist *evlist = *(struct evlist **)opt->value; >> + >> + if (!list_empty(&evlist->core.entries)) { >> + fprintf(stderr, "Must define cputype before events/metrics\n"); >> + return -1; >> + } >> + >> + evlist->hybrid_pmu_name = perf_pmu__hybrid_type_to_pmu(str); >> + if (!evlist->hybrid_pmu_name) { >> + fprintf(stderr, "--cputype %s is not supported!\n", str); >> + return -1; >> + } >> + >> + return 0; >> +} >> diff --git a/tools/perf/util/pmu-hybrid.h b/tools/perf/util/pmu-hybrid.h >> index 2b186c26a43e..26101f134a3a 100644 >> --- a/tools/perf/util/pmu-hybrid.h >> +++ b/tools/perf/util/pmu-hybrid.h >> @@ -5,6 +5,7 @@ >> #include >> #include >> #include >> +#include >> #include >> #include "pmu.h" >> >> @@ -18,6 +19,7 @@ bool perf_pmu__hybrid_mounted(const char *name); >> struct perf_pmu *perf_pmu__find_hybrid_pmu(const char *name); >> bool perf_pmu__is_hybrid(const char *name); >> char *perf_pmu__hybrid_type_to_pmu(const char *type); >> +int parse_hybrid_type(const struct option *opt, const char *str, int unset __maybe_unused); >> >> static inline int perf_pmu__hybrid_pmu_num(void) >> { >> -- >> 2.25.1 >> -- Zhengjun Xing