Received: by 2002:a05:6358:700f:b0:131:369:b2a3 with SMTP id 15csp3229227rwo; Fri, 4 Aug 2023 01:10:35 -0700 (PDT) X-Google-Smtp-Source: AGHT+IGJprlCeSnU7F0+sgMgh4Eu8Y/iFwJ7lSte4hqpxUPfsU7mc3lKX03++pOnODfwXOePFEdo X-Received: by 2002:a05:620a:c05:b0:76c:9391:6922 with SMTP id l5-20020a05620a0c0500b0076c93916922mr1109644qki.24.1691136634867; Fri, 04 Aug 2023 01:10:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1691136634; cv=none; d=google.com; s=arc-20160816; b=iIcEPKxwZT5Zn5ZXZ4VepSEn/A0eyyjde/kqLkTaR3uoMFq8DdFTQVgrgvhBoPDVmz 7SUZwgXryLvIA8l/URaLImJpX2lrr2/Y1I6auNwjvfJQdlJKWhw+SaPaSjDSXciajAYR a8S9/j67WCbrmQRc1rwy2Q9z/J3drjlAV8FTXpKRAfs7zW1JsT1WBHgn+PaBgav4gkcS eCIF368/McTPeMHiQtd5s7+tNc4ky/U7KvG33TpKJOIFY1/l8uGopsl1mMOeslavV5W+ XhD8O0hHfk2xoOwvccvCF+N+YrWYLX4gelcXlPbytF76LYIFuOE7bHAYY6awSfRHfOtq PZAA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :organization:from:references:to:content-language:subject:user-agent :mime-version:date:message-id:dkim-signature; bh=u3l27d905nmyYmd368tPEYPuVg0KEKiNfpzpcWTKGhY=; fh=joQbV4ge3gkgVpjJGAt6OT/s6tPyuJlgL4Af+wgTNoA=; b=Qo97KQxNQOi1gtsq9X0rJPakZFzGwDRhcahezJ+xXsrMblOqOt6upoqVL2QOu7FjH7 qOZWnTckkI2IJVLRhWJDQ9BWXlFONZp17nX4V8u5D2imi1BQvEpaCmOo2QIk2c6f7MC2 qiZIW8R1fz89A5v7wJUzCAG8rQK8FWfvNkEws9WxIhk7wYiSC7xLQWilE6s1d6zob30x l0cYRKAP7H0xoWVSNoar8BeYBIP9x3a5OPb3B0faGm5d+JgNt5GW9z6V+cLJpt7WTW9S Sa6RK1LgnZVu/5Z7kGAwhTIncEKPrlFjvafGu7WIMpRHnDlbbV9qU+QhDnjLSX4p9A5O 20dg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=XSFWbZYC; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l8-20020a635b48000000b0055c1fb5a4a9si1331568pgm.661.2023.08.04.01.10.21; Fri, 04 Aug 2023 01:10:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=XSFWbZYC; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233937AbjHDG7R (ORCPT + 99 others); Fri, 4 Aug 2023 02:59:17 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51510 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231540AbjHDG7O (ORCPT ); Fri, 4 Aug 2023 02:59:14 -0400 Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 00FB01990; Thu, 3 Aug 2023 23:59:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1691132352; x=1722668352; h=message-id:date:mime-version:subject:to:references:from: in-reply-to:content-transfer-encoding; bh=ZxbbUiVhbIzhenDDUn8PesZ9SavUa1xp5dnHc+C7s6A=; b=XSFWbZYCVVsc+olJVqI3VTyVmep9+z8tQV99agqZsmkDVGFTFPBniqYa PGI1ycz9sCzT1V2AbMgrDqgpIpJ6mX77Fk/lZ5tI5eNCcoyGoUlCnnnT/ M1i8itA17q8WWKVEW0daEAolPh7NYCHaWBE5sHsxckGrCYOwzFz8860ug k/a4h6ZWNcYMy+yfasAK+vVh4MWeL8LqTdjoeQO73hxfHhGQSOZHRHHB+ Dh/Um1GzsdCnzU+Vbhr6P+4zN5qGVhsa7FlyjunjYB/fx7qQaV27C4qX0 zStj/0+GSNNlCAef9MpVxdMT8Elase3j4Tw38RpgSpmbrQiX0YTMEAk9h A==; X-IronPort-AV: E=McAfee;i="6600,9927,10791"; a="349679982" X-IronPort-AV: E=Sophos;i="6.01,254,1684825200"; d="scan'208";a="349679982" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 03 Aug 2023 23:59:11 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10791"; a="1060612022" X-IronPort-AV: E=Sophos;i="6.01,254,1684825200"; d="scan'208";a="1060612022" Received: from ahunter6-mobl1.ger.corp.intel.com (HELO [10.0.2.15]) ([10.252.37.129]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 03 Aug 2023 23:59:07 -0700 Message-ID: Date: Fri, 4 Aug 2023 09:59:04 +0300 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Firefox/102.0 Thunderbird/102.13.1 Subject: Re: [PATCH v5 4/7] perf record: Track sideband events for all CPUs when tracing selected CPUs Content-Language: en-US To: Yang Jihong , peterz@infradead.org, mingo@redhat.com, acme@kernel.org, mark.rutland@arm.com, alexander.shishkin@linux.intel.com, jolsa@kernel.org, namhyung@kernel.org, irogers@google.com, kan.liang@linux.intel.com, james.clark@arm.com, tmricht@linux.ibm.com, ak@linux.intel.com, anshuman.khandual@arm.com, linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org References: <20230804020741.99806-1-yangjihong1@huawei.com> <20230804020741.99806-5-yangjihong1@huawei.com> From: Adrian Hunter Organization: Intel Finland Oy, Registered Address: PL 281, 00181 Helsinki, Business Identity Code: 0357606 - 4, Domiciled in Helsinki In-Reply-To: <20230804020741.99806-5-yangjihong1@huawei.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-4.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 4/08/23 05:07, Yang Jihong wrote: > User space tasks can migrate between CPUs, we need to track side-band > events for all CPUs. > > The specific scenarios are as follows: > > CPU0 CPU1 > perf record -C 0 start > taskA starts to be created and executed > -> PERF_RECORD_COMM and PERF_RECORD_MMAP > events only deliver to CPU1 > ...... > | > migrate to CPU0 > | > Running on CPU0 <----------/ > ... > > perf record -C 0 stop > > Now perf samples the PC of taskA. However, perf does not record the > PERF_RECORD_COMM and PERF_RECORD_MMAP events of taskA. > Therefore, the comm and symbols of taskA cannot be parsed. > > The solution is to record sideband events for all CPUs when tracing > selected CPUs. Because this modifies the default behavior, add related > comments to the perf record man page. > > The sys_perf_event_open invoked is as follows: > > # perf --debug verbose=3 record -e cpu-clock -C 1 true > > Opening: cpu-clock > ------------------------------------------------------------ > perf_event_attr: > type 1 (PERF_TYPE_SOFTWARE) > size 136 > config 0 (PERF_COUNT_SW_CPU_CLOCK) > { sample_period, sample_freq } 4000 > sample_type IP|TID|TIME|CPU|PERIOD|IDENTIFIER > read_format ID|LOST > disabled 1 > inherit 1 > freq 1 > sample_id_all 1 > exclude_guest 1 > ------------------------------------------------------------ > sys_perf_event_open: pid -1 cpu 1 group_fd -1 flags 0x8 = 5 > Opening: dummy:u > ------------------------------------------------------------ > perf_event_attr: > type 1 (PERF_TYPE_SOFTWARE) > size 136 > config 0x9 (PERF_COUNT_SW_DUMMY) > { sample_period, sample_freq } 1 > sample_type IP|TID|TIME|CPU|IDENTIFIER > read_format ID|LOST > inherit 1 > exclude_kernel 1 > exclude_hv 1 > mmap 1 > comm 1 > task 1 > sample_id_all 1 > exclude_guest 1 > mmap2 1 > comm_exec 1 > ksymbol 1 > bpf_event 1 > ------------------------------------------------------------ > sys_perf_event_open: pid -1 cpu 0 group_fd -1 flags 0x8 = 6 > sys_perf_event_open: pid -1 cpu 1 group_fd -1 flags 0x8 = 7 > sys_perf_event_open: pid -1 cpu 2 group_fd -1 flags 0x8 = 9 > sys_perf_event_open: pid -1 cpu 3 group_fd -1 flags 0x8 = 10 > sys_perf_event_open: pid -1 cpu 4 group_fd -1 flags 0x8 = 11 > sys_perf_event_open: pid -1 cpu 5 group_fd -1 flags 0x8 = 12 > sys_perf_event_open: pid -1 cpu 6 group_fd -1 flags 0x8 = 13 > sys_perf_event_open: pid -1 cpu 7 group_fd -1 flags 0x8 = 14 > > > Signed-off-by: Yang Jihong Acked-by: Adrian Hunter > --- > tools/perf/Documentation/perf-record.txt | 3 ++ > tools/perf/builtin-record.c | 44 +++++++++++++++++++++++- > 2 files changed, 46 insertions(+), 1 deletion(-) > > diff --git a/tools/perf/Documentation/perf-record.txt b/tools/perf/Documentation/perf-record.txt > index 680396c56bd1..dac53ece51ab 100644 > --- a/tools/perf/Documentation/perf-record.txt > +++ b/tools/perf/Documentation/perf-record.txt > @@ -388,6 +388,9 @@ comma-separated list with no space: 0,1. Ranges of CPUs are specified with -: 0- > In per-thread mode with inheritance mode on (default), samples are captured only when > the thread executes on the designated CPUs. Default is to monitor all CPUs. > > +User space tasks can migrate between CPUs, so when tracing selected CPUs, > +a dummy event is created to track sideband for all CPUs. > + > -B:: > --no-buildid:: > Do not save the build ids of binaries in the perf.data files. This skips > diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > index 3ff9d972225e..e575b5e295a2 100644 > --- a/tools/perf/builtin-record.c > +++ b/tools/perf/builtin-record.c > @@ -908,10 +908,44 @@ static int record__config_off_cpu(struct record *rec) > return off_cpu_prepare(rec->evlist, &rec->opts.target, &rec->opts); > } > > +static bool record__tracking_system_wide(struct record *rec) > +{ > + struct record_opts *opts = &rec->opts; > + struct evlist *evlist = rec->evlist; > + struct evsel *evsel; > + > + /* > + * If all (non-dummy) evsel have exclude_user, > + * system_wide is not needed. > + * > + * all_kernel and all_user will overwrite exclude_kernel and > + * exclude_user of attr in evsel__config(), here need to check > + * all the three items. > + * > + * Sideband system wide if one of the following conditions is met: > + * > + * - all_user is set, and there is a non-dummy event > + * - all_user and all_kernel are not set, and there is > + * a non-dummy event without exclude_user > + */ > + if (opts->all_kernel) > + return false; > + > + evlist__for_each_entry(evlist, evsel) { > + if (!evsel__is_dummy_event(evsel)) { > + if (opts->all_user || !evsel->core.attr.exclude_user) > + return true; > + } > + } > + > + return false; > +} > + > static int record__config_tracking_events(struct record *rec) > { > struct record_opts *opts = &rec->opts; > struct evlist *evlist = rec->evlist; > + bool system_wide = false; > struct evsel *evsel; > > /* > @@ -921,7 +955,15 @@ static int record__config_tracking_events(struct record *rec) > */ > if (opts->target.initial_delay || target__has_cpu(&opts->target) || > perf_pmus__num_core_pmus() > 1) { > - evsel = evlist__findnew_tracking_event(evlist, false); > + > + /* > + * User space tasks can migrate between CPUs, so when tracing > + * selected CPUs, sideband for all CPUs is still needed. > + */ > + if (!!opts->target.cpu_list && record__tracking_system_wide(rec)) > + system_wide = true; > + > + evsel = evlist__findnew_tracking_event(evlist, system_wide); > if (!evsel) > return -ENOMEM; >