Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp38492425rwd; Wed, 12 Jul 2023 08:21:18 -0700 (PDT) X-Google-Smtp-Source: APBJJlEZL8hZinLb0N3mYp1i4jQCWBjxI0TL/RLktO5jQ8f4F7U7bigXPTUyrq/l9fIQApSRwsvQ X-Received: by 2002:a17:906:b347:b0:991:f427:2fd8 with SMTP id cd7-20020a170906b34700b00991f4272fd8mr16137324ejb.74.1689175278561; Wed, 12 Jul 2023 08:21:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689175278; cv=none; d=google.com; s=arc-20160816; b=xPzfeN9L0G7WnU+w4bpbWxTT/T7scdAfMqzVxnGAMSqWQdxmPQioXpCIwp7744yM4U MRDkWP7Wtbv+G83xJKghPmudOOMvUFm1x29fC9vbt1rOkTjr7nKSuQg7Cd/sBzjOEsQU iG4uO30Vs1xDFJaYQvZPiD5JHfxib121tXR3r9kq393HNAp284catTBB3dIBpjv/jAIP /EdwqYMkU3ZtUAg6NmtokZtK69AjWL83NOAo0NwcT0HfPFiHiHqhZOHYZSm67P48GyzB GlPFNqPTp0m9kuREeC5ip9jY4Oq35xYyHZbUQicQSXLBz0KPzgEbHmNDTNTbr2LbEQHy g54A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :to:subject; bh=ugzTtkBaG3uX0C6Af8jU/nHD7xuSsL+93evs6jwDobc=; fh=mHKvWaNSen2xS5iQ7Fn6ju1Wg3Evln+HhI3cUf5CIWg=; b=F7eyTMeVRoDr7Lh892ZFxPjvIAf5NOhHREC5IVP8vViJlpxFAxXZpiIhrFvCEdjhUc kwzwiLKuuipor5b/CffQMqL/aI4Qmv0+Qm9U5HrhV9bmRUJ7comxYaBtj6BmXjRt/5kF FN8lIKu7TgJQQz1zk3XGeewUS6ndovyks60CKeDM+3yx69IIPWzjHLFyAx3+16dwc7fV iueMnbHDFAdMqf/rzLgFP8MY2ihZjYqzI/UM3Sjhw8GghnfMETFFRTRDGHO/p8RtdWof zP6syH58lxO5cmNCTtpwV8EvgSOap3I7PzjG7euoiPDtLfiGT+5ZlyDKhyyU9Qv8pVFA 1UAQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ja12-20020a170907988c00b00992a8a54f2fsi5118726ejc.368.2023.07.12.08.20.54; Wed, 12 Jul 2023 08:21:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232525AbjGLOok (ORCPT + 99 others); Wed, 12 Jul 2023 10:44:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41898 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232752AbjGLOof (ORCPT ); Wed, 12 Jul 2023 10:44:35 -0400 Received: from szxga02-in.huawei.com (szxga02-in.huawei.com [45.249.212.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 04809BB; Wed, 12 Jul 2023 07:44:26 -0700 (PDT) Received: from kwepemm600003.china.huawei.com (unknown [172.30.72.55]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4R1L5Z16XdzMqcq; Wed, 12 Jul 2023 22:41:06 +0800 (CST) Received: from [10.67.111.205] (10.67.111.205) by kwepemm600003.china.huawei.com (7.193.23.202) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.27; Wed, 12 Jul 2023 22:44:22 +0800 Subject: Re: [PATCH 3/5] perf record: Tracking side-band events for all CPUs when tracing selected CPUs To: Adrian Hunter , , , , , , , , , , , References: <20230704074217.240939-1-yangjihong1@huawei.com> <20230704074217.240939-4-yangjihong1@huawei.com> <206972a3-d44d-1c75-3fbc-426427614543@intel.com> From: Yang Jihong Message-ID: <747a2780-10d8-8094-3251-8e2c15f961b0@huawei.com> Date: Wed, 12 Jul 2023 22:44:22 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.6.1 MIME-Version: 1.0 In-Reply-To: <206972a3-d44d-1c75-3fbc-426427614543@intel.com> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.67.111.205] X-ClientProxiedBy: dggems705-chm.china.huawei.com (10.3.19.182) To kwepemm600003.china.huawei.com (7.193.23.202) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_BLOCKED,RCVD_IN_MSPIKE_H5,RCVD_IN_MSPIKE_WL, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello, On 2023/7/11 21:13, Adrian Hunter wrote: > On 4/07/23 10:42, Yang Jihong wrote: >> User space tasks can migrate between CPUs, we need to track side-band >> events for all CPUs. >> >> The specific scenarios are as follows: >> >> CPU0 CPU1 >> perf record -C 0 start >> taskA starts to be created and executed >> -> PERF_RECORD_COMM and PERF_RECORD_MMAP >> events only deliver to CPU1 >> ...... >> | >> migrate to CPU0 >> | >> Running on CPU0 <----------/ >> ... >> >> perf record -C 0 stop >> >> Now perf samples the PC of taskA. However, perf does not record the >> PERF_RECORD_COMM and PERF_RECORD_COMM events of taskA. >> Therefore, the comm and symbols of taskA cannot be parsed. >> >> The sys_perf_event_open invoked is as follows: >> >> # perf --debug verbose=3 record -e cpu-clock -C 1 true >> >> Opening: cpu-clock >> ------------------------------------------------------------ >> perf_event_attr: >> type 1 >> size 136 >> { sample_period, sample_freq } 4000 >> sample_type IP|TID|TIME|ID|CPU|PERIOD >> read_format ID|LOST >> disabled 1 >> inherit 1 >> freq 1 >> sample_id_all 1 >> exclude_guest 1 >> ------------------------------------------------------------ >> sys_perf_event_open: pid -1 cpu 1 group_fd -1 flags 0x8 = 5 >> Opening: dummy:HG >> ------------------------------------------------------------ >> perf_event_attr: >> type 1 >> size 136 >> config 0x9 >> { sample_period, sample_freq } 4000 >> sample_type IP|TID|TIME|ID|CPU|PERIOD >> read_format ID|LOST >> inherit 1 >> mmap 1 >> comm 1 >> freq 1 >> task 1 >> sample_id_all 1 >> mmap2 1 >> comm_exec 1 >> ksymbol 1 >> bpf_event 1 >> ------------------------------------------------------------ >> sys_perf_event_open: pid -1 cpu 0 group_fd -1 flags 0x8 = 6 >> sys_perf_event_open: pid -1 cpu 1 group_fd -1 flags 0x8 = 7 >> sys_perf_event_open: pid -1 cpu 2 group_fd -1 flags 0x8 = 9 >> sys_perf_event_open: pid -1 cpu 3 group_fd -1 flags 0x8 = 10 >> sys_perf_event_open: pid -1 cpu 4 group_fd -1 flags 0x8 = 11 >> sys_perf_event_open: pid -1 cpu 5 group_fd -1 flags 0x8 = 12 >> sys_perf_event_open: pid -1 cpu 6 group_fd -1 flags 0x8 = 13 >> sys_perf_event_open: pid -1 cpu 7 group_fd -1 flags 0x8 = 14 >> >> >> Signed-off-by: Yang Jihong >> --- >> tools/perf/builtin-record.c | 31 +++++++++++++++++++++++++++++++ >> 1 file changed, 31 insertions(+) >> >> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c >> index 8872cd037f2c..69e0d8c75aab 100644 >> --- a/tools/perf/builtin-record.c >> +++ b/tools/perf/builtin-record.c >> @@ -908,6 +908,31 @@ static int record__config_off_cpu(struct record *rec) >> return off_cpu_prepare(rec->evlist, &rec->opts.target, &rec->opts); >> } >> >> +static int record__config_tracking_events(struct record *rec) >> +{ >> + struct evsel *evsel; >> + struct evlist *evlist = rec->evlist; >> + struct record_opts *opts = &rec->opts; >> + >> + /* >> + * User space tasks can migrate between CPUs, so when tracing >> + * selected CPUs, sideband for all CPUs is still needed. >> + */ >> + if (opts->target.cpu_list) { > > I am not sure if anyone minds doing this by default, but perhaps > we should say something about it on the perf record man page. > Okay, will add comments to the man page. >> + evsel = evlist__findnew_tracking_event(evlist); >> + if (!evsel) >> + return -ENOMEM; >> + >> + if (!evsel->core.system_wide) { >> + evsel->core.system_wide = true; >> + evsel__set_sample_bit(evsel, TIME); >> + perf_evlist__propagate_maps(&evlist->core, &evsel->core); >> + } > > Perhaps better to export via internel/evsel.h > > void perf_evsel__go_system_wide(struct perf_evlist *evlist, struct perf_evsel *evsel) > { > if (!evsel->system_wide) { > evsel->system_wide = true; > if (evlist->needs_map_propagation) > __perf_evlist__propagate_maps(evlist, evsel); > } > } > > As suggested in response to patch 2, perhaps deal with system_wide > inside evlist__findnew_tracking_event() > Okay, I'll modify it as above, so maybe we need to export perf_evlist__propagate_maps(). As mentioned in the patch 1, __perf_evlist__propagate_maps is low-level and avoid to export it. Or can we export perf_evsel__go_system_wide() via through internel/evlist.h? In this way, we do not need to export perf_evlist__propagate_maps(). If so, would it be more appropriate to call perf_evlist__go_system_wide()? Thanks, Yang