Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp3517719imm; Fri, 25 May 2018 07:05:48 -0700 (PDT) X-Google-Smtp-Source: AB8JxZq47fc5+kydvLWZ+J81AlTw+KNb/ab2UMPiQ+lOtpl3KZLwJg/WPifaUaZ3v/+VCp+ztJlO X-Received: by 2002:aa7:8254:: with SMTP id e20-v6mr1513404pfn.140.1527257148574; Fri, 25 May 2018 07:05:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1527257148; cv=none; d=google.com; s=arc-20160816; b=U1DbabvZh7RZCjCGafkkvSZLqZwJr6fPIyUOoBZs2dB5qN5hMI+oLrHbw2QXzdzyrx BXf83Jgrr7ddndlXCaMP6w+6MLg6OArduBYlw6p17ueDiqMcUpSEj6aNaeJTzMwli6OB VzhGOqKpIgEflZK9pNppZKm3thOT53SYBE9wUN37W04qRBV9svSF8YUjEQdZ0Ur/PD6k l4iBKUp/asJa3SJ2dN+9ponqdog8LdBgUsKlBLxmdSA1tpnvLwqu09anQD5daBR/Reb2 EhK24QxCAVX3v9XkeEubIUQrMsRHS0RYexTC24E0Ae7gnsk6+X9xTPw2EKupfyUbrEXu nXmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=ddA85mfzVCN9FJd7Z9VFUNZ7w74FZbqJnC3+j74lqR4=; b=pZoBs0bvg+Ozb3gOVK+H8+zVGrecFHQ9KvlR1RzpbTeXIVwyG697Cq45BVuujEr9Rx NxrmA/oseKWVVJQAYQ3c+8YBZTxjBIUKFHT4LQPrHOtI4ZHGvbJXn/k8G0WgYU6ClfnU KzXLA1Dh6hKENaMGrRTtRO8/BY9jyN4g5tOru6l9TrPAfOdr9DmoBoah0gdny0T0nZ4P Btzv/8+8FqSwOxBomA/NTfcaoCsqJidFhHdMvDJZi0fU4tWTxZ2pqO4mwJc7hHDUt/3+ 2mX4AWl6MoBJKqUnaDNt6Q1TPjgqkWuiU6sMziQah/rcI3kKIbv5tN46o1GPW7rM/1YM Z3qw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x24-v6si23572261pfk.311.2018.05.25.07.05.29; Fri, 25 May 2018 07:05:48 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S936162AbeEYODy (ORCPT + 99 others); Fri, 25 May 2018 10:03:54 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:34246 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S935792AbeEYODx (ORCPT ); Fri, 25 May 2018 10:03:53 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 731CA80D; Fri, 25 May 2018 07:03:52 -0700 (PDT) Received: from [10.0.2.15] (unknown [10.37.12.89]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 285243F557; Fri, 25 May 2018 07:03:48 -0700 (PDT) Subject: Re: [RFT v2 1/4] perf cs-etm: Generate sample for missed packets To: Leo Yan Cc: Arnaldo Carvalho de Melo , Mathieu Poirier , Jonathan Corbet , Peter Zijlstra , Ingo Molnar , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-arm-kernel@lists.infradead.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, Tor Jeremiassen , mike.leach@linaro.org, kim.phillips@arm.com, coresight@lists.linaro.org, Mike Leach References: <1526892748-326-1-git-send-email-leo.yan@linaro.org> <1526892748-326-2-git-send-email-leo.yan@linaro.org> <20180522083920.GD31075@leoy-ThinkPad-X240s> From: Robert Walker Message-ID: <03a53a72-8af5-02de-21a8-fd02a0974ed6@arm.com> Date: Fri, 25 May 2018 15:03:47 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <20180522083920.GD31075@leoy-ThinkPad-X240s> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Leo, Following the discussions from your reply to this with a simplified patch, this version of the patch works better as you also need to emit a branch sample when handling a CS_ETM_TRACE_ON packet to indicate the end of a block of trace. This patch does not break the output from perf inject to generate instruction samples for AutoFDO, so I am happy with that. Regards Rob Reviewed-by: Robert Walker On 22/05/18 09:39, Leo Yan wrote: > Hi Rob, > > On Mon, May 21, 2018 at 12:27:42PM +0100, Robert Walker wrote: >> Hi Leo, >> >> On 21/05/18 09:52, Leo Yan wrote: >>> Commit e573e978fb12 ("perf cs-etm: Inject capabilitity for CoreSight >>> traces") reworks the samples generation flow from CoreSight trace to >>> match the correct format so Perf report tool can display the samples >>> properly. But the change has side effect for packet handling, it only >>> generate samples when 'prev_packet->last_instr_taken_branch' is true, >>> this results in the start tracing packet and exception packets are >>> dropped. >>> >>> This patch checks extra two conditions for complete samples: >>> >>> - If 'prev_packet->sample_type' is zero we can use this condition to >>> get to know this is the start tracing packet; for this case, the start >>> packet's end_addr is zero as well so we need to handle it in the >>> function cs_etm__last_executed_instr(); >>> >> I think you also need to add something in to handle discontinuities in >> trace - for example it is possible to configure the ETM to only trace >> execution in specific code regions or to trace a few cycles every so >> often. In these cases, prev_packet->sample_type will not be zero, but >> whatever the previous packet was. You will get a CS_ETM_TRACE_ON packet in >> such cases, generated by an I_TRACE_ON element in the trace stream. >> You also get this on exception return. >> >> However, you should also keep the test for prev_packet->sample_type == 0 >> as you may not see a CS_ETM_TRACE_ON when decoding a buffer that has >> wrapped. > Thanks for reviewing. Let's dig more detailed into this issue, > especially for handling packet CS_ETM_TRACE_ON, I'd like divide into two > sub cases. > > - The first case is for using python script: > > I use python script to analyze packets with below command: > ./perf script --itrace=ril128 -s arm-cs-trace-disasm.py -F cpu,event,ip,addr,sym -- -v -d objdump -k ./vmlinux > > What I observe is after we pass python script with parameter '-s > arm-cs-trace-disasm.py', then instruction tracing options > '--itrace=ril128' isn't really used; the perf tool creates another > new process for launch python script and re-enter cmd_script() > function, but at the second time when invoke cmd_script() for python > script execution the option '--itrace=ril128' is dropped and all > parameters are only valid defined by the python script. > > As result, I can the variable 'etmq->etm->synth_opts.last_branch' is > always FALSE for running python script. So all CS_ETM_TRACE_ON > packets will be ignored in the function cs_etm__flush(). > > Even the CS_ETM_TRACE_ON packets are missed to handle, the program > flow still can work well. The reason is without the interference by > CS_ETM_TRACE_ON, the CS_ETM_RANGE packets can smoothly create > instruction range by ignore the middle CS_ETM_TRACE_ON packet. > > Please see below example, in this example there have 3 packets, the > first one packet is CS_ETM_RANGE packet which is labelled with > 'PACKET_1', the first one packet can properly generate branch sample > data with previous packet as expected; the second packet is > PACKET_2 which is CS_ETM_TRACE_ON, but > 'etmq->etm->synth_opts.last_branch' is false so function > cs_etm__flush() doesn't handle it and skip the swap operation > "etmq->prev_packet = tmp"; the third packet is PACKET_3, which is > CS_ETM_RANGE packet and we can see it's smoontly to create > continous instruction range between PACKET_1 and PACKET_3. > > cs_etm__sample: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f79c end_addr=0xffff000008a5f7bc last_instr_taken_branch=1 > PACKET_1: cs_etm__sample: packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 > cs_etm__synth_branch_sample: ip=0xffff000008a5f7b8 addr=0xffff000008a5f858 pid=2290 tid=2290 id=1000000021 stream_id=1000000021 period=1 cpu=1 flags=0 cpumode=2 > > cs_etm__flush: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 > PACKET_2: cs_etm__flush: packet: sample_type=2 exc=0 exc_ret=0 cpu=2 start_addr=0xdeadbeefdeadbeef end_addr=0xdeadbeefdeadbeef last_instr_taken_branch=1 > > cs_etm__sample: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 > PACKET_3: cs_etm__sample: packet: sample_type=1 exc=0 exc_ret=0 cpu=2 start_addr=0xffff000008be7528 end_addr=0xffff000008be7538 last_instr_taken_branch=1 > cs_etm__synth_branch_sample: ip=0xffff000008a5f860 addr=0xffff000008be7528 pid=2290 tid=2290 id=1000000021 stream_id=1000000021 period=1 cpu=2 flags=0 cpumode=2 > > So seems to me, the CS_ETM_TRACE_ON packet doesn't introduce trouble > for the program flow analysis if we can handle all CS_ETM_RANGE > packets and without handling CS_ETM_TRACE_ON packet for branch > samples. > > - The second case is for --itrace option without python script: > ./perf script --itrace=ril -F cpu,event,ip,addr,sym -k ./vmlinux > > In this case, the flag 'etmq->etm->synth_opts.last_branch' is true > so CS_ETM_TRACE_ON packet will be handled; but I can observe the > CS_ETM_RANGE packet in etmq->prev_packet isn't handled in the > function cs_etm__flush() for branch sample, so actually we miss some > branch sample for this case. > > So I think we also need handle CS_ETM_RANGE packet in function > cs_etm__flush() to generate branch samples. But this has side > effect, we introduce the extra track for CS_ETM_TRACE_ON packet for > branch samples, so we will see one branch range like: > [ 0xdeadbeefdeadbeef .. 0xdeadbeefdeadbeef ]. > > Please reivew below change is okay for you? Thanks a lot for > suggestions. > > diff --git a/tools/perf/util/cs-etm.c b/tools/perf/util/cs-etm.c > index 822ba91..37d3722 100644 > --- a/tools/perf/util/cs-etm.c > +++ b/tools/perf/util/cs-etm.c > @@ -495,6 +495,13 @@ static inline void cs_etm__reset_last_branch_rb(struct cs_etm_queue *etmq) > static inline u64 cs_etm__last_executed_instr(struct cs_etm_packet *packet) > { > /* > + * The packet is the start tracing packet if the end_addr is zero, > + * returns 0 for this case. > + */ > + if (!packet->end_addr) > + return 0; > + > + /* > * The packet records the execution range with an exclusive end address > * > * A64 instructions are constant size, so the last executed > @@ -897,13 +904,28 @@ static int cs_etm__sample(struct cs_etm_queue *etmq) > etmq->period_instructions = instrs_over; > } > > - if (etm->sample_branches && > - etmq->prev_packet && > - etmq->prev_packet->sample_type == CS_ETM_RANGE && > - etmq->prev_packet->last_instr_taken_branch) { > - ret = cs_etm__synth_branch_sample(etmq); > - if (ret) > - return ret; > + if (etm->sample_branches && etmq->prev_packet) { > + bool generate_sample = false; > + > + /* Generate sample for start tracing packet */ > + if (etmq->prev_packet->sample_type == 0 || > + etmq->prev_packet->sample_type == CS_ETM_TRACE_ON) > + generate_sample = true; > + > + /* Generate sample for exception packet */ > + if (etmq->prev_packet->exc == true) > + generate_sample = true; > + > + /* Generate sample for normal branch packet */ > + if (etmq->prev_packet->sample_type == CS_ETM_RANGE && > + etmq->prev_packet->last_instr_taken_branch) > + generate_sample = true; > + > + if (generate_sample) { > + ret = cs_etm__synth_branch_sample(etmq); > + if (ret) > + return ret; > + } > } > > if (etm->sample_branches || etm->synth_opts.last_branch) { > @@ -921,12 +943,17 @@ static int cs_etm__sample(struct cs_etm_queue *etmq) > > static int cs_etm__flush(struct cs_etm_queue *etmq) > { > + struct cs_etm_auxtrace *etm = etmq->etm; > int err = 0; > struct cs_etm_packet *tmp; > > - if (etmq->etm->synth_opts.last_branch && > - etmq->prev_packet && > - etmq->prev_packet->sample_type == CS_ETM_RANGE) { > + if (!etmq->prev_packet) > + return 0; > + > + if (etmq->prev_packet->sample_type != CS_ETM_RANGE) > + return 0; > + > + if (etmq->etm->synth_opts.last_branch) { > /* > * Generate a last branch event for the branches left in the > * circular buffer at the end of the trace. > @@ -939,18 +966,25 @@ static int cs_etm__flush(struct cs_etm_queue *etmq) > err = cs_etm__synth_instruction_sample( > etmq, addr, > etmq->period_instructions); > + if (err) > + return err; > etmq->period_instructions = 0; > + } > > - /* > - * Swap PACKET with PREV_PACKET: PACKET becomes PREV_PACKET for > - * the next incoming packet. > - */ > - tmp = etmq->packet; > - etmq->packet = etmq->prev_packet; > - etmq->prev_packet = tmp; > + if (etm->sample_branches) { > + err = cs_etm__synth_branch_sample(etmq); > + if (err) > + return err; > } > > - return err; > + /* > + * Swap PACKET with PREV_PACKET: PACKET becomes PREV_PACKET for > + * the next incoming packet. > + */ > + tmp = etmq->packet; > + etmq->packet = etmq->prev_packet; > + etmq->prev_packet = tmp; > + return 0; > } > > static int cs_etm__run_decoder(struct cs_etm_queue *etmq)