Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp1300413imm; Tue, 22 May 2018 01:40:08 -0700 (PDT) X-Google-Smtp-Source: AB8JxZrKOreUekJeslYH+4ohW6JAAL4R9D0Ay6/eDFN92rzha1WQaG2yRRigZBYDy0IX5dC2yRY5 X-Received: by 2002:a62:98cb:: with SMTP id d72-v6mr23150471pfk.98.1526978408137; Tue, 22 May 2018 01:40:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526978408; cv=none; d=google.com; s=arc-20160816; b=EmYwdf7NdyHaqXamUJSrD+WbIHslkj3ptQTu/OZErLiJlzyrgEU3xWS8lkduxLqccr ZKKiND4b2BYwZt2WKQcDeITcVCc0MPHRBNrjswo9YkppHqJpL6MpsVQW/OCvFKsILJmK jGfDWJam61BrzcRt2BsXNMf7xNeEmEgv2eFwKXG+PxuYqNxIIZ7K7LLNooKRTTlTNRtc Jham/aql6NpuiXlE9IwnCDwfvjKG5urvrKl9U1rKaal9DNzZ1l4vA39/IIyfgFeCSmrk 8xvhAlFH24GnIkp2oHe4yvSPJHdGrhvuRkNGLA1aqCpbpNDNgE4TY2FwWAQdXSwgqHwH k54A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=kAUwDb0EkBkcp4j9Xj3XkPMa1zxX5DcL2CzrAbGqey4=; b=UU17JwskCfU/cLUqZ7QUpozQVlawJ67E3uIv2KJz/fry5lDoKr0qUFpk2+EOsKiqYt XEvsNOA/D2Xu0Um4CzNkukghD1bw28dGs9pfg+++vo+VcA4yXCyMxkTLBSL2GnKkQM5f 6RMMi5bTdSrCz1SRtD2f8mHV2CZuZv7r2cAI9QZ4TrcIRsmTq3CMQM0N6NbBgp97RShl Chj7Ml75y9B9Blp8L82SGaoYU7z6+CmQyD19Q9QsFZKvVlhKMhwNjNAlL0YyEp6dYoP2 QLTJNF+kD7w+fzpNbpLsQSb2nylv9e/pRBbElprSyXqJBcrPmkwqyFyKz3gZGsZUOkz+ 94vQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=LN0R4UOo; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u6-v6si12160521pgv.420.2018.05.22.01.39.53; Tue, 22 May 2018 01:40:08 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=LN0R4UOo; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751346AbeEVIji (ORCPT + 99 others); Tue, 22 May 2018 04:39:38 -0400 Received: from mail-pg0-f68.google.com ([74.125.83.68]:36049 "EHLO mail-pg0-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750709AbeEVIje (ORCPT ); Tue, 22 May 2018 04:39:34 -0400 Received: by mail-pg0-f68.google.com with SMTP id 63-v6so5403829pgg.3 for ; Tue, 22 May 2018 01:39:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=kAUwDb0EkBkcp4j9Xj3XkPMa1zxX5DcL2CzrAbGqey4=; b=LN0R4UOouE46Vmjh5QI556F27x+k2InaQfA4yqayDiDhPOdmCxxqyZUjWlVkGVV/lG 3efPP3qNQaaQwdirbo8kLkiNLDkCTgoI/6Nu40E9Cm6Vwi+IwnYRVBIxqcELndz2YxNo X/EO1TFQztA7JrV0qx+bvRJNDR8BzGat8N1fM= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=kAUwDb0EkBkcp4j9Xj3XkPMa1zxX5DcL2CzrAbGqey4=; b=pAorQGLejYD79ry9Nj8o7lgSx1zwqwc92rkRfzVmquEXkr0iiWH6gXAO4w9Jm9ks05 nSvdWcwmstUg7+cuTmJy3ZI6O1uPW2zArrDX1vVV9jBaGO4h96QGsWpypOEyJiUqhuTJ 8yaXwxac/DMkvKnLSy6l5jYVkQDj2fb9XZC7E6ePnX9MMyKip+A+nMCeL4MdnZ3B/0d1 5T8Kvmwpey1U06E1RwLfNeYfsOU2SQGUuml7ajwDXX+xZ4+acNNjONDSoaK0cSEJVi/V iLXHexlBPkMPJGo0UJPq/owzfLRmzbtg7GjgrdqomP8c26ET5nRG8HvLB4943ZZx6v2m AUEg== X-Gm-Message-State: ALKqPwfed9ncCmPgotHitOtDDcVG8Yu6KN0uK+mPR4vLIM+3ojovt0hV +2dxUSIZvCK2+eVt69dyekXRQQ== X-Received: by 2002:a65:4502:: with SMTP id n2-v6mr17875951pgq.95.1526978373815; Tue, 22 May 2018 01:39:33 -0700 (PDT) Received: from leoy-ThinkPad-X240s (li1170-132.members.linode.com. [45.79.71.132]) by smtp.gmail.com with ESMTPSA id z25-v6sm32018142pfi.171.2018.05.22.01.39.28 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 22 May 2018 01:39:32 -0700 (PDT) Date: Tue, 22 May 2018 16:39:20 +0800 From: Leo Yan To: Robert Walker Cc: Arnaldo Carvalho de Melo , Mathieu Poirier , Jonathan Corbet , Peter Zijlstra , Ingo Molnar , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-arm-kernel@lists.infradead.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, Tor Jeremiassen , mike.leach@linaro.org, kim.phillips@arm.com, coresight@lists.linaro.org, Mike Leach Subject: Re: [RFT v2 1/4] perf cs-etm: Generate sample for missed packets Message-ID: <20180522083920.GD31075@leoy-ThinkPad-X240s> References: <1526892748-326-1-git-send-email-leo.yan@linaro.org> <1526892748-326-2-git-send-email-leo.yan@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Rob, On Mon, May 21, 2018 at 12:27:42PM +0100, Robert Walker wrote: > Hi Leo, > > On 21/05/18 09:52, Leo Yan wrote: > >Commit e573e978fb12 ("perf cs-etm: Inject capabilitity for CoreSight > >traces") reworks the samples generation flow from CoreSight trace to > >match the correct format so Perf report tool can display the samples > >properly. But the change has side effect for packet handling, it only > >generate samples when 'prev_packet->last_instr_taken_branch' is true, > >this results in the start tracing packet and exception packets are > >dropped. > > > >This patch checks extra two conditions for complete samples: > > > >- If 'prev_packet->sample_type' is zero we can use this condition to > > get to know this is the start tracing packet; for this case, the start > > packet's end_addr is zero as well so we need to handle it in the > > function cs_etm__last_executed_instr(); > > > > I think you also need to add something in to handle discontinuities in > trace - for example it is possible to configure the ETM to only trace > execution in specific code regions or to trace a few cycles every so > often. In these cases, prev_packet->sample_type will not be zero, but > whatever the previous packet was. You will get a CS_ETM_TRACE_ON packet in > such cases, generated by an I_TRACE_ON element in the trace stream. > You also get this on exception return. > > However, you should also keep the test for prev_packet->sample_type == 0 > as you may not see a CS_ETM_TRACE_ON when decoding a buffer that has > wrapped. Thanks for reviewing. Let's dig more detailed into this issue, especially for handling packet CS_ETM_TRACE_ON, I'd like divide into two sub cases. - The first case is for using python script: I use python script to analyze packets with below command: ./perf script --itrace=ril128 -s arm-cs-trace-disasm.py -F cpu,event,ip,addr,sym -- -v -d objdump -k ./vmlinux What I observe is after we pass python script with parameter '-s arm-cs-trace-disasm.py', then instruction tracing options '--itrace=ril128' isn't really used; the perf tool creates another new process for launch python script and re-enter cmd_script() function, but at the second time when invoke cmd_script() for python script execution the option '--itrace=ril128' is dropped and all parameters are only valid defined by the python script. As result, I can the variable 'etmq->etm->synth_opts.last_branch' is always FALSE for running python script. So all CS_ETM_TRACE_ON packets will be ignored in the function cs_etm__flush(). Even the CS_ETM_TRACE_ON packets are missed to handle, the program flow still can work well. The reason is without the interference by CS_ETM_TRACE_ON, the CS_ETM_RANGE packets can smoothly create instruction range by ignore the middle CS_ETM_TRACE_ON packet. Please see below example, in this example there have 3 packets, the first one packet is CS_ETM_RANGE packet which is labelled with 'PACKET_1', the first one packet can properly generate branch sample data with previous packet as expected; the second packet is PACKET_2 which is CS_ETM_TRACE_ON, but 'etmq->etm->synth_opts.last_branch' is false so function cs_etm__flush() doesn't handle it and skip the swap operation "etmq->prev_packet = tmp"; the third packet is PACKET_3, which is CS_ETM_RANGE packet and we can see it's smoontly to create continous instruction range between PACKET_1 and PACKET_3. cs_etm__sample: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f79c end_addr=0xffff000008a5f7bc last_instr_taken_branch=1 PACKET_1: cs_etm__sample: packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 cs_etm__synth_branch_sample: ip=0xffff000008a5f7b8 addr=0xffff000008a5f858 pid=2290 tid=2290 id=1000000021 stream_id=1000000021 period=1 cpu=1 flags=0 cpumode=2 cs_etm__flush: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 PACKET_2: cs_etm__flush: packet: sample_type=2 exc=0 exc_ret=0 cpu=2 start_addr=0xdeadbeefdeadbeef end_addr=0xdeadbeefdeadbeef last_instr_taken_branch=1 cs_etm__sample: prev_packet: sample_type=1 exc=0 exc_ret=0 cpu=1 start_addr=0xffff000008a5f858 end_addr=0xffff000008a5f864 last_instr_taken_branch=1 PACKET_3: cs_etm__sample: packet: sample_type=1 exc=0 exc_ret=0 cpu=2 start_addr=0xffff000008be7528 end_addr=0xffff000008be7538 last_instr_taken_branch=1 cs_etm__synth_branch_sample: ip=0xffff000008a5f860 addr=0xffff000008be7528 pid=2290 tid=2290 id=1000000021 stream_id=1000000021 period=1 cpu=2 flags=0 cpumode=2 So seems to me, the CS_ETM_TRACE_ON packet doesn't introduce trouble for the program flow analysis if we can handle all CS_ETM_RANGE packets and without handling CS_ETM_TRACE_ON packet for branch samples. - The second case is for --itrace option without python script: ./perf script --itrace=ril -F cpu,event,ip,addr,sym -k ./vmlinux In this case, the flag 'etmq->etm->synth_opts.last_branch' is true so CS_ETM_TRACE_ON packet will be handled; but I can observe the CS_ETM_RANGE packet in etmq->prev_packet isn't handled in the function cs_etm__flush() for branch sample, so actually we miss some branch sample for this case. So I think we also need handle CS_ETM_RANGE packet in function cs_etm__flush() to generate branch samples. But this has side effect, we introduce the extra track for CS_ETM_TRACE_ON packet for branch samples, so we will see one branch range like: [ 0xdeadbeefdeadbeef .. 0xdeadbeefdeadbeef ]. Please reivew below change is okay for you? Thanks a lot for suggestions. diff --git a/tools/perf/util/cs-etm.c b/tools/perf/util/cs-etm.c index 822ba91..37d3722 100644 --- a/tools/perf/util/cs-etm.c +++ b/tools/perf/util/cs-etm.c @@ -495,6 +495,13 @@ static inline void cs_etm__reset_last_branch_rb(struct cs_etm_queue *etmq) static inline u64 cs_etm__last_executed_instr(struct cs_etm_packet *packet) { /* + * The packet is the start tracing packet if the end_addr is zero, + * returns 0 for this case. + */ + if (!packet->end_addr) + return 0; + + /* * The packet records the execution range with an exclusive end address * * A64 instructions are constant size, so the last executed @@ -897,13 +904,28 @@ static int cs_etm__sample(struct cs_etm_queue *etmq) etmq->period_instructions = instrs_over; } - if (etm->sample_branches && - etmq->prev_packet && - etmq->prev_packet->sample_type == CS_ETM_RANGE && - etmq->prev_packet->last_instr_taken_branch) { - ret = cs_etm__synth_branch_sample(etmq); - if (ret) - return ret; + if (etm->sample_branches && etmq->prev_packet) { + bool generate_sample = false; + + /* Generate sample for start tracing packet */ + if (etmq->prev_packet->sample_type == 0 || + etmq->prev_packet->sample_type == CS_ETM_TRACE_ON) + generate_sample = true; + + /* Generate sample for exception packet */ + if (etmq->prev_packet->exc == true) + generate_sample = true; + + /* Generate sample for normal branch packet */ + if (etmq->prev_packet->sample_type == CS_ETM_RANGE && + etmq->prev_packet->last_instr_taken_branch) + generate_sample = true; + + if (generate_sample) { + ret = cs_etm__synth_branch_sample(etmq); + if (ret) + return ret; + } } if (etm->sample_branches || etm->synth_opts.last_branch) { @@ -921,12 +943,17 @@ static int cs_etm__sample(struct cs_etm_queue *etmq) static int cs_etm__flush(struct cs_etm_queue *etmq) { + struct cs_etm_auxtrace *etm = etmq->etm; int err = 0; struct cs_etm_packet *tmp; - if (etmq->etm->synth_opts.last_branch && - etmq->prev_packet && - etmq->prev_packet->sample_type == CS_ETM_RANGE) { + if (!etmq->prev_packet) + return 0; + + if (etmq->prev_packet->sample_type != CS_ETM_RANGE) + return 0; + + if (etmq->etm->synth_opts.last_branch) { /* * Generate a last branch event for the branches left in the * circular buffer at the end of the trace. @@ -939,18 +966,25 @@ static int cs_etm__flush(struct cs_etm_queue *etmq) err = cs_etm__synth_instruction_sample( etmq, addr, etmq->period_instructions); + if (err) + return err; etmq->period_instructions = 0; + } - /* - * Swap PACKET with PREV_PACKET: PACKET becomes PREV_PACKET for - * the next incoming packet. - */ - tmp = etmq->packet; - etmq->packet = etmq->prev_packet; - etmq->prev_packet = tmp; + if (etm->sample_branches) { + err = cs_etm__synth_branch_sample(etmq); + if (err) + return err; } - return err; + /* + * Swap PACKET with PREV_PACKET: PACKET becomes PREV_PACKET for + * the next incoming packet. + */ + tmp = etmq->packet; + etmq->packet = etmq->prev_packet; + etmq->prev_packet = tmp; + return 0; } static int cs_etm__run_decoder(struct cs_etm_queue *etmq) -- 2.7.4