Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp8919643ybi; Thu, 6 Jun 2019 23:47:21 -0700 (PDT) X-Google-Smtp-Source: APXvYqwELpbIf29V87u0JUvuDibxfNsgljZGfFyOKCQqT12zdqseyW3O1MmZ1xmdAtJqm9jVI93v X-Received: by 2002:a17:902:24d:: with SMTP id 71mr55490161plc.166.1559890041727; Thu, 06 Jun 2019 23:47:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559890041; cv=none; d=google.com; s=arc-20160816; b=iDCMO7M0DPVBC5MDt+xFth56eySl5Kr0eM1YZEQHV1FvuU9UINP5gEHBImtuBiYuGH w+vWpMh/pXUn6n6W1gMt+OFpIRkJ4oedPtYRzay66VtWBWBOvHD7UX5tsQaQZpyXz5/a KESPx7kzwkkEY/32GcqiTpjt3jQXjRT82FwvB3P2vHl2ZHb+3wrIhJuLLczKF0VTEC16 CKQVZmpmMEh7Lbg6p2RJmSU4TuR4zcl6AV3SJx2ngtqivz1kx4MDSzX41Y+L9wI6ctIw /ZWfFzQ2cI7cMZ/KGgGZ8CaRBMNJ08yisGeCYNx3wcw4HCg3YKotng/OWxUeWKxFyve+ m5uA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=iFhT8liOalqkQhykuttRe8yLU3IavzSomEkEekLlsnY=; b=djdgNkdEAd+Qs92BRyLxW0NItLoIN2e4j719lwsdSeB1Uoi58mZtlzqEko1hLKh6Da tvE020wY54ePzu9u13uH17yl/qNtShLVltVAemsOd07uPxUqYk6n8KPO8UYV5JB8tR7G UCpGmsnyQKRwUOF/f5c+0UWqUlYWIHORlISILtHKsfgiWK1tiKK/Lx+FjdYMcDG4Y1Mr 3/AtC7iVTLVZ4R98l44OZDZD2/QdoSvBcEJwlr6o8qMIshc7bDgAGzhj58BuHS6X9Jqg VWEs3piOgn5d6/vEmVGpqI2xtdbOQ/wBog5gc1nMaF6hTWSdsKu6ssr52bllyP+39lZ0 aJ1w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b="ByrD0tV/"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j11si982665pjs.73.2019.06.06.23.47.04; Thu, 06 Jun 2019 23:47:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b="ByrD0tV/"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727337AbfFGGof (ORCPT + 99 others); Fri, 7 Jun 2019 02:44:35 -0400 Received: from mail-yw1-f68.google.com ([209.85.161.68]:41645 "EHLO mail-yw1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726761AbfFGGoe (ORCPT ); Fri, 7 Jun 2019 02:44:34 -0400 Received: by mail-yw1-f68.google.com with SMTP id y185so308584ywy.8 for ; Thu, 06 Jun 2019 23:44:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=iFhT8liOalqkQhykuttRe8yLU3IavzSomEkEekLlsnY=; b=ByrD0tV/Hxjb3oer9FyKqjVTDHRG9yBaOrPEzhNWnerouMDdM+5XcBU+FUa4wpPXMd ZJZBOwwbnYq8UY5H9rlSnbKvq1H4eZu3agdEIveNKdj4PDVl6PhhP/z99YTUECPRwdJJ fwx2IrBE9zoOuhQKp+AwwOvRPD7RoZ3gUl9CpoiDLqGyXJNUeeZ7Hv16qtkPmdTfLcwM XaT9DNGxMTBIHUvasgsaggQTXtkOV4TbEA/lvTjV+nIfRVl9hlivWvBnnLDu3Q70xyFm EI6pYMcYebOkq7sbKe/s2I36fUI2qCWSpmsv3YTjEuOa6aBz6NbiTz/XWugTuyWv0oCL n1BA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=iFhT8liOalqkQhykuttRe8yLU3IavzSomEkEekLlsnY=; b=VK6lLn2T50/+2mJa8dTVJoBrhoRPLDn/pS19XPKXkQUw+fbLuVbkTnlZStlVI76by6 nm0xG4VYcTWMIeifwy4pPWTFEF40O5LoqR18Bn0V+dMleIKo9bOeUBHRbkd1TCLBp5IP IgLseo/mT5GaUlluAPKkt6HF8vKAR9R/ad7djiPk9uaAslI+JkE6yW6g/52Rb9EghYYP X1OooZ0MDMon7SVa4YgtctJ92SUNheTsxvyXfeuci03N6+iZnLTOH2jASmpaKJq2mr5C 3q/fqclV1PdofYnsqsw7zacYUmf3/LiigViUrNVU2DDTwXQd8OSgBFfpSWw2JKOfJGyp H2eg== X-Gm-Message-State: APjAAAU4ugibc4amIq1OZYHVyribooOdSnles6ni0pOdghnFXioRdhNQ Tu5cMYwfNxm99FFFDntMu+IOGw== X-Received: by 2002:a81:5747:: with SMTP id l68mr13390457ywb.180.1559889873710; Thu, 06 Jun 2019 23:44:33 -0700 (PDT) Received: from leoy-ThinkPad-X240s (li1322-146.members.linode.com. [45.79.223.146]) by smtp.gmail.com with ESMTPSA id c140sm329922ywb.45.2019.06.06.23.44.29 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 06 Jun 2019 23:44:33 -0700 (PDT) Date: Fri, 7 Jun 2019 14:44:25 +0800 From: Leo Yan To: Arnaldo Carvalho de Melo Cc: Mathieu Poirier , suzuki.poulose@arm.com, peterz@infradead.org, mingo@redhat.com, alexander.shishkin@linux.intel.com, jolsa@redhat.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] perf tools: Properly set the value of 'old' and 'head' in snapshot mode Message-ID: <20190607064425.GF5970@leoy-ThinkPad-X240s> References: <20190605161633.12245-1-mathieu.poirier@linaro.org> <20190606201056.GJ21245@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190606201056.GJ21245@kernel.org> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 06, 2019 at 05:10:56PM -0300, Arnaldo Carvalho de Melo wrote: > Em Wed, Jun 05, 2019 at 10:16:33AM -0600, Mathieu Poirier escreveu: > > This patch adds the necessay intelligence to properly compute the value > > of 'old' and 'head' when operating in snapshot mode. That way we can get > > the latest information in the AUX buffer and be compatible with the > > generic AUX ring buffer mechanic. > > Leo, have you had the chance to test/review this one? Suzuki? Sure. I applied this patch on the perf/core branch (with latest commit 3e4fbf36c1e3 'perf augmented_raw_syscalls: Move reading filename to the loop') and passed testing with below steps: # perf record -e cs_etm/@tmc_etr0/ -S -m,64 --per-thread ./sort & [1] 19097 Bubble sorting array of 30000 elements # kill -USR2 19097 # kill -USR2 19097 # kill -USR2 19097 [ perf record: Woken up 4 times to write data ] [ perf record: Captured and wrote 0.753 MB perf.data ] FWIW: Tested-by: Leo Yan > I also changed the subject to: > > [PATCH] perf cs-etm: Properly set the value of 'old' and 'head' in snapshot mode > > So that when looking at a 'git log --oneline' one can have the proper > context and know that its about cs-etm. > > - Arnaldo > > > Signed-off-by: Mathieu Poirier > > --- > > tools/perf/arch/arm/util/cs-etm.c | 127 +++++++++++++++++++++++++++++- > > 1 file changed, 123 insertions(+), 4 deletions(-) > > > > diff --git a/tools/perf/arch/arm/util/cs-etm.c b/tools/perf/arch/arm/util/cs-etm.c > > index 911426721170..0a278bbcaba6 100644 > > --- a/tools/perf/arch/arm/util/cs-etm.c > > +++ b/tools/perf/arch/arm/util/cs-etm.c > > @@ -31,6 +31,8 @@ struct cs_etm_recording { > > struct auxtrace_record itr; > > struct perf_pmu *cs_etm_pmu; > > struct perf_evlist *evlist; > > + int wrapped_cnt; > > + bool *wrapped; > > bool snapshot_mode; > > size_t snapshot_size; > > }; > > @@ -536,16 +538,131 @@ static int cs_etm_info_fill(struct auxtrace_record *itr, > > return 0; > > } > > > > -static int cs_etm_find_snapshot(struct auxtrace_record *itr __maybe_unused, > > +static int cs_etm_alloc_wrapped_array(struct cs_etm_recording *ptr, int idx) > > +{ > > + bool *wrapped; > > + int cnt = ptr->wrapped_cnt; > > + > > + /* Make @ptr->wrapped as big as @idx */ > > + while (cnt <= idx) > > + cnt++; > > + > > + /* > > + * Free'ed in cs_etm_recording_free(). Using realloc() to avoid > > + * cross compilation problems where the host's system supports > > + * reallocarray() but not the target. > > + */ > > + wrapped = realloc(ptr->wrapped, cnt * sizeof(bool)); > > + if (!wrapped) > > + return -ENOMEM; > > + > > + wrapped[cnt - 1] = false; > > + ptr->wrapped_cnt = cnt; > > + ptr->wrapped = wrapped; > > + > > + return 0; > > +} > > + > > +static bool cs_etm_buffer_has_wrapped(unsigned char *buffer, > > + size_t buffer_size, u64 head) > > +{ > > + u64 i, watermark; > > + u64 *buf = (u64 *)buffer; > > + size_t buf_size = buffer_size; > > + > > + /* > > + * We want to look the very last 512 byte (chosen arbitrarily) in > > + * the ring buffer. > > + */ > > + watermark = buf_size - 512; > > + > > + /* > > + * @head is continuously increasing - if its value is equal or greater > > + * than the size of the ring buffer, it has wrapped around. > > + */ > > + if (head >= buffer_size) > > + return true; > > + > > + /* > > + * The value of @head is somewhere within the size of the ring buffer. > > + * This can be that there hasn't been enough data to fill the ring > > + * buffer yet or the trace time was so long that @head has numerically > > + * wrapped around. To find we need to check if we have data at the very > > + * end of the ring buffer. We can reliably do this because mmap'ed > > + * pages are zeroed out and there is a fresh mapping with every new > > + * session. > > + */ > > + > > + /* @head is less than 512 byte from the end of the ring buffer */ > > + if (head > watermark) > > + watermark = head; > > + > > + /* > > + * Speed things up by using 64 bit transactions (see "u64 *buf" above) > > + */ > > + watermark >>= 3; > > + buf_size >>= 3; > > + > > + /* > > + * If we find trace data at the end of the ring buffer, @head has > > + * been there and has numerically wrapped around at least once. > > + */ > > + for (i = watermark; i < buf_size; i++) > > + if (buf[i]) > > + return true; > > + > > + return false; > > +} > > + > > +static int cs_etm_find_snapshot(struct auxtrace_record *itr, > > int idx, struct auxtrace_mmap *mm, > > - unsigned char *data __maybe_unused, > > + unsigned char *data, > > u64 *head, u64 *old) > > { > > + int err; > > + bool wrapped; > > + struct cs_etm_recording *ptr = > > + container_of(itr, struct cs_etm_recording, itr); > > + > > + /* > > + * Allocate memory to keep track of wrapping if this is the first > > + * time we deal with this *mm. > > + */ > > + if (idx >= ptr->wrapped_cnt) { > > + err = cs_etm_alloc_wrapped_array(ptr, idx); > > + if (err) > > + return err; > > + } > > + > > + /* > > + * Check to see if *head has wrapped around. If it hasn't only the > > + * amount of data between *head and *old is snapshot'ed to avoid > > + * bloating the perf.data file with zeros. But as soon as *head has > > + * wrapped around the entire size of the AUX ring buffer it taken. > > + */ > > + wrapped = ptr->wrapped[idx]; > > + if (!wrapped && cs_etm_buffer_has_wrapped(data, mm->len, *head)) { > > + wrapped = true; > > + ptr->wrapped[idx] = true; > > + } > > + > > pr_debug3("%s: mmap index %d old head %zu new head %zu size %zu\n", > > __func__, idx, (size_t)*old, (size_t)*head, mm->len); > > > > - *old = *head; > > - *head += mm->len; > > + /* No wrap has occurred, we can just use *head and *old. */ > > + if (!wrapped) > > + return 0; > > + > > + /* > > + * *head has wrapped around - adjust *head and *old to pickup the > > + * entire content of the AUX buffer. > > + */ > > + if (*head >= mm->len) { > > + *old = *head - mm->len; > > + } else { > > + *head += mm->len; > > + *old = *head - mm->len; > > + } > > > > return 0; > > } > > @@ -586,6 +703,8 @@ static void cs_etm_recording_free(struct auxtrace_record *itr) > > { > > struct cs_etm_recording *ptr = > > container_of(itr, struct cs_etm_recording, itr); > > + > > + zfree(&ptr->wrapped); > > free(ptr); > > } > > > > -- > > 2.17.1 > > -- > > - Arnaldo