Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp524622ybi; Fri, 7 Jun 2019 12:01:25 -0700 (PDT) X-Google-Smtp-Source: APXvYqzlCQmjUX5BZCcDVVh2XmKu/vCFSO1jlA1AVuJCuzMZDSG9dC06gJtVwTfknCSlYTKP/rL8 X-Received: by 2002:a17:90a:a593:: with SMTP id b19mr7407030pjq.31.1559934085442; Fri, 07 Jun 2019 12:01:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559934085; cv=none; d=google.com; s=arc-20160816; b=LxpC3/znNRIZee/csZLvr+0lVLa4bCBQra5hnYQZruK+TXl51ZjEs96WT2/fNyoVqu 0dytyGDq8XNA8a40FAfw8eWXbiVnSvKAf1CGfvrSNKrMwK2oJ7rS/PIBmJ595JYeo/3z A5hgoy5hbxZ9/2PSIXjx5VgJW0gyhKalNyLcQFnERkZZhFGViwCcoR4h1wPWvHem/BiN AEsNBGmAk9hQb4ecg9V8mOBfocNUZ8k5E1qzbhZdtP/h2YgrplLgqR6o7Nm84N6sTKMQ 7jsInmGPzB+oFa/ydfE35NHjwck+9wO5QGVs6/vT8bukKvxm69RHJSK4FyHprcD/HDYM qxfQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:date:from:dkim-signature; bh=tsfr6umeVueoQ8nTR1ppqgNBP/xzgyeaPLGI8Cc6r4c=; b=GRP987EmET4khwj+/NP5FvnluBNyGC/sjo+bMWfA2jaLLCHIfJyps5XAl0iwagNFih xO9Q8e+xeNz5fSGqhnD5E+O0X+IlCKWlS799EdtxXYfjnrT7ozdiOTXiJK1xpxryVUBy FHZ73hCY3lwWFpR+Wrl81eHF1znddrDHsrcpybCAU7wEK8Jt4leyht7GaK6L0urGrai4 76jgmri1i/r0HmW8xqWeeq26EOfihQEae7cr/pTFpWtGwY1iquFe8meXsIQTWA0iq0xk ta9NzyE4lKxW4VRQ2kEaci0Az5yPnyepoDHHJklm9d+ikkFOHOK34v3VYGXg/RfQFqeY 9g8A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=IAyCC6Ja; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a13si2679460pjo.32.2019.06.07.12.01.09; Fri, 07 Jun 2019 12:01:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=IAyCC6Ja; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731708AbfFGSXd (ORCPT + 99 others); Fri, 7 Jun 2019 14:23:33 -0400 Received: from mail-qt1-f193.google.com ([209.85.160.193]:34654 "EHLO mail-qt1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730183AbfFGSXc (ORCPT ); Fri, 7 Jun 2019 14:23:32 -0400 Received: by mail-qt1-f193.google.com with SMTP id m29so3414845qtu.1 for ; Fri, 07 Jun 2019 11:23:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:date:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=tsfr6umeVueoQ8nTR1ppqgNBP/xzgyeaPLGI8Cc6r4c=; b=IAyCC6JadIqgfiOmvLE9V8xO5/sx9mJqMG6DgPTXckUXLZRy3gHgtXfP8O6aQLALFw hdexS+aURjpiv+mkPe5u0MvcITH69tU1G4wAasJnXwzmoKvgvi+g6+HJfMT8asWe6rvZ n38YvzCiCIDEkhO5LtIf6Fj4MZoNR5W/U/350etOrb1PTA/G8QL6H0F0CstGdt+l41dO i0SGOnWBBAJ2bVTpOIXgd2N2vcsRCsolHAIwf6RcGtc3RIgbmfY1BpTdGgcyLNDazsPZ RXfM2K14D3+zzi3I2HRDlAkKSvr9daRplrlyIOkYrpNAxOqNN4e+2gRfENwsbt/x6EGM 9T+Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:date:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=tsfr6umeVueoQ8nTR1ppqgNBP/xzgyeaPLGI8Cc6r4c=; b=HRM+TToG0HFFAvNjVsJCc5Lqg7OSpgtpX5cjeRQzlG4M0ad2kLBG7BORvhA5qJ0Ehm s3CX3qWuJdLvzlvUWKgKwjBof0OWMUByhkPJl54Gv6xAZA0642DmaqlsVrkjvcPUAUxs vcM+usumObNn0MNTi9XkA9uemn6qy8+I9VYwc0ly3XW7SVU+OhLLcV48HHRpd5SWT/jz BR0P2YWGjPzCfeZC/1+X94/+R0BR/imgfb0W9/67OJWaJ6fstz0b/74BtS5wbEmLjp0P 9muqtzJ1bJg1NWz8TlkGQp34TJnfgmIG2rizL4VVQoTuEhTFu/S9esPnXI90/79b26Y4 0RLw== X-Gm-Message-State: APjAAAWuIz1c5FsTFExGmomuoV/cPhLzeyqJYH7g8ATStxcEyJRkjn4v FJBAWQXR3vW12xIycX7rq64= X-Received: by 2002:aed:3824:: with SMTP id j33mr47027712qte.108.1559931810763; Fri, 07 Jun 2019 11:23:30 -0700 (PDT) Received: from quaco.ghostprotocols.net (187-26-97-17.3g.claro.net.br. [187.26.97.17]) by smtp.gmail.com with ESMTPSA id t197sm1673423qke.2.2019.06.07.11.23.29 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Fri, 07 Jun 2019 11:23:29 -0700 (PDT) From: Arnaldo Carvalho de Melo X-Google-Original-From: Arnaldo Carvalho de Melo Received: by quaco.ghostprotocols.net (Postfix, from userid 1000) id B36CA41149; Fri, 7 Jun 2019 15:23:25 -0300 (-03) Date: Fri, 7 Jun 2019 15:23:25 -0300 To: Leo Yan Cc: Arnaldo Carvalho de Melo , Mathieu Poirier , suzuki.poulose@arm.com, peterz@infradead.org, mingo@redhat.com, alexander.shishkin@linux.intel.com, jolsa@redhat.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] perf tools: Properly set the value of 'old' and 'head' in snapshot mode Message-ID: <20190607182325.GL21245@kernel.org> References: <20190605161633.12245-1-mathieu.poirier@linaro.org> <20190606201056.GJ21245@kernel.org> <20190607064425.GF5970@leoy-ThinkPad-X240s> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190607064425.GF5970@leoy-ThinkPad-X240s> X-Url: http://acmel.wordpress.com User-Agent: Mutt/1.11.3 (2019-02-01) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Em Fri, Jun 07, 2019 at 02:44:25PM +0800, Leo Yan escreveu: > On Thu, Jun 06, 2019 at 05:10:56PM -0300, Arnaldo Carvalho de Melo wrote: > > Em Wed, Jun 05, 2019 at 10:16:33AM -0600, Mathieu Poirier escreveu: > > > This patch adds the necessay intelligence to properly compute the value > > > of 'old' and 'head' when operating in snapshot mode. That way we can get > > > the latest information in the AUX buffer and be compatible with the > > > generic AUX ring buffer mechanic. > > > > Leo, have you had the chance to test/review this one? Suzuki? > > Sure. I applied this patch on the perf/core branch (with latest > commit 3e4fbf36c1e3 'perf augmented_raw_syscalls: Move reading > filename to the loop') and passed testing with below steps: > > # perf record -e cs_etm/@tmc_etr0/ -S -m,64 --per-thread ./sort & > [1] 19097 > Bubble sorting array of 30000 elements > > # kill -USR2 19097 > # kill -USR2 19097 > # kill -USR2 19097 > [ perf record: Woken up 4 times to write data ] > [ perf record: Captured and wrote 0.753 MB perf.data ] > > FWIW: > > Tested-by: Leo Yan Thanks a lot, I've added your "Tester notes:" and also your Tested-by:. As I don't have hardware (yet) to test these patches, tests by people who can test on real hardware is always super appreciated. Any suggestions for a SBC that I could buy to be able to do so? Regards, - Arnaldo > > I also changed the subject to: > > > > [PATCH] perf cs-etm: Properly set the value of 'old' and 'head' in snapshot mode > > > > So that when looking at a 'git log --oneline' one can have the proper > > context and know that its about cs-etm. > > > > - Arnaldo > > > > > Signed-off-by: Mathieu Poirier > > > --- > > > tools/perf/arch/arm/util/cs-etm.c | 127 +++++++++++++++++++++++++++++- > > > 1 file changed, 123 insertions(+), 4 deletions(-) > > > > > > diff --git a/tools/perf/arch/arm/util/cs-etm.c b/tools/perf/arch/arm/util/cs-etm.c > > > index 911426721170..0a278bbcaba6 100644 > > > --- a/tools/perf/arch/arm/util/cs-etm.c > > > +++ b/tools/perf/arch/arm/util/cs-etm.c > > > @@ -31,6 +31,8 @@ struct cs_etm_recording { > > > struct auxtrace_record itr; > > > struct perf_pmu *cs_etm_pmu; > > > struct perf_evlist *evlist; > > > + int wrapped_cnt; > > > + bool *wrapped; > > > bool snapshot_mode; > > > size_t snapshot_size; > > > }; > > > @@ -536,16 +538,131 @@ static int cs_etm_info_fill(struct auxtrace_record *itr, > > > return 0; > > > } > > > > > > -static int cs_etm_find_snapshot(struct auxtrace_record *itr __maybe_unused, > > > +static int cs_etm_alloc_wrapped_array(struct cs_etm_recording *ptr, int idx) > > > +{ > > > + bool *wrapped; > > > + int cnt = ptr->wrapped_cnt; > > > + > > > + /* Make @ptr->wrapped as big as @idx */ > > > + while (cnt <= idx) > > > + cnt++; > > > + > > > + /* > > > + * Free'ed in cs_etm_recording_free(). Using realloc() to avoid > > > + * cross compilation problems where the host's system supports > > > + * reallocarray() but not the target. > > > + */ > > > + wrapped = realloc(ptr->wrapped, cnt * sizeof(bool)); > > > + if (!wrapped) > > > + return -ENOMEM; > > > + > > > + wrapped[cnt - 1] = false; > > > + ptr->wrapped_cnt = cnt; > > > + ptr->wrapped = wrapped; > > > + > > > + return 0; > > > +} > > > + > > > +static bool cs_etm_buffer_has_wrapped(unsigned char *buffer, > > > + size_t buffer_size, u64 head) > > > +{ > > > + u64 i, watermark; > > > + u64 *buf = (u64 *)buffer; > > > + size_t buf_size = buffer_size; > > > + > > > + /* > > > + * We want to look the very last 512 byte (chosen arbitrarily) in > > > + * the ring buffer. > > > + */ > > > + watermark = buf_size - 512; > > > + > > > + /* > > > + * @head is continuously increasing - if its value is equal or greater > > > + * than the size of the ring buffer, it has wrapped around. > > > + */ > > > + if (head >= buffer_size) > > > + return true; > > > + > > > + /* > > > + * The value of @head is somewhere within the size of the ring buffer. > > > + * This can be that there hasn't been enough data to fill the ring > > > + * buffer yet or the trace time was so long that @head has numerically > > > + * wrapped around. To find we need to check if we have data at the very > > > + * end of the ring buffer. We can reliably do this because mmap'ed > > > + * pages are zeroed out and there is a fresh mapping with every new > > > + * session. > > > + */ > > > + > > > + /* @head is less than 512 byte from the end of the ring buffer */ > > > + if (head > watermark) > > > + watermark = head; > > > + > > > + /* > > > + * Speed things up by using 64 bit transactions (see "u64 *buf" above) > > > + */ > > > + watermark >>= 3; > > > + buf_size >>= 3; > > > + > > > + /* > > > + * If we find trace data at the end of the ring buffer, @head has > > > + * been there and has numerically wrapped around at least once. > > > + */ > > > + for (i = watermark; i < buf_size; i++) > > > + if (buf[i]) > > > + return true; > > > + > > > + return false; > > > +} > > > + > > > +static int cs_etm_find_snapshot(struct auxtrace_record *itr, > > > int idx, struct auxtrace_mmap *mm, > > > - unsigned char *data __maybe_unused, > > > + unsigned char *data, > > > u64 *head, u64 *old) > > > { > > > + int err; > > > + bool wrapped; > > > + struct cs_etm_recording *ptr = > > > + container_of(itr, struct cs_etm_recording, itr); > > > + > > > + /* > > > + * Allocate memory to keep track of wrapping if this is the first > > > + * time we deal with this *mm. > > > + */ > > > + if (idx >= ptr->wrapped_cnt) { > > > + err = cs_etm_alloc_wrapped_array(ptr, idx); > > > + if (err) > > > + return err; > > > + } > > > + > > > + /* > > > + * Check to see if *head has wrapped around. If it hasn't only the > > > + * amount of data between *head and *old is snapshot'ed to avoid > > > + * bloating the perf.data file with zeros. But as soon as *head has > > > + * wrapped around the entire size of the AUX ring buffer it taken. > > > + */ > > > + wrapped = ptr->wrapped[idx]; > > > + if (!wrapped && cs_etm_buffer_has_wrapped(data, mm->len, *head)) { > > > + wrapped = true; > > > + ptr->wrapped[idx] = true; > > > + } > > > + > > > pr_debug3("%s: mmap index %d old head %zu new head %zu size %zu\n", > > > __func__, idx, (size_t)*old, (size_t)*head, mm->len); > > > > > > - *old = *head; > > > - *head += mm->len; > > > + /* No wrap has occurred, we can just use *head and *old. */ > > > + if (!wrapped) > > > + return 0; > > > + > > > + /* > > > + * *head has wrapped around - adjust *head and *old to pickup the > > > + * entire content of the AUX buffer. > > > + */ > > > + if (*head >= mm->len) { > > > + *old = *head - mm->len; > > > + } else { > > > + *head += mm->len; > > > + *old = *head - mm->len; > > > + } > > > > > > return 0; > > > } > > > @@ -586,6 +703,8 @@ static void cs_etm_recording_free(struct auxtrace_record *itr) > > > { > > > struct cs_etm_recording *ptr = > > > container_of(itr, struct cs_etm_recording, itr); > > > + > > > + zfree(&ptr->wrapped); > > > free(ptr); > > > } > > > > > > -- > > > 2.17.1 > > > > -- > > > > - Arnaldo -- - Arnaldo