Received: by 2002:a05:6a10:6d25:0:0:0:0 with SMTP id gq37csp515521pxb; Sat, 11 Sep 2021 12:05:43 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwiTY+fBlpVBbfvQR7lHBtUkm6hrMvMMBJ58qY9ko8VkpMZF1XVtAb9sW1CI3Zo1u6+YkoL X-Received: by 2002:a17:906:1615:: with SMTP id m21mr4278079ejd.279.1631387143738; Sat, 11 Sep 2021 12:05:43 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1631387143; cv=none; d=google.com; s=arc-20160816; b=sXAQjd8t7Anzj40cZDDFIi18nJ5w103UIIjV/bYoBdWJc0ZAehc+iKHjOVOpSiyte0 lG0pI2aV/Sh9KZrlJCuUEPepRlHH+OU/fpfrpfbjb9FmgXul/h7R/bIVCNkMn0lYkp+7 O+V8bS1NqGrCp0xfnSpzoRymR4QpKIDMQNn/QhpNRnGqsZhkylaP2KIofYIiRkdk3dkC sN+RdxKxHfgsnGdtkZwc2ROlFvi9Htg+pGpwTA8jNr3Rktyf6fgz303FoaYvyejsQwEC yCsWZLj8MjD4Z2J8WG99vAvOZ9AjXtVEFNiUgeH4olztXgnY1sFFIs9PPWkQzVbXa5N5 b/Aw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=u45U4wBHotuIAGtmYzw9rGAJTUP+bEzf0VqEMQL7e00=; b=WIkBza/BrEKeecygH1JbDkPsBZZGwXcwtFKPH3mHN2y6PcFY2H/Oy8ScxvT/M+CtAP GIkS3RvL76H4RUl/IEa6N6LZ+2qQEoLq5pxx6IVwU3qRVp1xcS+ClHsmMWTW0KTPcaGf HdrZIzJRjnOnmDXtugEuaqnjL7u17ufsnpDNdoejZUYY8fFzPEVfoztDIBPfPHoQWUkz 0NaJnetuW0WMYjIJCwiEiZZQmDocaYNEBnMOFHZMifURi7DP9q/Wd+OYpOR6DxfF0vFK 8Q8V4pxTy4mVRs1p4dnk9GKPY5GfpOHgv3ETCRUzYCLM1lspaGCVBkybVfOt3TACZtxb KoZw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HNjO2aev; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id v9si2616755edw.524.2021.09.11.12.05.20; Sat, 11 Sep 2021 12:05:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=HNjO2aev; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233573AbhIKTFP (ORCPT + 99 others); Sat, 11 Sep 2021 15:05:15 -0400 Received: from mail.kernel.org ([198.145.29.99]:51364 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233408AbhIKTFN (ORCPT ); Sat, 11 Sep 2021 15:05:13 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id ABBA7611AD; Sat, 11 Sep 2021 19:04:00 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1631387040; bh=2eadlDRtpQV1TdvJOkIKZ7stwiEATGAMi1Djp66poZ8=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=HNjO2aevbWeI6qzm+rnpluzKv7DXDM+1/EmKXpxE1NUyYCnd00/KdhaD6tbYAfA6/ MY08R9ZlyMs1lgeEXa0cn10M0Zq1yfPE8fgtsN8xx0TnBuzPZVFFIncJuXLCaj6U3C clXuw5u07hebQh8Xm1DCyk59WGd0rO6VsA0lakemy9Zx5Z+48fdT6mImawp86G0Egd xIf80vW536lZBYW8SvEb4d2ksSEIqbbBwHATRpyYl2eU6DyqQPIdWOHIEtkt0ZHSa5 TfkJVr2XkkEx8zlO2RoUz8OMMvudd84STYqUE2ye8UkmYWFpr/GNSBLWD0v8yr5w6L v66KI0p75t/8g== Received: by quaco.ghostprotocols.net (Postfix, from userid 1000) id 6BFB34038F; Sat, 11 Sep 2021 16:03:57 -0300 (-03) Date: Sat, 11 Sep 2021 16:03:57 -0300 From: Arnaldo Carvalho de Melo To: Jin Yao , Ravi Bangoria Cc: mark.rutland@arm.com, alexander.shishkin@linux.intel.com, jolsa@redhat.com, namhyung@kernel.org, kim.phillips@amd.com, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v2 1/2] perf annotate: Fix fused instr logic for assembly functions Message-ID: References: <20210911043854.8373-1-ravi.bangoria@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20210911043854.8373-1-ravi.bangoria@amd.com> X-Url: http://acmel.wordpress.com Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Em Sat, Sep 11, 2021 at 10:08:53AM +0530, Ravi Bangoria escreveu: > Some x86 microarchitectures fuse a subset of cmp/test/ALU instructions > with branch instructions, and thus perf annotate highlight such valid > pairs as fused. Jin, are you ok with this? Can I have your reviewed-by? - Arnaldo > When annotated with source, perf uses struct disasm_line to contain > either source or instruction line from objdump output. Usually, a C > statement generates multiple instructions which include such > cmp/test/ALU + branch instruction pairs. But in case of assembly > function, each individual assembly source line generate one > instruction. Perf annotate instruction fusion logic assumes previous > disasm_line as previous instruction line, which is wrong because, > for assembly function, previous disasm_line contains source line. > And thus perf fails to highlight valid fused instruction pairs for > assembly functions. > > Fix it by searching backward until we find an instruction line and > consider that disasm_line as fused with current branch instruction. > > Before: > │ cmpq %rcx, RIP+8(%rsp) > 0.00 │ cmp %rcx,0x88(%rsp) > │ je .Lerror_bad_iret <--- Source line > 0.14 │ ┌──je b4 <--- Instruction line > │ │movl %ecx, %eax > > After: > │ cmpq %rcx, RIP+8(%rsp) > 0.00 │ ┌──cmp %rcx,0x88(%rsp) > │ │je .Lerror_bad_iret > 0.14 │ ├──je b4 > │ │movl %ecx, %eax > > Signed-off-by: Ravi Bangoria > --- > tools/perf/ui/browser.c | 33 ++++++++++++++++++++++--------- > tools/perf/ui/browser.h | 2 +- > tools/perf/ui/browsers/annotate.c | 24 +++++++++++++++------- > 3 files changed, 42 insertions(+), 17 deletions(-) > > diff --git a/tools/perf/ui/browser.c b/tools/perf/ui/browser.c > index 781afe42e90e..fa5bd5c20e96 100644 > --- a/tools/perf/ui/browser.c > +++ b/tools/perf/ui/browser.c > @@ -757,25 +757,40 @@ void __ui_browser__line_arrow(struct ui_browser *browser, unsigned int column, > } > > void ui_browser__mark_fused(struct ui_browser *browser, unsigned int column, > - unsigned int row, bool arrow_down) > + unsigned int row, int diff, bool arrow_down) > { > - unsigned int end_row; > + int end_row; > > - if (row >= browser->top_idx) > - end_row = row - browser->top_idx; > - else > + if (diff <= 0) > return; > > SLsmg_set_char_set(1); > > if (arrow_down) { > + if (row + diff <= browser->top_idx) > + return; > + > + end_row = row + diff - browser->top_idx; > ui_browser__gotorc(browser, end_row, column - 1); > - SLsmg_write_char(SLSMG_ULCORN_CHAR); > - ui_browser__gotorc(browser, end_row, column); > - SLsmg_draw_hline(2); > - ui_browser__gotorc(browser, end_row + 1, column - 1); > SLsmg_write_char(SLSMG_LTEE_CHAR); > + > + while (--end_row >= 0 && end_row > (int)(row - browser->top_idx)) { > + ui_browser__gotorc(browser, end_row, column - 1); > + SLsmg_draw_vline(1); > + } > + > + end_row = (int)(row - browser->top_idx); > + if (end_row >= 0) { > + ui_browser__gotorc(browser, end_row, column - 1); > + SLsmg_write_char(SLSMG_ULCORN_CHAR); > + ui_browser__gotorc(browser, end_row, column); > + SLsmg_draw_hline(2); > + } > } else { > + if (row < browser->top_idx) > + return; > + > + end_row = row - browser->top_idx; > ui_browser__gotorc(browser, end_row, column - 1); > SLsmg_write_char(SLSMG_LTEE_CHAR); > ui_browser__gotorc(browser, end_row, column); > diff --git a/tools/perf/ui/browser.h b/tools/perf/ui/browser.h > index 3678eb88f119..510ce4554050 100644 > --- a/tools/perf/ui/browser.h > +++ b/tools/perf/ui/browser.h > @@ -51,7 +51,7 @@ void ui_browser__write_graph(struct ui_browser *browser, int graph); > void __ui_browser__line_arrow(struct ui_browser *browser, unsigned int column, > u64 start, u64 end); > void ui_browser__mark_fused(struct ui_browser *browser, unsigned int column, > - unsigned int row, bool arrow_down); > + unsigned int row, int diff, bool arrow_down); > void __ui_browser__show_title(struct ui_browser *browser, const char *title); > void ui_browser__show_title(struct ui_browser *browser, const char *title); > int ui_browser__show(struct ui_browser *browser, const char *title, > diff --git a/tools/perf/ui/browsers/annotate.c b/tools/perf/ui/browsers/annotate.c > index ef4da4295bf7..e81c2493efdf 100644 > --- a/tools/perf/ui/browsers/annotate.c > +++ b/tools/perf/ui/browsers/annotate.c > @@ -125,13 +125,20 @@ static void annotate_browser__write(struct ui_browser *browser, void *entry, int > ab->selection = al; > } > > -static bool is_fused(struct annotate_browser *ab, struct disasm_line *cursor) > +static int is_fused(struct annotate_browser *ab, struct disasm_line *cursor) > { > struct disasm_line *pos = list_prev_entry(cursor, al.node); > const char *name; > + int diff = 1; > + > + while (pos && pos->al.offset == -1) { > + pos = list_prev_entry(pos, al.node); > + if (!ab->opts->hide_src_code) > + diff++; > + } > > if (!pos) > - return false; > + return 0; > > if (ins__is_lock(&pos->ins)) > name = pos->ops.locked.ins.name; > @@ -139,9 +146,11 @@ static bool is_fused(struct annotate_browser *ab, struct disasm_line *cursor) > name = pos->ins.name; > > if (!name || !cursor->ins.name) > - return false; > + return 0; > > - return ins__is_fused(ab->arch, name, cursor->ins.name); > + if (ins__is_fused(ab->arch, name, cursor->ins.name)) > + return diff; > + return 0; > } > > static void annotate_browser__draw_current_jump(struct ui_browser *browser) > @@ -155,6 +164,7 @@ static void annotate_browser__draw_current_jump(struct ui_browser *browser) > struct annotation *notes = symbol__annotation(sym); > u8 pcnt_width = annotation__pcnt_width(notes); > int width; > + int diff = 0; > > /* PLT symbols contain external offsets */ > if (strstr(sym->name, "@plt")) > @@ -205,11 +215,11 @@ static void annotate_browser__draw_current_jump(struct ui_browser *browser) > pcnt_width + 2 + notes->widths.addr + width, > from, to); > > - if (is_fused(ab, cursor)) { > + diff = is_fused(ab, cursor); > + if (diff > 0) { > ui_browser__mark_fused(browser, > pcnt_width + 3 + notes->widths.addr + width, > - from - 1, > - to > from); > + from - diff, diff, to > from); > } > } > > -- > 2.27.0 -- - Arnaldo