Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp1892472ioo; Mon, 23 May 2022 05:47:02 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzZVEPpfD9oSKkGqODO74oDuumOQwKfmQ6cq4b/H0CUcYZ8+PgRooWfGnmC4sSa0niYo2MJ X-Received: by 2002:a17:902:bb0f:b0:161:f859:bec7 with SMTP id im15-20020a170902bb0f00b00161f859bec7mr15239649plb.0.1653310022025; Mon, 23 May 2022 05:47:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653310022; cv=none; d=google.com; s=arc-20160816; b=UFl5d4/oUxPkM8APfzYXKC+GbdT1U8akr/o/bteQZnpEId5SkaxT5xN5g14TpKNZK9 ntIe8sEyPuivx51MYlMRgxrKXlaPJ54d5bUBMlrMURXo0cmZfwxcj83PcywXnaIPHLMd GO3CIDLStCmww1pxP7w71Dbb7fRs+2hoBg7E6CbFxpyfe9FPdtu1ykqP9C4uFrY4TG9L euwfD8Xg1xwW7FFSHwSSYE/EBK4dVlHcebwGC/aUZ3ota75zwi57SyTLcuUnpL/GJaUS q6XqAN6uQmCpps3Gocv5t/tbpX2tUTH25IDQPwRZij7iQO9gordfJ2tGJppO59HY0PwG ZmhA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=Vcn3rR8OhIbrRzbAa+Qr8Yx0KMgEM9B6pQzGYHEkRII=; b=LHFpMdNq7XtoebEwvBmYvpdGpZ8tr41mRsdFamAbEc4T9oj/Rrg19YGobEYWHhH8sp Zw4Tuvvi2xnBiedTfkQ4JWUFX5OYqQ+e3YBhExloBHSegSvQEL2V2E+33Y4coaRukUGm cCdeoTWnMefnR9qTbN9ElSYK2V9E94ScMKUsWOJOHE/Kx/dDUfVPiCpmTPtM+hJfFmtd YwIq1zUS66E0kx9PvMKWIh9BDSyeL5phes9JZdbinT57YIQVjgxs41wC22pHVjWdFDLZ 5TBWsBYMc2pvrMNO1bT2J76oDGSLoINfneoZmZHBWKXqzL/YDioeaJhMeb8mQkikv6Sc 3XoQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=dXdMkL8P; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id g12-20020a170902d5cc00b0015ec0b44cd1si9329274plh.199.2022.05.23.05.47.01 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 23 May 2022 05:47:02 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=dXdMkL8P; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 2B5C34FC41; Mon, 23 May 2022 05:47:00 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235740AbiEWMqt (ORCPT + 99 others); Mon, 23 May 2022 08:46:49 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40570 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235659AbiEWMqr (ORCPT ); Mon, 23 May 2022 08:46:47 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 41282527FD; Mon, 23 May 2022 05:46:45 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id C2A5061336; Mon, 23 May 2022 12:46:44 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id CEEADC385A9; Mon, 23 May 2022 12:46:43 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1653310004; bh=n362nAZJp3aj4NcxskSm616SiABVTuWrekUeDWsgm3A=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=dXdMkL8P/85JhD6HxonwoORKJUbAJ+0Q8xKxRSB7qgk13+QpOZSL9zExrylTX/Wgp mkOk6eSUsZXxdDeRG3bBHhJ8mEJQH/g48d9SwF1TM52+iir/jyf4FoGV/xaaOrqsAq Zeqah/xMchem7xnZmwQWna3iFx/GHKYfsADGAp6IEQVk4bb6A/uyS6joPP/32IetcP awRHWUOBR3Jsb69uPWiW1XOMWrTNdjlZacjmEiV/9Zxkmd7nIR0rln7gqHW0h10a/m FAEgIilBPc+vMhmTdPZFizvrJY47wS8tLcIbgbJ+0x7Wrnz9jw1PmUnTYvDiVnXaYM vWAeNfR4EJ55w== Received: by quaco.ghostprotocols.net (Postfix, from userid 1000) id 5E492400B1; Mon, 23 May 2022 09:46:41 -0300 (-03) Date: Mon, 23 May 2022 09:46:41 -0300 From: Arnaldo Carvalho de Melo To: Leo Yan Cc: Peter Zijlstra , Ingo Molnar , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Like Xu , Alyssa Ross , Ian Rogers , Kajol Jain , Adam Li , Li Huafei , German Gomez , James Clark , Kan Liang , Ali Saidi , Joe Mario , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v3 04/11] perf mem: Add statistics for peer snooping Message-ID: References: <20220518055729.1869566-1-leo.yan@linaro.org> <20220518055729.1869566-5-leo.yan@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Url: http://acmel.wordpress.com X-Spam-Status: No, score=-3.0 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Em Mon, May 23, 2022 at 09:38:31AM -0300, Arnaldo Carvalho de Melo escreveu: > Em Wed, May 18, 2022 at 01:57:22PM +0800, Leo Yan escreveu: > > Since the flag PERF_MEM_SNOOPX_PEER is added to support cache snooping > > from peer cache line, it can come from a peer core, a peer cluster, or > > a remote NUMA node. > > > > This patch adds statistics for the flag PERF_MEM_SNOOPX_PEER. Note, we > > take PERF_MEM_SNOOPX_PEER as an affiliated info, it needs to cooperate > > with cache level statistics. Therefore, we account the load operations > > for both the cache level's metrics (e.g. ld_l2hit, ld_llchit, etc.) and > > the metric 'ld_peer' when flag PERF_MEM_SNOOPX_PEER is set. > > > > Signed-off-by: Leo Yan > > Tested-by: Ali Saidi > > util/mem-events.c: In function ‘c2c_decode_stats’: > util/mem-events.c:536:17: error: ‘PERF_MEM_SNOOPX_PEER’ undeclared (first use in this function); did you mean ‘PERF_MEM_SNOOPX_FWD’? So I kept the first three patches: ⬢[acme@toolbox perf]$ git log --oneline -5 12aeaaba087d6d92 (HEAD -> perf/core) perf c2c: Update documentation for store metric 'N/A' 550b4d6f9a7e5ddc perf c2c: Add dimensions for 'N/A' metrics of store operation 9845063710725424 perf mem: Add stats for store operation with no available memory level 508c9fbce0d30da3 perf build: Error for BPF skeletons without LIBBPF 0869331fbaa2c11c Merge remote-tracking branch 'torvalds/master' into perf/core ⬢[acme@toolbox perf]$ With Jiri's ack, now waiting for clarification about the v9 discussion on the base patchkit. - Arnaldo > > Should I fix this as suggested by the compiler? > > 536 | #define P(a, b) PERF_MEM_##a##_##b > | ^~~~~~~~~ > util/mem-events.c:562:46: note: in expansion of macro ‘P’ > 562 | if (snoopx & P(SNOOPX, PEER)) > | ^ > util/mem-events.c:536:17: note: each undeclared identifier is reported only once for each function it appears in > 536 | #define P(a, b) PERF_MEM_##a##_##b > | ^~~~~~~~~ > util/mem-events.c:562:46: note: in expansion of macro ‘P’ > 562 | if (snoopx & P(SNOOPX, PEER)) > | ^ > make[4]: *** [/var/home/acme/git/perf/tools/build/Makefile.build:96: /tmp/build/perf/util/mem-events.o] Error 1 > make[4]: *** Waiting for unfinished jobs.... > LD /tmp/build/perf/util/scripting-engines/perf-in.o > make[3]: *** [/var/home/acme/git/perf/tools/build/Makefile.build:139: util] Error 2 > make[2]: *** [Makefile.perf:664: /tmp/build/perf/perf-in.o] Error 2 > make[1]: *** [Makefile.perf:240: sub-make] Error 2 > make: *** [Makefile:113: install-bin] Error 2 > make: Leaving directory '/var/home/acme/git/perf/tools/perf' > > Performance counter stats for 'make -k BUILD_BPF_SKEL=1 CORESIGHT=1 PYTHON=python3 O=/tmp/build/perf -C tools/perf install-bin': > > 31,749,639,340 cycles:u > 57,052,398,827 instructions:u # 1.80 insn per cycle > > 2.123830023 seconds time elapsed > > 7.146520000 seconds user > 1.707080000 seconds sys > > > ⬢[acme@toolbox perf]$ > > > --- > > tools/perf/util/mem-events.c | 22 +++++++++++++++++++--- > > tools/perf/util/mem-events.h | 1 + > > 2 files changed, 20 insertions(+), 3 deletions(-) > > > > diff --git a/tools/perf/util/mem-events.c b/tools/perf/util/mem-events.c > > index 5dca1882c284..9de0eb3a1200 100644 > > --- a/tools/perf/util/mem-events.c > > +++ b/tools/perf/util/mem-events.c > > @@ -525,6 +525,7 @@ int c2c_decode_stats(struct c2c_stats *stats, struct mem_info *mi) > > u64 op = data_src->mem_op; > > u64 lvl = data_src->mem_lvl; > > u64 snoop = data_src->mem_snoop; > > + u64 snoopx = data_src->mem_snoopx; > > u64 lock = data_src->mem_lock; > > u64 blk = data_src->mem_blk; > > /* > > @@ -567,18 +568,28 @@ do { \ > > if (lvl & P(LVL, IO)) stats->ld_io++; > > if (lvl & P(LVL, LFB)) stats->ld_fbhit++; > > if (lvl & P(LVL, L1 )) stats->ld_l1hit++; > > - if (lvl & P(LVL, L2 )) stats->ld_l2hit++; > > + if (lvl & P(LVL, L2)) { > > + stats->ld_l2hit++; > > + > > + if (snoopx & P(SNOOPX, PEER)) > > + stats->ld_peer++; > > + } > > if (lvl & P(LVL, L3 )) { > > if (snoop & P(SNOOP, HITM)) > > HITM_INC(lcl_hitm); > > else > > stats->ld_llchit++; > > + > > + if (snoopx & P(SNOOPX, PEER)) > > + stats->ld_peer++; > > } > > > > if (lvl & P(LVL, LOC_RAM)) { > > stats->lcl_dram++; > > if (snoop & P(SNOOP, HIT)) > > stats->ld_shared++; > > + else if (snoopx & P(SNOOPX, PEER)) > > + stats->ld_peer++; > > else > > stats->ld_excl++; > > } > > @@ -597,10 +608,14 @@ do { \ > > if ((lvl & P(LVL, REM_CCE1)) || > > (lvl & P(LVL, REM_CCE2)) || > > mrem) { > > - if (snoop & P(SNOOP, HIT)) > > + if (snoop & P(SNOOP, HIT)) { > > stats->rmt_hit++; > > - else if (snoop & P(SNOOP, HITM)) > > + } else if (snoop & P(SNOOP, HITM)) { > > HITM_INC(rmt_hitm); > > + } else if (snoopx & P(SNOOPX, PEER)) { > > + stats->rmt_hit++; > > + stats->ld_peer++; > > + } > > } > > > > if ((lvl & P(LVL, MISS))) > > @@ -661,6 +676,7 @@ void c2c_add_stats(struct c2c_stats *stats, struct c2c_stats *add) > > stats->ld_l1hit += add->ld_l1hit; > > stats->ld_l2hit += add->ld_l2hit; > > stats->ld_llchit += add->ld_llchit; > > + stats->ld_peer += add->ld_peer; > > stats->lcl_hitm += add->lcl_hitm; > > stats->rmt_hitm += add->rmt_hitm; > > stats->tot_hitm += add->tot_hitm; > > diff --git a/tools/perf/util/mem-events.h b/tools/perf/util/mem-events.h > > index 8a8b568baeee..4879b841c841 100644 > > --- a/tools/perf/util/mem-events.h > > +++ b/tools/perf/util/mem-events.h > > @@ -75,6 +75,7 @@ struct c2c_stats { > > u32 ld_l1hit; /* count of loads that hit L1D */ > > u32 ld_l2hit; /* count of loads that hit L2D */ > > u32 ld_llchit; /* count of loads that hit LLC */ > > + u32 ld_peer; /* count of loads that hit peer core or cluster cache */ > > u32 lcl_hitm; /* count of loads with local HITM */ > > u32 rmt_hitm; /* count of loads with remote HITM */ > > u32 tot_hitm; /* count of loads with local and remote HITM */ > > -- > > 2.25.1 > > -- > > - Arnaldo -- - Arnaldo