Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp775952rwb; Wed, 16 Nov 2022 07:31:57 -0800 (PST) X-Google-Smtp-Source: AA0mqf4CDyXw6FCgr620ZdA7wnSidVKloPLI3aX6fhnEYcIzBKqJGK5MLtsgCagERsw2kJ5gMeVz X-Received: by 2002:a17:90b:368e:b0:20d:63be:917b with SMTP id mj14-20020a17090b368e00b0020d63be917bmr4375557pjb.80.1668612716841; Wed, 16 Nov 2022 07:31:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1668612716; cv=none; d=google.com; s=arc-20160816; b=D+Yko0XSV3ifpE8W+gQ6tYCZBCwwcgWzP9hhSW8ejRU0j334Gj1yYNufwz0WFB0z7Z IwOSuT0Z0/aGoVy18pKmaTOgYhcBKsei3hSw/E3TeuKb0l7O4yfR3EmtiR7uG/auxqAX XyW1Edyhsz2ptzS2gkLlKA6bBEj9n304c1L+Wev1AQ3Tj5Hiym+r6EaN4UnfvM0OAy4v 8AwDbpdtAVaEOGF63TlKTQR9rfkuK75eC6E4FqvDwUNDYnSv+Fu/85EE8+51711AhskN d5nTi5piawEnuoFcXuO9BnHnguBhlTZhRbSLnTc9tdV5Ou0zrW2ymFo+XS8SUl/NcfPG /61A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id; bh=lXp2FK/xOC3GUBJn/OOLA4lfNWAcP3POxB03Xc2/6t4=; b=jkQKuM8Q/nD6hxGyWXcjf7ZOu7jRKc1H/ZFlf+ZbTo4JMSa5Pwgr70fGl1VQevPqPO tXvfZ6wdqum6JjYrLHLhmXS6PMH50LK0RKZxDm86ucbydlowQlNl9/29L8TBKemVX3dg O3Ed5FOgP549hxtFGppIOgsRiTD6XyqDGupfbHFO/RMsAZjsr+sZu409Du26R7gEqByn NopGsILZCS9Z0nnQBRuLpZE8fitHHuuke41BuuBJqDr5r9vTUbVO8USt7dCDiTWzwT1q o5YCvhEsjCPNIg3LWUYqxWfKjCt3kOm6aRW0GAXDr4bHgOPpIZYAP4nwHNQZwxsGbV+k Dwuw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id my11-20020a17090b4c8b00b002182059ec60si2468063pjb.80.2022.11.16.07.31.43; Wed, 16 Nov 2022 07:31:56 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233685AbiKPP0t (ORCPT + 92 others); Wed, 16 Nov 2022 10:26:49 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45118 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231403AbiKPP0p (ORCPT ); Wed, 16 Nov 2022 10:26:45 -0500 Received: from out30-42.freemail.mail.aliyun.com (out30-42.freemail.mail.aliyun.com [115.124.30.42]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 66923218; Wed, 16 Nov 2022 07:26:43 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R121e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046049;MF=renyu.zj@linux.alibaba.com;NM=1;PH=DS;RN=20;SR=0;TI=SMTPD_---0VUyHU7C_1668612396; Received: from 30.39.64.129(mailfrom:renyu.zj@linux.alibaba.com fp:SMTPD_---0VUyHU7C_1668612396) by smtp.aliyun-inc.com; Wed, 16 Nov 2022 23:26:38 +0800 Message-ID: Date: Wed, 16 Nov 2022 23:26:36 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.4.1 Subject: Re: [PATCH RFC 0/6] Add metrics for neoverse-n2 To: James Clark , nick Forrington , Jumana MP , John Garry Cc: Will Deacon , Mike Leach , Leo Yan , Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Andrew Kilroy , Shuai Xue , Zhuo Song , linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org References: <1667214694-89839-1-git-send-email-renyu.zj@linux.alibaba.com> From: Jing Zhang In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-9.9 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,UNPARSEABLE_RELAY, USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2022/11/16 下午7:19, James Clark 写道: > > > On 31/10/2022 11:11, Jing Zhang wrote: >> This series add six metricgroups for neoverse-n2, among which, the >> formula of topdown L1 is from the document: >> https://documentation-service.arm.com/static/60250c7395978b529036da86?token= >> >> Since neoverse-n2 does not yet support topdown L2, metricgroups such >> as Cache, TLB, Branch, InstructionsMix, and PEutilization are added to >> help further analysis of performance bottlenecks. >> > > Hi Jing, > > Thanks for working on this, these metrics look ok to me in general, > although we're currently working on publishing standardised metrics > across all new cores as part of a new project in Arm. This will include > N2, and our ones are very similar (or almost identical) to yours, > barring slightly different group names, metric names, and differences in > things like outputting topdown metrics as percentages. > > We plan to publish our standard metrics some time in the next 2 months. > Would you consider holding off on merging this change so that we have > consistant group names and units going forward? Otherwise N2 would be> the odd one out. I will send you the metrics when they are ready, and we > will have a script to generate perf jsons from them, so you can review. > Do you mean that after you release the new standard metrics, I remake my patch referring to them, such as consistent group names and unit? > We also have a slightly different forumula for one of the top down > metrics which I think would be slightly more accurate. We don't have The v2 version of the patchset updated the formula of topdown L1. Link: https://lore.kernel.org/all/1668411720-3581-1-git-send-email-renyu.zj@linux.alibaba.com/ The formula of the v2 version is more accurate than v1, and it has been verified in our test environment. Can you share your formula first and we can discuss it together? :) Thanks, Jing > anything for your "PE utilization" metrics, which I can raise > internally. It could always be added to perf on top of the standardised > ones if we don't add it to our standard ones. > > Thanks > James > >> with this series on neoverse-n2: >> >> $./perf list metricgroup >> >> List of pre-defined events (to be used in -e): >> >> >> Metric Groups: >> >> Branch >> Cache >> InstructionMix >> PEutilization >> TLB >> TopDownL1 >> >> >> $./perf list >> >> ... >> Metric Groups: >> >> Branch: >> branch_miss_pred_rate >> [The rate of branches mis-predited to the overall branches] >> branch_mpki >> [The rate of branches mis-predicted per kilo instructions] >> branch_pki >> [The rate of branches retired per kilo instructions] >> Cache: >> l1d_cache_miss_rate >> [The rate of L1 D-Cache misses to the overall L1 D-Cache] >> l1d_cache_mpki >> [The rate of L1 D-Cache misses per kilo instructions] >> ... >> >> >> $sudo ./perf stat -a -M TLB sleep 1 >> >> Performance counter stats for 'system wide': >> >> 35,861,936 L1I_TLB # 0.00 itlb_walk_rate (74.91%) >> 5,661 ITLB_WALK (74.91%) >> 97,279,240 INST_RETIRED # 0.07 itlb_mpki (74.91%) >> 6,851 ITLB_WALK (74.91%) >> 26,391 DTLB_WALK # 0.00 dtlb_walk_rate (75.07%) >> 35,585,545 L1D_TLB (75.07%) >> 85,923,244 INST_RETIRED # 0.35 dtlb_mpki (75.11%) >> 29,992 DTLB_WALK (75.11%) >> >> 1.003450755 seconds time elapsed >> >> >> Jing Zhang (6): >> perf vendor events arm64: Add topdown L1 metrics for neoverse-n2 >> perf vendor events arm64: Add TLB metrics for neoverse-n2 >> perf vendor events arm64: Add cache metrics for neoverse-n2 >> perf vendor events arm64: Add branch metrics for neoverse-n2 >> perf vendor events arm64: Add PE utilization metrics for neoverse-n2 >> perf vendor events arm64: Add instruction mix metrics for neoverse-n2 >> >> .../arch/arm64/arm/neoverse-n2/metrics.json | 247 +++++++++++++++++++++ >> 1 file changed, 247 insertions(+) >> create mode 100644 tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>