Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp9155249rwb; Thu, 24 Nov 2022 08:46:57 -0800 (PST) X-Google-Smtp-Source: AA0mqf6uJgK+923wAbaiHEAmhvL95HCE4JjGMa6dKc8tsrVhEt+RgiGkz/f4njk1Pz0MT0sF/+i4 X-Received: by 2002:a17:90a:708a:b0:20a:eaab:137 with SMTP id g10-20020a17090a708a00b0020aeaab0137mr36796452pjk.206.1669308416807; Thu, 24 Nov 2022 08:46:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669308416; cv=none; d=google.com; s=arc-20160816; b=AAmXPFNpLrslMDk1jG32bsp+IQsW6nAMyExvhZSCGj067U3hsz3WcoY87yAIUZEAj1 58IP8MReVcz7VJ3+6sP5fgENJ9A1gtrD6C32zztdQMqEHfAl6ITAiOa6lOlutpfDL9oQ PJ2THhxJ6mPlMZHNBRwMWwPLdYOzFUKUNvBvEUjXhcbFC4I1zW5U/flY3vtlYsUqkg/1 RDY5okmg7+Jwst2wFzGpQlrcweeciBw1ug6GCBEguaRs4oey8naJbpn/UacdgAddcQgK iZTMqYy9Socfjev/DYsf1p8CfTbaPf+izdk9cxoS9kIuMVxdVJlsHsiENBJYlwra3BfN ZWAg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id; bh=pNltCZEZPBi5o2K3XxfnsKAfGNdjGL2eGIkUYo7WWpM=; b=St2ypOHiNteJb6vGgcdIQX/koghLUpnfxGVZC2nXWDfC5QAT1x3WkQMzrZJT9qk2lI QrMx9pnU9MhdkbdQBUAWF0U+7P9VLmwPPu1oiB+j6xei2uvevt/TwyB+hOmzWzB79UAg GMnz8PnLWadchVUtknnrYSS9EuWbgS2KZR7B7BcTHHfrOLqosmpUQ/aQN6qap34mAdJ6 TxZhwMApuEzUKS+ZBvsvItnZSd+6KQyU9ihQBUQhs27dPqEHl36XcbDMD4zLM7fdK4kT 8L1ZakbBSWZiv162T1SSFQBM7gg44nVT5rtfVFI3kw2DvrZvk2E6wAO1LF5bziinR4Qs OcAg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id lb10-20020a17090b4a4a00b00213c2f26cb5si5199443pjb.126.2022.11.24.08.46.45; Thu, 24 Nov 2022 08:46:56 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229544AbiKXQdI (ORCPT + 87 others); Thu, 24 Nov 2022 11:33:08 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35054 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229499AbiKXQdG (ORCPT ); Thu, 24 Nov 2022 11:33:06 -0500 Received: from out30-45.freemail.mail.aliyun.com (out30-45.freemail.mail.aliyun.com [115.124.30.45]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 609E1769E9; Thu, 24 Nov 2022 08:33:04 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R831e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046050;MF=renyu.zj@linux.alibaba.com;NM=1;PH=DS;RN=20;SR=0;TI=SMTPD_---0VVbikvs_1669307577; Received: from 30.0.180.161(mailfrom:renyu.zj@linux.alibaba.com fp:SMTPD_---0VVbikvs_1669307577) by smtp.aliyun-inc.com; Fri, 25 Nov 2022 00:32:59 +0800 Message-ID: <50d46dcf-5e30-cd3d-82a2-51e527ff641f@linux.alibaba.com> Date: Fri, 25 Nov 2022 00:32:55 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.4.1 Subject: Re: [External] : [RFC PATCH v2 1/6] perf vendor events arm64: Add topdown L1 metrics for neoverse-n2 To: James Clark , John Garry Cc: Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Andrew Kilroy , Shuai Xue , Zhuo Song , linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, Will Deacon , Mike Leach , Leo Yan , Ian Rogers , Nick Forrington References: <1667214694-89839-1-git-send-email-renyu.zj@linux.alibaba.com> <1668411720-3581-2-git-send-email-renyu.zj@linux.alibaba.com> <590ff032-d271-48ee-a4d8-141cc070c335@oracle.com> <75c4f0e6-3f28-a748-e891-7be6016ca28e@oracle.com> <57315669-e6e7-08b8-a252-bc35d4fecc01@arm.com> <180a34c2-f68d-6f4d-da74-7bbb80e9e65c@linux.alibaba.com> <279545ee-1758-c60d-fdc3-2b15bcc4be6d@arm.com> From: Jing Zhang In-Reply-To: <279545ee-1758-c60d-fdc3-2b15bcc4be6d@arm.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-9.9 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,NICE_REPLY_A,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,UNPARSEABLE_RELAY, USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 在 2022/11/23 下午10:26, James Clark 写道: > > > On 22/11/2022 15:41, Jing Zhang wrote: >> >> >> 在 2022/11/22 下午10:00, James Clark 写道: >>> >>> >>> On 21/11/2022 17:55, John Garry wrote: >>>> On 21/11/2022 15:17, Jing Zhang wrote: >>>>> I'm sorry that I misunderstood the purpose of putting metric as >>>>> arch_std_event at first, >>>>> and now it works after the modification over your suggestion. >>>>> >>>>> But there are also a few questions: >>>>> >>>>> 1. The value of the slot in the topdownL1 is various in different >>>>> architectures, for example, >>>>> the slot is 5 on neoverse-n2. If I put topdownL1 metric as >>>>> arch_std_event, then I need to >>>>> specify the slot to 5 in n2. I can specify slot values in metric like >>>>> below, but is there any >>>>> other concise way to do this? >>>>> >>>>> diff --git >>>>> a/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>>>> b/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>>>> index 8ff1dfe..b473baf 100644 >>>>> --- a/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>>>> +++ b/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>>>> @@ -1,4 +1,23 @@ >>>>> [ >>>>> +       { >>>>> +               "MetricExpr": "5", >>>>> +               "PublicDescription": "A pipeline slot represents the >>>>> hardware resources needed to process one uOp", >>>>> +               "BriefDescription": "A pipeline slot represents the >>>>> hardware resources needed to process one uOp", >>>>> +               "MetricName": "slot" >>>> >>>> Ehhh....I'm not sure if that is a good idea. Ian or anyone else have an >>>> opinion on this? It is possible to reuse metrics, so it should work, but... >>>> >>>> One problem is that "slot" would show up as a metric, which you would >>>> not want. >>>> >>>> Alternatively I was going to suggest that you can overwrite specific std >>>> arch event attributes. So for example of frontend_bound, you could have: >>> >>> I would agree with not having this and just hard coding the 5 wherever >>> it's needed. Once we have a few different sets of metrics in place maybe >>> we can start to look at deduplication, but for now I don't see the value. >>> >>>> >>>> + b/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >>>> @@ -0,0 +1,30 @@ >>>> [ >>>>     { >>>>     "ArchStdEvent": "FRONTEND_BOUND", >>>>         "MetricExpr": "(stall_slot_frontend - cpu_cycles) / (5 * >>>> cpu_cycles)", >>>>     }, >>>> >>>>> +       } >>>>> +       { >>>>> +               "ArchStdEvent": "FRONTEND_BOUND" >>>>> +       }, >>>>> +       { >>>>> +               "ArchStdEvent": "BACKEND_BOUND" >>>>> +       }, >>>>> +       { >>>>> +               "ArchStdEvent": "WASTED" >>>>> +       }, >>>>> +       { >>>>> +               "ArchStdEvent": "RETIRING" >>>>> +       }, >>>>> >>>>> >>>>> 2. Should I add the topdownL1 metric to >>>>> tools/perf/pmu-event/recommended.json, >>>>> or create a new json file to place the general metric? >>>> >>>> It would not belong in recommended.json as that is specifically for >>>> arch-recommended events. It would really just depend on where the value >>>> comes from, i.e. arm arm or sbsa. >>>> >>> >>> For what we're going to publish shortly we'll be generating a >>> metrics.json file for each CPU. It will be autogenerated so I don't >>> think duplication will be an issue and I'm expecting that there will be >>> differences in the topdown metrics between CPUs anyway. So I would also >>> vote to not put it in recommended.json >>> >> >> I will create a new sbsa.json file in tools/perf/pmu-events/arch/arm64/ >> to place metrics that may be common between some CPUs, just like arch_std_event. > > Because this would apply to all CPUs rather than just N2, I still think > it's best to wait for our metrics repo to be published. Otherwise Arm > will start publishing metrics with names and group names for all future > CPUs that have different names to the common ones added as part of this > change. > > It's something that we've been working on for quite a while and we've > taken care to make sure that it applies to future products and is scalable. > > It would be easier to add these right now only for N2, and then > afterwards we can start to look at what is common and could be factored > out into the top level folder. > >> If the topdown metrics are different in other CPUs, we can overwrite the >> metric expression. > > True, but with different group names and metric names and units it could > get slightly complicated. > >> >> For example: >> >> +++ b/tools/perf/pmu-events/arch/arm64/sbsa.json >> @@ -0,0 +1,9 @@ >> +[ >> + { >> + "MetricExpr": "stall_slot_frontend / (slot * cpu_cycles)", >> + "PublicDescription": "Frontend bound L1 topdown metric", >> + "BriefDescription": "Frontend bound L1 topdown metric", >> + "MetricGroup": "TopDownL1", >> + "MetricName": "FRONTEND_BOUND" >> + } >> +] >> >> + b/tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json >> @@ -0,0 +1,30 @@ >> +[ >> + { >> + "ArchStdEvent": "FRONTEND_BOUND", >> + "MetricExpr": "(stall_slot_frontend - cpu_cycles) / (5 * cpu_cycles)", >> + } >> +] >> > > With the auto generation of metrics file I don't really see too much > benefit of doing it this way. > > You also run into the issue where if a platform happens to define all of > the events required by a metric, will that metric appear automatically, > even if it's not valid? > Ok, I agree to put the topdown metric in the n2 metric instead of arch_std_event. There is no unified formula for the topdown metric currently, and the slots of each CPU may be different. After the standard are pubulished in the future, please consider what John said, and use the general metric as arch_std_event.