Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp1007895pxj; Wed, 2 Jun 2021 18:00:44 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzTjS4t/8aZV7k/Zqisw8TyulrTlpIdAqGThwHyDeNys9jkNpdzAx9l1ueGG0XxSpAR2ZIa X-Received: by 2002:a17:906:1dd1:: with SMTP id v17mr36654015ejh.31.1622682044530; Wed, 02 Jun 2021 18:00:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1622682044; cv=none; d=google.com; s=arc-20160816; b=EtMnR2n1/mfpbturlM8RmRpAxRNzz/dSb7srWa28rEc0PwLHnofkLKpof7I4sGJ2fb M7PaO2/vFEH0EvXPl1HPJKXFGkkZ7dIt/KSHcN13yHLf5KgAwGU5X3ZS6do79EXVNgf6 6DdrpZL6CvW0tIH1b1EsF9Pycag6keRd4rwXOfQbVly0A/kJrRXNDWXdgEqdj5+5/iz4 CHNg91hibhlT2UfnEzFMhXXQHuKDrlIaMMPE4HiJxA5fpbrOxLDnidfSCI6OTPlYxmdB FiXqRBimCwXd/JP0JEmUsScppbBE3yW4DOGqxrdToP6VkD4bFolzAhlicL4EsuxAbPUj fmVw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:ironport-sdr:ironport-sdr; bh=O0cc09/XVjGuUV1NGNip2B0QIhbvDnENcTP+62tGxKg=; b=JMJFiWlVImgV0uG4Fr+OAwDco6W+39lzL1+CmCkSviHCrZlSwkvNwkNI4NJnSrAuCv igIPvYenQRQOOJc97LfwTrlq9nnFnd9SzB+BBZKTWZ0dlIEJ6ypLXxZZiOrC0dyPHLb9 j9MjbmIu9SfmE0ARl9e32zgo3H+AFd1mzNPYEB06lbUqk93iPlj88jvSX/WMld30F9Qw 1nzmg7O3MpdgujWFQS3CK0Tk9zlTZ5mnJuxI+3etj7CuyVRt/isHdx24FhPSVxDC05tj mgNpYoYNaim0w7024kRn33WpwkIhT3PsANibslKbxEouv9C9zqhTjKg34awl9HjZbG2P zqbg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id eb13si1257229edb.312.2021.06.02.18.00.21; Wed, 02 Jun 2021 18:00:44 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229839AbhFCA6F (ORCPT + 99 others); Wed, 2 Jun 2021 20:58:05 -0400 Received: from mga11.intel.com ([192.55.52.93]:55983 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229831AbhFCA6F (ORCPT ); Wed, 2 Jun 2021 20:58:05 -0400 IronPort-SDR: 6OKl+/H85NV0bS+fPBjIBB2evj1QnYGKGCY82EpOCB8bKsGkgxNBUHmBkhj+QLAp6blZQTT125 iTHsRMEEo1fg== X-IronPort-AV: E=McAfee;i="6200,9189,10003"; a="200920919" X-IronPort-AV: E=Sophos;i="5.83,244,1616482800"; d="scan'208";a="200920919" Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Jun 2021 17:56:21 -0700 IronPort-SDR: 1az+SYywtqQi5jVrJsPeQe8TLNl5OJBBpNVWmIhDYfn0N7rkZiaDP4QhNjHRUZe1zGaIa8X9WQ kI/AjSE8nsgA== X-IronPort-AV: E=Sophos;i="5.83,244,1616482800"; d="scan'208";a="479949442" Received: from yjin15-mobl1.ccr.corp.intel.com (HELO [10.238.4.82]) ([10.238.4.82]) by orsmga001-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Jun 2021 17:56:18 -0700 Subject: Re: [PATCH 1/4] perf vendor events: Add core event list for Icelake Server To: Arnaldo Carvalho de Melo Cc: Ian Rogers , Jiri Olsa , Peter Zijlstra , Ingo Molnar , Alexander Shishkin , LKML , Andi Kleen , Kan Liang , "Jin, Yao" References: <20210510012438.6293-1-yao.jin@linux.intel.com> <20210510012438.6293-2-yao.jin@linux.intel.com> <99bc4c45-d67f-d53f-7ca7-1e770ef585a6@linux.intel.com> From: "Jin, Yao" Message-ID: <9079d1a6-2e88-2d25-20db-9fec3e0e1bf0@linux.intel.com> Date: Thu, 3 Jun 2021 08:56:16 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.10.2 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Arnaldo, On 6/2/2021 11:12 PM, Arnaldo Carvalho de Melo wrote: > Em Wed, Jun 02, 2021 at 09:55:49PM +0800, Jin, Yao escreveu: >> Hi Arnaldo, >> >> On 6/2/2021 7:26 PM, Arnaldo Carvalho de Melo wrote: >>> Em Tue, May 25, 2021 at 09:42:55AM -0300, Arnaldo Carvalho de Melo escreveu: >>>> Em Mon, May 24, 2021 at 09:08:12AM +0800, Jin, Yao escreveu: >>>>> Could you pull the top 4 patches from "https://github.com/yaoj/icx-events.git"? >>> >>>>> perf vendor events: Update event list for Icelake Client >>>>> perf vendor events: Add metrics for Icelake Server >>>>> perf vendor events: Add uncore event list for Icelake Server >>> >>>>> The patch is too big and it's possibly corrupted by mailing system. >>>> Thanks, applied. >>> >>> So, this is failing 'perf test 10', see details below, please run 'perf >>> test' before pushing patches upstream. >>> >>> Triple checking: >>> >>> ⬢[acme@toolbox perf]$ git cherry-pick 8f74f0f4dbf6361f0a5d21c5da260fbbf7597286 >>> Removing tools/perf/pmu-events/arch/x86/icelakex/icx-metrics.json >>> [perf/core 6971d24f4d04ccfa] Revert "perf vendor events intel: Add metrics for Icelake Server" >>> Date: Wed Jun 2 08:16:20 2021 -0300 >>> 1 file changed, 327 deletions(-) >>> delete mode 100644 tools/perf/pmu-events/arch/x86/icelakex/icx-metrics.json >>> ⬢[acme@toolbox perf]$ git log --oneline -1 >>> 6971d24f4d04ccfa (HEAD -> perf/core) Revert "perf vendor events intel: Add metrics for Icelake Server" >>> ⬢[acme@toolbox perf]$ (rm -rf /tmp/build/perf ; mkdir -p /tmp/build/perf ; make -k CORESIGHT=1 BUILD_BPF_SKEL=1 PYTHON=python3 O=/tmp/build/perf -C tools/perf install-bin) > /dev/null 2>&1 ; perf test 10 >>> 10: PMU events : >>> 10.1: PMU event table sanity : Ok >>> 10.2: PMU event map aliases : Ok >>> 10.3: Parsing of PMU event table metrics : Ok >>> 10.4: Parsing of PMU event table metrics with fake PMUs : Ok >>> ⬢[acme@toolbox perf]$ git reset --hard HEAD~ >>> HEAD is now at 0ab8009b3e8dd6ba Merge remote-tracking branch 'torvalds/master' into perf/core >>> ⬢[acme@toolbox perf]$ (rm -rf /tmp/build/perf ; mkdir -p /tmp/build/perf ; make -k CORESIGHT=1 BUILD_BPF_SKEL=1 PYTHON=python3 O=/tmp/build/perf -C tools/perf install-bin) > /dev/null 2>&1 ; perf test 10 >>> 10: PMU events : >>> 10.1: PMU event table sanity : Ok >>> 10.2: PMU event map aliases : Ok >>> 10.3: Parsing of PMU event table metrics : Ok >>> 10.4: Parsing of PMU event table metrics with fake PMUs : FAILED! >>> ⬢[acme@toolbox perf]$ >>> >>> - Arnaldo >>> >>> ⬢[acme@toolbox perf]$ git bisect bad >>> d89bf9cab1f613e4496f929d89477b2baaad1ea9 is the first bad commit >>> commit d89bf9cab1f613e4496f929d89477b2baaad1ea9 >>> Author: Jin Yao >>> Date: Sat May 8 13:06:20 2021 +0800 >>> >>> perf vendor events intel: Add metrics for Icelake Server >>> >>> Add JSON metrics for Icelake Server to perf. >>> >>> Based on TMA metrics 4.21 at 01.org.: >>> >>> https://download.01.org/perfmon/ >>> >>> Signed-off-by: Jin Yao >>> Reviewed-by: Andi Kleen >>> Acked-by: Ian Rogers >>> Cc: Alexander Shishkin >>> Cc: Andi Kleen >>> Cc: Ingo Molnar >>> Cc: Jiri Olsa >>> Cc: Kan Liang >>> Cc: Peter Zijlstra >>> Link: http://lore.kernel.org/lkml/c0f27643-bebb-2912-56ed-f7abec7dbde3@linux.intel.com >>> Signed-off-by: Arnaldo Carvalho de Melo >>> >>> .../pmu-events/arch/x86/icelakex/icx-metrics.json | 327 +++++++++++++++++++++ >>> 1 file changed, 327 insertions(+) >>> create mode 100644 tools/perf/pmu-events/arch/x86/icelakex/icx-metrics.json >>> ⬢[acme@toolbox perf]$ >>> >>> >>> ⬢[acme@toolbox perf]$ perf test -v 10 |& tail -40 >>> parsing 'inst_retired.any / cpu_clk_unhalted.distributed' >>> parsing '( 1 * ( fp_arith_inst_retired.scalar_single + fp_arith_inst_retired.scalar_double ) + 2 * fp_arith_inst_retired.128b_packed_double + 4 * ( fp_arith_inst_retired.128b_packed_single + fp_arith_inst_retired.256b_packed_double ) + 8 * ( fp_arith_inst_retired.256b_packed_single + fp_arith_inst_retired.512b_packed_double ) + 16 * fp_arith_inst_retired.512b_packed_single ) / cpu_clk_unhalted.distributed' >>> parsing 'uops_executed.thread / ( uops_executed.core_cycles_ge_1 / 2 )' >>> parsing 'cpu_clk_unhalted.distributed' >>> parsing 'inst_retired.any / mem_inst_retired.all_loads' >>> parsing 'inst_retired.any / mem_inst_retired.all_stores' >>> parsing 'inst_retired.any / br_inst_retired.all_branches' >>> parsing 'inst_retired.any / br_inst_retired.near_call' >>> parsing 'br_inst_retired.all_branches / br_inst_retired.near_taken' >>> parsing 'inst_retired.any / ( 1 * ( fp_arith_inst_retired.scalar_single + fp_arith_inst_retired.scalar_double ) + 2 * fp_arith_inst_retired.128b_packed_double + 4 * ( fp_arith_inst_retired.128b_packed_single + fp_arith_inst_retired.256b_packed_double ) + 8 * ( fp_arith_inst_retired.256b_packed_single + fp_arith_inst_retired.512b_packed_double ) + 16 * fp_arith_inst_retired.512b_packed_single )' >>> parsing 'inst_retired.any' >>> parsing 'lsd.uops / (idq.dsb_uops + lsd.uops + idq.mite_uops + idq.ms_uops)' >>> parsing 'idq.dsb_uops / (idq.dsb_uops + lsd.uops + idq.mite_uops + idq.ms_uops)' >>> parsing 'l1d_pend_miss.pending / ( mem_load_retired.l1_miss + mem_load_retired.fb_hit )' >>> parsing 'l1d_pend_miss.pending / l1d_pend_miss.pending_cycles' >>> parsing '( itlb_misses.walk_pending + dtlb_load_misses.walk_pending + dtlb_store_misses.walk_pending ) / ( 2 * cpu_clk_unhalted.distributed )' >>> parsing '64 * l1d.replacement / 1000000000 / duration_time' >>> parsing '64 * l2_lines_in.all / 1000000000 / duration_time' >>> parsing '64 * longest_lat_cache.miss / 1000000000 / duration_time' >>> parsing '64 * offcore_requests.all_requests / 1000000000 / duration_time' >>> parsing '1000 * mem_load_retired.l1_miss / inst_retired.any' >>> parsing '1000 * mem_load_retired.l2_miss / inst_retired.any' >>> parsing '1000 * ( ( offcore_requests.all_data_rd - offcore_requests.demand_data_rd ) + l2_rqsts.all_demand_miss + l2_rqsts.swpf_miss ) / inst_retired.any' >>> parsing '1000 * mem_load_retired.l3_miss / inst_retired.any' >>> parsing '1000 * l2_lines_out.silent / inst_retired.any' >>> parsing '1000 * l2_lines_out.non_silent / inst_retired.any' >>> parsing 'cpu_clk_unhalted.ref_tsc / msr@tsc@' >>> parsing '(cpu_clk_unhalted.thread / cpu_clk_unhalted.ref_tsc) * msr@tsc@ / 1000000000 / duration_time' >>> parsing '( ( 1 * ( fp_arith_inst_retired.scalar_single + fp_arith_inst_retired.scalar_double ) + 2 * fp_arith_inst_retired.128b_packed_double + 4 * ( fp_arith_inst_retired.128b_packed_single + fp_arith_inst_retired.256b_packed_double ) + 8 * ( fp_arith_inst_retired.256b_packed_single + fp_arith_inst_retired.512b_packed_double ) + 16 * fp_arith_inst_retired.512b_packed_single ) / 1000000000 ) / duration_time' >>> parsing 'cpu_clk_unhalted.thread / cpu_clk_unhalted.ref_tsc' >>> parsing '1 - cpu_clk_unhalted.one_thread_active / cpu_clk_unhalted.ref_distributed' >>> parsing 'cpu_clk_unhalted.thread:k / cpu_clk_unhalted.thread' >>> parsing '( 64 * ( uncore_imc@cas_count_read@ + uncore_imc@cas_count_write@ ) / 1000000000 ) / duration_time' >>> parsing '1000000000 * ( cha@event\=0x36\,umask\=0x21\,config\=0x40433@ / cha@event\=0x35\,umask\=0x21\,config\=0x40433@ ) / ( cha_0@event\=0x0@ / duration_time )' >>> parsing 'cha@event\=0x36\,umask\=0x21\,config\=0x40433@ / cha@event\=0x36\,umask\=0x21\,config\=0x40433\,thresh\=1@' >>> parsing '( 1000000000 * ( cha@event\=0x36\,umask\=0x21\,config\=0x40433@_pmm / cha@event\=0x35\,umask\=0x21\,config\=0x40433@_pmm ) / cha_0@event\=0x0@ )' >>> check_parse_fake failed >>> test child finished with -1 >>> ---- end ---- >>> PMU events subtest 4: FAILED! >>> ⬢[acme@toolbox perf]$ >>> >> >> Very sorry about the "Parsing of PMU event table metrics with fake PMUs" >> failure! I will resubmit the patch also with other c-state metrics update. > > So have you figure out what was wrong from the verbose output above? > > - Arnaldo > Yes, thanks Arnaldo! The issue was at 'config\=0x40433@_pmm' in MetricExpr of "MEM_PMM_Read_Latency". Now I'm thinking to use more direct MetricExpr for "MEM_PMM_Read_Latency", such as. { "MetricExpr": "( 1000000000 * ( UNC_CHA_TOR_OCCUPANCY.IA_MISS_DRD_PMM / UNC_CHA_TOR_INSERTS.IA_MISS_DRD_PMM ) / cha_0@event\\=0x0@ )", "BriefDescription": "Average latency of data read request to external 3D X-Point memory [in nanoseconds]. Accounts for demand loads and L1/L2 data-read prefetches", "MetricGroup": "MemoryLat;SoC;Server", "MetricName": "MEM_PMM_Read_Latency" }, Now the test is passed. Thanks Jin Yao