Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp21847693ybl; Mon, 6 Jan 2020 12:31:11 -0800 (PST) X-Google-Smtp-Source: APXvYqxEjSzmVHuSpIh++3bBbm3tY0y/kSSWJ+gG0hUbylG+ucfjLW5Du8gpH07dVB/cgGs/8d8R X-Received: by 2002:a9d:7ad9:: with SMTP id m25mr111030540otn.13.1578342671220; Mon, 06 Jan 2020 12:31:11 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1578342671; cv=none; d=google.com; s=arc-20160816; b=x3bafdf+l9G+Vy3av/oKBLPrJ6DGnZJ3CNQmTHcnu7QLNUNzl+f9JHBCLkehGFAHm6 5sL294bcxaI7OHzeP7yQBxSW3b42+FoidMXTQloTGO3D+cpu980DBGpCkLB12oNsRHBf g7xs7l3jzsyQXQBgUuIF7omku9ZoK6E+2NMM03pP5oN4Uu13in8Yfro6wLz6vgrG9cpq kvXJHOuEpiq1ZZ/crAClZBAraiRLuoI1B219rimrdq2sCKp8qcVR1Ui1eWyCBH4uGDV+ akKWafcjqJ92GsBPHcicwxBHN7KiRaPcf294glfpK/6kvB9IWlfu7e4XlfUpn47Mfmx0 IdSQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from; bh=oTemC+FMCa9xn4tpzVy2uolItEhWy6Of+O0G6ycMK24=; b=vLZmjLlc7sgj6VB2Hc2tB7mIj8u36S0fEyF64j2ybvXkXo9Ola3/AaUqth5VGU4Sn/ tphH5f+Q++EvyXwMFjoGO7hcTPC1ezHtzxb1TnpTvVd3VHyMueREY1ewTsGX2fHpTgss Jc86wsyHlGMEhBM0I84asRZQFx608losMEepzVwEX1k0k892AttX3jX9B+8vMzwHqSsw rFjFVtpaQIe/aBDvLnhBgi2tYUx4+VfTgr5YNlOD7XVEhWJvDD8Dx3tLKuyFNtNC2p8p byC6FYDM9gpP+9qjpUmiujmKWS4xIKy3iaVdm3BV69CsYPiJKVfNu3s3YYB3aiNaPZo6 qZtA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q2si33870438otn.220.2020.01.06.12.30.58; Mon, 06 Jan 2020 12:31:11 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726893AbgAFUaR (ORCPT + 99 others); Mon, 6 Jan 2020 15:30:17 -0500 Received: from mga03.intel.com ([134.134.136.65]:10708 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726739AbgAFUaQ (ORCPT ); Mon, 6 Jan 2020 15:30:16 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by orsmga103.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 06 Jan 2020 12:30:16 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.69,403,1571727600"; d="scan'208";a="245699382" Received: from labuser-ice-lake-client-platform.jf.intel.com ([10.54.55.50]) by fmsmga004.fm.intel.com with ESMTP; 06 Jan 2020 12:30:15 -0800 From: kan.liang@linux.intel.com To: peterz@infradead.org, acme@redhat.com, mingo@kernel.org, linux-kernel@vger.kernel.org Cc: tglx@linutronix.de, jolsa@kernel.org, eranian@google.com, alexander.shishkin@linux.intel.com, ak@linux.intel.com, Kan Liang Subject: [PATCH V5 RESEND 00/14] TopDown metrics support for Icelake Date: Mon, 6 Jan 2020 12:29:05 -0800 Message-Id: <20200106202919.2943-1-kan.liang@linux.intel.com> X-Mailer: git-send-email 2.17.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Kan Liang Icelake has support for measuring the level 1 TopDown metrics directly in hardware. This is implemented by an additional METRICS register, and a new Fixed Counter 3 that measures pipeline SLOTS. New in Icelake - Do not require generic counters. This allows to collect TopDown always in addition to other events. - Measuring TopDown per thread/process instead of only per core For the Ice Lake implementation of performance metrics, the values in PERF_METRICS MSR are derived from fixed counter 3. Software should start both registers, PERF_METRICS and fixed counter 3, from zero. Additionally, software is recommended to periodically clear both registers in order to maintain accurate measurements. The latter is required for certain scenarios that involve sampling metrics at high rates. Software should always write fixed counter 3 before write to PERF_METRICS. IA32_PERF_GLOBAL_STATUS. OVF_PERF_METRICS[48]: If this bit is set, it indicates that some PERF_METRICS-related counter has overflowed and a PMI is triggered. Software has to synchronize, e.g. re-start, PERF_METRICS as well as fixed counter 3. Otherwise, PERF_METRICS may return invalid values. Limitation - To get accurate result and avoid reading the METRICS register multiple times, the TopDown metrics events and SLOTS event have to be in the same group. - METRICS and SLOTS registers have to be cleared after each read by SW. That is to prevent the lose of precision. - Cannot do sampling read SLOTS and TopDown metric events Please refer SDM Vol3, 18.3.9.3 Performance Metrics for the details of TopDown metrics. Changes since V4: - Add description regarding to event-code naming for fixed counters - Fix add_nr_metric_event(). For leader event, we have to take the accepted metrics events into account. For sibling event, it doesn't need to count accepted metrics events again. - Remove is_first_topdown_event_in_group(). Force slots in topdown group. Only update topdown events with slots event. - Re-use last_period and period_left for saved_metric and saved_slots. Changes since V3: - Separate fixed counter3 definition patch - Separate BTS index patch - Apply Peter's cleanup patch - Fix the name of perf capabilities for perf METRICS - Apply patch for mul_u64_u32_div() x86_64 implementation - Fix unconditionally allows collecting 4 extra events - Add patch to clean up NMI handler by naming global status bit - Add patch to reuse event_base_rdpmc for RDPMC userspace support Changes since V2: - Rebase on top of v5.3-rc1 Key changes since V1: - Remove variables for reg_idx and enabled_events[] array. The reg_idx can be calculated by idx in runtime. Using existing active_mask to replace enabled_events. - Choose value 47 for the fixed index of BTS. - Support OVF_PERF_METRICS overflow bit in PMI handler - Drops the caching mechanism and related variables New mechanism is to update all active slots/metrics events for the first slots/metrics events in a group. For each group reading, it still only read the slots/perf_metrics MSR once - Disable PMU for read of topdown events to avoid the NMI issue - Move RDPMC support to a separate patch - Using event=0x00,umask=0x1X for topdown metrics events - Drop the patch which add REMOVE transaction We can indicate x86_pmu_stop() by checking (event && !test_bit(event->hw.idx, cpuc->active_mask)), which is a good place to save the slots/metrics MSR value Andi Kleen (2): perf, tools, stat: Support new per thread TopDown metrics perf, tools: Add documentation for topdown metrics Kan Liang (12): perf/x86/intel: Introduce the fourth fixed counter perf/x86/intel: Set correct mask for TOPDOWN.SLOTS perf/x86/intel: Move BTS index to 47 perf/x86/intel: Basic support for metrics counters perf/x86/intel: Fix the name of perf capabilities for perf METRICS perf/x86/intel: Support hardware TopDown metrics perf/x86/intel: Support per thread RDPMC TopDown metrics perf/x86/intel: Export TopDown events for Icelake perf/x86/intel: Disable sampling read slots and topdown perf/x86/intel: Name global status bit in NMI handler perf/x86: Use event_base_rdpmc for RDPMC userspace support perf, tools, stat: Check Topdown Metric group arch/x86/events/core.c | 86 +++++- arch/x86/events/intel/core.c | 399 ++++++++++++++++++++++--- arch/x86/events/perf_event.h | 57 +++- arch/x86/include/asm/msr-index.h | 3 + arch/x86/include/asm/perf_event.h | 60 +++- include/linux/perf_event.h | 29 +- tools/perf/Documentation/perf-stat.txt | 9 +- tools/perf/Documentation/topdown.txt | 235 +++++++++++++++ tools/perf/builtin-stat.c | 97 ++++++ tools/perf/util/stat-shadow.c | 89 ++++++ tools/perf/util/stat.c | 4 + tools/perf/util/stat.h | 8 + 12 files changed, 1007 insertions(+), 69 deletions(-) create mode 100644 tools/perf/Documentation/topdown.txt -- 2.17.1