Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp9053149ybl; Fri, 17 Jan 2020 05:39:32 -0800 (PST) X-Google-Smtp-Source: APXvYqx27aREPH1uzOYX5PmsBezIetTRdJOczGacdPsSungdjcxvIFZD2Wlgj24kaSRPW2E4h8pr X-Received: by 2002:aca:5490:: with SMTP id i138mr3536302oib.34.1579268372644; Fri, 17 Jan 2020 05:39:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579268372; cv=none; d=google.com; s=arc-20160816; b=LU5k9e60MNiBQMnFnjmN09p66LjMy11tk+CaUc4b03Cy4q6GB3sjHfPlKqBALQv+TU SzvKIc0u6ed1+BW3G6YHu9MnMGv2p1L0Hw72OhSP+qPkCnyEMPlG4wERnfO21oLLV5sU ucuP7XaTPK3GA0zmkkf4x1MRsIYi9rAqaTqMCGTczROxZWzCaOBcqmQqIlZHodxhks/7 o0NFt29NDJIdOdVCg0emzi6CgYN9TTsOxcykJcKMCh1fzqpBHRl+Y6jZT6sjc2Q8SQ8W eO5bnmURyHKfdBcRhgDlD2+Eg4Dlpw8vMRZKn+0+EdcOE0YNqEtCxgOD1rms6DVLF2wl GaLg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=YwqINYCqMp+BaJFsq9HnCpJ8JXBsjZXkpqqI0YkWoaY=; b=FpK8YCfaWQ6ycC0XyQ2wWotRPz/BToZTqHGHfdvsVmXPGp7HOgU+YgZ7FrtwJvj0y2 uVPvuFdxmPDRgP/5ymD1e2G+O5k+ld5RmE9/RgaB+9YhN1yT3SSJVAG643Zt4DxgL98T VkoxW2PuWIpao5pWqoq6Tx930AX3fSTLpaJbrKIPVrQLvK0gONxG7Rhj7HUHUyb6skae MAUlpL2MGk/fbdIu0j8ONWiryZ33VE2OatIJV/h39rqYLkU/DSC5VeXjFlwUNlk341Bm 4F0h1eJvRlXw1BvA7ju1mDrR48epeUfqLih9yDp48kY1ofyf+clEUvNxFauad4IjPUzs 3MnA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 19si13368044oiq.128.2020.01.17.05.39.20; Fri, 17 Jan 2020 05:39:32 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729108AbgAQNiO (ORCPT + 99 others); Fri, 17 Jan 2020 08:38:14 -0500 Received: from mga02.intel.com ([134.134.136.20]:44152 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726974AbgAQNiO (ORCPT ); Fri, 17 Jan 2020 08:38:14 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by orsmga101.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 17 Jan 2020 05:38:13 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.70,330,1574150400"; d="scan'208";a="373653222" Received: from nntpdsd52-183.inn.intel.com ([10.125.52.183]) by orsmga004.jf.intel.com with ESMTP; 17 Jan 2020 05:38:09 -0800 From: roman.sudarikov@linux.intel.com To: peterz@infradead.org, mingo@redhat.com, acme@kernel.org, mark.rutland@arm.com, alexander.shishkin@linux.intel.com, jolsa@redhat.com, namhyung@kernel.org, linux-kernel@vger.kernel.org, eranian@google.com, bgregg@netflix.com, ak@linux.intel.com, kan.liang@linux.intel.com, gregkh@linuxfoundation.org Cc: alexander.antonov@intel.com, roman.sudarikov@linux.intel.com Subject: [PATCH v4 2/2] =?UTF-8?q?perf=20x86:=20Exposing=20an=20Uncore=20u?= =?UTF-8?q?nit=20to=20PMON=20for=20Intel=20Xeon=C2=AE=20server=20platform?= Date: Fri, 17 Jan 2020 16:37:59 +0300 Message-Id: <20200117133759.5729-3-roman.sudarikov@linux.intel.com> X-Mailer: git-send-email 2.19.1 In-Reply-To: <20200117133759.5729-1-roman.sudarikov@linux.intel.com> References: <20200117133759.5729-1-roman.sudarikov@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Roman Sudarikov Current version supports a server line starting Intel® Xeon® Processor Scalable Family and introduces mapping for IIO Uncore units only. Other units can be added on demand. IIO stack to PMON mapping is exposed through: /sys/devices/uncore_iio_/mapping in the following format: domain:bus For example, on a 4-die Intel Xeon® server platform: $ cat /sys/devices/uncore_iio_0/mapping 0000:00,0000:40,0000:80,0000:c0 Which means: IIO PMON block 0 on die 0 belongs to IIO stack on bus 0x00, domain 0x0000 IIO PMON block 0 on die 1 belongs to IIO stack on bus 0x40, domain 0x0000 IIO PMON block 0 on die 2 belongs to IIO stack on bus 0x80, domain 0x0000 IIO PMON block 0 on die 3 belongs to IIO stack on bus 0xc0, domain 0x0000 Reviewed-by: Kan Liang Co-developed-by: Alexander Antonov Signed-off-by: Alexander Antonov Signed-off-by: Roman Sudarikov --- arch/x86/events/intel/uncore_snbep.c | 160 +++++++++++++++++++++++++++ 1 file changed, 160 insertions(+) diff --git a/arch/x86/events/intel/uncore_snbep.c b/arch/x86/events/intel/uncore_snbep.c index b10a5ec79e48..813009b48a0f 100644 --- a/arch/x86/events/intel/uncore_snbep.c +++ b/arch/x86/events/intel/uncore_snbep.c @@ -273,6 +273,30 @@ #define SKX_CPUNODEID 0xc0 #define SKX_GIDNIDMAP 0xd4 +/* + * The CPU_BUS_NUMBER MSR returns the values of the respective CPUBUSNO CSR + * that BIOS programmed. MSR has package scope. + * | Bit | Default | Description + * | [63] | 00h | VALID - When set, indicates the CPU bus + * numbers have been initialized. (RO) + * |[62:48]| --- | Reserved + * |[47:40]| 00h | BUS_NUM_5 — Return the bus number BIOS assigned + * CPUBUSNO(5). (RO) + * |[39:32]| 00h | BUS_NUM_4 — Return the bus number BIOS assigned + * CPUBUSNO(4). (RO) + * |[31:24]| 00h | BUS_NUM_3 — Return the bus number BIOS assigned + * CPUBUSNO(3). (RO) + * |[23:16]| 00h | BUS_NUM_2 — Return the bus number BIOS assigned + * CPUBUSNO(2). (RO) + * |[15:8] | 00h | BUS_NUM_1 — Return the bus number BIOS assigned + * CPUBUSNO(1). (RO) + * | [7:0] | 00h | BUS_NUM_0 — Return the bus number BIOS assigned + * CPUBUSNO(0). (RO) + */ +#define SKX_MSR_CPU_BUS_NUMBER 0x300 +#define SKX_MSR_CPU_BUS_VALID_BIT (1ULL << 63) +#define BUS_NUM_STRIDE 8 + /* SKX CHA */ #define SKX_CHA_MSR_PMON_BOX_FILTER_TID (0x1ffULL << 0) #define SKX_CHA_MSR_PMON_BOX_FILTER_LINK (0xfULL << 9) @@ -3580,6 +3604,9 @@ static struct intel_uncore_ops skx_uncore_iio_ops = { .read_counter = uncore_msr_read_counter, }; +static int skx_iio_get_topology(struct intel_uncore_type *type, int max_dies); +static int skx_iio_set_mapping(struct intel_uncore_type *type, int max_dies); + static struct intel_uncore_type skx_uncore_iio = { .name = "iio", .num_counters = 4, @@ -3594,6 +3621,8 @@ static struct intel_uncore_type skx_uncore_iio = { .constraints = skx_uncore_iio_constraints, .ops = &skx_uncore_iio_ops, .format_group = &skx_uncore_iio_format_group, + .get_topology = skx_iio_get_topology, + .set_mapping = skx_iio_set_mapping, }; enum perf_uncore_iio_freerunning_type_id { @@ -3780,6 +3809,137 @@ static int skx_count_chabox(void) return hweight32(val); } +static inline int skx_msr_cpu_bus_read(int cpu, u64 *topology) +{ + u64 msr_value; + + if (rdmsrl_on_cpu(cpu, SKX_MSR_CPU_BUS_NUMBER, &msr_value) || + !(msr_value & SKX_MSR_CPU_BUS_VALID_BIT)) + return -ENXIO; + + *topology = msr_value; + + return 0; +} + +static int skx_iio_get_topology(struct intel_uncore_type *type, int max_dies) +{ + int ret, cpu, die, current_die; + struct pci_bus *bus = NULL; + + /* + * Verified single-segment environments only; disabled for multiple + * segment topologies for now. + */ + while ((bus = pci_find_next_bus(bus)) && !pci_domain_nr(bus)) + ; + if (bus) + return -EPERM; + + type->topology = kcalloc(max_dies, sizeof(u64), GFP_KERNEL); + if (!type->topology) + return -ENOMEM; + + /* + * Using cpus_read_lock() to ensure cpu is not going down between + * looking at cpu_online_mask. + */ + cpus_read_lock(); + /* Invalid value to start loop.*/ + current_die = -1; + for_each_online_cpu(cpu) { + die = topology_logical_die_id(cpu); + if (current_die == die) + continue; + ret = skx_msr_cpu_bus_read(cpu, &type->topology[die]); + if (ret) { + kfree(type->topology); + break; + } + current_die = die; + } + cpus_read_unlock(); + + return ret; +} + +static inline u8 skx_iio_stack_bus(struct intel_uncore_pmu *pmu, int die) +{ + return pmu->type->topology[die] >> (pmu->pmu_idx * BUS_NUM_STRIDE); +} + +static int skx_iio_set_box_mapping(struct intel_uncore_pmu *pmu, int max_dies) +{ + char *buf; + int die = 0; + /* Length of template "%04x:%02x," without null character. */ + const int template_len = 8; + + /* + * Root bus 0x00 is valid only for die 0 AND pmu_idx = 0. + * Set "0" platform mapping for PMUs which have zero stack bus and + * non-zero index. + */ + if (!skx_iio_stack_bus(pmu, die) && pmu->pmu_idx) { + pmu->mapping = kzalloc(2, GFP_KERNEL); + if (!pmu->mapping) + return -ENOMEM; + sprintf(pmu->mapping, "0"); + return 0; + } + + pmu->mapping = kzalloc(max_dies * template_len + 1, GFP_KERNEL); + if (!pmu->mapping) + return -ENOMEM; + + buf = pmu->mapping; + for (; die < max_dies; die++) { + buf += snprintf(buf, template_len + 1, "%04x:%02x,", 0, + skx_iio_stack_bus(pmu, die)); + } + + *(--buf) = '\0'; + + return 0; +} + +static int skx_iio_set_mapping(struct intel_uncore_type *type, int max_dies) +{ + /* + * Each IIO stack (PCIe root port) has its own IIO PMON block, so each + * "mapping" holds bus number(s) of PCIe root port(s), which can + * be monitored by that IIO PMON block. + * + * For example, on 4-die Xeon platform with up to 6 IIO stacks per die + * and, therefore, 6 IIO PMON blocks per die, the "mapping" + * of IIO PMON block 0 holds "0000:00,0000:40,0000:80,0000:c0": + * + * $ cat /sys/devices/uncore_iio_0/mapping + * 0000:00,0000:40,0000:80,0000:c0 + * + * Which means: + * IIO PMON 0 on die 0 belongs to PCIe RP on bus 0x00, domain 0x0000 + * IIO PMON 0 on die 1 belongs to PCIe RP on bus 0x40, domain 0x0000 + * IIO PMON 0 on die 2 belongs to PCIe RP on bus 0x80, domain 0x0000 + * IIO PMON 0 on die 3 belongs to PCIe RP on bus 0xc0, domain 0x0000 + */ + + int ret; + struct intel_uncore_pmu *pmu = type->pmus; + + for (; pmu - type->pmus < type->num_boxes; pmu++) { + ret = skx_iio_set_box_mapping(pmu, max_dies); + if (ret) { + for (; pmu->pmu_idx > 0; --pmu) + kfree(pmu->mapping); + break; + } + } + + kfree(type->topology); + return ret; +} + void skx_uncore_cpu_init(void) { skx_uncore_chabox.num_boxes = skx_count_chabox(); -- 2.19.1