Received: by 2002:a05:7412:419a:b0:f3:1519:9f41 with SMTP id i26csp86931rdh; Wed, 22 Nov 2023 19:42:58 -0800 (PST) X-Google-Smtp-Source: AGHT+IH8vKxKVpnMiXP4FGEt67dtXDGOaFxtZw8j0vDVlMFy1ylFSweBQNmNBjC1C3b+nXLm25K3 X-Received: by 2002:a05:6358:63a2:b0:16d:d0e4:acbd with SMTP id k34-20020a05635863a200b0016dd0e4acbdmr3534779rwh.32.1700710978593; Wed, 22 Nov 2023 19:42:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1700710978; cv=none; d=google.com; s=arc-20160816; b=oIdDVRlL44H3sxsiVE7boPjb3n7FikxNaZnKPguUO4QQkiUVT/R0Opi6HEuLxkeI2W BhheGthmL7V90dJM8uAsNL/MpB6dW0PJ9XBaoNGZ0wt4DeaclsJ/0NpDhC+d28k9V7gk fVvD2v3uDkWJJ9Y9l4MrA3arMxUK9w/HXICIZQezhB+2pmvpuG2pFpaUIAEO2yi5Mg7U FntphI4heT2lOlVUDOyJJOY9Gemd/Yh2NFK3DlS4iRyMUc5FLrmjjEg77DjrHhNS8ean Bm8ntL9ERs/UhKop02hopMEVJ0a1ZNLbKyhhHWmjG7wOHQSaQzd9FvsNs4O/1fGfcmuo MR4A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=rQzmsQ+7f42xVtfyY9du+CsQcDWtVvBFX3penvpQgiY=; fh=nOqhHfnJd6kSgHfO8J5Q8P/GaGiVNrZWuBtWbv+aPnM=; b=yH5hweRAFP9FfPbzmYqiDB5vIpEFVn5pXc8F23Db7iZ6kWMbmp4QSPOMEjruKeFRA5 mhCOiMxW2ODax6J9wKxfFWsACdXOQXhQfVj38s7HdtLz3bvzEZ8WVhlJ/YTqm6haWgZJ gmX8oxl8U7JCX198a/L7XGDLW+LpnvG/Itl3nAIApccSWcXmFMcFU0KajAPyBCQGyX1U czEs/lzVRM5r832hsE5tL1HBBiK5IWkLykH1dP1kzHunFdy0sdiJCigmKVnfsA82E2mZ r1R4Xs7Tm/J4U8V49VGkVd335z5nuQHrIqtYBL+KjcokjVraVC+rLxV3LWmwaDIO3+sQ TyXw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from fry.vger.email (fry.vger.email. [23.128.96.38]) by mx.google.com with ESMTPS id z3-20020a63e103000000b005b106cd44casi491860pgh.145.2023.11.22.19.42.58 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 22 Nov 2023 19:42:58 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) client-ip=23.128.96.38; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.38 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by fry.vger.email (Postfix) with ESMTP id 29D9D837FA1D; Wed, 22 Nov 2023 19:42:55 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at fry.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232840AbjKWDmU (ORCPT + 99 others); Wed, 22 Nov 2023 22:42:20 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34554 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232584AbjKWDmT (ORCPT ); Wed, 22 Nov 2023 22:42:19 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 0FCE61AB; Wed, 22 Nov 2023 19:42:25 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id EA6E01042; Wed, 22 Nov 2023 19:43:10 -0800 (PST) Received: from [10.163.39.193] (unknown [10.163.39.193]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 386953F73F; Wed, 22 Nov 2023 19:42:20 -0800 (PST) Message-ID: <54142157-5547-4b65-8168-46e68361045c@arm.com> Date: Thu, 23 Nov 2023 09:12:18 +0530 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v5 2/3] arm64: perf: Add support for event counting threshold Content-Language: en-US To: James Clark , linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, suzuki.poulose@arm.com, will@kernel.org, mark.rutland@arm.com Cc: Catalin Marinas , Jonathan Corbet , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org References: <20231113112507.917107-1-james.clark@arm.com> <20231113112507.917107-3-james.clark@arm.com> From: Anshuman Khandual In-Reply-To: <20231113112507.917107-3-james.clark@arm.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on fry.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (fry.vger.email [0.0.0.0]); Wed, 22 Nov 2023 19:42:55 -0800 (PST) On 11/13/23 16:55, James Clark wrote: > FEAT_PMUv3_TH (Armv8.8) permits a PMU counter to increment only on > events whose count meets a specified threshold condition. For example if > PMEVTYPERn.TC (Threshold Control) is set to 0b101 (Greater than or > equal, count), and the threshold is set to 2, then the PMU counter will > now only increment by 1 when an event would have previously incremented > the PMU counter by 2 or more on a single processor cycle. > > Three new Perf event config fields, 'threshold', 'threshold_compare' and > 'threshold_count' have been added to control the feature. > threshold_compare maps to the upper two bits of PMEVTYPERn.TC and > threshold_count maps to the first bit of TC. These separate attributes > have been picked rather than enumerating all the possible combinations > of the TC field as in the Arm ARM. The attributes would be used on a > Perf command line like this: > > $ perf stat -e stall_slot/threshold=2,threshold_compare=2/ If threshold_count = 0, then threshold and threshold_compare should just be ignored ? > > A new capability for reading out the maximum supported threshold value > has also been added: > > $ cat /sys/bus/event_source/devices/armv8_pmuv3/caps/threshold_max > > 0x000000ff Makes sense. > > If a threshold higher than threshold_max is provided, then no error is > generated but the threshold is clamped to the max value. If > FEAT_PMUv3_TH isn't implemented or a 32 bit kernel is running, then > threshold_max reads zero, and neither the 'threshold' nor > 'threshold_control' parameters will be used. > > The threshold is per PMU counter, and there are potentially different > threshold_max values per PMU type on heterogeneous systems. > > Bits higher than 32 now need to be written into PMEVTYPER, so > armv8pmu_write_evtype() has to be updated to take an unsigned long value > rather than u32 which gives the correct behavior on both aarch32 and 64. Makes sense. > > Signed-off-by: James Clark > --- > drivers/perf/arm_pmuv3.c | 79 +++++++++++++++++++++++++++++++++- > include/linux/perf/arm_pmuv3.h | 1 + > 2 files changed, 79 insertions(+), 1 deletion(-) > > diff --git a/drivers/perf/arm_pmuv3.c b/drivers/perf/arm_pmuv3.c > index 1d40d794f5e4..694d914ffc08 100644 > --- a/drivers/perf/arm_pmuv3.c > +++ b/drivers/perf/arm_pmuv3.c > @@ -15,6 +15,7 @@ > #include > > #include > +#include > #include > #include > #include > @@ -294,9 +295,18 @@ static const struct attribute_group armv8_pmuv3_events_attr_group = { > .is_visible = armv8pmu_event_attr_is_visible, > }; > > +#define TH_LO 2 > +#define TH_HI 13 > +#define TH_CNT 14 > +#define TH_CMP_LO 15 > +#define TH_CMP_HI 16 TH_ prefix sounds too cryptic. I had suggested some clean up earlier https://lore.kernel.org/all/20231009051753.179355-1-anshuman.khandual@arm.com/ But for now, something like the following might be bit better instead ? THRESHOLD_LOW THRESHOLD_HIGH THRESHOLD_CNT THRESHOLD_CMP_LOW THRESHOLD_CMP_HIGH > + > PMU_FORMAT_ATTR(event, "config:0-15"); > PMU_FORMAT_ATTR(long, "config1:0"); > PMU_FORMAT_ATTR(rdpmc, "config1:1"); > +PMU_FORMAT_ATTR(threshold, "config1:" __stringify(TH_LO) "-" __stringify(TH_HI)); > +PMU_FORMAT_ATTR(threshold_compare, "config1:" __stringify(TH_CMP_LO) "-" __stringify(TH_CMP_HI)); > +PMU_FORMAT_ATTR(threshold_count, "config1:" __stringify(TH_CNT)); > > static int sysctl_perf_user_access __read_mostly; > > @@ -310,10 +320,32 @@ static inline bool armv8pmu_event_want_user_access(struct perf_event *event) > return event->attr.config1 & 0x2; > } > > +static inline u32 armv8pmu_event_threshold(struct perf_event_attr *attr) > +{ > + return FIELD_GET(GENMASK(TH_HI, TH_LO), attr->config1); > +} > + > +static inline u8 armv8pmu_event_threshold_control(struct perf_event_attr *attr) > +{ > + u8 th_compare = FIELD_GET(GENMASK(TH_CMP_HI, TH_CMP_LO), attr->config1); > + u8 th_count = FIELD_GET(BIT(TH_CNT), attr->config1); > + > + /* > + * The count bit is always the bottom bit of the full control field, and > + * the comparison is the upper two bits, but it's not explicitly > + * labelled in the Arm ARM. For the Perf interface we split it into two > + * fields, so reconstruct it here. > + */ > + return (th_compare << 1) | th_count; If user provides 'th_count = 0', then all these threshold control code can be skipped as if FEAT_PMUv3_TH was never implemented ? Also what happens when threshold = 0 ? > +} > + > static struct attribute *armv8_pmuv3_format_attrs[] = { > &format_attr_event.attr, > &format_attr_long.attr, > &format_attr_rdpmc.attr, > + &format_attr_threshold.attr, > + &format_attr_threshold_compare.attr, > + &format_attr_threshold_count.attr, > NULL, > }; > > @@ -365,10 +397,38 @@ static ssize_t bus_width_show(struct device *dev, struct device_attribute *attr, > > static DEVICE_ATTR_RO(bus_width); > > +static u32 threshold_max(struct arm_pmu *cpu_pmu) > +{ > + /* > + * PMMIR.WIDTH is readable and non-zero on aarch32, but it would be PMMIR.THWIDTH ^^^^^^^^ ? > + * impossible to write the threshold in the upper 32 bits of PMEVTYPER. > + */ Could aarch32 support FEAT_PMUv3_TH ? if not, how can the PMMIR.THWIDTH value here be non-zero ? Also wondering if just a non-zero PMMIR.THWIDTH indicates the presence for FEAT_PMUv3_TH on a given ARM PMU. > + if (IS_ENABLED(CONFIG_ARM)) > + return 0; Small nit - would a negative check on CONFIG_ARM64 be better ? > + > + /* > + * The largest value that can be written to PMEVTYPER_EL0.TH is > + * (2 ^ PMMIR.THWIDTH) - 1. > + */ > + return (1 << FIELD_GET(ARMV8_PMU_THWIDTH, cpu_pmu->reg_pmmir)) - 1; Makes sense to return the adjusted value. > +} > + > +static ssize_t threshold_max_show(struct device *dev, > + struct device_attribute *attr, char *page) > +{ > + struct pmu *pmu = dev_get_drvdata(dev); > + struct arm_pmu *cpu_pmu = container_of(pmu, struct arm_pmu, pmu); > + > + return sysfs_emit(page, "0x%08x\n", threshold_max(cpu_pmu)); > +} > + > +static DEVICE_ATTR_RO(threshold_max); > + > static struct attribute *armv8_pmuv3_caps_attrs[] = { > &dev_attr_slots.attr, > &dev_attr_bus_slots.attr, > &dev_attr_bus_width.attr, > + &dev_attr_threshold_max.attr, > NULL, > }; > > @@ -552,7 +612,7 @@ static void armv8pmu_write_counter(struct perf_event *event, u64 value) > armv8pmu_write_hw_counter(event, value); > } > > -static inline void armv8pmu_write_evtype(int idx, u32 val) > +static inline void armv8pmu_write_evtype(int idx, unsigned long val) > { > u32 counter = ARMV8_IDX_TO_COUNTER(idx); > unsigned long mask = ARMV8_PMU_EVTYPE_EVENT | > @@ -921,6 +981,10 @@ static int armv8pmu_set_event_filter(struct hw_perf_event *event, > struct perf_event_attr *attr) > { > unsigned long config_base = 0; > + struct perf_event *perf_event = container_of(attr, struct perf_event, > + attr); > + struct arm_pmu *cpu_pmu = to_arm_pmu(perf_event->pmu); > + u32 th, th_max; > > if (attr->exclude_idle) > return -EPERM; > @@ -952,6 +1016,19 @@ static int armv8pmu_set_event_filter(struct hw_perf_event *event, > if (attr->exclude_user) > config_base |= ARMV8_PMU_EXCLUDE_EL0; > > + /* > + * Insert event counting threshold (FEAT_PMUv3_TH) values. If > + * FEAT_PMUv3_TH isn't implemented, then THWIDTH (threshold_max) will be > + * 0 and no values will be written. > + */ > + th_max = threshold_max(cpu_pmu); > + if (IS_ENABLED(CONFIG_ARM64) && th_max) { As mentioned above, using negative check on CONFIG_ARM64 in threshold_max() will complement this condition here, making it clear that these threshold configurations are applicable only on 64 bit platforms. > + th = min(armv8pmu_event_threshold(attr), th_max); > + config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TH, th); > + config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TC, > + armv8pmu_event_threshold_control(attr)); Small nit - armv8pmu_event_threshold_control() might also be captured before into a 'tc' local variable before adjusting the config_base similar to 'th'. Also better to add a small comment before 'th = min(..., ..) ' regarding the clamping user input to platform max_threshold. th = min(armv8pmu_event_threshold(attr), th_max); tc = armv8pmu_event_threshold_control(attr)); config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TH, th); config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TC, tc); > + } > + > /* > * Install the filter into config_base as this is used to > * construct the event type. > diff --git a/include/linux/perf/arm_pmuv3.h b/include/linux/perf/arm_pmuv3.h > index ddd1fec86739..ccbc0f9a74d8 100644 > --- a/include/linux/perf/arm_pmuv3.h > +++ b/include/linux/perf/arm_pmuv3.h > @@ -258,6 +258,7 @@ > #define ARMV8_PMU_BUS_SLOTS_MASK 0xff > #define ARMV8_PMU_BUS_WIDTH_SHIFT 16 > #define ARMV8_PMU_BUS_WIDTH_MASK 0xf > +#define ARMV8_PMU_THWIDTH GENMASK(23, 20) Small nit - may be ARMV8_PMU_TH_WIDTH instead ? > > /* > * This code is really good