Received: by 2002:a05:7412:8d10:b0:f3:1519:9f41 with SMTP id bj16csp407767rdb; Tue, 5 Dec 2023 08:35:08 -0800 (PST) X-Google-Smtp-Source: AGHT+IFtkwLfhhxBrk3iZuvStAFk/jtCUACrsuNr745v5RpOLxIl7uhEWS0CTdzR+FNiua4nJCgr X-Received: by 2002:a05:6a20:9146:b0:18b:6f9:dc22 with SMTP id x6-20020a056a20914600b0018b06f9dc22mr4185834pzc.41.1701794108395; Tue, 05 Dec 2023 08:35:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701794108; cv=none; d=google.com; s=arc-20160816; b=m1MDTUm3XyS4BcQ1rK2BVQf8QHiASxB6Jiahy52coRfYDsVp+6bxzYFtxmJh1VM/xv sf5ZjxJ8uHRBDrglKHvdAh+DaWgMUjrPYx3ADvHogv22SkJxlEMw480epgNqKF/IaAcy 0J4EcND7uYtwXz24l5pk94L923k76Quoj/GxcQA+9adKrw9o0Z0QKik2xLgmb/DkqpAm fcDYF5bSiuVLgpW6dcSQ/ShQPRkRzo2B0YQToIYbexPMV38vltDWsLNAc3liSr6BpCjM C+L5IdGae2s/RTogJYEMMzH6mGgGRY04kRt6sWpkQZBDfLKgNLUO/n1sXGkqHp3wowtg rE9A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=bi/sGZ6/y8jg0H+Vman2RQYIdZ6Gnz41EoPwkxPvtOU=; fh=lDDQL8Ty2JJP7Zirm+hn7LN083XHCW0tMYtmg/XYjXE=; b=DGT+Ox8n4WvWh4BwzTzsz8PYAS3pc8GjZwTQ24G8fzKPtJSipm2L1emCsUc+DyK24y a49C8/OcwTSkBEVTGnx3XwvU21KVkc731f+a4/5tojiqw3nqi+YChgQFwF2LGo5evT+y a7gRfLCcEsMK+XvXCVuXep6FK8MKdYbI46vjbgGviJI7KGpb1QNsf9UjLuObn5c4btbP JZR13VGUUZxA42AmZ8kAByfaMnU5sM9pATPI5j1HYwnnfd5IXWOVR2+GwOfUo4Xts8yI DGNYp/dzdlLFxCSioryM7dIXlJ4bMv8Yl1V+Vayxr92aIeqUICQzhn8QZBSz8ynbormA kc6A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from snail.vger.email (snail.vger.email. [2620:137:e000::3:7]) by mx.google.com with ESMTPS id l6-20020a63ba46000000b005c67bf3805fsi4762639pgu.309.2023.12.05.08.35.07 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 05 Dec 2023 08:35:08 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) client-ip=2620:137:e000::3:7; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id B9F7380B1D30; Tue, 5 Dec 2023 08:35:04 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232088AbjLEQev (ORCPT + 99 others); Tue, 5 Dec 2023 11:34:51 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56626 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229462AbjLEQeu (ORCPT ); Tue, 5 Dec 2023 11:34:50 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id A573312C; Tue, 5 Dec 2023 08:34:55 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 753F0139F; Tue, 5 Dec 2023 08:35:41 -0800 (PST) Received: from [10.57.43.244] (unknown [10.57.43.244]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 7DC363F6C4; Tue, 5 Dec 2023 08:34:53 -0800 (PST) Message-ID: <0c423476-3126-4bc5-a6df-03c37f107717@arm.com> Date: Tue, 5 Dec 2023 16:34:52 +0000 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v6 2/3] arm64: perf: Add support for event counting threshold Content-Language: en-GB To: James Clark , Will Deacon Cc: linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, mark.rutland@arm.com, anshuman.khandual@arm.com, namhyung@gmail.com, Catalin Marinas , Jonathan Corbet , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org References: <20231124102857.1106453-1-james.clark@arm.com> <20231124102857.1106453-3-james.clark@arm.com> <20231205131414.GA18119@willie-the-truck> <1a74fef6-7f51-a518-4258-5f693b56add9@arm.com> From: Suzuki K Poulose In-Reply-To: <1a74fef6-7f51-a518-4258-5f693b56add9@arm.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Tue, 05 Dec 2023 08:35:04 -0800 (PST) On 05/12/2023 15:05, James Clark wrote: > > > On 05/12/2023 13:14, Will Deacon wrote: >> On Fri, Nov 24, 2023 at 10:28:56AM +0000, James Clark wrote: >>> diff --git a/drivers/perf/arm_pmuv3.c b/drivers/perf/arm_pmuv3.c >>> index 1d40d794f5e4..eb1ef84e1dbb 100644 >>> --- a/drivers/perf/arm_pmuv3.c >>> +++ b/drivers/perf/arm_pmuv3.c >>> @@ -15,6 +15,7 @@ >>> #include >>> >>> #include >>> +#include >>> #include >>> #include >>> #include >>> @@ -294,9 +295,20 @@ static const struct attribute_group armv8_pmuv3_events_attr_group = { >>> .is_visible = armv8pmu_event_attr_is_visible, >>> }; >>> >>> +#define THRESHOLD_LOW 2 >>> +#define THRESHOLD_HIGH 13 >>> +#define THRESHOLD_CNT 14 >>> +#define THRESHOLD_CMP_LO 15 >>> +#define THRESHOLD_CMP_HI 16 >> >> Do you think THWIDTH might extend beyond 12 bits in future? If so, it might >> be worth juggling these bits around a bit so it's not sandwiched between >> 'rdpmc' and 'threshold_compare'. I defer to your judgement, however. >> > > It's possible, both PMMIR.THWIDTH and PMEVTYPER.TH currently have unused > bits above them. I can easily move threshold to the end, I suppose that > covers us at least until someone adds a new field above that. > >>> PMU_FORMAT_ATTR(event, "config:0-15"); >>> PMU_FORMAT_ATTR(long, "config1:0"); >>> PMU_FORMAT_ATTR(rdpmc, "config1:1"); >>> +PMU_FORMAT_ATTR(threshold, "config1:" __stringify(THRESHOLD_LOW) "-" >>> + __stringify(THRESHOLD_HIGH)); >>> +PMU_FORMAT_ATTR(threshold_compare, "config1:" __stringify(THRESHOLD_CMP_LO) "-" >>> + __stringify(THRESHOLD_CMP_HI)); >>> +PMU_FORMAT_ATTR(threshold_count, "config1:" __stringify(THRESHOLD_CNT)); >>> >>> static int sysctl_perf_user_access __read_mostly; >>> >>> @@ -310,10 +322,33 @@ static inline bool armv8pmu_event_want_user_access(struct perf_event *event) >>> return event->attr.config1 & 0x2; >>> } >>> >>> +static inline u32 armv8pmu_event_threshold(struct perf_event_attr *attr) >>> +{ >>> + return FIELD_GET(GENMASK(THRESHOLD_HIGH, THRESHOLD_LOW), attr->config1); >>> +} >>> + >>> +static inline u8 armv8pmu_event_threshold_control(struct perf_event_attr *attr) >> >> You can drop the 'inline's for these functions (and, in fact, this whole >> file could do with that cleanup :) >> > > Will do. > >>> +{ >>> + u8 th_compare = FIELD_GET(GENMASK(THRESHOLD_CMP_HI, THRESHOLD_CMP_LO), >>> + attr->config1); >>> + u8 th_count = FIELD_GET(BIT(THRESHOLD_CNT), attr->config1); >> >> I think this is correct, but you might want to look at how we handle this >> in the SPE driver as I think it ends up looking cleaner and makes it pretty >> obvious which bits correspond to the user ABI (i.e. config fields) and which >> bits are part of architectural registers. I'm not saying you have to do it >> that way, but please take a look if you haven't already. >> > > Yeah I could take the GEN_PMU_FORMAT_ATTR() etc macros out of there and > re-use them here too. And also for the other existing configs in > arm_pmuv3.c. > >>> + /* >>> + * The count bit is always the bottom bit of the full control field, and >>> + * the comparison is the upper two bits, but it's not explicitly >>> + * labelled in the Arm ARM. For the Perf interface we split it into two >>> + * fields, so reconstruct it here. >>> + */ >>> + return (th_compare << 1) | th_count; >>> +} >>> + >>> static struct attribute *armv8_pmuv3_format_attrs[] = { >>> &format_attr_event.attr, >>> &format_attr_long.attr, >>> &format_attr_rdpmc.attr, >>> + &format_attr_threshold.attr, >>> + &format_attr_threshold_compare.attr, >>> + &format_attr_threshold_count.attr, >>> NULL, >>> }; >>> >>> @@ -365,10 +400,38 @@ static ssize_t bus_width_show(struct device *dev, struct device_attribute *attr, >>> >>> static DEVICE_ATTR_RO(bus_width); >>> >>> +static u32 threshold_max(struct arm_pmu *cpu_pmu) >>> +{ >>> + /* >>> + * PMMIR.THWIDTH is readable and non-zero on aarch32, but it would be >>> + * impossible to write the threshold in the upper 32 bits of PMEVTYPER. >>> + */ >>> + if (IS_ENABLED(CONFIG_ARM)) >>> + return 0; >>> + >>> + /* >>> + * The largest value that can be written to PMEVTYPER_EL0.TH is >>> + * (2 ^ PMMIR.THWIDTH) - 1. >>> + */ >>> + return (1 << FIELD_GET(ARMV8_PMU_THWIDTH, cpu_pmu->reg_pmmir)) - 1; >>> +} >>> + >>> +static ssize_t threshold_max_show(struct device *dev, >>> + struct device_attribute *attr, char *page) >>> +{ >>> + struct pmu *pmu = dev_get_drvdata(dev); >>> + struct arm_pmu *cpu_pmu = container_of(pmu, struct arm_pmu, pmu); >>> + >>> + return sysfs_emit(page, "0x%08x\n", threshold_max(cpu_pmu)); >>> +} >>> + >>> +static DEVICE_ATTR_RO(threshold_max); >>> + >>> static struct attribute *armv8_pmuv3_caps_attrs[] = { >>> &dev_attr_slots.attr, >>> &dev_attr_bus_slots.attr, >>> &dev_attr_bus_width.attr, >>> + &dev_attr_threshold_max.attr, >>> NULL, >>> }; >>> >>> @@ -552,7 +615,7 @@ static void armv8pmu_write_counter(struct perf_event *event, u64 value) >>> armv8pmu_write_hw_counter(event, value); >>> } >>> >>> -static inline void armv8pmu_write_evtype(int idx, u32 val) >>> +static inline void armv8pmu_write_evtype(int idx, unsigned long val) >>> { >>> u32 counter = ARMV8_IDX_TO_COUNTER(idx); >>> unsigned long mask = ARMV8_PMU_EVTYPE_EVENT | >>> @@ -921,6 +984,10 @@ static int armv8pmu_set_event_filter(struct hw_perf_event *event, >>> struct perf_event_attr *attr) >>> { >>> unsigned long config_base = 0; >>> + struct perf_event *perf_event = container_of(attr, struct perf_event, >>> + attr); >>> + struct arm_pmu *cpu_pmu = to_arm_pmu(perf_event->pmu); >>> + u32 th, th_max; >>> >>> if (attr->exclude_idle) >>> return -EPERM; >>> @@ -952,6 +1019,21 @@ static int armv8pmu_set_event_filter(struct hw_perf_event *event, >>> if (attr->exclude_user) >>> config_base |= ARMV8_PMU_EXCLUDE_EL0; >>> >>> + /* >>> + * Insert event counting threshold (FEAT_PMUv3_TH) values. If >>> + * FEAT_PMUv3_TH isn't implemented, then THWIDTH (threshold_max) will be >>> + * 0 and no values will be written. >>> + */ >>> + th_max = threshold_max(cpu_pmu); >>> + if (IS_ENABLED(CONFIG_ARM64) && th_max) { >> >> Why is the IS_ENABLED() check needed here? >> > > The FIELD_PREP() below would cause a compilation error on arm32 because > TH and TC are above 32 bits. I can add a comment. > >>> + th = min(armv8pmu_event_threshold(attr), th_max); >>> + if (th) { >> >> Why is it useful to take the minimum here? If userspace asks for a value >> bigger than the maximum support threshold, shouldn't we return an error >> rather than silently clamp it? >> > > I think it probably was just so I didn't have to think about what would > happen when the value varied between cores. > > But you're right, it looks like I can add a validation function to > struct arm_pmu and call it from armpmu_event_init(). armpmu->map_event() > and armpmu->set_event_filter() are already called from there so I think > the validation could technically be added to one of those, but that's > probably a bit hacky. I don't know if you have any preference for where > the threshold validation should happen? I think the other reason was to make sure we never set those fields for something that doesn't support the feature ? (Given th_max is set to 0 in that case). But I agree that validation is a better approach than silently masking it. Suzuki > >>> + config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TH, th); >>> + config_base |= FIELD_PREP(ARMV8_PMU_EVTYPE_TC, >>> + armv8pmu_event_threshold_control(attr)); >>> + } >>> + } >>> + >>> /* >>> * Install the filter into config_base as this is used to >>> * construct the event type. >>> diff --git a/include/linux/perf/arm_pmuv3.h b/include/linux/perf/arm_pmuv3.h >>> index ddd1fec86739..ccbc0f9a74d8 100644 >>> --- a/include/linux/perf/arm_pmuv3.h >>> +++ b/include/linux/perf/arm_pmuv3.h >>> @@ -258,6 +258,7 @@ >>> #define ARMV8_PMU_BUS_SLOTS_MASK 0xff >>> #define ARMV8_PMU_BUS_WIDTH_SHIFT 16 >>> #define ARMV8_PMU_BUS_WIDTH_MASK 0xf >>> +#define ARMV8_PMU_THWIDTH GENMASK(23, 20) >> >> It's a bit messy having a mixture of GENMASK and MASK/SHIFT pairs. Please >> can you either update what's there to use GENMASK, or use SHIFT/MASK for the >> new addition? >> >> Will >> > > Yep will do. Thanks for the review. > > James