Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp4847134iob; Mon, 9 May 2022 03:05:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxrBuUAjlH2UjFkKY941J8n4o2My8dVrC4u1UuNGnRFQ20K+t+JfOvJAXeIDOktQ/ttiCNN X-Received: by 2002:a63:564c:0:b0:3c1:976d:9f03 with SMTP id g12-20020a63564c000000b003c1976d9f03mr12739329pgm.354.1652090714271; Mon, 09 May 2022 03:05:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652090714; cv=none; d=google.com; s=arc-20160816; b=MfXMdAw5rWSfp7TTYEpQttIbmTWQ2hor83bLhAOyRTRWln/Btr5zNSpC8UNrnJh9dX dokz13VZzAZVCEgxZhaFW1NO/v+o8IAxT+Sj/rxlRnkK9T+fS2Mm/rpTPvDXbTbr4z7e 6N1igqVwSwwvVsulQNYxk0pk0v7rM/PwiAmcV7W5wTCySpiy8gLqWoMZzCYkZsQItcGa fWCJ/KdUzqBFRBXOP70+X/ikc1x9vnOH+dPLjUhfr3o5pkF4Y7EO/s1v3DjNl/AQaHJe 3r9/oHwUgjLzTDs3xEkEUltRx2KJvR0edsB/Y3/Ojvjbo4gxVaxqHbFBwy5blo136IVB D3xg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Sgeqc5+6lUb+9xfdT4wfaAHvYTUYEwM44Jw+nzAyvBo=; b=L1hoqYG6yqiOjBAFn4tnoVfk22zed0ZvvuSSbSEv8T9AoqSmCkdJcQbSxMUoRQVUrC FZGFQzI2MwB13BVK+S8uFVp2yc2MdXlsII6ZK3jepMPA7rgrgwXLx8WXqYE0D8gKk2TS CtfllxH8kidye/+Kj/i7yMjVPzwJ50ztW9P7bLPeMjLUJCC2H/nEphvlKcyYrW96/kr0 IARHwDsHuyMFRYfAMaGAq+vliNK9kPYaE12K7GJYaWfLLzl2X0MolDOIjurWnPozotFG /jj+OLd6quvwT/IziP/jJBQRyob3ePodCahIKubPCTwS7utX2kGR462VYpnHdR7ndBDn Y3Yg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ventanamicro.com header.s=google header.b=FGQqvl29; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id q10-20020a056a00088a00b00510a30ef4c6si5130093pfj.370.2022.05.09.03.05.13 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 09 May 2022 03:05:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@ventanamicro.com header.s=google header.b=FGQqvl29; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 9AE21AD12A; Mon, 9 May 2022 02:45:46 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232041AbiEIDzJ (ORCPT + 99 others); Sun, 8 May 2022 23:55:09 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35018 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233262AbiEIDvQ (ORCPT ); Sun, 8 May 2022 23:51:16 -0400 Received: from mail-lf1-x136.google.com (mail-lf1-x136.google.com [IPv6:2a00:1450:4864:20::136]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8F38E13D60 for ; Sun, 8 May 2022 20:47:23 -0700 (PDT) Received: by mail-lf1-x136.google.com with SMTP id h29so21663793lfj.2 for ; Sun, 08 May 2022 20:47:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ventanamicro.com; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Sgeqc5+6lUb+9xfdT4wfaAHvYTUYEwM44Jw+nzAyvBo=; b=FGQqvl29lyENg2WaeeCsobSFPhHfkBy8zJqrxTjHVxd8yEbKReBV3WSC20VFRouFtx +X+HU7SF3kltlgAror42H0bWIunz0QlcNKUfX+6zuuH2r41OfGToaiwpdgfix6kOXJrz JhhVl+zhKtGXjHigiwtQQkBMHiIrP9WbTdxf4c52Tfl2ZzWFDVFCCvayrNcD04Q28h1l GYMwyFwU/HfOxwSPsJiZ65T68Fck3XlTOPV/5dvW7G0z/AZUM36X83+RO6DHm7H58UwQ y6b0fkwh6mncx87Pj0X1N8jD+aalEclY73TSKdpyJyPjk8YWYgkKDe73IA2ATlzoNQDT 8Yrg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Sgeqc5+6lUb+9xfdT4wfaAHvYTUYEwM44Jw+nzAyvBo=; b=3qfx7tQ58DqdNcugJMx1lnBUmZLg9ZAetg/TLNt7KNZPU7dE7bZ9RT6JN7bzXNj+BX FpbzcqYfZBXkkgwV5Ig1cCKrxIk30QUXcOqQ9ZvprJP4YfB6jp479N++a20kiuBETxQv ieX6H5LNsB4w8S+qCe+jGWvChaQ8i4v34ZIDAgr1TbwTnEGzdPV7BAjkoGNffJnFP4CK f7FuFmjYsiYmt0HoADPBKxpfvW0k8Gt/61VUyMDGrXeePOS1yFgFlpRF7qP6lIVLw8c5 dGqJYrTJ+MXeHjo45nugHDJvs95S+e/9qtixnsXixDGpA4yXCW+dauxlW4rGpoQgtTnp A8Vg== X-Gm-Message-State: AOAM530f0uFX+eFkJlmogKy3Jhfx7EP9YTjOUxbnAG2eA6Q/lpgvpcxo PdfYiPKNfUnhd89CKxKCKoZKZrCtMrCBGHyLCaKtiQ== X-Received: by 2002:a05:6512:e9e:b0:473:be54:ba7b with SMTP id bi30-20020a0565120e9e00b00473be54ba7bmr11177760lfb.419.1652068041785; Sun, 08 May 2022 20:47:21 -0700 (PDT) MIME-Version: 1.0 References: <20220508160749.984-1-jszhang@kernel.org> <20220508160749.984-3-jszhang@kernel.org> In-Reply-To: <20220508160749.984-3-jszhang@kernel.org> From: Anup Patel Date: Mon, 9 May 2022 09:17:10 +0530 Message-ID: Subject: Re: [PATCH v2 2/4] riscv: introduce unified static key mechanism for CPU features To: Jisheng Zhang Cc: Paul Walmsley , Palmer Dabbelt , Albert Ou , Andrey Ryabinin , Alexander Potapenko , Andrey Konovalov , Dmitry Vyukov , Vincenzo Frascino , Alexandre Ghiti , linux-riscv , "linux-kernel@vger.kernel.org List" , kasan-dev@googlegroups.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, May 8, 2022 at 9:47 PM Jisheng Zhang wrote: > > Currently, riscv has several features why may not be supported on all > riscv platforms, for example, FPU, SV48 and so on. To support unified > kernel Image style, we need to check whether the feature is suportted > or not. If the check sits at hot code path, then performance will be > impacted a lot. static key can be used to solve the issue. In the past > FPU support has been converted to use static key mechanism. I believe > we will have similar cases in the future. It's not just FPU and Sv48. There are several others such as Svinval, Vector, Svnapot, Svpbmt, and many many others. Overall, I agree with the approach of using static key array but I disagree with the semantics and the duplicate stuff being added. Please see more comments below .. > > Similar as arm64 does(in fact, some code is borrowed from arm64), this > patch tries to add an unified mechanism to use static keys for all > the cpu features by implementing an array of default-false static keys > and enabling them when detected. The cpus_have_*_cap() check uses the > static keys if riscv_const_caps_ready is finalized, otherwise the > compiler generates the bitmap test. First of all, we should stop calling this a feature (like ARM does). Rather, we should call these as isa extensions ("isaext") to align with the RISC-V priv spec and RISC-V profiles spec. For all the ISA optionalities which do not have distinct extension name, the RISC-V profiles spec is assigning names to all such optionalities. Another issue with semantics is that this patch assumes all features are enabled by default and we selectively disable it. This contrary to the approach taken by existing arch/riscv/kernel/cpufeature.c which assumes nothing is enabled by default and we selectively enable it. > > Signed-off-by: Jisheng Zhang > --- > arch/riscv/Makefile | 3 + > arch/riscv/include/asm/cpufeature.h | 94 +++++++++++++++++++++++++++++ > arch/riscv/kernel/cpufeature.c | 23 +++++++ > arch/riscv/tools/Makefile | 22 +++++++ > arch/riscv/tools/cpucaps | 5 ++ > arch/riscv/tools/gen-cpucaps.awk | 40 ++++++++++++ > 6 files changed, 187 insertions(+) > create mode 100644 arch/riscv/include/asm/cpufeature.h > create mode 100644 arch/riscv/tools/Makefile > create mode 100644 arch/riscv/tools/cpucaps > create mode 100755 arch/riscv/tools/gen-cpucaps.awk > > diff --git a/arch/riscv/Makefile b/arch/riscv/Makefile > index 7d81102cffd4..f4df67369d84 100644 > --- a/arch/riscv/Makefile > +++ b/arch/riscv/Makefile > @@ -154,3 +154,6 @@ PHONY += rv64_randconfig > rv64_randconfig: > $(Q)$(MAKE) KCONFIG_ALLCONFIG=$(srctree)/arch/riscv/configs/64-bit.config \ > -f $(srctree)/Makefile randconfig > + > +archprepare: > + $(Q)$(MAKE) $(build)=arch/riscv/tools kapi > diff --git a/arch/riscv/include/asm/cpufeature.h b/arch/riscv/include/asm/cpufeature.h > new file mode 100644 > index 000000000000..d80ddd2f3b49 > --- /dev/null > +++ b/arch/riscv/include/asm/cpufeature.h We don't need a separate header for this. All this belongs to arch/riscv/include/asm/hwcap.h > @@ -0,0 +1,94 @@ > +/* SPDX-License-Identifier: GPL-2.0-only */ > +/* > + * Copyright (C) 2014 Linaro Ltd. > + * Copyright (C) 2022 Jisheng Zhang > + */ > + > +#ifndef __ASM_CPUFEATURE_H > +#define __ASM_CPUFEATURE_H > + > +#include > + > +#include > +#include > +#include > + > +extern DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); This is a redundant bitmap. Please re-use "riscv_isa" bitmap for this the ISA extensions. > +extern struct static_key_false cpu_hwcap_keys[RISCV_NCAPS]; This should be called "riscv_isa_keys" > +extern struct static_key_false riscv_const_caps_ready; This should be called "riscv_isa_keys_ready". > + > +static __always_inline bool system_capabilities_finalized(void) Another misaligned name. This should be called "riscv_isa_keys_finalized()". > +{ > + return static_branch_likely(&riscv_const_caps_ready); > +} > + > +/* > + * Test for a capability with a runtime check. > + * > + * Before the capability is detected, this returns false. > + */ > +static inline bool cpus_have_cap(unsigned int num) > +{ > + if (num >= RISCV_NCAPS) > + return false; > + return test_bit(num, cpu_hwcaps); > +} This should be called riscv_isa_have_extension() and it should internally call "__riscv_isa_extension_available(NULL, num)". > + > +/* > + * Test for a capability without a runtime check. > + * > + * Before capabilities are finalized, this returns false. > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool __cpus_have_const_cap(int num) This should be named "__riscv_isa_have_const_extension()" > +{ > + if (num >= RISCV_NCAPS) > + return false; > + return static_branch_unlikely(&cpu_hwcap_keys[num]); > +} > + > +/* > + * Test for a capability without a runtime check. > + * > + * Before capabilities are finalized, this will BUG(). > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool cpus_have_final_cap(int num) This should be called "riscv_isa_have_final_extension()" > +{ > + if (system_capabilities_finalized()) > + return __cpus_have_const_cap(num); > + else > + BUG(); > +} > + > +/* > + * Test for a capability, possibly with a runtime check. > + * > + * Before capabilities are finalized, this behaves as cpus_have_cap(). > + * After capabilities are finalized, this is patched to avoid a runtime check. > + * > + * @num must be a compile-time constant. > + */ > +static __always_inline bool cpus_have_const_cap(int num) Same comment as above. > +{ > + if (system_capabilities_finalized()) > + return __cpus_have_const_cap(num); > + else > + return cpus_have_cap(num); > +} > + > +static inline void cpus_set_cap(unsigned int num) Same comment as above. > +{ > + if (num >= RISCV_NCAPS) { > + pr_warn("Attempt to set an illegal CPU capability (%d >= %d)\n", > + num, RISCV_NCAPS); > + } else { > + __set_bit(num, cpu_hwcaps); > + } > +} > + > +#endif > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > index 1b2d42d7f589..e6c72cad0c1c 100644 > --- a/arch/riscv/kernel/cpufeature.c > +++ b/arch/riscv/kernel/cpufeature.c > @@ -9,6 +9,7 @@ > #include > #include > #include > +#include > #include > #include > #include > @@ -25,6 +26,15 @@ static DECLARE_BITMAP(riscv_isa, RISCV_ISA_EXT_MAX) __read_mostly; > __ro_after_init DEFINE_STATIC_KEY_FALSE(cpu_hwcap_fpu); > #endif > > +DECLARE_BITMAP(cpu_hwcaps, RISCV_NCAPS); > +EXPORT_SYMBOL(cpu_hwcaps); Just like the previous comment. This is a redundant bitmap. Please use "riscv_isa" bitmap for this purpose. > + > +DEFINE_STATIC_KEY_ARRAY_FALSE(cpu_hwcap_keys, RISCV_NCAPS); > +EXPORT_SYMBOL(cpu_hwcap_keys); > + > +DEFINE_STATIC_KEY_FALSE(riscv_const_caps_ready); > +EXPORT_SYMBOL(riscv_const_caps_ready); Please see comments above. > + > /** > * riscv_isa_extension_base() - Get base extension word > * > @@ -62,6 +72,17 @@ bool __riscv_isa_extension_available(const unsigned long *isa_bitmap, int bit) > } > EXPORT_SYMBOL_GPL(__riscv_isa_extension_available); > > +static void __init enable_cpu_capabilities(void) > +{ > + int i; > + > + for (i = 0; i < RISCV_NCAPS; i++) { > + if (!cpus_have_cap(i)) > + continue; > + static_branch_enable(&cpu_hwcap_keys[i]); > + } > +} > + > void __init riscv_fill_hwcap(void) > { > struct device_node *node; > @@ -236,4 +257,6 @@ void __init riscv_fill_hwcap(void) > if (elf_hwcap & (COMPAT_HWCAP_ISA_F | COMPAT_HWCAP_ISA_D)) > static_branch_enable(&cpu_hwcap_fpu); > #endif > + enable_cpu_capabilities(); > + static_branch_enable(&riscv_const_caps_ready); > } > diff --git a/arch/riscv/tools/Makefile b/arch/riscv/tools/Makefile > new file mode 100644 > index 000000000000..932b4fe5c768 > --- /dev/null > +++ b/arch/riscv/tools/Makefile > @@ -0,0 +1,22 @@ > +# SPDX-License-Identifier: GPL-2.0 > + > +gen := arch/$(ARCH)/include/generated > +kapi := $(gen)/asm > + > +kapi-hdrs-y := $(kapi)/cpucaps.h > + > +targets += $(addprefix ../../../,$(gen-y) $(kapi-hdrs-y)) > + > +PHONY += kapi > + > +kapi: $(kapi-hdrs-y) $(gen-y) > + > +# Create output directory if not already present > +_dummy := $(shell [ -d '$(kapi)' ] || mkdir -p '$(kapi)') > + > +quiet_cmd_gen_cpucaps = GEN $@ > + cmd_gen_cpucaps = mkdir -p $(dir $@) && \ > + $(AWK) -f $(filter-out $(PHONY),$^) > $@ > + > +$(kapi)/cpucaps.h: $(src)/gen-cpucaps.awk $(src)/cpucaps FORCE > + $(call if_changed,gen_cpucaps) > diff --git a/arch/riscv/tools/cpucaps b/arch/riscv/tools/cpucaps > new file mode 100644 > index 000000000000..cb1ff2747859 > --- /dev/null > +++ b/arch/riscv/tools/cpucaps > @@ -0,0 +1,5 @@ > +# SPDX-License-Identifier: GPL-2.0 > +# > +# Internal CPU capabilities constants, keep this list sorted > + > +HAS_NO_FPU How can "No FPU" be a CPU capability ? We have ISA extensions 'F' and 'D' which tells us whether an FPU is available or not. I think this file should be a table with two columns " " I this this file should look like this: i yes m yes a yes c yes f yes d yes h yes sv48 no sv57 no sstc yes svinval yes svpbmt yes svnapot yes sscofpmf yes ... > diff --git a/arch/riscv/tools/gen-cpucaps.awk b/arch/riscv/tools/gen-cpucaps.awk > new file mode 100755 > index 000000000000..52a1e1b064ad > --- /dev/null > +++ b/arch/riscv/tools/gen-cpucaps.awk > @@ -0,0 +1,40 @@ > +#!/bin/awk -f > +# SPDX-License-Identifier: GPL-2.0 > +# gen-cpucaps.awk: riscv cpucaps header generator > +# > +# Usage: awk -f gen-cpucaps.awk cpucaps.txt > + > +# Log an error and terminate > +function fatal(msg) { > + print "Error at line " NR ": " msg > "/dev/stderr" > + exit 1 > +} > + > +# skip blank lines and comment lines > +/^$/ { next } > +/^#/ { next } > + > +BEGIN { > + print "#ifndef __ASM_CPUCAPS_H" > + print "#define __ASM_CPUCAPS_H" > + print "" > + print "/* Generated file - do not edit */" > + cap_num = 0 > + print "" > +} > + > +/^[vA-Z0-9_]+$/ { > + printf("#define RISCV_%-30s\t%d\n", $0, cap_num++) > + next > +} > + > +END { > + printf("#define RISCV_NCAPS\t\t\t\t%d\n", cap_num) > + print "" > + print "#endif /* __ASM_CPUCAPS_H */" > +} This script need to change refer capabilities as extensions. For every extension, there should be two defines. For e.g. "sstc" extension should have following defines #define RISCV_ISA_EXT_sstc <#num> #define RISCV_ISA_EXT_FROMSTR_sstc <1|0> > + > +# Any lines not handled by previous rules are unexpected > +{ > + fatal("unhandled statement") > +} > -- > 2.34.1 > Regards, Anup