Received: by 2002:ab2:7a09:0:b0:1f8:46dc:890e with SMTP id k9csp89079lqo; Wed, 15 May 2024 08:28:58 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWNouDdcfl34lvtzrY+zHpm9XN2xT0TGg3TnAnQuYZdH++k0deRk5IDnpnZQZ+GzR5um1AgrIFP8TiQfinSU33jgkitVTlzommLZ+vqTQ== X-Google-Smtp-Source: AGHT+IFYnQj6awiLPlPw62sFopLzTpn8GTqKYHElFbI5gVSAw/2wJ8n8AZ8dO8O1BedDmEtBwevP X-Received: by 2002:ac8:5ac8:0:b0:43e:1e1d:2ab7 with SMTP id d75a77b69052e-43e1e1d2cc7mr79587621cf.44.1715786937970; Wed, 15 May 2024 08:28:57 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1715786937; cv=pass; d=google.com; s=arc-20160816; b=fvONDIgrRpT57i3wpRDItGKBoEybU9mufcwZg0VT1YdNwKvliGwVMwGuo8moMfQNbV agbSIqAV9B0Zk3j/nCO9dEJ6VQt3rGH4/o1MUx71OlSDCC7rX/7dZx34b0mvFaD/Agit AlY+95Ao+GGX6rQFKKAQHwOg38bN2UYhICidty3l/3brbPvz0uMLX9LmpnUntTvc6h3c btPDJFH5zsLV1/GYjNPPzN+Yh6+1HxX8TFofu7wDOoX8M4Yw/Qtj1fDFPLwFR0Y9gFn4 77DsKZ79HfQCwXdVf4rrfMcoWFeN2cpgQJxF/SoE+o/OHWSuaxtVZRgC4+YXIJPivtoX dS0g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:message-id:date:subject:cc:to :from:dkim-signature; bh=JwhgdxwRegwsRRlXX5UWtYfOaHfLq4uUvKHcUYlFOzk=; fh=WKMNpuC2C4ex+07H041cKMGnqDzqhzJU8OW5hXRv12Y=; b=y6w46by8Uhsg5lxfEorBOUoI5AdZVeR1wXAg01TaJwdaciEOxW9QQpZbHdsepMgrMJ I1Uqfs1TVqDl+7C8mYpeQBRDazEevaKqlf+5ogod51d1uEeszWK3LlZdsdHFnXH/bRix +I9DL11waY1k62fDUmSIZ0CJbaO6fzfWlAGRInVMzHr3IJDrKr9Z7UNfb/+azGz+OH7O 9rBo2KUeGdRRUO5PTPT6ZUKcBuZOuxPxADGhpV+Qev+Agr3dmD4nyE3J+VRVPnu0yRzf NzpdWD+/k1590tPOgaNRJYoW6NjR/jS+6lf1d0h0dkGrhrxGG7+XpzVyiIQjUzxIAgYm 3OhQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=UT4xbhif; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-180068-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-180068-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id d75a77b69052e-43df8f159ecsi135747421cf.229.2024.05.15.08.28.57 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 15 May 2024 08:28:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-180068-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=UT4xbhif; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-180068-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-180068-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 9F0211C21230 for ; Wed, 15 May 2024 15:28:57 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id D4C50156228; Wed, 15 May 2024 15:28:16 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="UT4xbhif" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id BD17D155A5D for ; Wed, 15 May 2024 15:28:15 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715786895; cv=none; b=knj9y5f05z3UP4D9ud10Gam4ya5QrabGHlvTOXPwqXouVZZ3WfoY4eXkVLodUFRHUog8T4kgRTjWg7xZY3gvOdkUcUmTj+jec57q3PM7OEVUZ3eRpPauke0HfXfNCEqK9J4oqUSyMALvX6+OlrKD206bd98Fwdm3pVEkKof7sr8= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715786895; c=relaxed/simple; bh=R6gVY7FVJaWS0Na7LhBOOOlMaTUNdeLkTe6U83u8smg=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version; b=FFFo2MOc9R83jR46Ygs7sHps0ySgRfDLXX42WexdW6NJ4yhH74dR9RuqttiwE6L9MayxDO2uTweyLKGIUZoUcPhKT5dBH6WL84N0589QU0jztSlt1d+6t2rjLm8KQH7eUlNHh4XpGWR2/xUDFshlBq7tvxaCcHuQyvqqR5Goprg= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=UT4xbhif; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 86EFDC116B1; Wed, 15 May 2024 15:28:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1715786895; bh=R6gVY7FVJaWS0Na7LhBOOOlMaTUNdeLkTe6U83u8smg=; h=From:To:Cc:Subject:Date:From; b=UT4xbhifLvilw/tyGq9/SEMwftyR9mEvCNmSuPuXisidVr6LSQYLj1hiVWXnI7kgd ydICc2Gob3VhLE+77o2B6+hxv7nOr1EtugcZQT8pq//4rWo9FUSxZKlleOGq+/XLmG AcYY/EwY2ybMIwfQk9szSBiPxPn4yfrn3sF6SbgU0uV66Wtu22kS/NFYk0yDoWP5Vn wR76OixfYOv1YE9C34Gn82ePnXQkCcwxXzSFSgmtbA+/SuliLXaP0HzpyoPN0ER1MR e/2522wxsS5nXvmxJR2UgwFcrTdIfLB5/OtKfiKzNYDu6YFalLZqJzJUkOiQi9cv9r 0R6Dz5jdLXbkQ== From: Conor Dooley To: linux-riscv@lists.infradead.org Cc: conor@kernel.org, Conor Dooley , xiao.w.wang@intel.com, Andrew Jones , pulehui@huawei.com, Charlie Jenkins , Paul Walmsley , Palmer Dabbelt , linux-kernel@vger.kernel.org Subject: [PATCH v1] RISC-V: separate Zbb optimisations requiring and not requiring toolchain support Date: Wed, 15 May 2024 16:27:40 +0100 Message-ID: <20240515-hedging-passage-44fd394ab1be@spud> X-Mailer: git-send-email 2.43.0 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 X-Developer-Signature: v=1; a=openpgp-sha256; l=8907; i=conor.dooley@microchip.com; h=from:subject:message-id; bh=XwP4n1lKCC0d/1n1H9SYTARGbDfq2MDy76bBXq3ujYY=; b=owGbwMvMwCFWscWwfUFT0iXG02pJDGkuV7Jf8e6LclK/6OP/ZK7/J5PpK27PklcVXrXjcrfvB NVlMasaO0pZGMQ4GGTFFFkSb/e1SK3/47LDuectzBxWJpAhDFycAjARJT2GvzJtxe078iu++y5y SdsyhSmYedPmtWynt9R+1TWSORf28CTD/3q/HRoGT6e/s0spnT9twvX7Hemf7ujOTjvX8WW59aE MGxYA X-Developer-Key: i=conor.dooley@microchip.com; a=openpgp; fpr=F9ECA03CF54F12CD01F1655722E2C55B37CF380C Content-Transfer-Encoding: 8bit From: Conor Dooley It seems a bit ridiculous to require toolchain support for BPF to assemble Zbb instructions, so introduce hidden a Kconfig option that controls the use of any toolchain-requiring optimisations while support is available. Zbb support has always depended on alternatives, so while adjusting the config options guarding optimisations, remove any checks for whether or not alternatives are enabled. Signed-off-by: Conor Dooley --- This patch stems out of a conversation about Zba optimisations in BPF. I'm not super sold on the approach in all honesty, even though we recently had a conversation about respecting the Kconfig options - but at this point I'd be convinced to just add some wording to the Kconfig options mentioning that BPF optimisations are excluded. Having hidden options that mean someone can turn what on what they think are Zbb optimisations but not actually get any cos their toolchain doesn't support it seems crap to me. I don't wanna add another user-visible option for that situation cos I wanna try to minimise the extent of our extension-related Kconfig options, not blow them up like Augustus Gloop! Cheers, Conor. CC: xiao.w.wang@intel.com CC: Andrew Jones CC: pulehui@huawei.com CC: Charlie Jenkins CC: Paul Walmsley CC: Palmer Dabbelt CC: Conor Dooley CC: linux-riscv@lists.infradead.org CC: linux-kernel@vger.kernel.org --- arch/riscv/Kconfig | 15 ++++++++++++--- arch/riscv/include/asm/arch_hweight.h | 4 ++-- arch/riscv/include/asm/bitops.h | 4 ++-- arch/riscv/include/asm/checksum.h | 3 +-- arch/riscv/lib/csum.c | 9 +++------ arch/riscv/lib/strcmp.S | 4 ++-- arch/riscv/lib/strlen.S | 4 ++-- arch/riscv/lib/strncmp.S | 4 ++-- 8 files changed, 26 insertions(+), 21 deletions(-) diff --git a/arch/riscv/Kconfig b/arch/riscv/Kconfig index e927b52b420c..f216a52ed181 100644 --- a/arch/riscv/Kconfig +++ b/arch/riscv/Kconfig @@ -605,14 +605,23 @@ config TOOLCHAIN_HAS_VECTOR_CRYPTO def_bool $(as-instr, .option arch$(comma) +v$(comma) +zvkb) depends on AS_HAS_OPTION_ARCH -config RISCV_ISA_ZBB - bool "Zbb extension support for bit manipulation instructions" +config RISCV_ISA_ZBB_ALT + def_bool RISCV_ISA_ZBB depends on TOOLCHAIN_HAS_ZBB depends on RISCV_ALTERNATIVE + help + This option controls whether or not we build optimisations that + depend on toolchain support. It's automatically enabled whenever the + toolchain in use supports assembling Zbb instructions and + RISCV_ISA_ZBB is set. + +config RISCV_ISA_ZBB + bool "Zbb extension support for bit manipulation instructions" default y help Add support for enabling optimisations in the kernel when the - Zbb extension is detected at boot. + Zbb extension is detected at boot. Some optimisations may + additionally depend on toolchain support for Zbb. The Zbb extension provides instructions to accelerate a number of bit-specific operations (count bit population, sign extending, diff --git a/arch/riscv/include/asm/arch_hweight.h b/arch/riscv/include/asm/arch_hweight.h index 85b2c443823e..a677f6b82228 100644 --- a/arch/riscv/include/asm/arch_hweight.h +++ b/arch/riscv/include/asm/arch_hweight.h @@ -19,7 +19,7 @@ static __always_inline unsigned int __arch_hweight32(unsigned int w) { -#ifdef CONFIG_RISCV_ISA_ZBB +#ifdef CONFIG_RISCV_ISA_ZBB_ALT asm goto(ALTERNATIVE("j %l[legacy]", "nop", 0, RISCV_ISA_EXT_ZBB, 1) : : : : legacy); @@ -50,7 +50,7 @@ static inline unsigned int __arch_hweight8(unsigned int w) #if BITS_PER_LONG == 64 static __always_inline unsigned long __arch_hweight64(__u64 w) { -# ifdef CONFIG_RISCV_ISA_ZBB +# ifdef CONFIG_RISCV_ISA_ZBB_ALT asm goto(ALTERNATIVE("j %l[legacy]", "nop", 0, RISCV_ISA_EXT_ZBB, 1) : : : : legacy); diff --git a/arch/riscv/include/asm/bitops.h b/arch/riscv/include/asm/bitops.h index 880606b0469a..3ed810a6123d 100644 --- a/arch/riscv/include/asm/bitops.h +++ b/arch/riscv/include/asm/bitops.h @@ -15,7 +15,7 @@ #include #include -#if !defined(CONFIG_RISCV_ISA_ZBB) || defined(NO_ALTERNATIVE) +#if !defined(CONFIG_RISCV_ISA_ZBB_ALT) || defined(NO_ALTERNATIVE) #include #include #include @@ -175,7 +175,7 @@ static __always_inline int variable_fls(unsigned int x) variable_fls(x_); \ }) -#endif /* !defined(CONFIG_RISCV_ISA_ZBB) || defined(NO_ALTERNATIVE) */ +#endif /* !defined(CONFIG_RISCV_ISA_ZBB_ALT) || defined(NO_ALTERNATIVE) */ #include #include diff --git a/arch/riscv/include/asm/checksum.h b/arch/riscv/include/asm/checksum.h index 88e6f1499e88..956224ea8199 100644 --- a/arch/riscv/include/asm/checksum.h +++ b/arch/riscv/include/asm/checksum.h @@ -49,8 +49,7 @@ static inline __sum16 ip_fast_csum(const void *iph, unsigned int ihl) * ZBB only saves three instructions on 32-bit and five on 64-bit so not * worth checking if supported without Alternatives. */ - if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB) && - IS_ENABLED(CONFIG_RISCV_ALTERNATIVE)) { + if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB_ALT)) { unsigned long fold_temp; asm goto(ALTERNATIVE("j %l[no_zbb]", "nop", 0, diff --git a/arch/riscv/lib/csum.c b/arch/riscv/lib/csum.c index 7fb12c59e571..7a97394c252b 100644 --- a/arch/riscv/lib/csum.c +++ b/arch/riscv/lib/csum.c @@ -44,8 +44,7 @@ __sum16 csum_ipv6_magic(const struct in6_addr *saddr, * Zbb support saves 4 instructions, so not worth checking without * alternatives if supported */ - if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB) && - IS_ENABLED(CONFIG_RISCV_ALTERNATIVE)) { + if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB_ALT)) { unsigned long fold_temp; /* @@ -161,8 +160,7 @@ do_csum_with_alignment(const unsigned char *buff, int len) * Zbb support saves 6 instructions, so not worth checking without * alternatives if supported */ - if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB) && - IS_ENABLED(CONFIG_RISCV_ALTERNATIVE)) { + if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB_ALT)) { unsigned long fold_temp; /* @@ -248,8 +246,7 @@ do_csum_no_alignment(const unsigned char *buff, int len) * Zbb support saves 6 instructions, so not worth checking without * alternatives if supported */ - if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB) && - IS_ENABLED(CONFIG_RISCV_ALTERNATIVE)) { + if (IS_ENABLED(CONFIG_RISCV_ISA_ZBB_ALT)) { unsigned long fold_temp; /* diff --git a/arch/riscv/lib/strcmp.S b/arch/riscv/lib/strcmp.S index 687b2bea5c43..a4dd2ac306f1 100644 --- a/arch/riscv/lib/strcmp.S +++ b/arch/riscv/lib/strcmp.S @@ -8,7 +8,7 @@ /* int strcmp(const char *cs, const char *ct) */ SYM_FUNC_START(strcmp) - ALTERNATIVE("nop", "j strcmp_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB) + ALTERNATIVE("nop", "j strcmp_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB_ALT) /* * Returns @@ -43,7 +43,7 @@ SYM_FUNC_START(strcmp) * The code was published as part of the bitmanip manual * in Appendix A. */ -#ifdef CONFIG_RISCV_ISA_ZBB +#ifdef CONFIG_RISCV_ISA_ZBB_ALT strcmp_zbb: .option push diff --git a/arch/riscv/lib/strlen.S b/arch/riscv/lib/strlen.S index 8ae3064e45ff..3ab1310a7b83 100644 --- a/arch/riscv/lib/strlen.S +++ b/arch/riscv/lib/strlen.S @@ -8,7 +8,7 @@ /* int strlen(const char *s) */ SYM_FUNC_START(strlen) - ALTERNATIVE("nop", "j strlen_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB) + ALTERNATIVE("nop", "j strlen_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB_ALT) /* * Returns @@ -33,7 +33,7 @@ SYM_FUNC_START(strlen) /* * Variant of strlen using the ZBB extension if available */ -#ifdef CONFIG_RISCV_ISA_ZBB +#ifdef CONFIG_RISCV_ISA_ZBB_ALT strlen_zbb: #ifdef CONFIG_CPU_BIG_ENDIAN diff --git a/arch/riscv/lib/strncmp.S b/arch/riscv/lib/strncmp.S index aba5b3148621..aeed830804d7 100644 --- a/arch/riscv/lib/strncmp.S +++ b/arch/riscv/lib/strncmp.S @@ -8,7 +8,7 @@ /* int strncmp(const char *cs, const char *ct, size_t count) */ SYM_FUNC_START(strncmp) - ALTERNATIVE("nop", "j strncmp_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB) + ALTERNATIVE("nop", "j strncmp_zbb", 0, RISCV_ISA_EXT_ZBB, CONFIG_RISCV_ISA_ZBB_ALT) /* * Returns @@ -46,7 +46,7 @@ SYM_FUNC_START(strncmp) /* * Variant of strncmp using the ZBB extension if available */ -#ifdef CONFIG_RISCV_ISA_ZBB +#ifdef CONFIG_RISCV_ISA_ZBB_ALT strncmp_zbb: .option push -- 2.43.0