Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp1576473iog; Tue, 14 Jun 2022 08:43:37 -0700 (PDT) X-Google-Smtp-Source: AGRyM1u5M7oaQjWySOrrj8TE6Rfg4VwsX0fik6M1s6wGTGcAnBHcZCs7X8nzgRLpbObzkB0GfnF+ X-Received: by 2002:a17:90b:3b45:b0:1e8:8688:219d with SMTP id ot5-20020a17090b3b4500b001e88688219dmr5155419pjb.231.1655221417583; Tue, 14 Jun 2022 08:43:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1655221417; cv=none; d=google.com; s=arc-20160816; b=ISofpKO53XssRg2hBMYFqb/W59Xqsz9E7KX7PuhOJ9rFfTwmV4Tin+kIo+MltOdGCp DMoXt8Kpxf7ZZxi5j9NQoaCQ4Av8rqHwlEhjqm3V41hOzeux23KffggpRXts7AVjTVA6 XG/BRjwBt3SJC9TOD7VvO4Doo9W57/RvH3HsMN/LqfaWDijyTL+mIuX5NeEgw9hNCsdY 4hwMSokvIBhjsLMwWFEDXSDCSjOcjl9rIKyK+f3VDZNDXFQq/gslUBcl7feFJE2BbB5r IP8H3jszxtbZ0aNu6Ti+LWT/0bLsF2PslCwA7er1Ely1E8m9+jWkrnIGktD+PcGJk2To Q//A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=kQM8S+pM1PYYc1VPdpiDIud+0asVPfRB4C7ECP+ubNc=; b=MI39JVogtt8X+U7Ygj4Dd7PSjp3c42Y0yfBN8A3vTxyztecXf95UVD9DexnYyqv0AN 2tqIni8LCqKdJupNcGlQeyOh1vF62QfkLknNRJ0xN9ppd6zy0Bb8gRWsH1yxRg2IT57X j0wMVq8ZGDIGNQ1tkashPWuM/lQbaE5CP8x+r8OS4VF3Xzh4NkZ/YAAFba7EDeA3PBkF 6LJnuCsdv22uQD9E47YXfNKAPOSsE6yg9CYFgCG8BCrkzZhWj9qJDJtaWl+L8AWMHv6E iLUszhh3RIcEBB0w2zJutc9v8c9sjNYeYUehWwD/2AjYVIvU6S88FLOdwiqjDGa4JSOL dFSQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@atishpatra.org header.s=google header.b=R5goSh56; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id o23-20020a17090a421700b001e2bf2a53ccsi12080490pjg.72.2022.06.14.08.43.25; Tue, 14 Jun 2022 08:43:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@atishpatra.org header.s=google header.b=R5goSh56; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1349086AbiFNPfC (ORCPT + 99 others); Tue, 14 Jun 2022 11:35:02 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42668 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243618AbiFNPfB (ORCPT ); Tue, 14 Jun 2022 11:35:01 -0400 Received: from mail-yb1-xb30.google.com (mail-yb1-xb30.google.com [IPv6:2607:f8b0:4864:20::b30]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 14A2011466 for ; Tue, 14 Jun 2022 08:35:00 -0700 (PDT) Received: by mail-yb1-xb30.google.com with SMTP id e184so15763932ybf.8 for ; Tue, 14 Jun 2022 08:35:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=atishpatra.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=kQM8S+pM1PYYc1VPdpiDIud+0asVPfRB4C7ECP+ubNc=; b=R5goSh56V1xQ9hqhvWR/MPKlDpaaNS2GcHKsmq3WZSacQvw9elUD191K+AdvWMOCUA UyU0xEbVT/NYPzi22NeyYfKtf7Gw3+zCa7AJFPVDr06gCH6K10Zza9gxo0P8nFyqcmXe EkqwWwssXf8sc8T2HD0Tvu9qMLg877yRFh494= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=kQM8S+pM1PYYc1VPdpiDIud+0asVPfRB4C7ECP+ubNc=; b=K5ei02WKM8DppnT7uWO2L5H2jIGmM/idRV6lUDXwvzUG4rg8wtKo9LHzNvrHQ3/lTL VeLKmghdFtCKkuiHQFrx05jRAkHTuwf0OAIz4KYQXApKO7QYhZ0VEmYTenRjvjlISAfe JKvWasBwQktXlXZMhB/hpvzkflyK/qBHY3CL5mow5C6HCeMkRMczA66DhKptZIvPjfvU 7aQ22hsVO3fm/aVSm+KDDtJBNI3XPJ3U5hygnTzICsAaaHVtjpESU/4r5DD7cikm+sfF ZlS67XUk+LX7SqqNPZdL9l4YS4CVsaeZP5KzUIMJTsxUtONcgEOM1SzVA6CUe8DuE9aP MwIA== X-Gm-Message-State: AJIora94gfnU6+BBsl61kcPTy3LapXUBqsoitMMS+hrk9KuW1Zny6eLm FNAR40a0+99u6k4Ha5gvRRUOohQF5NlEejX7Los0 X-Received: by 2002:a05:6902:729:b0:660:77c3:d81a with SMTP id l9-20020a056902072900b0066077c3d81amr6002603ybt.473.1655220899030; Tue, 14 Jun 2022 08:34:59 -0700 (PDT) MIME-Version: 1.0 References: <20220602141032.169907-1-christoph.muellner@vrull.io> In-Reply-To: <20220602141032.169907-1-christoph.muellner@vrull.io> From: Atish Patra Date: Tue, 14 Jun 2022 08:34:48 -0700 Message-ID: Subject: Re: [RFC PATCH] riscv: Add Zawrs support for spinlocks To: Christoph Muellner Cc: Palmer Dabbelt , Paul Walmsley , Albert Ou , Christoph Muellner , Heiko Stuebner , Philipp Tomsich , Aaron Durbin , linux-riscv , "linux-kernel@vger.kernel.org List" Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 2, 2022 at 7:11 AM Christoph Muellner wrote: > > The current RISC-V code uses the generic ticket lock implementation, > that calls the macros smp_cond_load_relaxed() and smp_cond_load_acquire(). > Currently, RISC-V uses the generic implementation of these macros. > This patch introduces a RISC-V specific implementation, of these > macros, that peels off the first loop iteration and modifies the waiting > loop such, that it is possible to use the WRS instruction of the Zawrs > ISA extension to stall the CPU. > > The resulting implementation of smp_cond_load_*() will only work for > 32-bit or 64-bit types for RV64 and 32-bit types for RV32. > This is caused by the restrictions of the LR instruction (RISC-V only > has LR.W and LR.D). Compiler assertions guard this new restriction. > > This patch uses the existing RISC-V ISA extension framework > to detect the presents of Zawrs at run-time. > If available a NOP instruction will be replaced by WRS. > A similar patch could add support for the PAUSE instruction of > the Zihintpause ISA extension. > FYI..The initial Zihintpause support patch https://www.spinics.net/lists/kernel/msg4380326.html > The whole mechanism is gated by Kconfig setting, which defaults to Y. > > The Zawrs specification can be found here: > https://github.com/riscv/riscv-zawrs/blob/main/zawrs.adoc > > Note, that the Zawrs extension is not frozen or ratified yet. > Therefore this patch is an RFC and not intended to get merged. > > Signed-off-by: Christoph Muellner > --- > arch/riscv/Kconfig | 10 +++ > arch/riscv/include/asm/barrier.h | 97 ++++++++++++++++++++++++++++ > arch/riscv/include/asm/errata_list.h | 12 +++- > arch/riscv/include/asm/hwcap.h | 3 +- > arch/riscv/kernel/cpu.c | 1 + > arch/riscv/kernel/cpufeature.c | 13 ++++ > 6 files changed, 133 insertions(+), 3 deletions(-) > > diff --git a/arch/riscv/Kconfig b/arch/riscv/Kconfig > index 905e550e0fd3..054872317d4a 100644 > --- a/arch/riscv/Kconfig > +++ b/arch/riscv/Kconfig > @@ -358,6 +358,16 @@ config RISCV_ISA_C > > If you don't know what to do here, say Y. > > +config RISCV_ISA_ZAWRS > + bool "Zawrs extension support" > + select RISCV_ALTERNATIVE > + default y > + help > + Adds support to dynamically detect the presence of the Zawrs extension > + (wait for reservation set) and enable its usage. > + > + If you don't know what to do here, say Y. > + > config RISCV_ISA_SVPBMT > bool "SVPBMT extension support" > depends on 64BIT && MMU > diff --git a/arch/riscv/include/asm/barrier.h b/arch/riscv/include/asm/barrier.h > index d0e24aaa2aa0..69b8f1f4b80c 100644 > --- a/arch/riscv/include/asm/barrier.h > +++ b/arch/riscv/include/asm/barrier.h > @@ -12,6 +12,8 @@ > > #ifndef __ASSEMBLY__ > > +#include > + > #define nop() __asm__ __volatile__ ("nop") > > #define RISCV_FENCE(p, s) \ > @@ -42,6 +44,69 @@ do { \ > ___p1; \ > }) > > +#if __riscv_xlen == 64 > + > +#define __riscv_lrsc_word(t) \ > + (sizeof(t) == sizeof(int) || \ > + sizeof(t) == sizeof(long)) > + > +#define __riscv_lr(ptr) \ > + sizeof(*ptr) == sizeof(int) ? "lr.w" : "lr.d" > + > +#elif __riscv_xlen == 32 > + > +#define __riscv_lrsc_word(ptr) \ > + (sizeof(*ptr) == sizeof(int)) > + > +#define __riscv_lr(t) "lr.w" > + > +#else > +#error "Unexpected __riscv_xlen" > +#endif /* __riscv_xlen */ > + > +#define compiletime_assert_atomic_lrsc_type(t) \ > + compiletime_assert(__riscv_lrsc_word(t), \ > + "Need type compatible with LR/SC instructions.") > + > +#define ___smp_load_reservedN(pfx, ptr) \ > +({ \ > + typeof(*ptr) ___p1; \ > + __asm__ __volatile__ ("lr." pfx " %[p], %[c]\n" \ > + : [p]"=&r" (___p1), [c]"+A"(*ptr)); \ > + ___p1; \ > +}) > + > +#define ___smp_load_reserved32(ptr) \ > + ___smp_load_reservedN("w", ptr) > + > +#define ___smp_load_reserved64(ptr) \ > + ___smp_load_reservedN("d", ptr) > + > +#define __smp_load_reserved_relaxed(ptr) \ > +({ \ > + typeof(*ptr) ___p1; \ > + compiletime_assert_atomic_lrsc_type(*ptr); \ > + if (sizeof(*ptr) == 32) { \ > + ___p1 = ___smp_load_reserved32(ptr); \ > + } else { \ > + ___p1 = ___smp_load_reserved64(ptr); \ > + } \ > + ___p1; \ > +}) > + > +#define __smp_load_reserved_acquire(ptr) \ > +({ \ > + typeof(*ptr) ___p1; \ > + compiletime_assert_atomic_lrsc_type(*ptr); \ > + if (sizeof(*ptr) == 32) { \ > + ___p1 = ___smp_load_reserved32(ptr); \ > + } else { \ > + ___p1 = ___smp_load_reserved64(ptr); \ > + } \ > + RISCV_FENCE(r,rw); \ > + ___p1; \ > +}) > + > /* > * This is a very specific barrier: it's currently only used in two places in > * the kernel, both in the scheduler. See include/linux/spinlock.h for the two > @@ -69,6 +134,38 @@ do { \ > */ > #define smp_mb__after_spinlock() RISCV_FENCE(iorw,iorw) > > +#define smp_cond_load_relaxed(ptr, cond_expr) \ > +({ \ > + typeof(ptr) __PTR = (ptr); \ > + __unqual_scalar_typeof(*ptr) VAL; \ > + VAL = READ_ONCE(*__PTR); \ > + if (!cond_expr) { \ > + for (;;) { \ > + VAL = __smp_load_reserved_relaxed(__PTR); \ > + if (cond_expr) \ > + break; \ > + ALT_WRS(); \ > + } \ > + } \ > + (typeof(*ptr))VAL; \ > +}) > + > +#define smp_cond_load_acquire(ptr, cond_expr) \ > +({ \ > + typeof(ptr) __PTR = (ptr); \ > + __unqual_scalar_typeof(*ptr) VAL; \ > + VAL = smp_load_acquire(__PTR); \ > + if (!cond_expr) { \ > + for (;;) { \ > + VAL = __smp_load_reserved_acquire(__PTR); \ > + if (cond_expr) \ > + break; \ > + ALT_WRS(); \ > + } \ > + } \ > + (typeof(*ptr))VAL; \ > +}) > + > #include > > #endif /* __ASSEMBLY__ */ > diff --git a/arch/riscv/include/asm/errata_list.h b/arch/riscv/include/asm/errata_list.h > index 9e2888dbb5b1..b9aa0b346493 100644 > --- a/arch/riscv/include/asm/errata_list.h > +++ b/arch/riscv/include/asm/errata_list.h > @@ -19,8 +19,9 @@ > #define ERRATA_THEAD_NUMBER 1 > #endif > > -#define CPUFEATURE_SVPBMT 0 > -#define CPUFEATURE_NUMBER 1 > +#define CPUFEATURE_ZAWRS 0 > +#define CPUFEATURE_SVPBMT 1 > +#define CPUFEATURE_NUMBER 2 > > #ifdef __ASSEMBLY__ > > @@ -42,6 +43,13 @@ asm(ALTERNATIVE("sfence.vma %0", "sfence.vma", SIFIVE_VENDOR_ID, \ > ERRATA_SIFIVE_CIP_1200, CONFIG_ERRATA_SIFIVE_CIP_1200) \ > : : "r" (addr) : "memory") > > +#define ZAWRS_WRS ".long 0x1000073" > +#define ALT_WRS() \ > +asm volatile(ALTERNATIVE( \ > + "nop\n\t", \ > + ZAWRS_WRS "\n\t", \ > + 0, CPUFEATURE_ZAWRS, CONFIG_RISCV_ISA_ZAWRS)) > + > /* > * _val is marked as "will be overwritten", so need to set it to 0 > * in the default case. > diff --git a/arch/riscv/include/asm/hwcap.h b/arch/riscv/include/asm/hwcap.h > index 4e2486881840..c7dd8cc38bec 100644 > --- a/arch/riscv/include/asm/hwcap.h > +++ b/arch/riscv/include/asm/hwcap.h > @@ -51,7 +51,8 @@ extern unsigned long elf_hwcap; > * available logical extension id. > */ > enum riscv_isa_ext_id { > - RISCV_ISA_EXT_SSCOFPMF = RISCV_ISA_EXT_BASE, > + RISCV_ISA_EXT_ZAWRS = RISCV_ISA_EXT_BASE, > + RISCV_ISA_EXT_SSCOFPMF, > RISCV_ISA_EXT_SVPBMT, > RISCV_ISA_EXT_ID_MAX = RISCV_ISA_EXT_MAX, > }; > diff --git a/arch/riscv/kernel/cpu.c b/arch/riscv/kernel/cpu.c > index fba9e9f46a8c..6c3a10ff5358 100644 > --- a/arch/riscv/kernel/cpu.c > +++ b/arch/riscv/kernel/cpu.c > @@ -87,6 +87,7 @@ int riscv_of_parent_hartid(struct device_node *node) > * extensions by an underscore. > */ > static struct riscv_isa_ext_data isa_ext_arr[] = { > + __RISCV_ISA_EXT_DATA(zawrs, RISCV_ISA_EXT_ZAWRS), > __RISCV_ISA_EXT_DATA(sscofpmf, RISCV_ISA_EXT_SSCOFPMF), > __RISCV_ISA_EXT_DATA(svpbmt, RISCV_ISA_EXT_SVPBMT), > __RISCV_ISA_EXT_DATA("", RISCV_ISA_EXT_MAX), > diff --git a/arch/riscv/kernel/cpufeature.c b/arch/riscv/kernel/cpufeature.c > index dea3ea19deee..fc2c47a1784b 100644 > --- a/arch/riscv/kernel/cpufeature.c > +++ b/arch/riscv/kernel/cpufeature.c > @@ -199,6 +199,7 @@ void __init riscv_fill_hwcap(void) > } else { > SET_ISA_EXT_MAP("sscofpmf", RISCV_ISA_EXT_SSCOFPMF); > SET_ISA_EXT_MAP("svpbmt", RISCV_ISA_EXT_SVPBMT); > + SET_ISA_EXT_MAP("zawrs", RISCV_ISA_EXT_ZAWRS); > } > #undef SET_ISA_EXT_MAP > } > @@ -251,6 +252,14 @@ struct cpufeature_info { > bool (*check_func)(unsigned int stage); > }; > > +static bool __init_or_module cpufeature_zawrs_check_func(unsigned int stage) > +{ > + if (stage == RISCV_ALTERNATIVES_EARLY_BOOT) > + return false; > + > + return riscv_isa_extension_available(NULL, ZAWRS); > +} > + > static bool __init_or_module cpufeature_svpbmt_check_func(unsigned int stage) > { > #ifdef CONFIG_RISCV_ISA_SVPBMT > @@ -267,6 +276,10 @@ static bool __init_or_module cpufeature_svpbmt_check_func(unsigned int stage) > > static const struct cpufeature_info __initdata_or_module > cpufeature_list[CPUFEATURE_NUMBER] = { > + { > + .name = "zawrs", > + .check_func = cpufeature_zawrs_check_func > + }, > { > .name = "svpbmt", > .check_func = cpufeature_svpbmt_check_func > -- > 2.35.3 > > > _______________________________________________ > linux-riscv mailing list > linux-riscv@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/linux-riscv -- Regards, Atish