Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp295526pxf; Wed, 24 Mar 2021 05:29:43 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzMSwUmmn+xmC3+rzd1bgk4XX2po1aYRtWjHm4OcWkMlfFItv0vyj6DpoEVIL4yQu3HWgBP X-Received: by 2002:a50:fe06:: with SMTP id f6mr3177208edt.349.1616588982857; Wed, 24 Mar 2021 05:29:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1616588982; cv=none; d=google.com; s=arc-20160816; b=s73wyW/qssdn3LoOHhKSZAMDaS/JglileiIUTbsd8PDk0bQglpgo4bq9h0aZakoMKy IWqIIe5MrFubPkQCpFYcv5rQs+iCo/mD2bcOwZmEaVs1rB841a92W6vBhLoIQtH5NDWB BJNa8CI9PbJ4jno1VWu3ibHn194k9jeTjzTS0pbqCaXReSl5b0bvT3Nc6NV9MwiRVNQ7 hUJgZJvHptaS27+fs77DE3InWDwwQPoiRrCHxN0m3EzWCjCQ8exhycOmli657l1LVQfX h2je4xzFT97LeEn1laSJjMUI1cGasr/HpNB1de4EUlsY5bvDqs7sNsMeoTYLVBhVLuBK WYJw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=esvyG+mFXtt6T3mRAftaKzFIiWR03MFVnUAwlEUKKks=; b=Ofg+QdiinCGl/uPlwsNvNSNPsMABSVsory2PRtoEVxjLru/IulxMUdG1sXRB3YJks/ Ag3WRnBe+S9yBeQxqpl0F5KiQjQJrkfmh+qoC8KYsPdjpOp30IDCcBulLVBmQ+f09QBg 7IJOSbdEqH47vIuzFGPDDUuM7dCGLX3arOH6R8TCFSwl+og1BdBipxLMGmpvvooR0NBD NxVwxpiWBj22TrfavaR1C3MmtZpPDC0tGV+aDMMVcCBcrHV4ySXeMoU53tCc+/DlS8yl gx+DGjz6A/OTVGbwX0FyB9tXH8F2kLHBcg+JCxG811Lk24qrEipriEg+hZao9ca27hNA cOiQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=QdaYQcKc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ly13si1664518ejb.649.2021.03.24.05.29.18; Wed, 24 Mar 2021 05:29:42 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=QdaYQcKc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234072AbhCXMZS (ORCPT + 99 others); Wed, 24 Mar 2021 08:25:18 -0400 Received: from mail.kernel.org ([198.145.29.99]:36160 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233734AbhCXMYt (ORCPT ); Wed, 24 Mar 2021 08:24:49 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 44F0061A18 for ; Wed, 24 Mar 2021 12:24:48 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1616588688; bh=BXm3zkQ5Yjf6l+F8X1T+BW05dzKb+rl9rVFAsGKCTFA=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=QdaYQcKcth121k/VXNlsXTvdxZYWbwhTp8vVog49HmChmpdvW9xiKLNxzoxTPF5rw kzollqGyldN5oC1Gb0wWA4J7Aljydph9xQUzwCrLn7gBUx0EcmlJHBY5r3Y/d1FXLo AXkAFiqmlli0XfneVfameVehoEDOtWpZR6CCM5ceqWSmTiHu/X4N5bvl1ALAsc4vdb insAonEzCM3bU1IAlXWf51vgHRxMFFgqeHplKNucz7dYMhG51qEf61mcFt/SP6coX/ lj6jY8OBYvRj00liXjcfWPT4x0qwWzqwqjhxk/KnFiUYNzgKeYq0UGrdtvFFBnUHhs ZtlpLBPl/7Ntw== Received: by mail-lf1-f50.google.com with SMTP id o126so22139403lfa.0 for ; Wed, 24 Mar 2021 05:24:48 -0700 (PDT) X-Gm-Message-State: AOAM530Xl31HKX0CDk5o2VUAmTxM9IsVzSJYzio/LBzUevhMUvh5tmPI LRcBrX1rYXJiCyz1STzSI53UMkLAFwgM3IPhhxg= X-Received: by 2002:a05:6512:3d1c:: with SMTP id d28mr1742491lfv.41.1616588686431; Wed, 24 Mar 2021 05:24:46 -0700 (PDT) MIME-Version: 1.0 References: <1616580892-80815-1-git-send-email-guoren@kernel.org> In-Reply-To: From: Guo Ren Date: Wed, 24 Mar 2021 20:24:34 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH] riscv: locks: introduce ticket-based spinlock implementation To: Vitaly Wool Cc: linux-riscv , LKML , Guo Ren , Catalin Marinas , Will Deacon , Peter Zijlstra , Palmer Dabbelt , Anup Patel , Arnd Bergmann Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Mar 24, 2021 at 7:16 PM Vitaly Wool wrote: > > > > On Wed, Mar 24, 2021, 11:16 AM wrote: >> >> From: Guo Ren >> >> This patch introduces a ticket lock implementation for riscv, along the >> same lines as the implementation for arch/arm & arch/csky. > > > Could you please provide a rationale for this? Like, what is wrong with the current implementation. Ticket based spinlock's principle is here: https://lwn.net/Articles/267968/ Current implementation will cause cache line bouncing when many harts are acquiring the same spinlock. I'm seeking a solution, maybe not fitting the current RISC-V base ISA. I'll add more comments in the next version of patch. > > Thanks in advance, > > Best regards, > Vitaly >> >> >> Signed-off-by: Guo Ren >> Cc: Catalin Marinas >> Cc: Will Deacon >> Cc: Peter Zijlstra >> Cc: Palmer Dabbelt >> Cc: Anup Patel >> Cc: Arnd Bergmann >> --- >> arch/riscv/Kconfig | 1 + >> arch/riscv/include/asm/Kbuild | 1 + >> arch/riscv/include/asm/spinlock.h | 158 ++++++++++++-------------------- >> arch/riscv/include/asm/spinlock_types.h | 19 ++-- >> 4 files changed, 74 insertions(+), 105 deletions(-) >> >> diff --git a/arch/riscv/Kconfig b/arch/riscv/Kconfig >> index 87d7b52..7c56a20 100644 >> --- a/arch/riscv/Kconfig >> +++ b/arch/riscv/Kconfig >> @@ -30,6 +30,7 @@ config RISCV >> select ARCH_HAS_STRICT_KERNEL_RWX if MMU >> select ARCH_OPTIONAL_KERNEL_RWX if ARCH_HAS_STRICT_KERNEL_RWX >> select ARCH_OPTIONAL_KERNEL_RWX_DEFAULT >> + select ARCH_USE_QUEUED_RWLOCKS >> select ARCH_WANT_DEFAULT_TOPDOWN_MMAP_LAYOUT if MMU >> select ARCH_WANT_FRAME_POINTERS >> select ARCH_WANT_HUGE_PMD_SHARE if 64BIT >> diff --git a/arch/riscv/include/asm/Kbuild b/arch/riscv/include/asm/Kbuild >> index 445ccc9..e57ef80 100644 >> --- a/arch/riscv/include/asm/Kbuild >> +++ b/arch/riscv/include/asm/Kbuild >> @@ -3,5 +3,6 @@ generic-y += early_ioremap.h >> generic-y += extable.h >> generic-y += flat.h >> generic-y += kvm_para.h >> +generic-y += qrwlock.h >> generic-y += user.h >> generic-y += vmlinux.lds.h >> diff --git a/arch/riscv/include/asm/spinlock.h b/arch/riscv/include/asm/spinlock.h >> index f4f7fa1..2c81764 100644 >> --- a/arch/riscv/include/asm/spinlock.h >> +++ b/arch/riscv/include/asm/spinlock.h >> @@ -7,129 +7,91 @@ >> #ifndef _ASM_RISCV_SPINLOCK_H >> #define _ASM_RISCV_SPINLOCK_H >> >> -#include >> -#include >> -#include >> - >> /* >> - * Simple spin lock operations. These provide no fairness guarantees. >> + * Ticket-based spin-locking. >> */ >> +static inline void arch_spin_lock(arch_spinlock_t *lock) >> +{ >> + arch_spinlock_t lockval; >> + u32 tmp; >> + >> + asm volatile ( >> + "1: lr.w %0, %2 \n" >> + " mv %1, %0 \n" >> + " addw %0, %0, %3 \n" >> + " sc.w %0, %0, %2 \n" >> + " bnez %0, 1b \n" >> + : "=&r" (tmp), "=&r" (lockval), "+A" (lock->lock) >> + : "r" (1 << TICKET_NEXT) >> + : "memory"); >> >> -/* FIXME: Replace this with a ticket lock, like MIPS. */ >> - >> -#define arch_spin_is_locked(x) (READ_ONCE((x)->lock) != 0) >> + while (lockval.tickets.next != lockval.tickets.owner) { >> + /* >> + * FIXME - we need wfi/wfe here to prevent: >> + * - cache line bouncing >> + * - saving cpu pipeline in multi-harts-per-core >> + * processor >> + */ >> + lockval.tickets.owner = READ_ONCE(lock->tickets.owner); >> + } >> >> -static inline void arch_spin_unlock(arch_spinlock_t *lock) >> -{ >> - smp_store_release(&lock->lock, 0); >> + __atomic_acquire_fence(); >> } >> >> static inline int arch_spin_trylock(arch_spinlock_t *lock) >> { >> - int tmp = 1, busy; >> - >> - __asm__ __volatile__ ( >> - " amoswap.w %0, %2, %1\n" >> - RISCV_ACQUIRE_BARRIER >> - : "=r" (busy), "+A" (lock->lock) >> - : "r" (tmp) >> + u32 tmp, contended, res; >> + >> + do { >> + asm volatile ( >> + " lr.w %0, %3 \n" >> + " srliw %1, %0, %5 \n" >> + " slliw %2, %0, %5 \n" >> + " or %1, %2, %1 \n" >> + " li %2, 0 \n" >> + " sub %1, %1, %0 \n" >> + " bnez %1, 1f \n" >> + " addw %0, %0, %4 \n" >> + " sc.w %2, %0, %3 \n" >> + "1: \n" >> + : "=&r" (tmp), "=&r" (contended), "=&r" (res), >> + "+A" (lock->lock) >> + : "r" (1 << TICKET_NEXT), "I" (TICKET_NEXT) >> : "memory"); >> + } while (res); >> >> - return !busy; >> -} >> - >> -static inline void arch_spin_lock(arch_spinlock_t *lock) >> -{ >> - while (1) { >> - if (arch_spin_is_locked(lock)) >> - continue; >> - >> - if (arch_spin_trylock(lock)) >> - break; >> + if (!contended) { >> + __atomic_acquire_fence(); >> + return 1; >> + } else { >> + return 0; >> } >> } >> >> -/***********************************************************/ >> - >> -static inline void arch_read_lock(arch_rwlock_t *lock) >> +static inline void arch_spin_unlock(arch_spinlock_t *lock) >> { >> - int tmp; >> - >> - __asm__ __volatile__( >> - "1: lr.w %1, %0\n" >> - " bltz %1, 1b\n" >> - " addi %1, %1, 1\n" >> - " sc.w %1, %1, %0\n" >> - " bnez %1, 1b\n" >> - RISCV_ACQUIRE_BARRIER >> - : "+A" (lock->lock), "=&r" (tmp) >> - :: "memory"); >> + smp_store_release(&lock->tickets.owner, lock->tickets.owner + 1); >> + /* FIXME - we need ipi/sev here to notify above */ >> } >> >> -static inline void arch_write_lock(arch_rwlock_t *lock) >> +static inline int arch_spin_value_unlocked(arch_spinlock_t lock) >> { >> - int tmp; >> - >> - __asm__ __volatile__( >> - "1: lr.w %1, %0\n" >> - " bnez %1, 1b\n" >> - " li %1, -1\n" >> - " sc.w %1, %1, %0\n" >> - " bnez %1, 1b\n" >> - RISCV_ACQUIRE_BARRIER >> - : "+A" (lock->lock), "=&r" (tmp) >> - :: "memory"); >> + return lock.tickets.owner == lock.tickets.next; >> } >> >> -static inline int arch_read_trylock(arch_rwlock_t *lock) >> +static inline int arch_spin_is_locked(arch_spinlock_t *lock) >> { >> - int busy; >> - >> - __asm__ __volatile__( >> - "1: lr.w %1, %0\n" >> - " bltz %1, 1f\n" >> - " addi %1, %1, 1\n" >> - " sc.w %1, %1, %0\n" >> - " bnez %1, 1b\n" >> - RISCV_ACQUIRE_BARRIER >> - "1:\n" >> - : "+A" (lock->lock), "=&r" (busy) >> - :: "memory"); >> - >> - return !busy; >> + return !arch_spin_value_unlocked(READ_ONCE(*lock)); >> } >> >> -static inline int arch_write_trylock(arch_rwlock_t *lock) >> +static inline int arch_spin_is_contended(arch_spinlock_t *lock) >> { >> - int busy; >> - >> - __asm__ __volatile__( >> - "1: lr.w %1, %0\n" >> - " bnez %1, 1f\n" >> - " li %1, -1\n" >> - " sc.w %1, %1, %0\n" >> - " bnez %1, 1b\n" >> - RISCV_ACQUIRE_BARRIER >> - "1:\n" >> - : "+A" (lock->lock), "=&r" (busy) >> - :: "memory"); >> + struct __raw_tickets tickets = READ_ONCE(lock->tickets); >> >> - return !busy; >> + return (tickets.next - tickets.owner) > 1; >> } >> +#define arch_spin_is_contended arch_spin_is_contended >> >> -static inline void arch_read_unlock(arch_rwlock_t *lock) >> -{ >> - __asm__ __volatile__( >> - RISCV_RELEASE_BARRIER >> - " amoadd.w x0, %1, %0\n" >> - : "+A" (lock->lock) >> - : "r" (-1) >> - : "memory"); >> -} >> - >> -static inline void arch_write_unlock(arch_rwlock_t *lock) >> -{ >> - smp_store_release(&lock->lock, 0); >> -} >> +#include >> >> #endif /* _ASM_RISCV_SPINLOCK_H */ >> diff --git a/arch/riscv/include/asm/spinlock_types.h b/arch/riscv/include/asm/spinlock_types.h >> index f398e76..d7b38bf 100644 >> --- a/arch/riscv/include/asm/spinlock_types.h >> +++ b/arch/riscv/include/asm/spinlock_types.h >> @@ -10,16 +10,21 @@ >> # error "please don't include this file directly" >> #endif >> >> +#define TICKET_NEXT 16 >> + >> typedef struct { >> - volatile unsigned int lock; >> + union { >> + u32 lock; >> + struct __raw_tickets { >> + /* little endian */ >> + u16 owner; >> + u16 next; >> + } tickets; >> + }; >> } arch_spinlock_t; >> >> -#define __ARCH_SPIN_LOCK_UNLOCKED { 0 } >> - >> -typedef struct { >> - volatile unsigned int lock; >> -} arch_rwlock_t; >> +#define __ARCH_SPIN_LOCK_UNLOCKED { { 0 } } >> >> -#define __ARCH_RW_LOCK_UNLOCKED { 0 } >> +#include >> >> #endif /* _ASM_RISCV_SPINLOCK_TYPES_H */ >> -- >> 2.7.4 >> >> >> _______________________________________________ >> linux-riscv mailing list >> linux-riscv@lists.infradead.org >> http://lists.infradead.org/mailman/listinfo/linux-riscv -- Best Regards Guo Ren ML: https://lore.kernel.org/linux-csky/