Received: by 2002:a05:6a10:eb17:0:0:0:0 with SMTP id hx23csp1157629pxb; Thu, 9 Sep 2021 22:40:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJza1Sp0NZ5waHdfQCkwbuS6gkr43Ohg6H4PB0W0r/u+xggz8bs0QoiLeLvUZyyYpF38DWmP X-Received: by 2002:a02:9695:: with SMTP id w21mr3065210jai.64.1631252418922; Thu, 09 Sep 2021 22:40:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1631252418; cv=none; d=google.com; s=arc-20160816; b=P4aincaY9d65c0yLqGBIzc/fS3GdhmuSNMPLOLxEIc0EzzSK1gZgeI3ET+kqu4mwYt p9IclF4a+SDJ9820oCn8dwZ5r4G2yUn+3UviWNXeL5oN7yNZ71ct6XSQPCWFoQI5ujAT AWgpWh2S7hXAlJaYJd2I9XI+7A1rNbHBvcHFmz0m1Rsp6i5pNXsVACoq6XZAFUlD3gHU JLVcoczpyWq2buT7ty+zI0e38YQYIZYTy55LnYW8hTTOm0x75KUGsHHDZhJfs43mkTqv AjSAcS6L9oFG0tXmUEe6D1zy3nzqYkqmo/07v3IQpAnhAr9MjTnf8YrmpC6A3ivB91rn 4SWQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=WVOJy0zbM+TdEltsyqj+YAqbFTvB32dZ/mFRQ4t8D7g=; b=ObeEcjXwyUkQzkiAgYRcZSc5dPesFGVovn4NwXs9ofZi1rjzlKHRJEqNAI5xTU2O+s ECKl3oWvekaEomc6IdHun877BmUCEAlvSdcCGeL5qUj6l7Tp0h/l1YXZdYvysAXgENiN gogJIP+PD4Ar4F5CMDqM9FiSy3X8Qicm+5uzg82bVmX/F45dZVRoy8oUmEFwLXSve9Kp aXpkFrtoOa3Nz4hyjwnL4z66DgIgJj5yCfHCkcZcLFknF8qS8Xv/p/tUvVuIeVMy7XrR g4Nt5b/uGF7186juc78s7vMWVrtvaIoTGTTs8sF5CutLHmdjojCOqcyioPppEmjBZIUp tJbg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=p3XMS611; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b60si4409359jai.29.2021.09.09.22.40.07; Thu, 09 Sep 2021 22:40:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=p3XMS611; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230417AbhIJFki (ORCPT + 99 others); Fri, 10 Sep 2021 01:40:38 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52048 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230037AbhIJFki (ORCPT ); Fri, 10 Sep 2021 01:40:38 -0400 Received: from mail-il1-x12c.google.com (mail-il1-x12c.google.com [IPv6:2607:f8b0:4864:20::12c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A44C7C061574; Thu, 9 Sep 2021 22:39:27 -0700 (PDT) Received: by mail-il1-x12c.google.com with SMTP id w1so897955ilv.1; Thu, 09 Sep 2021 22:39:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=WVOJy0zbM+TdEltsyqj+YAqbFTvB32dZ/mFRQ4t8D7g=; b=p3XMS6111m2OC6pSC/4tmzqZctg3cE8apucLT+k4YUtJ6dfBjzyTVR0BLnpFPv5e9R +jUrTcb0BOs5q2H7lsFYljgQIJsdvAqskP20bQji7aYqX05QFA5NWRJCddtm1ikJ5sWw GiERESp9PaoPla5g9dsG4rWccoQmAi+xoom7DLYpVeu9PrwIu2ub3wPxTi+7Y0d6ukpK d2i4O54+F5A8q68FFEByFK9PTNDkJRYtOH95Zk7uvfsfEy9kkV/S9FzQZQaN3OLBlYAy zj9FAhCHNm/40TY+gDqe7MpzeEkvBmw9a1jGcvshZ5xOKa4D5PmT+f7oTfhz6/vtEunE 3hOA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=WVOJy0zbM+TdEltsyqj+YAqbFTvB32dZ/mFRQ4t8D7g=; b=XRdyqLHo1olzAJjoC1GKj/g5Ph+ITVKvHuv088BdYuz/KwZo1rMn/ZZQd5/YDVO1Kb rWrPmIVmCO5wN7JjQciSprjjQE/58K3MNtdnrVvA/yTVeflde/vd55wttk/o4E73bptm 0NBf5n4mrci6wGIm3fj9Jj7YsFFVaNhEwcdVS9CboMUe/KHFemvPuKduVrEab13hcwRi G3N6isehN3J1jJKHKYNSVDNCQrGtHgQdF4SZvXz58o7yzyrdO7lVABZzWJx7XKpZPgLO qYs35AzkmwoGNIICWmt0e+RqehWumePUNrEQz7HjOIjOeiEPZzuu82qEJRNuKy1ux1Vg dCKg== X-Gm-Message-State: AOAM530gdcIm+M7WGEAGHY3cPP9Wpu5483Jz7P3L2WTcqfAT7NFHMLPV FJc7XgMmqVwc4Ysh9jhNtzo= X-Received: by 2002:a92:d9d0:: with SMTP id n16mr5038981ilq.45.1631252366825; Thu, 09 Sep 2021 22:39:26 -0700 (PDT) Received: from auth1-smtp.messagingengine.com (auth1-smtp.messagingengine.com. [66.111.4.227]) by smtp.gmail.com with ESMTPSA id z7sm2073515ilz.25.2021.09.09.22.39.24 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 09 Sep 2021 22:39:25 -0700 (PDT) Received: from compute2.internal (compute2.nyi.internal [10.202.2.42]) by mailauth.nyi.internal (Postfix) with ESMTP id 5805C27C0054; Fri, 10 Sep 2021 01:39:24 -0400 (EDT) Received: from mailfrontend1 ([10.202.2.162]) by compute2.internal (MEProxy); Fri, 10 Sep 2021 01:39:24 -0400 X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvtddrudegtddgleehucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurhepfffhvffukfhfgggtuggjsehttdertddttddvnecuhfhrohhmpeeuohhquhhn ucfhvghnghcuoegsohhquhhnrdhfvghnghesghhmrghilhdrtghomheqnecuggftrfgrth htvghrnhepveeijedthfeijeefudehhedvveegudegteehgffgtddvuedtveegtedvvdef gedtnecuffhomhgrihhnpehkvghrnhgvlhdrohhrghenucevlhhushhtvghrufhiiigvpe dtnecurfgrrhgrmhepmhgrihhlfhhrohhmpegsohhquhhnodhmvghsmhhtphgruhhthhhp vghrshhonhgrlhhithihqdeiledvgeehtdeigedqudejjeekheehhedvqdgsohhquhhnrd hfvghngheppehgmhgrihhlrdgtohhmsehfihigmhgvrdhnrghmvg X-ME-Proxy: Received: by mail.messagingengine.com (Postfix) with ESMTPA; Fri, 10 Sep 2021 01:39:21 -0400 (EDT) Date: Fri, 10 Sep 2021 13:37:48 +0800 From: Boqun Feng To: Dan Lustig Cc: Will Deacon , Peter Zijlstra , Linus Torvalds , Alan Stern , Alexander Shishkin , Peter Anvin , Andrea Parri , Ingo Molnar , "Paul E. McKenney" , Vince Weaver , Thomas Gleixner , Jiri Olsa , Arnaldo Carvalho de Melo , Linux Kernel Mailing List , Stephane Eranian , linux-tip-commits@vger.kernel.org, palmer@dabbelt.com, paul.walmsley@sifive.com, mpe@ellerman.id.au Subject: Re: [tip:locking/core] tools/memory-model: Add extra ordering for locks and remove it for ordinary release/acquire Message-ID: References: <20180926182920.27644-2-paulmck@linux.ibm.com> <20210908144217.GA603644@rowland.harvard.edu> <20210909133535.GA9722@willie-the-truck> <5412ab37-2979-5717-4951-6a61366df0f2@nvidia.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Sep 10, 2021 at 08:01:14AM +0800, Boqun Feng wrote: > On Thu, Sep 09, 2021 at 01:03:18PM -0400, Dan Lustig wrote: > > On 9/9/2021 9:35 AM, Will Deacon wrote: > > > [+Palmer, PaulW, Daniel and Michael] > > > > > > On Thu, Sep 09, 2021 at 09:25:30AM +0200, Peter Zijlstra wrote: > > >> On Wed, Sep 08, 2021 at 09:08:33AM -0700, Linus Torvalds wrote: > > >> > > >>> So if this is purely a RISC-V thing, > > >> > > >> Just to clarify, I think the current RISC-V thing is stonger than > > >> PowerPC, but maybe not as strong as say ARM64, but RISC-V memory > > >> ordering is still somewhat hazy to me. > > >> > > >> Specifically, the sequence: > > >> > > >> /* critical section s */ > > >> WRITE_ONCE(x, 1); > > >> FENCE RW, W > > >> WRITE_ONCE(s.lock, 0); /* store S */ > > >> AMOSWAP %0, 1, r.lock /* store R */ > > >> FENCE R, RW > > >> WRITE_ONCE(y, 1); > > >> /* critical section r */ > > >> > > >> fully separates section s from section r, as in RW->RW ordering > > >> (possibly not as strong as smp_mb() though), while on PowerPC it would > > >> only impose TSO ordering between sections. > > >> > > >> The AMOSWAP is a RmW and as such matches the W from the RW->W fence, > > >> similarly it marches the R from the R->RW fence, yielding an: > > >> > > >> RW-> W > > >> RmW > > >> R ->RW > > >> > > >> ordering. It's the stores S and R that can be re-ordered, but not the > > >> sections themselves (same on PowerPC and many others). > > >> > > >> Clarification from a RISC-V enabled person would be appreciated. > > > > To first order, RISC-V's memory model is very similar to ARMv8's. It > > is "other-multi-copy-atomic", unlike Power, and respects dependencies. > > It also has AMOs and LR/SC with optional RCsc acquire or release > > semantics. There's no need to worry about RISC-V somehow pushing the > > boundaries of weak memory ordering in new ways. > > > > The tricky part is that unlike ARMv8, RISC-V doesn't have load-acquire > > or store-release opcodes at all. Only AMOs and LR/SC have acquire or > > release options. That means that while certain operations like swap > > can be implemented with native RCsc semantics, others like store-release > > have to fall back on fences and plain writes. > > > > That's where the complexity came up last time this was discussed, at > > least as it relates to RISC-V: how to make sure the combination of RCsc > > atomics and plain operations+fences gives the semantics everyone is > > asking for here. And to be clear there, I'm not asking for LKMM to > > weaken anything about critical section ordering just for RISC-V's sake. > > TSO/RCsc ordering between critical sections is a perfectly reasonable > > model in my opinion. I just want to make sure RISC-V gets it right > > given whatever the decision is. > > > > >>> then I think it's entirely reasonable to > > >>> > > >>> spin_unlock(&r); > > >>> spin_lock(&s); > > >>> > > >>> cannot be reordered. > > >> > > >> I'm obviously completely in favour of that :-) > > > > > > I don't think we should require the accesses to the actual lockwords to > > > be ordered here, as it becomes pretty onerous for relaxed LL/SC > > > architectures where you'd end up with an extra barrier either after the > > > unlock() or before the lock() operation. However, I remain absolutely in > > > favour of strengthening the ordering of the _critical sections_ guarded by > > > the locks to be RCsc. > > > > I agree with Will here. If the AMOSWAP above is actually implemented with > > a RISC-V AMO, then the two critical sections will be separated as if RW,RW, > > as Peter described. If instead it's implemented using LR/SC, then RISC-V > > Just out of curiosity, in the following code, can the store S and load L > be reordered? > > WRITE_ONCE(x, 1); // store S > FENCE RW, W > WRITE_ONCE(s.lock, 0); // unlock(s) > AMOSWAP %0, 1, s.lock // lock(s) > FENCE R, RW > r1 = READ_ONCE(y); // load L > > I think they can, because neither "FENCE RW, W" nor "FENCE R, RW" order > them. Note that the reordering is allowed in LKMM, because unlock-lock > only need to be as strong as RCtso. > > Moreover, how about the outcome of the following case: > > { > r1, r2 are registers (variables) on each CPU, X, Y are memory > locations, and initialized as 0 > } > > CPU 0 > ===== > AMOSWAP r1, 1, X > FENCE R, RW > r2 = READ_ONCE(Y); > > CPU 1 > ===== > WRITE_ONCE(Y, 1); > FENCE RW, RW > r2 = READ_ONCE(X); > > can we observe the result where r2 on CPU0 is 0 while r2 on CPU1 is 1? > As reminded by Andrea, what I meant to ask here is: can we observer the result where r2 on CPU0 is 0 while r2 on CPU1 is 0? Regards, Boqun > Regards, > Boqun > > > gives only TSO (R->R, R->W, W->W), because the two pieces of the AMO are > > split, and that breaks the chain. Getting full RW->RW between the critical > > sections would therefore require an extra fence. Also, the accesses to the > > lockwords themselves would not be ordered without an extra fence. > > > > > Last time this came up, I think the RISC-V folks were generally happy to > > > implement whatever was necessary for Linux [1]. The thing that was stopping > > > us was Power (see CONFIG_ARCH_WEAK_RELEASE_ACQUIRE), wasn't it? I think > > > Michael saw quite a bit of variety in the impact on benchmarks [2] across > > > different machines. So the question is whether newer Power machines are less > > > affected to the degree that we could consider making this change again. > > > > Yes, as I said above, RISC-V will implement what is needed to make this work. > > > > Dan > > > > > Will > > > > > > [1] https://lore.kernel.org/lkml/11b27d32-4a8a-3f84-0f25-723095ef1076@nvidia.com/ > > > [2] https://lore.kernel.org/lkml/87tvp3xonl.fsf@concordia.ellerman.id.au/