Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751944AbdHAMRP (ORCPT ); Tue, 1 Aug 2017 08:17:15 -0400 Received: from foss.arm.com ([217.140.101.70]:39674 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751199AbdHAMRO (ORCPT ); Tue, 1 Aug 2017 08:17:14 -0400 Date: Tue, 1 Aug 2017 13:17:13 +0100 From: Will Deacon To: Peter Zijlstra Cc: "Paul E. McKenney" , Boqun Feng , linux-kernel@vger.kernel.org, Ingo Molnar , Thomas Gleixner , Randy Dunlap Subject: Re: [RFC][PATCH v3]: documentation,atomic: Add new documents Message-ID: <20170801121713.GH8702@arm.com> References: <20170611135632.sl72klbeklelupej@tardis> <20170612144929.3wiwtbqopsfpm3qk@hirez.programming.kicks-ass.net> <20170726115328.2sxiitivlnlq64dk@hirez.programming.kicks-ass.net> <20170726124750.vktrn5zi2gmpzfru@tardis> <20170731090535.rjgnoewqg7mhzr55@hirez.programming.kicks-ass.net> <20170731110403.ou3zqsp3uviqorkz@tardis> <20170731174345.GL3730@linux.vnet.ibm.com> <20170801090121.edo7mekhw3sann4h@hirez.programming.kicks-ass.net> <20170801101900.GB8702@arm.com> <20170801114744.evjjfviqhu5kgu7v@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170801114744.evjjfviqhu5kgu7v@hirez.programming.kicks-ass.net> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2617 Lines: 61 On Tue, Aug 01, 2017 at 01:47:44PM +0200, Peter Zijlstra wrote: > On Tue, Aug 01, 2017 at 11:19:00AM +0100, Will Deacon wrote: > > On Tue, Aug 01, 2017 at 11:01:21AM +0200, Peter Zijlstra wrote: > > > On Mon, Jul 31, 2017 at 10:43:45AM -0700, Paul E. McKenney wrote: > > > > > > > Why wouldn't the following have ACQUIRE semantics? > > > > > > > > atomic_inc(&var); > > > > smp_mb__after_atomic(); > > > > > > > > Is the issue that there is no actual value returned or some such? > > > > > > Yes, so that the inc is a load-store, and thus there is a load, we loose > > > the value. > > > > > > But I see your point I think. Irrespective of still having the value, > > > the ordering is preserved and nothing should pass across that. > > > > > > > So if I have something like this, the assertion really can trigger? > > > > > > > > WRITE_ONCE(x, 1); atomic_inc(&y); > > > > r0 = xchg_release(&y, 5); smp_mb__after_atomic(); > > > > r1 = READ_ONCE(x); > > > > > > > > > > > > WARN_ON(r0 == 0 && r1 == 0); > > > > > > > > I must confess that I am not seeing why we would want to allow this > > > > outcome. > > > > > > No you are indeed quite right. I just wasn't creative enough. Thanks for > > > the inspiration. > > > > Just to close this out, we agree that an smp_rmb() instead of > > smp_mb__after_atomic() would *not* forbid this outcome, right? > > So that really hurts my brain. Per the normal rules that smp_rmb() would > order the read of @x against the last ll of @y and per ll/sc ordering > you then still don't get to make the WARN happen. > > On IRC you explained that your 8.1 LSE instructions are not in fact > ordered by a smp_rmb, only by smp_wmb, which is 'surprising' since you > really need to load the old value to compute the new value. To be clear, it's only the ST* variants of the LSE instructions that are treated as a write for the purposes of memory ordering, so these are the non-*_return variants. It's not unlikely that other architectures will exhibit the same behaviour (e.g. Power, RISC-V), because the CPU can treat non-return atomics as "fire-and-forget" and have them handled elsewhere in the memory subsystem, causing them to be treated similarly to posted writes. For the code snippet above, the second thread has no idea about the value of y and so smp_rmb() is the wrong thing to be using imo. It really cares about ordering the store to y before the read of x, so needs a full mb (i.e. the test is more like 'R' than 'MP'). Also, wouldn't this problem also arise if your atomics were built using a spinlock where unlock had release semantics? Will