Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751809AbdFISuN (ORCPT ); Fri, 9 Jun 2017 14:50:13 -0400 Received: from bombadil.infradead.org ([65.50.211.133]:47025 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751721AbdFISuM (ORCPT ); Fri, 9 Jun 2017 14:50:12 -0400 Date: Fri, 9 Jun 2017 20:49:58 +0200 From: Peter Zijlstra To: Vineet Gupta Cc: Will Deacon , Paul McKenney , Boqun Feng , linux-kernel@vger.kernel.org, Ingo Molnar , Thomas Gleixner , rkuo@codeaurora.org, james.hogan@imgtec.com, jejb@parisc-linux.org, davem@davemloft.net, cmetcalf@mellanox.com, arcml Subject: Re: [RFC][PATCH] atomic: Fix atomic_set_release() for 'funny' architectures Message-ID: <20170609184958.zkgcaxk7wezods5d@hirez.programming.kicks-ass.net> References: <20170609092450.jwmldgtli57ozxgq@hirez.programming.kicks-ass.net> <20170609110506.yod47flaav3wgoj5@hirez.programming.kicks-ass.net> <20170609111305.bn4ca4uscbp6pgxn@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: NeoMutt/20170113 (1.7.2) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1679 Lines: 34 On Fri, Jun 09, 2017 at 10:28:50AM -0700, Vineet Gupta wrote: > On 06/09/2017 04:13 AM, Peter Zijlstra wrote: > > On Fri, Jun 09, 2017 at 01:05:06PM +0200, Peter Zijlstra wrote: > > > > > The spinlock based atomics should be SC, that is, none of them appear to > > > place extra barriers in atomic_cmpxchg() or any of the other SC atomic > > > primitives and therefore seem to rely on their spinlock implementation > > > being SC (I did not fully validate all that). > > > > So I did see that ARC and PARISC have 'superfluous' smp_mb() calls > > around their spinlock implementation. > > > > That is, for spinlock semantics you only need one _after_ lock and one > > _before_ unlock. But the atomic stuff relies on being SC and thus would > > need one before and after both lock and unlock. > > Right we discussed this a while back: https://lkml.org/lkml/2015/6/11/276 > > At the time when I tried removing these extra barriers, hackbench regressed. > I'm about to get a new quad core 1GHz chip (vs. the FPGA before) and will > re-experiment. Likely we don't need it otherwise I will add a comment of > this "feature" > > > But ARC could probably optimize (if they still care about that hardware) > > by pulling out those barriers and putting it in the atomic > > implementation. > > A bit confused here. Reading the lkml posting for this thread, you posted 2 > patches, and they had to do with atomic_set() for EZChip platform which is > really special (no ll/sc). The extra smp_mb() is related to ll/sc variants. > Just tryign to make sure that we are talking 2 different things here :-) Could be I just got all my variants in a twist... wouldn't be the first time ;-)