Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934776AbXHOUPd (ORCPT ); Wed, 15 Aug 2007 16:15:33 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1757069AbXHOUPI (ORCPT ); Wed, 15 Aug 2007 16:15:08 -0400 Received: from e2.ny.us.ibm.com ([32.97.182.142]:45098 "EHLO e2.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752020AbXHOUPF (ORCPT ); Wed, 15 Aug 2007 16:15:05 -0400 Date: Wed, 15 Aug 2007 13:15:01 -0700 From: "Paul E. McKenney" To: Nick Piggin Cc: Herbert Xu , csnook@redhat.com, dhowells@redhat.com, linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, torvalds@linux-foundation.org, netdev@vger.kernel.org, akpm@linux-foundation.org, ak@suse.de, heiko.carstens@de.ibm.com, davem@davemloft.net, schwidefsky@de.ibm.com, wensong@linux-vs.org, horms@verge.net.au, wjiang@resilience.com, cfriesen@nortel.com, zlynx@acm.org, rpjday@mindspring.com, jesper.juhl@gmail.com, Arnd Bergmann Subject: Re: [PATCH 6/24] make atomic_read() behave consistently on frv Message-ID: <20070815201501.GM9645@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20070811042943.GA13410@linux.vnet.ibm.com> <20070813060302.GF13410@linux.vnet.ibm.com> <46C13EE1.1000707@yahoo.com.au> <20070814170128.GA8243@linux.vnet.ibm.com> <46C2FFDD.5000600@yahoo.com.au> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <46C2FFDD.5000600@yahoo.com.au> User-Agent: Mutt/1.5.13 (2006-08-11) Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4438 Lines: 100 On Wed, Aug 15, 2007 at 11:30:05PM +1000, Nick Piggin wrote: > Paul E. McKenney wrote: > >On Tue, Aug 14, 2007 at 03:34:25PM +1000, Nick Piggin wrote: > > >>Maybe it is the safe way to go, but it does obscure cases where there > >>is a real need for barriers. > > > > > >I prefer burying barriers into other primitives. > > When they should naturally be there, eg. locking or the RCU primitives, > I agree. > > I don't like having them scattered in various "just in case" places, > because it makes both the users and the APIs hard to understand and > change. I certainly agree that we shouldn't do volatile just for the fun of it, and also that use of volatile should be quite rare. > Especially since several big architectures don't have volatile in their > atomic_get and _set, I think it would be a step backwards to add them in > as a "just in case" thin now (unless there is a better reason). Good point, except that I would expect gcc's optimization to continue to improve. I would like the kernel to be able to take advantage of improved optimization, which means that we are going to have to make a few changes. Adding volatile to atomic_get() and atomic_set() is IMHO one of those changes. > >>Many atomic operations are allowed to be reordered between CPUs, so > >>I don't have a good idea for the rationale to order them within the > >>CPU (also loads and stores to long and ptr types are not ordered like > >>this, although we do consider those to be atomic operations too). > >> > >>barrier() in a way is like enforcing sequential memory ordering > >>between process and interrupt context, wheras volatile is just > >>enforcing coherency of a single memory location (and as such is > >>cheaper). > > > > > >barrier() is useful, but it has the very painful side-effect of forcing > >the compiler to dump temporaries. So we do need something that is > >not quite so global in effect. > > Yep. > > >>What do you think of this crazy idea? > >> > >>/* Enforce a compiler barrier for only operations to location X. > >>* Call multiple times to provide an ordering between multiple > >>* memory locations. Other memory operations can be assumed by > >>* the compiler to remain unchanged and may be reordered > >>*/ > >>#define order(x) asm volatile("" : "+m" (x)) > > > >There was something very similar discussed earlier in this thread, > >with quite a bit of debate as to exactly what the "m" flag should > >look like. I suggested something similar named ACCESS_ONCE in the > >context of RCU (http://lkml.org/lkml/2007/7/11/664): > > Oh, I missed that earlier debate. Will go have a look. > > > #define ACCESS_ONCE(x) (*(volatile typeof(x) *)&(x)) > > > >The nice thing about this is that it works for both loads and stores. > >Not clear that order() above does this -- I get compiler errors when > >I try something like "b = order(a)" or "order(a) = 1" using gcc 4.1.2. > > As Arnd ponted out, order() is not supposed to be an lvalue, but a > statement like the rest of our memory ordering API. > > As to your followup question of why to use it over ACCESS_ONCE. I > guess, aside from consistency with the rest of the barrier APIs, you > can use it in other primitives when you don't actually know what the > caller is going to do or if it even will make an access. You could > also use it between calls to _other_ primitives, etc... it just > seems more flexible to me, but I haven't actually used such a thing > in real code... > > ACCESS_ONCE doesn't seem as descriptive. What it results in is the > memory location being loaded or stored (presumably once exactly), > but I think the more general underlying idea is a barrier point. OK, first, I am not arguing that ACCESS_ONCE() can replace all current uses of barrier(). Similarly, rcu_dereference(), rcu_assign_pointer(), and the various RCU versions of the list-manipulation API in no way replaced all uses of explicit memory barriers. However, I do believe that these API are much easier to use (where they apply, of course) than were the earlier idioms involving explicit memory barriers. But we of course need to keep the explicit memory and compiler barriers for other situations. Thanx, Paul - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/