Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757137Ab2BPGfV (ORCPT ); Thu, 16 Feb 2012 01:35:21 -0500 Received: from e33.co.us.ibm.com ([32.97.110.151]:40795 "EHLO e33.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752211Ab2BPGfQ (ORCPT ); Thu, 16 Feb 2012 01:35:16 -0500 Date: Wed, 15 Feb 2012 22:35:04 -0800 From: "Paul E. McKenney" To: Peter Zijlstra Cc: linux-kernel@vger.kernel.org, mingo@elte.hu, laijs@cn.fujitsu.com, dipankar@in.ibm.com, akpm@linux-foundation.org, mathieu.desnoyers@polymtl.ca, josh@joshtriplett.org, niv@us.ibm.com, tglx@linutronix.de, rostedt@goodmis.org, Valdis.Kletnieks@vt.edu, dhowells@redhat.com, eric.dumazet@gmail.com, darren@dvhart.com, fweisbec@gmail.com, patches@linaro.org, Avi Kiviti , Chris Mason , Eric Paris Subject: Re: [PATCH RFC tip/core/rcu] rcu: direct algorithmic SRCU implementation Message-ID: <20120216063504.GE2976@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20120213020951.GA12138@linux.vnet.ibm.com> <1329310763.2293.78.camel@twins> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1329310763.2293.78.camel@twins> User-Agent: Mutt/1.5.21 (2010-09-15) X-Content-Scanned: Fidelis XPS MAILER x-cbid: 12021606-2398-0000-0000-00000447405E Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3111 Lines: 72 On Wed, Feb 15, 2012 at 01:59:23PM +0100, Peter Zijlstra wrote: > On Sun, 2012-02-12 at 18:09 -0800, Paul E. McKenney wrote: > > The current implementation of synchronize_srcu_expedited() can cause > > severe OS jitter due to its use of synchronize_sched(), which in turn > > invokes try_stop_cpus(), which causes each CPU to be sent an IPI. > > This can result in severe performance degradation for real-time workloads > > and especially for short-interation-length HPC workloads. Furthermore, > > because only one instance of try_stop_cpus() can be making forward progress > > at a given time, only one instance of synchronize_srcu_expedited() can > > make forward progress at a time, even if they are all operating on > > distinct srcu_struct structures. > > > > This commit, inspired by an earlier implementation by Peter Zijlstra > > (https://lkml.org/lkml/2012/1/31/211) and by further offline discussions, > > takes a strictly algorithmic bits-in-memory approach. This has the > > disadvantage of requiring one explicit memory-barrier instruction in > > each of srcu_read_lock() and srcu_read_unlock(), but on the other hand > > completely dispenses with OS jitter and furthermore allows SRCU to be > > used freely by CPUs that RCU believes to be idle or offline. > > > > The update-side implementation handles the single read-side memory > > barrier by rechecking the per-CPU counters after summing them and > > by running through the update-side state machine twice. > > Yeah, getting rid of that second memory barrier in srcu_read_lock() is > pure magic :-) > > > This implementation has passed moderate rcutorture testing on both 32-bit > > x86 and 64-bit Power. A call_srcu() function will be present in a later > > version of this patch. > > Goodness ;-) Glad you like the magic and the prospect of call_srcu(). ;-) > > @@ -131,10 +214,11 @@ int __srcu_read_lock(struct srcu_struct *sp) > > int idx; > > > > preempt_disable(); > > - idx = sp->completed & 0x1; > > - barrier(); /* ensure compiler looks -once- at sp->completed. */ > > - per_cpu_ptr(sp->per_cpu_ref, smp_processor_id())->c[idx]++; > > - srcu_barrier(); /* ensure compiler won't misorder critical section. */ > > + idx = rcu_dereference_index_check(sp->completed, > > + rcu_read_lock_sched_held()) & 0x1; > > + ACCESS_ONCE(per_cpu_ptr(sp->per_cpu_ref, smp_processor_id())->c[idx]) += > > + SRCU_USAGE_COUNT + 1; > > + smp_mb(); /* B */ /* Avoid leaking the critical section. */ > > preempt_enable(); > > return idx; > > } > > You could use __this_cpu_* muck to shorten some of that. Ah, so something like this? ACCESS_ONCE(this_cpu_ptr(sp->per_cpu_ref)->c[idx]) += SRCU_USAGE_COUNT + 1; Now that you mention it, this does look nicer, applied here and to srcu_read_unlock(). > Acked-by: Peter Zijlstra Thanx, Paul -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/