Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758735Ab2BONAk (ORCPT ); Wed, 15 Feb 2012 08:00:40 -0500 Received: from casper.infradead.org ([85.118.1.10]:40217 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758005Ab2BONAi convert rfc822-to-8bit (ORCPT ); Wed, 15 Feb 2012 08:00:38 -0500 Message-ID: <1329310763.2293.78.camel@twins> Subject: Re: [PATCH RFC tip/core/rcu] rcu: direct algorithmic SRCU implementation From: Peter Zijlstra To: paulmck@linux.vnet.ibm.com Cc: linux-kernel@vger.kernel.org, mingo@elte.hu, laijs@cn.fujitsu.com, dipankar@in.ibm.com, akpm@linux-foundation.org, mathieu.desnoyers@polymtl.ca, josh@joshtriplett.org, niv@us.ibm.com, tglx@linutronix.de, rostedt@goodmis.org, Valdis.Kletnieks@vt.edu, dhowells@redhat.com, eric.dumazet@gmail.com, darren@dvhart.com, fweisbec@gmail.com, patches@linaro.org, Avi Kiviti , Chris Mason , Eric Paris Date: Wed, 15 Feb 2012 13:59:23 +0100 In-Reply-To: <20120213020951.GA12138@linux.vnet.ibm.com> References: <20120213020951.GA12138@linux.vnet.ibm.com> Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7BIT X-Mailer: Evolution 3.2.2- Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2664 Lines: 57 On Sun, 2012-02-12 at 18:09 -0800, Paul E. McKenney wrote: > The current implementation of synchronize_srcu_expedited() can cause > severe OS jitter due to its use of synchronize_sched(), which in turn > invokes try_stop_cpus(), which causes each CPU to be sent an IPI. > This can result in severe performance degradation for real-time workloads > and especially for short-interation-length HPC workloads. Furthermore, > because only one instance of try_stop_cpus() can be making forward progress > at a given time, only one instance of synchronize_srcu_expedited() can > make forward progress at a time, even if they are all operating on > distinct srcu_struct structures. > > This commit, inspired by an earlier implementation by Peter Zijlstra > (https://lkml.org/lkml/2012/1/31/211) and by further offline discussions, > takes a strictly algorithmic bits-in-memory approach. This has the > disadvantage of requiring one explicit memory-barrier instruction in > each of srcu_read_lock() and srcu_read_unlock(), but on the other hand > completely dispenses with OS jitter and furthermore allows SRCU to be > used freely by CPUs that RCU believes to be idle or offline. > > The update-side implementation handles the single read-side memory > barrier by rechecking the per-CPU counters after summing them and > by running through the update-side state machine twice. Yeah, getting rid of that second memory barrier in srcu_read_lock() is pure magic :-) > This implementation has passed moderate rcutorture testing on both 32-bit > x86 and 64-bit Power. A call_srcu() function will be present in a later > version of this patch. Goodness ;-) > @@ -131,10 +214,11 @@ int __srcu_read_lock(struct srcu_struct *sp) > int idx; > > preempt_disable(); > - idx = sp->completed & 0x1; > - barrier(); /* ensure compiler looks -once- at sp->completed. */ > - per_cpu_ptr(sp->per_cpu_ref, smp_processor_id())->c[idx]++; > - srcu_barrier(); /* ensure compiler won't misorder critical section. */ > + idx = rcu_dereference_index_check(sp->completed, > + rcu_read_lock_sched_held()) & 0x1; > + ACCESS_ONCE(per_cpu_ptr(sp->per_cpu_ref, smp_processor_id())->c[idx]) += > + SRCU_USAGE_COUNT + 1; > + smp_mb(); /* B */ /* Avoid leaking the critical section. */ > preempt_enable(); > return idx; > } You could use __this_cpu_* muck to shorten some of that. Acked-by: Peter Zijlstra -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/