Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756350AbYH3Mij (ORCPT ); Sat, 30 Aug 2008 08:38:39 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751895AbYH3Mia (ORCPT ); Sat, 30 Aug 2008 08:38:30 -0400 Received: from casper.infradead.org ([85.118.1.10]:60753 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751548AbYH3Mi3 (ORCPT ); Sat, 30 Aug 2008 08:38:29 -0400 Subject: Re: [RT PATCH v2] seqlock: serialize against writers From: Peter Zijlstra To: Gregory Haskins Cc: mingo@elte.hu, rostedt@goodmis.org, tglx@linutronix.de, linux-kernel@vger.kernel.org, linux-rt-users@vger.kernel.org, gregory.haskins@gmail.com, andi@firstfloor.org, shemminger@vyatta.com In-Reply-To: <48B93DCD.2010503@novell.com> References: <20080829154237.1196.66825.stgit@dev.haskins.net> <20080829180135.22450.54780.stgit@dev.haskins.net> <1220095043.8426.16.camel@twins> <48B93DCD.2010503@novell.com> Content-Type: text/plain Date: Sat, 30 Aug 2008 14:38:19 +0200 Message-Id: <1220099899.8426.34.camel@twins> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3389 Lines: 90 On Sat, 2008-08-30 at 08:32 -0400, Gregory Haskins wrote: > Peter Zijlstra wrote: > > On Fri, 2008-08-29 at 14:03 -0400, Gregory Haskins wrote: > > > >> *Patch submitted for inclusion in PREEMPT_RT 26-rt4. Applies to 2.6.26.3-rt3* > >> > >> Hi Ingo, Steven, Thomas, > >> Please consider for -rt4. This fixes a nasty deadlock on my systems under > >> heavy load. > >> > >> [ > >> Changelog: > >> v2: only touch seqlock_t because raw_seqlock_t doesn't require > >> serialization and userspace cannot modify data during a read > >> > >> v1: initial release > >> ] > >> > >> -Greg > >> > >> ---- > >> seqlock: serialize against writers > >> > >> Seqlocks have always advertised that readers do not "block", but this was > >> never really true. Readers have always logically blocked at the head of > >> the critical section under contention with writers, regardless of whether > >> they were allowed to run code or not. > >> > >> Recent changes in this space (88a411c07b6fedcfc97b8dc51ae18540bd2beda0) > >> have turned this into a more explicit blocking operation in mainline. > >> However, this change highlights a short-coming in -rt because the > >> normal seqlock_ts are preemptible. This means that we can potentially > >> deadlock should a reader spin waiting for a write critical-section to end > >> while the writer is preempted. > >> > > > > Ah, the point I was missing is higher-priority realtime task, in which > > case the write side will never run because it wont preempt. > > > > Yep > > > >> This patch changes the internal implementation to use a rwlock and forces > >> the readers to serialize with the writers under contention. This will > >> have the advantage that -rt seqlocks_t will sleep the reader if deadlock > >> were imminent, and it will pi-boost the writer to prevent inversion. > >> > >> This fixes a deadlock discovered under testing where all high prioritiy > >> readers were hogging the cpus and preventing a writer from releasing the > >> lock. > >> > >> Since seqlocks are designed to be used as rarely-write locks, this should > >> not affect the performance in the fast-path > >> > > > > Still dont like this patch, once you have a rwlock you might as well go > > all the way. > > Why? Because the second point. > A full rwlock will still be much slower since the readers will > always need an atomic op. This construct only uses atomic ops in the > slow path under contention, which should be rare, and is thus still > superior when retries are permissible to the design. > > > Esp since this half-arsed construct defeats PI in certain > > cases. > > > > Ouch. While I admit that you can still get into inversion scenarios > once the reader leaves the seqbegin, this is the nature of seqlocks. > The only ways I can think of to get around this involve atomic ops in > the fast path, which I think should be avoided. What would you suggest > otherwise? Since we're talking -rt here, determinism rules, so bite the bullet and do full PI. The only reason we made all that stuff preemptable is to gain determinism, that also means we have to do the PI thing. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/