Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932262AbVLUDjP (ORCPT ); Tue, 20 Dec 2005 22:39:15 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932270AbVLUDjP (ORCPT ); Tue, 20 Dec 2005 22:39:15 -0500 Received: from mustang.oldcity.dca.net ([216.158.38.3]:41348 "HELO mustang.oldcity.dca.net") by vger.kernel.org with SMTP id S932262AbVLUDjP (ORCPT ); Tue, 20 Dec 2005 22:39:15 -0500 Subject: Re: 2.6.14-rt22 (and mainline) excessive latency From: Lee Revell To: paulmck@us.ibm.com Cc: Ingo Molnar , linux-kernel In-Reply-To: <20051221014747.GB5741@us.ibm.com> References: <1135039244.28649.41.camel@mindpipe> <20051220042442.GA32039@elte.hu> <20051221014747.GB5741@us.ibm.com> Content-Type: text/plain Date: Tue, 20 Dec 2005 22:32:48 -0500 Message-Id: <1135135970.28229.0.camel@mindpipe> Mime-Version: 1.0 X-Mailer: Evolution 2.4.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1520 Lines: 36 On Tue, 2005-12-20 at 17:47 -0800, Paul E. McKenney wrote: > On Tue, Dec 20, 2005 at 05:24:42AM +0100, Ingo Molnar wrote: > > > > * Lee Revell wrote: > > > > > I captured this 3+ ms latency trace when killing a process with a few > > > thousand threads. Can a cond_resched be added to this code path? > > > > > bash-17992 0.n.1 29us : eligible_child (do_wait) > > > > > > [ 3000+ of these deleted ] > > > > > > bash-17992 0.n.1 3296us : eligible_child (do_wait) > > > > Atomicity of signal delivery is pretty much a must, so i'm not sure this > > particular latency can be fixed, short of running PREEMPT_RT. Paul E. > > McKenney is doing some excellent stuff by RCU-ifying the task lookup and > > signal code, but i'm not sure whether it could cover do_wait(). > > Took a quick break from repeatedly shooting myself in the foot with > RCU read-side priority boosting (still have a few toes left) to take > a quick look at this. The TASK_TRACED and TASK_STOPPED cases seem > non-trivial, and I am concerned about races with exit. > > Any thoughts on whether the latency is due to contention on the > tasklist lock vs. the "goto repeat" in do_wait()? It's a UP system so I'd be surprised if there were any contention. Lee - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/