Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758391AbYBWOur (ORCPT ); Sat, 23 Feb 2008 09:50:47 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754271AbYBWOuf (ORCPT ); Sat, 23 Feb 2008 09:50:35 -0500 Received: from wf-out-1314.google.com ([209.85.200.174]:36866 "EHLO wf-out-1314.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754221AbYBWOud (ORCPT ); Sat, 23 Feb 2008 09:50:33 -0500 DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=txuGgBMXRKs7XIJ5tKiv7DACB4bC8kmp79U/qfnoTAWxMNglkrcvGpx5e5oFXqSa3oM8lVhNm8P+ZxTzQaA7w924bb+hwdYrAJojp8PCNpELDIRk5aThGUD1s7Ifbb72yPhSD1t43FT01qkMIpxcz9PlfT/BEKFqSbJcHSayuMs= Message-ID: <7c86c4470802230650l1605db93o3c9b38ec52bcba89@mail.gmail.com> Date: Sat, 23 Feb 2008 15:50:33 +0100 From: "stephane eranian" To: "Peter Zijlstra" Subject: Re: runqueue locks in schedule() Cc: linux-kernel@vger.kernel.org, ia64 , "Stephane Eranian" , "Corey J Ashford" , "Ingo Molnar" In-Reply-To: <1200576266.28661.27.camel@twins> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <7c86c4470801161629t3870da59hb6ac371c44126b07@mail.gmail.com> <1200576266.28661.27.camel@twins> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2717 Lines: 60 Peter, > On Wed, 2008-01-16 at 16:29 -0800, stephane eranian wrote: > > Hello, > > > > As suggested by people on this list, I have changed perfmon2 to use > > the high resolution timers as the interface to allow timeout-based > > event set multiplexing. This works around the problems I had with > > tickless-enabled kernels. > > > > Multiplexing is supported in per-thread as well. In that case, the > > timeout measures virtual time. When the thread is context switched > > out, we need to save the remainder of the timeout and cancel the > > timer. When the thread is context switched in, we need to reinstall > > the timer. These timer save/restore operations have to be done in the > > switch_to() code near the end of schedule(). > > > > There are situations where hrtimer_start() may end up trying to > > acquire the runqueue lock. This happens on a context switch where the > > current thread is blocking (not preempted) and the new timeout happens > > to be either in the past or just expiring. We've run into such > > situations with simple tests. > > > > On all architectures, but IA-64, it seems thet the runqueue lock is > > held until the end of schedule(). On IA-64, the lock is released > > BEFORE switch_to() for some reason I don't quite remember. That may > > not even be needed anymore. > > > > The early unlocking is controlled by a macro named > > __ARCH_WANT_UNLOCKED_CTXSW. Defining this macros on X86 (or PPC) fixed > > our problem. > > > > It is not clear to me why the runqueue lock needs to be held up until > > the end of schedule() on some platforms and not on others. Not that > > releasing the lock earlier does not necessarily introduce more > > overhead because the lock is never re-acquired later in the schedule() > > function. > > > > Question: > > - is it safe to release the lock before switch_to() on all architectures? > > I had similar problem when using hrtimers from the scheduler, I extended > the HRTIMER_CB_IRQSAFE_NO_SOFTIRQ time type to run with cpu_base->lock > unlocked. > I am running into an issue when enabling this flag. Basically, the timer never fires when it gets into this situation where in hrtimer_start() the timer ends up being the next one to fire. In this mode, hrtimer_enqueue_reprogram() become a NOP. But then nobody never inserts the time into any queue. There is a comment that says "caller site takes care of this". Could you elaborate on this? Thanks. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/