Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755918Ab2KZUJf (ORCPT ); Mon, 26 Nov 2012 15:09:35 -0500 Received: from mx1.redhat.com ([209.132.183.28]:24070 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755782Ab2KZUJc (ORCPT ); Mon, 26 Nov 2012 15:09:32 -0500 Date: Mon, 26 Nov 2012 15:09:17 -0500 From: Don Zickus To: Chuansheng Liu Cc: akpm@linux-foundation.org, mingo@kernel.org, rjw@sisk.pl, linux-kernel@vger.kernel.org Subject: Re: [PATCH] watchdog: optimizing the hrtimer interval for power saving Message-ID: <20121126200917.GW1871@redhat.com> References: <1353602906.15558.1695.camel@cliu38-desktop-build> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1353602906.15558.1695.camel@cliu38-desktop-build> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3894 Lines: 108 On Fri, Nov 23, 2012 at 12:48:26AM +0800, Chuansheng Liu wrote: > > By default, the watchdog threshold is 10, it means every 4s > every CPU will receive one hrtimer interrupt, for low power > device, it will cause 4-5mV power impact when device is deep > sleep. > > So here want to optimize it as below: > 4s + 4s + 4s + 4s + 4s > == > > 12s + 2s + 2s + 2s + 2s > 3/5 1/10 1/10 1/10 1/10 > > In 5 chances, once one chance is hit, then we can start the > hrtimer with a longer period sample(12s). Until the current > chance is not hit, will start the hrtimer with a shorted > period sample(2s). Hmm. Have you tried this patch with cpuspeed (or cpupower) disabled? The reason I ask is the watchdog threshold is set to 10 which means the hardlockup watchdog will go off in 10 seconds if it isn't kicked by hrtimers. So 12 seconds will miss the window repeatedly. However, the hardlockup works by calculating the max cpu frequency and converting it to 10 seconds. Thanks to cpuspeed, most machines don't run at max frequency. Therefore a 10 second window is usually 60 seconds or more. So your initial testing might have missed the fact that 12 seconds is greater than the 10 second hardlockup period. Cheers, Don > > With this patch, in most case the hrtimer will be 12s instead > of 4s averagely. It can save the device power indeed. > > Signed-off-by: liu chuansheng > --- > kernel/watchdog.c | 30 ++++++++++++++++++++++++++++-- > 1 files changed, 28 insertions(+), 2 deletions(-) > > diff --git a/kernel/watchdog.c b/kernel/watchdog.c > index dd4b80a..6457e62 100644 > --- a/kernel/watchdog.c > +++ b/kernel/watchdog.c > @@ -125,7 +125,24 @@ static u64 get_sample_period(void) > * and hard thresholds) to increment before the > * hardlockup detector generates a warning > */ > - return get_softlockup_thresh() * ((u64)NSEC_PER_SEC / 5); > + return get_softlockup_thresh() * ((u64)NSEC_PER_SEC / 10); > +} > + > +static u64 get_long_sample_period(void) > +{ > + /* > + * convert watchdog_thresh from seconds to ns > + * We want to give 5 chances to detect softlockup, > + * for power saving, once one chance is succeeding, > + * we can set long period to avoid power consumption. > + * Currently, set the long sample period is: > + * 20s * 3/5 = 12s, once this 12s chance is not hit, > + * we will divide the left 8s into 4 pieces, give every > + * chance every 2s, so it will be likely: > + * 12s + 2s + 2s + 2s + 2s, > + * Anyway, we just use 12s is enough in normal case. > + */ > + return get_softlockup_thresh() * ((u64)NSEC_PER_SEC * 3 / 5); > } > > /* Commands for resetting the watchdog */ > @@ -267,6 +284,10 @@ static enum hrtimer_restart watchdog_timer_fn(struct hrtimer *hrtimer) > unsigned long touch_ts = __this_cpu_read(watchdog_touch_ts); > struct pt_regs *regs = get_irq_regs(); > int duration; > + bool is_touched; > + > + is_touched = (__this_cpu_read(hrtimer_interrupts) == > + __this_cpu_read(soft_lockup_hrtimer_cnt)); > > /* kick the hardlockup detector */ > watchdog_interrupt_count(); > @@ -275,7 +296,12 @@ static enum hrtimer_restart watchdog_timer_fn(struct hrtimer *hrtimer) > wake_up_process(__this_cpu_read(softlockup_watchdog)); > > /* .. and repeat */ > - hrtimer_forward_now(hrtimer, ns_to_ktime(get_sample_period())); > + if (is_touched) { > + hrtimer_forward_now(hrtimer, > + ns_to_ktime(get_long_sample_period())); > + } else { > + hrtimer_forward_now(hrtimer, ns_to_ktime(get_sample_period())); > + } > > if (touch_ts == 0) { > if (unlikely(__this_cpu_read(softlockup_touch_sync))) { > -- > 1.7.0.4 > > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/