Received: by 10.223.185.111 with SMTP id b44csp106013wrg; Fri, 9 Mar 2018 01:51:16 -0800 (PST) X-Google-Smtp-Source: AG47ELsExzHgu9dBCCmfhoeQ5kNaQkM+crJ2Ya7KQcfgcqt4rEVVjB06eOEXjhQnWQzkufDkeOH2 X-Received: by 10.98.10.219 with SMTP id 88mr29460801pfk.202.1520589076562; Fri, 09 Mar 2018 01:51:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520589076; cv=none; d=google.com; s=arc-20160816; b=DytKSHDH/7TiLQ0lq+1L5tRTLaK6F+rZHoQkUN7WqZ3vVzraT5BTTStCJG/mKmSTiW fQz/uc6lNWgeec2f+vpeoPOp8CS4CSdlAq0YW9Rj/ILPSxdAWyHyZmIwYyqVMlJlStqu iV0Bot8c6dj1dlI1HOLu2iO3NP8Y3qYbYv+PHC4UOMT/uFaSB1DGgG3UM3OU84XeSc+D rlJjulDHIfrEe2SY1FeGvuy2gnmspttDZYyV97/OZ/9nUt3GhBfBXJqKo77JuZFc16dH 6FC03NqzFXzAdQVgYJBiKiXHpAxVOm9h9n/NB/884SOczKgAfoN+R2ULWf+O4J/bZ20Z g2iw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=jvhP1aQtkgdbw32jO2Ea+6eJm1xARXqr5jj32oJ4+5o=; b=m1MKQXYwdpMBqQpCjJFw93jOhDIT6laJCkqJ6UmKsK1O6hPGgayFqNZHp0byXxfXsP J93lf8u2XIckT7jUVRIQnzrrXv7IGabA++ixgj939+Zho6l8fDc2XOTR3Bj+bHkaC/p7 mXY3P/wjNSJy3e+Wdt6jRZrV1oSpvrQnQYUxO2Gg4QDicrdU20qCy366H45vJ0G8eW1o nefOCFsYFK+vavL+NgRjdKwnPq1HRo3svzrHmTc+RiwlqqkOFLqKa+CbRmmETCNY5e04 fu/b59SDOYD0UMVbN+6sxkl8PLjfXpRZUHhlZfjh0xiO/XrgLpJvb6DZCRGQngrz9gXe eoew== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t191si499025pgc.566.2018.03.09.01.51.02; Fri, 09 Mar 2018 01:51:16 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751072AbeCIJsw (ORCPT + 99 others); Fri, 9 Mar 2018 04:48:52 -0500 Received: from cloudserver094114.home.pl ([79.96.170.134]:46969 "EHLO cloudserver094114.home.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750878AbeCIJst (ORCPT ); Fri, 9 Mar 2018 04:48:49 -0500 Received: from 79.184.254.228.ipv4.supernova.orange.pl (79.184.254.228) (HELO aspire.rjw.lan) by serwer1319399.home.pl (79.96.170.134) with SMTP (IdeaSmtpServer 0.83) id 909c919344401be1; Fri, 9 Mar 2018 10:48:47 +0100 From: "Rafael J. Wysocki" To: Peter Zijlstra , Linux PM , Frederic Weisbecker Cc: Thomas Gleixner , Paul McKenney , Thomas Ilsche , Doug Smythies , Rik van Riel , Aubrey Li , Mike Galbraith , LKML Subject: [RFC/RFT][PATCH v3 6/6] cpuidle: menu: Refine idle state selection for running tick Date: Fri, 09 Mar 2018 10:49:11 +0100 Message-ID: <2427176.MrJbnUnnmj@aspire.rjw.lan> In-Reply-To: <2450532.XN8DODrtDf@aspire.rjw.lan> References: <2450532.XN8DODrtDf@aspire.rjw.lan> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Rafael J. Wysocki If the tick isn't stopped, the target residency of the state selected by the menu governor may be greater than the actual time to the next tick and that means lost energy. To avoid that, make tick_nohz_get_sleep_length() return the current time the the next event (before stopping the tick) in addition to the estimated one via an extra pointer argument and make menu_select() use that value to refine the state selection when necessary. Signed-off-by: Rafael J. Wysocki --- drivers/cpuidle/governors/menu.c | 21 ++++++++++++++++++--- include/linux/tick.h | 2 +- kernel/time/tick-sched.c | 7 +++++-- 3 files changed, 24 insertions(+), 6 deletions(-) Index: linux-pm/include/linux/tick.h =================================================================== --- linux-pm.orig/include/linux/tick.h +++ linux-pm/include/linux/tick.h @@ -119,7 +119,7 @@ extern void tick_nohz_idle_retain_tick(v extern void tick_nohz_idle_enter(void); extern void tick_nohz_idle_exit(void); extern void tick_nohz_irq_exit(void); -extern ktime_t tick_nohz_get_sleep_length(void); +extern ktime_t tick_nohz_get_sleep_length(ktime_t *cur_ret); extern unsigned long tick_nohz_get_idle_calls(void); extern unsigned long tick_nohz_get_idle_calls_cpu(int cpu); extern u64 get_cpu_idle_time_us(int cpu, u64 *last_update_time); Index: linux-pm/kernel/time/tick-sched.c =================================================================== --- linux-pm.orig/kernel/time/tick-sched.c +++ linux-pm/kernel/time/tick-sched.c @@ -1024,10 +1024,11 @@ void tick_nohz_irq_exit(void) /** * tick_nohz_get_sleep_length - return the expected length of the current sleep + * @cur_ret: pointer for returning the current time to the next event * * Called from power state control code with interrupts disabled */ -ktime_t tick_nohz_get_sleep_length(void) +ktime_t tick_nohz_get_sleep_length(ktime_t *cur_ret) { struct clock_event_device *dev = __this_cpu_read(tick_cpu_device.evtdev); struct tick_sched *ts = this_cpu_ptr(&tick_cpu_sched); @@ -1038,6 +1039,8 @@ ktime_t tick_nohz_get_sleep_length(void) */ ktime_t now = ts->idle_entrytime; + *cur_ret = ktime_sub(dev->next_event, now); + if (can_stop_idle_tick(cpu, ts)) { ktime_t next_event = __tick_nohz_next_event(ts, cpu); @@ -1045,7 +1048,7 @@ ktime_t tick_nohz_get_sleep_length(void) return ktime_sub(next_event, now); } - return ktime_sub(dev->next_event, now); + return *cur_ret; } /** Index: linux-pm/drivers/cpuidle/governors/menu.c =================================================================== --- linux-pm.orig/drivers/cpuidle/governors/menu.c +++ linux-pm/drivers/cpuidle/governors/menu.c @@ -296,6 +296,7 @@ static int menu_select(struct cpuidle_dr unsigned int expected_interval; unsigned long nr_iowaiters, cpu_load; int resume_latency = dev_pm_qos_raw_read_value(device); + ktime_t tick_time; if (data->needs_update) { menu_update(drv, dev); @@ -313,7 +314,7 @@ static int menu_select(struct cpuidle_dr } /* determine the expected residency time, round up */ - data->next_timer_us = ktime_to_us(tick_nohz_get_sleep_length()); + data->next_timer_us = ktime_to_us(tick_nohz_get_sleep_length(&tick_time)); get_iowait_load(&nr_iowaiters, &cpu_load); data->bucket = which_bucket(data->next_timer_us, nr_iowaiters); @@ -400,8 +401,22 @@ static int menu_select(struct cpuidle_dr * within the tick period range that could be used if longer * idle duration was predicted. */ - *nohz_ret = !(first_idx > idx && - drv->states[first_idx].target_residency < TICK_USEC_HZ); + if (first_idx > idx && + drv->states[first_idx].target_residency < TICK_USEC_HZ) { + unsigned int tick_us = ktime_to_us(tick_time); + + /* + * Find a state with target residency less than the + * time to the next timer event including the tick. + */ + while (idx > 0 && + (drv->states[idx].target_residency > tick_us || + drv->states[idx].disabled || + dev->states_usage[idx].disable)) + idx--; + + *nohz_ret = false; + } } data->last_state_idx = idx;