Received: by 10.223.185.116 with SMTP id b49csp3581109wrg; Tue, 6 Mar 2018 01:14:52 -0800 (PST) X-Google-Smtp-Source: AG47ELtH/WJRThJ0a3wnHJdcXOVcIVmj8aUT/fqeuWy+nIEEIjuBrAKv8GLxBQRyjTkX8V8QnQcz X-Received: by 2002:a17:902:9882:: with SMTP id s2-v6mr16545025plp.196.1520327692562; Tue, 06 Mar 2018 01:14:52 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520327692; cv=none; d=google.com; s=arc-20160816; b=lZAy+hbdN1LYqn5B/+tGEgn8zJ0iwo70GmmqILS/1oW9rPiosAZA3PAT0IitYOUBnV 2SCS5lIZkHCBX/ugFQ10/6ZYTlH3VevGxRVrGSiTJg6vlBPGWcVe1DQJ46AYDvx096Yd sNF1IcLOTtc3xqK581GgwEUODj2H+WDjjwQb7xUcExLw8cP1t4JZYdu+pEJu8Wi0LU7u vcYSnbBgV2hvaBWIoaozITaigXkt3uf9+fEaOzzWj5Gz8lE6XBgYiO0uHvXxQwE2MN1m tUptI7ud6l2Rp3YdjvvSDw8Q3FAOdglfGIdOh+X6ECKqRiO8DziLTfCYcuQ+Xne8jkNl h8LA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=wNPVs34diLT8ui8wCWOXiFz223688qpQcQOsRWtNYIw=; b=ma8pTMT6vOJM8L0A3w5LS0a4Iz8MP7VzyKDoUI5e9uAFXi2U/KXNNI/7QwAS/Ta7KU mr+jbNopK6UkxNIXZl0vkKJZQ5a3aa8rgKLmeuYkLTgJNaOAklu3yLUMUncjX9C5y3dd 4gXDP02Lq5nfvgahMfvkJH/LGhrU2GRa7ii4vw2wWlldGjnOx6p+kZ3V0qaL+KkG+Rm+ b0UTH1In2gmesoRXXBNMqupHsuwYbbB5kJl4YiYXUec8Y4CmoAJhyzrTguOThCjRS32Q Tv6Oxmk4e27yMbuV8g5DgL7knTmDEb2WhqHi6vvUBc5Wpt3iQP/BfHeiAGcf3AYtWewa 4mrg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m1si1674203pgc.757.2018.03.06.01.14.38; Tue, 06 Mar 2018 01:14:52 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753475AbeCFJM7 (ORCPT + 99 others); Tue, 6 Mar 2018 04:12:59 -0500 Received: from cloudserver094114.home.pl ([79.96.170.134]:60016 "EHLO cloudserver094114.home.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753069AbeCFJMe (ORCPT ); Tue, 6 Mar 2018 04:12:34 -0500 Received: from 79.184.254.228.ipv4.supernova.orange.pl (79.184.254.228) (HELO aspire.rjw.lan) by serwer1319399.home.pl (79.96.170.134) with SMTP (IdeaSmtpServer 0.83) id 59716b4d61af81fa; Tue, 6 Mar 2018 10:12:32 +0100 From: "Rafael J. Wysocki" To: Peter Zijlstra , Linux PM Cc: Thomas Gleixner , Frederic Weisbecker , Paul McKenney , Thomas Ilsche , Doug Smythies , Rik van Riel , Aubrey Li , Mike Galbraith , LKML Subject: [RFC/RFT][PATCH v2 2/6] sched: idle: Do not stop the tick upfront in the idle loop Date: Tue, 06 Mar 2018 10:02:15 +0100 Message-ID: <3346281.BDGJiv2ZOp@aspire.rjw.lan> In-Reply-To: <2067762.1uWBf5RSRc@aspire.rjw.lan> References: <2067762.1uWBf5RSRc@aspire.rjw.lan> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Rafael J. Wysocki Push the decision whether or not to stop the tick somewhat deeper into the idle loop. Stopping the tick upfront leads to unpleasant outcomes in case the idle governor doesn't agree with the timekeeping code on the duration of the upcoming idle period. Specifically, if the tick has been stopped and the idle governor predicts short idle, the situation is bad regardless of whether or not the prediction is accurate. If it is accurate, the tick has been stopped unnecessarily which means excessive overhead. If it is not accurate, the CPU is likely to spend too much time in the (shallow, because short idle has been predicted) idle state selected by the governor [1]. As the first step towards addressing this problem, change the code to make the tick stopping decision inside of the loop in do_idle(). In particular, do not stop the tick in the cpu_idle_poll() code path. Also don't do that in tick_nohz_irq_exit() which doesn't really have information to whether or not to stop the tick. Link: https://marc.info/?l=linux-pm&m=150116085925208&w=2 # [1] Link: https://tu-dresden.de/zih/forschung/ressourcen/dateien/projekte/haec/powernightmares.pdf Signed-off-by: Rafael J. Wysocki --- -> v2: No changes. --- kernel/sched/idle.c | 13 ++++++++++--- kernel/time/tick-sched.c | 2 +- 2 files changed, 11 insertions(+), 4 deletions(-) Index: linux-pm/kernel/sched/idle.c =================================================================== --- linux-pm.orig/kernel/sched/idle.c +++ linux-pm/kernel/sched/idle.c @@ -220,13 +220,17 @@ static void do_idle(void) */ __current_set_polling(); - tick_nohz_idle_enter(); + tick_nohz_idle_prepare(); while (!need_resched()) { check_pgt_cache(); rmb(); if (cpu_is_offline(cpu)) { + local_irq_disable(); + tick_nohz_idle_go_idle(true); + local_irq_enable(); + cpuhp_report_idle_dead(); arch_cpu_idle_dead(); } @@ -240,10 +244,13 @@ static void do_idle(void) * broadcast device expired for us, we don't want to go deep * idle as we know that the IPI is going to arrive right away. */ - if (cpu_idle_force_poll || tick_check_broadcast_expired()) + if (cpu_idle_force_poll || tick_check_broadcast_expired()) { + tick_nohz_idle_go_idle(false); cpu_idle_poll(); - else + } else { + tick_nohz_idle_go_idle(true); cpuidle_idle_call(); + } arch_cpu_idle_exit(); } Index: linux-pm/kernel/time/tick-sched.c =================================================================== --- linux-pm.orig/kernel/time/tick-sched.c +++ linux-pm/kernel/time/tick-sched.c @@ -1007,7 +1007,7 @@ void tick_nohz_irq_exit(void) struct tick_sched *ts = this_cpu_ptr(&tick_cpu_sched); if (ts->inidle) - __tick_nohz_idle_enter(ts, true); + __tick_nohz_idle_enter(ts, false); else tick_nohz_full_update_tick(ts); }