Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp647393pxy; Thu, 22 Apr 2021 10:06:57 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwAr5K0Dys+aDWLUIdGhX2pjCa49rgIr0Nb7xntMR4YmPEhvlDySdtKAIsTA2r371GzJgAR X-Received: by 2002:a17:906:49c1:: with SMTP id w1mr4642305ejv.178.1619111217445; Thu, 22 Apr 2021 10:06:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619111217; cv=none; d=google.com; s=arc-20160816; b=Vb+jed/tUeC3RLhYPAtQkBZFif3po+v7o5jFIab72rHtrcVd8y8Jx1Qn26RAN1t/o7 YDFwIDaaL9aQPWARj6ZgOo9KvLbO9mdlG7ybkQ5qQkCM7rNDb8zXDNNAIsS2Z73g+i4j h8HUyU9XkLPlf89c1OlOKRa8LAIoYw3gkEma/n+mx+iLg+n5zuxgrm72xWGPLjEvU/HU oBCqXY/lE/HpkQX1duGYmiODwyK1MYbiRXEOg/Zw1FWtibX7N13czzbH/na4Ylm+Fftb wX48VDXUJW8ZnB+XtBsJFlDHswoVGPd4Tk/yRLUsapELrmhbqqMCtvaQdYwz52qZ6aLV MuzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:subject:cc:to:from:date; bh=5Pffb2ZpSj9HREAUeGHlAsVKEi/sx3PVuAGoyfnB0UQ=; b=N5uUE2dvBUrbts3aGaqXasxTsntSgRurQ2kekGXFAI/XRC9BPqCM91TMI+FjGTXekT p07uRTrgKxvCsiVXgAiHBLhn346z48BPGiljHzzAhN4v6gBY5tGqWYBnRJ7bp8Sykioe 1daNKfgCHXCO4kBknoL1hK3Xq/Uc51l+v6rnT+UXV9MV77vL9zggXOJkkP37TKmu95J+ QvH+iBPy0uzO9teLQOjD9JjSX+t0mkpa/iCnvS7Cq8zS5Diov+7jxCjZdK2YwHZNgaH+ XdF9+NosHYFlYVBswz8wmp5E1uzKskRNhrGllA8Y4xkEsfVrmnTv3Q1bH0pGXAqP3IgZ uMjA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id hp30si3107240ejc.344.2021.04.22.10.06.33; Thu, 22 Apr 2021 10:06:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238346AbhDVRDR (ORCPT + 99 others); Thu, 22 Apr 2021 13:03:17 -0400 Received: from shelob.surriel.com ([96.67.55.147]:35718 "EHLO shelob.surriel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236287AbhDVRDR (ORCPT ); Thu, 22 Apr 2021 13:03:17 -0400 Received: from [2603:3005:d05:2b00:6e0b:84ff:fee2:98bb] (helo=imladris.surriel.com) by shelob.surriel.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94) (envelope-from ) id 1lZcir-0000SG-4p; Thu, 22 Apr 2021 13:02:37 -0400 Date: Thu, 22 Apr 2021 13:02:36 -0400 From: Rik van Riel To: linux-kernel@vger.kernel.org Cc: kernel-team@fb.com, Vincent Guittot , Valentin Schneider , Peter Zijlstra , Ingo Molnar , Dietmar Eggemann , Mel Gorman Subject: [PATCH v4] sched,fair: Skip newidle_balance if a wakeup is pending Message-ID: <20210422130236.0bb353df@imladris.surriel.com> X-Mailer: Claws Mail 3.17.6 (GTK+ 2.24.32; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: riel@shelob.surriel.com Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The try_to_wake_up function has an optimization where it can queue a task for wakeup on its previous CPU, if the task is still in the middle of going to sleep inside schedule(). Once schedule() re-enables IRQs, the task will be woken up with an IPI, and placed back on the runqueue. If we have such a wakeup pending, there is no need to search other CPUs for runnable tasks. Just skip (or bail out early from) newidle balancing, and run the just woken up task. For a memcache like workload test, this reduces total CPU use by about 2%, proportionally split between user and system time, and p99 and p95 application response time by 10% on average. The schedstats run_delay number shows a similar improvement. Signed-off-by: Rik van Riel --- kernel/sched/fair.c | 11 ++++++++++- 1 file changed, 10 insertions(+), 1 deletion(-) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 69680158963f..6a18688a37f8 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -10594,6 +10594,14 @@ static int newidle_balance(struct rq *this_rq, struct rq_flags *rf) u64 curr_cost = 0; update_misfit_status(NULL, this_rq); + + /* + * There is a task waiting to run. No need to search for one. + * Return 0; the task will be enqueued when switching to idle. + */ + if (this_rq->ttwu_pending) + return 0; + /* * We must set idle_stamp _before_ calling idle_balance(), such that we * measure the duration of idle_balance() as idle time. @@ -10661,7 +10669,8 @@ static int newidle_balance(struct rq *this_rq, struct rq_flags *rf) * Stop searching for tasks to pull if there are * now runnable tasks on this rq. */ - if (pulled_task || this_rq->nr_running > 0) + if (pulled_task || this_rq->nr_running > 0 || + this_rq->ttwu_pending) break; } rcu_read_unlock(); -- 2.25.4