Received: by 2002:a05:6520:4211:b029:f4:110d:56bc with SMTP id o17csp1321716lkv; Thu, 29 Apr 2021 03:21:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz/0DGIbLG8qhtBGEZ+QBdEIMvZm9oaQuNwBJXDJxF2Whf664RuA5NKJat+UU7fHhIeVmrh X-Received: by 2002:a63:6ec3:: with SMTP id j186mr31223927pgc.249.1619691709370; Thu, 29 Apr 2021 03:21:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619691709; cv=none; d=google.com; s=arc-20160816; b=pPitfeo2j2N10XRQ2w80OHSouRAOhuJNwZIikl49q/k2klsf7Rf+3oR8KKBaXlKB4j L2ajULST8cdH4RLS1QuK1dgJAY5XZtXJuVARXXTSeIJYSdu+/y8urHmAkuaoi/87vE85 XPgpea/+pe6VINTSd0suK6AexebOjZNbUN4CDzft5KpV9sDlSnr77sbMH2URNT1whqcJ X/K61zVteRVpRZV12hZVwwEVYYZwZsKdjWf/6FSy2/A2XZfCFPt/Azy3f1TyGavaVFlu KhgNEQWNEPWNVbEfzNRF4klGRnnY4yjTYhbiSQKJQ72OnkjsbBUD5XwxM2pdGIWqn84Z /72A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:references:in-reply-to:message-id:date:subject :cc:to:from; bh=4YRCMdGYTKFNDbXcTmyBcRnZo/YNP7i5eFDxWQRNhco=; b=rryfeR7WAuglBypNZpvsSsZsC0iX5m8Ctzq5inTtIfFY5ThAXFMpA4UFl84C8UsL93 2atK4TXklygrax2i9617YEKCuAqvN13E7KMJrbBIkPWIywxad2G0rMu0H6052Mdvgiwe fZZfSYMmeKG1geTkMyCeqCrxTfA4WKXybKHT5XMFY3MdNBom+g/w3mhux2xK8mZFsAHu QEuiDGCsrHfe+brtKSZWZjQNhCtth5Q33OcK+SVXPQVR+d7zxS7GWYlHH4C98NX39fLw KtkJ3l1m8jr5QdxWrUda0Bh4LLrjQIrqv460ap2MESVmJugf3mw8282hv22YtilE3Dz7 cg4g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r18si3008771pls.423.2021.04.29.03.21.36; Thu, 29 Apr 2021 03:21:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240402AbhD2KVP (ORCPT + 99 others); Thu, 29 Apr 2021 06:21:15 -0400 Received: from foss.arm.com ([217.140.110.172]:46170 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240322AbhD2KU4 (ORCPT ); Thu, 29 Apr 2021 06:20:56 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id CDB1E106F; Thu, 29 Apr 2021 03:20:09 -0700 (PDT) Received: from e120189.arm.com (unknown [10.57.56.207]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id 9DAC63F70D; Thu, 29 Apr 2021 03:20:06 -0700 (PDT) From: Pierre.Gondois@arm.com To: linux-kernel@vger.kernel.org, xuewen.yan@unisoc.com, qperret@qperret.net, dietmar.eggemann@arm.com Cc: Lukasz.Luba@arm.com, Vincent.Donnefort@arm.com, Pierre Gondois , mingo@redhat.com, peterz@infradead.org, juri.lelli@redhat.com, vincent.guittot@linaro.org, rostedt@goodmis.org, bsegall@google.com, mgorman@suse.de, bristot@redhat.com Subject: [PATCH v2 2/2] sched/fair: Fix negative energy delta in find_energy_efficient_cpu() Date: Thu, 29 Apr 2021 11:19:48 +0100 Message-Id: <20210429101948.31224-3-Pierre.Gondois@arm.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20210429101948.31224-1-Pierre.Gondois@arm.com> References: <20210429101948.31224-1-Pierre.Gondois@arm.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Pierre Gondois find_energy_efficient_cpu() (feec()) searches the best energy CPU to place a task on. To do so, compute_energy() estimates the energy impact of placing the task on a CPU, based on CPU and task utilization signals. Utilization signals can be concurrently updated while evaluating a performance domain (pd). In some cases, this leads to having a 'negative delta', i.e. placing the task in the pd is seen as an energy gain. Thus, any further energy comparison is biased. In case of a 'negative delta', return prev_cpu since: 1. a 'negative delta' happens in less than 0.5% of feec() calls, on a Juno with 6 CPUs (4 little, 2 big) 2. it is unlikely to have two consecutive 'negative delta' for a task, so if the first call fails, feec() will correctly place the task in the next feec() call 3. EAS current behavior tends to select prev_cpu if the task doesn't raise the OPP of its current pd. prev_cpu is EAS's generic decision 4. prev_cpu should be preferred to returning an error code. In the latter case, select_idle_sibling() would do the placement, selecting a big (and not energy efficient) CPU. As 3., the task would potentially reside on the big CPU for a long time Reported-by: Xuewen Yan Suggested-by: Xuewen Yan Signed-off-by: Pierre Gondois --- kernel/sched/fair.c | 28 ++++++++++++++++------------ 1 file changed, 16 insertions(+), 12 deletions(-) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index c5351366fb93..935f1ea267a9 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -6594,15 +6594,15 @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu) { unsigned long prev_delta = ULONG_MAX, best_delta = ULONG_MAX; struct root_domain *rd = cpu_rq(smp_processor_id())->rd; + int cpu, best_energy_cpu = prev_cpu, target = -1; unsigned long cpu_cap, util, base_energy = 0; - int cpu, best_energy_cpu = prev_cpu; struct sched_domain *sd; struct perf_domain *pd; rcu_read_lock(); pd = rcu_dereference(rd->pd); if (!pd || READ_ONCE(rd->overutilized)) - goto fail; + goto unlock; /* * Energy-aware wake-up happens on the lowest sched_domain starting @@ -6612,7 +6612,9 @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu) while (sd && !cpumask_test_cpu(prev_cpu, sched_domain_span(sd))) sd = sd->parent; if (!sd) - goto fail; + goto unlock; + + target = prev_cpu; sync_entity_load_avg(&p->se); if (!task_util_est(p)) @@ -6666,6 +6668,8 @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu) if (compute_prev_delta) { prev_delta = compute_energy(p, prev_cpu, pd); + if (prev_delta < base_energy_pd) + goto unlock; prev_delta -= base_energy_pd; best_delta = min(best_delta, prev_delta); } @@ -6673,6 +6677,8 @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu) /* Evaluate the energy impact of using this CPU. */ if (max_spare_cap_cpu >= 0) { cur_delta = compute_energy(p, max_spare_cap_cpu, pd); + if (cur_delta < base_energy_pd) + goto unlock; cur_delta -= base_energy_pd; if (cur_delta < best_delta) { best_delta = cur_delta; @@ -6680,25 +6686,23 @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu) } } } -unlock: + rcu_read_unlock(); /* * Pick the best CPU if prev_cpu cannot be used, or if it saves at * least 6% of the energy used by prev_cpu. */ - if (prev_delta == ULONG_MAX) - return best_energy_cpu; - - if ((prev_delta - best_delta) > ((prev_delta + base_energy) >> 4)) - return best_energy_cpu; + if ((prev_delta == ULONG_MAX) || + (prev_delta - best_delta) > ((prev_delta + base_energy) >> 4)) + target = best_energy_cpu; - return prev_cpu; + return target; -fail: +unlock: rcu_read_unlock(); - return -1; + return target; } /* -- 2.17.1