Received: by 10.192.165.156 with SMTP id m28csp303546imm; Tue, 17 Apr 2018 10:22:33 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/8z4aF7brazL91CRN/7SGR/zJqp7qdPKyaJOz8YayXZNlolKtoCaPQ7/QWx4sZ+t8zunuY X-Received: by 10.98.23.134 with SMTP id 128mr2750304pfx.120.1523985752882; Tue, 17 Apr 2018 10:22:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523985752; cv=none; d=google.com; s=arc-20160816; b=t4G3NMZTwUGZzo8Qx7mJFQihtwHj+BfXM3HRzfgKPpxKVWsmEicg8ur3D15djdC2NZ TmxcFm9lzV8RuVD/JZqJR8Z/ERgyoOyiOv6q2pccuc7D7g2yA9LFm+Bpn0RmgYrGkCAY J6pJUfjoJPoDWDsxekOzM53p4OjzlfjS5KZ4wKrXIC+DsxTn23ayq/J42Zxfl7+VVFek nQNe5gYrAwb5qIXH66bUIlRwGPFlsuO0VmR7O9A63h3B8pOPtYExHMP0c6e94hJjFBRp eEOroPbJP+vvrxnBAGN2Gf+/Y6v3+qojOGNn62KyV7Bz+xAmCKrNQ0ypdipq6wURvR2o Dcsg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=ublVbYgVxsqLrJ3N+sE6YPYuq3MntBs3H+XOBxx6P4U=; b=OLesXsbp/3g7p91SOrg5A5lpB2MDkyv+BohRepauhK9XgfYQjYY0K0Avdx3SmVDkqt InegeGBevdqQj5EmikhwtzOl7M0p6OXItQu15V20WCa08EjStC4JTeuP/ohzbm4Ekl4c gxFlKGT8C2vj5qtkDBHqYkfoTCG16lyboN/Bp9C2RnEDLULLPir3slBk+tATBnauphFt tx3Goa6TjW6bBKudEk1HaVoGv/rsZOhAMYSnOd0GMANohivn/oNqMjKyb4JsCHs3R3GY IrwL8tDmP8WN9iknd2cfSaQAlzN/hQxR04Gl5QlUxnqWUhBsvVwVAdlhOcenSCYZJySc EOQw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=P2d22Ulv; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q12-v6si10609286pll.467.2018.04.17.10.22.18; Tue, 17 Apr 2018 10:22:32 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=P2d22Ulv; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752915AbeDQPWg (ORCPT + 99 others); Tue, 17 Apr 2018 11:22:36 -0400 Received: from mail-ot0-f194.google.com ([74.125.82.194]:45396 "EHLO mail-ot0-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752826AbeDQPWd (ORCPT ); Tue, 17 Apr 2018 11:22:33 -0400 Received: by mail-ot0-f194.google.com with SMTP id w4-v6so2794141ote.12 for ; Tue, 17 Apr 2018 08:22:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=ublVbYgVxsqLrJ3N+sE6YPYuq3MntBs3H+XOBxx6P4U=; b=P2d22UlvE0QfP/JTmOq/Wt7oSlZoFmcyuovhjd/yAtfiMU+rQoumVdALvKLZYfLv3I shotCnSo6cLm4iZlbx3t6mTafDaTpN9VswC9QsJ/d/RrsH0Pq7TrgA+/dENImOKa1qWj RudrRamHP5ccLVCia6DPKnR0zEe3zZAmpDFOo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=ublVbYgVxsqLrJ3N+sE6YPYuq3MntBs3H+XOBxx6P4U=; b=uDEgsivd1DYCfoc2QieblyPx+eSbJk4TrGfkVeQJi26bvtc8f8m1OGD93WpizStfTr 1jaA/SJPOEOFs4D+yStBgKDU/qmW+njYGE2x5ehxPln2kiBcmz01FtzP+krGrDnDawfq YQ1UFJjG3FaZSxq5bM/+wUc4MhxIjE4P/DuUVj6F9X+7Dd6HPs+wk7voMgz8a1qfI1jd vO96Fio/Beeq4rsD3EOp+7J6bRcjKuIJsTkskHHXoUGCEjz5UyzQ+170y1l1daxdnwSY no8uXF5JfW6qF/S5rZH3+n2+LK9Onv9AjbrV286PrgfiGQuj9EFAdb8Qz+9K23oElUvE PWvw== X-Gm-Message-State: ALQs6tCnMeMHEHo/6Xir9Y8JTHB11XJxiaQygSBUafioz1KrGB7pUfyg xsau/aQ5vm0LCfr7aEMmmhx9zw== X-Received: by 2002:a9d:16ac:: with SMTP id c41-v6mr1599357ote.377.1523978552640; Tue, 17 Apr 2018 08:22:32 -0700 (PDT) Received: from leoy-ThinkPad-X240s (li973-238.members.linode.com. [45.33.19.238]) by smtp.gmail.com with ESMTPSA id t12-v6sm9153432otj.39.2018.04.17.08.22.20 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 17 Apr 2018 08:22:31 -0700 (PDT) Date: Tue, 17 Apr 2018 23:22:13 +0800 From: Leo Yan To: Dietmar Eggemann Cc: linux-kernel@vger.kernel.org, Peter Zijlstra , Quentin Perret , Thara Gopinath , linux-pm@vger.kernel.org, Morten Rasmussen , Chris Redpath , Patrick Bellasi , Valentin Schneider , "Rafael J . Wysocki" , Greg Kroah-Hartman , Vincent Guittot , Viresh Kumar , Todd Kjos , Joel Fernandes , Juri Lelli , Steve Muckle , Eduardo Valentin Subject: Re: [RFC PATCH v2 4/6] sched/fair: Introduce an energy estimation helper function Message-ID: <20180417152213.GC18509@leoy-ThinkPad-X240s> References: <20180406153607.17815-1-dietmar.eggemann@arm.com> <20180406153607.17815-5-dietmar.eggemann@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180406153607.17815-5-dietmar.eggemann@arm.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 06, 2018 at 04:36:05PM +0100, Dietmar Eggemann wrote: > From: Quentin Perret > > In preparation for the definition of an energy-aware wakeup path, a > helper function is provided to estimate the consequence on system energy > when a specific task wakes-up on a specific CPU. compute_energy() > estimates the OPPs to be reached by all frequency domains and estimates > the consumption of each online CPU according to its energy model and its > percentage of busy time. > > Cc: Ingo Molnar > Cc: Peter Zijlstra > Signed-off-by: Quentin Perret > Signed-off-by: Dietmar Eggemann > --- > include/linux/sched/energy.h | 20 +++++++++++++ > kernel/sched/fair.c | 68 ++++++++++++++++++++++++++++++++++++++++++++ > kernel/sched/sched.h | 2 +- > 3 files changed, 89 insertions(+), 1 deletion(-) > > diff --git a/include/linux/sched/energy.h b/include/linux/sched/energy.h > index 941071eec013..b4110b145228 100644 > --- a/include/linux/sched/energy.h > +++ b/include/linux/sched/energy.h > @@ -27,6 +27,24 @@ static inline bool sched_energy_enabled(void) > return static_branch_unlikely(&sched_energy_present); > } > > +static inline > +struct capacity_state *find_cap_state(int cpu, unsigned long util) > +{ > + struct sched_energy_model *em = *per_cpu_ptr(energy_model, cpu); > + struct capacity_state *cs = NULL; > + int i; > + > + util += util >> 2; > + > + for (i = 0; i < em->nr_cap_states; i++) { > + cs = &em->cap_states[i]; > + if (cs->cap >= util) > + break; > + } > + > + return cs; 'cs' is possible to return NULL. > +} > + > static inline struct cpumask *freq_domain_span(struct freq_domain *fd) > { > return &fd->span; > @@ -42,6 +60,8 @@ struct freq_domain; > static inline bool sched_energy_enabled(void) { return false; } > static inline struct cpumask > *freq_domain_span(struct freq_domain *fd) { return NULL; } > +static inline struct capacity_state > +*find_cap_state(int cpu, unsigned long util) { return NULL; } > static inline void init_sched_energy(void) { } > #define for_each_freq_domain(fdom) for (; fdom; fdom = NULL) > #endif > diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c > index 6960e5ef3c14..8cb9fb04fff2 100644 > --- a/kernel/sched/fair.c > +++ b/kernel/sched/fair.c > @@ -6633,6 +6633,74 @@ static int wake_cap(struct task_struct *p, int cpu, int prev_cpu) > } > > /* > + * Returns the util of "cpu" if "p" wakes up on "dst_cpu". > + */ > +static unsigned long cpu_util_next(int cpu, struct task_struct *p, int dst_cpu) > +{ > + unsigned long util, util_est; > + struct cfs_rq *cfs_rq; > + > + /* Task is where it should be, or has no impact on cpu */ > + if ((task_cpu(p) == dst_cpu) || (cpu != task_cpu(p) && cpu != dst_cpu)) > + return cpu_util(cpu); > + > + cfs_rq = &cpu_rq(cpu)->cfs; > + util = READ_ONCE(cfs_rq->avg.util_avg); > + > + if (dst_cpu == cpu) > + util += task_util(p); > + else > + util = max_t(long, util - task_util(p), 0); I tried to understand the logic at here, below code is more clear for myself: int prev_cpu = task_cpu(p); cfs_rq = &cpu_rq(cpu)->cfs; util = READ_ONCE(cfs_rq->avg.util_avg); /* Bail out if src and dst CPUs are the same one */ if (prev_cpu == cpu && dst_cpu == cpu) return util; /* Remove task utilization for src CPU */ if (cpu == prev_cpu) util = max_t(long, util - task_util(p), 0); /* Add task utilization for dst CPU */ if (dst_cpu == cpu) util += task_util(p); BTW, CPU utilization is decayed value and task_util() is not decayed value, so 'util - task_util(p)' calculates a smaller value than the prev CPU pure utilization, right? Another question is can we reuse the function cpu_util_wake() and just compenstate task util for dst cpu? > + if (sched_feat(UTIL_EST)) { > + util_est = READ_ONCE(cfs_rq->avg.util_est.enqueued); > + if (dst_cpu == cpu) > + util_est += _task_util_est(p); > + else > + util_est = max_t(long, util_est - _task_util_est(p), 0); > + util = max(util, util_est); > + } > + > + return min_t(unsigned long, util, capacity_orig_of(cpu)); > +} > + > +/* > + * Estimates the system level energy assuming that p wakes-up on dst_cpu. > + * > + * compute_energy() is safe to call only if an energy model is available for > + * the platform, which is when sched_energy_enabled() is true. > + */ > +static unsigned long compute_energy(struct task_struct *p, int dst_cpu) > +{ > + unsigned long util, max_util, sum_util; > + struct capacity_state *cs; > + unsigned long energy = 0; > + struct freq_domain *fd; > + int cpu; > + > + for_each_freq_domain(fd) { > + max_util = sum_util = 0; > + for_each_cpu_and(cpu, freq_domain_span(fd), cpu_online_mask) { > + util = cpu_util_next(cpu, p, dst_cpu); > + util += cpu_util_dl(cpu_rq(cpu)); > + max_util = max(util, max_util); > + sum_util += util; > + } > + > + /* > + * Here we assume that the capacity states of CPUs belonging to > + * the same frequency domains are shared. Hence, we look at the > + * capacity state of the first CPU and re-use it for all. > + */ > + cpu = cpumask_first(freq_domain_span(fd)); > + cs = find_cap_state(cpu, max_util); > + energy += cs->power * sum_util / cs->cap; > + } This means all CPUs will be iterated for calculation, the complexity is O(n)... Thanks, Leo Yan > + return energy; > +} > + > +/* > * select_task_rq_fair: Select target runqueue for the waking task in domains > * that have the 'sd_flag' flag set. In practice, this is SD_BALANCE_WAKE, > * SD_BALANCE_FORK, or SD_BALANCE_EXEC. > diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h > index 5d552c0d7109..6eb38f41d5d9 100644 > --- a/kernel/sched/sched.h > +++ b/kernel/sched/sched.h > @@ -2156,7 +2156,7 @@ static inline void cpufreq_update_util(struct rq *rq, unsigned int flags) {} > # define arch_scale_freq_invariant() false > #endif > > -#ifdef CONFIG_CPU_FREQ_GOV_SCHEDUTIL > +#ifdef CONFIG_SMP > static inline unsigned long cpu_util_dl(struct rq *rq) > { > return (rq->dl.running_bw * SCHED_CAPACITY_SCALE) >> BW_SHIFT; > -- > 2.11.0 >