Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp717138imm; Wed, 29 Aug 2018 10:21:37 -0700 (PDT) X-Google-Smtp-Source: ANB0Vdasd/e+PjWpo+gl5QOmSRQjH2/Rt6pVPjEliYIOevokJL3FSjs1UdjdPqjx3ODrduCy/Vvu X-Received: by 2002:a62:41d6:: with SMTP id g83-v6mr6881031pfd.219.1535563297746; Wed, 29 Aug 2018 10:21:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1535563297; cv=none; d=google.com; s=arc-20160816; b=MkACDLf6Df1aUXM3w3m0tcmu97UoP/KZ2GeYba+qL1EjAVOSJAab/8irulclp92BWi khIwOmhTTl0x7GpWh+Hun8JnW6gv7XsewkOtFuuKcfaFduAp4zBNHPDTBZUjXcMiUsB6 5Yow9dFXK2pMYCpIEj1Py3pWCq8KTV9o/7bwNZk23uy9OQMxBJOIO20lxNS/qxIHwbsY Kd3J7oZ1WPSzeEcTkU8kpCGxKorTozonZbCWgDeUUzZd588JkYuvYANgHSfwhyLe09ql 41jatj87+0qK3sIrGpEfUulLAr1UebtydkNY9haeXXysf1plCSQSZk4dAv0nN7b/AdA+ 9SKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=eLg43/cNxQXSP2z5nKpFkBiFanXNQrBKkR9+vgtvhoU=; b=CWx3x3p9eAODEFkAwjTlAoH2lddQGO2dEL3q/A7YSWqa8ZuM/WeMttXV9CL94Fnekp 0qqJr7M3e0EooSp/AUtL6yN2nDYaVmioLCT+zGeCL6eB1y/MfXtOiNglGhfILZzDkOc1 zEWQxQNZ5+SAIXn5sK3NTWlX+y2uF19WE3wLWghUoJLE3s+OpDlZ/FMfZGGveMY/Qm4/ zqsz6ExUINuDm1lrPDJsggmb73JXigWQxUE2d8z3itdyOP6opdmWuzerEDUKdEiXRxGl ZcE+PUglrljpcUa58KL/r4ETryfvqwoftYKOCAHpXSmK5nYFHELfkghV97tYbdb1+Xtk +I0g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 9-v6si3993142pgq.229.2018.08.29.10.21.22; Wed, 29 Aug 2018 10:21:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727859AbeH2VSG (ORCPT + 99 others); Wed, 29 Aug 2018 17:18:06 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:58650 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727444AbeH2VSG (ORCPT ); Wed, 29 Aug 2018 17:18:06 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 2F39C18A; Wed, 29 Aug 2018 10:20:12 -0700 (PDT) Received: from queper01-lin (queper01-lin.emea.arm.com [10.4.13.27]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 399223F721; Wed, 29 Aug 2018 10:20:08 -0700 (PDT) Date: Wed, 29 Aug 2018 18:20:06 +0100 From: Quentin Perret To: Patrick Bellasi Cc: peterz@infradead.org, rjw@rjwysocki.net, linux-kernel@vger.kernel.org, linux-pm@vger.kernel.org, gregkh@linuxfoundation.org, mingo@redhat.com, dietmar.eggemann@arm.com, morten.rasmussen@arm.com, chris.redpath@arm.com, valentin.schneider@arm.com, vincent.guittot@linaro.org, thara.gopinath@linaro.org, viresh.kumar@linaro.org, tkjos@google.com, joel@joelfernandes.org, smuckle@google.com, adharmap@codeaurora.org, skannan@codeaurora.org, pkondeti@codeaurora.org, juri.lelli@redhat.com, edubezval@gmail.com, srinivas.pandruvada@linux.intel.com, currojerez@riseup.net, javi.merino@kernel.org Subject: Re: [PATCH v6 07/14] sched/topology: Introduce sched_energy_present static key Message-ID: <20180829172004.afbe2oukprvptqs2@queper01-lin> References: <20180820094420.26590-1-quentin.perret@arm.com> <20180820094420.26590-8-quentin.perret@arm.com> <20180829165058.GR2960@e110439-lin> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180829165058.GR2960@e110439-lin> User-Agent: NeoMutt/20171215 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wednesday 29 Aug 2018 at 17:50:58 (+0100), Patrick Bellasi wrote: > > +/* > > + * The complexity of the Energy Model is defined as: nr_pd * (nr_cpus + nr_cs) > > + * with: 'nr_pd' the number of performance domains; 'nr_cpus' the number of > > + * CPUs; and 'nr_cs' the sum of the capacity states numbers of all performance > > + * domains. > > + * > > + * It is generally not a good idea to use such a model in the wake-up path on > > + * very complex platforms because of the associated scheduling overheads. The > > + * arbitrary constraint below prevents that. It makes EAS usable up to 16 CPUs > > + * with per-CPU DVFS and less than 8 capacity states each, for example. > > According to the formula above, that should give a "complexity value" of: > > 16 * (16 + 9) = 384 > > while, 2K complexity seems more like a 40xCPUs system with 8 OPPs. > > Maybe we should update either the example or the constant below ? Hmm I guess the example isn't really clear. 'nr_cs' is the _sum_ of the number of OPPs of all perf. domains. So, in the example above, if you have 16 CPUs with per-CPU DVFS, and each DVFS island has 8 OPPs, then nr_cs = 16 * 8 = 128. So if you apply the formula you get C = 16 * (16 + 128) = 2304, which is more than EM_MAX_COMPLEXITY, so EAS cannot start. If the DVFS island had 7 OPPs instead of 8 (for example) you would get nr_cs = 112, C = 2048, and so EAS could start. I can try to re-work that comment to explain things a bit better ... > > > + */ > > +#define EM_MAX_COMPLEXITY 2048 > > + > > static void build_perf_domains(const struct cpumask *cpu_map) > > { > > + int i, nr_pd = 0, nr_cs = 0, nr_cpus = cpumask_weight(cpu_map); > > struct perf_domain *pd = NULL, *tmp; > > int cpu = cpumask_first(cpu_map); > > struct root_domain *rd = cpu_rq(cpu)->rd; > > - int i; > > + > > + /* EAS is enabled for asymmetric CPU capacity topologies. */ > > + if (!per_cpu(sd_asym_cpucapacity, cpu)) { > > + if (sched_debug()) { > > + pr_info("rd %*pbl: CPUs do not have asymmetric capacities\n", > > + cpumask_pr_args(cpu_map)); > > + } > > + goto free; > > + } > > > > for_each_cpu(i, cpu_map) { > > /* Skip already covered CPUs. */ > > @@ -288,6 +318,21 @@ static void build_perf_domains(const struct cpumask *cpu_map) > > goto free; > > tmp->next = pd; > > pd = tmp; > > + > > + /* > > + * Count performance domains and capacity states for the > > + * complexity check. > > + */ > > + nr_pd++; > > A special case where EAS is not going to be used is for systems where > nr_pd matches the number of online CPUs, isn't it ? Well, it depends. Say you have only 4 CPUs with 3 OPPs each. Even with per-CPU DVFS the complexity is low enough to start EAS. I don't really see a good reason for not doing so no ? > > If that's the case, then, by caching this nr_pd you can probably check > this condition in the sched_energy_start() and bail out even faster by > avoiding to scan all the doms_new's pd ? > > > > + nr_cs += em_pd_nr_cap_states(pd->obj); > > + } > > + > > + /* Bail out if the Energy Model complexity is too high. */ > > + if (nr_pd * (nr_cs + nr_cpus) > EM_MAX_COMPLEXITY) { > > + if (sched_debug()) > > + pr_info("rd %*pbl: EM complexity is too high\n ", > > + cpumask_pr_args(cpu_map)); > > + goto free; > > } > > > > perf_domain_debug(cpu_map, pd); > > @@ -307,6 +352,35 @@ static void build_perf_domains(const struct cpumask *cpu_map) > > if (tmp) > > call_rcu(&tmp->rcu, destroy_perf_domain_rcu); > > } > > + > > +static void sched_energy_start(int ndoms_new, cpumask_var_t doms_new[]) > > +{ > > + /* > > + * The conditions for EAS to start are checked during the creation of > > + * root domains. If one of them meets all conditions, it will have a > > + * non-null list of performance domains. > > + */ > > + while (ndoms_new) { > > + if (cpu_rq(cpumask_first(doms_new[ndoms_new - 1]))->rd->pd) > > + goto enable; > > + ndoms_new--; > > + } > > + > > + if (static_branch_unlikely(&sched_energy_present)) { > ^^^^^^^^ > Is this defined unlikely to reduce overheads on systems which never > satisfy all the conditions above while still rebuild SDs from time to > time ? Something like that. I just thought that the case where EAS needs to be disabled after being enabled isn't very common. I mean, the most typical use-case is, EAS is enabled at boot and stays enabled forever, or EAS never gets enabled. Enabling/disabling EAS because of hotplug (for example) can definitely happen, but that shouldn't be the case very often in practice, I think. So we can optimize things out a bit I suppose. Thanks ! Quentin