Received: by 2002:a25:31c3:0:0:0:0:0 with SMTP id x186csp1378184ybx; Thu, 31 Oct 2019 09:43:09 -0700 (PDT) X-Google-Smtp-Source: APXvYqwL2N0GC5e2Kiim27f7NyYUnzBhRov0GjWhXKaC4f6WUXBUfqVXlTvp+2u8LwNb1iWIL4V9 X-Received: by 2002:aa7:ca0c:: with SMTP id y12mr7155012eds.224.1572540189551; Thu, 31 Oct 2019 09:43:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1572540189; cv=none; d=google.com; s=arc-20160816; b=u027IkPNs5i8+X7WZQ3/xSmdoOxBIKk3qbOXvqEdxXsDUmHdM77eddj+CHlsqZN/zh 1P2jpv/309/bAHOqwSskFCZdUwjZ7c6txqi9T/Hd0Jc9uRM3rHDWbgoBUVgTiHOev6y/ SH0r0mkJFGqhi8kad9sg4nil479ElWZfto2Dc/W+vlH4HqhTU6Vqe8MbSZGRQru5++ZE TO504zC4wWECtZdIwXQxUFuNtndoeylv001yOnnhNKjA2OspLm249CkriXqAeEfXeA2h kMQeGdJ08nWiJPFQNNp0nT9Bj/YTK+xjkDP3HPVOnOPq3tC7VajEX4nwI03BOXUzdZQV G0PA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=i3JA9BeNpdMvTYkpJ4Yfiu06Nv7u2cZK1AMJmkSO1JI=; b=npoAK9qEVvltpYscWG4LsSEqbGu11FwTb8Onl8+5IKyW0eOU7yfaCgdqC0VqF1wV+n EcGNgnwIFcBNFkkwOJOiLGM57HLvJP5VbVB+3MhjPHU9qHEsVuKgtNR0iqyexkofqj+s WfFCFIB3E9UfB3mfPI8Yst1bLZWH7haYI1K+ffYEiSud3sYFW1VS2qXbQSgEIpvRv0gM r0VA+kZpqvbvhzdrFNZXW2V/Tav03Z5iPYSnSqOBdFnUIrMQWOHP5ZTzbBbUUQlTae6C So2nfUxaqi+ZOwJH6JESIC0WS7PL6d3RAtWaQwQB4pdRcN4Y96xVhqUXQ47I0xIxyzs4 eP6Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=mcEHczdy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x37si4960310edd.228.2019.10.31.09.42.46; Thu, 31 Oct 2019 09:43:09 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=mcEHczdy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728628AbfJaQil (ORCPT + 99 others); Thu, 31 Oct 2019 12:38:41 -0400 Received: from mail-lf1-f66.google.com ([209.85.167.66]:46464 "EHLO mail-lf1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727593AbfJaQil (ORCPT ); Thu, 31 Oct 2019 12:38:41 -0400 Received: by mail-lf1-f66.google.com with SMTP id 19so41070lft.13 for ; Thu, 31 Oct 2019 09:38:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=i3JA9BeNpdMvTYkpJ4Yfiu06Nv7u2cZK1AMJmkSO1JI=; b=mcEHczdyxkQ5gZdvEHh4SuqAiwcQMlKf0RBL4m0W0ShsIc428LgOrXULW53mmu1/Qg sT/2Zw4kOUfb1Nqk1ai9cS6Ogo2EdhLU/ppfa3z7TY1eLsA0VISLpllU9yNe8MjWbgUK pVPjHsUkm4iut7qutxV8ZwFC4uGQUN74qcZTSKtN6HWAPcL1hoRQi2JsFJxSETAJjmNP cGEnl9KhOk4refjQFXbUbbwTd4ZafTpr9FsDObL+pMfPn9oJdRNMG3CWZueWxvr5qChf 55CRPsuRFPNS3wxChrlkOKOabcoqSSgzs/wP1LTGLdSIjthbt/isColAtDPNIWtVhRk1 NUSQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=i3JA9BeNpdMvTYkpJ4Yfiu06Nv7u2cZK1AMJmkSO1JI=; b=aEnzrNznid0t5a1J6JHv+hm3V+nfvza1g/LO9UN7twrbtijNRdjrTW3IGzNi7zdVXm a0fqajTnSkJ8gsfweG9BBHXCQ68KTkZEzzorbqJpBejfr4iHTMjps62PSXM/zlHkRkLe znCB696dgS5WNIfqo+dxv30dIUHfuU81lZU616YdSIxymhf7JMUlD4lyKhWQdamzgNSP zRDtjv14ZsECorlmrd00+e1Vzm0UY6Khw9MXLIUy46AlCiTZuwKKf/ICePi1HfUCIVxO mZxLXN7gH3a6g6golvk22Axbku3yGbvVLJ5OOa/aINnqeBVeITsWIK5pBnJZ/zLnUDvE s7BA== X-Gm-Message-State: APjAAAX/OhrOWan6JM4goQ5e5tWXQHXfpQj9Z1jBa2V/2FTqz4vvQDzr LOEk429wBZrpuOQxBcKIzgOayPLr8Bgj25d4ho0cNg== X-Received: by 2002:ac2:4650:: with SMTP id s16mr4454635lfo.32.1572539916724; Thu, 31 Oct 2019 09:38:36 -0700 (PDT) MIME-Version: 1.0 References: <1571776465-29763-1-git-send-email-thara.gopinath@linaro.org> <1571776465-29763-6-git-send-email-thara.gopinath@linaro.org> <2b19d7da-412c-932f-7251-110eadbef3e3@arm.com> In-Reply-To: <2b19d7da-412c-932f-7251-110eadbef3e3@arm.com> From: Vincent Guittot Date: Thu, 31 Oct 2019 17:38:25 +0100 Message-ID: Subject: Re: [Patch v4 5/6] thermal/cpu-cooling: Update thermal pressure in case of a maximum frequency capping To: Dietmar Eggemann Cc: Thara Gopinath , Ingo Molnar , Peter Zijlstra , Ionela Voinescu , Zhang Rui , Eduardo Valentin , Quentin Perret , linux-kernel , Amit Kachhap , Javi Merino , Daniel Lezcano Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 31 Oct 2019 at 17:29, Dietmar Eggemann wrote: > > On 22.10.19 22:34, Thara Gopinath wrote: > > Thermal governors can request for a cpu's maximum supported frequency > > to be capped in case of an overheat event. This in turn means that the > > maximum capacity available for tasks to run on the particular cpu is > > reduced. Delta between the original maximum capacity and capped > > maximum capacity is known as thermal pressure. Enable cpufreq cooling > > device to update the thermal pressure in event of a capped > > maximum frequency. > > > > Signed-off-by: Thara Gopinath > > --- > > drivers/thermal/cpu_cooling.c | 31 +++++++++++++++++++++++++++++-- > > 1 file changed, 29 insertions(+), 2 deletions(-) > > > > diff --git a/drivers/thermal/cpu_cooling.c b/drivers/thermal/cpu_cooling.c > > index 391f397..2e6a979 100644 > > --- a/drivers/thermal/cpu_cooling.c > > +++ b/drivers/thermal/cpu_cooling.c > > @@ -218,6 +218,23 @@ static u32 cpu_power_to_freq(struct cpufreq_cooling_device *cpufreq_cdev, > > } > > > > /** > > + * update_sched_max_capacity - update scheduler about change in cpu > > + * max frequency. > > + * @policy - cpufreq policy whose max frequency is capped. > > + */ > > +static void update_sched_max_capacity(struct cpumask *cpus, > > + unsigned int cur_max_freq, > > + unsigned int max_freq) > > +{ > > + int cpu; > > + unsigned long capacity = (cur_max_freq << SCHED_CAPACITY_SHIFT) / > > + max_freq; > > + > > + for_each_cpu(cpu, cpus) > > + update_thermal_pressure(cpu, capacity); > > +} > > + > > +/** > > * get_load() - get load for a cpu since last updated > > * @cpufreq_cdev: &struct cpufreq_cooling_device for this cpu > > * @cpu: cpu number > > @@ -320,6 +337,7 @@ static int cpufreq_set_cur_state(struct thermal_cooling_device *cdev, > > unsigned long state) > > { > > struct cpufreq_cooling_device *cpufreq_cdev = cdev->devdata; > > + int ret; > > > > /* Request state should be less than max_level */ > > if (WARN_ON(state > cpufreq_cdev->max_level)) > > @@ -331,8 +349,17 @@ static int cpufreq_set_cur_state(struct thermal_cooling_device *cdev, > > > > cpufreq_cdev->cpufreq_state = state; > > > > - return dev_pm_qos_update_request(&cpufreq_cdev->qos_req, > > - cpufreq_cdev->freq_table[state].frequency); > > + ret = dev_pm_qos_update_request > > + (&cpufreq_cdev->qos_req, > > + cpufreq_cdev->freq_table[state].frequency); > > + > > + if (ret > 0) > > + update_sched_max_capacity > > + (cpufreq_cdev->policy->cpus, > > + cpufreq_cdev->freq_table[state].frequency, > > + cpufreq_cdev->policy->cpuinfo.max_freq); > > + > > + return ret; > > } > > > > /** > > > > Why not getting rid of update_sched_max_capacity() entirely and call > update_thermal_pressure() in cpu_cooling.c directly? Saves one level in > the call chain and would mean less code for this feature. But you add complexity in update_thermal_pressure which now has to deal with a cpumask and to compute some frequency ratio IMHO, it's cleaner to keep update_thermal_pressure simple as it is now > > Just compile tested on arm64: > > diff --git a/drivers/thermal/cpu_cooling.c b/drivers/thermal/cpu_cooling.c > index 3211b4d3a899..bf36995013b0 100644 > --- a/drivers/thermal/cpu_cooling.c > +++ b/drivers/thermal/cpu_cooling.c > @@ -217,23 +217,6 @@ static u32 cpu_power_to_freq(struct > cpufreq_cooling_device *cpufreq_cdev, > return freq_table[i - 1].frequency; > } > > -/** > - * update_sched_max_capacity - update scheduler about change in cpu > - * max frequency. > - * @policy - cpufreq policy whose max frequency is capped. > - */ > -static void update_sched_max_capacity(struct cpumask *cpus, > - unsigned int cur_max_freq, > - unsigned int max_freq) > -{ > - int cpu; > - unsigned long capacity = (cur_max_freq << SCHED_CAPACITY_SHIFT) / > - max_freq; > - > - for_each_cpu(cpu, cpus) > - update_thermal_pressure(cpu, capacity); > -} > - > /** > * get_load() - get load for a cpu since last updated > * @cpufreq_cdev: &struct cpufreq_cooling_device for this cpu > @@ -353,7 +336,7 @@ static int cpufreq_set_cur_state(struct > thermal_cooling_device *cdev, > cpufreq_cdev->freq_table[state].frequency); > > if (ret > 0) > - update_sched_max_capacity > + update_thermal_pressure > (cpufreq_cdev->policy->cpus, > cpufreq_cdev->freq_table[state].frequency, > cpufreq_cdev->policy->cpuinfo.max_freq); > diff --git a/include/linux/sched.h b/include/linux/sched.h > index 55dfe9634f67..5707813c7621 100644 > --- a/include/linux/sched.h > +++ b/include/linux/sched.h > @@ -1985,9 +1985,9 @@ static inline void rseq_syscall(struct pt_regs *regs) > #endif > > #ifdef CONFIG_SMP > -void update_thermal_pressure(int cpu, u64 capacity); > +void update_thermal_pressure(struct cpumask *cpus, unsigned int cur, > unsigned int max); > #else > -static inline void update_thermal_pressure(int cpu, u64 capacity) > +static inline void update_thermal_pressure(struct cpumask *cpus, > unsigned int cur, unsigned int max); > { > } > #endif > diff --git a/kernel/sched/thermal.c b/kernel/sched/thermal.c > index 0da31e12a5ff..691bdd79597a 100644 > --- a/kernel/sched/thermal.c > +++ b/kernel/sched/thermal.c > @@ -43,17 +43,16 @@ static DEFINE_PER_CPU(unsigned long, delta_capacity); > * the arch_scale_cpu_capacity and capped capacity is stored in per cpu > * delta_capacity. > */ > -void update_thermal_pressure(int cpu, u64 capped_freq_ratio) > +void update_thermal_pressure(struct cpumask *cpus, unsigned int cur, > unsigned int max) > { > - unsigned long __capacity, delta; > + int cpu; > > - /* Normalize the capped freq ratio */ > - __capacity = (capped_freq_ratio * arch_scale_cpu_capacity(cpu)) >> > - > SCHED_CAPACITY_SHIFT; > - delta = arch_scale_cpu_capacity(cpu) - __capacity; > - pr_debug("updating cpu%d thermal pressure to %lu\n", cpu, delta); > + for_each_cpu(cpu, cpus) { > + unsigned long scale_cap = arch_scale_cpu_capacity(cpu); > + unsigned long cur_cap = cur * scale_cap / max; > > - per_cpu(delta_capacity, cpu) = delta; > + per_cpu(delta_capacity, cpu) = scale_cap - cur_cap; > + } > }