Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp497323ybv; Thu, 13 Feb 2020 04:31:58 -0800 (PST) X-Google-Smtp-Source: APXvYqw5aReT3TYTzfSXkAauTXFcaxeWUghkXkOhlyHfwGgmTIKYzjzsnBQqjq1wIhqmllpVU9W+ X-Received: by 2002:a9d:3bc4:: with SMTP id k62mr13594591otc.186.1581597118468; Thu, 13 Feb 2020 04:31:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581597118; cv=none; d=google.com; s=arc-20160816; b=ofYcojobfqMk+1RULGYc57XFxd9eP6CFvp1iZcct6KNiUZsaKrjx+yEbDiZwget//2 vf1ZbtmvD2iYr2F26Mg6MDT+Iu3ulKz7MHw9XNxfe4zc+in6tiSxxtbguNwA27lUjxm6 p3L6CN/pONoPdP5+ER8xdDjpbxiPmfWpvNPBdkKwFrU0Up7InRPm/uZ8AvzrRacvcyAP iRCVwexnc/n12ZNzpdv5uleMze2zjFBVNl5kW4CMahcuS1TtwIoOWGOF6PLhA0eHFUwq cduNnsmfTBpu9I9aRpnk84bJug3xpYi9Kv7fQ6WzUCRHpRp67Q0d+VqRgJVAULXKAYl+ 2NCA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=EZOpQ1PJkZ+dCWbPKJgvY1GJipZZjyyGhXZ1qG68CEU=; b=fxrlXgHX/okz1LHrGFseD4QUqSBEVZ6HOVjw1PccSNTbLrvX0NTPXwXALcWFpG8Ayn 8NbCzkEQtVTEyIvNVzp6jx3okJVckGZx34se3VnT7Ok2mBsRBGqYmz0N48WklUjTDpK0 4axi70dE39H8iTikIRFBBObMvCmp7QRWsbk4uXJzMLKH2QmLVWNFGTg4NS3tpzSYAziv G8OsqgI4cPGelzwK5VIsuV3yKEEBIaVSwoFNoHByWa8S7Q7sFaQUxlu6B8nGcb9OyLbF UDkGXj/s83ES2hDjgf2G/ZCgb91ddCPtJaVuReHYwDCMvcWYfB4fh02Ywnm4BHZWXuyz J53w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@verdurent-com.20150623.gappssmtp.com header.s=20150623 header.b=A7lnflGx; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i6si1119504otp.5.2020.02.13.04.31.43; Thu, 13 Feb 2020 04:31:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@verdurent-com.20150623.gappssmtp.com header.s=20150623 header.b=A7lnflGx; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729855AbgBMMaI (ORCPT + 99 others); Thu, 13 Feb 2020 07:30:08 -0500 Received: from mail-vs1-f66.google.com ([209.85.217.66]:46941 "EHLO mail-vs1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729531AbgBMMaI (ORCPT ); Thu, 13 Feb 2020 07:30:08 -0500 Received: by mail-vs1-f66.google.com with SMTP id t12so3429868vso.13 for ; Thu, 13 Feb 2020 04:30:05 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=verdurent-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=EZOpQ1PJkZ+dCWbPKJgvY1GJipZZjyyGhXZ1qG68CEU=; b=A7lnflGx3LufOzmX9xh9MuPNG4eHrBdPNuQ/gEsP+8pQTd5HAKRhN5Ynr8VN2ZnU2Z epw8lUGPgvGO8ZP3Wsg+QkugfyqyZFhVlxoKzfse2ajlWZAwfrt0T6NBFkLXGalOt1ES xFaAFZI+j4RN+uN94/NcKP9iWihXfzIfIfg0kw60Uo/vu9tMk4WN+sINWdwi00HCEdqQ +zeTpOd0yZCqEpd+1EXesF4+E9XzjKCKK8o8dZiWJwudpUBssw/KmpSgOCKvKAPu9ZSj vmsha1OOAA/sXlVsTt+tUsWhHsxCup5pOiV3lTWl7M8P4wEXilOR9I2UsptSNHWoxwHz u09g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=EZOpQ1PJkZ+dCWbPKJgvY1GJipZZjyyGhXZ1qG68CEU=; b=cUW09Qo+pviSiwll0MLTJitT+fDUamjxiqMgFSiTbQX9ibj3s4Pkt3o9z+HGfQf9BY Bh5v8YiUYNFGZdVnb5Aoc+OLGN9NL/ej93AOORL5NvQd+A8BGKKFPO8anxWvZNgkqFit kvgL9KT8Nwa9nHUYKNO9Tou7fzUb02sxrC1ZeFZdw2yL82U8VZvW3p2np6srxVVC80hb 0DtHHsbsBGV0Fa2bISp8N/qhGlI02B/B/RUch3SN1m1gGkclZxH+EmI6agy68zecZqL1 wKsKSZWNNgXxLKscCliAoKMK+/mpQvnkPA0j1EFBRbRDjkmpWgTUlutVlTHtzJUIxsdM kmDw== X-Gm-Message-State: APjAAAVg1p7qjt0Lsh7PU3Df7//J9FGN6aYpW4mALOk5O9EYp1Ibteuo 3U9BH+yQoOfRwEP/jJz+kld8uk3W47a08HPRQ2F1vw== X-Received: by 2002:a05:6102:535:: with SMTP id m21mr1780766vsa.95.1581597005286; Thu, 13 Feb 2020 04:30:05 -0800 (PST) MIME-Version: 1.0 References: <1580250967-4386-1-git-send-email-thara.gopinath@linaro.org> <1580250967-4386-2-git-send-email-thara.gopinath@linaro.org> In-Reply-To: <1580250967-4386-2-git-send-email-thara.gopinath@linaro.org> From: Amit Kucheria Date: Thu, 13 Feb 2020 17:59:54 +0530 Message-ID: Subject: Re: [Patch v9 1/8] sched/pelt: Add support to track thermal pressure To: Thara Gopinath Cc: Ingo Molnar , Peter Zijlstra , ionela.voinescu@arm.com, Vincent Guittot , Dietmar Eggemann , Zhang Rui , qperret@google.com, Daniel Lezcano , Viresh Kumar , Steven Rostedt , Will Deacon , Catalin Marinas , Sudeep Holla , Juri Lelli , corbet@lwn.net, LKML , Amit Daniel Kachhap , Javi Merino Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 29, 2020 at 4:06 AM Thara Gopinath wrote: > > Extrapolating on the existing framework to track rt/dl utilization using > pelt signals, add a similar mechanism to track thermal pressure. The > difference here from rt/dl utilization tracking is that, instead of > tracking time spent by a cpu running a rt/dl task through util_avg, the > average thermal pressure is tracked through load_avg. This is because > thermal pressure signal is weighted "delta" capacity and is not > binary(util_avg is binary). "delta capacity" here means delta between the > actual capacity of a cpu and the decreased capacity a cpu due to a thermal > event. > > In order to track average thermal pressure, a new sched_avg variable > avg_thermal is introduced. Function update_thermal_load_avg can be called > to do the periodic bookkeeping (accumulate, decay and average) of the > thermal pressure. > > Signed-off-by: Thara Gopinath > Reviewed-by: Vincent Guittot > --- > > v6->v7: > - Added CONFIG_HAVE_SCHED_THERMAL_PRESSURE to stub out > update_thermal_load_avg in unsupported architectures as per > review comments from Peter, Dietmar and Quentin. > - Updated comment for update_thermal_load_avg as per review > comments from Peter and Dietmar. > v7->v8: > - Fixed typo in defining update_thermal_load_avg which was > causing build errors (reported by kbuild test report) > v8->v9: > - Defined thermal_load_avg to read rq->avg_thermal.load_avg and > avoid cacheline miss in unsupported cases as per Peter's > suggestion. > > include/trace/events/sched.h | 4 ++++ > init/Kconfig | 4 ++++ > kernel/sched/pelt.c | 31 +++++++++++++++++++++++++++++++ > kernel/sched/pelt.h | 31 +++++++++++++++++++++++++++++++ > kernel/sched/sched.h | 3 +++ > 5 files changed, 73 insertions(+) > > diff --git a/include/trace/events/sched.h b/include/trace/events/sched.h > index 420e80e..a8fb667 100644 > --- a/include/trace/events/sched.h > +++ b/include/trace/events/sched.h > @@ -613,6 +613,10 @@ DECLARE_TRACE(pelt_dl_tp, > TP_PROTO(struct rq *rq), > TP_ARGS(rq)); > > +DECLARE_TRACE(pelt_thermal_tp, > + TP_PROTO(struct rq *rq), > + TP_ARGS(rq)); > + > DECLARE_TRACE(pelt_irq_tp, > TP_PROTO(struct rq *rq), > TP_ARGS(rq)); > diff --git a/init/Kconfig b/init/Kconfig > index bd9f1fd..055c3bf 100644 > --- a/init/Kconfig > +++ b/init/Kconfig > @@ -463,6 +463,10 @@ config HAVE_SCHED_AVG_IRQ > depends on IRQ_TIME_ACCOUNTING || PARAVIRT_TIME_ACCOUNTING > depends on SMP > > +config HAVE_SCHED_THERMAL_PRESSURE > + bool "Enable periodic averaging of thermal pressure" > + depends on SMP > + > config BSD_PROCESS_ACCT > bool "BSD Process Accounting" > depends on MULTIUSER > diff --git a/kernel/sched/pelt.c b/kernel/sched/pelt.c > index bd006b7..5d1fbf0 100644 > --- a/kernel/sched/pelt.c > +++ b/kernel/sched/pelt.c > @@ -367,6 +367,37 @@ int update_dl_rq_load_avg(u64 now, struct rq *rq, int running) > return 0; > } > > +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE > +/* > + * thermal: > + * > + * load_sum = \Sum se->avg.load_sum but se->avg.load_sum is not tracked > + * > + * util_avg and runnable_load_avg are not supported and meaningless. > + * > + * Unlike rt/dl utilization tracking that track time spent by a cpu > + * running a rt/dl task through util_avg, the average thermal pressure is > + * tracked through load_avg. This is because thermal pressure signal is > + * weighted "delta" capacity and is not binary(util_avg is binary). "delta May I suggest a slight rewording here and in the commit description, This is because the thermal pressure signal is weighted "delta" capacity unlike util_avg which is binary. It would also help, if you expanded on what you mean by binary in the commit description and how the delta capacity is weighted. > + * capacity" here means delta between the actual capacity of a cpu and the > + * decreased capacity a cpu due to a thermal event. Use consistent wording throughout the series - either capped or decreased capacity. > + */ This could be shortened to: Delta capacity of cpu = Actual capacity - Capped capacity due to thermal event > +int update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) > +{ > + if (___update_load_sum(now, &rq->avg_thermal, > + capacity, > + capacity, > + capacity)) { > + ___update_load_avg(&rq->avg_thermal, 1, 1); > + trace_pelt_thermal_tp(rq); > + return 1; > + } > + > + return 0; > +} > +#endif > + > #ifdef CONFIG_HAVE_SCHED_AVG_IRQ > /* > * irq: > diff --git a/kernel/sched/pelt.h b/kernel/sched/pelt.h > index afff644..916979a 100644 > --- a/kernel/sched/pelt.h > +++ b/kernel/sched/pelt.h > @@ -7,6 +7,26 @@ int __update_load_avg_cfs_rq(u64 now, struct cfs_rq *cfs_rq); > int update_rt_rq_load_avg(u64 now, struct rq *rq, int running); > int update_dl_rq_load_avg(u64 now, struct rq *rq, int running); > > +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE > +int update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity); > + > +static inline u64 thermal_load_avg(struct rq *rq) > +{ > + return READ_ONCE(rq->avg_thermal.load_avg); > +} > +#else > +static inline int > +update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) > +{ > + return 0; > +} > + > +static inline u64 thermal_load_avg(struct rq *rq) > +{ > + return 0; > +} > +#endif > + > #ifdef CONFIG_HAVE_SCHED_AVG_IRQ > int update_irq_load_avg(struct rq *rq, u64 running); > #else > @@ -159,6 +179,17 @@ update_dl_rq_load_avg(u64 now, struct rq *rq, int running) > } > > static inline int > +update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) > +{ > + return 0; > +} > + > +static inline u64 thermal_load_avg(struct rq *rq) > +{ > + return 0; > +} > + > +static inline int > update_irq_load_avg(struct rq *rq, u64 running) > { > return 0; > diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h > index 1a88dc8..1f256cb 100644 > --- a/kernel/sched/sched.h > +++ b/kernel/sched/sched.h > @@ -944,6 +944,9 @@ struct rq { > #ifdef CONFIG_HAVE_SCHED_AVG_IRQ > struct sched_avg avg_irq; > #endif > +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE > + struct sched_avg avg_thermal; > +#endif > u64 idle_stamp; > u64 avg_idle; > > -- > 2.1.4 >