Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp589690ybv; Thu, 13 Feb 2020 06:12:33 -0800 (PST) X-Google-Smtp-Source: APXvYqwptC6+sxSyF2Yeq4JSjKZzmOfzNYrVCpGs5HcpiSuFpY3gRxIYYCWcAOfX8fdnO2MOhmZy X-Received: by 2002:a05:6830:1d91:: with SMTP id y17mr12737041oti.276.1581603153606; Thu, 13 Feb 2020 06:12:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581603153; cv=none; d=google.com; s=arc-20160816; b=mWMNfUgHxqcyIo7pyHlUhU3D/ffBoBkNAZ6IOieNgPQltZK7f8rpMLPf3lY3qtWj7D 0YxTrv9HCc16IIVNrpIRxvexT3PttNPLDO/RnbZ7i4ulkNvVIZrhFxVOTvsAcwzK8ADS 66uvz6jVVw1fZ5gYgEKtOvirkEByqrb19n0uubNdiPNEOIXaElO6rTghvP5LKkaPP2eX omFFGQK2T74aYTWVXtGM1oQfN1BlQnoOzihiWvUR6MsBYY3V4GCAmdgZYeenvqo0nc7Z LUdr/86t8NMjRp6hRCHugyY2ICAl1rzkh3/izQ7PNjedYRKapjx56iRkJPaneVbd4c2k KCMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:cc:references:to :subject:dkim-signature; bh=j1SIK81ZPhXEIF166UZGxZMxUt2GaElLV0aYrlmFKR8=; b=xzc1vaAIfTdnYmHtsaB8m6CF9e4n3YQYVKNGT1wofi1Qz3ndeWaUzhj3FvEX7v54oh wR1CbHVDVmi8bkpAFN30efUeiaXcRO4zHyFAAbKbDHOPxClr1H1AUHlp/YxMEt2yicBG 3zGA4P9SOMiagncFjB0qfCztpO0hk8BDP2MNvzjF08L0B0CaBu5UyG1USMs7a5P0HgNr 94qSrSA3FLCXsiCW5STIbc25EFSAei7q2WTyNDwUTurJ4A9HnwQTXd/Edu5YoZZLPGbg xv/PxN2KcNVNHNfOJD2DrixGBZaz3RanbL1OXqHdn398zk6qFJRSAGbyc7iFW+CZDLPW VS8w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=fWyC+rRl; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e1si1123084otr.8.2020.02.13.06.12.19; Thu, 13 Feb 2020 06:12:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=fWyC+rRl; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730199AbgBMOLm (ORCPT + 99 others); Thu, 13 Feb 2020 09:11:42 -0500 Received: from mail-qk1-f196.google.com ([209.85.222.196]:36181 "EHLO mail-qk1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730082AbgBMOLl (ORCPT ); Thu, 13 Feb 2020 09:11:41 -0500 Received: by mail-qk1-f196.google.com with SMTP id w25so5775935qki.3 for ; Thu, 13 Feb 2020 06:11:41 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=subject:to:references:cc:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding; bh=j1SIK81ZPhXEIF166UZGxZMxUt2GaElLV0aYrlmFKR8=; b=fWyC+rRlU75y1ZgTDdL3baIpvwpSLKDByEZiVhRm8/xubRqqcdESqcktmqnoeb/cIn 4inaXd76nUihpQyzxy0uNHOWspcRGihwSrfDHKtOi4lpEMGU7AxsZ3DGCZpo6oYxkgq2 /upDrB/u7gZZul+zxoiVl7ettRNdPJxUPvwwyDXUXeE7MYTrzT5sPBMAB7V21n0FVZOJ GFqCLV2mlTtpu005HJ1+dAs4ejof88OJGdyUBbkXaVHHkDdt709iTMGCIlzP3T93GQDK JYlZ2mkZAhdtRgEjpvErXsG/ExQBbIQ9JeGqAyI+LeOyj3oyUygMtaR6Y2BJMbJqrlP8 QjwQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:cc:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=j1SIK81ZPhXEIF166UZGxZMxUt2GaElLV0aYrlmFKR8=; b=XzW5ZjoMxaupoHrqpm1V6anLIibIaQooB7BwdSR9+o60iWbygEot6pim8NeM/5Qnz5 G/kaqLU86MkBKVnV7XpWl/GD5WdRd6QauKlgwRp9zuorjr/txP/iIXU1qjoUhoUgrkIF bF7TjhmlBk3FKEdhS+jauYUsSHJ5KUazA4KfUZsEYRW57bsqc2cXpBnu3F7OLwTy978E WoMjRtTOUaR7I5XQTaVc8xU+jW6eWn/eNOHZf8BaGBX+rdgM188VP3iOs/ATcu9yGqsg UmZzcf57oNpzUpCV7XJjQOLE/tGuC6WqJxrkgzIxMUlLVuKttk5vyxb4iI15CSt/oLkK mFkw== X-Gm-Message-State: APjAAAX/yxET8mwZmxe42TfmeoFt9HcGjadZaFqViqaOLypx9LH1jzHI JD7OwP2yc+IsCMMCMFYidSVbnA== X-Received: by 2002:a37:5f43:: with SMTP id t64mr15506221qkb.68.1581603100486; Thu, 13 Feb 2020 06:11:40 -0800 (PST) Received: from [192.168.1.169] (pool-71-255-246-27.washdc.fios.verizon.net. [71.255.246.27]) by smtp.gmail.com with ESMTPSA id b35sm1567396qtc.9.2020.02.13.06.11.38 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 13 Feb 2020 06:11:39 -0800 (PST) Subject: Re: [Patch v9 1/8] sched/pelt: Add support to track thermal pressure To: Amit Kucheria References: <1580250967-4386-1-git-send-email-thara.gopinath@linaro.org> <1580250967-4386-2-git-send-email-thara.gopinath@linaro.org> Cc: Ingo Molnar , Peter Zijlstra , ionela.voinescu@arm.com, Vincent Guittot , Dietmar Eggemann , Zhang Rui , qperret@google.com, Daniel Lezcano , Viresh Kumar , Steven Rostedt , Will Deacon , Catalin Marinas , Sudeep Holla , Juri Lelli , corbet@lwn.net, LKML , Amit Daniel Kachhap , Javi Merino From: Thara Gopinath Message-ID: <5E455919.5090506@linaro.org> Date: Thu, 13 Feb 2020 09:11:37 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.5.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 02/13/2020 07:29 AM, Amit Kucheria wrote: > On Wed, Jan 29, 2020 at 4:06 AM Thara Gopinath > wrote: >> >> Extrapolating on the existing framework to track rt/dl utilization using >> pelt signals, add a similar mechanism to track thermal pressure. The >> difference here from rt/dl utilization tracking is that, instead of >> tracking time spent by a cpu running a rt/dl task through util_avg, the >> average thermal pressure is tracked through load_avg. This is because >> thermal pressure signal is weighted "delta" capacity and is not >> binary(util_avg is binary). "delta capacity" here means delta between the >> actual capacity of a cpu and the decreased capacity a cpu due to a thermal >> event. >> >> In order to track average thermal pressure, a new sched_avg variable >> avg_thermal is introduced. Function update_thermal_load_avg can be called >> to do the periodic bookkeeping (accumulate, decay and average) of the >> thermal pressure. >> >> Signed-off-by: Thara Gopinath >> Reviewed-by: Vincent Guittot >> --- >> >> v6->v7: >> - Added CONFIG_HAVE_SCHED_THERMAL_PRESSURE to stub out >> update_thermal_load_avg in unsupported architectures as per >> review comments from Peter, Dietmar and Quentin. >> - Updated comment for update_thermal_load_avg as per review >> comments from Peter and Dietmar. >> v7->v8: >> - Fixed typo in defining update_thermal_load_avg which was >> causing build errors (reported by kbuild test report) >> v8->v9: >> - Defined thermal_load_avg to read rq->avg_thermal.load_avg and >> avoid cacheline miss in unsupported cases as per Peter's >> suggestion. >> >> include/trace/events/sched.h | 4 ++++ >> init/Kconfig | 4 ++++ >> kernel/sched/pelt.c | 31 +++++++++++++++++++++++++++++++ >> kernel/sched/pelt.h | 31 +++++++++++++++++++++++++++++++ >> kernel/sched/sched.h | 3 +++ >> 5 files changed, 73 insertions(+) >> >> diff --git a/include/trace/events/sched.h b/include/trace/events/sched.h >> index 420e80e..a8fb667 100644 >> --- a/include/trace/events/sched.h >> +++ b/include/trace/events/sched.h >> @@ -613,6 +613,10 @@ DECLARE_TRACE(pelt_dl_tp, >> TP_PROTO(struct rq *rq), >> TP_ARGS(rq)); >> >> +DECLARE_TRACE(pelt_thermal_tp, >> + TP_PROTO(struct rq *rq), >> + TP_ARGS(rq)); >> + >> DECLARE_TRACE(pelt_irq_tp, >> TP_PROTO(struct rq *rq), >> TP_ARGS(rq)); >> diff --git a/init/Kconfig b/init/Kconfig >> index bd9f1fd..055c3bf 100644 >> --- a/init/Kconfig >> +++ b/init/Kconfig >> @@ -463,6 +463,10 @@ config HAVE_SCHED_AVG_IRQ >> depends on IRQ_TIME_ACCOUNTING || PARAVIRT_TIME_ACCOUNTING >> depends on SMP >> >> +config HAVE_SCHED_THERMAL_PRESSURE >> + bool "Enable periodic averaging of thermal pressure" >> + depends on SMP >> + >> config BSD_PROCESS_ACCT >> bool "BSD Process Accounting" >> depends on MULTIUSER >> diff --git a/kernel/sched/pelt.c b/kernel/sched/pelt.c >> index bd006b7..5d1fbf0 100644 >> --- a/kernel/sched/pelt.c >> +++ b/kernel/sched/pelt.c >> @@ -367,6 +367,37 @@ int update_dl_rq_load_avg(u64 now, struct rq *rq, int running) >> return 0; >> } >> >> +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE >> +/* >> + * thermal: >> + * >> + * load_sum = \Sum se->avg.load_sum but se->avg.load_sum is not tracked >> + * >> + * util_avg and runnable_load_avg are not supported and meaningless. >> + * >> + * Unlike rt/dl utilization tracking that track time spent by a cpu >> + * running a rt/dl task through util_avg, the average thermal pressure is >> + * tracked through load_avg. This is because thermal pressure signal is >> + * weighted "delta" capacity and is not binary(util_avg is binary). "delta > > May I suggest a slight rewording here and in the commit description, > > This is because the thermal pressure signal is weighted "delta" > capacity unlike util_avg which is binary. Sure! Will fix it. > > It would also help, if you expanded on what you mean by binary in the > commit description and how the delta capacity is weighted. I don't understand this. Binary means 0 or 1. delta capacity is time weighted, i will update this. > >> + * capacity" here means delta between the actual capacity of a cpu and the >> + * decreased capacity a cpu due to a thermal event. > > Use consistent wording throughout the series - either capped or > decreased capacity. > >> + */ > > This could be shortened to: > > Delta capacity of cpu = Actual capacity - Capped capacity due to thermal event Will fix this. > >> +int update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) >> +{ >> + if (___update_load_sum(now, &rq->avg_thermal, >> + capacity, >> + capacity, >> + capacity)) { >> + ___update_load_avg(&rq->avg_thermal, 1, 1); >> + trace_pelt_thermal_tp(rq); >> + return 1; >> + } >> + >> + return 0; >> +} >> +#endif >> + >> #ifdef CONFIG_HAVE_SCHED_AVG_IRQ >> /* >> * irq: >> diff --git a/kernel/sched/pelt.h b/kernel/sched/pelt.h >> index afff644..916979a 100644 >> --- a/kernel/sched/pelt.h >> +++ b/kernel/sched/pelt.h >> @@ -7,6 +7,26 @@ int __update_load_avg_cfs_rq(u64 now, struct cfs_rq *cfs_rq); >> int update_rt_rq_load_avg(u64 now, struct rq *rq, int running); >> int update_dl_rq_load_avg(u64 now, struct rq *rq, int running); >> >> +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE >> +int update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity); >> + >> +static inline u64 thermal_load_avg(struct rq *rq) >> +{ >> + return READ_ONCE(rq->avg_thermal.load_avg); >> +} >> +#else >> +static inline int >> +update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) >> +{ >> + return 0; >> +} >> + >> +static inline u64 thermal_load_avg(struct rq *rq) >> +{ >> + return 0; >> +} >> +#endif >> + >> #ifdef CONFIG_HAVE_SCHED_AVG_IRQ >> int update_irq_load_avg(struct rq *rq, u64 running); >> #else >> @@ -159,6 +179,17 @@ update_dl_rq_load_avg(u64 now, struct rq *rq, int running) >> } >> >> static inline int >> +update_thermal_load_avg(u64 now, struct rq *rq, u64 capacity) >> +{ >> + return 0; >> +} >> + >> +static inline u64 thermal_load_avg(struct rq *rq) >> +{ >> + return 0; >> +} >> + >> +static inline int >> update_irq_load_avg(struct rq *rq, u64 running) >> { >> return 0; >> diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h >> index 1a88dc8..1f256cb 100644 >> --- a/kernel/sched/sched.h >> +++ b/kernel/sched/sched.h >> @@ -944,6 +944,9 @@ struct rq { >> #ifdef CONFIG_HAVE_SCHED_AVG_IRQ >> struct sched_avg avg_irq; >> #endif >> +#ifdef CONFIG_HAVE_SCHED_THERMAL_PRESSURE >> + struct sched_avg avg_thermal; >> +#endif >> u64 idle_stamp; >> u64 avg_idle; >> >> -- >> 2.1.4 >> -- Warm Regards Thara