Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp8260096rwd; Tue, 20 Jun 2023 12:28:32 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ4rcLuZCDe5uC4WzKKgWTtf7w8Bl4STkS5Dv8TQI7s3rIA8bRLRGKwcuNavpaQH/NZCbA9b X-Received: by 2002:a05:6a00:16d6:b0:661:4a00:1e8b with SMTP id l22-20020a056a0016d600b006614a001e8bmr4778361pfc.23.1687289312106; Tue, 20 Jun 2023 12:28:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1687289312; cv=none; d=google.com; s=arc-20160816; b=QSFoihGEwLLr92wTZ9OeftJ8sp/r9z3Rlo55SXV+hgrVxoTJ5nzJQtZgc14RsT6o78 jtAnwUVtOdNrKp1F4/8tbuzFpqgVesiowySDL5Qxb1zwa7TehaWDa2A0VXzVPEI4iJg4 NRoCwYmI1MmUG9QQC8dNmbhvd9T6znW8j09pbejmmE/18SoRRoFsP33AgLWW05s4vhGC UxRw8RBjt7aY81hoykmOkk5HXouSmqYk2BqDkl6S2PZ7EYOu2GZJNnRBRYdweTHPZVK4 a2pX4CU5ioZznd7hsxpX15r/F9E3pP+czkr3llUOS8xvKHog8+ASKvPj3GZZ3AvtYrB3 Qudw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=RCaOAN1lAgFX63nyqwqYs4paj62tn+2HK5slSbPJUks=; b=FA0qIy7l47eWHUwmfBIfykIISFseFP0/H6q045F7mHHv9FWcF32ZAflTko5gkYQFKs vPg99NixnRDqyL1U/sIAz7m8g9g+oY+vNhWAf6Du6+wC87+5MUtf9pWW77UzdQHbqsNn nMgKdTo3qtDJvP8HRDOeu8rgw4hyi6b5VXkpT8LuTxcVhdDamfgdb7zcfVOuyWZ7vaor jHYL4M+3G3cUKmLa1Iac/5NgXokTYPW8CMO//RkRS+gUlb2YvVdZnH1de9Fto0Xt2kvv JxdGr0dLv3UzDGPVwcyWPitZdHJ45GmQoA+uZOmfK+9XbOD/eIOvmz4CfEGmmdf2AKps TXpQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=Inu+37zq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id x3-20020a63b203000000b0054fe7b18671si2246891pge.809.2023.06.20.12.28.14; Tue, 20 Jun 2023 12:28:32 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=Inu+37zq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229567AbjFTSdQ (ORCPT + 99 others); Tue, 20 Jun 2023 14:33:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60284 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229665AbjFTSdO (ORCPT ); Tue, 20 Jun 2023 14:33:14 -0400 Received: from mail-yb1-xb4a.google.com (mail-yb1-xb4a.google.com [IPv6:2607:f8b0:4864:20::b4a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6F1E810CE for ; Tue, 20 Jun 2023 11:33:05 -0700 (PDT) Received: by mail-yb1-xb4a.google.com with SMTP id 3f1490d57ef6-bce829aca1bso5447812276.3 for ; Tue, 20 Jun 2023 11:33:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1687285984; x=1689877984; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=RCaOAN1lAgFX63nyqwqYs4paj62tn+2HK5slSbPJUks=; b=Inu+37zqf+xqVO90CaUGqjRGPqQl9Wdhyj9ZNMYgA8LimfGK0MuzHhG0c9J01rXMnZ sCOlDztWnQa/zeQ7WinObNQviCcQC2hTDAeA3mlIa2T/UOf5kIpu+zfTQLrTGXOAnbu3 yuhnxbaozl/KopC8Fb4KpMnjCtSphJIlpzwiHjK3JzFF7uK1J3WK4Gk07Qp+lxY4IhuT czS35YL1cKqJLd9/bpO8k0cklSaM7ZUTNmZLJIABfFSWEgXolY4ZC6wihqGQ7Q/spMol EEay8r9zsoMhavsJXVl6sE6+CS8SuBGgbbu/lefYpY/Qgt88tbaqnV92ZkVeet5eJrNz e8Aw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1687285984; x=1689877984; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=RCaOAN1lAgFX63nyqwqYs4paj62tn+2HK5slSbPJUks=; b=f2OVHRyqDCZUrgSrqxJAbzA2FHP8zyIUxCTp8c929rvYvuJW4TzDnpyxKN3UUXWzG8 84zuIIXfHo0UZxtKDHKnjtKMzWxs+0EdEW6GBOubr98uYzQfscP8GMuhj5XreHAJFAw0 1/3/3G3Q9ECJktZ98EvIxOgUuEd56kceFD2txFZzyqZOGlLo/b9Yrb2+1qE4pA1OmPeq N3iVdr2X5HDwwH5BbW1567oovZKqDe9nIma3Z47RYb4kmVzusaZHPs9vQ6oQfqHVvoC1 Db7zhDZw1ma2rRh+rNURIGGq8g8+adnUNPknIgZorMF15tUiYPX0g9uHrVBwgsouq6gK 3AhA== X-Gm-Message-State: AC+VfDwkygD5JPXCOmTXDvztmyxpVNxcC3UanHlimRDJrgLri8Haw0VB Uub8dYduHJ2wAaBGk4EzohIwWyFxxk83 X-Received: from joshdon-desktop.svl.corp.google.com ([2620:15c:2a3:200:7f30:1fa1:c3c6:d39a]) (user=joshdon job=sendgmr) by 2002:a25:418f:0:b0:bea:918f:2ef1 with SMTP id o137-20020a25418f000000b00bea918f2ef1mr1201669yba.7.1687285984733; Tue, 20 Jun 2023 11:33:04 -0700 (PDT) Date: Tue, 20 Jun 2023 11:32:47 -0700 In-Reply-To: <20230620183247.737942-1-joshdon@google.com> Mime-Version: 1.0 References: <20230620183247.737942-1-joshdon@google.com> X-Mailer: git-send-email 2.41.0.162.gfafddb0af9-goog Message-ID: <20230620183247.737942-2-joshdon@google.com> Subject: [PATCH v3 2/2] sched: add throttled time stat for throttled children From: Josh Don To: Peter Zijlstra Cc: Tejun Heo , Zefan Li , Johannes Weiner , Ingo Molnar , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Steven Rostedt , Ben Segall , Mel Gorman , Daniel Bristot de Oliveira , Valentin Schneider , cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, mkoutny@suse.com, Josh Don Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org We currently export the total throttled time for cgroups that are given a bandwidth limit. This patch extends this accounting to also account the total time that each children cgroup has been throttled. This is useful to understand the degree to which children have been affected by the throttling control. Children which are not runnable during the entire throttled period, for example, will not show any self-throttling time during this period. Expose this in a new interface, 'cpu.stat.local', which is similar to how non-hierarchical events are accounted in 'memory.events.local'. Signed-off-by: Josh Don --- v3: fix bug with nested limits v2: - moved export to new cpu.stat.local file, per Tejun's recommendation include/linux/cgroup-defs.h | 2 ++ kernel/cgroup/cgroup.c | 34 ++++++++++++++++++++++++++++ kernel/sched/core.c | 44 +++++++++++++++++++++++++++++++++++++ kernel/sched/fair.c | 21 +++++++++++++++++- kernel/sched/sched.h | 2 ++ 5 files changed, 102 insertions(+), 1 deletion(-) diff --git a/include/linux/cgroup-defs.h b/include/linux/cgroup-defs.h index 8a0d5466c7be..ae20dbb885d6 100644 --- a/include/linux/cgroup-defs.h +++ b/include/linux/cgroup-defs.h @@ -661,6 +661,8 @@ struct cgroup_subsys { void (*css_rstat_flush)(struct cgroup_subsys_state *css, int cpu); int (*css_extra_stat_show)(struct seq_file *seq, struct cgroup_subsys_state *css); + int (*css_local_stat_show)(struct seq_file *seq, + struct cgroup_subsys_state *css); int (*can_attach)(struct cgroup_taskset *tset); void (*cancel_attach)(struct cgroup_taskset *tset); diff --git a/kernel/cgroup/cgroup.c b/kernel/cgroup/cgroup.c index 2d3d13e52333..bfe063225d3b 100644 --- a/kernel/cgroup/cgroup.c +++ b/kernel/cgroup/cgroup.c @@ -3726,6 +3726,36 @@ static int cpu_stat_show(struct seq_file *seq, void *v) return ret; } +static int __maybe_unused cgroup_local_stat_show(struct seq_file *seq, + struct cgroup *cgrp, int ssid) +{ + struct cgroup_subsys *ss = cgroup_subsys[ssid]; + struct cgroup_subsys_state *css; + int ret; + + if (!ss->css_local_stat_show) + return 0; + + css = cgroup_tryget_css(cgrp, ss); + if (!css) + return 0; + + ret = ss->css_local_stat_show(seq, css); + css_put(css); + return ret; +} + +static int cpu_local_stat_show(struct seq_file *seq, void *v) +{ + struct cgroup __maybe_unused *cgrp = seq_css(seq)->cgroup; + int ret = 0; + +#ifdef CONFIG_CGROUP_SCHED + ret = cgroup_local_stat_show(seq, cgrp, cpu_cgrp_id); +#endif + return ret; +} + #ifdef CONFIG_PSI static int cgroup_io_pressure_show(struct seq_file *seq, void *v) { @@ -5276,6 +5306,10 @@ static struct cftype cgroup_base_files[] = { .name = "cpu.stat", .seq_show = cpu_stat_show, }, + { + .name = "cpu.stat.local", + .seq_show = cpu_local_stat_show, + }, { } /* terminate */ }; diff --git a/kernel/sched/core.c b/kernel/sched/core.c index d9a04faa48a1..0de483190a57 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -11140,6 +11140,27 @@ static int cpu_cfs_stat_show(struct seq_file *sf, void *v) return 0; } + +static u64 throttled_time_self(struct task_group *tg) +{ + int i; + u64 total = 0; + + for_each_possible_cpu(i) { + total += READ_ONCE(tg->cfs_rq[i]->throttled_clock_self_time); + } + + return total; +} + +static int cpu_cfs_local_stat_show(struct seq_file *sf, void *v) +{ + struct task_group *tg = css_tg(seq_css(sf)); + + seq_printf(sf, "throttled_time %llu\n", throttled_time_self(tg)); + + return 0; +} #endif /* CONFIG_CFS_BANDWIDTH */ #endif /* CONFIG_FAIR_GROUP_SCHED */ @@ -11216,6 +11237,10 @@ static struct cftype cpu_legacy_files[] = { .name = "stat", .seq_show = cpu_cfs_stat_show, }, + { + .name = "stat.local", + .seq_show = cpu_cfs_local_stat_show, + }, #endif #ifdef CONFIG_RT_GROUP_SCHED { @@ -11272,6 +11297,24 @@ static int cpu_extra_stat_show(struct seq_file *sf, return 0; } +static int cpu_local_stat_show(struct seq_file *sf, + struct cgroup_subsys_state *css) +{ +#ifdef CONFIG_CFS_BANDWIDTH + { + struct task_group *tg = css_tg(css); + u64 throttled_self_usec; + + throttled_self_usec = throttled_time_self(tg); + do_div(throttled_self_usec, NSEC_PER_USEC); + + seq_printf(sf, "throttled_usec %llu\n", + throttled_self_usec); + } +#endif + return 0; +} + #ifdef CONFIG_FAIR_GROUP_SCHED static u64 cpu_weight_read_u64(struct cgroup_subsys_state *css, struct cftype *cft) @@ -11450,6 +11493,7 @@ struct cgroup_subsys cpu_cgrp_subsys = { .css_released = cpu_cgroup_css_released, .css_free = cpu_cgroup_css_free, .css_extra_stat_show = cpu_extra_stat_show, + .css_local_stat_show = cpu_local_stat_show, #ifdef CONFIG_RT_GROUP_SCHED .can_attach = cpu_cgroup_can_attach, #endif diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 72571d48559a..b1bccef78b90 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -4878,8 +4878,12 @@ enqueue_entity(struct cfs_rq *cfs_rq, struct sched_entity *se, int flags) list_add_leaf_cfs_rq(cfs_rq); } else { #ifdef CONFIG_CFS_BANDWIDTH + struct rq *rq = rq_of(cfs_rq); + if (cfs_rq_throttled(cfs_rq) && !cfs_rq->throttled_clock) - cfs_rq->throttled_clock = rq_clock(rq_of(cfs_rq)); + cfs_rq->throttled_clock = rq_clock(rq); + if (!cfs_rq->throttled_clock_self) + cfs_rq->throttled_clock_self = rq_clock(rq); #endif } } @@ -5384,6 +5388,17 @@ static int tg_unthrottle_up(struct task_group *tg, void *data) /* Add cfs_rq with load or one or more already running entities to the list */ if (!cfs_rq_is_decayed(cfs_rq)) list_add_leaf_cfs_rq(cfs_rq); + + if (cfs_rq->throttled_clock_self) { + u64 delta = rq_clock(rq) - cfs_rq->throttled_clock_self; + + cfs_rq->throttled_clock_self = 0; + + if (SCHED_WARN_ON((s64)delta < 0)) + delta = 0; + + cfs_rq->throttled_clock_self_time += delta; + } } return 0; @@ -5398,6 +5413,10 @@ static int tg_throttle_down(struct task_group *tg, void *data) if (!cfs_rq->throttle_count) { cfs_rq->throttled_clock_pelt = rq_clock_pelt(rq); list_del_leaf_cfs_rq(cfs_rq); + + SCHED_WARN_ON(cfs_rq->throttled_clock_self); + if (cfs_rq->nr_running) + cfs_rq->throttled_clock_self = rq_clock(rq); } cfs_rq->throttle_count++; diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h index 556496c77dc2..46f8a01ff405 100644 --- a/kernel/sched/sched.h +++ b/kernel/sched/sched.h @@ -636,6 +636,8 @@ struct cfs_rq { u64 throttled_clock; u64 throttled_clock_pelt; u64 throttled_clock_pelt_time; + u64 throttled_clock_self; + u64 throttled_clock_self_time; int throttled; int throttle_count; struct list_head throttled_list; -- 2.41.0.162.gfafddb0af9-goog