Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp576282rwl; Wed, 29 Mar 2023 06:02:39 -0700 (PDT) X-Google-Smtp-Source: AKy350YmO4yzwfh6F+rdjy85Z0qPa7iJbtgGauavC9qpva5EFLSGm1vq4ugAbhE0r81JKbNTteVC X-Received: by 2002:aa7:ccce:0:b0:501:cf67:97f3 with SMTP id y14-20020aa7ccce000000b00501cf6797f3mr18635356edt.25.1680094959618; Wed, 29 Mar 2023 06:02:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1680094959; cv=none; d=google.com; s=arc-20160816; b=0MvvkG5RCwcbYv3XkQkEuwmzzj3hamqiuHZhkf8Eh5bIfCKKzElI2MYLdWGCBr84Ua edvRSiAkDHZsIx6wqDMw5UtTTsNoHpCTgsUl8WHMgs3TWhVbanT3Z+CexYF6juGxFvqE x6qqdFksOF51LUFZWSR8kUV4xwGmCYK4fCvl25EIMVh2wOc8c656NVxicPn22SZ6bC6+ kUhEdLvzW7pDulsJ3cvnXCooRG3nQSkfPG8ikaT8JWgyy6FZfOMFnlLCnNt2yXcTbveR bQi5FPpEeiEIuEtIQPzsPzSh914f/IQwxe1/5O9xv+vmVpIpEstqOYPcmIYVCfnAaUj8 UXjg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=V+tcCpHoOz8e5BKcWtXtUbqlaYi7I1AwPgxgJKpvzd4=; b=yyXjoG7sDkCp+ylfVPR0zyTOKY8zPdcP4/8l4SqSjzXkqyJXcIfKHXtpB6WqW8wL1b J3jX9zIhUBsrdVIbAgHxaZ3rzcukhVj2sEvTlxbAUwBuLs6O0HRrYZ2HCZew8a+ePNbS www2CF5T1daq3r2HX0Oo5qOOwb9i7giHNxLK6I8/yzH2P5NK8grut2tJTgYAFd+O7vTG wIBPFIf4H8UHbJgT4KN1Npz7AVxkRNm5sE3CDSfdTN/1lZgCjCiTI7VgUASSLDprbPKt dZ4g/Xv5VpR1yUVrjf41Ic1fQptyLRK23u7v+WR4lp+v61L+KziffBNatjrYEFji3bVy /1eg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=iezc4kR3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l19-20020a056402029300b004acb7b6a25csi35354748edv.42.2023.03.29.06.01.50; Wed, 29 Mar 2023 06:02:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=iezc4kR3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229913AbjC2M5X (ORCPT + 99 others); Wed, 29 Mar 2023 08:57:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39106 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229642AbjC2M5S (ORCPT ); Wed, 29 Mar 2023 08:57:18 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 674521A6 for ; Wed, 29 Mar 2023 05:56:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1680094594; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=V+tcCpHoOz8e5BKcWtXtUbqlaYi7I1AwPgxgJKpvzd4=; b=iezc4kR3ii/dKzubpLoedxCuP0qptMfEoNj7oOo89Yt9F/E7+PCVl+/b/T+ch9vKLGGKEt njM4pYE3Ksm42hmO4yWFnDn0Rmkp49MPGLt9BZ8lnkfumEhvUs2U1uC23U/3jiiRwXe3Rm S28rvHsk4sojU6XL14lbu66vrirttaI= Received: from mail-qt1-f200.google.com (mail-qt1-f200.google.com [209.85.160.200]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-558-VsTgRHE9MXuuZqecwXlnng-1; Wed, 29 Mar 2023 08:56:33 -0400 X-MC-Unique: VsTgRHE9MXuuZqecwXlnng-1 Received: by mail-qt1-f200.google.com with SMTP id u1-20020a05622a198100b003e12a0467easo10167718qtc.11 for ; Wed, 29 Mar 2023 05:56:33 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1680094593; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=V+tcCpHoOz8e5BKcWtXtUbqlaYi7I1AwPgxgJKpvzd4=; b=2wYpJAtD7b7c9w/m9RDu6nXgERZHML79xIrr4ooHIl3EIuhfol3LrinTw4VXKQM1rS K5RH04Ig6KS/hpU9pQhgwBeNNyeQlHjs2n5qiZgw4X/MHNS4VRMEHgO1Ilf52RP/2DC1 Wl7mdKYJI+Ve+VTWnPeZavPW2yBfH7chRLeEYYfMbrwP1XF8S62VwkwpeybQL0c5wNVB t6UtJciNIUWM10X0JE4ScIORepCDT/u5MVKd5KmaWenP0DtulgiMUnqMIxeE3aXNQ9nj gb+sHcktX3qlIHcSqhcoOljnC3p7IPrfRoURWxz4zV2mhgq0AB9isL/EnXQ6QK/DO8T0 Hy0A== X-Gm-Message-State: AAQBX9fWGiKX726PrIS5Ib9TDCG9nNwfcxygrkAPqbwv6HEHlRAmZqey NIn468JYylWNoHR3HtNyFyjIOSNawU5Kz80SnD3NaUU7oJgP9AajArtaXitpSiRsBu7RuHriqdf 0AEMYEyVzqL7EJKtfWoP3tbbI X-Received: by 2002:a05:6214:1c8d:b0:5df:466f:9edc with SMTP id ib13-20020a0562141c8d00b005df466f9edcmr8412273qvb.22.1680094592810; Wed, 29 Mar 2023 05:56:32 -0700 (PDT) X-Received: by 2002:a05:6214:1c8d:b0:5df:466f:9edc with SMTP id ib13-20020a0562141c8d00b005df466f9edcmr8412238qvb.22.1680094592489; Wed, 29 Mar 2023 05:56:32 -0700 (PDT) Received: from localhost.localdomain.com ([151.29.151.163]) by smtp.gmail.com with ESMTPSA id c23-20020a379a17000000b007436d0e9408sm13527134qke.127.2023.03.29.05.56.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 29 Mar 2023 05:56:32 -0700 (PDT) From: Juri Lelli To: Peter Zijlstra , Ingo Molnar , Qais Yousef , Waiman Long , Tejun Heo , Zefan Li , Johannes Weiner , Hao Luo Cc: Dietmar Eggemann , Steven Rostedt , linux-kernel@vger.kernel.org, luca.abeni@santannapisa.it, claudio@evidence.eu.com, tommaso.cucinotta@santannapisa.it, bristot@redhat.com, mathieu.poirier@linaro.org, cgroups@vger.kernel.org, Vincent Guittot , Wei Wang , Rick Yiu , Quentin Perret , Heiko Carstens , Vasily Gorbik , Alexander Gordeev , Sudeep Holla , Juri Lelli Subject: [PATCH 3/6] sched/cpuset: Keep track of SCHED_DEADLINE task in cpusets Date: Wed, 29 Mar 2023 14:55:55 +0200 Message-Id: <20230329125558.255239-4-juri.lelli@redhat.com> X-Mailer: git-send-email 2.39.2 In-Reply-To: <20230329125558.255239-1-juri.lelli@redhat.com> References: <20230329125558.255239-1-juri.lelli@redhat.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-0.2 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Qais reported that iterating over all tasks when rebuilding root domains for finding out which ones are DEADLINE and need their bandwidth correctly restored on such root domains can be a costly operation (10+ ms delays on suspend-resume). To fix the problem keep track of the number of DEADLINE tasks belonging to each cpuset and then use this information (followup patch) to only perform the above iteration if DEADLINE tasks are actually present in the cpuset for which a corresponding root domain is being rebuilt. Reported-by: Qais Yousef Link: https://lore.kernel.org/lkml/20230206221428.2125324-1-qyousef@layalina.io/ Signed-off-by: Juri Lelli --- include/linux/cpuset.h | 4 ++++ kernel/cgroup/cgroup.c | 4 ++++ kernel/cgroup/cpuset.c | 25 +++++++++++++++++++++++++ kernel/sched/deadline.c | 14 ++++++++++++++ 4 files changed, 47 insertions(+) diff --git a/include/linux/cpuset.h b/include/linux/cpuset.h index 355f796c5f07..0348dba5680e 100644 --- a/include/linux/cpuset.h +++ b/include/linux/cpuset.h @@ -71,6 +71,8 @@ extern void cpuset_init_smp(void); extern void cpuset_force_rebuild(void); extern void cpuset_update_active_cpus(void); extern void cpuset_wait_for_hotplug(void); +extern void inc_dl_tasks_cs(struct task_struct *task); +extern void dec_dl_tasks_cs(struct task_struct *task); extern void cpuset_lock(void); extern void cpuset_unlock(void); extern void cpuset_cpus_allowed(struct task_struct *p, struct cpumask *mask); @@ -196,6 +198,8 @@ static inline void cpuset_update_active_cpus(void) static inline void cpuset_wait_for_hotplug(void) { } +static inline void inc_dl_tasks_cs(struct task_struct *task) { } +static inline void dec_dl_tasks_cs(struct task_struct *task) { } static inline void cpuset_lock(void) { } static inline void cpuset_unlock(void) { } diff --git a/kernel/cgroup/cgroup.c b/kernel/cgroup/cgroup.c index 935e8121b21e..ff27b2d2bf0b 100644 --- a/kernel/cgroup/cgroup.c +++ b/kernel/cgroup/cgroup.c @@ -57,6 +57,7 @@ #include #include #include +#include #include #include @@ -6673,6 +6674,9 @@ void cgroup_exit(struct task_struct *tsk) list_add_tail(&tsk->cg_list, &cset->dying_tasks); cset->nr_tasks--; + if (dl_task(tsk)) + dec_dl_tasks_cs(tsk); + WARN_ON_ONCE(cgroup_task_frozen(tsk)); if (unlikely(!(tsk->flags & PF_KTHREAD) && test_bit(CGRP_FREEZE, &task_dfl_cgroup(tsk)->flags))) diff --git a/kernel/cgroup/cpuset.c b/kernel/cgroup/cpuset.c index fbc10b494292..eb0854ef9757 100644 --- a/kernel/cgroup/cpuset.c +++ b/kernel/cgroup/cpuset.c @@ -193,6 +193,12 @@ struct cpuset { int use_parent_ecpus; int child_ecpus_count; + /* + * number of SCHED_DEADLINE tasks attached to this cpuset, so that we + * know when to rebuild associated root domain bandwidth information. + */ + int nr_deadline_tasks; + /* Invalid partition error code, not lock protected */ enum prs_errcode prs_err; @@ -245,6 +251,20 @@ static inline struct cpuset *parent_cs(struct cpuset *cs) return css_cs(cs->css.parent); } +void inc_dl_tasks_cs(struct task_struct *p) +{ + struct cpuset *cs = task_cs(p); + + cs->nr_deadline_tasks++; +} + +void dec_dl_tasks_cs(struct task_struct *p) +{ + struct cpuset *cs = task_cs(p); + + cs->nr_deadline_tasks--; +} + /* bits in struct cpuset flags field */ typedef enum { CS_ONLINE, @@ -2477,6 +2497,11 @@ static int cpuset_can_attach(struct cgroup_taskset *tset) ret = security_task_setscheduler(task); if (ret) goto out_unlock; + + if (dl_task(task)) { + cs->nr_deadline_tasks++; + cpuset_attach_old_cs->nr_deadline_tasks--; + } } /* diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c index 4cc7e1ca066d..8f92f0f87383 100644 --- a/kernel/sched/deadline.c +++ b/kernel/sched/deadline.c @@ -16,6 +16,8 @@ * Fabio Checconi */ +#include + /* * Default limits for DL period; on the top end we guard against small util * tasks still getting ridiculously long effective runtimes, on the bottom end we @@ -2595,6 +2597,12 @@ static void switched_from_dl(struct rq *rq, struct task_struct *p) if (task_on_rq_queued(p) && p->dl.dl_runtime) task_non_contending(p); + /* + * In case a task is setscheduled out from SCHED_DEADLINE we need to + * keep track of that on its cpuset (for correct bandwidth tracking). + */ + dec_dl_tasks_cs(p); + if (!task_on_rq_queued(p)) { /* * Inactive timer is armed. However, p is leaving DEADLINE and @@ -2635,6 +2643,12 @@ static void switched_to_dl(struct rq *rq, struct task_struct *p) if (hrtimer_try_to_cancel(&p->dl.inactive_timer) == 1) put_task_struct(p); + /* + * In case a task is setscheduled to SCHED_DEADLINE we need to keep + * track of that on its cpuset (for correct bandwidth tracking). + */ + inc_dl_tasks_cs(p); + /* If p is not queued we will update its parameters at next wakeup. */ if (!task_on_rq_queued(p)) { add_rq_bw(&p->dl, &rq->dl); -- 2.39.2