Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp492634pxu; Sun, 22 Nov 2020 16:35:04 -0800 (PST) X-Google-Smtp-Source: ABdhPJx+W6+OJ+ZEwxOHDKnDO4CRrxtwmhlgxL7VGetiu+mMPGQjjhaP/dtrzIxDg+8oUNYjI5U5 X-Received: by 2002:a50:af21:: with SMTP id g30mr43515910edd.46.1606091704228; Sun, 22 Nov 2020 16:35:04 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606091704; cv=none; d=google.com; s=arc-20160816; b=IhH8wu/+OpY2ExgCp8PiJJcNH039WULjs8+7dHF/o1Sx9UZ9gsqmVzJXimnvQLE2Jr CT9oydHXJWYMU8O1lfByYCcvsunxfKSALG4H9q9jPksiJN/vwk3lit5QjvRMvb4ZzOlh Nzu1IENRt69RuQbsLjZDXuYFeO9M1muYHsPTbWP1o1XRgnt6ZbwuPNvl7qHvNsgyaGVb d6wjtjRgKH7oH88nVxuhjbIP/HRHIzGG5P6DhU5Fkah6WsT+Awq3AtWlNOvR7EvDBORS pwQyXZtYnwogU0PwHHBzKylhtzZmrUr5aL5fT+TY1zTli+sUWzHxLZTOXsfe2t/YsZL6 7KEA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=sTrFkVjBTvuRzAyjMLohzxLAMHmdJ9CkOOZJnRCr2vI=; b=bYqqJ7eHufofnvHCrsCE70ha189Du7UEML6SNFo7SeI7X5DjovfRrtj4AqZiOaf8wO /Z5uVxtnySQUe/5JXJLd0Pym4bG6IB5Ung4FI5C4NDuChvgT/C6lZx8Rzpr09LPbYJ9n GUfSJM+phQCuGMb5gYjLzeImmqy2jQ3DTFOgYi3uEya+/hqMCP42iFj4+7qcx934NDHv Yr8Qst9mqk0tl5ASWDbJEmxB+JLkSxKQPe+sY2/d0mzTTQB7N9CQ2W3RZlFav79ye9yv j7uSwRSQMupWaOZbWyY8z5D+Hm2KuZR3nOTy3yqwtnhAVog3zOpBRc25aiJbHwP0kHLD Dcxw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=k6VmViGw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z13si5550766ejf.662.2020.11.22.16.34.42; Sun, 22 Nov 2020 16:35:04 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=k6VmViGw; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726612AbgKVXzE (ORCPT + 99 others); Sun, 22 Nov 2020 18:55:04 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52980 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726496AbgKVXzD (ORCPT ); Sun, 22 Nov 2020 18:55:03 -0500 Received: from mail-pl1-x641.google.com (mail-pl1-x641.google.com [IPv6:2607:f8b0:4864:20::641]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 89B06C0613CF for ; Sun, 22 Nov 2020 15:55:03 -0800 (PST) Received: by mail-pl1-x641.google.com with SMTP id r2so312091pls.3 for ; Sun, 22 Nov 2020 15:55:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=sTrFkVjBTvuRzAyjMLohzxLAMHmdJ9CkOOZJnRCr2vI=; b=k6VmViGwYJp07rzEmdB+N8914lfq8ePmVgL5CLJgbYBMNSN6WgLYd3g3Eogu5URsKS TxoJT/UwpDw3UeaCVCRxrFn0ZbXN/3M264DctjVgL8K7eebIXa3mZpQoYq/RnsRkmQnZ qUK3BvpXBZBELA30BA6ugOhPSTHLBVAsCdBjCzscb8WF77wtSt4y+z4jMRC3q1i4SNzC UBqGq7yplJvdXQ290tc/CVIp08BQMZR9jjbFD6VYVhJSFZS3eOrZihF9M1fqj1QIYc5S TODu9lfdOAEyZLcln7GD/g2inJLJmv/jPtNLTZeXj27xWS9XnS3el0BN3+qx6g4tTQ2b Qccw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=sTrFkVjBTvuRzAyjMLohzxLAMHmdJ9CkOOZJnRCr2vI=; b=JQ7zQ88lKJNHwvJJCvNgOz0lxqoOiQ2r/ozWLPWft/la5RaSfvxuSdC6gcAYxLcmwd Yvvu4FxeLdQO2Zlq5F/ihcyD8ZCnXBmnl8CZVJ0fJ8eqwc2+XZS3oX4IZH07BPrFSIvX 6GOkocp+aREjbt3U/dzNQHdj9048ZfucZ0FU2EIGTI7WCRVpBqU89x3kd+B2Q+k2oSXJ tUE1mfqD99Z6eMqlvf0rGQiYKcpXqfqxi/z65dtNPHXM+XTCU2+6SHp8FXOtYiysmHmC Qesj+6kzooqx1B2UIrz3b5R81nWDwB+vQiWezjUZaygGBvSpdLIrIvTWt0Wc28U2fJTq U4Wg== X-Gm-Message-State: AOAM530U9F8yWGchk04iZf1oRzv9rg8I9g7YQ/cvJCjoE6h4L1byZT78 ud1xEcsHlC6KgLJGe4K76pU= X-Received: by 2002:a17:902:a607:b029:d8:ba6e:9b54 with SMTP id u7-20020a170902a607b02900d8ba6e9b54mr21937132plq.42.1606089302832; Sun, 22 Nov 2020 15:55:02 -0800 (PST) Received: from localhost (61-68-227-232.tpgi.com.au. [61.68.227.232]) by smtp.gmail.com with ESMTPSA id e18sm1892055pgr.71.2020.11.22.15.54.59 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 22 Nov 2020 15:55:00 -0800 (PST) Date: Mon, 23 Nov 2020 10:54:56 +1100 From: Balbir Singh To: "Joel Fernandes (Google)" Cc: Nishanth Aravamudan , Julien Desfossez , Peter Zijlstra , Tim Chen , Vineeth Pillai , Aaron Lu , Aubrey Li , tglx@linutronix.de, linux-kernel@vger.kernel.org, mingo@kernel.org, torvalds@linux-foundation.org, fweisbec@gmail.com, keescook@chromium.org, kerrnel@google.com, Phil Auld , Valentin Schneider , Mel Gorman , Pawan Gupta , Paolo Bonzini , vineeth@bitbyteword.org, Chen Yu , Christian Brauner , Agata Gruza , Antonio Gomez Iglesias , graf@amazon.com, konrad.wilk@oracle.com, dfaggioli@suse.com, pjt@google.com, rostedt@goodmis.org, derkling@google.com, benbjiang@tencent.com, Alexandre Chartre , James.Bottomley@hansenpartnership.com, OWeisse@umich.edu, Dhaval Giani , Junaid Shahid , jsbarnes@google.com, chris.hyser@oracle.com, Ben Segall , Josh Don , Hao Luo , Tom Lendacky , Aubrey Li , Aubrey Li , "Paul E. McKenney" , Tim Chen Subject: Re: [PATCH -tip 14/32] sched: migration changes for core scheduling Message-ID: <20201122235456.GF110669@balbir-desktop> References: <20201117232003.3580179-1-joel@joelfernandes.org> <20201117232003.3580179-15-joel@joelfernandes.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201117232003.3580179-15-joel@joelfernandes.org> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Nov 17, 2020 at 06:19:44PM -0500, Joel Fernandes (Google) wrote: > From: Aubrey Li > > - Don't migrate if there is a cookie mismatch > Load balance tries to move task from busiest CPU to the > destination CPU. When core scheduling is enabled, if the > task's cookie does not match with the destination CPU's > core cookie, this task will be skipped by this CPU. This > mitigates the forced idle time on the destination CPU. > > - Select cookie matched idle CPU > In the fast path of task wakeup, select the first cookie matched > idle CPU instead of the first idle CPU. > > - Find cookie matched idlest CPU > In the slow path of task wakeup, find the idlest CPU whose core > cookie matches with task's cookie > > - Don't migrate task if cookie not match > For the NUMA load balance, don't migrate task to the CPU whose > core cookie does not match with task's cookie > > Tested-by: Julien Desfossez > Signed-off-by: Aubrey Li > Signed-off-by: Tim Chen > Signed-off-by: Vineeth Remanan Pillai > Signed-off-by: Joel Fernandes (Google) > --- > kernel/sched/fair.c | 64 ++++++++++++++++++++++++++++++++++++++++---- > kernel/sched/sched.h | 29 ++++++++++++++++++++ > 2 files changed, 88 insertions(+), 5 deletions(-) > > diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c > index de82f88ba98c..ceb3906c9a8a 100644 > --- a/kernel/sched/fair.c > +++ b/kernel/sched/fair.c > @@ -1921,6 +1921,15 @@ static void task_numa_find_cpu(struct task_numa_env *env, > if (!cpumask_test_cpu(cpu, env->p->cpus_ptr)) > continue; > > +#ifdef CONFIG_SCHED_CORE > + /* > + * Skip this cpu if source task's cookie does not match > + * with CPU's core cookie. > + */ > + if (!sched_core_cookie_match(cpu_rq(cpu), env->p)) > + continue; > +#endif > + Any reason this is under an #ifdef? In sched_core_cookie_match() won't the check for sched_core_enabled() do the right thing even when CONFIG_SCHED_CORE is not enabed? > env->dst_cpu = cpu; > if (task_numa_compare(env, taskimp, groupimp, maymove)) > break; > @@ -5867,11 +5876,17 @@ find_idlest_group_cpu(struct sched_group *group, struct task_struct *p, int this > > /* Traverse only the allowed CPUs */ > for_each_cpu_and(i, sched_group_span(group), p->cpus_ptr) { > + struct rq *rq = cpu_rq(i); > + > +#ifdef CONFIG_SCHED_CORE > + if (!sched_core_cookie_match(rq, p)) > + continue; > +#endif > + > if (sched_idle_cpu(i)) > return i; > > if (available_idle_cpu(i)) { > - struct rq *rq = cpu_rq(i); > struct cpuidle_state *idle = idle_get_state(rq); > if (idle && idle->exit_latency < min_exit_latency) { > /* > @@ -6129,8 +6144,18 @@ static int select_idle_cpu(struct task_struct *p, struct sched_domain *sd, int t > for_each_cpu_wrap(cpu, cpus, target) { > if (!--nr) > return -1; > - if (available_idle_cpu(cpu) || sched_idle_cpu(cpu)) > - break; > + > + if (available_idle_cpu(cpu) || sched_idle_cpu(cpu)) { > +#ifdef CONFIG_SCHED_CORE > + /* > + * If Core Scheduling is enabled, select this cpu > + * only if the process cookie matches core cookie. > + */ > + if (sched_core_enabled(cpu_rq(cpu)) && > + p->core_cookie == cpu_rq(cpu)->core->core_cookie) > +#endif > + break; > + } > } > > time = cpu_clock(this) - time; > @@ -7530,8 +7555,9 @@ int can_migrate_task(struct task_struct *p, struct lb_env *env) > * We do not migrate tasks that are: > * 1) throttled_lb_pair, or > * 2) cannot be migrated to this CPU due to cpus_ptr, or > - * 3) running (obviously), or > - * 4) are cache-hot on their current CPU. > + * 3) task's cookie does not match with this CPU's core cookie > + * 4) running (obviously), or > + * 5) are cache-hot on their current CPU. > */ > if (throttled_lb_pair(task_group(p), env->src_cpu, env->dst_cpu)) > return 0; > @@ -7566,6 +7592,15 @@ int can_migrate_task(struct task_struct *p, struct lb_env *env) > return 0; > } > > +#ifdef CONFIG_SCHED_CORE > + /* > + * Don't migrate task if the task's cookie does not match > + * with the destination CPU's core cookie. > + */ > + if (!sched_core_cookie_match(cpu_rq(env->dst_cpu), p)) > + return 0; > +#endif > + > /* Record that we found atleast one task that could run on dst_cpu */ > env->flags &= ~LBF_ALL_PINNED; > > @@ -8792,6 +8827,25 @@ find_idlest_group(struct sched_domain *sd, struct task_struct *p, int this_cpu) > p->cpus_ptr)) > continue; > > +#ifdef CONFIG_SCHED_CORE > + if (sched_core_enabled(cpu_rq(this_cpu))) { > + int i = 0; > + bool cookie_match = false; > + > + for_each_cpu(i, sched_group_span(group)) { > + struct rq *rq = cpu_rq(i); > + > + if (sched_core_cookie_match(rq, p)) { > + cookie_match = true; > + break; > + } > + } > + /* Skip over this group if no cookie matched */ > + if (!cookie_match) > + continue; > + } > +#endif > + > local_group = cpumask_test_cpu(this_cpu, > sched_group_span(group)); > > diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h > index e72942a9ee11..de553d39aa40 100644 > --- a/kernel/sched/sched.h > +++ b/kernel/sched/sched.h > @@ -1135,6 +1135,35 @@ static inline raw_spinlock_t *rq_lockp(struct rq *rq) > > bool cfs_prio_less(struct task_struct *a, struct task_struct *b); > > +/* > + * Helper to check if the CPU's core cookie matches with the task's cookie > + * when core scheduling is enabled. > + * A special case is that the task's cookie always matches with CPU's core > + * cookie if the CPU is in an idle core. > + */ > +static inline bool sched_core_cookie_match(struct rq *rq, struct task_struct *p) > +{ > + bool idle_core = true; > + int cpu; > + > + /* Ignore cookie match if core scheduler is not enabled on the CPU. */ > + if (!sched_core_enabled(rq)) > + return true; > + > + for_each_cpu(cpu, cpu_smt_mask(cpu_of(rq))) { > + if (!available_idle_cpu(cpu)) { I was looking at this snippet and comparing this to is_core_idle(), the major difference is the check for vcpu_is_preempted(). Do we want to call the core as non idle if any vcpu was preempted on this CPU? > + idle_core = false; > + break; > + } > + } > + > + /* > + * A CPU in an idle core is always the best choice for tasks with > + * cookies. > + */ > + return idle_core || rq->core->core_cookie == p->core_cookie; > +} > + Balbir Singh.