LinuxLists.cc - [PATCH 1/3] sched: Enable wake balancing for the SMT/HT domain

2009-10-24 20:06:36

Subject: [PATCH 1/3] sched: Enable wake balancing for the SMT/HT domain

Subject: sched: Enable wake balancing for the SMT/HT domain
From: Arjan van de Ven <[email protected]>

Logical CPUs that are part of a hyperthreading/SMT set are equivalent
in terms of where to execute a task; after all they share pretty much
all resources including the L1 cache.

This means that if task A wakes up task B, we should really consider
all logical CPUs in the SMT/HT set to run task B, not just the CPU that
task A is running on; in case task A keeps running, task B now gets to
execute with no latency. In the case where task A then immediately goes
to wait for a response from task B, nothing is lost due to the aforementioned
equivalency.

This patch turns on the "balance on wakup" and turns of "affine wakeups"
for the SMT/HT scheduler domain to get this lower latency behavior.

Signed-off-by: Arjan van de Ven <[email protected]>

diff --git a/include/linux/topology.h b/include/linux/topology.h
index fc0bf3e..3665dc2 100644
--- a/include/linux/topology.h
+++ b/include/linux/topology.h
@@ -95,8 +95,8 @@ int arch_update_cpu_topology(void);
| 1*SD_BALANCE_NEWIDLE \
| 1*SD_BALANCE_EXEC \
| 1*SD_BALANCE_FORK \
- | 0*SD_BALANCE_WAKE \
- | 1*SD_WAKE_AFFINE \
+ | 1*SD_BALANCE_WAKE \
+ | 0*SD_WAKE_AFFINE \
| 1*SD_SHARE_CPUPOWER \
| 0*SD_POWERSAVINGS_BALANCE \
| 0*SD_SHARE_PKG_RESOURCES \

--
Arjan van de Ven Intel Open Source Technology Centre
For development, discussion and tips for power savings,
visit http://www.lesswatts.org

2009-10-24 20:03:27

by Arjan van de Ven

[permalink] [raw]

Subject: [PATCH 2/3] sched: Add aggressive load balancing for certain situations

Subject: sched: Add aggressive load balancing for certain situations
From: Arjan van de Ven <[email protected]>

The scheduler, in it's "find idlest group" function currently has an unconditional
threshold for an imbalance, before it will consider moving a task.

However, there are situations where this is undesireable, and we want to opt in to a
more aggressive load balancing algorithm to minimize latencies.

This patch adds the infrastructure for this and also adds two cases for which
we select the aggressive approach
1) From interrupt context. Events that happen in irq context are very likely,
as a heuristic, to show latency sensitive behavior
2) When doing a wake_up() and the scheduler domain we're investigating has the
flag set that opts in to load balancing during wake_up()
(for example the SMT/HT domain)

Signed-off-by: Arjan van de Ven <[email protected]>

diff --git a/kernel/sched_fair.c b/kernel/sched_fair.c
index 4e777b4..fe9b95b 100644
--- a/kernel/sched_fair.c
+++ b/kernel/sched_fair.c
@@ -1246,7 +1246,7 @@ static int wake_affine(struct sched_domain *sd, struct task_struct *p, int sync)
*/
static struct sched_group *
find_idlest_group(struct sched_domain *sd, struct task_struct *p,
- int this_cpu, int load_idx)
+ int this_cpu, int load_idx, int agressive)
{
struct sched_group *idlest = NULL, *this = NULL, *group = sd->groups;
unsigned long min_load = ULONG_MAX, this_load = 0;
@@ -1290,7 +1290,9 @@ find_idlest_group(struct sched_domain *sd, struct task_struct *p,
}
} while (group = group->next, group != sd->groups);

- if (!idlest || 100*this_load < imbalance*min_load)
+ if (!idlest)
+ return NULL;
+ if (!agressive && 100*this_load < imbalance*min_load)
return NULL;
return idlest;
}
@@ -1412,6 +1414,7 @@ static int select_task_rq_fair(struct task_struct *p, int sd_flag, int wake_flag
int load_idx = sd->forkexec_idx;
struct sched_group *group;
int weight;
+ int agressive;

if (!(sd->flags & sd_flag)) {
sd = sd->child;
@@ -1421,7 +1424,13 @@ static int select_task_rq_fair(struct task_struct *p, int sd_flag, int wake_flag
if (sd_flag & SD_BALANCE_WAKE)
load_idx = sd->wake_idx;

- group = find_idlest_group(sd, p, cpu, load_idx);
+ agressive = 0;
+ if (in_irq())
+ agressive = 1;
+ if (sd_flag & SD_BALANCE_WAKE)
+ agressive = 1;
+
+ group = find_idlest_group(sd, p, cpu, load_idx, agressive);
if (!group) {
sd = sd->child;
continue;

--
Arjan van de Ven Intel Open Source Technology Centre
For development, discussion and tips for power savings,
visit http://www.lesswatts.org

2009-10-24 20:06:23

by Arjan van de Ven

[permalink] [raw]

Subject: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: sched: Disable affine wakeups by default
From: Arjan van de Ven <[email protected]>

The global affine wakeup scheduler feature sounds nice, but there is a problem
with this: This is ALSO a per scheduler domain feature already.
By having the global scheduler feature enabled by default, the scheduler domains
no longer have the option to opt out.

There are domains (for example the HT/SMT domain) that have good reason to want
to opt out of this feature.

With this patch they can opt out, while all other domains currently default to
the affine setting anyway.

Signed-off-by: Arjan van de Ven <[email protected]>

diff --git a/kernel/sched_features.h b/kernel/sched_features.h
index 0d94083..58c2ea7 100644
--- a/kernel/sched_features.h
+++ b/kernel/sched_features.h
@@ -72,7 +72,7 @@ SCHED_FEAT(SYNC_WAKEUPS, 1)
* improve cache locality. Typically used with SYNC wakeups as
* generated by pipes and the like, see also SYNC_WAKEUPS.
*/
-SCHED_FEAT(AFFINE_WAKEUPS, 1)
+SCHED_FEAT(AFFINE_WAKEUPS, 0)

/*
* Weaken SYNC hint based on overlap

--
Arjan van de Ven Intel Open Source Technology Centre
For development, discussion and tips for power savings,
visit http://www.lesswatts.org

2009-10-25 06:55:25

Subject: [PATCH 1/3] sched: Enable wake balancing for the SMT/HT domain

Subject: [PATCH 2/3] sched: Add aggressive load balancing for certain situations

Subject: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 2/3] sched: Add aggressive load balancing for certain situations

Subject: Re: [PATCH 1/3] sched: Enable wake balancing for the SMT/HT domain

Subject: Re: [PATCH 2/3] sched: Add aggressive load balancing for certain situations

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: Re: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: Re: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: Re: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: Re: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: Re: [tip:sched/core] sched: Check for an idle shared cache in select_task_rq_fair()

Subject: [tip:sched/core] sched: Fix affinity logic in select_task_rq_fair()

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default

Subject: Re: [PATCH 3/3] sched: Disable affine wakeups by default