2022-11-30 13:06:56

by Yair Podemsky

[permalink] [raw]
Subject: [PATCH] sched/core: Fix arch_scale_freq_tick() on tickless systems

In order for the scheduler to be frequency invariant we measure the
ratio between the maximum cpu frequency and the actual cpu frequency.
During long tickless periods of time the calculations that keep track
of that might overflow, in the function scale_freq_tick():

if (check_shl_overflow(acnt, 2*SCHED_CAPACITY_SHIFT, &acnt))
» goto error;

eventually forcing the kernel to disable the feature for all cpus,
and show the warning message
"Scheduler frequency invariance went wobbly, disabling!".
Let's avoid that by limiting the frequency invariant calculations
to cpus with regular tick.

Fixes: e2b0d619b400 ("x86, sched: check for counters overflow in frequency invariant accounting")
Signed-off-by: Yair Podemsky <[email protected]>
Suggested-by: "Peter Zijlstra (Intel)" <[email protected]>
---
V1 -> V2: solution approach was changed from detecting long tickless periods
to frequency invariant measurements on housekeeping cpus only.
Link: [email protected]
---
kernel/sched/core.c | 4 +++-
1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index daff72f00385..1bb0a840c817 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -5469,7 +5469,9 @@ void scheduler_tick(void)
unsigned long thermal_pressure;
u64 resched_latency;

- arch_scale_freq_tick();
+ if (housekeeping_cpu(cpu, HK_TYPE_TICK))
+ arch_scale_freq_tick();
+
sched_clock_tick();

rq_lock(rq, &rf);
--
2.31.1


2022-12-14 10:07:03

by Valentin Schneider

[permalink] [raw]
Subject: Re: [PATCH] sched/core: Fix arch_scale_freq_tick() on tickless systems

On 30/11/22 14:51, Yair Podemsky wrote:
> In order for the scheduler to be frequency invariant we measure the
> ratio between the maximum cpu frequency and the actual cpu frequency.
> During long tickless periods of time the calculations that keep track
> of that might overflow, in the function scale_freq_tick():
>
> if (check_shl_overflow(acnt, 2*SCHED_CAPACITY_SHIFT, &acnt))
> » goto error;
>
> eventually forcing the kernel to disable the feature for all cpus,
> and show the warning message
> "Scheduler frequency invariance went wobbly, disabling!".
> Let's avoid that by limiting the frequency invariant calculations
> to cpus with regular tick.
>
> Fixes: e2b0d619b400 ("x86, sched: check for counters overflow in frequency invariant accounting")
> Signed-off-by: Yair Podemsky <[email protected]>
> Suggested-by: "Peter Zijlstra (Intel)" <[email protected]>

Reviewed-by: Valentin Schneider <[email protected]>

> ---
> V1 -> V2: solution approach was changed from detecting long tickless periods
> to frequency invariant measurements on housekeeping cpus only.
> Link: [email protected]
> ---
> kernel/sched/core.c | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/kernel/sched/core.c b/kernel/sched/core.c
> index daff72f00385..1bb0a840c817 100644
> --- a/kernel/sched/core.c
> +++ b/kernel/sched/core.c
> @@ -5469,7 +5469,9 @@ void scheduler_tick(void)
> unsigned long thermal_pressure;
> u64 resched_latency;
>
> - arch_scale_freq_tick();
> + if (housekeeping_cpu(cpu, HK_TYPE_TICK))
> + arch_scale_freq_tick();
> +
> sched_clock_tick();
>
> rq_lock(rq, &rf);
> --
> 2.31.1

2022-12-14 15:49:53

by Giovanni Gherdovich

[permalink] [raw]
Subject: Re: [PATCH] sched/core: Fix arch_scale_freq_tick() on tickless systems

On Wed, 2022-11-30 at 14:51 +0200, Yair Podemsky wrote:
> In order for the scheduler to be frequency invariant we measure the
> ratio between the maximum cpu frequency and the actual cpu frequency.
> During long tickless periods of time the calculations that keep track
> of that might overflow, in the function scale_freq_tick():
>
> if (check_shl_overflow(acnt, 2*SCHED_CAPACITY_SHIFT, &acnt))
> » goto error;
>
> eventually forcing the kernel to disable the feature for all cpus,
> and show the warning message
> "Scheduler frequency invariance went wobbly, disabling!".
> Let's avoid that by limiting the frequency invariant calculations
> to cpus with regular tick.
>
> Fixes: e2b0d619b400 ("x86, sched: check for counters overflow in frequency invariant accounting")
> Signed-off-by: Yair Podemsky <[email protected]>
> Suggested-by: "Peter Zijlstra (Intel)" <[email protected]>

Acked-by: Giovanni Gherdovich <[email protected]>

> ---
> V1 -> V2: solution approach was changed from detecting long tickless periods
> to frequency invariant measurements on housekeeping cpus only.
> Link: [email protected]
> ---
> kernel/sched/core.c | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/kernel/sched/core.c b/kernel/sched/core.c
> index daff72f00385..1bb0a840c817 100644
> --- a/kernel/sched/core.c
> +++ b/kernel/sched/core.c
> @@ -5469,7 +5469,9 @@ void scheduler_tick(void)
> unsigned long thermal_pressure;
> u64 resched_latency;
>
> - arch_scale_freq_tick();
> + if (housekeeping_cpu(cpu, HK_TYPE_TICK))
> + arch_scale_freq_tick();
> +
> sched_clock_tick();
>
> rq_lock(rq, &rf);

2023-01-04 16:03:01

by Yair Podemsky

[permalink] [raw]
Subject: Re: [PATCH] sched/core: Fix arch_scale_freq_tick() on tickless systems

Friendly ping

On Wed, 2022-12-14 at 16:35 +0100, Giovanni Gherdovich wrote:
> On Wed, 2022-11-30 at 14:51 +0200, Yair Podemsky wrote:
> > In order for the scheduler to be frequency invariant we measure the
> > ratio between the maximum cpu frequency and the actual cpu
> > frequency.
> > During long tickless periods of time the calculations that keep
> > track
> > of that might overflow, in the function scale_freq_tick():
> >
> > if (check_shl_overflow(acnt, 2*SCHED_CAPACITY_SHIFT, &acnt))
> > » goto error;
> >
> > eventually forcing the kernel to disable the feature for all cpus,
> > and show the warning message
> > "Scheduler frequency invariance went wobbly, disabling!".
> > Let's avoid that by limiting the frequency invariant calculations
> > to cpus with regular tick.
> >
> > Fixes: e2b0d619b400 ("x86, sched: check for counters overflow in
> > frequency invariant accounting")
> > Signed-off-by: Yair Podemsky <[email protected]>
> > Suggested-by: "Peter Zijlstra (Intel)" <[email protected]>
>
> Acked-by: Giovanni Gherdovich <[email protected]>
>
> > ---
> > V1 -> V2: solution approach was changed from detecting long
> > tickless periods
> > to frequency invariant measurements on housekeeping cpus only.
> > Link: [email protected]
> > ---
> > kernel/sched/core.c | 4 +++-
> > 1 file changed, 3 insertions(+), 1 deletion(-)
> >
> > diff --git a/kernel/sched/core.c b/kernel/sched/core.c
> > index daff72f00385..1bb0a840c817 100644
> > --- a/kernel/sched/core.c
> > +++ b/kernel/sched/core.c
> > @@ -5469,7 +5469,9 @@ void scheduler_tick(void)
> > unsigned long thermal_pressure;
> > u64 resched_latency;
> >
> > - arch_scale_freq_tick();
> > + if (housekeeping_cpu(cpu, HK_TYPE_TICK))
> > + arch_scale_freq_tick();
> > +
> > sched_clock_tick();
> >
> > rq_lock(rq, &rf);

Subject: [tip: sched/urgent] sched/core: Fix arch_scale_freq_tick() on tickless systems

The following commit has been merged into the sched/urgent branch of tip:

Commit-ID: 7fb3ff22ad8772bbf0e3ce1ef3eb7b09f431807f
Gitweb: https://git.kernel.org/tip/7fb3ff22ad8772bbf0e3ce1ef3eb7b09f431807f
Author: Yair Podemsky <[email protected]>
AuthorDate: Wed, 30 Nov 2022 14:51:21 +02:00
Committer: Ingo Molnar <[email protected]>
CommitterDate: Sat, 07 Jan 2023 12:25:50 +01:00

sched/core: Fix arch_scale_freq_tick() on tickless systems

In order for the scheduler to be frequency invariant we measure the
ratio between the maximum CPU frequency and the actual CPU frequency.

During long tickless periods of time the calculations that keep track
of that might overflow, in the function scale_freq_tick():

if (check_shl_overflow(acnt, 2*SCHED_CAPACITY_SHIFT, &acnt))
goto error;

eventually forcing the kernel to disable the feature for all CPUs,
and show the warning message:

"Scheduler frequency invariance went wobbly, disabling!".

Let's avoid that by limiting the frequency invariant calculations
to CPUs with regular tick.

Fixes: e2b0d619b400 ("x86, sched: check for counters overflow in frequency invariant accounting")
Suggested-by: "Peter Zijlstra (Intel)" <[email protected]>
Signed-off-by: Yair Podemsky <[email protected]>
Signed-off-by: Ingo Molnar <[email protected]>
Reviewed-by: Valentin Schneider <[email protected]>
Acked-by: Giovanni Gherdovich <[email protected]>
Link: https://lore.kernel.org/r/[email protected]
---
kernel/sched/core.c | 4 +++-
1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index 25b582b..965d813 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -5504,7 +5504,9 @@ void scheduler_tick(void)
unsigned long thermal_pressure;
u64 resched_latency;

- arch_scale_freq_tick();
+ if (housekeeping_cpu(cpu, HK_TYPE_TICK))
+ arch_scale_freq_tick();
+
sched_clock_tick();

rq_lock(rq, &rf);