LinuxLists.cc - [PATCH] sched: Fix adverse effects of NFS client on interactive response

2006-01-04 12:02:30

Con Kolivas wrote:
> On Fri, 6 Jan 2006 11:02 am, Peter Williams wrote:
>
>>Con Kolivas wrote:
>>
>>>On Fri, 6 Jan 2006 10:13 am, Peter Williams wrote:
>>>
>>>>If the plugsched patches were included in -mm we could get wider testing
>>>>of alternative scheduling mechanisms. But I think it will take a lot of
>>>>testing of the new schedulers to allay fears that they may introduce new
>>>>problems of their own.
>>>
>>>When I first generated plugsched and posted it to lkml for inclusion in
>>>-mm it was blocked as having no chance of being included by both Ingo and
>>>Linus and I doubt they've changed their position since then. As you're
>>>well aware this is why I gave up working on it and let you maintain it
>>>since then. Obviously I thought it was a useful feature or I wouldn't
>>>have worked on it.
>>
>>I've put a lot of effort into reducing code duplication and reducing the
>>size of the interface and making it completely orthogonal to load
>>balancing so I'm hopeful (perhaps mistakenly) that this makes it more
>>acceptable (at least in -mm).
>
>
> The objection was to dilution of developer effort towards one cpu scheduler to
> rule them all.

I think that I've partially addressed that objection by narrowing the
focus of the alternative schedulers so that the dilution of effort is
reduced. The dichotomy between the dual array schedulers (ingosched and
nicksched) and the single array schedulers (staircase and the SPA
schedulers) is the main stumbling block to narrowing the focus further.

> Linus' objection was against specialisation - he preferred one
> cpu scheduler that could do everything rather than unique cpu schedulers for
> NUMA, SMP, UP, embedded...

kernbench results show that the penalties for an all purpose scheduler
aren't very big so it's probably not a bad philosophy. In spite of this
I think specialization is worth pursuing if it can be achieved with very
small configurable differences to the mechanism. If the configuration
change can be done at boot time or on a running system then it's even
better e.g. your "compute" switch in staircase.

> Each approach has its own arguments and there
> isn't much point bringing them up again. We shall use Linux as the
> "steamroller to crack a nut" no matter what that nut is.
>

Even if plugsched has no hope of getting into the mainline kernel, I see
it as a useful tool for the practical evaluation of the various
approaches. If it could go into -mm for a while this evaluation could
be more widespread.

In it's current state it should not interfere with other scheduling
related development such as the load balancing changes, cpusets etc.

>
>>My testing shows that there's no observable difference in performance
>>between a stock kernel and plugsched with ingosched selected at the
>>total system level (although micro benchmarking may show slight
>>increases in individual operations).
>
>
> I could find no difference either, but IA64 which does not cope with
> indirection well would probably suffer a demonstrable performance hit I have
> been told.

I wasn't aware of that.

> I do not have access to such hardware.

Nor do I.

Peter
--
Peter Williams [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
-- Ambrose Bierce

2006-01-06 07:39:49

On Sunday 08 January 2006 10:31, Peter Williams wrote:
> In any case and in the meantime, perhaps the solution is to use
> TASK_NONINTERACTIVE where needed but treat
> TASK_INTERRUPTIBLE|TASK_NONINTERACTIVE sleep the same as
> TASK_UNINTERRUPTIBLE sleep instead of ignoring it?

That's how I would tackle it.

Con

2006-01-08 05:51:52

by Mike Galbraith

[permalink] [raw]

Subject: Re: [PATCH] sched: Fix adverse effects of NFS client on interactive response

At 10:40 AM 1/8/2006 +1100, Peter Williams wrote:
>Mike Galbraith wrote:
>>> One slight variation of your scheme would be to measure the average
>>> length of the CPU runs that the task does (i.e. how long it runs
>>> without voluntarily relinquishing the CPU) and not allowing them to
>>> defer the shift to the expired array if this average run length is
>>> greater than some specified value. The length of this average for each
>>> task shouldn't change with system load. (This is more or less saying
>>> that it's ok for a task to stay on the active array provided it's
>>> unlikely to delay the switch between the active and expired arrays for
>>> very long.)
>>
>>Average burn time would indeed probably be a better metric, but that
>>would require doing bookkeeping is the fast path.
>
>Most of the infrastructure is already there and the cost of doing the
>extra bits required to get this metric would be extremely small. The
>hardest bit would be deciding on the "limit" to be applied when deciding
>whether to let a supposed interactive task stay on the active array.

Yeah, I noticed run_time when I started implementing my first cut. (which
is of course buggy)

>By the way, it seems you have your own scheduler versions? If so are you
>interested in adding them to the collection in PlugSched?

No, I used to do a bunch of experimentation in fairness vs interactivity,
but they all ended up just trading one weakness for an other.

-Mike