by Guest section DW

[permalink] [raw]

Subject: Re: OOM killer???

On Thu, Mar 29, 2001 at 11:29:34AM +0200, Dr. Michael Weller wrote:
> On Wed, 28 Mar 2001, Andreas Dilger wrote:
>
> > Szaka writes:
> > > On Tue, 27 Mar 2001, Andreas Dilger wrote:
> > > > Every time this subject comes up, I point to AIX and SIGDANGER - a signal
> > > > sent to processes when the system gets OOM.
> > >
> > > And every time the SIGDANGER comes up, the issue that AIX provides
> > > *both* early and late allocation mechanism even on per-process basis
> > > that can be controlled by *both* the programmer and the admin is
> > > completely ignored. Linux supports none of these...
>
> > If Linux provided both of those, then people would probably already be
> > happy.
>
> Probably.

Two things are wrong.
1. Linux has an OOM killer.
2. The OOM killer has a bad behaviour.

Presently, with the proper kind of load, one can see a process killed
by OOM almost daily. That is totally unacceptable.
People are working on refining the algorithm so that blatant idiocies
where processes are killed while there is plenty of resources
are avoided. Good. Suppose it done. Then one thing is wrong.

1. Linux has an OOM killer.

A system with an OOM killer is unreliable. Linux must have a reliable
mode of operation, and that must be the default mode.

Now you assume that adding SIGDANGER would make people happy.
But it would be a rather unimportant addition.
It might help in some cases, but it falls in the category
of improving the OOM killer a little.

People will be happy when Linux is reliable by default.

Andries

[Never use planes where the company's engineers spend their
time designing algorithms for selecting which passenger
must be thrown out when the plane is overloaded.]

2001-03-29 12:00:34

Just to throw my own observations into the war, I have to agree with David
K. here. This needs to be some sort of module and/or interface. Get the
policy into a replaceable user space module.

One of the hot areas for the kernel right now is for embedded systems.
They need an entirely different strategy than for a desk top. I'm working
on such a thing now were we don't even have an enabled swap space and the
OOM is causing us no end of trouble as we start dipping below 1MB "free"
system memory.

On 29 Mar 2001, Michael Peddemors wrote:

> Looking over the last few weeks of postings, there are just WAY to many
> conflicting ways that people want the OOM to work.. Although an
> incredible amount of good work has gone into this, people are definetely
> not happy about the benifits of OOM ... About 10 different approaches
> are being made to change the rule based systems pertaining to WHEN the
> OOM will fire, but in the end, still not everyone will be happy..

<SNIP>

> On 29 Mar 2001 07:41:44 -0800, David Konerding wrote:
>
> > Now, if you're going to implement OOM, when it is absolutely necessary, at the very
> > least, move the policy implementation out of the kernel. One of the general
> > philosophies of Linux has been to move policy out of the kernel. In this case, you'd
> > just have a root owned process with locked pages that can't be pre-empted, which
> > implemented the policy. You'll never come up with an OOM policy that will fit
> > everybody's needs unless it can be tuned for particular system's usage, and it's
> > going to be far easier to come up with that policy if it's not in the kernel.