by Tetsuo Handa

[permalink] [raw]

Subject: Re: [PATCH 0/9] oom reaper v6

David Rientjes wrote:
> On Tue, 22 Mar 2016, Michal Hocko wrote:
>
> > Hi,
> > I am reposting the whole patchset on top of the current Linus tree which should
> > already contain big pile of Andrew's mm patches. This should serve an easier
> > reviewability and I also hope that this core part of the work can go to 4.6.
> >
> > The previous version was posted here [1] Hugh and David have suggested to
> > drop [2] because the munlock path currently depends on the page lock and
> > it is better if the initial version was conservative and prevent from
> > any potential lockups even though it is not clear whether they are real
> > - nobody has seen oom_reaper stuck on the page lock AFAICK. Me or Hugh
> > will have a look and try to make the munlock path not depend on the page
> > lock as a follow up work.
> >
> > Apart from that the feedback revealed one bug for a very unusual
> > configuration (sysctl_oom_kill_allocating_task) and that has been fixed
> > by patch 8 and one potential mis interaction with the pm freezer fixed by
> > patch 7.
> >
> > I think the current code base is already very useful for many situations.
> > The rest of the feedback was mostly about potential enhancements of the
> > current code which I would really prefer to build on top of the current
> > series. I plan to finish my mmap_sem killable for write in the upcoming
> > release cycle and hopefully have it merged in the next merge window.
> > I believe more extensions will follow.
> >
> > This code has been sitting in the mmotm (thus linux-next) for a while.
> > Are there any fundamental objections to have this part merged in this
> > merge window?
> >
>
> Tetsuo, have you been able to run your previous test cases on top of this
> version and do you have any concerns about it or possible extensions that
> could be made?
>

I think [PATCH 3/9] [PATCH 4/9] [PATCH 8/9] will be mostly reverted.
My concerns and possible extensions are explained in

Re: [PATCH 6/5] oom, oom_reaper: disable oom_reaper for oom_kill_allocating_task
http://lkml.kernel.org/r/[email protected]

. Regarding "[PATCH 4/9] mm, oom_reaper: report success/failure",
debug_show_all_locks() may not be safe

commit 856848737bd944c1 "lockdep: fix debug_show_all_locks()"
commit 82a1fcb90287052a "softlockup: automatically detect hung TASK_UNINTERRUPTIBLE tasks"

and showing traces might be more useful.
(A discussion for making printk() completely async is in progress.)

But we don't have time to update this series before merge window for 4.6 closes.
We want to send current patchset as is for now, don't we? So, please go ahead.

My other concerns about OOM handling:

Change TIF_MEMDIE strategy from per a thread to per a signal_struct.

[PATCH] mm,oom: Set TIF_MEMDIE on all OOM-killed threads.
http://lkml.kernel.org/r/1458529634-5951-1-git-send-email-penguin-kernel@I-love.SAKURA.ne.jp

Found a bug in too_many_isolated() assumption.

How to handle infinite too_many_isolated() loop (for OOM detection rework v4) ?
http://lkml.kernel.org/r/[email protected]

Waiting for a patch to be merged.

[PATCH] mm,writeback: Don't use memory reserves for wb_start_writeback
http://lkml.kernel.org/r/[email protected]

And, kmallocwd, __GFP_KILLABLE, and timeout (or something finite one) for TIF_MEMDIE.

2016-03-23 12:07:27

by Michal Hocko

[permalink] [raw]

Subject: Re: [PATCH 0/9] oom reaper v6

On Wed 23-03-16 20:11:35, Tetsuo Handa wrote:
> David Rientjes wrote:
[...]
> > Tetsuo, have you been able to run your previous test cases on top of this
> > version and do you have any concerns about it or possible extensions that
> > could be made?
> >
>
> I think [PATCH 3/9] [PATCH 4/9] [PATCH 8/9] will be mostly reverted.
> My concerns and possible extensions are explained in
>
> Re: [PATCH 6/5] oom, oom_reaper: disable oom_reaper for oom_kill_allocating_task
> http://lkml.kernel.org/r/[email protected]

I believe issues you have raised there are a matter for further
discussion as they are potential improvements of the existing
functionality rather than fixing a regression of the current code.

> . Regarding "[PATCH 4/9] mm, oom_reaper: report success/failure",
> debug_show_all_locks() may not be safe
>
> commit 856848737bd944c1 "lockdep: fix debug_show_all_locks()"
> commit 82a1fcb90287052a "softlockup: automatically detect hung TASK_UNINTERRUPTIBLE tasks"

Let me ask again. What exactly is unsafe about calling
debug_show_all_locks here? It is true that 856848737bd944c1 has
changed debug_show_all_locks to ignore running tasks which limits
this functionality to some degree but I still think this might be
useful. Proposed alternatives were way too verbose and complex on its
own. This is something to be further discussed as well, though.

> and showing traces might be more useful.
> (A discussion for making printk() completely async is in progress.)
>
> But we don't have time to update this series before merge window for 4.6 closes.
> We want to send current patchset as is for now, don't we? So, please go ahead.

I am happy that we are on the same patch here.

> My other concerns about OOM handling:

Let's stick to oom reaper here, please.

Thanks!
--
Michal Hocko
SUSE Labs