Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757543AbcCCU5d (ORCPT ); Thu, 3 Mar 2016 15:57:33 -0500 Received: from mail-pf0-f181.google.com ([209.85.192.181]:36420 "EHLO mail-pf0-f181.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752032AbcCCU5c (ORCPT ); Thu, 3 Mar 2016 15:57:32 -0500 Date: Thu, 3 Mar 2016 12:57:23 -0800 (PST) From: Hugh Dickins X-X-Sender: hugh@eggly.anvils To: Michal Hocko cc: Hugh Dickins , Vlastimil Babka , Joonsoo Kim , Andrew Morton , Linus Torvalds , Johannes Weiner , Mel Gorman , David Rientjes , Tetsuo Handa , Hillf Danton , KAMEZAWA Hiroyuki , linux-mm@kvack.org, LKML Subject: Re: [PATCH 0/3] OOM detection rework v4 In-Reply-To: <20160303123258.GE26202@dhcp22.suse.cz> Message-ID: References: <1450203586-10959-1-git-send-email-mhocko@kernel.org> <20160203132718.GI6757@dhcp22.suse.cz> <20160229203502.GW16930@dhcp22.suse.cz> <20160301133846.GF9461@dhcp22.suse.cz> <20160303123258.GE26202@dhcp22.suse.cz> User-Agent: Alpine 2.11 (LSU 23 2013-08-11) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1406 Lines: 42 On Thu, 3 Mar 2016, Michal Hocko wrote: > On Thu 03-03-16 01:54:43, Hugh Dickins wrote: > > On Tue, 1 Mar 2016, Michal Hocko wrote: > [...] > > > So I have tried the following: > > > diff --git a/mm/compaction.c b/mm/compaction.c > > > index 4d99e1f5055c..7364e48cf69a 100644 > > > --- a/mm/compaction.c > > > +++ b/mm/compaction.c > > > @@ -1276,6 +1276,9 @@ static unsigned long __compaction_suitable(struct zone *zone, int order, > > > alloc_flags)) > > > return COMPACT_PARTIAL; > > > > > > + if (order <= PAGE_ALLOC_COSTLY_ORDER) > > > + return COMPACT_CONTINUE; > > > + > > > > I gave that a try just now, but it didn't help me: OOMed much sooner, > > after doing half as much work. I think I exaggerated: sooner, but not _much_ sooner; and I cannot see now what I based that estimate of "half as much work" on. > > I do not have an explanation why it would cause oom sooner but this > turned out to be incomplete. There is another wmaark check deeper in the > compaction path. Could you try the one from > http://lkml.kernel.org/r/20160302130022.GG26686@dhcp22.suse.cz I've now added that in: it corrects the "sooner", but does not make any difference to the fact of OOMing for me. Hugh > > I will try to find a machine with more CPUs and try to reproduce this in > the mean time. > > I will also have a look at the data you have collected. > -- > Michal Hocko > SUSE Labs