Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755582AbcCCMdF (ORCPT ); Thu, 3 Mar 2016 07:33:05 -0500 Received: from mail-wm0-f67.google.com ([74.125.82.67]:33677 "EHLO mail-wm0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751215AbcCCMdD (ORCPT ); Thu, 3 Mar 2016 07:33:03 -0500 Date: Thu, 3 Mar 2016 13:32:59 +0100 From: Michal Hocko To: Hugh Dickins Cc: Vlastimil Babka , Joonsoo Kim , Andrew Morton , Linus Torvalds , Johannes Weiner , Mel Gorman , David Rientjes , Tetsuo Handa , Hillf Danton , KAMEZAWA Hiroyuki , linux-mm@kvack.org, LKML Subject: Re: [PATCH 0/3] OOM detection rework v4 Message-ID: <20160303123258.GE26202@dhcp22.suse.cz> References: <1450203586-10959-1-git-send-email-mhocko@kernel.org> <20160203132718.GI6757@dhcp22.suse.cz> <20160229203502.GW16930@dhcp22.suse.cz> <20160301133846.GF9461@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1052 Lines: 31 On Thu 03-03-16 01:54:43, Hugh Dickins wrote: > On Tue, 1 Mar 2016, Michal Hocko wrote: [...] > > So I have tried the following: > > diff --git a/mm/compaction.c b/mm/compaction.c > > index 4d99e1f5055c..7364e48cf69a 100644 > > --- a/mm/compaction.c > > +++ b/mm/compaction.c > > @@ -1276,6 +1276,9 @@ static unsigned long __compaction_suitable(struct zone *zone, int order, > > alloc_flags)) > > return COMPACT_PARTIAL; > > > > + if (order <= PAGE_ALLOC_COSTLY_ORDER) > > + return COMPACT_CONTINUE; > > + > > I gave that a try just now, but it didn't help me: OOMed much sooner, > after doing half as much work. I do not have an explanation why it would cause oom sooner but this turned out to be incomplete. There is another wmaark check deeper in the compaction path. Could you try the one from http://lkml.kernel.org/r/20160302130022.GG26686@dhcp22.suse.cz I will try to find a machine with more CPUs and try to reproduce this in the mean time. I will also have a look at the data you have collected. -- Michal Hocko SUSE Labs