Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754777AbcK2QQE (ORCPT ); Tue, 29 Nov 2016 11:16:04 -0500 Received: from magic.merlins.org ([209.81.13.136]:50306 "EHLO mail1.merlins.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754676AbcK2QPt (ORCPT ); Tue, 29 Nov 2016 11:15:49 -0500 Date: Tue, 29 Nov 2016 08:15:29 -0800 From: Marc MERLIN To: Michal Hocko Cc: Vlastimil Babka , Linus Torvalds , linux-mm , LKML , Joonsoo Kim , Tejun Heo , Greg Kroah-Hartman Message-ID: <20161129161529.wyvuxd3fpsxitag7@merlins.org> References: <20161121154336.GD19750@merlins.org> <0d4939f3-869d-6fb8-0914-5f74172f8519@suse.cz> <20161121215639.GF13371@merlins.org> <20161122160629.uzt2u6m75ash4ved@merlins.org> <48061a22-0203-de54-5a44-89773bff1e63@suse.cz> <20161123063410.GB2864@dhcp22.suse.cz> <20161128072315.GC14788@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20161128072315.GC14788@dhcp22.suse.cz> X-Sysadmin: BOFH X-URL: http://marc.merlins.org/ User-Agent: NeoMutt/20160916 (1.7.0) X-SA-Exim-Connect-IP: 173.11.111.145 X-SA-Exim-Mail-From: marc@merlins.org X-Spam-Report: * -2.9 RP_MATCHES_RCVD Envelope sender domain matches handover relay domain * 0.7 SPF_SOFTFAIL SPF: sender does not match SPF record (softfail) * -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% * [score: 0.0000] * -1.5 GREYLIST_ISWHITE The incoming server has been whitelisted for this * receipient and sender Subject: Re: 4.8.8 kernel trigger OOM killer repeatedly when I have lots of RAM that should be free Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2817 Lines: 69 On Mon, Nov 28, 2016 at 08:23:15AM +0100, Michal Hocko wrote: > Marc, could you try this patch please? I think it should be pretty clear > it should help you but running it through your use case would be more > than welcome before I ask Greg to take this to the 4.8 stable tree. > > Thanks! > > On Wed 23-11-16 07:34:10, Michal Hocko wrote: > [...] > > commit b2ccdcb731b666aa28f86483656c39c5e53828c7 > > Author: Michal Hocko > > Date: Wed Nov 23 07:26:30 2016 +0100 > > > > mm, oom: stop pre-mature high-order OOM killer invocations > > > > 31e49bfda184 ("mm, oom: protect !costly allocations some more for > > !CONFIG_COMPACTION") was an attempt to reduce chances of pre-mature OOM > > killer invocation for high order requests. It seemed to work for most > > users just fine but it is far from bullet proof and obviously not > > sufficient for Marc who has reported pre-mature OOM killer invocations > > with 4.8 based kernels. 4.9 will all the compaction improvements seems > > to be behaving much better but that would be too intrusive to backport > > to 4.8 stable kernels. Instead this patch simply never declares OOM for > > !costly high order requests. We rely on order-0 requests to do that in > > case we are really out of memory. Order-0 requests are much more common > > and so a risk of a livelock without any way forward is highly unlikely. > > > > Reported-by: Marc MERLIN > > Signed-off-by: Michal Hocko Tested-by: Marc MERLIN Marc > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > > index a2214c64ed3c..7401e996009a 100644 > > --- a/mm/page_alloc.c > > +++ b/mm/page_alloc.c > > @@ -3161,6 +3161,16 @@ should_compact_retry(struct alloc_context *ac, unsigned int order, int alloc_fla > > if (!order || order > PAGE_ALLOC_COSTLY_ORDER) > > return false; > > > > +#ifdef CONFIG_COMPACTION > > + /* > > + * This is a gross workaround to compensate a lack of reliable compaction > > + * operation. We cannot simply go OOM with the current state of the compaction > > + * code because this can lead to pre mature OOM declaration. > > + */ > > + if (order <= PAGE_ALLOC_COSTLY_ORDER) > > + return true; > > +#endif > > + > > /* > > * There are setups with compaction disabled which would prefer to loop > > * inside the allocator rather than hit the oom killer prematurely. > > -- > > Michal Hocko > > SUSE Labs > > -- > Michal Hocko > SUSE Labs > -- "A mouse is a device used to point at the xterm you want to type in" - A.S.R. Microsoft is to operating systems .... .... what McDonalds is to gourmet cooking Home page: http://marc.merlins.org/ | PGP 1024R/763BE901