Received: by 10.213.65.68 with SMTP id h4csp1216085imn; Sat, 7 Apr 2018 21:31:15 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/aEgOB1j79eqpI06PhqfIvXuUVh8SP18CbrnVisTHosF8goMWu6oXOVOOTpY9/c56RDs8F X-Received: by 10.99.120.74 with SMTP id t71mr22183981pgc.310.1523161875751; Sat, 07 Apr 2018 21:31:15 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523161875; cv=none; d=google.com; s=arc-20160816; b=pZ4Y+mKHMjADl0RlDgAX/dh0WtGouwXE/JnECsS7Zer/wFpQqZNwCpCLm8D79K+tEJ u3o+xPA8Hwteu21P24TnsguQW/CPyH8AXO6oGE3o6/Obs6/QsgiSvhCpRpo7spK6RNxy dNM7A7oSFD8YCgVvXP7P9zDEbb2QgwqxD5Fucvddjw3iCxi2AZL177PcEzUuokRSxDCP aFWSwuyhCYnelpm0fOHYan4hVlxU3AadKFJcFF2TOoMKlCpIUR8d6DLeiJ+67Ak7t/HP LkF6y/fM6Z1ZOxYJ3LjmDE39OQWVPZtXTDDPyQ1Z0fNzU2Y7mTYSM7vBpqFms2SFDM/g IrYw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=xCOQfkKOUHB/jQTWapaukhGyrC+j+6kp07uAs/onC/Q=; b=R2hooDLNBb2pTqpMx3u6alze5mt/vbfLe8Lo2ZPEy/RZyjdYZYxx4XFdCBNItPYCFR AHUO0qaRMzPRr637Wb9XF2w2trcZjRT1H9jbQmWTIoJDKf3ii2NIQPmxp59TSvihJ3tB WRqKcrc2KVhBr4FjBF6n1AIgoIwvcyYukm/+prpOok3w8bj6CnexEbwTcnpgW5CQyIfa G8zDVzNwNc0KMvvFi/mc90AXPwOE0umsOtMeAuVEqSF2NlYrR0EcWFv4IJbiYvc21cc3 eF66l7EGEVOAQKyQHiTJMsCcEuPsR8CseA7jcU6gG6qeISrxxfVDNvTeYP+J2dojxIeH GA4A== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=RedvTat9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g23si10892693pfk.17.2018.04.07.21.30.29; Sat, 07 Apr 2018 21:31:15 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=RedvTat9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751317AbeDHE1M (ORCPT + 99 others); Sun, 8 Apr 2018 00:27:12 -0400 Received: from bombadil.infradead.org ([198.137.202.133]:50992 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750760AbeDHE1L (ORCPT ); Sun, 8 Apr 2018 00:27:11 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20170209; h=In-Reply-To:Content-Type:MIME-Version :References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=xCOQfkKOUHB/jQTWapaukhGyrC+j+6kp07uAs/onC/Q=; b=RedvTat9Cko/FV/HJk19f+AQh vbjhNd9ZHavNRV/I+GPYZeXkVYwqa4GoxGgMEojfRylkD2hCNjnoBDkbqwJNTq+ojlzc04WhGrXZF +RJ/6LbPq3d5FUnkXY8elRXRtJpawcaHzpC5mEBQuodEWZHyLAh/p0rHtWJ1WMagc8ZOCAg48UDba BeAyewR8I9sweEm6IZ5lPq1hgzJxPFag4nCgj3Kh6g1zpaZw8u1BOz3S6oUKHgBEiwgpOgcBJDaYH 5IRAR9O+4+ukdzyGw3ymim0fuDBlCWpulsk818Bh7CH1KwnXYVkGov9T8kBRfJIEAIMLPkIwwnFMG iZRWzpz4w==; Received: from willy by bombadil.infradead.org with local (Exim 4.90_1 #2 (Red Hat Linux)) id 1f51v3-0002w7-Qh; Sun, 08 Apr 2018 04:27:09 +0000 Date: Sat, 7 Apr 2018 21:27:09 -0700 From: Matthew Wilcox To: Michal Hocko Cc: LKML , linux-mm@kvack.org, Vlastimil Babka Subject: Re: __GFP_LOW Message-ID: <20180408042709.GC32632@bombadil.infradead.org> References: <20180405025841.GA9301@bombadil.infradead.org> <20180405142258.GA28128@bombadil.infradead.org> <20180405142749.GL6312@dhcp22.suse.cz> <20180405151359.GB28128@bombadil.infradead.org> <20180405153240.GO6312@dhcp22.suse.cz> <20180405161501.GD28128@bombadil.infradead.org> <20180405185444.GQ6312@dhcp22.suse.cz> <20180405201557.GA3666@bombadil.infradead.org> <20180406060953.GA8286@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180406060953.GA8286@dhcp22.suse.cz> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 06, 2018 at 08:09:53AM +0200, Michal Hocko wrote: > OK, we already split the documentation into these categories. So we got > at least the structure right ;) Yes, this part of the documentation makes sense to me :-) > > - What kind of memory to allocate (DMA, NORMAL, HIGHMEM) > > - Where to get the pages from > > - Local node only (THISNODE) > > - Only in compliance with cpuset policy (HARDWALL) > > - Spread the pages between zones (WRITE) > > - The movable zone (MOVABLE) > > - The reclaimable zone (RECLAIMABLE) > > - What you are willing to do if no free memory is available: > > - Nothing at all (NOWAIT) > > - Use my own time to free memory (DIRECT_RECLAIM) > > - But only try once (NORETRY) > > - Can call into filesystems (FS) > > - Can start I/O (IO) > > - Can sleep (!ATOMIC) > > - Steal time from other processes to free memory (KSWAPD_RECLAIM) > > What does that mean? If I drop the flag, do not steal? Well I do because > they will hit direct reclaim sooner... If they allocate memory, sure. A process which stays in its working set won't, unless it's preempted by kswapd. > > - Kill other processes to get their memory (!RETRY_MAYFAIL) > > Not really for costly orders. Yes, need to be more precise there. > > - All of the above, and wait forever (NOFAIL) > > - Take from emergency reserves (HIGH) > > - ... but not the last parts of the regular reserves (LOW) > > What does that mean and how it is different from NOWAIT? Is this about > the low watermark and if yes do we want to teach users about this and > make the whole thing even more complicated? Does it wake > kswapd? What is the eagerness ordering? LOW, NOWAIT, NORETRY, > RETRY_MAYFAIL, NOFAIL? LOW doesn't quite fit into the eagerness scale with the other flags; instead it's composable with them. So you can specify NOWAIT | LOW, NORETRY | LOW, NOFAIL | LOW, etc. All I have in mind is something like this: if (alloc_flags & ALLOC_HIGH) min -= min / 2; + if (alloc_flags & ALLOC_LOW) + min += min / 2; The idea is that a GFP_KERNEL | __GFP_LOW allocation cannot force a GFP_KERNEL allocation into an OOM situation because it cannot take the last pages of memory before the watermark. It can still make a GFP_KERNEL allocation *more likely* to hit OOM (just like any other kind of allocation can), but it can't do it by itself. --- I've been wondering about combining the DIRECT_RECLAIM, NORETRY, RETRY_MAYFAIL and NOFAIL flags together into a single field: 0 => RECLAIM_NEVER, /* !DIRECT_RECLAIM */ 1 => RECLAIM_ONCE, /* NORETRY */ 2 => RECLAIM_PROGRESS, /* RETRY_MAYFAIL */ 3 => RECLAIM_FOREVER, /* NOFAIL */ The existance of __GFP_RECLAIM makes this a bit tricky. I honestly don't know what this code is asking for: kernel/power/swap.c: __get_free_page(__GFP_RECLAIM | __GFP_HIGH); but I suspect I'll have to find out. There's about 60 places to look at. I also want to add __GFP_KILL (to be part of the GFP_KERNEL definition). That way, each bit that you set in the GFP mask increases the things the page allocator can do to get memory for you. At the moment, RETRY_MAYFAIL subtracts the ability to kill other tasks, which is unusual. For example, this test in kvmalloc_node: WARN_ON_ONCE((flags & GFP_KERNEL) != GFP_KERNEL); doesn't catch RETRY_MAYFAIL being set.