Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759329AbZLOEbB (ORCPT ); Mon, 14 Dec 2009 23:31:01 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1759258AbZLOEbA (ORCPT ); Mon, 14 Dec 2009 23:31:00 -0500 Received: from smtp-out.google.com ([216.239.33.17]:18261 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752766AbZLOEa6 (ORCPT ); Mon, 14 Dec 2009 23:30:58 -0500 DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=date:from:x-x-sender:to:cc:subject:in-reply-to:message-id: references:user-agent:mime-version:content-type:x-system-of-record; b=SIiW8GS4Ga/HTPVFOs0D8QrEd+lEWopWGpy16l6jXKLclSz34awyQ0WJJq5gQnXyU KKPGaiyVb3eiMdKt1RXZA== Date: Mon, 14 Dec 2009 20:30:37 -0800 (PST) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: KAMEZAWA Hiroyuki cc: Andrew Morton , Daisuke Nishimura , KOSAKI Motohiro , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Christoph Lameter Subject: Re: [BUGFIX][PATCH] oom-kill: fix NUMA consraint check with nodemask v4.2 In-Reply-To: <20091215103202.eacfd64e.kamezawa.hiroyu@jp.fujitsu.com> Message-ID: References: <20091110162121.361B.A69D9226@jp.fujitsu.com> <20091110163419.361E.A69D9226@jp.fujitsu.com> <20091110164055.a1b44a4b.kamezawa.hiroyu@jp.fujitsu.com> <20091110170338.9f3bb417.nishimura@mxp.nes.nec.co.jp> <20091110171704.3800f081.kamezawa.hiroyu@jp.fujitsu.com> <20091111112404.0026e601.kamezawa.hiroyu@jp.fujitsu.com> <20091111134514.4edd3011.kamezawa.hiroyu@jp.fujitsu.com> <20091111142811.eb16f062.kamezawa.hiroyu@jp.fujitsu.com> <20091111152004.3d585cee.kamezawa.hiroyu@jp.fujitsu.com> <20091111153414.3c263842.kamezawa.hiroyu@jp.fujitsu.com> <20091118095824.076c211f.kamezawa.hiroyu@jp.fujitsu.com> <20091214171632.0b34d833.akpm@linux-foundation.org> <20091215103202.eacfd64e.kamezawa.hiroyu@jp.fujitsu.com> User-Agent: Alpine 2.00 (DEB 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2188 Lines: 47 On Tue, 15 Dec 2009, KAMEZAWA Hiroyuki wrote: > I'm now preparing more counters for mm's statistics. It's better to > wait and to see what we can do more. And other patches for total > oom-killer improvement is under development. > > And, there is a compatibility problem. > As David says, this may break some crazy software which uses > fake_numa+cpuset+oom_killer+oom_adj for resource controlling. > (even if I recommend them to use memcg rather than crazy tricks...) > That's not at all what I said. I said using total_vm as a baseline allows users to define when a process is to be considered "rogue," that is, using more memory than expected. Using rss would be inappropriate since it is highly dynamic and depends on the state of the VM at the time of oom, which userspace cannot possibly keep updated. You consistently ignore that point: the power of /proc/pid/oom_adj to influence when a process, such as a memory leaker, is to be considered as a high priority for an oom kill. It has absolutely nothing to do with fake NUMA, cpusets, or memcg. > 2 ideas which I can think of now are.. > 1) add sysctl_oom_calc_on_committed_memory > If this is set, use vm-size instead of rss. > I would agree only if the oom killer used total_vm as a the default, it is long-standing and allows for the aforementioned capability that you lose with rss. I have no problem with the added sysctl to use rss as the baseline when enabled. > 2) add /proc//oom_guard_size > This allows users to specify "valid/expected size" of a task. > When > #echo 10M > /proc//oom_guard_size > At OOM calculation, 10Mbytes is subtracted from rss size. > (The best way is to estimate this automatically from vm_size..but...) Expected rss is almost impossible to tune for cpusets that have a highly dynamic set of mems, let alone without containment. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/