Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751156AbVKFSTm (ORCPT ); Sun, 6 Nov 2005 13:19:42 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932247AbVKFSTm (ORCPT ); Sun, 6 Nov 2005 13:19:42 -0500 Received: from omx3-ext.sgi.com ([192.48.171.20]:47526 "EHLO omx3.sgi.com") by vger.kernel.org with ESMTP id S1750965AbVKFSTl (ORCPT ); Sun, 6 Nov 2005 13:19:41 -0500 Date: Sun, 6 Nov 2005 10:18:32 -0800 From: Paul Jackson To: Linus Torvalds Cc: mingo@elte.hu, andy@thermo.lanl.gov, mbligh@mbligh.org, akpm@osdl.org, arjan@infradead.org, arjanv@infradead.org, haveblue@us.ibm.com, kravetz@us.ibm.com, lhms-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org, linux-mm@kvack.org, mel@csn.ul.ie, nickpiggin@yahoo.com.au Subject: Re: [Lhms-devel] [PATCH 0/7] Fragmentation Avoidance V19 Message-Id: <20051106101832.510b0245.pj@sgi.com> In-Reply-To: References: <20051104010021.4180A184531@thermo.lanl.gov> <20051103221037.33ae0f53.pj@sgi.com> <20051104063820.GA19505@elte.hu> <20051104155317.GA7281@elte.hu> <20051105233408.3037a6fe.pj@sgi.com> Organization: SGI X-Mailer: Sylpheed version 2.0.0beta5 (GTK+ 2.4.9; i686-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2215 Lines: 53 Linus wrote: > The thing is, if 99.8% of memory is cleanable, the 0.2% is still enough to > make pretty much _every_ hugepage in the system pinned down. Agreed. I realized after writing this that I wasn't clear on something. I wasn't focused the subject of this thread, adding hugetlb pages after the system has been up a while. I was focusing on a related subject - freeing up most of the ordinary size pages on the dedicated application nodes between jobs on a large system using * a bootcpuset (for the classic Unix load) and * dedicated nodes (for the HPC apps). I am looking to provide the combination of: 1) specifying some hugetlb pages at system boot, plus 2) the ability to clean off most of the ordinary sized pages from the application nodes between jobs. Perhaps Andy or some of my HPC customers wish I was also looking to provide: 3) the ability to add lots of hugetlb pages on the application nodes after the system has run a while. But if they are, then they have some more educatin' to do on me. For now, I am sympathetic to your concerns with code and locking complexity. Freeing up great globs of hugetlb sized contiguous chunks of memory after a system has run a while would be hard. We have to be careful which hard problems we decide to take on. We can't take on too many, and we have to pick ones that will provide a major long term advantage to Linux, over the forseeable changes in system hardware and architecture. Even if most of the processors that Andy has tested against would benefit from dynamically added hugetlb pages, if we can anticipate that this will not be a substained opportunity for Linux (and looking at current x86 chips doesn't require much anticipating) then that might not be the place to invest our precious core complexity dollars. -- I won't rest till it's the best ... Programmer, Linux Scalability Paul Jackson 1.925.600.0401 - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/