Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932655AbVKBLlw (ORCPT ); Wed, 2 Nov 2005 06:41:52 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932656AbVKBLlv (ORCPT ); Wed, 2 Nov 2005 06:41:51 -0500 Received: from holly.csn.ul.ie ([136.201.105.4]:37793 "EHLO holly.csn.ul.ie") by vger.kernel.org with ESMTP id S932655AbVKBLlv (ORCPT ); Wed, 2 Nov 2005 06:41:51 -0500 Date: Wed, 2 Nov 2005 11:41:47 +0000 (GMT) From: Mel Gorman X-X-Sender: mel@skynet To: Nick Piggin Cc: "Martin J. Bligh" , Joel Schopp , Andrew Morton , kravetz@us.ibm.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, lhms-devel@lists.sourceforge.net, Ingo Molnar Subject: Re: [Lhms-devel] [PATCH 0/7] Fragmentation Avoidance V19 In-Reply-To: <43681E89.8070905@yahoo.com.au> Message-ID: References: <20051030183354.22266.42795.sendpatchset@skynet.csn.ul.ie><20051031055725.GA3820@w-mikek2.ibm.com><4365BBC4.2090906@yahoo.com.au> <20051030235440.6938a0e9.akpm@osdl.org> <27700000.1130769270@[10.10.2.4]> <4366A8D1.7020507@yahoo.com.au> <4366C559.5090504@yahoo.com.au> <4366D469.2010202@yahoo.com.au> <4367D71A.1030208@austin.ibm.com> <43681100.1000603@yahoo.com.au> <214340000.1130895665@[10.10.2.4]> <43681E89.8070905@yahoo.com.au> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3366 Lines: 76 On Wed, 2 Nov 2005, Nick Piggin wrote: > Martin J. Bligh wrote: > > > The numbers I have seen show that performance is decreased. People > > > like Ken Chen spend months trying to find a 0.05% improvement in > > > performance. Not long ago I just spent days getting our cached > > > kbuild performance back to where 2.4 is on my build system. > > > > > > Ironically, we're currently trying to chase down a 'database benchmark' > > regression that seems to have been cause by the last round of "let's > > rewrite the scheduler again" (more details later). Nick, you've added an > > awful lot of complexity to some of these code paths yourself ... seems > > ironic that you're the one complaining about it ;-) > > > > Yeah that's unfortunate, but I think a large portion of the problem > (if they are anything the same) has been narrowed down to some over > eager wakeup balancing for which there are a number of proposed > patches. > > But in this case I was more worried about getting the groundwork done > for handling the multicore multicore systems that everyone will soon > be using rather than several % performance regression on TPC-C (not > to say that I don't care about that at all)... I don't see the irony. > > But let's move this to another thread if it is going to continue. I > would be happy to discuss scheduler problems. > > > > You have an extra zone. You size that zone at boot according to the > > > amount of memory you need to be able to free. Only easy-reclaim stuff > > > goes in that zone. > > > > > > It is less complex because zones are a complexity we already have to > > > live with. 99% of the infrastructure is already there to do this. > > > > > > If you want to hot unplug memory or guarantee hugepage allocation, > > > this is the way to do it. Nobody has told me why this *doesn't* work. > > > > > > Because the zone is statically sized, and you're back to the same crap > > we had with 32bit systems of splitting ZONE_NORMAL and ZONE_HIGHMEM, > > effectively. Define how much you need for system ram, and how much > > for easily reclaimable memory at boot time. You can't - it doesn't work. > > > > You can't what? What doesn't work? If you have no hard limits set, > then the frag patches can't guarantee anything either. > True, but the difference is Anti-defrag: Best effort at low cost (according to Aim9) without tunable Zones: Will work, but requires tunable. falls apart if tuned wrong > You can't have it both ways. Either you have limits for things or > you don't need any guarantees. Zones handle the former case nicely, > and we currently do the latter case just fine (along with the frag > patches). > Sure, so you compromise and do best effort for as long as possible. Always try to keep fragmentation low. If the system is configured to really need low fragmentation, then after a long period of time, a page-migration mechanism kicks in to move the kernel pages out of EasyRclm areas and we continue on. -- Mel Gorman Part-time Phd Student Java Applications Developer University of Limerick IBM Dublin Software Lab - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/