Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755059Ab2JIO47 (ORCPT ); Tue, 9 Oct 2012 10:56:59 -0400 Received: from g6t0184.atlanta.hp.com ([15.193.32.61]:28168 "EHLO g6t0184.atlanta.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752114Ab2JIO4y (ORCPT ); Tue, 9 Oct 2012 10:56:54 -0400 Message-ID: <1349794597.29752.10.camel@MikesLinux.fc.hp.com> Subject: Re: [PATCH] mm: memmap_init_zone() performance improvement From: Mike Yoknis Reply-To: mike.yoknis@hp.com To: Mel Gorman Cc: mingo@redhat.com, akpm@linux-foundation.org, linux-arch@vger.kernel.org, mmarek@suse.cz, tglx@linutronix.de, hpa@zytor.com, arnd@arndb.de, sam@ravnborg.org, minchan@kernel.org, kamezawa.hiroyu@jp.fujitsu.com, mhocko@suse.cz, linux-kbuild@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org Date: Tue, 09 Oct 2012 08:56:37 -0600 In-Reply-To: <20121008151656.GM29125@suse.de> References: <1349276174-8398-1-git-send-email-mike.yoknis@hp.com> <20121008151656.GM29125@suse.de> Organization: Hewlett-Packard Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.4.4 (3.4.4-2.fc17) Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2704 Lines: 64 On Mon, 2012-10-08 at 16:16 +0100, Mel Gorman wrote: > On Wed, Oct 03, 2012 at 08:56:14AM -0600, Mike Yoknis wrote: > > memmap_init_zone() loops through every Page Frame Number (pfn), > > including pfn values that are within the gaps between existing > > memory sections. The unneeded looping will become a boot > > performance issue when machines configure larger memory ranges > > that will contain larger and more numerous gaps. > > > > The code will skip across invalid sections to reduce the > > number of loops executed. > > > > Signed-off-by: Mike Yoknis > > This only helps SPARSEMEM and changes more headers than should be > necessary. It would have been easier to do something simple like > > if (!early_pfn_valid(pfn)) { > pfn = ALIGN(pfn + MAX_ORDER_NR_PAGES, MAX_ORDER_NR_PAGES) - 1; > continue; > } > > because that would obey the expectation that pages within a > MAX_ORDER_NR_PAGES-aligned range are all valid or all invalid (ARM is the > exception that breaks this rule). It would be less efficient on > SPARSEMEM than what you're trying to merge but I do not see the need for > the additional complexity unless you can show it makes a big difference > to boot times. > Mel, I, too, was concerned that pfn_valid() was defined in so many header files. But, I did not feel that it was appropriate for me to try to restructure things to consolidate those definitions just to add this one new function. Being a kernel newbie I did not believe that I had a good enough understanding of what combinations and permutations of CONFIG and architecture may have made all of those different definitions necessary, so I left them in. Yes, indeed, this fix is targeted for systems that have holes in memory. That is where we see the problem. We are creating large computer systems and we would like for those machines to perform well, including boot times. Let me pass along the numbers I have. We have what we call an "architectural simulator". It is a computer program that pretends that it is a computer system. We use it to test the firmware before real hardware is available. We have booted Linux on our simulator. As you would expect it takes longer to boot on the simulator than it does on real hardware. With my patch - boot time 41 minutes Without patch - boot time 94 minutes These numbers do not scale linearly to real hardware. But indicate to me a place where Linux can be improved. Mike Yoknis -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/