Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1750714AbVKDU6E (ORCPT ); Fri, 4 Nov 2005 15:58:04 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750873AbVKDU6E (ORCPT ); Fri, 4 Nov 2005 15:58:04 -0500 Received: from e4.ny.us.ibm.com ([32.97.182.144]:1246 "EHLO e4.ny.us.ibm.com") by vger.kernel.org with ESMTP id S1750714AbVKDU6C (ORCPT ); Fri, 4 Nov 2005 15:58:02 -0500 Date: Fri, 4 Nov 2005 12:57:58 -0800 From: Mike Kravetz To: Andy Whitcroft Cc: Arnd Bergmann , linuxppc64-dev@ozlabs.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [PATCH] powerpc: mem_init crash for sparsemem Message-ID: <20051104205758.GA5397@w-mikek2.ibm.com> References: <200511041631.17237.arnd@arndb.de> <436BC20B.9070704@shadowen.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <436BC20B.9070704@shadowen.org> User-Agent: Mutt/1.4.1i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1450 Lines: 33 On Fri, Nov 04, 2005 at 08:18:19PM +0000, Andy Whitcroft wrote: > Arnd Bergmann wrote: > > I have a Cell blade with some broken memory in the middle of the > > physical address space and this is correctly detected by the > > firmware, but not relocated. When I enable CONFIG_SPARSEMEM, > > the memsections for the nonexistant address space do not > > get struct page entries allocated, as expected. > > > > However, mem_init for the non-NUMA configuration tries to > > access these pages without first looking if they are there. This earlier statement in mem_init (or at least the comment), num_physpages = max_pfn; /* RAM is assumed contiguous */ may be a cause for concern. I'm pretty sure max_pfn has previously been set based on the value of lmb_end_of_DRAM(). My guess is that we are going to report the system as having more memory that it actually does (will not account for the hole(s)). That being said, the pfn_valid() check is still needed here. But, it looks like that code was originally written under the assumption that there were no holes. Can someone 'more in the know' of ppc architecture comment on the ram is contiguous assumption? Is this no longer the case? -- Mike - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/