Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756049AbZAAFPR (ORCPT ); Thu, 1 Jan 2009 00:15:17 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750864AbZAAFPD (ORCPT ); Thu, 1 Jan 2009 00:15:03 -0500 Received: from mail.lang.hm ([64.81.33.126]:44156 "EHLO bifrost.lang.hm" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750724AbZAAFPB (ORCPT ); Thu, 1 Jan 2009 00:15:01 -0500 Date: Wed, 31 Dec 2008 22:17:06 -0800 (PST) From: david@lang.hm X-X-Sender: dlang@asgard.lang.hm To: Andi Kleen cc: Cyrill Gorcunov , linux-kernel Subject: Re: early exception error In-Reply-To: <20090101041727.GW496@one.firstfloor.org> Message-ID: References: <87k59hur5f.fsf@basil.nowhere.org> <20081231093803.GA20882@localhost> <20081231183039.GE20882@localhost> <20081231195005.GT496@one.firstfloor.org> <20090101041727.GW496@one.firstfloor.org> User-Agent: Alpine 1.10 (DEB 962 2008-03-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1482 Lines: 35 On Thu, 1 Jan 2009, Andi Kleen wrote: > On Wed, Dec 31, 2008 at 12:59:08PM -0800, david@lang.hm wrote: >> On Wed, 31 Dec 2008, Andi Kleen wrote: >> >>>> on the picture you sent me i noticed the message >>>> "Your memory is not aligned you need to rebuild your >>>> kernel with bigger NODEMAP SIZE shift=20" and then >>>> srat code complains about "No NUMA code hash function found" >>>> which looks a bit scary. Btw, could you post this picture >>>> on some public resource so NUMA people could check it? >>> >>> This case used to be handled cleanly (NUMA disabled), but perhaps >>> that has regressed. But still it sounds like something is going wrong, >>> unless his machine really has a very weird memory map. >> >> it shouldn't, it was one of the high-volume servers 4-5 years ago and only >> has 4G of ram in it > > From looking at the screenshot Cyrill sent you seem to have a funny > SRAT with overlapping areas that is rejected in the end. I suspect the > fallback code doesn't handle this properly. > > Does it work when you boot with numa=noacpi ? it gets past the point where the bootmemory_debug messages flow by, but I get another oops (snapshot of the screen is at http://linux.lang.hm/linux/IMG00031.jpg ) David Lang -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/