Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1762143AbXKHUGW (ORCPT ); Thu, 8 Nov 2007 15:06:22 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1761281AbXKHUGO (ORCPT ); Thu, 8 Nov 2007 15:06:14 -0500 Received: from gir.skynet.ie ([193.1.99.77]:48695 "EHLO gir.skynet.ie" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1761075AbXKHUGN (ORCPT ); Thu, 8 Nov 2007 15:06:13 -0500 Date: Thu, 8 Nov 2007 20:06:07 +0000 To: Christoph Lameter Cc: Lee Schermerhorn , akpm@linux-foundatin.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: Plans for Onezonelist patch series ??? Message-ID: <20071108200607.GD23882@skynet.ie> References: <20071107011130.382244340@sgi.com> <1194535612.6214.9.camel@localhost> <1194537674.5295.8.camel@localhost> <20071108184009.GC23882@skynet.ie> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.13 (2006-08-11) From: mel@skynet.ie (Mel Gorman) Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4117 Lines: 85 On (08/11/07 10:43), Christoph Lameter didst pronounce: > On Thu, 8 Nov 2007, Mel Gorman wrote: > > > There was two bugs that were resolved but I didn't repost after that as > > mainline + -mm had gone to hell in a hand-basket and I didn't want to > > add to the mess. > > Hell? I must have missed it. > Some time after rc1, things appeared in a mess - at least I didn't have much luck figuring out what was going on when I looked. Admittedly, being very ill at the time I didn't spend much effort on it. Either way things were churning enough that there seemed to be enough going on without adding one-zonelist to the mix. I've rebased the patches to mm-broken-out-2007-11-06-02-32. However, the vanilla -mm and the one with onezonelist applied are locking up in the same manner. I'm way too behind at the moment to guess if it is a new bug or reported already. At best, I can say the patches are not making things any worse :) I'll go through the archives in the morning and do a bit more testing to see what happens. In case this is familiar to people, the lockup I see is; [ 115.548908] BUG: spinlock bad magic on CPU#0, sshd/2752 [ 115.611371] lock: c20029c8, .magic: ffffffff, .owner: /-1, .owner_cpu: -1066669496 [ 115.709027] [] show_trace_log_lvl+0x1a/0x30 [ 115.770560] [] show_trace+0x12/0x20 [ 115.823787] [] dump_stack+0x16/0x20 [ 115.877011] [] spin_bug+0x96/0xf0 [ 115.928172] [] _raw_spin_lock+0x69/0x140 [ 115.986580] [] _spin_lock+0x4f/0x60 [ 116.039809] [] kobject_add+0x4e/0x1a0 [ 116.095112] [] uids_user_create+0x54/0x80 [ 116.154555] [] alloc_uid+0xd2/0x150 [ 116.207784] [] set_user+0x2b/0xb0 [ 116.258951] [] sys_setreuid+0x141/0x150 [ 116.316305] [] syscall_call+0x7/0xb [ 116.369544] ======================= [ 127.680346] BUG: soft lockup - CPU#0 stuck for 11s! [sshd:2752] [ 127.750987] [ 127.768781] Pid: 2752, comm: sshd Not tainted (2.6.24-rc1-mm1 #1) [ 127.841498] EIP: 0060:[] EFLAGS: 00000246 CPU: 0 [ 127.906948] EIP is at delay_tsc+0x1/0x20 [ 127.953754] EAX: 00000001 EBX: c20029c8 ECX: b0953e83 EDX: 2b35cb6d [ 128.028533] ESI: 04b81d83 EDI: 00000000 EBP: c30f1ec4 ESP: c30f1ebc [ 128.103305] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 [ 128.167717] CR0: 80050033 CR2: b7e45544 CR3: 02153000 CR4: 00000690 [ 128.242490] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 [ 128.317253] DR6: ffff0ff0 DR7: 00000400 [ 128.363010] [] show_trace_log_lvl+0x1a/0x30 [ 128.424541] [] show_trace+0x12/0x20 [ 128.477767] [] show_regs+0x1c/0x20 [ 128.529948] [] softlockup_tick+0x11b/0x150 [ 128.590446] [] run_local_timers+0x12/0x20 [ 128.649888] [] update_process_times+0x42/0x90 [ 128.713461] [] tick_periodic+0x25/0x80 [ 128.769812] [] tick_handle_periodic+0x19/0x80 [ 128.833397] [] timer_interrupt+0x49/0x50 [ 128.891802] [] handle_IRQ_event+0x28/0x60 [ 128.951246] [] handle_level_irq+0x78/0xe0 [ 129.010693] [] do_IRQ+0x40/0x80 [ 129.059782] [] common_interrupt+0x23/0x28 [ 129.119229] [] _raw_spin_lock+0xb2/0x140 [ 129.177635] [] _spin_lock+0x4f/0x60 [ 129.230851] [] kobject_add+0x4e/0x1a0 [ 129.286175] [] uids_user_create+0x54/0x80 [ 129.345598] [] alloc_uid+0xd2/0x150 [ 129.398830] [] set_user+0x2b/0xb0 [ 129.450005] [] sys_setreuid+0x141/0x150 [ 129.507380] [] syscall_call+0x7/0xb [ 129.560605] ======================= -- -- Mel Gorman Part-time Phd Student Linux Technology Center University of Limerick IBM Dublin Software Lab - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/