Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757739Ab0GNVg1 (ORCPT ); Wed, 14 Jul 2010 17:36:27 -0400 Received: from e3.ny.us.ibm.com ([32.97.182.143]:53639 "EHLO e3.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753462Ab0GNVgZ (ORCPT ); Wed, 14 Jul 2010 17:36:25 -0400 Subject: Re: 2.6.33.5 rt23: machine lockup (nfs/autofs related?) From: john stultz To: Fernando Lopez-Lezcano Cc: Thomas Gleixner , LKML , rt-users , Steven Rostedt , Nick Piggin In-Reply-To: <1279143129.29524.39.camel@localhost.localdomain> References: <1278609590.7527.11.camel@localhost.localdomain> <1278628386.3008.11.camel@localhost.localdomain> <1278629044.12059.6.camel@localhost.localdomain> <1278630050.3008.18.camel@localhost.localdomain> <1278702134.5102.9.camel@localhost.localdomain> <1278705254.2349.14.camel@localhost.localdomain> <1278713600.7122.22.camel@localhost.localdomain> <1278716222.2349.20.camel@localhost.localdomain> <1278977858.6489.52.camel@localhost.localdomain> <1278978834.2404.12.camel@localhost.localdomain> <1278983426.6489.57.camel@localhost.localdomain> <1278985231.2404.60.camel@localhost.localdomain> <1278990381.11165.3.camel@localhost.localdomain> <1279143129.29524.39.camel@localhost.localdomain> Content-Type: text/plain; charset="ISO-8859-1" Date: Wed, 14 Jul 2010 14:36:03 -0700 Message-ID: <1279143363.3372.2.camel@localhost> Mime-Version: 1.0 X-Mailer: Evolution 2.28.3 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2361 Lines: 60 On Wed, 2010-07-14 at 14:32 -0700, Fernando Lopez-Lezcano wrote: > On Mon, 2010-07-12 at 20:06 -0700, Fernando Lopez-Lezcano wrote: > > > > -------- > > > > BUG: unable to handle kernel NULL pointer dereference at 0000008c > > > > IP: [] rt_spin_lock_fastunlock.clone.2+0x6/0x3e > > > ... > > > > Pid: 2855, comm: nautilus Not tainted > > > > 2.6.33.6-147.rt23.3.fc12.ccrma.i686.rt #3 P5K/EPU/P5K/EPU > > > > EIP: 0060:[] EFLAGS: 00210246 CPU: 0 > > > > EIP is at rt_spin_lock_fastunlock.clone.2+0x6/0x3e > > > > EAX: 00000078 EBX: ef45393c ECX: 00000000 EDX: 00000078 > > > > ESI: ef716edc EDI: 00000000 EBP: f1977c8c ESP: f1977c88 > > > > DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 preempt:00000000 > > > > Process nautilus (pid: 2855, ti=f1976000 task=f2347130 task.ti=f1976000) > > > > Stack: > > > > ef45393c f1977c94 c0781206 f1977cc8 c04d842e 00000000 ef703e54 faadd5bc > > > > <0> f1977cdc 126bc87a ef703ddc 00000000 ef45393c ef716edc ef6f5494 > > > > faafdc6c > > > > <0> f1977df8 faad9041 c3604b5c faafdc6c ef452bfc 00007e7f f5eb41e8 > > > > 00000007 > > > > Call Trace: > > > > [] ? rt_spin_unlock+0x8/0xa > > > > [] ? d_materialise_unique+0x210/0x2aa > > > > > > Can you gdb list *0xc04d842e ? > > > > (gdb) list *0xc04d842e > > 0xc04d842e is in d_materialise_unique (fs/dcache.c:2073). > > 2068 out_unalias: > > 2069 d_move_locked(alias, dentry); > > 2070 ret = alias; > > 2071 out_err: > > 2072 spin_unlock(&inode->i_lock); > > 2073 if (m2) > > 2074 mutex_unlock(m2); > > 2075 if (m1) > > 2076 mutex_unlock(m1); > > 2077 return ret; > > > > > Thanks again for all the testing here! Its really appreciated! > > I just tried building 2.6.33.6 rt26 (saw it this morning on the site) > and it does not exhibit this problem. Woohoo! Yea. The vfs-scalability patches were dropped. Sort of a bummer, but I'm glad that fixes it for you. We may run into the issue again when the patches can get beaten back into shape, but I'll try to hunt down the issue from your log output prior to that. Thanks again for the testing and great feedback! -john -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/