Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751090AbWHVBhj (ORCPT ); Mon, 21 Aug 2006 21:37:39 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751098AbWHVBhj (ORCPT ); Mon, 21 Aug 2006 21:37:39 -0400 Received: from adsl-69-232-92-238.dsl.sndg02.pacbell.net ([69.232.92.238]:56813 "EHLO gnuppy.monkey.org") by vger.kernel.org with ESMTP id S1751090AbWHVBhi (ORCPT ); Mon, 21 Aug 2006 21:37:38 -0400 Date: Mon, 21 Aug 2006 18:37:22 -0700 To: Robert Crocombe Cc: Esben Nielsen , Ingo Molnar , Thomas Gleixner , rostedt@goodmis.org, linux-kernel , "Bill Huey (hui)" Subject: rtmutex assert failure (was [Patch] restore the RCU callback...) Message-ID: <20060822013722.GA628@gnuppy.monkey.org> References: <20060810021835.GB12769@gnuppy.monkey.org> <20060811010646.GA24434@gnuppy.monkey.org> <20060811211857.GA32185@gnuppy.monkey.org> <20060811221054.GA32459@gnuppy.monkey.org> <20060814234423.GA31230@gnuppy.monkey.org> <20060818115934.GA29919@gnuppy.monkey.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.11+cvs20060403 From: Bill Huey (hui) Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1620 Lines: 37 On Mon, Aug 21, 2006 at 05:21:35PM -0700, Robert Crocombe wrote: > On 8/18/06, hui Bill Huey wrote: > >Patch attached: > > The problem still appears to happen, but now I get no trace at all, > just a single line reported to the machine's console (and not on the > serial console): > > pdflush/314[CPU#2]: BUG in debug_rt_mutex_unlock at > kernel/rt_mutex_debug.c:471 > > i.e., a standard statement, except at pdflush instead of swapper this time. Thanks for testing. I appreciate this and I'm more than just plain willing to help out. Any thing goes to get this patch solid. :) I'll come up with a patch tomorrow to try and get a clean stack trace. I've made some change to the bug dump output to make it more preemption aware, but, as you can, some tweeking is needed. However, I don't think this is going to be a simple atomic violation problem that I've seen numerous times before. That's what the previous patch tried to address. It's seriously pointing to a problem with the rtmutex and/or how it's used, maybe a corner case it's missing. I'll start to look at it either tonight or tomorrow and see what's going on. It's going to take me a bit so be patient. I'll definitely get back to you on this. I've got a number of weeks to blow exclusively on -rt development, so I'll be around for a little while longer, hopefully longer. bill - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/