Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932069AbVKNT4t (ORCPT ); Mon, 14 Nov 2005 14:56:49 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932070AbVKNT4t (ORCPT ); Mon, 14 Nov 2005 14:56:49 -0500 Received: from mail.timesys.com ([65.117.135.102]:23215 "EHLO postfix.timesys.com") by vger.kernel.org with ESMTP id S932069AbVKNT4s (ORCPT ); Mon, 14 Nov 2005 14:56:48 -0500 Message-ID: <4378EA29.1000400@timesys.com> Date: Mon, 14 Nov 2005 14:48:57 -0500 From: john cooper User-Agent: Mozilla Thunderbird 0.8 (X11/20040913) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Luca Falavigna Cc: mingo@elte.hu, linux-kernel@vger.kernel.org, john cooper Subject: Re: [BUG] Softlockup detected with linux-2.6.14-rt6 References: <4378B48E.6010006@gmail.com> In-Reply-To: <4378B48E.6010006@gmail.com> Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 14 Nov 2005 19:51:16.0078 (UTC) FILETIME=[C62538E0:01C5E954] Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1909 Lines: 49 Luca Falavigna wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > I found this softlockup bug involving arts daemon using a > linux-2.6.14-rt6 kernel (with "Complete Preemption" and "Detect Soft > Lockups" compiled in). > This bug does not happen everytime: I was able to reproduce it only > three times in a week. Here is the oops message (obtained from my > printer because system is frozen): > > BUG: artsd:4177, possible softlockup detected on CPU#0! > [] softlockup_detected+0x34/0x40 (8) > [] softlockup_tick+0xa9/axb0 (20) > [] timer_interrupt+0x21/0x40 (12) > [] handle_IRQ_event+0x63/0xf0 (12) > [] __do_IRQ+0xa3/0x150 (48) > [] do_IRQ+0x34/0x70 (40) > [] common_interrupt+0x1a/0x20 (8) > [] common_interrupt+0x1a/0x20 (20) > [] __delay+0x20/0x30 (44) > [] snd_timer_close+0x1b4/0x2b0 [snd_timer] (12) > [] fasync_helper+0x7a/0x100 (12) > [] snd_timer_user_release+0x4c/0x80 [snd_timer] (28) > [] __fput+0xad/0x1a0 (24) > [] filp_close+0x52/0x90 (40) > [] sys_close+0x70/0xc0 (24) > [] syscall_call+0x7/0xb (28) We are seeing this too, particularly [IIRC exclusively] on MIPS and PPC though there isn't any obvious target architecture correlation. It appears to be a false positive as the system seems otherwise responsive to general scheduling. It's on my list to address but not a priority just yet. Adding a sched pri/policy printk to show_task() and inserting a call to show_state() in the softlockup detect routine should shed some light. -john -- john.cooper@timesys.com - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/