Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755992AbYACXE6 (ORCPT ); Thu, 3 Jan 2008 18:04:58 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752694AbYACXEu (ORCPT ); Thu, 3 Jan 2008 18:04:50 -0500 Received: from ogre.sisk.pl ([217.79.144.158]:49672 "EHLO ogre.sisk.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752920AbYACXEt (ORCPT ); Thu, 3 Jan 2008 18:04:49 -0500 From: "Rafael J. Wysocki" To: Christian Kujau Subject: Re: 2.6.24-rc6: possible recursive locking detected Date: Fri, 4 Jan 2008 00:06:47 +0100 User-Agent: KMail/1.9.6 (enterprise 20070904.708012) Cc: linux-kernel@vger.kernel.org, jfs-discussion@lists.sourceforge.net, Ingo Molnar , Peter Zijlstra References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200801040006.47979.rjw@sisk.pl> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3567 Lines: 71 [Added some CCs] On Thursday, 3 of January 2008, Christian Kujau wrote: > hi, > > a few minutes after upgrading from -rc5 to -rc6 I got: > > [ 1310.670986] ============================================= > [ 1310.671690] [ INFO: possible recursive locking detected ] > [ 1310.672097] 2.6.24-rc6 #1 > [ 1310.672421] --------------------------------------------- > [ 1310.672828] FahCore_a0.exe/3692 is trying to acquire lock: > [ 1310.673238] (&q->lock){++..}, at: [] __wake_up+0x1b/0x50 > [ 1310.673869] > [ 1310.673870] but task is already holding lock: > [ 1310.674567] (&q->lock){++..}, at: [] __wake_up+0x1b/0x50 > [ 1310.675267] > [ 1310.675268] other info that might help us debug this: > [ 1310.675952] 5 locks held by FahCore_a0.exe/3692: > [ 1310.676334] #0: (rcu_read_lock){..--}, at: [] net_rx_action+0x60/0x1b0 > [ 1310.677251] #1: (rcu_read_lock){..--}, at: [] netif_receive_skb+0x100/0x470 > [ 1310.677924] #2: (rcu_read_lock){..--}, at: [] ip_local_deliver_finish+0x32/0x210 > [ 1310.678460] #3: (clock-AF_INET){-.-?}, at: [] sock_def_readable+0x1e/0x80 > [ 1310.679250] #4: (&q->lock){++..}, at: [] __wake_up+0x1b/0x50 > [ 1310.680151] > [ 1310.680152] stack backtrace: > [ 1310.680772] Pid: 3692, comm: FahCore_a0.exe Not tainted 2.6.24-rc6 #1 > [ 1310.681209] [] show_trace_log_lvl+0x1a/0x30 > [ 1310.681659] [] show_trace+0x12/0x20 > [ 1310.682085] [] dump_stack+0x6a/0x70 > [ 1310.682512] [] __lock_acquire+0x971/0x10c0 > [ 1310.682961] [] lock_acquire+0x5e/0x80 > [ 1310.683392] [] _spin_lock_irqsave+0x38/0x50 > [ 1310.683914] [] __wake_up+0x1b/0x50 > [ 1310.684337] [] ep_poll_safewake+0x9a/0xc0 > [ 1310.684822] [] ep_poll_callback+0x8b/0xe0 > [ 1310.685265] [] __wake_up_common+0x48/0x70 > [ 1310.685712] [] __wake_up+0x37/0x50 > [ 1310.686136] [] sock_def_readable+0x7a/0x80 > [ 1310.686579] [] sock_queue_rcv_skb+0xeb/0x150 > [ 1310.687028] [] udp_queue_rcv_skb+0x139/0x2a0 > [ 1310.687554] [] __udp4_lib_rcv+0x2f1/0x7e0 > [ 1310.687996] [] udp_rcv+0x12/0x20 > [ 1310.688415] [] ip_local_deliver_finish+0x125/0x210 > [ 1310.688881] [] ip_local_deliver+0x2d/0x90 > [ 1310.689323] [] ip_rcv_finish+0xeb/0x300 > [ 1310.689760] [] ip_rcv+0x195/0x230 > [ 1310.690182] [] netif_receive_skb+0x37c/0x470 > [ 1310.690632] [] process_backlog+0x69/0xc0 > [ 1310.691175] [] net_rx_action+0x137/0x1b0 > [ 1310.691681] [] __do_softirq+0x52/0xb0 > [ 1310.692006] [] do_softirq+0x94/0xe0 > [ 1310.692301] ======================= > > > This is a single CPU machine, and the box was quite busy due to disk I/O > (load 6-8). The machine continues to run and all is well now. Even the > application mentioned above (FahCore_a0.exe) is running fine > ("Folding@Home", cpu bound). The binary is located on an jfs filesystem, > which was also under heavy I/O. Can someone tell me why the backtrace > shows so much net* stuff? There was not much net I/O... > > more details and .config: http://nerdbynature.de/bits/2.6.24-rc6 > > Thanks, > Christian. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/