Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1762078AbXISQoP (ORCPT ); Wed, 19 Sep 2007 12:44:15 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1756306AbXISQoB (ORCPT ); Wed, 19 Sep 2007 12:44:01 -0400 Received: from hellhawk.shadowen.org ([80.68.90.175]:1511 "EHLO hellhawk.shadowen.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755896AbXISQoA (ORCPT ); Wed, 19 Sep 2007 12:44:00 -0400 Date: Wed, 19 Sep 2007 17:43:48 +0100 From: Andy Whitcroft To: Andrew Morton Cc: linux-kernel@vger.kernel.org Subject: 2.6.23-rc6-mm1 -- mkfs stuck in 'D' Message-ID: <20070919164348.GC2519@shadowen.org> References: <20070918011841.2381bd93.akpm@linux-foundation.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20070918011841.2381bd93.akpm@linux-foundation.org> User-Agent: Mutt/1.5.13 (2006-08-11) X-SPF-Guess: neutral Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1908 Lines: 46 Seems I have a case of a largish i386 NUMA (NUMA-Q) which has a mkfs stuck in a 'D' wait: ======================= mkfs.ext2 D c10220f4 0 6233 6222 c344fc80 00000082 00000286 c10220f4 c344fc90 002ed099 c2963340 c2b9f640 c142bce0 c2b9f640 c344fc90 002ed099 c344fcfc c344fcc0 c1219563 c1109bf2 c344fcc4 c186e4d4 c186e4d4 002ed099 c1022612 c2b9f640 c186e000 c104000c Call Trace: [] lock_timer_base+0x19/0x35 [] schedule_timeout+0x70/0x8d [] prop_fraction_single+0x37/0x5d [] process_timeout+0x0/0x5 [] task_dirty_limit+0x3a/0xb5 [] io_schedule_timeout+0x1e/0x28 [] congestion_wait+0x62/0x7a [] autoremove_wake_function+0x0/0x33 [] get_dirty_limits+0x16a/0x172 [] autoremove_wake_function+0x0/0x33 [] balance_dirty_pages+0x154/0x1be [] generic_perform_write+0x168/0x18a [] generic_file_buffered_write+0x73/0x107 [] __generic_file_aio_write_nolock+0x47a/0x4a5 [] do_sock_write+0x92/0x99 [] sock_aio_write+0x52/0x5e [] generic_file_aio_write_nolock+0x48/0x9b [] do_sync_write+0xbf/0xfc [] autoremove_wake_function+0x0/0x33 [] do_page_fault+0x2cc/0x739 [] vfs_write+0x8d/0x108 [] sys_write+0x41/0x67 [] syscall_call+0x7/0xb ======================= This machine and others have run numerous test runs on this kernel and this is the first time I've see a hang like this. I wonder if this is the ultimate cause of the couple of mainline hangs which were seen, but not diagnosed. -apw - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/