Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751046AbWE1XuJ (ORCPT ); Sun, 28 May 2006 19:50:09 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751054AbWE1XuJ (ORCPT ); Sun, 28 May 2006 19:50:09 -0400 Received: from mx2.suse.de ([195.135.220.15]:48541 "EHLO mx2.suse.de") by vger.kernel.org with ESMTP id S1751046AbWE1XuH (ORCPT ); Sun, 28 May 2006 19:50:07 -0400 From: Neil Brown To: Arend Freije Date: Mon, 29 May 2006 09:49:55 +1000 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <17530.14115.195147.46212@cse.unsw.edu.au> Cc: linux-kernel@vger.kernel.org Subject: Re: RAID-1 and Reiser4 issue: umount hangs In-Reply-To: message from Arend Freije on Sunday May 28 References: <4478CF33.80609@inn.nl> <17528.55008.287088.705263@cse.unsw.edu.au> <44797136.4050707@inn.nl> X-Mailer: VM 7.19 under Emacs 21.4.1 X-face: v[Gw_3E*Gng}4rRrKRYotwlE?.2|**#s9D Neil Brown wrote: > > # echo t > /proc/sysrq-trigger > > > > and look at the resulting kernel messages, particularly for the > > unmount process. If they don't make sense to you, post them. > > > > > Tnx for your response. After recompiling the kernel with magic_sysrq > enabled, rebooting, and repeating umount with the reiser4 md device, the > following trace contains a reiser4-entry: This looks very much like a reiser4 problem rather than a raid problem, or at least you will need someone very familiar with reiser4 to understand what is going on here. I recommend you post this information to reiserfs-dev@namesys.com (and cc: to linux-kernel too if you like) to make sure someone knowledgeable sees it. I suspect reiser4 has some kernel threads that it uses. They might have 'obvious' names like ent:.... ktxnmgrd from looking at the source. Including the stack trace of these and anything else with 'reiser4' in the trace would probably be helpful. NeilBrown > > > syslog-ng R running 0 7581 1 7197 (NOTLB) > > umount D C011B591 0 7588 7200 (NOTLB) > > f6643c98 00000000 c1808320 c011b591 f7db5ad0 f4d18c00 003d092a 00000000 > > 00000000 f7db5ad0 c1808320 00000000 f4d18c00 003d092a f6b33540 c1808320 > > 00000000 f4d18c00 003d092a f6b33540 f6b33668 c1808320 00000000 f6643cfc > > Call Trace: > > [] __wake_up_common+0x41/0x70 > > [] io_schedule+0x26/0x30 > > [] sync_page+0x4b/0x60 > > [] __wait_on_bit+0x45/0x70 > > [] sync_page+0x0/0x60 > > [] wait_on_page_bit+0xad/0xc0 > > [] wake_bit_function+0x0/0x60 > > [] wake_bit_function+0x0/0x60 > > [] jwait_io+0x59/0x80 > > [] update_journal_header+0x83/0xb0 > > [] commit_tx+0xca/0x110 > > [] reiser4_write_logs+0x141/0x1e0 > > [] commit_current_atom+0x171/0x2c0 > > [] try_commit_txnh+0x13f/0x1e0 > > [] commit_txnh+0x34/0xd0 > > [] txn_end+0x2c/0x30 > > [] txn_restart+0x10/0x30 > > [] txn_restart_current+0x1a/0x20 > > [] force_commit_atom+0x3f/0x70 > > [] txnmgr_force_commit_all+0xea/0x130 > > [] release_format40+0x7e/0x150 > > [] init_context+0x58/0x80 > > [] reiser4_put_super+0x89/0xf0 > > [] invalidate_inodes+0x5d/0x80 > > [] generic_shutdown_super+0x156/0x160 > > [] kill_block_super+0x2d/0x50 > > [] deactivate_super+0x60/0x80 > > [] sys_umount+0x3f/0x90 > > [] do_page_fault+0x1c0/0x5a8 > > [] sys_munmap+0x51/0x80 > > [] sys_oldumount+0x17/0x20 > > [] sysenter_past_esp+0x54/0x75 > > Freely inerpreting this trace, I'd say that the umount causes the > reiser4 filesystem to do an extra commit, and the sync_page seems to be > waiting for IO. > Are there other traces of interrest? > - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/