Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1422790AbWBNUUQ (ORCPT ); Tue, 14 Feb 2006 15:20:16 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1422783AbWBNUUP (ORCPT ); Tue, 14 Feb 2006 15:20:15 -0500 Received: from rgminet01.oracle.com ([148.87.113.118]:52885 "EHLO rgminet01.oracle.com") by vger.kernel.org with ESMTP id S1422790AbWBNUUO (ORCPT ); Tue, 14 Feb 2006 15:20:14 -0500 Date: Tue, 14 Feb 2006 12:19:46 -0800 From: Mark Fasheh To: Claudio Martins Cc: linux-kernel@vger.kernel.org, ocfs2-devel@oss.oracle.com, Andrew Morton , Linus Torvalds , Jan Kara , Nohez Subject: Re: OCFS2 Filesystem inconsistency across nodes Message-ID: <20060214201946.GD20175@ca-server1.us.oracle.com> Reply-To: Mark Fasheh References: <200602100536.02893.ctpm@rnl.ist.utl.pt> <200602110540.57573.ctpm@rnl.ist.utl.pt> <20060213222606.GC20175@ca-server1.us.oracle.com> <200602140616.11856.ctpm@rnl.ist.utl.pt> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200602140616.11856.ctpm@rnl.ist.utl.pt> Organization: Oracle Corporation User-Agent: Mutt/1.5.11 X-Brightmail-Tracker: AAAAAQAAAAI= X-Whitelist: TRUE Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1804 Lines: 46 On Tue, Feb 14, 2006 at 06:16:11AM +0000, Claudio Martins wrote: > This patch does indeed seem to fix this particular problem. Now creating and > deleting files/directories gives expected results across nodes. Great! > The bad news is that it didn't last long. While doing some more tests I found > another problem, but judging from kernel messages I think this one is related > to the DLM code. Not so great :/ We have some dlm fixes in our git tree which haven't made their way to Linus yet (I wanted to run a few more tests). Would you be interested in patching with them so we can see which bugs are left? The easiest way to get this is to pull them out of 2.6.16-rc3-mm1: http://kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.16-rc3/2.6.16-rc3-mm1/broken-out/git-ocfs2.patch You'll also pull a performance fix in that patch. Of course, even easier might be to just run 2.6.16-rc3-mm1 for now as it also contains the checkpoint list split backout. > On node 0, tar exited with: > tar: > build-AMD-linux-2.6.16-rc2-git3-jbdfix1/drivers/media/video/cx25840/cx25840-core.c: > Cannot stat: Invalid argument > > On node 2, tar exited with a segmentation fault. Could you load up debugfs.ocfs2 against your file system and run the following command: debugfs: locate It will tell me the path to the file which that metadata lock refers to. The path may help us figure out what sort of access we're having problems on here. --Mark -- Mark Fasheh Senior Software Developer, Oracle mark.fasheh@oracle.com - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/