Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1030266AbWHOLm3 (ORCPT ); Tue, 15 Aug 2006 07:42:29 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1030267AbWHOLm3 (ORCPT ); Tue, 15 Aug 2006 07:42:29 -0400 Received: from nf-out-0910.google.com ([64.233.182.187]:33801 "EHLO nf-out-0910.google.com") by vger.kernel.org with ESMTP id S1030266AbWHOLm2 (ORCPT ); Tue, 15 Aug 2006 07:42:28 -0400 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=KdjBohtkCs50dRtlHHkx4xnI2+Znc5UNRohDQfNFu7sQxol/ju2W4yX8fHhSDNDJXidNqRnYiaGQx0KFpcf3fALHaM+aNyArNM+/+GP2OvJVr63G8woUankKW5BH+1nRFmgKIxLbs4v98tTbvvjU1cWcTtwPWiDID6pvVIjxyjA= Message-ID: <9a8748490608150442q4ad7a835r53400e9880da3175@mail.gmail.com> Date: Tue, 15 Aug 2006 13:42:27 +0200 From: "Jesper Juhl" To: "Nathan Scott" Subject: Re: 2.6.18-rc3-git3 - XFS - BUG: unable to handle kernel NULL pointer dereference at virtual address 00000078 Cc: linux-kernel@vger.kernel.org, xfs@oss.sgi.com In-Reply-To: <20060815190343.A2743401@wobbly.melbourne.sgi.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <9a8748490608080137k596a6290r3567096668449a64@mail.gmail.com> <9a8748490608100431m244207b1v9c9c5087233fcf3a@mail.gmail.com> <20060811083546.B2596458@wobbly.melbourne.sgi.com> <9a8748490608101544n29f863e7o7584ac64f1d4c210@mail.gmail.com> <9a8748490608101552w12822fa6m415a5fb5537c744d@mail.gmail.com> <9a8748490608110133v5f973cf6w1af340f59bb229ec@mail.gmail.com> <9a8748490608110325k25c340e2yac925eb226d1fe4f@mail.gmail.com> <20060814120032.E2698880@wobbly.melbourne.sgi.com> <9a8748490608140049t492742cx7f826a9f40835d71@mail.gmail.com> <20060815190343.A2743401@wobbly.melbourne.sgi.com> Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2633 Lines: 64 On 15/08/06, Nathan Scott wrote: > On Mon, Aug 14, 2006 at 09:49:10AM +0200, Jesper Juhl wrote: > > On 14/08/06, Nathan Scott wrote: > > > > LEAFN node level is 1 inode 412035424 bno = 8388608 > > > > > > Ooh. Can you describe this test case you're using? > > ... > > These filesystems vary in size from 50G to 3.5T > > > > The XFS filesystems contain rsync copies of filesystems on as many servers. > > The workload that triggers the problem is when all the servers start > > updating their copy via rsync - Then within a few hours the problem > > triggers. > > > > So to recreate the same scenario you'll want 28 servers doing rsync of > > filesystems of various sizes between 50G & 3.5T to a central server > > running 2.6.18-rc4 with 28 XFS filesystems. > > ... > > There are millions of files. The data the server recievs is copies of > > websites. Each server that sends data to the server with the 28 XFS > > filesystems hosts between 1800 and 2600 websites, so there are lots of > > files and every concievable strange filename. > > Wow, a special kind of hell for a filesystem developer...! ;-) > > Its not clear to me where the rename operation happens in all of > this - does rsync create a local, temporary copy of the file and > then rename it? I'm not sure. I'll investigate and see if I can work out what exactely rsync does. > Is it that central server going down or one of > those 28 other server machines? (I assume it is, I can't see an > opportunity for renaming out there...). > It's the central server with all the xfs filesystems that dies. The one recieving all the rsync data. > When you hit it again, could you grab the contents of the inode > (you'll get that from xfs_repair -n, e.g. 412035424 above) with > xfs_db (see last entry in the XFS FAQ which describes how to do > that), then mail that to me please? Sure, I'll read up on that and make sure to grab that info next time. > If you can get the source > and target names in the rename that'll help alot too... I can > explain how to use KDB to get that, but maybe you have another > debugger handy already? > An explanation of how exactely to do that would be greatly appreciated. -- Jesper Juhl Don't top-post http://www.catb.org/~esr/jargon/html/T/top-post.html Plain text mails only, please http://www.expita.com/nomime.html - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/