Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755714AbWK3Fvp (ORCPT ); Thu, 30 Nov 2006 00:51:45 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755996AbWK3Fvp (ORCPT ); Thu, 30 Nov 2006 00:51:45 -0500 Received: from wx-out-0506.google.com ([66.249.82.225]:43320 "EHLO wx-out-0506.google.com") by vger.kernel.org with ESMTP id S1755714AbWK3Fvo (ORCPT ); Thu, 30 Nov 2006 00:51:44 -0500 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=Lhti7EuyVXqTgq+UefpF0sG4opl615T1mhuK9ints20Ud+Rh5PWEqUQkzo5/xr2QnXkduaWsN/jpJ31TiYHVNjh0FE5X/lW5fLZOmgZZdHsxZNW5gP9XiyJ1Zo5fCQkYvlp/t7602wn6L5Bur2+q5vgfBLKR7dqCrvX2FnB132Q= Message-ID: <9a8748490611292151m57cdbf4kacebb4dd20b95147@mail.gmail.com> Date: Thu, 30 Nov 2006 06:51:43 +0100 From: "Jesper Juhl" To: "David Chinner" Subject: Re: XFS internal error xfs_trans_cancel at line 1138 of file fs/xfs/xfs_trans.c (kernel 2.6.18.1) Cc: "Linux Kernel Mailing List" , xfs@oss.sgi.com, xfs-masters@oss.sgi.com, "Keith Owens" In-Reply-To: <20061130020734.GB37654165@melbourne.sgi.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <9a8748490611280749k5c97d21bx2e499d2209d27dfe@mail.gmail.com> <20061129013214.GH44411608@melbourne.sgi.com> <9a8748490611290117oc0ba880v1a6407bc4f41088f@mail.gmail.com> <20061130020734.GB37654165@melbourne.sgi.com> Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3062 Lines: 71 On 30/11/06, David Chinner wrote: > On Wed, Nov 29, 2006 at 10:17:25AM +0100, Jesper Juhl wrote: > > On 29/11/06, David Chinner wrote: > > >On Tue, Nov 28, 2006 at 04:49:00PM +0100, Jesper Juhl wrote: > > >> Filesystem "dm-1": XFS internal error xfs_trans_cancel at line 1138 of > > >> file fs/xfs/xfs_trans.c. Caller 0xffffffff8034b47e > > >> > > >> Call Trace: > > >> [] show_trace+0xb2/0x380 > > >> [] dump_stack+0x15/0x20 > > >> [] xfs_error_report+0x3c/0x50 > > >> [] xfs_trans_cancel+0x6e/0x130 > > >> [] xfs_create+0x5ee/0x6a0 > > >> [] xfs_vn_mknod+0x156/0x2e0 > > >> [] xfs_vn_create+0xb/0x10 > > >> [] vfs_create+0x8c/0xd0 > > >> [] nfsd_create_v3+0x31a/0x560 > > >> [] nfsd3_proc_create+0x148/0x170 > > >> [] nfsd_dispatch+0xf9/0x1e0 > > >> [] svc_process+0x437/0x6e0 > > >> [] nfsd+0x1cd/0x360 > > >> [] child_rip+0xa/0x12 > > >> xfs_force_shutdown(dm-1,0x8) called from line 1139 of file > > >> fs/xfs/xfs_trans.c. Return address = 0xffffffff80359daa > > > > > >We shut down the filesystem because we cancelled a dirty transaction. > > >Once we start to dirty the incore objects, we can't roll back to > > >an unchanged state if a subsequent fatal error occurs during the > > >transaction and we have to abort it. > > > > > So you are saying that there's nothing I can do to prevent this from > > happening in the future? > > Pretty much - we need to work out what is going wrong and > we can't from teh shutdown message above - the error has > occurred in a path that doesn't have error report traps > in it. > > Is this reproducable? > Not on demand, no. It has happened only this once as far as I know and for unknown reasons. > > >If I understand historic occurrences of this correctly, there is > > >a possibility that it can be triggered in ENOMEM situations. Was your > > >machine running out of memoy when this occurred? > > > > > Not really. I just checked my monitoring software and, at the time > > this happened, the box had ~5.9G RAM free (of 8G total) and no swap > > used (but 11G available). > > Ok. Sounds like we need more error reporting points inserted > into that code so we dump an error earlier and hence have some > hope of working out what went wrong next time..... > > OOC, there weren't any I/O errors reported before this shutdown? > No. I looked but found none. Let me know if there's anything I can do to help. -- Jesper Juhl Don't top-post http://www.catb.org/~esr/jargon/html/T/top-post.html Plain text mails only, please http://www.expita.com/nomime.html - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/