Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S966439AbWK2JR1 (ORCPT ); Wed, 29 Nov 2006 04:17:27 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S966464AbWK2JR1 (ORCPT ); Wed, 29 Nov 2006 04:17:27 -0500 Received: from wx-out-0506.google.com ([66.249.82.224]:21850 "EHLO wx-out-0506.google.com") by vger.kernel.org with ESMTP id S966439AbWK2JR0 (ORCPT ); Wed, 29 Nov 2006 04:17:26 -0500 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=KKShFXk0eUtPY3N6rzS4FAe/PQHM0WcYvrIEM1NCk9HlHQ53Dbr672XRFnzuvMvoBQ+8SRsO21wIBzq7kQjuZIZ3dI0eXwDKwIPBFsOoRMDI8iPAzcW+uVS3fHi9yavR3CzEMMC1PFAf/kZqqNqdju2UQ1gxwceYNSuN5dA2PZQ= Message-ID: <9a8748490611290117oc0ba880v1a6407bc4f41088f@mail.gmail.com> Date: Wed, 29 Nov 2006 10:17:25 +0100 From: "Jesper Juhl" To: "David Chinner" Subject: Re: XFS internal error xfs_trans_cancel at line 1138 of file fs/xfs/xfs_trans.c (kernel 2.6.18.1) Cc: "Linux Kernel Mailing List" , xfs@oss.sgi.com, xfs-masters@oss.sgi.com, "Keith Owens" In-Reply-To: <20061129013214.GH44411608@melbourne.sgi.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <9a8748490611280749k5c97d21bx2e499d2209d27dfe@mail.gmail.com> <20061129013214.GH44411608@melbourne.sgi.com> Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3235 Lines: 86 On 29/11/06, David Chinner wrote: > On Tue, Nov 28, 2006 at 04:49:00PM +0100, Jesper Juhl wrote: > > Hi, > > > > One of my NFS servers just gave me a nasty surprise that I think it is > > relevant to tell you about: > > Thanks, Jesper. > > > Filesystem "dm-1": XFS internal error xfs_trans_cancel at line 1138 of > > file fs/xfs/xfs_trans.c. Caller 0xffffffff8034b47e > > > > Call Trace: > > [] show_trace+0xb2/0x380 > > [] dump_stack+0x15/0x20 > > [] xfs_error_report+0x3c/0x50 > > [] xfs_trans_cancel+0x6e/0x130 > > [] xfs_create+0x5ee/0x6a0 > > [] xfs_vn_mknod+0x156/0x2e0 > > [] xfs_vn_create+0xb/0x10 > > [] vfs_create+0x8c/0xd0 > > [] nfsd_create_v3+0x31a/0x560 > > [] nfsd3_proc_create+0x148/0x170 > > [] nfsd_dispatch+0xf9/0x1e0 > > [] svc_process+0x437/0x6e0 > > [] nfsd+0x1cd/0x360 > > [] child_rip+0xa/0x12 > > xfs_force_shutdown(dm-1,0x8) called from line 1139 of file > > fs/xfs/xfs_trans.c. Return address = 0xffffffff80359daa > > We shut down the filesystem because we cancelled a dirty transaction. > Once we start to dirty the incore objects, we can't roll back to > an unchanged state if a subsequent fatal error occurs during the > transaction and we have to abort it. > So you are saying that there's nothing I can do to prevent this from happening in the future? > If I understand historic occurrences of this correctly, there is > a possibility that it can be triggered in ENOMEM situations. Was your > machine running out of memoy when this occurred? > Not really. I just checked my monitoring software and, at the time this happened, the box had ~5.9G RAM free (of 8G total) and no swap used (but 11G available). > > Filesystem "dm-1": Corruption of in-memory data detected. Shutting > > down filesystem: dm-1 > > Please umount the filesystem, and rectify the problem(s) > > nfsd: non-standard errno: 5 > > EIO gets returned in certain locations once the filesystem has > been shutdown. > Makes sense. > > I unmounted the filesystem, ran xfs_repair which told me to try an > > mount it first to replay the log, so I did, unmounted it again, ran > > xfs_repair (which didn't find any problems) and finally mounted it and > > everything is good - the filesystem seems intact. > > Yeah, the above error report typically is due to an in-memory > problem, not an on disk issue. > Good to know. > > The server in question is running kernel 2.6.18.1 > > Can happen to XFS on any kernel version - got a report of this from > someone running a 2.4 kernel a couple of weeks ago.... > Ok. Thank you for your reply David. -- Jesper Juhl Don't top-post http://www.catb.org/~esr/jargon/html/T/top-post.html Plain text mails only, please http://www.expita.com/nomime.html - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/