Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Fri, 8 Mar 2002 08:00:05 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Fri, 8 Mar 2002 08:00:02 -0500 Received: from tolkor.sgi.com ([192.48.180.13]:12958 "EHLO tolkor.sgi.com") by vger.kernel.org with ESMTP id ; Fri, 8 Mar 2002 07:59:46 -0500 Message-ID: <3C88B612.1070206@sgi.com> Date: Fri, 08 Mar 2002 07:01:06 -0600 From: Stephen Lord User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:0.9.7) Gecko/20011226 X-Accept-Language: en-us MIME-Version: 1.0 To: Svetoslav Slavtchev CC: linux-kernel@vger.kernel.org Subject: Re: 2.4.18-rc4-aa1 XFS oopses caused by cpio In-Reply-To: <1015580766.20800.3.camel@svetljo.st-peter.stw.uni-erlangen.de> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Svetoslav Slavtchev wrote: >that's what i got > >Unable to handle kernel NULL pointer dereference at virtual address >00000008 >801dba4e >*pde = 00000000 >Oops: 0000 >CPU: 0 >EIP: 0010:[<801dba4e>] Not tainted >Using defaults from ksymoops -t elf32-i386 -a i386 >EFLAGS: 00010046 >eax: 00000120 ebx: 00000000 ecx: 00000282 edx: 00000000 >esi: 8b745a00 edi: 00000008 ebp: 9aa92000 esp: 9ca31904 >ds: 0018 es: 0018 ss: 0018 >Process cpio (pid: 20435, stackpage=9ca31000) >Stack: 00000282 00000004 8d5f4200 88dc0248 8d65b8c0 801db30d 88dc0248 >00000008 > 00000004 00000001 8d65b980 8d65b8c0 00000004 88dc0248 801db081 >88dc0248 > 8d65b980 8d65b8c0 00000004 8d65ba40 00000000 00000004 00000004 >8b745b20 >Call Trace: [<801db30d>] [<801db081>] [<801db674>] [<8022db0d>] >[<80209989>] > [<8023bc4d>] [<8022dd94>] [<801db3a2>] [<8022e747>] [<80222538>] >[<8020a412>] > [<8021d4b9>] [<80212496>] [<8020f959>] [<802236ac>] [<802299f5>] >[<80222f71>] > [<80234f0e>] [<801d83bd>] [<80220981>] [<8020e103>] [<8022341a>] >[<8020e103>] > [<80227c17>] [<80235098>] [<80235348>] [<80145966>] [<80145a52>] >[<8010722b>] >Code: 8b 4a 08 85 c9 74 1b 8d 74 26 00 8d bc 27 00 00 00 00 43 83 > >>>EIP; 801dba4e <===== >>> >Trace; 801db30c >Trace; 801db080 >Trace; 801db674 >Trace; 8022db0c <_pagebuf_get_object+3c/140> >Trace; 80209988 >Trace; 8023bc4c >Trace; 8022dd94 <_pagebuf_free_object+f4/120> >Trace; 801db3a2 >Trace; 8022e746 >Trace; 80222538 >Trace; 8020a412 >Trace; 8021d4b8 >Trace; 80212496 >Trace; 8020f958 >Trace; 802236ac >Trace; 802299f4 >Trace; 80222f70 >Trace; 80234f0e >Trace; 801d83bc >Trace; 80220980 >Trace; 8020e102 >Trace; 8022341a >Trace; 8020e102 >Trace; 80227c16 >Trace; 80235098 >Trace; 80235348 >Trace; 80145966 >Trace; 80145a52 >Trace; 8010722a >Code; 801dba4e >00000000 <_EIP>: >Code; 801dba4e <===== > 0: 8b 4a 08 mov 0x8(%edx),%ecx <===== >Code; 801dba50 > 3: 85 c9 test %ecx,%ecx >Code; 801dba52 > 5: 74 1b je 22 <_EIP+0x22> 801dba70 > >Code; 801dba54 > 7: 8d 74 26 00 lea 0x0(%esi,1),%esi >Code; 801dba58 > b: 8d bc 27 00 00 00 00 lea 0x0(%edi,1),%edi >Code; 801dba60 > 12: 43 inc %ebx >Code; 801dba60 > 13: 83 00 00 addl $0x0,(%eax) > This is this chunk of code: for (bsy = mp->m_perag[agno].pagb_list, n = 0; c01970e2: 31 db xor %ebx,%ebx c01970e4: 8b b5 e4 01 00 00 mov 0x1e4(%ebp),%esi c01970ea: 8b 54 06 20 mov 0x20(%esi,%eax,1),%edx n < XFS_PAGB_NUM_SLOTS; bsy++, n++) { if (bsy->busy_tp == NULL) { c01970ee: 8b 4a 08 mov 0x8(%edx),%ecx It looks like bsy is NULL, which means either a corruption problem (agno too large), or a memory allocation problem during mount - did you get any error messages when you mounted the filesystem? Had you performed much I/O before this happened, or was the cpio the first thing you did. If this structure is missing then chances are you would not get far before dying. We have already referenced other fields in the m_perag[agno] structure, so I lean towards pagb_list itself being NULL. Can you send me the output of mkfs if you still have it, or failing that, the output of xfs_growfs -n /xxx where /xxx is the mounted filesystem. This chunk of code is new, you may have found a hole in it - although I have been running it for a couple of months here without problem. It could also be something specific to the aa kernel, I have not looked at this merge of XFS. Steve - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/