Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Fri, 30 Nov 2001 06:53:57 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Fri, 30 Nov 2001 06:53:48 -0500 Received: from elin.scali.no ([62.70.89.10]:14854 "EHLO elin.scali.no") by vger.kernel.org with ESMTP id ; Fri, 30 Nov 2001 06:53:39 -0500 Message-ID: <3C077FF8.AFBD8DB8@scali.no> Date: Fri, 30 Nov 2001 13:47:52 +0100 From: Steffen Persvold X-Mailer: Mozilla 4.76 [en] (X11; U; Linux 2.4.9-ac18 i686) X-Accept-Language: en MIME-Version: 1.0 To: lkml , nfs list Subject: 2.4.9 kernel crash Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Hi all, This just happened to my RedHat 7.2 box running the 2.4.9-13 update kernel from RedHat. The box is running as a NFS server, exporting two ext3 volumes (one 36GB and one 73GB) : VFS: Busy inodes after unmount. Self-destruct in 5 seconds. Have a nice day... Unable to handle kernel NULL pointer dereference at virtual address 0000000b printing eip: c01537aa *pde = 00000000 Oops: 0000 CPU: 0 EIP: 0010:[] Not tainted EFLAGS: 00010282 eax: fffffffb ebx: c0a69580 ecx: c0a69590 edx: c0a69590 esi: cf12bce0 edi: fffffffb ebp: fffffb4c esp: cff6bf60 ds: 0018 es: 0018 ss: 0018 Process kswapd (pid: 5, stackpage=cff6b000) Stack: c02dbdf8 ffffff55 cbf027c8 c181a8a0 cf12bcf8 cf12bce0 c0a69580 c0150eb6 c0a69580 00000419 00000000 c1022dc4 c1022dc4 c1022d9c 00000000 c01357a5 00000001 00000ed2 000000c0 000000c0 0008e000 c01512a1 00000000 c0135bab Call Trace: [] prune_dcache [kernel] 0xf6 [] page_launder [kernel] 0x8f5 [] shrink_dcache_memory [kernel] 0x21 [] do_try_to_free_pages [kernel] 0x1b [] kswapd [kernel] 0x55 [] stext [kernel] 0x0 [] kernel_thread [kernel] 0x26 [] kswapd [kernel] 0x0 Code: 8b 47 10 85 c0 74 04 53 ff d0 58 68 40 be 2d c0 8d 43 24 50 <1>Unable to handle kernel NULL pointer dereference at virtual address 0000000b printing eip: c01537aa *pde = 00000000 Oops: 0000 CPU: 1 EIP: 0010:[] Not tainted EFLAGS: 00010282 eax: fffffffb ebx: cc33a580 ecx: cc33a590 edx: cc33a590 esi: cf543aa0 edi: fffffffb ebp: 00000000 esp: cc62dd34 ds: 0018 es: 0018 ss: 0018 Process nfsd (pid: 979, stackpage=cc62d000) Stack: c65bb8a0 c02da480 000008b4 c02da460 cf543ab8 cf543aa0 cc33a580 c0150eb6 cc33a580 0000045a 00000000 c1025008 c1025008 c1024fe0 00000000 c01357a5 00000001 00001648 000000d2 00000000 000000d2 c01512a1 00000000 c0135bab Call Trace: [] prune_dcache [kernel] 0xf6 [] page_launder [kernel] 0x8f5 [] shrink_dcache_memory [kernel] 0x21 [] do_try_to_free_pages [kernel] 0x1b [] try_to_free_pages [kernel] 0x28 [] _wrapped_alloc_pages [kernel] 0x1c1 [] __alloc_pages [kernel] 0xf [] generic_file_readahead [kernel] 0x201 [] do_generic_file_read [kernel] 0x26d [] generic_file_read [kernel] 0x64 [] file_read_actor [kernel] 0x0 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x4044 [] __insmod_ext3_S.data_L672 [ext3] 0xc0 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x91ab [] __insmod_nfsd_S.data_L2208 [nfsd] 0x660 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x551 [] __insmod_nfsd_S.data_L2208 [nfsd] 0x660 [] svc_process_Rsmp_64b56219 [sunrpc] 0x34a [] __insmod_nfsd_S.data_L2208 [nfsd] 0x18 [] __insmod_nfsd_S.data_L2208 [nfsd] 0x38 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x33b [] kernel_thread [kernel] 0x26 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x130 Code: 8b 47 10 85 c0 74 04 53 ff d0 58 68 40 be 2d c0 8d 43 24 50 <1>Unable to handle kernel NULL pointer dereference at virtual address 0000000b printing eip: c01537aa *pde = 00000000 Oops: 0000 CPU: 0 EIP: 0010:[] Not tainted EFLAGS: 00010282 eax: fffffffb ebx: c7245900 ecx: c7245910 edx: c7245910 esi: c56779e0 edi: fffffffb ebp: 00000000 esp: cc6ebd34 ds: 0018 es: 0018 ss: 0018 Process nfsd (pid: 980, stackpage=cc6eb000) Stack: c0933d20 c0933d20 00000003 c0141ea8 c56779f8 c56779e0 c7245900 c0150eb6 c7245900 00000000 c1015278 00000000 c12df20c c1015278 00000000 c01357a5 00000000 000020f2 000000d2 00000000 000000d2 c01512a1 00000000 c0135bab Call Trace: [] try_to_free_buffers [kernel] 0xf8 [] prune_dcache [kernel] 0xf6 [] page_launder [kernel] 0x8f5 [] shrink_dcache_memory [kernel] 0x21 [] do_try_to_free_pages [kernel] 0x1b [] try_to_free_pages [kernel] 0x28 [] _wrapped_alloc_pages [kernel] 0x1c1 [] __alloc_pages [kernel] 0xf [] generic_file_readahead [kernel] 0x201 [] do_generic_file_read [kernel] 0x3d8 [] generic_file_read [kernel] 0x64 [] file_read_actor [kernel] 0x0 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x4044 [] __insmod_ext3_S.data_L672 [ext3] 0xc0 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x91ab [] __insmod_nfsd_S.data_L2208 [nfsd] 0x660 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x551 [] __insmod_nfsd_S.data_L2208 [nfsd] 0x660 [] svc_process_Rsmp_64b56219 [sunrpc] 0x34a [] __insmod_nfsd_S.data_L2208 [nfsd] 0x18 [] __insmod_nfsd_S.data_L2208 [nfsd] 0x38 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x33b [] kernel_thread [kernel] 0x26 [] __insmod_nfsd_S.text_L52160 [nfsd] 0x130 Code: 8b 47 10 85 c0 74 04 53 ff d0 58 68 40 be 2d c0 8d 43 24 50 <7>eth0: 0 multicast blocks dropped. The machine did not crash completely and I was still able to access it from remote (through ssh). When I looked at top, kswapd was 'defunct' (zombie). Is this something that is fixed in newer 'vanilla' kernels (e.g 2.4.16) ? Regards, -- Steffen Persvold | Scalable Linux Systems | Try out the world's best mailto:sp@scali.no | http://www.scali.com | performing MPI implementation: Tel: (+47) 2262 8950 | Olaf Helsets vei 6 | - ScaMPI 1.12.2 - Fax: (+47) 2262 8951 | N0621 Oslo, NORWAY | >300MBytes/s and <4uS latency - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/