Return-Path: Received: from smtp-out-1.desy.de ([131.169.56.84]:49914 "EHLO smtp-out-1.desy.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753242Ab0KSL1R (ORCPT ); Fri, 19 Nov 2010 06:27:17 -0500 Received: from smtp-map-1.desy.de (smtp-map-1.desy.de [131.169.56.66]) by smtp-out-1.desy.de (DESY_OUT_1) with ESMTP id AFB301773 for ; Fri, 19 Nov 2010 12:27:15 +0100 (MET) Received: from adserv71.win.desy.de (adserv71.win.desy.de [131.169.97.57]) by smtp-map-1.desy.de (DESY_MAP_1) with ESMTP id A513613E9B for ; Fri, 19 Nov 2010 12:27:15 +0100 (MET) Message-ID: <4CE65F1A.1080600@desy.de> Date: Fri, 19 Nov 2010 12:27:22 +0100 From: Tigran Mkrtchyan To: NFS list CC: Benny Halevy Subject: Re: 2.6.37-rc1 krash References: <4CE54463.7070805@desy.de> <4CE55B5D.3030509@panasas.com> In-Reply-To: <4CE55B5D.3030509@panasas.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 Hi Benny, On 11/18/2010 05:59 PM, Benny Halevy wrote: > Hi Tigran, Can you please gdb your nfs.ko module to get the line number of the warning? > (gdb) list *(pnfs_destroy_layout+0xd6) > I'm not sure what arch you're using but these offsets don't make sense on > my x86_64 machine... here is the corresponding lines: if (lo) { pnfs_clear_lseg_list(lo, &tmp_list, &range); WARN_ON(!list_empty(&nfsi->layout->segs)); WARN_ON(!list_empty(&nfsi->layout->layouts)); WARN_ON(nfsi->layout->refcount != 1); /* Matched by refcount set to 1 in alloc_init_layout_hdr */ put_layout_hdr_locked(lo); } > By "crashed", you mean these warnings, or is there anything else? > after that machine was dead and I have to reset it. This messages I got from /var/log/messages and they was the last entries before new boot sequence. I will do the same exercise with -rc2 next week. Regards, Tigran. > Benny > > On 2010-11-18 17:21, Tigran Mkrtchyan wrote: >> During dead client recovery procedure you client crashed in did. >> >> The procedure was: during IO disconnect network cable. We was testing >> server cleanup sequence. Then we have connected client back. After some >> time client reconnected...and crashed. The last thing I have seen was >> BAD_SESSION >> from the server on READ call: >> >> >> >> Nov 18 16:04:29 slinux kernel: [ 555.385877] Got error -10052 from the >> server on DESTROY_SESSION. Session has been destroyed regardless... >> Nov 18 16:04:29 slinux kernel: [ 555.395300] ------------[ cut here >> ]------------ >> Nov 18 16:04:29 slinux kernel: [ 555.395321] WARNING: at >> fs/nfs/pnfs.c:477 pnfs_destroy_layout+0xd6/0x100 [nfs]() >> Nov 18 16:04:29 slinux kernel: [ 555.395323] Hardware name: VirtualBox >> Nov 18 16:04:29 slinux kernel: [ 555.395324] Modules linked in: >> nfs_layout_nfsv41_files nfs lockd fscache nfs_acl auth_rpcgss sunrpc >> ipv6 af_packet binfmt_misc dm_mirror dm_multipath scsi_dh video output >> thermal sbs sbshc pci_slot fan container battery lp sg ac option >> usb_wwan usbserial thermal_sys button parport_pc tpm_tis tpm serio_raw >> tpm_bios parport e1000 i2c_piix4 pata_mpiix dm_region_hash dm_log dm_mod >> [last unloaded: mperf] >> Nov 18 16:04:29 slinux kernel: [ 555.395425] Pid: 2259, comm: >> 131.169.40.35-m Not tainted 2.6.37-rc1.pnfs.1 #1 >> Nov 18 16:04:29 slinux kernel: [ 555.395427] Call Trace: >> Nov 18 16:04:29 slinux kernel: [ 555.395434] [] ? >> pnfs_destroy_layout+0xd6/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395440] [] ? >> pnfs_destroy_layout+0xd6/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395444] [] ? >> warn_slowpath_common+0x8c/0xc0 >> Nov 18 16:04:29 slinux kernel: [ 555.395450] [] ? >> pnfs_destroy_layout+0xd6/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395456] [] ? >> pnfs_destroy_all_layouts+0x82/0xc0 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395462] [] ? >> nfs4_run_state_manager+0x4e3/0x540 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395469] [] ? >> nfs4_run_state_manager+0x0/0x540 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395471] [] ? >> kthread+0x96/0xa0 >> Nov 18 16:04:29 slinux kernel: [ 555.395474] [] ? >> kernel_thread_helper+0x4/0x10 >> Nov 18 16:04:29 slinux kernel: [ 555.395476] [] ? >> kthread+0x0/0xa0 >> Nov 18 16:04:29 slinux kernel: [ 555.395478] [] ? >> kernel_thread_helper+0x0/0x10 >> Nov 18 16:04:29 slinux kernel: [ 555.395479] ---[ end trace >> 8f55223a1de06cc6 ]--- >> Nov 18 16:04:29 slinux kernel: [ 555.395480] ------------[ cut here >> ]------------ >> Nov 18 16:04:29 slinux kernel: [ 555.395486] WARNING: at >> fs/nfs/pnfs.c:478 pnfs_destroy_layout+0xfc/0x100 [nfs]() >> Nov 18 16:04:29 slinux kernel: [ 555.395487] Hardware name: VirtualBox >> Nov 18 16:04:29 slinux kernel: [ 555.395488] Modules linked in: >> nfs_layout_nfsv41_files nfs lockd fscache nfs_acl auth_rpcgss sunrpc >> ipv6 af_packet binfmt_misc dm_mirror dm_multipath scsi_dh video output >> thermal sbs sbshc pci_slot fan container battery lp sg ac option >> usb_wwan usbserial thermal_sys button parport_pc tpm_tis tpm serio_raw >> tpm_bios parport e1000 i2c_piix4 pata_mpiix dm_region_hash dm_log dm_mod >> [last unloaded: mperf] >> Nov 18 16:04:29 slinux kernel: [ 555.395503] Pid: 2259, comm: >> 131.169.40.35-m Tainted: G W 2.6.37-rc1.pnfs.1 #1 >> Nov 18 16:04:29 slinux kernel: [ 555.395504] Call Trace: >> Nov 18 16:04:29 slinux kernel: [ 555.395510] [] ? >> pnfs_destroy_layout+0xfc/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395516] [] ? >> pnfs_destroy_layout+0xfc/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395518] [] ? >> warn_slowpath_common+0x8c/0xc0 >> Nov 18 16:04:29 slinux kernel: [ 555.395523] [] ? >> pnfs_destroy_layout+0xfc/0x100 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395529] [] ? >> pnfs_destroy_all_layouts+0x82/0xc0 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395536] [] ? >> nfs4_run_state_manager+0x4e3/0x540 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395542] [] ? >> nfs4_run_state_manager+0x0/0x540 [nfs] >> Nov 18 16:04:29 slinux kernel: [ 555.395544] [] ? >> kthread+0x96/0xa0 >> Nov 18 16:04:29 slinux kernel: [ 555.395546] [] ? >> kernel_thread_helper+0x4/0x10 >> Nov 18 16:04:29 slinux kernel: [ 555.395548] [] ? >> kthread+0x0/0xa0 >> Nov 18 16:04:29 slinux kernel: [ 555.395550] [] ? >> kernel_thread_helper+0x0/0x10 >> Nov 18 16:04:29 slinux kernel: [ 555.395551] ---[ end trace >> 8f55223a1de06cc7 ]--- >> Nov 18 16:04:29 slinux kernel: [ 555.395552] ------------[ cut here >> ]------------ >> Nov 18 16:04:29 slinux kernel: [ 555.395558] WARNING: at >> fs/nfs/pnfs.c:479 pnfs_destroy_layout+0xe9/0x100 [nfs]() >> Nov 18 16:04:29 slinux kernel: [ 555.395559] Hardware name: VirtualBox >> Nov 18 16:04:29 slinux kernel: [ 555.395560] Modules linked in: >> nfs_layout_nfsv41_files nfs lockd fscache nfs_acl auth_rpcgss sunrpc >> ipv6 af_packet binfmt_misc dm_mirror dm_multipath scsi_dh video output >> thermal sbs sbshc pci_slot fan container battery lp sg ac option >> usb_wwan usbserial thermal_sys button parport_pc tpm_tis tpm serio_raw >> tpm_bios parport e1000 i2c_piix4 pata_mpiix dm_region_hash dm_log dm_mod >> [last unloaded: mperf] >> Nov 18 16:04:29 slinux kernel: [ 555.395575] Pid: 2259, comm: >> 131.169.40.35-m Tainted: G W 2.6.37-rc1.pnfs.1 #1 >> Nov 18 16:04:29 slinux kernel: [ 555.395576] Call Trace: >> >> this is >> >> pnfs-all-2.6.37-rc1-2010-11-03 from Banny's tree (git >> 6a1df873544d146fcdc493034b170879985909e8) + >> http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=blobdiff;f=mm/vmstat.c;h=42eac4d33216b81c307a87016e821051bc86146e;hp=cd2e42be7b68f73dc60f40631f2b9f87708d3b47;hb=ff8b16d7e15a8ba2a6086645614a483e048e3fbf;hpb=81a6cff678ecee7cdc0658285d3150660c07cfce >> >> >> >> Regards, >> Tigran. >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html