From: Garrick Staples Subject: mountd segfault on itanium2 Date: Fri, 30 Apr 2004 14:24:14 -0700 Sender: nfs-admin@lists.sourceforge.net Message-ID: <20040430212414.GF22498@polop.usc.edu> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="aPdhxNJGSeOG9wFI" Return-path: Received: from sc8-sf-mx2-b.sourceforge.net ([10.3.1.12] helo=sc8-sf-mx2.sourceforge.net) by sc8-sf-list2.sourceforge.net with esmtp (Exim 4.30) id 1BJfVe-0005Aa-39 for nfs@lists.sourceforge.net; Fri, 30 Apr 2004 14:25:26 -0700 Received: from polop.usc.edu ([128.125.10.9]) by sc8-sf-mx2.sourceforge.net with esmtp (TLSv1:AES256-SHA:256) (Exim 4.30) id 1BJfVd-0007CM-QB for nfs@lists.sourceforge.net; Fri, 30 Apr 2004 14:25:25 -0700 Received: from polop.usc.edu (localhost.localdomain [127.0.0.1]) by polop.usc.edu (8.12.10/8.12.10) with ESMTP id i3ULOElv022823 for ; Fri, 30 Apr 2004 14:24:14 -0700 Received: (from garrick@localhost) by polop.usc.edu (8.12.10/8.12.10/Submit) id i3ULOETg022821 for nfs@lists.sourceforge.net; Fri, 30 Apr 2004 14:24:14 -0700 To: nfs@lists.sourceforge.net Errors-To: nfs-admin@lists.sourceforge.net List-Unsubscribe: , List-Id: Discussion of NFS under Linux development, interoperability, and testing. List-Post: List-Help: List-Subscribe: , List-Archive: --aPdhxNJGSeOG9wFI Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi all, I'm having a terrible time with mountd segfaulting on two Itanium boxes.= I can't find a specific trigger, but I can generally trigger it within a few minutes by just calling mount/umount a few hundred times. I'm using glibc 2.3.2 and nfs-utils 1.0.6 from RHE. In the tests below, I have a single directory exported to 10.125.0.0/16. S= ince I know name resolution was a recent problem, I've made sure all clients are= in /etc/hosts. I'm using NIS, but files is before dns and nis in nsswitch.con= f. I've also tested with and without nscd running. Thanks in advance for any help. gdb isn't showing much in a backtrace, but I can supply a core if anyone wa= nts it. # gdb --core=3D/var/lib/nfs/core.15841 =2E.. This GDB was configured as "ia64-redhat-linux-gnu". Core was generated by `./mountd -F -d all'. Program terminated with signal 11, Segmentation fault. #0 0x20000008002c19d0 in ?? () (gdb) bt #0 0x20000008002c19d0 in ?? () #1 0x20000008002c1950 in ?? () Previous frame identical to this frame (corrupt stack?) I have a few different straces that show the segfault happening in different places in the code: open("/proc/fs/nfsd/filehandle", O_RDWR) =3D 9 fstat(9, {st_mode=3DS_IFREG|0600, st_size=3D0, ...}) =3D 0 mmap(NULL, 65536, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = =3D 0x2000000800440000 write(9, "10.125.0.0/16 /export/usc-01 64 "..., 33) =3D 33 read(9, "\\x010000000008001102000000\n", 16384) =3D 27 close(9) =3D 0 munmap(0x2000000800440000, 65536) =3D 0 brk(0) =3D 0x2000000800038000 sendmsg(6, {msg_name(16)=3D{sa_family=3DAF_INET, sin_port=3Dhtons(641), sin_addr=3Dinet_addr("10.125.1.176")}, msg_iov(1)=3D[{"#\246?\315\0\0\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,= 56}], msg_controllen=3D32, msg_control=3D0x2000000800341dd8, , msg_flags=3D0}, 0)= =3D 56 select(1024, [3 4 5 6 7], NULL, NULL, NULL) =3D 2 (in [5 6]) read(5, "", 0) =3D 0 --- SIGSEGV (Segmentation fault) @ 20000008002c19d0 (63742f3132353111) --- open("/var/lib/nfs/rmtab", O_RDWR) =3D 10 fstat(10, {st_mode=3DS_IFREG|0644, st_size=3D6445, ...}) =3D 0 mmap(NULL, 65536, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = =3D 0x200000000037c000 lseek(10, 0, SEEK_CUR) =3D 0 read(10, "10.125.1.10:10.125.0.0/16:0x0000"..., 16384) =3D 6445 lseek(10, 6445, SEEK_SET) =3D 6445 lseek(10, -4678, SEEK_CUR) =3D 1767 write(10, "10.125.0.0/16:/export/usc-01:0x0"..., 40) =3D 40 fdatasync(10) =3D 0 close(10) =3D 0 munmap(0x200000000037c000, 65536) =3D 0 close(8) =3D 0 gettimeofday({1083357502, 998698}, NULL) =3D 0 write(5, "10.125.0.0/16 0 \\x00080011020000"..., 62) =3D 62 --- SIGSEGV (Segmentation fault) @ 20000000002899d0 (7064752f35343639) --- --=20 Garrick Staples, Linux/HPCC Administrator University of Southern California --aPdhxNJGSeOG9wFI Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (GNU/Linux) iD8DBQFAksP+0SBUxJbm9HMRAvhuAKCKKvYp/qb1nJPlFNy8kO/TGIwzKgCeO3hZ moBQvAttBdVXsjjrBufvBbQ= =vfer -----END PGP SIGNATURE----- --aPdhxNJGSeOG9wFI-- ------------------------------------------------------- This SF.Net email is sponsored by: Oracle 10g Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE. http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click _______________________________________________ NFS maillist - NFS@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs