From: "Gregory Baker" Subject: Re: bug in linux mount? (says NetApp) Date: Tue, 11 Jul 2006 20:07:40 -0500 Message-ID: <44B44B5C.7020603@amd.com> References: <44B3F547.9010507@amd.com> Reply-To: gregory.baker@amd.com Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Cc: autofs@linux.kernel.org, nfs@lists.sourceforge.net Return-path: Received: from sc8-sf-mx1-b.sourceforge.net ([10.3.1.91] helo=mail.sourceforge.net) by sc8-sf-list2-new.sourceforge.net with esmtp (Exim 4.43) id 1G0TCm-0002Vq-99 for nfs@lists.sourceforge.net; Tue, 11 Jul 2006 18:07:56 -0700 Received: from outbound-cpk.frontbridge.com ([207.46.163.16] helo=outbound2-cpk-R.bigfish.com) by mail.sourceforge.net with esmtps (TLSv1:DES-CBC3-SHA:168) (Exim 4.44) id 1G0TCm-0006IQ-5F for nfs@lists.sourceforge.net; Tue, 11 Jul 2006 18:07:56 -0700 To: "Blake Golliher" In-Reply-To: List-Id: "Discussion of NFS under Linux development, interoperability, and testing." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: nfs-bounces@lists.sourceforge.net Errors-To: nfs-bounces@lists.sourceforge.net We have various versions; 3500 and R series are running 7.x, the older 840/880 are running 6.5.x NetApp Release 7.0.2P4: Wed Nov 30 01:52:11 PST 2005 charity.amd.com NetApp Release 7.0.3P4: Sat Jan 28 06:03:18 PST 2006 chang.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 diego.amd.com NetApp Release 7.0.3P4: Sat Jan 28 06:03:18 PST 2006 eng.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 escher.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 faith.amd.com NetApp Release 6.5.6P10D3:Thu Apr 13 22:13:21 PDT 2006 fortitud.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 frida.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 hope.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 justice.amd.com NetApp Release 7.0.3P4: Sat Jan 28 06:03:18 PST 2006 kuching.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 okeeffe.amd.com NetApp Release 7.0.2P4: Wed Nov 30 01:52:11 PST 2005 prudence.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 raffael.amd.com NetApp Release 7.0.3P4: Sat Jan 28 06:03:18 PST 2006 siam.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 sobriety.amd.com NetApp Release 6.5.5P7: Thu Jul 21 02:10:50 PDT 2005 vangogh.amd.com NetApp Release 7.2RC1: Wed Feb 15 00:46:09 PST 2006 jacen.amd.com NetApp Release 7.2RC1: Wed Feb 15 00:46:09 PST 2006 jaina.amd.com --Greg Blake Golliher wrote: > What version of OnTap are you running? > > -Blake > > On Jul 11, 2006, at 12:00 PM, Gregory Baker wrote: > >> >> We have thousands of linux clients hitting netapp file servers (many >> 3500 series, clustered) on a local gigabit LAN. From time to time, >> applications return "file not found" when attempting to automount a >> directory and access a file. An example of this is a long running >> process, which reads in data, processes it for hours (in which time the >> filesystem is unmounted) then tries to read more data from that mount >> point (which causes a "file not found" error in the application). This >> occurs about 1/100th of the time. >> >> Researching at Netapp turns up this bit by Chuck Lever (Linux NFS >> contributer) >> >> "Using the Linux NFS Client with Network Appliance Filers" >> http://www.netapp.com/libr ary/tr/3183.pdf (February 2006) >> >> page 10 says... >> >> "Due to a bug in the mount command, the default retransmission timeout >> value on Linux for NFS over TCP is quite small...To obtain standard >> behavior, we strongly recommend using "timeo=600, retrans=2" explicitly >> when mounting via TCP." >> >> Our defaults (assuming man pages are correct, RedHat Enterprise Linux 3) >> would be timeo=7, retrans=3, which translates to 7+14+28+56 = 105 tenths >> of a second (10 seconds). It appears netapp is suggesting waiting >> 600+600 = 1200 tenths (120 seconds) before giving up on the mount >> command... >> >> * What "bug" in the mount command do you believe NetApp is talking about? >> >> * What do you think proper options for NFS auto/mounts would be for >> extremely busy centralized NFS filers? >> >> * What is the reference standard behavior? >> >> Thanks, >> >> --Greg >> >> -- ---------------------------------------------------------------------- >> Greg Baker 512-602-3287 (work) >> gregory.baker@amd.com 512-602-6970 (fax) >> 5900 E. Ben White Blvd MS 626 512-555-1212 (info) >> Austin, TX 78741 >> >> >> >> >> >> ------------------------------------------------------------------------- >> Using Tomcat but need to do more? Need to support web services, security? >> Get stuff done quickly with pre-integrated technology to make your job >> easier >> Download IBM WebSphere Application Server v.1.0.1 based on Apache >> Geronimo >> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 >> _______________________________________________ >> NFS maillist - NFS@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nfs >> >> > -- ---------------------------------------------------------------------- Greg Baker 512-602-3287 (work) gregory.baker@amd.com 512-602-6970 (fax) 5900 E. Ben White Blvd MS 626 512-555-1212 (info) Austin, TX 78741 ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ NFS maillist - NFS@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs