Return-Path: Received: from rcsinet10.oracle.com ([148.87.113.121]:23981 "EHLO rcsinet10.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751605Ab0LHVEl convert rfc822-to-8bit (ORCPT ); Wed, 8 Dec 2010 16:04:41 -0500 Subject: Re: mount.nfs timeout of 9999ms (was Re: Listen backlog set to 64) Content-Type: text/plain; charset=us-ascii From: Chuck Lever In-Reply-To: <20101208183738.GA9996@fieldses.org> Date: Wed, 8 Dec 2010 16:04:14 -0500 Cc: Mark Hills , Neil Brown , linux-nfs@vger.kernel.org Message-Id: <13B622C7-5FC8-4297-BC1F-FD81E45D5D07@oracle.com> References: <20101117090826.4b2724da@notabene.brown> <20101129205935.GD9897@fieldses.org> <20101130200013.GA2108@fieldses.org> <45243C0D-2158-47DD-91AF-C91AB960F6B4@oracle.com> <20101208183738.GA9996@fieldses.org> To: "J. Bruce Fields" Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 On Dec 8, 2010, at 1:37 PM, J. Bruce Fields wrote: > On Wed, Dec 08, 2010 at 01:28:06PM -0500, Chuck Lever wrote: >> From what I have read, the server's configuration is the immediate problem here: namely that the disk that /var is on is slow, or there are bugs (or not-very-useful behavior) in the file system implementation used on /var, or that rpm should not cause such latency for other applications, or that the multi-threading architecture of mountd is incorrect. Or some combination of these. >> >> A ten second latency for file system operations is the real issue. >> >>>> IMO adding another mount option would be mistaken. It adds clutter to >>>> the mount user interface to work around what is clearly a problem on the >>>> server. >>> >>> I agree that new options can be cluttered. >>> >>> But I think the problem needs attention at both the client and sever side. >> >> It seems reasonable to relax mount.nfs's timeout settings used when performing mount and rpcbind requests. The question I have is what would be reasonable values to use instead of what we have? (That question is directed to everyone who is still reading). > > What's the behavior you want when the server is just off or unreachable? > > I'd've thought that there are cases where you just want the mount to > hang till it's interrupted or the problem is fixed. (Autofs possibly > being one of them.) Be it noted however that autofs usually wants a quick yes or no answer in these cases. Auto mounts that take a long time to succeed usually drive users crazy, especially if the automounter is single-threaded. That's why this timeout is kept short. -- Chuck Lever chuck[dot]lever[at]oracle[dot]com