From: Dennis Nezic Subject: Re: nfs: server not responding, timed out Date: Fri, 19 Mar 2010 00:27:20 -0400 Message-ID: <20100319002720.0e93411e.dennisn@dennisn.dyndns.org> References: <20100318170603.f6a7f188.dennisn@dennisn.dyndns.org> <4BA2DFC5.1010400@cn.fujitsu.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII To: linux-nfs@vger.kernel.org Return-path: Received: from lo.gmane.org ([80.91.229.12]:42426 "EHLO lo.gmane.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750733Ab0CSE1c (ORCPT ); Fri, 19 Mar 2010 00:27:32 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1NsTo3-0003Rx-Ci for linux-nfs@vger.kernel.org; Fri, 19 Mar 2010 05:27:31 +0100 Received: from 67.55.41.152 ([67.55.41.152]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 19 Mar 2010 05:27:31 +0100 Received: from dennisn by 67.55.41.152 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 19 Mar 2010 05:27:31 +0100 Sender: linux-nfs-owner@vger.kernel.org List-ID: On Fri, 19 Mar 2010 10:21:57 +0800, Bian Naimeng wrote: > > After upgrading my server (kernel 2.6.19 to 2.6.33, nfs-utils 1.1.0 > > to 1.2.1/1.1.4/1.1.6), and probably other stuff too), and possibly > > my client laptop's kernel, I have suddenly started to get these > > "server X not responding, timed out" errors (on my client), > > especially (only?) when doing large file transfers. This would lead > > to input/output errors, and the transfers would fail. > > > > I never noticed any such problems for over two years, using the > > older versions. The networking (wifi link) hasn't changed. > > > > Usually the file transfer trips and falls over itself near the end > > -- Ie. it will do 600MB out of 800MB just fine, and then suddently > > start giving these "timed out" errors, and then crash and burn. At > > this point, I am forced to "umount -fl" the mount. If I then try to > > remount it, the server acnowledges my "authenticated mount > > requests" perfectly fine, but my client (laptop) still appears > > "hung". After a few minutes, I am able to remount it. > > > > I tried playing with the rsize/wsize/timeo/retrans variables, but > > none of it seemed to fix the problem. > > > > Any ideas about what has changed? Maybe this is/was a well-known > > problem? :P > > > > I do not know the what's the reason. And I am not sure the followed > discussion can fix this problem, but maybe it can help you. > http://marc.info/?l=linux-nfs&m=123478426412524&w=2 Both the patches mentioned in that thread already seem to have been applied to my kernels. So, although the problem seems related, it wasn't that bug in particular. The person in that thread was talking about mounts dying after 5-15minutes, which doesn't happen with me -- my problem only seems to occur under intense activity.