From: Trond Myklebust Subject: Re: NFS regression? Odd delays and lockups accessing an NFS export. Date: Sun, 24 Aug 2008 18:15:36 -0400 Message-ID: <1219616136.14389.12.camel@localhost> References: <1219087258.7192.19.camel@localhost> <1219400624.18774.67.camel@zakaz.uk.xensource.com> <1219428489.6919.21.camel@localhost> <1219428818.27921.43.camel@localhost.localdomain> <56a8daef0808221233h68853587n6015ca7d809b17e1@mail.gmail.com> <1219435207.27921.51.camel@localhost.localdomain> <1219440202.9097.14.camel@localhost> <1219441041.27921.57.camel@localhost.localdomain> <1219442213.9097.25.camel@localhost> <1219603981.27921.145.camel@localhost.localdomain> <1219605422.14389.2.camel@localhost> <1219605596.14389.5.camel@localhost> <1219615789.27921.152.camel@localhost.localdomain> Mime-Version: 1.0 Content-Type: text/plain Cc: John Ronciak , Grant Coady , linux-kernel@vger.kernel.org, neilb@suse.de, bfields@fieldses.org, linux-nfs@vger.kernel.org, Jeff Kirsher , Jesse Brandeburg , Bruce Allan , PJ Waskiewicz , John Ronciak , e1000-devel@lists.sourceforge.net To: Ian Campbell Return-path: Received: from mail-out1.uio.no ([129.240.10.57]:54248 "EHLO mail-out1.uio.no" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750909AbYHXWPp (ORCPT ); Sun, 24 Aug 2008 18:15:45 -0400 In-Reply-To: <1219615789.27921.152.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org> Sender: linux-nfs-owner@vger.kernel.org List-ID: On Sun, 2008-08-24 at 23:09 +0100, Ian Campbell wrote: > (added some quoting from previous mail to save replying twice) > > On Sun, 2008-08-24 at 15:19 -0400, Trond Myklebust wrote: > > On Sun, 2008-08-24 at 15:17 -0400, Trond Myklebust wrote: > > > >From the tcpdump, it looks as if the NFS server is failing to close the > > > socket, when the client closes its side. You therefore end up getting > > > stuck in the FIN_WAIT2 state (as netstat clearly shows above). > > > > > > Is the server keeping the client in this state for a very long > > > period? > > Well, it had been around an hour and a half on this occasion. Next time > it happens I can wait longer but I'm pretty sure I've come back from > time away and it's been wedged for at least a day. How long would you > expect it to remain in this state for? The server should ideally start to close the socket as soon as it receives the FIN from the client. I'll have a look at the code. Trond