From: Ian Thurlbeck <ian@stams.strath.ac.uk>
Subject: Re: Strange delays on NFS server
Date: Thu, 12 Aug 2004 16:15:19 +0100
Sender: nfs-admin@lists.sourceforge.net
Message-ID: <411B8987.1030609@stams.strath.ac.uk>
References: <4119FB15.7010205@stams.strath.ac.uk> <411A17F2.2060203@RedHat.com> <411A448D.3080205@stams.strath.ac.uk> <20040811164135.GA11101@suse.de>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii; format=flowed
Cc: Steve Dickson <SteveD@redhat.com>,  nfs@lists.sourceforge.net
To: Olaf Kirch <okir@suse.de>
In-Reply-To: <20040811164135.GA11101@suse.de>
Errors-To: nfs-admin@lists.sourceforge.net

Olaf Kirch wrote:
> On Wed, Aug 11, 2004 at 05:08:45PM +0100, Ian Thurlbeck wrote:
> 
>>OK, I've been running "top -d 1 -i" and trying to see what comes up when
>>the server freezes. I caught one instance where a delay coincided
>>with about 15 nfsd + 1 kjournald process appearing in the top
>>display. I'm simultaneously looking at a graphical network tool to try
>>and see the traffic going to the server - anyone got a better suggestion?
> 
> 
> This sounds exactly like the COMMIT stall problem for which I submitted
> the early-writeout patch to this list about a week ago.
> 
> I've been thinking about this a little more. It may be that one reason
> the problem is more pronounced in in 2.6 than in 2.4 is the new io
> barrier code. In 2.6 ext3 uses barriers by default; Suse's 2.6 has reiserfs
> patches that add barriers (and enables them by default). We've reports of
> this problem on both file systems.
> 
> JFS does i/o barriers while XFS does not; and this also fits the pattern
> of what Ian reports. I dimly remember there's a kernel command line
> option to turn off barriers at the block io level. Can you try if
> that helps, Ian?
> 
> The more I think about this, the more I believe the early-writeout patch
> is the right way to address this problem (short of turning off barriers).
> When data hits the NFS server, it is supposed to go to disk rather
> soonishly. This also covers most of the rewrite case, at least as long
> as you have just one application writing to the file - all rewriting
> happens in the client cache.
> 
> The crucial question is, what is a good heursitic to choose when to
> initiate a write-out. Sequential writes to the end of file are easy
> enough to detect.
> 
> I have a somewhat updated version of my patch that covers just this
> case, and exports a sysctl to let you tune how often it initiates
> an early write-out.
> 
> Olaf

Dear All

I ran top in batch mode this afternoon, turned off imapd/smb services
for good measure, and noted the times of any delays.  I then checked
the top log file and they all coincide with something like this
turning up in the process list:

14:28:30  up 80 days,  5:25,  2 users,  load average: 0.42, 0.15, 0.12
118 processes: 116 sleeping, 1 running, 1 zombie, 0 stopped
CPU states:  cpu    user    nice  system    irq  softirq  iowait    idle
            total    0.0%    0.0%    0.9%   0.0%     0.0%    0.0%   99.0%
Mem:   514664k av,  380296k used,  134368k free,       0k shrd,   37464k 
buff
         66096k active,             231056k inactive
Swap: 1052216k av,   14712k used, 1037504k free       239788k cached

   PID USER  PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME CPU COMMAND
  6365 root  16   0  1244 1244   880 R     0.9  0.2   0:15   0 top
   156 root  15   0     0    0     0 DW    0.0  0.0  25:45   0 kjournald


The kjournald process is quickly joined by a bunch of nfsd's:


14:28:34  up 80 days,  5:25,  2 users,  load average: 0.63, 0.20, 0.14
118 processes: 115 sleeping, 2 running, 1 zombie, 0 stopped
CPU states:  cpu    user    nice  system    irq  softirq  iowait    idle
            total   38.0%    0.0%   31.4%   0.0%     0.0%    0.0%   30.4%
Mem:   514664k av,  509240k used,    5424k free,       0k shrd,   29328k 
buff
         64028k active,             361036k inactive
Swap: 1052216k av,   14712k used, 1037504k free        375832k cached

   PID USER PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME CPU COMMAND
  6365 root  16   0  1244 1244   880 R     0.9  0.2   0:15   0 top
   156 root  15   0     0    0     0 DW    0.0  0.0  25:45   0 kjournald
  2021 root  15   0     0    0     0 DW    0.0  0.0  17:38   0 nfsd
  2037 root  15   0     0    0     0 DW    0.0  0.0  17:51   0 nfsd
  2042 root  15   0     0    0     0 DW    0.0  0.0  17:07   0 nfsd
  2044 root  15   0     0    0     0 DW    0.0  0.0  16:44   0 nfsd

And then the bdflush process joins in and the number of nfsd processes
grows, then shrinks back, and finally they all disappear.

Full log is at:

http://www.stams.strath.ac.uk/~ian/nfs/

File top.log (2.8MB)

Events are around 14:29:10, 14:49:00, 14:47:50, and a belter at 15:11:05

Is there anything more useful I can do ??

Many thanks

Ian
-- 
Ian Thurlbeck                http://www.stams.strath.ac.uk/
Statistics and Modelling Science, University of Strathclyde
Livingstone Tower, 26 Richmond Street, Glasgow, UK,  G1 1XH
Tel: +44 (0)141 548 3667           Fax: +44 (0)141 552 2079


-------------------------------------------------------
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs