2008-08-30 09:28:36

by Grant Coady

[permalink] [raw]
Subject: 2.6.27-rc5 nfs stall - more info

Hi there,

Had another stall, server running 2.6.24.7, clients running 2.6.27-rc5.

On the server the netstat -t gave during stall:

grant@deltree:~$ netstat -t
Active Internet connections (w/o servers)
Proto Recv-Q Send-Q Local Address Foreign Address State
tcp 0 0 localhost:50087 localhost:4743 ESTABLISHED
tcp 0 52 deltree.mire.mine.n:ssh sillywin.mire.mine:3052 ESTABLISHED
tcp 0 0 deltree.mire.mine.n:ssh sillywin.mire.mine:3051 ESTABLISHED
tcp 0 0 deltree.mire.mine.n:ssh sillywin.mire.mine:1056 ESTABLISHED
tcp 0 0 localhost:4743 localhost:50087 ESTABLISHED


after stall, shows the delay seems to be in making the NFS connection:
grant@deltree:~$ netstat -t
Active Internet connections (w/o servers)
Proto Recv-Q Send-Q Local Address Foreign Address State
tcp 0 0 localhost:50087 localhost:4743 ESTABLISHED
tcp 0 0 deltree.mire.mine.n:ssh sillywin.mire.mine:3052 ESTABLISHED
tcp 0 0 deltree.mire.mine.n:ssh sillywin.mire.mine:3051 ESTABLISHED
tcp 0 0 deltree.mire.mine.n:ssh sillywin.mire.mine:1056 ESTABLISHED
tcp 0 0 localhost:4743 localhost:50087 ESTABLISHED
tcp 0 0 deltree.mire.mine.:nfsd pooh.mire.mine.nu:rtip ESTABLISHED


On 'pooh', I opened a new root terminal and run a command from nfs
export from deltree, this is when the stall happened, pooh had been
up for about five hours, this from /var/log/messages:
Aug 30 19:10:21 pooh sshd[2615]: Accepted publickey for root from 192.168.1.36 port 1246 ssh2


Just before doing that I'd told 'tosh' to halt via ssh terminal
and the server recorded in /var/log/messages:
Aug 30 19:11:03 deltree mountd[573]: authenticated unmount request from tosh.mire.mine.nu:680 for /home/common (/home/common)

Just coincidence?
.config + dmesg for the clients (.27-rc5a):
http://bugsplatter.id.au/kernel/boxen/pooh/
http://bugsplatter.id.au/kernel/boxen/toshi/
server (.24.7a):
http://bugsplatter.id.au/kernel/boxen/deltree/

Grant.