2010-04-22 18:25:44

by J. Bruce Fields

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> Well, hello there,
>
> Posted it on linux-kernel ML also, and post it here, for more specific analysis.
>
> I faced this problem today while trying to mount some NFS share on OpenBSD box.
> I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
>
> ksh: cd: /storage - Stale NFS file handle
>
> Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
>
> While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
>
> http://pastebin.ca/1864713
>
> Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
>
> Then I started to experiment with another Linux box to kill the possible different variants.
>
> On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:

I'm a bit confused. What kernel and nfs-utils version is running on the
problematic Linux server?

Also, what are the contents of /proc/net/rpc/nfsd.export/content and
/proc/net/rpc/nfsd.content after you try to access the filesystem?

--b.

>
> --
> 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x25e4 (correct), seq 1, ack 1, win 92, options [nop,nop,TS val 1808029984 ecr 1618999], length 0
> 172.17.2.5.3565791363 > 172.17.2.2.2049: 40 null
> 172.17.2.2.2049 > 172.17.2.5.884: Flags [.], cksum 0x25e6 (correct), seq 1, ack 45, win 46, options [nop,nop,TS val 1618999 ecr 1808029984], length 0
> 172.17.2.2.2049 > 172.17.2.5.3565791363: reply ok 24 null
> 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x259b (correct), seq 45, ack 29, win 92, options [nop,nop,TS val 1808029985 ecr 1618999], length 0
> 172.17.2.5.3582568579 > 172.17.2.2.2049: 40 null
> 172.17.2.2.2049 > 172.17.2.5.3582568579: reply ok 24 null
> 172.17.2.5.3599345795 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> 172.17.2.2.2049 > 172.17.2.5.3599345795: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> 172.17.2.5.3616123011 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> 172.17.2.2.2049 > 172.17.2.5.3616123011: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> 172.17.2.5.884 > 172.17.2.2.2049: Flags [F.], cksum 0x2449 (correct), seq 281, ack 129, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> 172.17.2.2.2049 > 172.17.2.5.884: Flags [F.], cksum 0x2476 (correct), seq 129, ack 282, win 46, options [nop,nop,TS val 1618999 ecr 1808029986], length 0
> 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x2448 (correct), seq 282, ack 130, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> --
>
> familiar messages, eh?
>
> Since that time I've solved that's not OpenBSD problem. So only NFS and Linux left as the reasons of this.
> It was possible to mount that small partition on Linux box too, the same as on OpenBSD.
>
> But afterthat I recongnized an interesting issue: I have different sw raid setups on my storage server.
> I tried to mount a small partition on the same md device where 5.5TB partition is located, and got the same
> error message! Now I'm sure it's about NFS <-> MDADM setup, that's why I called the topic like this.
>
> A bit about my setup:
>
> # cat /proc/mdstat
> Personalities : [linear] [raid0] [raid1] [raid6] [raid5] [raid4] [multipath]
> md3 : active raid1 sdc1[0] sdd1[1]
> 61376 blocks [2/2] [UU]
>
> md1 : active raid5 sdc2[2] sdd2[3] sdb2[1] sda2[0]
> 3153408 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
>
> md2 : active raid5 sdc3[2] sdd3[3] sdb3[1] sda3[0]
> 5857199616 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
>
> md0 : active raid1 sdb1[1] sda1[0]
> 61376 blocks [2/2] [UU]
>
> unused devices: <none>
>
> md0, md1, and md3 aren't so interesting, since fs is created directly on them, and that's a _problem device_:
>
> # parted /dev/md2
> GNU Parted 2.2
> Using /dev/md2
> Welcome to GNU Parted! Type 'help' to view a list of commands.
> (parted) p free
> p free
> Model: Unknown (unknown)
> Disk /dev/md2: 5998GB
> Sector size (logical/physical): 512B/512B
> Partition Table: gpt
>
> Number Start End Size File system Name Flags
> 17.4kB 1049kB 1031kB Free Space
> 1 1049kB 2147MB 2146MB linux-swap(v1) swap
> 2 2147MB 23.6GB 21.5GB xfs home
> 3 23.6GB 24.7GB 1074MB xfs temp
> 4 24.7GB 35.4GB 10.7GB xfs user
> 5 35.4GB 51.5GB 16.1GB xfs var
> 6 51.5GB 5998GB 5946GB xfs vault
> 5998GB 5998GB 507kB Free Space
>
> # ls /dev/md?*
> /dev/md0 /dev/md1 /dev/md2 /dev/md2p1 /dev/md2p2 /dev/md2p3 /dev/md2p4 /dev/md2p5 /dev/md2p6 /dev/md3
>
> It's very handy partitioning scheme where I can extend (grow 5th raid) with more hdds only /vault partition while "loosing" (a.k.a. not using for this partition) only ~1gb of space from every 2TB drive.
>
> System boots ok and xfs_check passes with no problems, etc.
> The only problem: it's not possible to use NFS shares on any partition of /dev/md2 device.
>
> Finally, my question to NFS and MDADM developers: any idea?
>
> --
> Dont wait to die to find paradise...
> --
> Cheerz,
> Vlad "Stealth" Glagolev




2010-04-22 20:08:07

by Vlad Glagolev

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, 22 Apr 2010 15:56:21 -0400
Trond Myklebust <[email protected]> wrote:

> On Thu, 2010-04-22 at 23:51 +0400, Vlad Glagolev wrote:
> > On Thu, 22 Apr 2010 15:47:30 -0400
> > Trond Myklebust <[email protected]> wrote:
> >
> > > On Thu, 2010-04-22 at 15:32 -0400, J. Bruce Fields wrote:
> > > > On Thu, Apr 22, 2010 at 10:53:10PM +0400, Vlad Glagolev wrote:
> > > > > On Thu, 22 Apr 2010 14:25:43 -0400
> > > > > "J. Bruce Fields" <[email protected]> wrote:
> > > > >
> > > > > > On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > > > > > > Well, hello there,
> > > > > > >
> > > > > > > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> > > > > > >
> > > > > > > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > > > > > > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> > > > > > >
> > > > > > > ksh: cd: /storage - Stale NFS file handle
> > > > > > >
> > > > > > > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > > > > > > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> > > > > > >
> > > > > > > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> > > > > > >
> > > > > > > http://pastebin.ca/1864713
> > > > > > >
> > > > > > > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> > > > > > >
> > > > > > > Then I started to experiment with another Linux box to kill the possible different variants.
> > > > > > >
> > > > > > > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
> > > > > >
> > > > > > I'm a bit confused. What kernel and nfs-utils version is running on the
> > > > > > problematic Linux server?
> > > > >
> > > > > same. nfs-utils 1.1.6 and kernel 2.6.32.
> > > >
> > > > Huh. That should be new enough for it to be using uuid's. I wonder why
> > > > it isn't?
> > >
> > > What are the contents of /dev/disk/by-uuid?
> > >
> >
> > $ ls -1 /dev/disk/by-uuid/
> > 0e9742f6-44e3-431c-911f-4c914e4f81d5
> > 31ae89c6-dba6-4351-b3a9-e8b08be07c3d
> > 4429ba7a-afd2-4c61-83a0-900dae1bccdc
> > 463bbc42-c19b-4b9e-bae7-838ac0e2e5c6
> > 473b9320-88a4-44eb-b592-2ac98619bc9b
> > 53d16b07-d496-4f8f-ad59-ea34aaf169f4
> > 6adf1c55-405c-43cf-a84d-be5d2746d300
> > b35a7bca-12ad-4738-a895-52f20b7cc5d9
> > dc892f1f-0b83-41dd-bde7-0761295f33a3
> > f7ac4165-320f-4235-a78a-5fe1bd0aac24
> >
>
> So, when you do 'ls -l' on the above, you do indeed see all the
> partitions that are being exported via NFS?

there's only one, and yes, you're right.. /dev/md2p6 isn't in the list.

According to blkid:

# blkid
/dev/sda1: UUID="9a9beac0-d5fa-94d1-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sda2: UUID="8327bc1c-3d13-ee78-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sda3: UUID="01821f8d-da8e-fd6a-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdb1: UUID="9a9beac0-d5fa-94d1-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdb2: UUID="8327bc1c-3d13-ee78-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdb3: UUID="01821f8d-da8e-fd6a-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdc1: UUID="ed9c6039-5faf-d0ef-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdc2: UUID="8327bc1c-3d13-ee78-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdc3: UUID="01821f8d-da8e-fd6a-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdd1: UUID="ed9c6039-5faf-d0ef-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdd2: UUID="8327bc1c-3d13-ee78-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/sdd3: UUID="01821f8d-da8e-fd6a-86df-b710391e5a7c" TYPE="linux_raid_member"
/dev/md0: UUID="53d16b07-d496-4f8f-ad59-ea34aaf169f4" TYPE="xfs"
/dev/md2p1: UUID="31ae89c6-dba6-4351-b3a9-e8b08be07c3d" TYPE="swap"
/dev/md2p2: UUID="dc892f1f-0b83-41dd-bde7-0761295f33a3" TYPE="xfs"
/dev/md2p3: UUID="f7ac4165-320f-4235-a78a-5fe1bd0aac24" TYPE="xfs"
/dev/md2p4: UUID="4429ba7a-afd2-4c61-83a0-900dae1bccdc" TYPE="xfs"
/dev/md2p5: UUID="b35a7bca-12ad-4738-a895-52f20b7cc5d9" TYPE="xfs"
/dev/md2p6: UUID="9f8d66e8-4b11-4ae7-8f90-b00ee9d204b1" TYPE="xfs"
/dev/md1: UUID="0e9742f6-44e3-431c-911f-4c914e4f81d5" TYPE="xfs"
/dev/md3: UUID="463bbc42-c19b-4b9e-bae7-838ac0e2e5c6" TYPE="xfs"
/dev/sde1: UUID="6adf1c55-405c-43cf-a84d-be5d2746d300" TYPE="ext2"

and not all of them exist at /dev/disk/by-uuid:

# ls -1 /dev/disk/by-uuid
0e9742f6-44e3-431c-911f-4c914e4f81d5 <- /dev/md1
31ae89c6-dba6-4351-b3a9-e8b08be07c3d <- /dev/md2p1
4429ba7a-afd2-4c61-83a0-900dae1bccdc <- /dev/md2p4
463bbc42-c19b-4b9e-bae7-838ac0e2e5c6 <- /dev/md3
473b9320-88a4-44eb-b592-2ac98619bc9b <- ??
53d16b07-d496-4f8f-ad59-ea34aaf169f4 <- /dev/md0
6adf1c55-405c-43cf-a84d-be5d2746d300 <- /dev/sde1
b35a7bca-12ad-4738-a895-52f20b7cc5d9 <- /dev/md2p5
dc892f1f-0b83-41dd-bde7-0761295f33a3 <- /dev/md2p2
f7ac4165-320f-4235-a78a-5fe1bd0aac24 <- /dev/md2p3

Interesting. so 9f8d66e8-4b11-4ae7-8f90-b00ee9d204b1 != 473b9320-88a4-44eb-b592-2ac98619bc9b.

Bug in udev?

--
Dont wait to die to find paradise...
--
Cheerz,
Vlad "Stealth" Glagolev


Attachments:
(No filename) (5.26 kB)
(No filename) (198.00 B)
Download all attachments

2010-04-22 18:53:22

by Vlad Glagolev

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, 22 Apr 2010 14:25:43 -0400
"J. Bruce Fields" <[email protected]> wrote:

> On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > Well, hello there,
> >
> > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> >
> > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> >
> > ksh: cd: /storage - Stale NFS file handle
> >
> > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> >
> > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> >
> > http://pastebin.ca/1864713
> >
> > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> >
> > Then I started to experiment with another Linux box to kill the possible different variants.
> >
> > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
>
> I'm a bit confused. What kernel and nfs-utils version is running on the
> problematic Linux server?

same. nfs-utils 1.1.6 and kernel 2.6.32.

>
> Also, what are the contents of /proc/net/rpc/nfsd.export/content and
> /proc/net/rpc/nfsd.content after you try to access the filesystem?

# cat /proc/net/rpc/nfsd.export/content
#path domain(flags)
/vault 172.17.2.5(ro,insecure,root_squash,sync,wdelay,no_subtree_check)

# cat /proc/net/rpc/nfsd.fh/content
#domain fsidtype fsid [path]
# 172.17.2.5 3 0x0001030500000803
172.17.2.5 0 0x0500030100000803 /vault

>
> --b.
>
> >
> > --
> > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x25e4 (correct), seq 1, ack 1, win 92, options [nop,nop,TS val 1808029984 ecr 1618999], length 0
> > 172.17.2.5.3565791363 > 172.17.2.2.2049: 40 null
> > 172.17.2.2.2049 > 172.17.2.5.884: Flags [.], cksum 0x25e6 (correct), seq 1, ack 45, win 46, options [nop,nop,TS val 1618999 ecr 1808029984], length 0
> > 172.17.2.2.2049 > 172.17.2.5.3565791363: reply ok 24 null
> > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x259b (correct), seq 45, ack 29, win 92, options [nop,nop,TS val 1808029985 ecr 1618999], length 0
> > 172.17.2.5.3582568579 > 172.17.2.2.2049: 40 null
> > 172.17.2.2.2049 > 172.17.2.5.3582568579: reply ok 24 null
> > 172.17.2.5.3599345795 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> > 172.17.2.2.2049 > 172.17.2.5.3599345795: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> > 172.17.2.5.3616123011 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> > 172.17.2.2.2049 > 172.17.2.5.3616123011: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> > 172.17.2.5.884 > 172.17.2.2.2049: Flags [F.], cksum 0x2449 (correct), seq 281, ack 129, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> > 172.17.2.2.2049 > 172.17.2.5.884: Flags [F.], cksum 0x2476 (correct), seq 129, ack 282, win 46, options [nop,nop,TS val 1618999 ecr 1808029986], length 0
> > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x2448 (correct), seq 282, ack 130, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> > --
> >
> > familiar messages, eh?
> >
> > Since that time I've solved that's not OpenBSD problem. So only NFS and Linux left as the reasons of this.
> > It was possible to mount that small partition on Linux box too, the same as on OpenBSD.
> >
> > But afterthat I recongnized an interesting issue: I have different sw raid setups on my storage server.
> > I tried to mount a small partition on the same md device where 5.5TB partition is located, and got the same
> > error message! Now I'm sure it's about NFS <-> MDADM setup, that's why I called the topic like this.
> >
> > A bit about my setup:
> >
> > # cat /proc/mdstat
> > Personalities : [linear] [raid0] [raid1] [raid6] [raid5] [raid4] [multipath]
> > md3 : active raid1 sdc1[0] sdd1[1]
> > 61376 blocks [2/2] [UU]
> >
> > md1 : active raid5 sdc2[2] sdd2[3] sdb2[1] sda2[0]
> > 3153408 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
> >
> > md2 : active raid5 sdc3[2] sdd3[3] sdb3[1] sda3[0]
> > 5857199616 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
> >
> > md0 : active raid1 sdb1[1] sda1[0]
> > 61376 blocks [2/2] [UU]
> >
> > unused devices: <none>
> >
> > md0, md1, and md3 aren't so interesting, since fs is created directly on them, and that's a _problem device_:
> >
> > # parted /dev/md2
> > GNU Parted 2.2
> > Using /dev/md2
> > Welcome to GNU Parted! Type 'help' to view a list of commands.
> > (parted) p free
> > p free
> > Model: Unknown (unknown)
> > Disk /dev/md2: 5998GB
> > Sector size (logical/physical): 512B/512B
> > Partition Table: gpt
> >
> > Number Start End Size File system Name Flags
> > 17.4kB 1049kB 1031kB Free Space
> > 1 1049kB 2147MB 2146MB linux-swap(v1) swap
> > 2 2147MB 23.6GB 21.5GB xfs home
> > 3 23.6GB 24.7GB 1074MB xfs temp
> > 4 24.7GB 35.4GB 10.7GB xfs user
> > 5 35.4GB 51.5GB 16.1GB xfs var
> > 6 51.5GB 5998GB 5946GB xfs vault
> > 5998GB 5998GB 507kB Free Space
> >
> > # ls /dev/md?*
> > /dev/md0 /dev/md1 /dev/md2 /dev/md2p1 /dev/md2p2 /dev/md2p3 /dev/md2p4 /dev/md2p5 /dev/md2p6 /dev/md3
> >
> > It's very handy partitioning scheme where I can extend (grow 5th raid) with more hdds only /vault partition while "loosing" (a.k.a. not using for this partition) only ~1gb of space from every 2TB drive.
> >
> > System boots ok and xfs_check passes with no problems, etc.
> > The only problem: it's not possible to use NFS shares on any partition of /dev/md2 device.
> >
> > Finally, my question to NFS and MDADM developers: any idea?
> >
> > --
> > Dont wait to die to find paradise...
> > --
> > Cheerz,
> > Vlad "Stealth" Glagolev
>
>


--
Dont wait to die to find paradise...
--
Cheerz,
Vlad "Stealth" Glagolev


Attachments:
(No filename) (6.36 kB)
(No filename) (198.00 B)
Download all attachments

2010-04-22 19:32:36

by J. Bruce Fields

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, Apr 22, 2010 at 10:53:10PM +0400, Vlad Glagolev wrote:
> On Thu, 22 Apr 2010 14:25:43 -0400
> "J. Bruce Fields" <[email protected]> wrote:
>
> > On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > > Well, hello there,
> > >
> > > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> > >
> > > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> > >
> > > ksh: cd: /storage - Stale NFS file handle
> > >
> > > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> > >
> > > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> > >
> > > http://pastebin.ca/1864713
> > >
> > > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> > >
> > > Then I started to experiment with another Linux box to kill the possible different variants.
> > >
> > > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
> >
> > I'm a bit confused. What kernel and nfs-utils version is running on the
> > problematic Linux server?
>
> same. nfs-utils 1.1.6 and kernel 2.6.32.

Huh. That should be new enough for it to be using uuid's. I wonder why
it isn't?

--b.

>
> >
> > Also, what are the contents of /proc/net/rpc/nfsd.export/content and
> > /proc/net/rpc/nfsd.content after you try to access the filesystem?
>
> # cat /proc/net/rpc/nfsd.export/content
> #path domain(flags)
> /vault 172.17.2.5(ro,insecure,root_squash,sync,wdelay,no_subtree_check)
>
> # cat /proc/net/rpc/nfsd.fh/content
> #domain fsidtype fsid [path]
> # 172.17.2.5 3 0x0001030500000803
> 172.17.2.5 0 0x0500030100000803 /vault
>
> >
> > --b.
> >
> > >
> > > --
> > > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x25e4 (correct), seq 1, ack 1, win 92, options [nop,nop,TS val 1808029984 ecr 1618999], length 0
> > > 172.17.2.5.3565791363 > 172.17.2.2.2049: 40 null
> > > 172.17.2.2.2049 > 172.17.2.5.884: Flags [.], cksum 0x25e6 (correct), seq 1, ack 45, win 46, options [nop,nop,TS val 1618999 ecr 1808029984], length 0
> > > 172.17.2.2.2049 > 172.17.2.5.3565791363: reply ok 24 null
> > > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x259b (correct), seq 45, ack 29, win 92, options [nop,nop,TS val 1808029985 ecr 1618999], length 0
> > > 172.17.2.5.3582568579 > 172.17.2.2.2049: 40 null
> > > 172.17.2.2.2049 > 172.17.2.5.3582568579: reply ok 24 null
> > > 172.17.2.5.3599345795 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> > > 172.17.2.2.2049 > 172.17.2.5.3599345795: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> > > 172.17.2.5.3616123011 > 172.17.2.2.2049: 92 fsinfo fh Unknown/0100030005030100000800000000000000000000000000000000000000000000
> > > 172.17.2.2.2049 > 172.17.2.5.3616123011: reply ok 32 fsinfo ERROR: Stale NFS file handle POST:
> > > 172.17.2.5.884 > 172.17.2.2.2049: Flags [F.], cksum 0x2449 (correct), seq 281, ack 129, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> > > 172.17.2.2.2049 > 172.17.2.5.884: Flags [F.], cksum 0x2476 (correct), seq 129, ack 282, win 46, options [nop,nop,TS val 1618999 ecr 1808029986], length 0
> > > 172.17.2.5.884 > 172.17.2.2.2049: Flags [.], cksum 0x2448 (correct), seq 282, ack 130, win 92, options [nop,nop,TS val 1808029986 ecr 1618999], length 0
> > > --
> > >
> > > familiar messages, eh?
> > >
> > > Since that time I've solved that's not OpenBSD problem. So only NFS and Linux left as the reasons of this.
> > > It was possible to mount that small partition on Linux box too, the same as on OpenBSD.
> > >
> > > But afterthat I recongnized an interesting issue: I have different sw raid setups on my storage server.
> > > I tried to mount a small partition on the same md device where 5.5TB partition is located, and got the same
> > > error message! Now I'm sure it's about NFS <-> MDADM setup, that's why I called the topic like this.
> > >
> > > A bit about my setup:
> > >
> > > # cat /proc/mdstat
> > > Personalities : [linear] [raid0] [raid1] [raid6] [raid5] [raid4] [multipath]
> > > md3 : active raid1 sdc1[0] sdd1[1]
> > > 61376 blocks [2/2] [UU]
> > >
> > > md1 : active raid5 sdc2[2] sdd2[3] sdb2[1] sda2[0]
> > > 3153408 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
> > >
> > > md2 : active raid5 sdc3[2] sdd3[3] sdb3[1] sda3[0]
> > > 5857199616 blocks level 5, 512k chunk, algorithm 2 [4/4] [UUUU]
> > >
> > > md0 : active raid1 sdb1[1] sda1[0]
> > > 61376 blocks [2/2] [UU]
> > >
> > > unused devices: <none>
> > >
> > > md0, md1, and md3 aren't so interesting, since fs is created directly on them, and that's a _problem device_:
> > >
> > > # parted /dev/md2
> > > GNU Parted 2.2
> > > Using /dev/md2
> > > Welcome to GNU Parted! Type 'help' to view a list of commands.
> > > (parted) p free
> > > p free
> > > Model: Unknown (unknown)
> > > Disk /dev/md2: 5998GB
> > > Sector size (logical/physical): 512B/512B
> > > Partition Table: gpt
> > >
> > > Number Start End Size File system Name Flags
> > > 17.4kB 1049kB 1031kB Free Space
> > > 1 1049kB 2147MB 2146MB linux-swap(v1) swap
> > > 2 2147MB 23.6GB 21.5GB xfs home
> > > 3 23.6GB 24.7GB 1074MB xfs temp
> > > 4 24.7GB 35.4GB 10.7GB xfs user
> > > 5 35.4GB 51.5GB 16.1GB xfs var
> > > 6 51.5GB 5998GB 5946GB xfs vault
> > > 5998GB 5998GB 507kB Free Space
> > >
> > > # ls /dev/md?*
> > > /dev/md0 /dev/md1 /dev/md2 /dev/md2p1 /dev/md2p2 /dev/md2p3 /dev/md2p4 /dev/md2p5 /dev/md2p6 /dev/md3
> > >
> > > It's very handy partitioning scheme where I can extend (grow 5th raid) with more hdds only /vault partition while "loosing" (a.k.a. not using for this partition) only ~1gb of space from every 2TB drive.
> > >
> > > System boots ok and xfs_check passes with no problems, etc.
> > > The only problem: it's not possible to use NFS shares on any partition of /dev/md2 device.
> > >
> > > Finally, my question to NFS and MDADM developers: any idea?
> > >
> > > --
> > > Dont wait to die to find paradise...
> > > --
> > > Cheerz,
> > > Vlad "Stealth" Glagolev
> >
> >
>
>
> --
> Dont wait to die to find paradise...
> --
> Cheerz,
> Vlad "Stealth" Glagolev



2010-04-22 19:47:37

by Trond Myklebust

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, 2010-04-22 at 15:32 -0400, J. Bruce Fields wrote:
> On Thu, Apr 22, 2010 at 10:53:10PM +0400, Vlad Glagolev wrote:
> > On Thu, 22 Apr 2010 14:25:43 -0400
> > "J. Bruce Fields" <[email protected]> wrote:
> >
> > > On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > > > Well, hello there,
> > > >
> > > > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> > > >
> > > > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > > > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> > > >
> > > > ksh: cd: /storage - Stale NFS file handle
> > > >
> > > > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > > > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> > > >
> > > > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> > > >
> > > > http://pastebin.ca/1864713
> > > >
> > > > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> > > >
> > > > Then I started to experiment with another Linux box to kill the possible different variants.
> > > >
> > > > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
> > >
> > > I'm a bit confused. What kernel and nfs-utils version is running on the
> > > problematic Linux server?
> >
> > same. nfs-utils 1.1.6 and kernel 2.6.32.
>
> Huh. That should be new enough for it to be using uuid's. I wonder why
> it isn't?

What are the contents of /dev/disk/by-uuid?


2010-04-22 19:51:00

by Vlad Glagolev

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, 22 Apr 2010 15:47:30 -0400
Trond Myklebust <[email protected]> wrote:

> On Thu, 2010-04-22 at 15:32 -0400, J. Bruce Fields wrote:
> > On Thu, Apr 22, 2010 at 10:53:10PM +0400, Vlad Glagolev wrote:
> > > On Thu, 22 Apr 2010 14:25:43 -0400
> > > "J. Bruce Fields" <[email protected]> wrote:
> > >
> > > > On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > > > > Well, hello there,
> > > > >
> > > > > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> > > > >
> > > > > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > > > > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> > > > >
> > > > > ksh: cd: /storage - Stale NFS file handle
> > > > >
> > > > > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > > > > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> > > > >
> > > > > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> > > > >
> > > > > http://pastebin.ca/1864713
> > > > >
> > > > > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> > > > >
> > > > > Then I started to experiment with another Linux box to kill the possible different variants.
> > > > >
> > > > > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
> > > >
> > > > I'm a bit confused. What kernel and nfs-utils version is running on the
> > > > problematic Linux server?
> > >
> > > same. nfs-utils 1.1.6 and kernel 2.6.32.
> >
> > Huh. That should be new enough for it to be using uuid's. I wonder why
> > it isn't?
>
> What are the contents of /dev/disk/by-uuid?
>

$ ls -1 /dev/disk/by-uuid/
0e9742f6-44e3-431c-911f-4c914e4f81d5
31ae89c6-dba6-4351-b3a9-e8b08be07c3d
4429ba7a-afd2-4c61-83a0-900dae1bccdc
463bbc42-c19b-4b9e-bae7-838ac0e2e5c6
473b9320-88a4-44eb-b592-2ac98619bc9b
53d16b07-d496-4f8f-ad59-ea34aaf169f4
6adf1c55-405c-43cf-a84d-be5d2746d300
b35a7bca-12ad-4738-a895-52f20b7cc5d9
dc892f1f-0b83-41dd-bde7-0761295f33a3
f7ac4165-320f-4235-a78a-5fe1bd0aac24

--
Dont wait to die to find paradise...
--
Cheerz,
Vlad "Stealth" Glagolev


Attachments:
(No filename) (2.42 kB)
(No filename) (198.00 B)
Download all attachments

2010-04-22 19:56:21

by Trond Myklebust

[permalink] [raw]
Subject: Re: NFS and /dev/mdXpY

On Thu, 2010-04-22 at 23:51 +0400, Vlad Glagolev wrote:
> On Thu, 22 Apr 2010 15:47:30 -0400
> Trond Myklebust <[email protected]> wrote:
>
> > On Thu, 2010-04-22 at 15:32 -0400, J. Bruce Fields wrote:
> > > On Thu, Apr 22, 2010 at 10:53:10PM +0400, Vlad Glagolev wrote:
> > > > On Thu, 22 Apr 2010 14:25:43 -0400
> > > > "J. Bruce Fields" <[email protected]> wrote:
> > > >
> > > > > On Sat, Apr 17, 2010 at 07:57:47PM +0400, Vlad Glagolev wrote:
> > > > > > Well, hello there,
> > > > > >
> > > > > > Posted it on linux-kernel ML also, and post it here, for more specific analysis.
> > > > > >
> > > > > > I faced this problem today while trying to mount some NFS share on OpenBSD box.
> > > > > > I mounted it successfully without any visible errors, but I wasn't able to cd there, the printed error was:
> > > > > >
> > > > > > ksh: cd: /storage - Stale NFS file handle
> > > > > >
> > > > > > Apropos, the partition is 5.5 TB. I tried another one on my box and it was mounted successfully. It was possible to manage files there too. Its size is ~3GB.
> > > > > > That's why the first time I thought about some size limitations of OpenBSD/Linux/NFS.
> > > > > >
> > > > > > While talking on #openbsd @ freenode, I discovered this via tcpdump on both sides:
> > > > > >
> > > > > > http://pastebin.ca/1864713
> > > > > >
> > > > > > Googling for 3 hours didn't help at all, some posts had similiar issue but either with no answer at all or without any full description.
> > > > > >
> > > > > > Then I started to experiment with another Linux box to kill the possible different variants.
> > > > > >
> > > > > > On another box I also have nfs-utils 1.1.6 and kernel 2.6.32. Mounting that big partition was unsuccessful, it got just stuck. On tcpdump I've seen this:
> > > > >
> > > > > I'm a bit confused. What kernel and nfs-utils version is running on the
> > > > > problematic Linux server?
> > > >
> > > > same. nfs-utils 1.1.6 and kernel 2.6.32.
> > >
> > > Huh. That should be new enough for it to be using uuid's. I wonder why
> > > it isn't?
> >
> > What are the contents of /dev/disk/by-uuid?
> >
>
> $ ls -1 /dev/disk/by-uuid/
> 0e9742f6-44e3-431c-911f-4c914e4f81d5
> 31ae89c6-dba6-4351-b3a9-e8b08be07c3d
> 4429ba7a-afd2-4c61-83a0-900dae1bccdc
> 463bbc42-c19b-4b9e-bae7-838ac0e2e5c6
> 473b9320-88a4-44eb-b592-2ac98619bc9b
> 53d16b07-d496-4f8f-ad59-ea34aaf169f4
> 6adf1c55-405c-43cf-a84d-be5d2746d300
> b35a7bca-12ad-4738-a895-52f20b7cc5d9
> dc892f1f-0b83-41dd-bde7-0761295f33a3
> f7ac4165-320f-4235-a78a-5fe1bd0aac24
>

So, when you do 'ls -l' on the above, you do indeed see all the
partitions that are being exported via NFS?