LinuxLists.cc - [PATCH] Reinstantiating stale inodes

2004-04-23 14:15:26

Subject: [PATCH] Reinstantiating stale inodes

--- linux-2.4.21/fs/nfs/inode.c.org 2004-04-17 18:26:32.000000000 -0400
+++ linux-2.4.21/fs/nfs/inode.c 2004-04-23 03:19:51.000000000 -0400
@@ -953,13 +953,57 @@ nfs_wait_on_inode(struct inode *inode, i
}

/*
+ * Reinstantiate an inode that has gone stale
+ */
+static int
+nfs_reinstantiate(
+ struct inode *dir,
+ struct dentry *dentry,
+ struct nfs_fattr *fattr)
+{
+ int error;
+ struct nfs_fh fhandle;
+ struct inode *inode;
+
+ error = NFS_PROTO(dir)->lookup(dir, &dentry->d_name, &fhandle, &fattr);
+ if (!error) {
+ error = -ENOMEM;
+ inode = nfs_fhget(dentry, &fhandle, &fattr);
+ if (inode) {
+ d_drop(dentry);
+ dput(dentry);
+ d_instantiate(dentry, inode);
+ dentry->d_time = jiffies;
+ error = 0;
+ }
+ }
+ return error;
+}
+
+/*
* Externally visible revalidation function
*/
int
nfs_revalidate(struct dentry *dentry)
{
struct inode *inode = dentry->d_inode;
- return nfs_revalidate_inode(NFS_SERVER(inode), inode);
+ struct inode *pinode;
+ struct nfs_fattr fattr;
+ int error;
+
+ error = nfs_revalidate_inode(NFS_SERVER(inode), inode);
+ if (!error || error != -ESTALE)
+ return error;
+ /*
+ * We have a stale fh so ask the server for another one
+ */
+ pinode = dentry->d_parent->d_inode;
+ if (nfs_reinstantiate(pinode, dentry, &fattr) == 0) {
+ inode = dentry->d_inode;
+ if (nfs_refresh_inode(inode, &fattr) == 0)
+ error = 0;
+ }
+ return error;
}

/*

Attachments:

linux-2.4.21-nfs-estale.patch (1.38 kB)

2004-04-23 14:33:31

by Olaf Kirch

[permalink] [raw]

Subject: Re: [PATCH] Reinstantiating stale inodes

On Fri, Apr 23, 2004 at 10:15:35AM -0400, Steve Dickson wrote:
> Here is a 2.4 patch that will reinstantiate an inode
> when a ESTALE error is returned on a getattr. When
> the error occurs, a lookup is immediately issued
> to get a new fh.

Brrr. Are you sure this is such a good idea? It will have all sorts of
bad side effects. For instance, you may be writing a file named "foo".
Someone else replaces foo with their copy (mv foo-new foo) and your file
handle becomes stale.

If you issue a lookup immediately, you will continue writing, but
now your writes go to the new file and produce garbage.

At a minimum, the lookup should occur at file open, and only if
there are no other users of the inode.

Olaf
--
Olaf Kirch | The Hardware Gods hate me.
[email protected] |
---------------+

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 14:36:18

by Trond Myklebust

[permalink] [raw]

Subject: Re: [PATCH] Reinstantiating stale inodes

On Fri, 2004-04-23 at 10:15, Steve Dickson wrote:
> Here is a 2.4 patch that will reinstantiate an inode
> when a ESTALE error is returned on a getattr. When
> the error occurs, a lookup is immediately issued
> to get a new fh.
>
> The fixes the problem of a server rsync -a directory
> that a client has mounted. The key being the -a flag
> since it causes the server not to update the mtime on
> the directory.
>
> My initial efforts was to make nfs_lookup_revalidate()
> a bit smarter with the use of ctimes but turns out
> that when nfs_lookup_revalidate() does no caching
> (i.e. an otw lookup is issued on every call) the
> ESTALEs still occurred.
>
> Then I turned my attention to __nfs_refresh_inode() and
> had it used ctime in its calculations of what is
> and is not valid... This did work, but it cause a significant
> amount of extra otw traffic (using the cthon test suite) for
> the non error cases. The one thing good about this patch (imho)
> is the extra lookups only occur after an error that generally
> does not happen...
>
> Comments would be appreciated, especially about how
> I'm reinstantiating the dentry....

There are several problems here, but the main one is that you have no
guarantees that you are the exclusive user of that dentry.

This again means that people who think they have open files on the "old"
inode may suddenly find their program Oopsing or corrupting the new
file. Ditto for all those shrink_dcache_*() which are likely to Oops.

Cheers,
Trond

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 14:48:26

by Lever, Charles

[permalink] [raw]

Subject: RE: [PATCH] Reinstantiating stale inodes

> On Fri, 2004-04-23 at 10:15, Steve Dickson wrote:
> > Here is a 2.4 patch that will reinstantiate an inode
> > when a ESTALE error is returned on a getattr. When
> > the error occurs, a lookup is immediately issued
> > to get a new fh.
> >=20
> > The fixes the problem of a server rsync -a directory
> > that a client has mounted. The key being the -a flag
> > since it causes the server not to update the mtime on
> > the directory.
> >=20
> > My initial efforts was to make nfs_lookup_revalidate()
> > a bit smarter with the use of ctimes but turns out
> > that when nfs_lookup_revalidate() does no caching
> > (i.e. an otw lookup is issued on every call) the
> > ESTALEs still occurred.
> >=20
> > Then I turned my attention to __nfs_refresh_inode() and
> > had it used ctime in its calculations of what is
> > and is not valid... This did work, but it cause a significant
> > amount of extra otw traffic (using the cthon test suite) for
> > the non error cases. The one thing good about this patch (imho)
> > is the extra lookups only occur after an error that generally
> > does not happen...
> >=20
> > Comments would be appreciated, especially about how
> > I'm reinstantiating the dentry....
>=20
> There are several problems here, but the main one is that you have no
> guarantees that you are the exclusive user of that dentry.
>=20
> This again means that people who think they have open files=20
> on the "old"
> inode may suddenly find their program Oopsing or corrupting the new
> file. Ditto for all those shrink_dcache_*() which are likely to Oops.

yes, i was wondering about that when i saw the patch.

there appears to be a real problem when restoring from a backup,
or using rsync. the file size and the mtime stay precisely the
same, but the file handle changes. i'm not sure anything can be
done about this in NFSv2/3?

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 15:00:31

by Trond Myklebust

[permalink] [raw]

Subject: RE: [PATCH] Reinstantiating stale inodes

On Fri, 2004-04-23 at 10:48, Lever, Charles wrote:
> there appears to be a real problem when restoring from a backup,
> or using rsync. the file size and the mtime stay precisely the
> same, but the file handle changes. i'm not sure anything can be
> done about this in NFSv2/3?

The only way to distinguish the two is to replace the use of the mtime
with the ctime in nfs_check_verifier() and friends.

Under ordinary circumstances, the mtime and ctime should be more or less
identical, so I'm not sure why Steve was seeing extra revalidations when
he was running the Connectathon suite. Were you perhaps changing the
algorithm in nfs_refresh_inode() instead, Steve?

Cheers,
Trond

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 15:08:55

by Olaf Kirch

[permalink] [raw]

Subject: Re: [PATCH] Reinstantiating stale inodes

On Fri, Apr 23, 2004 at 07:48:11AM -0700, Lever, Charles wrote:
> there appears to be a real problem when restoring from a backup,
> or using rsync. the file size and the mtime stay precisely the
> same, but the file handle changes. i'm not sure anything can be
> done about this in NFSv2/3?

Well, namei_open could interpret an ESTALE error as "retry the
lookup and open, ignoring the cache".
I think this would solve 99% of all problems.

Olaf
--
Olaf Kirch | The Hardware Gods hate me.
[email protected] |
---------------+

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 15:17:37

by Lever, Charles

[permalink] [raw]

Subject: RE: [PATCH] Reinstantiating stale inodes

> On Fri, Apr 23, 2004 at 07:48:11AM -0700, Lever, Charles wrote:
> > there appears to be a real problem when restoring from a backup,
> > or using rsync. the file size and the mtime stay precisely the
> > same, but the file handle changes. i'm not sure anything can be
> > done about this in NFSv2/3?
>=20
> Well, namei_open could interpret an ESTALE error as "retry the
> lookup and open, ignoring the cache".
> I think this would solve 99% of all problems.

my impression was that viro would wretch at the idea of adding
file-system specific logic to the VFS layer. %^)

-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs

2004-04-23 15:50:27