2015-06-01 13:52:32

by David Howells

[permalink] [raw]
Subject: Can ovl_drop_write() be called earlier in ovl_dentry_open()

In ovl_dentry_open(), ovl_drop_write() is called after vfs_open() - but is
this actually necessary? Can't we just drop it post-copyup? After all,
that's all we wanted the write lock for, right?

David
---
--- a/fs/overlayfs/inode.c
+++ b/fs/overlayfs/inode.c
@@ -356,16 +356,14 @@ static int ovl_dentry_open(struct dentry *dentry, struct inode *inode,
err = ovl_copy_up_last(dentry, NULL, true);
else
err = ovl_copy_up(dentry);
+ ovl_drop_write(dentry);
if (err)
- goto out_drop_write;
+ goto out;

ovl_path_upper(dentry, &realpath);
}

err = vfs_open(&realpath, d_backing_inode(realpath.dentry), file, cred);
-out_drop_write:
- if (want_write)
- ovl_drop_write(dentry);
out:
return err;
}


2015-06-01 14:21:46

by David Howells

[permalink] [raw]
Subject: [PATCH] overlay: Call ovl_drop_write() earlier in ovl_dentry_open()

Call ovl_drop_write() earlier in ovl_dentry_open() before we call vfs_open()
as we've done the copy up for which we needed the freeze-write lock by that
point.

Signed-off-by: David Howells <[email protected]>
---

fs/overlayfs/inode.c | 14 ++++----------
1 file changed, 4 insertions(+), 10 deletions(-)

diff --git a/fs/overlayfs/inode.c b/fs/overlayfs/inode.c
index ce1f349642ab..8a9c32f8f4f9 100644
--- a/fs/overlayfs/inode.c
+++ b/fs/overlayfs/inode.c
@@ -343,31 +343,25 @@ static int ovl_dentry_open(struct dentry *dentry, struct inode *inode,
int err;
struct path realpath;
enum ovl_path_type type;
- bool want_write = false;

type = ovl_path_real(dentry, &realpath);
if (ovl_open_need_copy_up(file->f_flags, type, realpath.dentry)) {
- want_write = true;
err = ovl_want_write(dentry);
if (err)
- goto out;
+ return err;

if (file->f_flags & O_TRUNC)
err = ovl_copy_up_last(dentry, NULL, true);
else
err = ovl_copy_up(dentry);
+ ovl_drop_write(dentry);
if (err)
- goto out_drop_write;
+ return err;

ovl_path_upper(dentry, &realpath);
}

- err = vfs_open(&realpath, d_backing_inode(realpath.dentry), file, cred);
-out_drop_write:
- if (want_write)
- ovl_drop_write(dentry);
-out:
- return err;
+ return vfs_open(&realpath, d_backing_inode(realpath.dentry), file, cred);
}

static const struct inode_operations ovl_file_inode_operations = {

2015-06-01 14:22:50

by Miklos Szeredi

[permalink] [raw]
Subject: Re: Can ovl_drop_write() be called earlier in ovl_dentry_open()

On Mon, Jun 1, 2015 at 3:52 PM, David Howells <[email protected]> wrote:
> In ovl_dentry_open(), ovl_drop_write() is called after vfs_open() - but is
> this actually necessary? Can't we just drop it post-copyup? After all,
> that's all we wanted the write lock for, right?

Hmm, that could result in a race where remount r/o of upper fs comes
in between copy-up and vfs_open() so copy-up succeeds but the actual
open fails. It's harmless, though, and not very likely. So I guess
your patch is OK.

Thanks,
Miklos




>
> David
> ---
> --- a/fs/overlayfs/inode.c
> +++ b/fs/overlayfs/inode.c
> @@ -356,16 +356,14 @@ static int ovl_dentry_open(struct dentry *dentry, struct inode *inode,
> err = ovl_copy_up_last(dentry, NULL, true);
> else
> err = ovl_copy_up(dentry);
> + ovl_drop_write(dentry);
> if (err)
> - goto out_drop_write;
> + goto out;
>
> ovl_path_upper(dentry, &realpath);
> }
>
> err = vfs_open(&realpath, d_backing_inode(realpath.dentry), file, cred);
> -out_drop_write:
> - if (want_write)
> - ovl_drop_write(dentry);
> out:
> return err;
> }

2015-06-01 15:45:17

by David Howells

[permalink] [raw]
Subject: Re: Can ovl_drop_write() be called earlier in ovl_dentry_open()

Miklos Szeredi <[email protected]> wrote:

> > In ovl_dentry_open(), ovl_drop_write() is called after vfs_open() - but is
> > this actually necessary? Can't we just drop it post-copyup? After all,
> > that's all we wanted the write lock for, right?
>
> Hmm, that could result in a race where remount r/o of upper fs comes
> in between copy-up and vfs_open() so copy-up succeeds but the actual
> open fails. It's harmless, though, and not very likely. So I guess
> your patch is OK.

That race is there anyway if there's no copy up, right?

David

2015-06-01 15:51:41

by Miklos Szeredi

[permalink] [raw]
Subject: Re: Can ovl_drop_write() be called earlier in ovl_dentry_open()

On Mon, Jun 1, 2015 at 5:45 PM, David Howells <[email protected]> wrote:
> Miklos Szeredi <[email protected]> wrote:
>
>> > In ovl_dentry_open(), ovl_drop_write() is called after vfs_open() - but is
>> > this actually necessary? Can't we just drop it post-copyup? After all,
>> > that's all we wanted the write lock for, right?
>>
>> Hmm, that could result in a race where remount r/o of upper fs comes
>> in between copy-up and vfs_open() so copy-up succeeds but the actual
>> open fails. It's harmless, though, and not very likely. So I guess
>> your patch is OK.
>
> That race is there anyway if there's no copy up, right?

No. The race I'm talking about is that with your patch it's possible
that the file will be copied up, but open will return -EROFS.

Without your patch, that is not possible since holding write counter
for the mnt over both the copy-up and the open ensures that the
filesystem cannot become read-only in the middle.

So your patch changes behavior, but the new behavior is acceptable,
because there's no major change in semantics (it should only be
detectable by the increased disk usage in the rare case of the failed
open).

Thanks,
Miklos

2015-06-01 15:54:03

by David Howells

[permalink] [raw]
Subject: Re: Can ovl_drop_write() be called earlier in ovl_dentry_open()

Miklos Szeredi <[email protected]> wrote:

> >> Hmm, that could result in a race where remount r/o of upper fs comes
> >> in between copy-up and vfs_open() so copy-up succeeds but the actual
> >> open fails. It's harmless, though, and not very likely. So I guess
> >> your patch is OK.
> >
> > That race is there anyway if there's no copy up, right?
>
> No. The race I'm talking about is that with your patch it's possible
> that the file will be copied up, but open will return -EROFS.

Ah, I see what you're getting at.

> Without your patch, that is not possible since holding write counter
> for the mnt over both the copy-up and the open ensures that the
> filesystem cannot become read-only in the middle.
>
> So your patch changes behavior, but the new behavior is acceptable,
> because there's no major change in semantics (it should only be
> detectable by the increased disk usage in the rare case of the failed
> open).

Okay.

David