LinuxLists.cc - msync(2) bug(?), returns AOP_WRITEPAGE

2007-10-07 19:20:39

Subject: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

According to vfs.txt, ->writepage() may return AOP_WRITEPAGE_ACTIVATE back
to the VFS/VM. Indeed some filesystems such as tmpfs can return
AOP_WRITEPAGE_ACTIVATE; and stackable file systems (e.g., Unionfs) also
return AOP_WRITEPAGE_ACTIVATE if the lower f/s returned it.

Anyway, some Ubuntu users of Unionfs reported that msync(2) sometimes
returns AOP_WRITEPAGE_ACTIVATE (decimal 524288) back to userland.
Therefore, some user programs fail, esp. if they're written such as this:

err = msync(...);
if (err != 0)
// fail

They temporarily fixed the specific program in question (apt-get) to check

if (err < 0)
// fail

Is this a bug indeed, or are user programs supposed to handle
AOP_WRITEPAGE_ACTIVATE (I hope not the latter). If it's a kernel bug, what
should the kernel return: a zero, or an -errno (and which one)?

Thanks,
Erez.

2007-10-07 19:58:23

by Pekka Enberg

[permalink] [raw]

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Hi Erez,

On 10/7/07, Erez Zadok <[email protected]> wrote:
> Anyway, some Ubuntu users of Unionfs reported that msync(2) sometimes
> returns AOP_WRITEPAGE_ACTIVATE (decimal 524288) back to userland.
> Therefore, some user programs fail, esp. if they're written such as
> this:

[snip]

On 10/7/07, Erez Zadok <[email protected]> wrote:
> Is this a bug indeed, or are user programs supposed to handle?
> AOP_WRITEPAGE_ACTIVATE (I hope not the latter). If it's a kernel bug,
> what should the kernel return: a zero, or an -errno (and which one)?

It's a kernel bug. AOP_WRITEPAGE_ACTIVATE is a hint to the VM to avoid
writeback of the page in the near future. I wonder if it's enough that we
change the return value to zero from
mm/page-writeback.c:write_cache_pages() in case we hit AOP_WRITEPAGE_ACTIVE...

Pekka

diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 63512a9..717f341 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -672,8 +672,10 @@ retry:

ret = (*writepage)(page, wbc, data);

- if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE))
+ if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE)) {
unlock_page(page);
+ ret = 0;
+ }
if (ret || (--(wbc->nr_to_write) <= 0))
done = 1;
if (wbc->nonblocking && bdi_write_congested(bdi)) {

2007-10-08 01:59:13

by Ryan Finnie

[permalink] [raw]

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

On 10/7/07, Pekka J Enberg <[email protected]> wrote:
> On 10/7/07, Erez Zadok <[email protected]> wrote:
> > Anyway, some Ubuntu users of Unionfs reported that msync(2) sometimes
> > returns AOP_WRITEPAGE_ACTIVATE (decimal 524288) back to userland.
> > Therefore, some user programs fail, esp. if they're written such as
> > this:
>
...
> It's a kernel bug. AOP_WRITEPAGE_ACTIVATE is a hint to the VM to avoid
> writeback of the page in the near future. I wonder if it's enough that we
> change the return value to zero from
> mm/page-writeback.c:write_cache_pages() in case we hit AOP_WRITEPAGE_ACTIVE...

Doesn't appear to be enough. I can't figure out why (since it appears
write_cache_pages bubbles up directly to sys_msync), but with that
patch applied, in my test case[1], msync returns -1 EIO. However,
with the exact same kernel without that patch applied, msync returns
524288 (AOP_WRITEPAGE_ACTIVATE). But as your patch specifically flips
524288 to 0, I can't figure out how it eventually returns -1 EIO.

Ryan

[1] "apt-get check" on a unionfs2 mount backed by tmpfs over cdrom,
standard livecd setup

2007-10-08 11:18:52

by Pekka Enberg

[permalink] [raw]

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Hi Ryan,

On 10/8/07, Ryan Finnie <[email protected]> wrote:
> Doesn't appear to be enough. I can't figure out why (since it appears
> write_cache_pages bubbles up directly to sys_msync), but with that
> patch applied, in my test case[1], msync returns -1 EIO. However,
> with the exact same kernel without that patch applied, msync returns
> 524288 (AOP_WRITEPAGE_ACTIVATE). But as your patch specifically flips
> 524288 to 0, I can't figure out how it eventually returns -1 EIO.
>
> [1] "apt-get check" on a unionfs2 mount backed by tmpfs over cdrom,
> standard livecd setup

You have swap device disabled, right? If so, I can't see any reason
why msync(2) on tmpfs would return -EIO. Can you please send a strace
log for your test case?

Pekka

2007-10-11 21:48:18

by Andrew Morton

[permalink] [raw]

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

On Sun, 7 Oct 2007 15:20:19 -0400
Erez Zadok <[email protected]> wrote:

> According to vfs.txt, ->writepage() may return AOP_WRITEPAGE_ACTIVATE back
> to the VFS/VM. Indeed some filesystems such as tmpfs can return
> AOP_WRITEPAGE_ACTIVATE; and stackable file systems (e.g., Unionfs) also
> return AOP_WRITEPAGE_ACTIVATE if the lower f/s returned it.
>
> Anyway, some Ubuntu users of Unionfs reported that msync(2) sometimes
> returns AOP_WRITEPAGE_ACTIVATE (decimal 524288) back to userland.
> Therefore, some user programs fail, esp. if they're written such as this:
>
> err = msync(...);
> if (err != 0)
> // fail
>
> They temporarily fixed the specific program in question (apt-get) to check
>
> if (err < 0)
> // fail
>
> Is this a bug indeed, or are user programs supposed to handle
> AOP_WRITEPAGE_ACTIVATE (I hope not the latter). If it's a kernel bug, what
> should the kernel return: a zero, or an -errno (and which one)?
>

shit. That's a nasty bug. Really userspace should be testing for -1, but
the msync() library function should only ever return 0 or -1.

Does this fix it?

--- a/mm/page-writeback.c~a
+++ a/mm/page-writeback.c
@@ -850,8 +850,10 @@ retry:

ret = (*writepage)(page, wbc, data);

- if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE))
+ if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE)) {
unlock_page(page);
+ ret = 0;
+ }
if (ret || (--(wbc->nr_to_write) <= 0))
done = 1;
if (wbc->nonblocking && bdi_write_congested(bdi)) {
_

2007-10-11 22:12:20

by Ryan Finnie

[permalink] [raw]

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

On 10/11/07, Andrew Morton <[email protected]> wrote:
> shit. That's a nasty bug. Really userspace should be testing for -1, but
> the msync() library function should only ever return 0 or -1.
>
> Does this fix it?
>
> --- a/mm/page-writeback.c~a
> +++ a/mm/page-writeback.c
> @@ -850,8 +850,10 @@ retry:
>
> ret = (*writepage)(page, wbc, data);
>
> - if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE))
> + if (unlikely(ret == AOP_WRITEPAGE_ACTIVATE)) {
> unlock_page(page);
> + ret = 0;
> + }
> if (ret || (--(wbc->nr_to_write) <= 0))
> done = 1;
> if (wbc->nonblocking && bdi_write_congested(bdi)) {
> _
>

Pekka Enberg replied with an identical patch a few days ago, but for
some reason the same condition flows up to msync as -1 EIO instead of
AOP_WRITEPAGE_ACTIVATE with that patch applied. The last part of the
thread is below. Thanks.

Ryan

On 10/7/07, Ryan Finnie <[email protected]> wrote:
> On 10/7/07, Pekka J Enberg <[email protected]> wrote:
> > On 10/7/07, Erez Zadok <[email protected]> wrote:
> > > Anyway, some Ubuntu users of Unionfs reported that msync(2) sometimes
> > > returns AOP_WRITEPAGE_ACTIVATE (decimal 524288) back to userland.
> > > Therefore, some user programs fail, esp. if they're written such as
> > > this:
> >
> ...
> > It's a kernel bug. AOP_WRITEPAGE_ACTIVATE is a hint to the VM to avoid
> > writeback of the page in the near future. I wonder if it's enough that we
> > change the return value to zero from
> > mm/page-writeback.c:write_cache_pages() in case we hit AOP_WRITEPAGE_ACTIVE...
>
> Doesn't appear to be enough. I can't figure out why (since it appears
> write_cache_pages bubbles up directly to sys_msync), but with that
> patch applied, in my test case[1], msync returns -1 EIO. However,
> with the exact same kernel without that patch applied, msync returns
> 524288 (AOP_WRITEPAGE_ACTIVATE). But as your patch specifically flips
> 524288 to 0, I can't figure out how it eventually returns -1 EIO.
>
> Ryan
>
> [1] "apt-get check" on a unionfs2 mount backed by tmpfs over cdrom,
> standard livecd setup
>

2007-10-12 00:39:15

Subject: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: [PATCH] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: [PATCH] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: [PATCH+comment] fix tmpfs BUG and AOP_WRITEPAGE_ACTIVATE

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland

Subject: Re: msync(2) bug(?), returns AOP_WRITEPAGE_ACTIVATE to userland