LinuxLists.cc - Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put

2019-10-06 22:23:23

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On Sat, May 21, 2016 at 09:59:07PM -0700, Linus Torvalds wrote:
> We really should avoid the "__{get,put}_user()" functions entirely,
> because they can easily be mis-used and the original intent of being
> used for simple direct user accesses no longer holds in a post-SMAP/PAN
> world.
>
> Manually optimizing away the user access range check makes no sense any
> more, when the range check is generally much cheaper than the "enable
> user accesses" code that the __{get,put}_user() functions still need.
>
> So instead of __put_user(), use the unsafe_put_user() interface with
> user_access_{begin,end}() that really does generate better code these
> days, and which is generally a nicer interface. Under some loads, the
> multiple user writes that filldir() does are actually quite noticeable.
>
> This also makes the dirent name copy use unsafe_put_user() with a couple
> of macros. We do not want to make function calls with SMAP/PAN
> disabled, and the code this generates is quite good when the
> architecture uses "asm goto" for unsafe_put_user() like x86 does.
>
> Note that this doesn't bother with the legacy cases. Nobody should use
> them anyway, so performance doesn't really matter there.
>
> Signed-off-by: Linus Torvalds <[email protected]>

Linus,

this patch causes all my sparc64 emulations to stall during boot. It causes
all alpha emulations to crash with [1a] and [1b] when booting from a virtual
disk, and one of the xtensa emulations to crash with [2].

Reverting this patch fixes the problem.

Guenter

---
[1a]

Unable to handle kernel paging request at virtual address 0000000000000004
rcS(47): Oops -1
pc = [<0000000000000004>] ra = [<fffffc00004512e4>] ps = 0000 Not tainted
pc is at 0x4
ra is at filldir64+0x64/0x320
v0 = 0000000000000000 t0 = 0000000000000000 t1 = 0000000120117e8b
t2 = 646e617275303253 t3 = 646e617275300000 t4 = 0000000000007fe8
t5 = 0000000120117e78 t6 = 0000000000000000 t7 = fffffc0007ec8000
s0 = fffffc0007dbca56 s1 = 000000000000000a s2 = 0000000000000020
s3 = fffffc0007ecbec8 s4 = 0000000000000008 s5 = 0000000000000021
s6 = 1cd2631fe897bf5a
a0 = fffffc0007dbca56 a1 = 2f2f2f2f2f2f2f2f a2 = 000000000000000a
a3 = 1cd2631fe897bf5a a4 = 0000000000000021 a5 = 0000000000000008
t8 = 0000000000000020 t9 = 0000000000000000 t10= fffffc0007dbca60
t11= 0000000000000001 pv = fffffc0000b9a810 at = 0000000000000001
gp = fffffc0000f03930 sp = (____ptrval____)
Disabling lock debugging due to kernel taint
Trace:
[<fffffc00004e7a08>] call_filldir+0xe8/0x1b0
[<fffffc00004e8684>] ext4_readdir+0x924/0xa70
[<fffffc0000ba3088>] _raw_spin_unlock+0x18/0x30
[<fffffc00003f751c>] __handle_mm_fault+0x9fc/0xc30
[<fffffc0000450c68>] iterate_dir+0x198/0x240
[<fffffc0000450b2c>] iterate_dir+0x5c/0x240
[<fffffc00004518b8>] ksys_getdents64+0xa8/0x160
[<fffffc0000451990>] sys_getdents64+0x20/0x40
[<fffffc0000451280>] filldir64+0x0/0x320
[<fffffc0000311634>] entSys+0xa4/0xc0

---
[1b]

Unable to handle kernel paging request at virtual address 0000000000000004
reboot(50): Oops -1
pc = [<0000000000000004>] ra = [<fffffc00004512e4>] ps = 0000 Tainted: G D
pc is at 0x4
ra is at filldir64+0x64/0x320
v0 = 0000000000000000 t0 = 0000000067736d6b t1 = 000000012011445b
t2 = 0000000000000000 t3 = 0000000000000000 t4 = 0000000000007ef8
t5 = 0000000120114448 t6 = 0000000000000000 t7 = fffffc0007eec000
s0 = fffffc000792b5c3 s1 = 0000000000000004 s2 = 0000000000000018
s3 = fffffc0007eefec8 s4 = 0000000000000008 s5 = 00000000f00000a3
s6 = 000000000000000b
a0 = fffffc000792b5c3 a1 = 2f2f2f2f2f2f2f2f a2 = 0000000000000004
a3 = 000000000000000b a4 = 00000000f00000a3 a5 = 0000000000000008
t8 = 0000000000000018 t9 = 0000000000000000 t10= 0000000022e1d02a
t11= 000000011f8fd3b8 pv = fffffc0000b9a810 at = 0000000022e1ccf8
gp = fffffc0000f03930 sp = (____ptrval____)
Trace:
[<fffffc00004ccba0>] proc_readdir_de+0x170/0x300
[<fffffc0000451280>] filldir64+0x0/0x320
[<fffffc00004c565c>] proc_root_readdir+0x3c/0x80
[<fffffc0000450c68>] iterate_dir+0x198/0x240
[<fffffc00004518b8>] ksys_getdents64+0xa8/0x160
[<fffffc0000451990>] sys_getdents64+0x20/0x40
[<fffffc0000451280>] filldir64+0x0/0x320
[<fffffc0000311634>] entSys+0xa4/0xc0

---
[2]

Unable to handle kernel paging request at virtual address 0000000000000004
reboot(50): Oops -1
pc = [<0000000000000004>] ra = [<fffffc00004512e4>] ps = 0000 Tainted: G D
pc is at 0x4
ra is at filldir64+0x64/0x320
v0 = 0000000000000000 t0 = 0000000067736d6b t1 = 000000012011445b
t2 = 0000000000000000 t3 = 0000000000000000 t4 = 0000000000007ef8
t5 = 0000000120114448 t6 = 0000000000000000 t7 = fffffc0007eec000
s0 = fffffc000792b5c3 s1 = 0000000000000004 s2 = 0000000000000018
s3 = fffffc0007eefec8 s4 = 0000000000000008 s5 = 00000000f00000a3
s6 = 000000000000000b
a0 = fffffc000792b5c3 a1 = 2f2f2f2f2f2f2f2f a2 = 0000000000000004
a3 = 000000000000000b a4 = 00000000f00000a3 a5 = 0000000000000008
t8 = 0000000000000018 t9 = 0000000000000000 t10= 0000000022e1d02a
t11= 000000011fd6f3b8 pv = fffffc0000b9a810 at = 0000000022e1ccf8
gp = fffffc0000f03930 sp = (____ptrval____)
Trace:
[<fffffc00004ccba0>] proc_readdir_de+0x170/0x300
[<fffffc0000451280>] filldir64+0x0/0x320
[<fffffc00004c565c>] proc_root_readdir+0x3c/0x80
[<fffffc0000450c68>] iterate_dir+0x198/0x240
[<fffffc00004518b8>] ksys_getdents64+0xa8/0x160
[<fffffc0000451990>] sys_getdents64+0x20/0x40
[<fffffc0000451280>] filldir64+0x0/0x320
[<fffffc0000311634>] entSys+0xa4/0xc0

Code:
00000000
00063301
000007a3
00001111
00003f64

Segmentation fault

2019-10-06 23:07:31

by Linus Torvalds

[permalink] [raw]

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On Sun, Oct 6, 2019 at 3:20 PM Guenter Roeck <[email protected]> wrote:
>
> this patch causes all my sparc64 emulations to stall during boot. It causes
> all alpha emulations to crash with [1a] and [1b] when booting from a virtual
> disk, and one of the xtensa emulations to crash with [2].

Ho humm. I've run variations of that patch over a few years on x86,
but obviously not on alpha/sparc.

At least I should still be able to read alpha assembly, even after all
these years. Would you mind sending me the result of

make fs/readdir.s

on alpha with the broken config? I'd hope that the sparc issue is the same.

Actually, could you also do

make fs/readdir.o

and then send me the "objdump --disassemble" of that? That way I get
the instruction offsets without having to count by hand.

> Unable to handle kernel paging request at virtual address 0000000000000004
> rcS(47): Oops -1
> pc = [<0000000000000004>] ra = [<fffffc00004512e4>] ps = 0000 Not tainted
> pc is at 0x4

That is _funky_. I'm not seeing how it could possibly jump to 0x4, but
it clearly does.

That said, are you sure it's _that_ commit? Because this pattern:

> a0 = fffffc0007dbca56 a1 = 2f2f2f2f2f2f2f2f a2 = 000000000000000a

implicates the memchr('/') call in the next one. That's a word full of
'/' characters.

Of course, it could just be left-over register contents from that
memchr(), but it makes me wonder. Particularly since it seems to
happen early in filldir64():

> ra is at filldir64+0x64/0x320

which is just a fairly small handful of instructions in, and I
wouldn't be shocked if that's the return address for the call to
memchr.

Linus

2019-10-06 23:37:28

by Linus Torvalds

[permalink] [raw]

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On Sun, Oct 6, 2019 at 4:06 PM Linus Torvalds
<[email protected]> wrote:
>
> Ho humm. I've run variations of that patch over a few years on x86,
> but obviously not on alpha/sparc.

Oooh.

I wonder... This may be the name string copy loop. And it's special in
that the result may not be aligned.

Now, a "__put_user()" with an unaligned address _should_ work - it's
very easy to trigger that from user space by just giving an unaligned
address to any system call that then writes a single word.

But alpha does

#define __put_user_32(x, addr) \
__asm__ __volatile__("1: stl %r2,%1\n" \
"2:\n" \
EXC(1b,2b,$31,%0) \
: "=r"(__pu_err) \
: "m"(__m(addr)), "rJ"(x), "0"(__pu_err))

iow it implements that 32-bit __put_user() as a 'stl'.

Which will trap if it's not aligned.

And I wonder how much testing that has ever gotten. Nobody really does
unaigned accesses on alpha.

We need to do that memcpy unrolling on x86, because x86 actually uses
"user_access_begin()" and we have magic rules about what is inside
that region.

But on alpha (and sparc) it might be better to just do "__copy_to_user()".

Anyway, this does look like a possible latent bug where the alpha
unaligned trap doesn't then handle the case of exceptions. I know it
_tries_, but I doubt it's gotten a whole lot of testing.

Anyway, let me think about this, but just for testing, does the
attached patch make any difference? It's not the right thing in
general (and most definitely not on x86), but for testing whether this
is about unaligned accesses it might work.

It's entirely untested, and in fact on x86 it should cause objtool to
complain about a function call with AC set. But I think that on alpha
and sparc, using __copy_to_user() for the name copy should work, and
would work around the unaligned issue.

That said, if it *is* the unaligned issue, then that just means that
we have a serious bug elsewhere in the alpha port. Maybe nobody cares.

Linus

Attachments:

patch.diff (658.00 B)

2019-10-07 00:05:06

by Guenter Roeck

[permalink] [raw]

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On 10/6/19 4:35 PM, Linus Torvalds wrote:
[ ... ]

> Anyway, let me think about this, but just for testing, does the
> attached patch make any difference? It's not the right thing in
> general (and most definitely not on x86), but for testing whether this
> is about unaligned accesses it might work.
>

All my alpha, sparc64, and xtensa tests pass with the attached patch
applied on top of v5.4-rc2. I didn't test any others.

I'll (try to) send you some disassembly next.

Guenter

2019-10-07 00:24:01

by Guenter Roeck

[permalink] [raw]

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On Sun, Oct 06, 2019 at 04:06:16PM -0700, Linus Torvalds wrote:
> On Sun, Oct 6, 2019 at 3:20 PM Guenter Roeck <[email protected]> wrote:
> >
> > this patch causes all my sparc64 emulations to stall during boot. It causes
> > all alpha emulations to crash with [1a] and [1b] when booting from a virtual
> > disk, and one of the xtensa emulations to crash with [2].
>
> Ho humm. I've run variations of that patch over a few years on x86,
> but obviously not on alpha/sparc.
>
> At least I should still be able to read alpha assembly, even after all
> these years. Would you mind sending me the result of
>
> make fs/readdir.s
>
> on alpha with the broken config? I'd hope that the sparc issue is the same.
>
> Actually, could you also do
>
> make fs/readdir.o
>
> and then send me the "objdump --disassemble" of that? That way I get
> the instruction offsets without having to count by hand.
>

Both attached for alpha.

> > Unable to handle kernel paging request at virtual address 0000000000000004
> > rcS(47): Oops -1
> > pc = [<0000000000000004>] ra = [<fffffc00004512e4>] ps = 0000 Not tainted
> > pc is at 0x4
>
> That is _funky_. I'm not seeing how it could possibly jump to 0x4, but
> it clearly does.
>
> That said, are you sure it's _that_ commit? Because this pattern:
>
Bisect on sparc pointed to this commit, and re-running the tests with
the commit reverted passed for all architectures. I didn't check any
further.

Please let me know if you need anything else at this point.

Thanks,
Guenter

Attachments:

(No filename) (1.54 kB)
readdir.s (65.86 kB)
readdir.s.objdump (32.19 kB)
Download all attachments

2019-10-07 01:18:18

by Linus Torvalds

[permalink] [raw]

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

On Sun, Oct 6, 2019 at 5:04 PM Guenter Roeck <[email protected]> wrote:
>
> All my alpha, sparc64, and xtensa tests pass with the attached patch
> applied on top of v5.4-rc2. I didn't test any others.

Okay... I really wish my guess had been wrong.

Because fixing filldir64 isn't the problem. I can come up with
multiple ways to avoid the unaligned issues if that was the problem.

But it does look to me like the fundamental problem is that unaligned
__put_user() calls might just be broken on alpha (and likely sparc
too). Because that looks to be the only difference between the
__copy_to_user() approach and using unsafe_put_user() in a loop.

Now, I should have handled unaligned things differently in the first
place, and in that sense I think commit 9f79b78ef744 ("Convert
filldir[64]() from __put_user() to unsafe_put_user()") really is
non-optimal on architectures with alignment issues.

And I'll fix it.

But at the same time, the fact that "non-optimal" turns into "doesn't
work" is a fairly nasty issue.

> I'll (try to) send you some disassembly next.

Thanks, verified.

The "ra is at filldir64+0x64/0x320" is indeed right at the return
point of the "jsr verify_dirent_name".

But the problem isn't there - that's just left-over state. I'm pretty
sure that function worked fine, and returned.

The problem is that "pc is at 0x4" and the page fault that then
happens at that address as a result.

And that seems to be due to this:

8c0: 00 00 29 2c ldq_u t0,0(s0)
8c4: 07 00 89 2c ldq_u t3,7(s0)
8c8: 03 04 e7 47 mov t6,t2
8cc: c1 06 29 48 extql t0,s0,t0
8d0: 44 0f 89 48 extqh t3,s0,t3
8d4: 01 04 24 44 or t0,t3,t0
8d8: 00 00 22 b4 stq t0,0(t1)

that's the "get_unaligned((type *)src)" (the first six instructions)
followed by the "unsafe_put_user()" done with a single "stq". That's
the guts of the unsafe_copy_loop() as part of
unsafe_copy_dirent_name()

And what I think happens is that it is writing to user memory that is

(a) unaligned

(b) not currently mapped in user space

so then the do_entUna() function tries to handle the unaligned trap,
but then it takes an exception while doing that (due to the unmapped
page), and then something in that nested exception mess causes it to
mess up badly and corrupt the register contents on stack, and it
returns with garbage in 'pc', and then you finally die with that

Unable to handle kernel paging request at virtual address 0000000000000004
pc is at 0x4

thing.

And yes, I'll fix that name copy loop in filldir to align the
destination first, *but* if I'm right, it means that something like
this should also likely cause issues:

#define _GNU_SOURCE
#include <unistd.h>
#include <sys/mman.h>

int main(int argc, char **argv)
{
void *mymap;
uid_t *bad_ptr = (void *) 0x01;

/* Create unpopulated memory area */
mymap = mmap(NULL, 16384, PROT_READ | PROT_WRITE, MAP_PRIVATE
| MAP_ANONYMOUS, -1, 0);

/* Unaligned uidpointer in that memory area */
bad_ptr = mymap+1;

/* Make the kernel do put_user() on it */
return getresuid(bad_ptr, bad_ptr+1, bad_ptr+2);
}

because that simple user mode program should cause that same "page
fault on unaligned put_user()" behavior as far as I can tell.

Mind humoring me and trying that on your alpha machine (or emulator,
or whatever)?

Linus

2019-10-07 01:26:11

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: RE: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: RE: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Attachments:

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: [RFC] change of calling conventions for arch_futex_atomic_op_inuser()

Subject: Re: [RFC] change of calling conventions for arch_futex_atomic_op_inuser()

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: [RFC][PATCHES] drivers/scsi/sg.c uaccess cleanups/fixes

Subject: [RFC PATCH 1/8] sg_ioctl(): fix copyout handling

Subject: [RFC PATCH 3/8] sg_write(): __get_user() can fail...

Subject: [RFC PATCH 6/8] sg_read(): get rid of access_ok()/__copy_..._user()

Subject: [RFC PATCH 8/8] SG_IO: get rid of access_ok()

Subject: [RFC PATCH 4/8] sg_read(): simplify reading ->pack_id of userland sg_io_hdr_t

Subject: [RFC PATCH 5/8] sg_new_write(): don't bother with access_ok

Subject: [RFC PATCH 7/8] sg_write(): get rid of access_ok()/__copy_from_user()/__get_user()

Subject: Re: [RFC][PATCHES] drivers/scsi/sg.c uaccess cleanups/fixes

Subject: [RFC] csum_and_copy_from_user() semantics

Subject: Re: [PATCH] Convert filldir[64]() from __put_user() to unsafe_put_user()

Subject: Re: [RFC][PATCHES] drivers/scsi/sg.c uaccess cleanups/fixes

Subject: Re: [RFC][PATCHES] drivers/scsi/sg.c uaccess cleanups/fixes