LinuxLists.cc - perf_fuzzer causes reboot

2014-02-21 20:23:41

Subject: perf_fuzzer causes reboot

So I'm not sure who exactly to report this to. Some perf people CC'd as
I trigger it while using the perf_fuzzer.

This is with 3.14-rc3 on a core2 machine, although I've had the reboots
happen throughout at least 3.14-rc*

I'm having a hard time coming up with a reproducible test case. Using the
random seed that caused the below will cause the perf_fuzzer to segfault
but not reboot.

The log isn't very helpful, it reboots so fast that the oops doesn't
finish printing and the serial log just moves to the bootloader...

[ 4466.804123] BUG: unable to handle kernel NULL pointer dereference at 0000000000000050
[ 4466.808014] IP: [<ffffffff81111783>] cache_reap+0x5e/0x1c5
[ 4466.808014] PGD 0
[ 4466.808014] Oops: 0000 [#1] GNU GRUB version 2.00-17

Vince

2014-02-21 22:11:50

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

cc'ing x32 people

On Fri, 21 Feb 2014, Vince Weaver wrote:

> So I'm not sure who exactly to report this to. Some perf people CC'd as
> I trigger it while using the perf_fuzzer.
>
> This is with 3.14-rc3 on a core2 machine, although I've had the reboots
> happen throughout at least 3.14-rc*
>
> I'm having a hard time coming up with a reproducible test case. Using the
> random seed that caused the below will cause the perf_fuzzer to segfault
> but not reboot.
>
> The log isn't very helpful, it reboots so fast that the oops doesn't
> finish printing and the serial log just moves to the bootloader...
>
> [ 4466.804123] BUG: unable to handle kernel NULL pointer dereference at 0000000000000050
> [ 4466.808014] IP: [<ffffffff81111783>] cache_reap+0x5e/0x1c5
> [ 4466.808014] PGD 0
> [ 4466.808014] Oops: 0000 [#1] GNU GRUB version 2.00-17

Maybe related, this is on an x32-compiled binary.

When trying to reproduce the perf_fuzzer myseriously segfaults on what
appears to be perfectly valid mmap'd perf ring-buffers.

(running under gdb)

Program received signal SIGSEGV, Segmentation fault.
0x0041efbb in __memset_sse2 ()

=> 0x0041efbb <+2203>: movdqa %xmm0,(%rdi)

rdi 0xf7f61000 4160098304

f7f61000-f7f72000 rw-s 00000000 00:08 4475 anon_inode:[perf_event]

So I'm not sure if somehow something is wrong with the page mapping, that
makes a valid write fail and sometimes (possibly due to address space
randomization) reboot the system?

Vince

2014-02-21 22:32:13

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Fri, 21 Feb 2014, Vince Weaver wrote:

> On Fri, 21 Feb 2014, Vince Weaver wrote:
>
> > So I'm not sure who exactly to report this to. Some perf people CC'd as
> > I trigger it while using the perf_fuzzer.
> >
> > This is with 3.14-rc3 on a core2 machine, although I've had the reboots
> > happen throughout at least 3.14-rc*
> >
> > I'm having a hard time coming up with a reproducible test case. Using the
> > random seed that caused the below will cause the perf_fuzzer to segfault
> > but not reboot.
> >
> > The log isn't very helpful, it reboots so fast that the oops doesn't
> > finish printing and the serial log just moves to the bootloader...
> >
> > [ 4466.804123] BUG: unable to handle kernel NULL pointer dereference at 0000000000000050
> > [ 4466.808014] IP: [<ffffffff81111783>] cache_reap+0x5e/0x1c5
> > [ 4466.808014] PGD 0
> > [ 4466.808014] Oops: 0000 [#1] GNU GRUB version 2.00-17
>
> Maybe related, this is on an x32-compiled binary.
>
> When trying to reproduce the perf_fuzzer myseriously segfaults on what
> appears to be perfectly valid mmap'd perf ring-buffers.
>
> (running under gdb)
>
> Program received signal SIGSEGV, Segmentation fault.
> 0x0041efbb in __memset_sse2 ()
>
> => 0x0041efbb <+2203>: movdqa %xmm0,(%rdi)
>
> rdi 0xf7f61000 4160098304
>
> f7f61000-f7f72000 rw-s 00000000 00:08 4475 anon_inode:[perf_event]
>
> So I'm not sure if somehow something is wrong with the page mapping, that
> makes a valid write fail and sometimes (possibly due to address space
> randomization) reboot the system?

also strange, when I look at the core dumps it always shows the bad memory
address happening at the beginning of an mmap page as expected for the
ip listed, but the segfault listed in the kernel happens at some
completely unrelated address that isn't even page aligned and shouldn't be
possible based on the gdb/coredump results?

[ 1560.313863] perf_fuzzer[2826]: segfault at 503283ff ip 000000000041efbb
sp 00000000ffd367d8 error 6 in perf_fuzzer[400000+d1000]
[ 1704.673245] perf_fuzzer[2835]: segfault at 503283ff ip 000000000041efbb
sp 00000000ff972be8 error 6 in perf_fuzzer[400000+d1000]
[ 2978.101276] perf_fuzzer[2841]: segfault at 503283ff ip 000000000041efbb
sp 00000000ff92ba68 error 6 in perf_fuzzer[400000+d1000]
[ 4907.185366] perf_fuzzer[2868]: segfault at 503283ff ip 000000000041efbb
sp 00000000ffadcd28 error 6 in perf_fuzzer[400000+d1000]
[ 9570.793746] perf_fuzzer[6183]: segfault at 4d0bf28e ip 000000000041efbb
sp 00000000ff83f688 error 6 in perf_fuzzer[400000+d1000]
[ 9743.888431] perf_fuzzer[6187]: segfault at 91734d5 ip 000000000041efbb
sp 00000000ffb4e288 error 6 in perf_fuzzer[400000+d1000]

Vince

2014-02-22 04:48:42

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

So I changed the perf_fuzzer so when it randomly stomps all over the
perf_event_mmap_page, it uses a constant value of 0xdeadbeef rather
than a random value.

The result is below. The segfaults make a bit more sense now, it
almost looks like what is happening is we are corrupting an address
value somehow (head? tail?) and the kernel then uses the corrupt address
and writes to memory outside of the mmap ring buffer.

I still haven't figured out how to trigger this exactly, but you can
see when over-written with 0xdeadbeef the memory address written to is
consistently some small multiple of 0x120.

I imagine it would be a bad thing if it turned out to be possible to
select what memory address got written to. Although since I've
only reproduced this on x32 maybe it won't be possible to over-write
the kernel; but I have seen this bug cause a reboot when the
wrong thing got over-written.

[28002.850192] perf_fuzzer[7083]: segfault at 2be0 ip 000000000041efab sp 00000000ff826748 error 6 in perf_fuzzer[400000+d1000]
[28639.769869] perf_fuzzer[7100]: segfault at 1320 ip 000000000041efab sp 00000000ffa65038 error 6 in perf_fuzzer[400000+d1000]
[29396.986242] perf_fuzzer[7120]: segfault at 10e0 ip 000000000041efab sp 00000000ffd48e68 error 6 in perf_fuzzer[400000+d1000]
[29738.892931] perf_fuzzer[7128]: segfault at 18c0 ip 000000000041efab sp 00000000ffcdcd88 error 6 in perf_fuzzer[400000+d1000]
[29815.550210] perf_fuzzer[7132]: segfault at 120 ip 000000000041efab sp 00000000ffe673b8 error 6 in perf_fuzzer[400000+d1000]
[30173.455348] perf_fuzzer[7141]: segfault at 120 ip 000000000041efab sp 00000000ffda1948 error 6 in perf_fuzzer[400000+d1000]
[30570.625642] perf_fuzzer[7156]: segfault at 1680 ip 000000000041efab sp 00000000ffaad028 error 6 in perf_fuzzer[400000+d1000]
[31047.887784] perf_fuzzer[7169]: segfault at 60c0 ip 000000000041efab sp 00000000ffaa86e8 error 6 in perf_fuzzer[400000+d1000]
[31300.168714] perf_fuzzer[7175]: segfault at 3a80 ip 000000000041efab sp 00000000ffd83228 error 6 in perf_fuzzer[400000+d1000]
[31984.727278] perf_fuzzer[7193]: segfault at 7e0 ip 000000000041efab sp 00000000ff9db1f8 error 6 in perf_fuzzer[400000+d1000]

Vince

2014-02-22 05:05:02

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Those are segfaults in user space, though?

On February 21, 2014 8:50:38 PM PST, Vince Weaver <[email protected]> wrote:
>
>So I changed the perf_fuzzer so when it randomly stomps all over the
>perf_event_mmap_page, it uses a constant value of 0xdeadbeef rather
>than a random value.
>
>The result is below. The segfaults make a bit more sense now, it
>almost looks like what is happening is we are corrupting an address
>value somehow (head? tail?) and the kernel then uses the corrupt
>address
>and writes to memory outside of the mmap ring buffer.
>
>I still haven't figured out how to trigger this exactly, but you can
>see when over-written with 0xdeadbeef the memory address written to is
>consistently some small multiple of 0x120.
>
>I imagine it would be a bad thing if it turned out to be possible to
>select what memory address got written to. Although since I've
>only reproduced this on x32 maybe it won't be possible to over-write
>the kernel; but I have seen this bug cause a reboot when the
>wrong thing got over-written.
>
>[28002.850192] perf_fuzzer[7083]: segfault at 2be0 ip 000000000041efab
>sp 00000000ff826748 error 6 in perf_fuzzer[400000+d1000]
>[28639.769869] perf_fuzzer[7100]: segfault at 1320 ip 000000000041efab
>sp 00000000ffa65038 error 6 in perf_fuzzer[400000+d1000]
>[29396.986242] perf_fuzzer[7120]: segfault at 10e0 ip 000000000041efab
>sp 00000000ffd48e68 error 6 in perf_fuzzer[400000+d1000]
>[29738.892931] perf_fuzzer[7128]: segfault at 18c0 ip 000000000041efab
>sp 00000000ffcdcd88 error 6 in perf_fuzzer[400000+d1000]
>[29815.550210] perf_fuzzer[7132]: segfault at 120 ip 000000000041efab
>sp 00000000ffe673b8 error 6 in perf_fuzzer[400000+d1000]
>[30173.455348] perf_fuzzer[7141]: segfault at 120 ip 000000000041efab
>sp 00000000ffda1948 error 6 in perf_fuzzer[400000+d1000]
>[30570.625642] perf_fuzzer[7156]: segfault at 1680 ip 000000000041efab
>sp 00000000ffaad028 error 6 in perf_fuzzer[400000+d1000]
>[31047.887784] perf_fuzzer[7169]: segfault at 60c0 ip 000000000041efab
>sp 00000000ffaa86e8 error 6 in perf_fuzzer[400000+d1000]
>[31300.168714] perf_fuzzer[7175]: segfault at 3a80 ip 000000000041efab
>sp 00000000ffd83228 error 6 in perf_fuzzer[400000+d1000]
>[31984.727278] perf_fuzzer[7193]: segfault at 7e0 ip 000000000041efab
>sp 00000000ff9db1f8 error 6 in perf_fuzzer[400000+d1000]
>
>Vince

--
Sent from my mobile phone. Please pardon brevity and lack of formatting.

2014-02-22 06:27:17

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/21/2014 08:50 PM, Vince Weaver wrote:
>
> So I changed the perf_fuzzer so when it randomly stomps all over the
> perf_event_mmap_page, it uses a constant value of 0xdeadbeef rather
> than a random value.
>
> The result is below. The segfaults make a bit more sense now, it
> almost looks like what is happening is we are corrupting an address
> value somehow (head? tail?) and the kernel then uses the corrupt address
> and writes to memory outside of the mmap ring buffer.
>

That seems unlikely:

handle->page = (offset >> page_shift) & (rb->nr_pages - 1);
offset &= (1UL << page_shift) - 1;

The masking to the number of pages should make that not possible, even
if a completely bogus value is written.

> I still haven't figured out how to trigger this exactly, but you can
> see when over-written with 0xdeadbeef the memory address written to is
> consistently some small multiple of 0x120.
>
> I imagine it would be a bad thing if it turned out to be possible to
> select what memory address got written to. Although since I've
> only reproduced this on x32 maybe it won't be possible to over-write
> the kernel; but I have seen this bug cause a reboot when the
> wrong thing got over-written.
>
> [28002.850192] perf_fuzzer[7083]: segfault at 2be0 ip 000000000041efab sp 00000000ff826748 error 6 in perf_fuzzer[400000+d1000]
> [28639.769869] perf_fuzzer[7100]: segfault at 1320 ip 000000000041efab sp 00000000ffa65038 error 6 in perf_fuzzer[400000+d1000]
> [29396.986242] perf_fuzzer[7120]: segfault at 10e0 ip 000000000041efab sp 00000000ffd48e68 error 6 in perf_fuzzer[400000+d1000]
> [29738.892931] perf_fuzzer[7128]: segfault at 18c0 ip 000000000041efab sp 00000000ffcdcd88 error 6 in perf_fuzzer[400000+d1000]
> [29815.550210] perf_fuzzer[7132]: segfault at 120 ip 000000000041efab sp 00000000ffe673b8 error 6 in perf_fuzzer[400000+d1000]
> [30173.455348] perf_fuzzer[7141]: segfault at 120 ip 000000000041efab sp 00000000ffda1948 error 6 in perf_fuzzer[400000+d1000]
> [30570.625642] perf_fuzzer[7156]: segfault at 1680 ip 000000000041efab sp 00000000ffaad028 error 6 in perf_fuzzer[400000+d1000]
> [31047.887784] perf_fuzzer[7169]: segfault at 60c0 ip 000000000041efab sp 00000000ffaa86e8 error 6 in perf_fuzzer[400000+d1000]
> [31300.168714] perf_fuzzer[7175]: segfault at 3a80 ip 000000000041efab sp 00000000ffd83228 error 6 in perf_fuzzer[400000+d1000]
> [31984.727278] perf_fuzzer[7193]: segfault at 7e0 ip 000000000041efab sp 00000000ff9db1f8 error 6 in perf_fuzzer[400000+d1000]

Error 6 reflects a write in userspace to a not-present page.

Since your previous trace indicates that the value of the register in
question is a different one, I'm guessing that what we have here is PEBS
getting activated. 0x120 is 2*0x90, and 0x90 is the size of a 64-bit
PEBS record.

-hpa

2014-02-23 05:16:20

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Fri, 21 Feb 2014, H. Peter Anvin wrote:

> Error 6 reflects a write in userspace to a not-present page.
>
> Since your previous trace indicates that the value of the register in question
> is a different one, I'm guessing that what we have here is PEBS getting
> activated. 0x120 is 2*0x90, and 0x90 is the size of a 64-bit PEBS record.

I'm having problems generating a replayable syscall trace that exhibits
the problem.

It turns out that the segfault address listed (the multiple of 0x120)
happens to be the value in the RBP register at the time of the segfault.

That's odd, as the instruction is
movdqa %xmm0,(%rdi)
and rdi is the valid mmap address of the perf ring buffer
rdi 0xf7768000 4151738368

so I'm not sure why RBP is involved at all.

In all of the cases I've investigated the precise_ip value has been set
for the problem event... but none of the events have been hardware events
(software and breakpoint so far). So probably not PEBS related?

Vince

2014-02-23 05:25:09

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

What is the instructions around it, by any chance?

On February 22, 2014 9:18:17 PM PST, Vince Weaver <[email protected]> wrote:
>On Fri, 21 Feb 2014, H. Peter Anvin wrote:
>
>> Error 6 reflects a write in userspace to a not-present page.
>>
>> Since your previous trace indicates that the value of the register in
>question
>> is a different one, I'm guessing that what we have here is PEBS
>getting
>> activated. 0x120 is 2*0x90, and 0x90 is the size of a 64-bit PEBS
>record.
>
>I'm having problems generating a replayable syscall trace that exhibits
>
>the problem.
>
>It turns out that the segfault address listed (the multiple of 0x120)
>happens to be the value in the RBP register at the time of the
>segfault.
>
>That's odd, as the instruction is
> movdqa %xmm0,(%rdi)
>and rdi is the valid mmap address of the perf ring buffer
> rdi 0xf7768000 4151738368
>
>so I'm not sure why RBP is involved at all.
>
>In all of the cases I've investigated the precise_ip value has been set
>
>for the problem event... but none of the events have been hardware
>events
>(software and breakpoint so far). So probably not PEBS related?
>
>Vince

--
Sent from my mobile phone. Please pardon brevity and lack of formatting.

2014-02-23 06:08:48

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

I'd be interested in how rbp gets set, too. It might just be a coincidence and the value in rbp has some other meaning here.

On February 22, 2014 9:18:17 PM PST, Vince Weaver <[email protected]> wrote:
>On Fri, 21 Feb 2014, H. Peter Anvin wrote:
>
>> Error 6 reflects a write in userspace to a not-present page.
>>
>> Since your previous trace indicates that the value of the register in
>question
>> is a different one, I'm guessing that what we have here is PEBS
>getting
>> activated. 0x120 is 2*0x90, and 0x90 is the size of a 64-bit PEBS
>record.
>
>I'm having problems generating a replayable syscall trace that exhibits
>
>the problem.
>
>It turns out that the segfault address listed (the multiple of 0x120)
>happens to be the value in the RBP register at the time of the
>segfault.
>
>That's odd, as the instruction is
> movdqa %xmm0,(%rdi)
>and rdi is the valid mmap address of the perf ring buffer
> rdi 0xf7768000 4151738368
>
>so I'm not sure why RBP is involved at all.
>
>In all of the cases I've investigated the precise_ip value has been set
>
>for the problem event... but none of the events have been hardware
>events
>(software and breakpoint so far). So probably not PEBS related?
>
>Vince

--
Sent from my mobile phone. Please pardon brevity and lack of formatting.

2014-02-23 14:03:02

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Sat, 22 Feb 2014, H. Peter Anvin wrote:

> I'd be interested in how rbp gets set, too. It might just be a
> coincidence and the value in rbp has some other meaning here.

The code in question does this:

i=find_random_active_event();
if (i<0) return;
if ((event_data[i].mmap)) {
value=0xdeadbeef;
memset(event_data[i].mmap,value,getpagesize());

[New LWP 10526]
Core was generated by `./perf_fuzzer -t OCIRMQWPpAi -r 1392938876'.
Program terminated with signal 11, Segmentation fault.
#0 0x0041efab in __memset_sse2 ()
(gdb) bt
#0 0x0041efab in __memset_sse2 ()
#1 0x004017ec in trash_random_mmap () at perf_fuzzer.c:808
#2 main (argc=<optimized out>, argv=<optimized out>) at perf_fuzzer.c:1604

So rbp is set by the imul below, it is the offset into the
event_data[i] array where the elements have size of 0x120

0x004017bd <+3085>: callq 0x402ee0 <find_random_active_event>
0x004017c2 <+3090>: test %eax,%eax
0x004017c4 <+3092>: js 0x4011e8 <main+1592>
0x004017ca <+3098>: imul $0x120,%eax,%ebp
0x004017d0 <+3104>: mov 0x756b2c(%ebp),%eax

0x004017d7 <+3111>: test %eax,%eax
0x004017d9 <+3113>: je 0x40183b <main+3211>

0x004017db <+3115>: mov 0xc(%esp),%edx
0x004017e0 <+3120>: mov %eax,%edi
0x004017e2 <+3122>: mov $0xdeadbeef,%esi
0x004017e7 <+3127>: callq 0x400260
0x004017ec <+3132>: testb $0x20,0x353e76(%rip) # 0x755669 <logging+$

400260: ff 25 ce 0e 2d 00 jmpq *0x2d0ece(%rip) # 6d1134 $

0x6d1134: 0x0041e710

Dump of assembler code for function __memset_sse2:

0x0041e710 <+0>: cmp $0x1,%rdx
0x0041e714 <+4>: mov %rdi,%rax
0x0041e717 <+7>: jne 0x41e71d <__memset_sse2+13>
0x0041e719 <+9>: mov %sil,(%rdi)

and as far as I can tell nothing touches rbp again until the segfault.
Nothing in _memset_sse2 does as far as I can tell.

Vince

2014-02-24 03:00:17

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Sun, 23 Feb 2014, Vince Weaver wrote:
>
> and as far as I can tell nothing touches > Nothing in _memset_sse2 does as far as
I only know enough about ftrace to be dangerous, is the trace of the problem:

perf_fuzzer-11492 [000] 197077.488363: function: perf_fuzzer-11492 [000] 197077.488363: function: perf_fuzzer-11492 [000] 197077.488365: function: perf_fuzzer-11492 [000] 197077.488365: function: perf_fuzzer-11492 [000] 197077.488366: function: perf_fuzzer-11492 [000] 197077.488367: function: perf_fuzzer-11492 [000] 197077.488366: function: perf_fuzzer-11492 [000] 197077.488367: function: perf_fuzzer-11492 [000] 197077.488368: function: perf_fuzzer-11492 [000] 197077.488368: function: perf_fuzzer-11492 [000] 197077.488369: function: perf_fuzzer-11492 [000] 197077.488370: function: perf_fuzzer-11492 [000] 197077.488370: function: perf_fuzzer-11492 [000] 197077.488371: function: perf_fuzzer-11492 [000] 197077.488371: function: perf_fuzzer-11492 [000] 197077.488372: function: perf_fuzzer-11492 [000] 197077.488373: function: perf_fuzzer-11492 [000] 197077.488373: function: perf_fuzzer-11492 [000] 197077.488374: function: perf_fuzzer-11492 [000] 197077.488374: function: perf_fuzzer-11492 [000] 197077.488375: function: perf_fuzzer-11492 [000] 197077.488376: function: perf_fuzzer-11492 [000] 197077.488377: function: perf_fuzzer-11492 [000] 197077.488378: function: perf_fuzzer-11492 [000] 197077.488378: function: perf_fuzzer-11492 [000] 197077.488379: function: perf_fuzzer-11492 [000] 197077.488380: function: perf_fuzzer-11492 [000] 197077.488380: function: perf_fuzzer-11492 [000] 197077.488381: function: perf_fuzzer-11492 [000] 197077.488382: function: perf_fuzzer-11492 [000] 197077.488383: function: perf_fuzzer-11492 [000] 197077.488383: sys_exit: perf_fuzzer-11492 [000] 197077.488387: function: perf_fuzzer-11492 [000] 197077.488387: function: perf_fuzzer-11492 [000] 197077.488390: function: perf_fuzzer-11492 [000] 197077.488391: page_fault_user: perf_fuzzer-11492 [000] 197077.488395: function: perf_fuzzer-11492 [000] 197077.488396: function: perf_fuzzer-11492 [000] 197077.488397: function: perf_fuzzer-11492 [000] 197077.488398: page_fault_kernel: perf_fuzzer-11492 [000] 197077.488399: function: perf_fuzzer-11492 [000] 197077.488400: function: perf_fuzzer-11492 [000] 197077.488401: function: perf_fuzzer-11492 [000] 197077.488401: function: perf_fuzzer-11492 [000] 197077.488402: function: perf_fuzzer-11492 [000] 197077.488403: function: perf_fuzzer-11492 [000] 197077.488403: function: perf_fuzzer-11492 [000] 197077.488405: function: perf_fuzzer-11492 [000] 197077.488406: function: perf_fuzzer-11492 [000] 197077.488406: page_fault_kernel: perf_fuzzer-11492 [000] 197077.488407: function: perf_fuzzer-11492 [000] 197077.488408: function: perf_fuzzer-11492 [000] 197077.488409: function: perf_fuzzer-11492 [000] 197077.488409: function: perf_fuzzer-11492 [000] 197077.488410: function: perf_fuzzer-11492 [000] 197077.488410: function: perf_fuzzer-11492 [000] 197077.488411: function: perf_fuzzer-11492 [000] 197077.488413: function: perf_fuzzer-11492 [000] 197077.488414: function: perf_fuzzer-11492 [000] 197077.488415: function: perf_fuzzer-11492 [000] 197077.488415: function: perf_fuzzer-11492 [000] 197077.488416: function: perf_fuzzer-11492 [000] 197077.488418: function: perf_fuzzer-11492 [000] 197077.488419: function: perf_fuzzer-11492 [000] 197077.488419: function: perf_fuzzer-11492 [000] 197077.488420: function: perf_fuzzer-11492 [000] 197077.488421: function: perf_fuzzer-11492 [000] 197077.488422: function: perf_fuzzer-11492 [000] 197077.488423: function: perf_fuzzer-11492 [000] 197077.488423: function: perf_fuzzer-11492 [000] 197077.488424: function: perf_fuzzer-11492 [000] 197077.488425: function: perf_fuzzer-11492 [000] 197077.488426: function: perf_fuzzer-11492 [000] 197077.488426: function: perf_fuzzer-11492 [000] 197077.488427: function: perf_fuzzer-11492 [000] 197077.488428: function: perf_fuzzer-11492 [000] 197077.488429: function: perf_fuzzer-11492 [000] 197077.488430: function: perf_fuzzer-11492 [000] 197077.488430: function: perf_fuzzer-11492 [000] 197077.488431: function: perf_fuzzer-11492 [000] 197077.488432: function: perf_fuzzer-11492 [000] 197077.488434: function: perf_fuzzer-11492 [000] 197077.488443: function: perf_fuzzer-11492 [000] 197077.488444: function: perf_fuzzer-11492 [000] 197077.488445: function: perf_fuzzer-11492 [000] 197077.488445: function: perf_fuzzer-11492 [000] 197077.488446: function: perf_fuzzer-11492 [000] 197077.488447: function: perf_fuzzer-11492 [000] 197077.488447: function: perf_fuzzer-11492 [000] 197077.488449: function: perf_fuzzer-11492 [000] 197077.488452: function: perf_fuzzer-11492 [000] 197077.488453: console: rbp again until the segfault.
I can tell.
but here is what I think
intel_get_event_constraints
intel_pebs_constraints
intel_put_event_constraints
intel_pmu_enable_all
intel_pmu_pebs_enable_all
intel_pmu_lbr_enable_all
intel_pmu_pebs_enable_all
intel_pmu_lbr_enable_all
mutex_unlock
mutex_lock
_cond_resched
_raw_spin_lock_irq
mutex_unlock
mutex_lock
_cond_resched
_raw_spin_lock_irq
mutex_unlock
mutex_lock
_cond_resched
_raw_spin_lock_irq
smp_call_function_single
_raw_spin_lock
mutex_unlock
mutex_lock
_cond_resched
_raw_spin_lock_irq
smp_call_function_single
_raw_spin_lock
mutex_unlock
mutex_unlock
syscall_trace_leave
NR 1073741981 = 0
do_device_not_available
math_state_restore
trace_do_page_fault
address=__per_cpu_end ip=__per_cpu_end error_code=0x6
perf_callchain
copy_from_user_nmi
trace_do_page_fault
address=irq_stack_union ip=copy_user_generic_string error_code=0x0
__do_page_fault
bad_area_nosemaphore
__bad_area_nosemaphore
no_context
fixup_exception
search_exception_tables
search_extable
copy_user_handle_tail
trace_do_page_fault
address=irq_stack_union ip=copy_user_handle_tail error_code=0x0
__do_page_fault
bad_area_nosemaphore
__bad_area_nosemaphore
no_context
fixup_exception
search_exception_tables
search_extable
perf_output_begin
perf_output_copy
perf_output_copy
perf_output_copy
perf_output_copy
perf_output_copy
perf_output_copy
perf_output_end
perf_output_put_handle
__do_page_fault
down_read_trylock
_cond_resched
find_vma
bad_area
up_read
__bad_area_nosemaphore
is_prefetch
convert_ip_to_linear
unhandled_signal
__printk_ratelimit
_raw_spin_trylock
_raw_spin_unlock_irqrestore
printk
vprintk_emit
_raw_spin_lock
cont_add
console_trylock
down_trylock
_raw_spin_lock_irqsave
_raw_spin_unlock_irqrestore
console_unlock
_raw_spin_lock_irqsave
print_time
T.950
[197179.420735] perf_fuzzer[11492]: segfault at 22e0 ip 000000000041efab sp 00000000ffda0938 error 6

2014-02-24 05:23:23

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/23/2014 07:02 PM, Vince Weaver wrote:
> On Sun, 23 Feb 2014, Vince Weaver wrote:
>>
>> and as far as I can tell nothing touches rbp again until the segfault.
>> Nothing in _memset_sse2 does as far as I can tell.
>
> I only know enough about ftrace to be dangerous, but here is what I think
> is the trace of the problem:
>
> perf_fuzzer-11492 [000] 197077.488420: function: perf_output_put_handle
> perf_fuzzer-11492 [000] 197077.488421: function: __do_page_fault

So we do a write to the buffer rather immediately before this happens,
and in particular that will update the head:

rb->user_page->data_head = head;

However, that doesn't explain what is going on and in particular the
write to whatever address was in %rbp. The rest pretty much seems to be
the page fault logic.

Incidentally, I doubt that this is x32-related in any way; there seems
to be absolutely no difference between x86-64 perf and x32 perf; more
likely it just makes the error more reproducible because the address
space is so much smaller.

-hpa

2014-02-24 15:33:37

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Sun, 23 Feb 2014, H. Peter Anvin wrote:

> So we do a write to the buffer rather immediately before this happens,
> and in particular that will update the head:
>
> rb->user_page->data_head = head;
>
> However, that doesn't explain what is going on and in particular the
> write to whatever address was in %rbp. The rest pretty much seems to be
> the page fault logic.

It turns out you don't even have to over-write rb->user_page->data_head.
Just touching the mmap page with a write of a single byte (it doesn't
matter where) is enough to trigger the bug.

This is a pain to track down, it would be easier if I could get a
replayable syscall trace, but even though the segfault is very
reproducible with my fuzzer, it's very sensitive to extra syscalls in the
trace path and the fuzzer logger/replayer path has a different number of
write syscalls and won't trigger the problem.

> Incidentally, I doubt that this is x32-related in any way; there seems
> to be absolutely no difference between x86-64 perf and x32 perf; more
> likely it just makes the error more reproducible because the address
> space is so much smaller.

quite possibly. I only began chasing the problem because when compiled
for x32 this bug apparently will reboot the machine now and then (not just
segfault the program). I never saw that failure mode with x86_64, but
again maybe it's just easier to hit with the reduced address space as you
say.

Vince

2014-02-24 16:32:32

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, Vince Weaver wrote:

> Just touching the mmap page with a write of a single byte (it doesn't
> matter where) is enough to trigger the bug.

OK, investigating this more.

perf_fuzzer-2971 [000] 154.944114: page_fault_user: address=0xf7729000 ip=0x41efab error_code=0x6
perf_fuzzer-2971 [000] 154.944118: function: ip=0xffffffff810d40e7 parent_ip=0xffffffff810d0840
perf_fuzzer-2971 [000] 154.944119: function: ip=0xffffffff812a91a5 parent_ip=0xffffffff81013ff5
perf_fuzzer-2971 [000] 154.944120: function: ip=0xffffffff8153837c parent_ip=0xffffffff81535432
perf_fuzzer-2971 [000] 154.944121: page_fault_kernel: address=0x22e0 ip=0xffffffff812a7d5c error_code=0x0

It looks like there are two page faults. The first is caused by the user
code accessing the mmap'd page. It looks sort of normal and what you'd
expect if the perf_event mmap ring buffer is being accessed for the first
time.

What follows is a kernel page fault, and this is the one where for
whatever reason CR2 has obtained the value of the userspace RBP register.

Vince

2014-02-24 16:47:59

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Ok, so the obvious question is what is at that kernel address?

On February 24, 2014 8:34:30 AM PST, Vince Weaver <[email protected]> wrote:
>On Mon, 24 Feb 2014, Vince Weaver wrote:
>
>> Just touching the mmap page with a write of a single byte (it doesn't
>
>> matter where) is enough to trigger the bug.
>
>OK, investigating this more.
>
>perf_fuzzer-2971 [000] 154.944114: page_fault_user:
>address=0xf7729000 ip=0x41efab error_code=0x6
>perf_fuzzer-2971 [000] 154.944118: function:
>ip=0xffffffff810d40e7 parent_ip=0xffffffff810d0840
>perf_fuzzer-2971 [000] 154.944119: function:
>ip=0xffffffff812a91a5 parent_ip=0xffffffff81013ff5
>perf_fuzzer-2971 [000] 154.944120: function:
>ip=0xffffffff8153837c parent_ip=0xffffffff81535432
>perf_fuzzer-2971 [000] 154.944121: page_fault_kernel:
>address=0x22e0 ip=0xffffffff812a7d5c error_code=0x0
>
>It looks like there are two page faults. The first is caused by the
>user
>code accessing the mmap'd page. It looks sort of normal and what you'd
>expect if the perf_event mmap ring buffer is being accessed for the
>first
>time.
>
>What follows is a kernel page fault, and this is the one where for
>whatever reason CR2 has obtained the value of the userspace RBP
>register.
>
>Vince

--
Sent from my mobile phone. Please pardon brevity and lack of formatting.

2014-02-24 17:08:51

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, H. Peter Anvin wrote:

> On February 24, 2014 8:34:30 AM PST, Vince Weaver <[email protected]> wrote:
> >On Mon, 24 Feb 2014, Vince Weaver wrote:
> >
> >> Just touching the mmap page with a write of a single byte (it doesn't
> >
> >> matter where) is enough to trigger the bug.
> >
> >OK, investigating this more.
> >
> >perf_fuzzer-2971 [000] 154.944114: page_fault_user:
> >address=0xf7729000 ip=0x41efab error_code=0x6
> >perf_fuzzer-2971 [000] 154.944118: function:
> >ip=0xffffffff810d40e7 parent_ip=0xffffffff810d0840
> >perf_fuzzer-2971 [000] 154.944119: function:
> >ip=0xffffffff812a91a5 parent_ip=0xffffffff81013ff5
> >perf_fuzzer-2971 [000] 154.944120: function:
> >ip=0xffffffff8153837c parent_ip=0xffffffff81535432
> >perf_fuzzer-2971 [000] 154.944121: page_fault_kernel:
> >address=0x22e0 ip=0xffffffff812a7d5c error_code=0x0

> Ok, so the obvious question is what is at that kernel address?
>

It's in copy_user_generic_string()
rep movsq %ds:(%rsi),%es:(%rdi)

And looking at the ftrace:
perf_fuzzer-2979 [000] 161.475920: page_fault_user: address=__per_cpu_end ip=__per_cpu_end error_code=0x6
perf_fuzzer-2979 [000] 161.475922: function: perf_callchain
perf_fuzzer-2979 [000] 161.475922: function: copy_from_user_nmi
perf_fuzzer-2979 [000] 161.475923: function: trace_do_page_fault
perf_fuzzer-2979 [000] 161.475924: page_fault_kernel: address=irq_stack_union ip=copy_user_generic_string error_code=0x0

What is likely happening is the user page fault is triggering
code to do a "perf_callchain" dump, which is calling copy_from_user_nmi()
which calls copy_user_generic_string() which is somehow getting the user
RBP in the RDI register somehow?

Vince

2014-02-24 17:25:48

by Peter Zijlstra

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, Feb 24, 2014 at 12:10:44PM -0500, Vince Weaver wrote:
> On Mon, 24 Feb 2014, H. Peter Anvin wrote:
>
> > On February 24, 2014 8:34:30 AM PST, Vince Weaver <[email protected]> wrote:
> > >On Mon, 24 Feb 2014, Vince Weaver wrote:
> > >
> > >> Just touching the mmap page with a write of a single byte (it doesn't
> > >
> > >> matter where) is enough to trigger the bug.
> > >
> > >OK, investigating this more.
> > >
> > >perf_fuzzer-2971 [000] 154.944114: page_fault_user:
> > >address=0xf7729000 ip=0x41efab error_code=0x6
> > >perf_fuzzer-2971 [000] 154.944118: function:
> > >ip=0xffffffff810d40e7 parent_ip=0xffffffff810d0840
> > >perf_fuzzer-2971 [000] 154.944119: function:
> > >ip=0xffffffff812a91a5 parent_ip=0xffffffff81013ff5
> > >perf_fuzzer-2971 [000] 154.944120: function:
> > >ip=0xffffffff8153837c parent_ip=0xffffffff81535432
> > >perf_fuzzer-2971 [000] 154.944121: page_fault_kernel:
> > >address=0x22e0 ip=0xffffffff812a7d5c error_code=0x0
>
> > Ok, so the obvious question is what is at that kernel address?
> >
>
> It's in copy_user_generic_string()
> rep movsq %ds:(%rsi),%es:(%rdi)
>
> And looking at the ftrace:
> perf_fuzzer-2979 [000] 161.475920: page_fault_user: address=__per_cpu_end ip=__per_cpu_end error_code=0x6
> perf_fuzzer-2979 [000] 161.475922: function: perf_callchain
> perf_fuzzer-2979 [000] 161.475922: function: copy_from_user_nmi
> perf_fuzzer-2979 [000] 161.475923: function: trace_do_page_fault
> perf_fuzzer-2979 [000] 161.475924: page_fault_kernel: address=irq_stack_union ip=copy_user_generic_string error_code=0x0
>
> What is likely happening is the user page fault is triggering
> code to do a "perf_callchain" dump, which is calling copy_from_user_nmi()
> which calls copy_user_generic_string() which is somehow getting the user
> RBP in the RDI register somehow?

So that code very much relies on the 'recursive' NMI/iret magic from
Steve, patch 3f3c8b8c4b2a3 (and assorted fixes later).

If CR2 is getting corrupted; 7fbb98c5cb075 seems relevant.

Peter, does x32 have a slightly different ABI/calling convention that
would make any of these patches just slightly 'off'?

2014-02-24 17:30:54

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, Peter Zijlstra wrote:

> On Mon, Feb 24, 2014 at 12:10:44PM -0500, Vince Weaver wrote:
> > On Mon, 24 Feb 2014, H. Peter Anvin wrote:
> >
> > > On February 24, 2014 8:34:30 AM PST, Vince Weaver <[email protected]> wrote:
> > > >On Mon, 24 Feb 2014, Vince Weaver wrote:
> > > >
> > > >> Just touching the mmap page with a write of a single byte (it doesn't
> > > >
> > > >> matter where) is enough to trigger the bug.
> > > >
> > > >OK, investigating this more.
> > > >
> > > >perf_fuzzer-2971 [000] 154.944114: page_fault_user:
> > > >address=0xf7729000 ip=0x41efab error_code=0x6
> > > >perf_fuzzer-2971 [000] 154.944118: function:
> > > >ip=0xffffffff810d40e7 parent_ip=0xffffffff810d0840
> > > >perf_fuzzer-2971 [000] 154.944119: function:
> > > >ip=0xffffffff812a91a5 parent_ip=0xffffffff81013ff5
> > > >perf_fuzzer-2971 [000] 154.944120: function:
> > > >ip=0xffffffff8153837c parent_ip=0xffffffff81535432
> > > >perf_fuzzer-2971 [000] 154.944121: page_fault_kernel:
> > > >address=0x22e0 ip=0xffffffff812a7d5c error_code=0x0
> >
> > > Ok, so the obvious question is what is at that kernel address?
> > >
> >
> > It's in copy_user_generic_string()
> > rep movsq %ds:(%rsi),%es:(%rdi)
> >
> > And looking at the ftrace:
> > perf_fuzzer-2979 [000] 161.475920: page_fault_user: address=__per_cpu_end ip=__per_cpu_end error_code=0x6
> > perf_fuzzer-2979 [000] 161.475922: function: perf_callchain
> > perf_fuzzer-2979 [000] 161.475922: function: copy_from_user_nmi
> > perf_fuzzer-2979 [000] 161.475923: function: trace_do_page_fault
> > perf_fuzzer-2979 [000] 161.475924: page_fault_kernel: address=irq_stack_union ip=copy_user_generic_string error_code=0x0
> >
> > What is likely happening is the user page fault is triggering
> > code to do a "perf_callchain" dump, which is calling copy_from_user_nmi()
> > which calls copy_user_generic_string() which is somehow getting the user
> > RBP in the RDI register somehow?
>
> So that code very much relies on the 'recursive' NMI/iret magic from
> Steve, patch 3f3c8b8c4b2a3 (and assorted fixes later).
>
> If CR2 is getting corrupted; 7fbb98c5cb075 seems relevant.
>
> Peter, does x32 have a slightly different ABI/calling convention that
> would make any of these patches just slightly 'off'?

I do note that
perf_callchain_user();

Does
fp = (void __user *)regs->bp;

...

bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));

And in my particular executable RBP has nothing to do with a frame
pointer, but is instead being used as a general purpose register.

Am I missing something here? Though in that case I'm not sure why this
wouldn't be easier to trigger.

Vince

2014-02-24 17:39:11

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, Vince Weaver wrote:

> I do note that
> perf_callchain_user();
>
> Does
> fp = (void __user *)regs->bp;
>
> ...
>
> bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
>
>
> And in my particular executable RBP has nothing to do with a frame
> pointer, but is instead being used as a general purpose register.

and as a reminder, I'm seeing this on an x32 executable, so
perf_callchain_user32() is probably coming into play.

So maybe it is an x32 issue after all.

Vince

2014-02-24 17:41:05

by Peter Zijlstra

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, Feb 24, 2014 at 12:32:39PM -0500, Vince Weaver wrote:
> I do note that
> perf_callchain_user();
>
> Does
> fp = (void __user *)regs->bp;
>
> ...
>
> bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
>
>
> And in my particular executable RBP has nothing to do with a frame
> pointer, but is instead being used as a general purpose register.
>
> Am I missing something here? Though in that case I'm not sure why this
> wouldn't be easier to trigger.

Ah, in case the frame doesn't actually exist we would expect to fault
and get the fixup treatment, returning a short copy (the return value
being bytes _NOT_ copied).

When that happens;

if (bytes != 0)
break;

At which point we'll terminate the stack frame iteration.

This is where we rely on being able to take a fault from NMI context,
the fault iret will re-enable NMIs, necessitating all the magic Steve
did.

2014-02-24 17:41:46

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/24/2014 09:32 AM, Vince Weaver wrote:
>>
>> Peter, does x32 have a slightly different ABI/calling convention that
>> would make any of these patches just slightly 'off'?
>
> I do note that
> perf_callchain_user();
>
> Does
> fp = (void __user *)regs->bp;
>
> ...
>
> bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
>
>
> And in my particular executable RBP has nothing to do with a frame
> pointer, but is instead being used as a general purpose register.
>
> Am I missing something here? Though in that case I'm not sure why this
> wouldn't be easier to trigger.
>

Neither x86-64 nor x32 are typically compiled with fixed frame pointers
(which would be %rbp if they are). So I'm guessing the perf_callchain
logic is only applicable to a user-space binary explicitly compiled with
frame pointers turned on.

So copy_from_user_nmi() stumbles onto a nonexistent page and takes a
page fault. This isn't a big deal, because perf_callchain_user() is set
up to handle that (and just terminates the trace), *except* now CR2 is
corrupt, and we took this event while handling a page fault already...
and apparently before we even did read_cr2() in __do_page_fault.

The description of copy_from_user_nmi() states:

/*
* We rely on the nested NMI work to allow atomic faults from the NMI
path; the
* nested NMI paths are careful to preserve CR2.
*/

... but that doesn't seem to happen here for whatever reason.

There is no hint in your trace what happens after the kernel page fault
so that makes it hard to know.

-hpa

2014-02-24 17:43:30

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/24/2014 09:41 AM, Vince Weaver wrote:
> On Mon, 24 Feb 2014, Vince Weaver wrote:
>
>> I do note that
>> perf_callchain_user();
>>
>> Does
>> fp = (void __user *)regs->bp;
>>
>> ...
>>
>> bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
>>
>>
>> And in my particular executable RBP has nothing to do with a frame
>> pointer, but is instead being used as a general purpose register.
>
> and as a reminder, I'm seeing this on an x32 executable, so
> perf_callchain_user32() is probably coming into play.
>
> So maybe it is an x32 issue after all.
>

No.

if (!test_thread_flag(TIF_IA32))
return 0;

TIF_IA32 is clear for an x32 process.

-hpa

2014-02-24 17:52:55

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/24/2014 09:25 AM, Peter Zijlstra wrote:
>>
>> What is likely happening is the user page fault is triggering
>> code to do a "perf_callchain" dump, which is calling copy_from_user_nmi()
>> which calls copy_user_generic_string() which is somehow getting the user
>> RBP in the RDI register somehow?
>
> So that code very much relies on the 'recursive' NMI/iret magic from
> Steve, patch 3f3c8b8c4b2a3 (and assorted fixes later).
>
> If CR2 is getting corrupted; 7fbb98c5cb075 seems relevant.
>
> Peter, does x32 have a slightly different ABI/calling convention that
> would make any of these patches just slightly 'off'?
>

As long as we're talking kernel code, x32 isn't even involved (we do not
support compiling the kernel as x32 and most likely never will.)

-hpa

2014-02-24 17:58:19

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, H. Peter Anvin wrote:

> On 02/24/2014 09:32 AM, Vince Weaver wrote:
> >>
> >> Peter, does x32 have a slightly different ABI/calling convention that
> >> would make any of these patches just slightly 'off'?
> >
> > I do note that
> > perf_callchain_user();
> >
> > Does
> > fp = (void __user *)regs->bp;
> >
> > ...
> >
> > bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
> >
> >
> > And in my particular executable RBP has nothing to do with a frame
> > pointer, but is instead being used as a general purpose register.
> >
> > Am I missing something here? Though in that case I'm not sure why this
> > wouldn't be easier to trigger.
> >
>
> Neither x86-64 nor x32 are typically compiled with fixed frame pointers
> (which would be %rbp if they are). So I'm guessing the perf_callchain
> logic is only applicable to a user-space binary explicitly compiled with
> frame pointers turned on.
>
> So copy_from_user_nmi() stumbles onto a nonexistent page and takes a
> page fault. This isn't a big deal, because perf_callchain_user() is set
> up to handle that (and just terminates the trace), *except* now CR2 is
> corrupt, and we took this event while handling a page fault already...
> and apparently before we even did read_cr2() in __do_page_fault.
>
> The description of copy_from_user_nmi() states:
>
> /*
> * We rely on the nested NMI work to allow atomic faults from the NMI
> path; the
> * nested NMI paths are careful to preserve CR2.
> */
>
> ... but that doesn't seem to happen here for whatever reason.
>
> There is no hint in your trace what happens after the kernel page fault
> so that makes it hard to know.

Ahh, ftrace, the cause of and solution to all my perf_fuzzing problems.

Anyway I've attached the full tail end of the trace if you want to see
everything that happens.

Vince

Attachments:

little.trace.bz2 (59.99 kB)

2014-02-24 18:05:16

by Vince Weaver

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On Mon, 24 Feb 2014, Vince Weaver wrote:

> On Mon, 24 Feb 2014, H. Peter Anvin wrote:
>
> > On 02/24/2014 09:32 AM, Vince Weaver wrote:
> > >>
> > >> Peter, does x32 have a slightly different ABI/calling convention that
> > >> would make any of these patches just slightly 'off'?
> > >
> > > I do note that
> > > perf_callchain_user();
> > >
> > > Does
> > > fp = (void __user *)regs->bp;
> > >
> > > ...
> > >
> > > bytes = copy_from_user_nmi(&frame, fp, sizeof(frame));
> > >
> > >
> > > And in my particular executable RBP has nothing to do with a frame
> > > pointer, but is instead being used as a general purpose register.
> > >
> > > Am I missing something here? Though in that case I'm not sure why this
> > > wouldn't be easier to trigger.
> > >
> >
> > Neither x86-64 nor x32 are typically compiled with fixed frame pointers
> > (which would be %rbp if they are). So I'm guessing the perf_callchain
> > logic is only applicable to a user-space binary explicitly compiled with
> > frame pointers turned on.
> >
> > So copy_from_user_nmi() stumbles onto a nonexistent page and takes a
> > page fault. This isn't a big deal, because perf_callchain_user() is set
> > up to handle that (and just terminates the trace), *except* now CR2 is
> > corrupt, and we took this event while handling a page fault already...
> > and apparently before we even did read_cr2() in __do_page_fault.
> >
> > The description of copy_from_user_nmi() states:
> >
> > /*
> > * We rely on the nested NMI work to allow atomic faults from the NMI
> > path; the
> > * nested NMI paths are careful to preserve CR2.
> > */
> >
> > ... but that doesn't seem to happen here for whatever reason.
> >
> > There is no hint in your trace what happens after the kernel page fault
> > so that makes it hard to know.
>
> Ahh, ftrace, the cause of and solution to all my perf_fuzzing problems.
>
> Anyway I've attached the full tail end of the trace if you want to see
> everything that happens.

and then I note there are *two* kernel page faults.

perf_fuzzer-2979 [000] 161.475924: page_fault_kernel: address=irq_stack_union ip=copy_user_generic_string error_code=0x0
address=0x1 ip=0xffffffff812a7d9c error_code=0x0
perf_fuzzer-2979 [000] 161.475924: function: __do_page_fault
perf_fuzzer-2979 [000] 161.475924: function: bad_area_nosemaphore
perf_fuzzer-2979 [000] 161.475925: function: __bad_area_nosemaphore
perf_fuzzer-2979 [000] 161.475925: function: no_context
perf_fuzzer-2979 [000] 161.475925: function: fixup_exception
perf_fuzzer-2979 [000] 161.475926: function: search_exception_tables
perf_fuzzer-2979 [000] 161.475926: function: search_extable
perf_fuzzer-2979 [000] 161.475927: function: copy_user_handle_tail
perf_fuzzer-2979 [000] 161.475927: function: trace_do_page_fault
perf_fuzzer-2979 [000] 161.475928: page_fault_kernel: address=irq_stack_union ip=copy_user_handle_tail error_code=0x0
address=0x1 ip=0xffffffff812a92bb error_code=0x0
perf_fuzzer-2979 [000] 161.475928: function: __do_page_fault
perf_fuzzer-2979 [000] 161.475928: function: bad_area_nosemaphore
perf_fuzzer-2979 [000] 161.475929: function: __bad_area_nosemaphore
perf_fuzzer-2979 [000] 161.475929: function: no_context
perf_fuzzer-2979 [000] 161.475929: function: fixup_exception
perf_fuzzer-2979 [000] 161.475929: function: search_exception_tables
perf_fuzzer-2979 [000] 161.475930: function: search_extable
perf_fuzzer-2979 [000] 161.475931: function: perf_output_begin
perf_fuzzer-2979 [000] 161.475931: function: perf_output_copy

That second one is in copy_user_handle_tail()

Sorry for the sloppy analysis here, I did most of the initial tracing last
night at 1am typing one-handed with a sick crying baby draped over one
shoulder, so not really operating at my best.

Vince

2014-02-24 18:34:58

by H. Peter Anvin

[permalink] [raw]

Subject: Re: perf_fuzzer compiled for x32 causes reboot

On 02/24/2014 10:07 AM, Vince Weaver wrote:
>>
>> Anyway I've attached the full tail end of the trace if you want to see
>> everything that happens.
>
> and then I note there are *two* kernel page faults.
>
> perf_fuzzer-2979 [000] 161.475924: page_fault_kernel: address=irq_stack_union ip=copy_user_generic_string error_code=0x0
> address=0x1 ip=0xffffffff812a7d9c error_code=0x0
> perf_fuzzer-2979 [000] 161.475924: function: __do_page_fault
> perf_fuzzer-2979 [000] 161.475924: function: bad_area_nosemaphore
> perf_fuzzer-2979 [000] 161.475925: function: __bad_area_nosemaphore
> perf_fuzzer-2979 [000] 161.475925: function: no_context
> perf_fuzzer-2979 [000] 161.475925: function: fixup_exception
> perf_fuzzer-2979 [000] 161.475926: function: search_exception_tables
> perf_fuzzer-2979 [000] 161.475926: function: search_extable
> perf_fuzzer-2979 [000] 161.475927: function: copy_user_handle_tail
> perf_fuzzer-2979 [000] 161.475927: function: trace_do_page_fault
> perf_fuzzer-2979 [000] 161.475928: page_fault_kernel: address=irq_stack_union ip=copy_user_handle_tail error_code=0x0
> address=0x1 ip=0xffffffff812a92bb error_code=0x0
> perf_fuzzer-2979 [000] 161.475928: function: __do_page_fault
> perf_fuzzer-2979 [000] 161.475928: function: bad_area_nosemaphore
> perf_fuzzer-2979 [000] 161.475929: function: __bad_area_nosemaphore
> perf_fuzzer-2979 [000] 161.475929: function: no_context
> perf_fuzzer-2979 [000] 161.475929: function: fixup_exception
> perf_fuzzer-2979 [000] 161.475929: function: search_exception_tables
> perf_fuzzer-2979 [000] 161.475930: function: search_extable
> perf_fuzzer-2979 [000] 161.475931: function: perf_output_begin
> perf_fuzzer-2979 [000] 161.475931: function: perf_output_copy
>
> That second one is in copy_user_handle_tail()
>

Either way, it really seems like we have a case of CR2 leakage out of
the NMI context.

-hpa

2014-02-24 19:13:32

Subject: perf_fuzzer causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Attachments:

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Attachments:

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: [PATCH] x86: Rename copy_from_user_nmi() to copy_from_user_trace()

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot

Subject: Re: perf_fuzzer compiled for x32 causes reboot