2017-03-07 16:30:41

by Nikolay Borisov

[permalink] [raw]
Subject: kasan behavior when built with unsupported compiler

Hello,

I've been chasing a particular UAF as reported by kasan
(https://www.spinics.net/lists/kernel/msg2458136.html). However, one
thing which I took notice of rather lately is that I was building my
kernel with gcc 4.7.4 which is not supported by kasan as indicated by
the following string:

scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
-fsanitize=kernel-address is not supported by compiler


Nevertheless, the kernel compiles and when I boot it I see the kasan
splats as per the referenced thread. If, however, I build the kernel
with a newer compiler version 5.4.0 kasan no longer complains.


At this point I'm wondering whether the splats can be due to old
compiler being used e.g. false positives or are they genuine splats and
gcc 5 somehow obfuscates them ? Clearly despite the warning about not
being able to use CONFIG_KASAN it is still working since I'm seeing the
splats. Is this valid behavior ?


Regards,
Nikolay


2017-03-07 16:05:57

by Dmitry Vyukov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
<[email protected]> wrote:
> Hello,
>
> I've been chasing a particular UAF as reported by kasan
> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
> thing which I took notice of rather lately is that I was building my
> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
> the following string:
>
> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
> -fsanitize=kernel-address is not supported by compiler
>
>
> Nevertheless, the kernel compiles and when I boot it I see the kasan
> splats as per the referenced thread. If, however, I build the kernel
> with a newer compiler version 5.4.0 kasan no longer complains.
>
>
> At this point I'm wondering whether the splats can be due to old
> compiler being used e.g. false positives or are they genuine splats and
> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
> being able to use CONFIG_KASAN it is still working since I'm seeing the
> splats. Is this valid behavior ?


Hi,

Re the message that kasan is not supported while it's still enabled in the end.
I think it's an issue related to gcc plugins. Originally kasan was
supported with 5.0+ thus the message. However, later we extended this
support to 4.5+ with gcc plugins. However, that confusing message from
build system was not fixed. So yes, it's confusing and it's something
to fix, but mostly you can just ignore it.

Re false positives with 4.7. By default I would assume that it is true
positive. Should be easy to check with manual printfs.

Re why 5.4 does not detect it. Difficult to say.
If you confirm that it's a real bug and provide repro instructions,
then I can recheck it with latest gcc. If it's a real bug and the
latest gcc does not detect it, then we need to look more closely at
it. I afraid 5.4 won't be fixed.
It's also possible that it's a false positive in the old compiler (I
think there were some bugs). If so, I would recommend switching to a
newer compiler.

2017-03-07 16:25:44

by Andrey Ryabinin

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler



On 03/07/2017 06:54 PM, Dmitry Vyukov wrote:

>
> Hi,
>
> Re the message that kasan is not supported while it's still enabled in the end.
> I think it's an issue related to gcc plugins. Originally kasan was
> supported with 5.0+ thus the message. However, later we extended this
> support to 4.5+ with gcc plugins.

You made this up, we don't have such plugin.

2017-03-07 16:39:27

by Alexander Potapenko

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Tue, Mar 7, 2017 at 4:54 PM, Dmitry Vyukov <[email protected]> wrote:
> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
> <[email protected]> wrote:
>> Hello,
>>
>> I've been chasing a particular UAF as reported by kasan
>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>> thing which I took notice of rather lately is that I was building my
>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>> the following string:
>>
>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>> -fsanitize=kernel-address is not supported by compiler
>>
>>
>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>> splats as per the referenced thread. If, however, I build the kernel
>> with a newer compiler version 5.4.0 kasan no longer complains.
>>
>>
>> At this point I'm wondering whether the splats can be due to old
>> compiler being used e.g. false positives or are they genuine splats and
>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>> splats. Is this valid behavior ?
>
>
> Hi,
>
> Re the message that kasan is not supported while it's still enabled in the end.
> I think it's an issue related to gcc plugins. Originally kasan was
> supported with 5.0+ thus the message. However, later we extended this
> support to 4.5+ with gcc plugins. However, that confusing message from
> build system was not fixed. So yes, it's confusing and it's something
> to fix, but mostly you can just ignore it.
>
> Re false positives with 4.7. By default I would assume that it is true
> positive. Should be easy to check with manual printfs.
>
> Re why 5.4 does not detect it. Difficult to say.
> If you confirm that it's a real bug and provide repro instructions,
> then I can recheck it with latest gcc. If it's a real bug and the
> latest gcc does not detect it, then we need to look more closely at
> it. I afraid 5.4 won't be fixed.
> It's also possible that it's a false positive in the old compiler (I
> think there were some bugs). If so, I would recommend switching to a
> newer compiler.

I wonder if there's actual KASAN instrumentation in the kernel in
question built with GCC 4.7.
If there's none, there's little point in investigating this further,
as the tool is anyway barely usable.
Note that the report originates from something like copy_to_user() (or
hard to tell the exact place - Nikolay, could you please symbolize the
report?), i.e. it could be triggered even without KASAN
instrumentation.

--
Alexander Potapenko
Software Engineer

Google Germany GmbH
Erika-Mann-Straße, 33
80636 München

Geschäftsführer: Matthew Scott Sucherman, Paul Terence Manicle
Registergericht und -nummer: Hamburg, HRB 86891
Sitz der Gesellschaft: Hamburg

2017-03-07 16:40:09

by Andrey Ryabinin

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On 03/07/2017 06:35 PM, Nikolay Borisov wrote:
> Hello,
>
> I've been chasing a particular UAF as reported by kasan
> (https://www.spinics.net/lists/kernel/msg2458136.html).

AFAICS it's not use after free, it's out-of-bounds access.

> However, one
> thing which I took notice of rather lately is that I was building my
> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
> the following string:
>
> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
> -fsanitize=kernel-address is not supported by compiler
>
>
> Nevertheless, the kernel compiles and when I boot it I see the kasan
> splats as per the referenced thread.

With gcc like 4.7.4 compiler will not instrument memory accesses in code, which means
that kasan will not detect bugs. However, in your case, access is done via copy_to_user().

Compiler doesn't see accesses in copy_to_user() because it's written in assembly, thus
we added manual check in commit 1771c6e1a567ea0ba2cccc0a4ffe68a1419fd8ef.
Of course this manual check will work with any compiler version.


> If, however, I build the kernel
> with a newer compiler version 5.4.0 kasan no longer complains.
>

That's odd. Perhaps we have some tricky bug triggered by gcc 4.7.4 code generation
or it might be some bug in gcc. But it's just a blind guess, it's hard to say anything
for sure without further analysis.

>
> At this point I'm wondering whether the splats can be due to old
> compiler being used e.g. false positives or are they genuine splats and
> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
> being able to use CONFIG_KASAN it is still working since I'm seeing the
> splats. Is this valid behavior ?
>

So with GCCs that doesn't support kasan we shouldn't have any false-positives,
but it's expected to have a lot of false-negatives (missed bugs).



>
> Regards,
> Nikolay
>

2017-03-07 17:48:14

by Dmitry Vyukov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Tue, Mar 7, 2017 at 5:26 PM, Andrey Ryabinin <[email protected]> wrote:
>
>
> On 03/07/2017 06:54 PM, Dmitry Vyukov wrote:
>
>>
>> Hi,
>>
>> Re the message that kasan is not supported while it's still enabled in the end.
>> I think it's an issue related to gcc plugins. Originally kasan was
>> supported with 5.0+ thus the message. However, later we extended this
>> support to 4.5+ with gcc plugins.
>
> You made this up, we don't have such plugin.

Right. It was KCOV.

2017-03-07 18:01:40

by Nikolay Borisov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler



On 7.03.2017 18:05, Alexander Potapenko wrote:
> On Tue, Mar 7, 2017 at 4:54 PM, Dmitry Vyukov <[email protected]> wrote:
>> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
>> <[email protected]> wrote:
>>> Hello,
>>>
>>> I've been chasing a particular UAF as reported by kasan
>>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>>> thing which I took notice of rather lately is that I was building my
>>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>>> the following string:
>>>
>>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>>> -fsanitize=kernel-address is not supported by compiler
>>>
>>>
>>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>>> splats as per the referenced thread. If, however, I build the kernel
>>> with a newer compiler version 5.4.0 kasan no longer complains.
>>>
>>>
>>> At this point I'm wondering whether the splats can be due to old
>>> compiler being used e.g. false positives or are they genuine splats and
>>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>>> splats. Is this valid behavior ?
>>
>>
>> Hi,
>>
>> Re the message that kasan is not supported while it's still enabled in the end.
>> I think it's an issue related to gcc plugins. Originally kasan was
>> supported with 5.0+ thus the message. However, later we extended this
>> support to 4.5+ with gcc plugins. However, that confusing message from
>> build system was not fixed. So yes, it's confusing and it's something
>> to fix, but mostly you can just ignore it.
>>
>> Re false positives with 4.7. By default I would assume that it is true
>> positive. Should be easy to check with manual printfs.
>>
>> Re why 5.4 does not detect it. Difficult to say.
>> If you confirm that it's a real bug and provide repro instructions,
>> then I can recheck it with latest gcc. If it's a real bug and the
>> latest gcc does not detect it, then we need to look more closely at
>> it. I afraid 5.4 won't be fixed.
>> It's also possible that it's a false positive in the old compiler (I
>> think there were some bugs). If so, I would recommend switching to a
>> newer compiler.
>
> I wonder if there's actual KASAN instrumentation in the kernel in
> question built with GCC 4.7.
> If there's none, there's little point in investigating this further,
> as the tool is anyway barely usable.
> Note that the report originates from something like copy_to_user() (or
> hard to tell the exact place - Nikolay, could you please symbolize the
> report?), i.e. it could be triggered even without KASAN
> instrumentation.

Of course there is kasan instrumentation, otherwise I won't see kasan reports, no ?
I bisected this to commit 1771c6e1a567ea0ba2cc which adds user memory access API
instrumentation. Here is a symbolized report:

==================================================================
BUG: KASAN: slab-out-of-bounds in filldir+0xc8/0x170 at addr ffff88006a22560e
Read of size 20 by task systemd/1
=============================================================================
BUG kmalloc-96 (Not tainted): kasan: bad access detected
-----------------------------------------------------------------------------

Disabling lock debugging due to kernel taint
INFO: Allocated in ext4_htree_store_dirent+0x3e/0x120 age=0 cpu=2 pid=1
[< none >] ___slab_alloc+0x636/0x6a0 mm/slub.c:2446
[< none >] __slab_alloc+0x4f/0x86 mm/slub.c:2475
[< inline >] slab_alloc_node mm/slub.c:2538
[< inline >] slab_alloc mm/slub.c:2580
[< none >] __kmalloc+0x27a/0x340 mm/slub.c:3561
[< inline >] kmalloc include/linux/slab.h:483
[< inline >] kzalloc include/linux/slab.h:622
[< none >] ext4_htree_store_dirent+0x3e/0x120 fs/ext4/dir.c:447
[< none >] htree_dirblock_to_tree+0x16a/0x190 fs/ext4/namei.c:1001
[< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
[< inline >] ext4_dx_readdir fs/ext4/dir.c:571
[< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
[< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
[< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207

INFO: Freed in ext4_ext_map_blocks+0x7f9/0x23e0 age=1 cpu=2 pid=1
[< none >] __slab_free+0x31b/0x440 mm/slub.c:2657
[< inline >] slab_free mm/slub.c:2810
[< none >] kfree+0x27f/0x2d0 mm/slub.c:3662
[< none >] ext4_ext_map_blocks+0x7f9/0x23e0 fs/ext4/extents.c:4619
[< none >] ext4_map_blocks+0x3b4/0x5b0 fs/ext4/inode.c:529
[< none >] ext4_getblk+0x54/0x1a0 fs/ext4/inode.c:929
[< none >] ext4_bread+0x13/0x90 fs/ext4/inode.c:979
[< none >] __ext4_read_dirblock+0x3f/0x380 fs/ext4/namei.c:99
[< none >] htree_dirblock_to_tree+0x48/0x190 fs/ext4/namei.c:959
[< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
[< inline >] ext4_dx_readdir fs/ext4/dir.c:571
[< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
[< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
[< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
INFO: Slab 0xffffea0001a88900 objects=20 used=17 fp=0xffff88006a224e10 flags=0x4080
INFO: Object 0xffff88006a2255e0 @offset=5600 fp=0x45b282a2484c60d4

Bytes b4 ffff88006a2255d0: 02 00 00 00 01 00 00 00 c9 ac fb ff 00 00 00 00 ................
Object ffff88006a2255e0: d4 60 4c 48 a2 82 b2 45 18 8a 82 6a 00 88 ff ff .`LH...E...j....
Object ffff88006a2255f0: 38 51 22 6a 00 88 ff ff 88 8b 82 6a 00 88 ff ff 8Q"j.......j....
Object ffff88006a225600: 00 00 00 00 00 00 00 00 28 03 08 00 14 01 66 62 ........(.....fb
Object ffff88006a225610: 64 65 76 2d 62 6c 61 63 6b 6c 69 73 74 2e 63 6f dev-blacklist.co
Object ffff88006a225620: 6e 66 00 00 00 00 00 00 00 00 00 00 00 00 00 00 nf..............
Object ffff88006a225630: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
CPU: 2 PID: 1 Comm: systemd Tainted: G B 4.7.0-nbor #171
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Ubuntu-1.8.2-1ubuntu1 04/01/2014
0000000000000000 ffff88006cd97c58 ffffffff8146bd4c ffff8800000946c0
ffff88006a2255e0 ffff88006cd97c88 ffffffff81198d96 ffff8800000946c0
ffffea0001a88900 ffff88006a2255e0 0000000000000000 ffff88006cd97cb0
Call Trace:
[< inline >] __dump_stack lib/dump_stack.c:15
[<ffffffff8146bd4c>] dump_stack+0x85/0xc9 lib/dump_stack.c:51
[<ffffffff81198d96>] print_trailer+0x116/0x190 mm/slub.c:667
[<ffffffff811992c1>] object_err+0x41/0x50 mm/slub.c:674
[< inline >] print_address_description mm/kasan/report.c:180
[< inline >] kasan_report_error mm/kasan/report.c:276
[<ffffffff811a0a42>] kasan_report+0x282/0x530 mm/kasan/report.c:298
[< inline >] check_memory_region_inline mm/kasan/kasan.c:292
[<ffffffff8119ffa7>] check_memory_region+0x137/0x160 mm/kasan/kasan.c:299
[<ffffffff811a0041>] kasan_check_read+0x11/0x20 mm/kasan/kasan.c:304
[< inline >] copy_to_user ./arch/x86/include/asm/uaccess.h:760
[<ffffffff811ccc08>] filldir+0xc8/0x170 fs/readdir.c:195
[< inline >] dir_emit include/linux/fs.h:3134
[<ffffffff8124af38>] call_filldir+0x88/0x140 fs/ext4/dir.c:510
[< inline >] ext4_dx_readdir fs/ext4/dir.c:586
[<ffffffff8124b934>] ext4_readdir+0x714/0x950 fs/ext4/dir.c:121
[<ffffffff811ccd2d>] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[<ffffffff811ccf71>] SyS_getdents+0x91/0x120 fs/readdir.c:212
[<ffffffff816d7d80>] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
Memory state around the buggy address:
ffff88006a225500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
ffff88006a225580: fc fc fc fc fc fc fc fc fc fc fc fc 00 00 00 00
>ffff88006a225600: 00 00 00 00 05 fc fc fc fc fc fc fc fc fc fc fc
^
ffff88006a225680: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
ffff88006a225700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc 00 00

2017-03-07 18:54:44

by Alexander Potapenko

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Tue, Mar 7, 2017 at 6:33 PM, Nikolay Borisov
<[email protected]> wrote:
>
>
> On 7.03.2017 18:05, Alexander Potapenko wrote:
>> On Tue, Mar 7, 2017 at 4:54 PM, Dmitry Vyukov <[email protected]> wrote:
>>> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
>>> <[email protected]> wrote:
>>>> Hello,
>>>>
>>>> I've been chasing a particular UAF as reported by kasan
>>>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>>>> thing which I took notice of rather lately is that I was building my
>>>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>>>> the following string:
>>>>
>>>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>>>> -fsanitize=kernel-address is not supported by compiler
>>>>
>>>>
>>>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>>>> splats as per the referenced thread. If, however, I build the kernel
>>>> with a newer compiler version 5.4.0 kasan no longer complains.
>>>>
>>>>
>>>> At this point I'm wondering whether the splats can be due to old
>>>> compiler being used e.g. false positives or are they genuine splats and
>>>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>>>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>>>> splats. Is this valid behavior ?
>>>
>>>
>>> Hi,
>>>
>>> Re the message that kasan is not supported while it's still enabled in the end.
>>> I think it's an issue related to gcc plugins. Originally kasan was
>>> supported with 5.0+ thus the message. However, later we extended this
>>> support to 4.5+ with gcc plugins. However, that confusing message from
>>> build system was not fixed. So yes, it's confusing and it's something
>>> to fix, but mostly you can just ignore it.
>>>
>>> Re false positives with 4.7. By default I would assume that it is true
>>> positive. Should be easy to check with manual printfs.
>>>
>>> Re why 5.4 does not detect it. Difficult to say.
>>> If you confirm that it's a real bug and provide repro instructions,
>>> then I can recheck it with latest gcc. If it's a real bug and the
>>> latest gcc does not detect it, then we need to look more closely at
>>> it. I afraid 5.4 won't be fixed.
>>> It's also possible that it's a false positive in the old compiler (I
>>> think there were some bugs). If so, I would recommend switching to a
>>> newer compiler.
>>
>> I wonder if there's actual KASAN instrumentation in the kernel in
>> question built with GCC 4.7.
>> If there's none, there's little point in investigating this further,
>> as the tool is anyway barely usable.
>> Note that the report originates from something like copy_to_user() (or
>> hard to tell the exact place - Nikolay, could you please symbolize the
>> report?), i.e. it could be triggered even without KASAN
>> instrumentation.
>
> Of course there is kasan instrumentation, otherwise I won't see kasan reports, no ?
Not necessarily.
There's KASAN instrumentation inserted by the compiler, and KASAN
instrumentation added manually to the places the compiler can't
instrument.
> I bisected this to commit 1771c6e1a567ea0ba2cc which adds user memory access API
Commit 1771c6e1a567ea0ba2cc added exactly these calls to
check_memory_region() you are seeing.
If there is any other instrumentation inserted by the compiler, it's
possible that it may catch accesses to the same object (if something
else except copy_to_user() is being done).
Otherwise the only thing you can do to investigate this bug with GCC
4.7 is to bisect further by rolling to earlier revisions and applying
1771c6e1a567ea0ba2cc on top of them.
I won't be surprised though if at some point the bisection may stop
for a different reason.
> instrumentation. Here is a symbolized report:
>
> ==================================================================
> BUG: KASAN: slab-out-of-bounds in filldir+0xc8/0x170 at addr ffff88006a22560e
> Read of size 20 by task systemd/1
> =============================================================================
> BUG kmalloc-96 (Not tainted): kasan: bad access detected
> -----------------------------------------------------------------------------
>
> Disabling lock debugging due to kernel taint
> INFO: Allocated in ext4_htree_store_dirent+0x3e/0x120 age=0 cpu=2 pid=1
> [< none >] ___slab_alloc+0x636/0x6a0 mm/slub.c:2446
> [< none >] __slab_alloc+0x4f/0x86 mm/slub.c:2475
> [< inline >] slab_alloc_node mm/slub.c:2538
> [< inline >] slab_alloc mm/slub.c:2580
> [< none >] __kmalloc+0x27a/0x340 mm/slub.c:3561
> [< inline >] kmalloc include/linux/slab.h:483
> [< inline >] kzalloc include/linux/slab.h:622
> [< none >] ext4_htree_store_dirent+0x3e/0x120 fs/ext4/dir.c:447
> [< none >] htree_dirblock_to_tree+0x16a/0x190 fs/ext4/namei.c:1001
> [< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
> [< inline >] ext4_dx_readdir fs/ext4/dir.c:571
> [< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
> [< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
> [< inline >] SYSC_getdents fs/readdir.c:230
> [< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
> [< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
>
> INFO: Freed in ext4_ext_map_blocks+0x7f9/0x23e0 age=1 cpu=2 pid=1
> [< none >] __slab_free+0x31b/0x440 mm/slub.c:2657
> [< inline >] slab_free mm/slub.c:2810
> [< none >] kfree+0x27f/0x2d0 mm/slub.c:3662
> [< none >] ext4_ext_map_blocks+0x7f9/0x23e0 fs/ext4/extents.c:4619
> [< none >] ext4_map_blocks+0x3b4/0x5b0 fs/ext4/inode.c:529
> [< none >] ext4_getblk+0x54/0x1a0 fs/ext4/inode.c:929
> [< none >] ext4_bread+0x13/0x90 fs/ext4/inode.c:979
> [< none >] __ext4_read_dirblock+0x3f/0x380 fs/ext4/namei.c:99
> [< none >] htree_dirblock_to_tree+0x48/0x190 fs/ext4/namei.c:959
> [< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
> [< inline >] ext4_dx_readdir fs/ext4/dir.c:571
> [< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
> [< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
> [< inline >] SYSC_getdents fs/readdir.c:230
> [< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
> [< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
> INFO: Slab 0xffffea0001a88900 objects=20 used=17 fp=0xffff88006a224e10 flags=0x4080
> INFO: Object 0xffff88006a2255e0 @offset=5600 fp=0x45b282a2484c60d4
>
> Bytes b4 ffff88006a2255d0: 02 00 00 00 01 00 00 00 c9 ac fb ff 00 00 00 00 ................
> Object ffff88006a2255e0: d4 60 4c 48 a2 82 b2 45 18 8a 82 6a 00 88 ff ff .`LH...E...j....
> Object ffff88006a2255f0: 38 51 22 6a 00 88 ff ff 88 8b 82 6a 00 88 ff ff 8Q"j.......j....
> Object ffff88006a225600: 00 00 00 00 00 00 00 00 28 03 08 00 14 01 66 62 ........(.....fb
> Object ffff88006a225610: 64 65 76 2d 62 6c 61 63 6b 6c 69 73 74 2e 63 6f dev-blacklist.co
> Object ffff88006a225620: 6e 66 00 00 00 00 00 00 00 00 00 00 00 00 00 00 nf..............
> Object ffff88006a225630: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
> CPU: 2 PID: 1 Comm: systemd Tainted: G B 4.7.0-nbor #171
> Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Ubuntu-1.8.2-1ubuntu1 04/01/2014
> 0000000000000000 ffff88006cd97c58 ffffffff8146bd4c ffff8800000946c0
> ffff88006a2255e0 ffff88006cd97c88 ffffffff81198d96 ffff8800000946c0
> ffffea0001a88900 ffff88006a2255e0 0000000000000000 ffff88006cd97cb0
> Call Trace:
> [< inline >] __dump_stack lib/dump_stack.c:15
> [<ffffffff8146bd4c>] dump_stack+0x85/0xc9 lib/dump_stack.c:51
> [<ffffffff81198d96>] print_trailer+0x116/0x190 mm/slub.c:667
> [<ffffffff811992c1>] object_err+0x41/0x50 mm/slub.c:674
> [< inline >] print_address_description mm/kasan/report.c:180
> [< inline >] kasan_report_error mm/kasan/report.c:276
> [<ffffffff811a0a42>] kasan_report+0x282/0x530 mm/kasan/report.c:298
> [< inline >] check_memory_region_inline mm/kasan/kasan.c:292
> [<ffffffff8119ffa7>] check_memory_region+0x137/0x160 mm/kasan/kasan.c:299
> [<ffffffff811a0041>] kasan_check_read+0x11/0x20 mm/kasan/kasan.c:304
> [< inline >] copy_to_user ./arch/x86/include/asm/uaccess.h:760
> [<ffffffff811ccc08>] filldir+0xc8/0x170 fs/readdir.c:195
> [< inline >] dir_emit include/linux/fs.h:3134
> [<ffffffff8124af38>] call_filldir+0x88/0x140 fs/ext4/dir.c:510
> [< inline >] ext4_dx_readdir fs/ext4/dir.c:586
> [<ffffffff8124b934>] ext4_readdir+0x714/0x950 fs/ext4/dir.c:121
> [<ffffffff811ccd2d>] iterate_dir+0x7d/0x190 fs/readdir.c:50
> [< inline >] SYSC_getdents fs/readdir.c:230
> [<ffffffff811ccf71>] SyS_getdents+0x91/0x120 fs/readdir.c:212
> [<ffffffff816d7d80>] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
> Memory state around the buggy address:
> ffff88006a225500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
> ffff88006a225580: fc fc fc fc fc fc fc fc fc fc fc fc 00 00 00 00
>>ffff88006a225600: 00 00 00 00 05 fc fc fc fc fc fc fc fc fc fc fc
> ^
> ffff88006a225680: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
> ffff88006a225700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc 00 00

I think I've seen relevant reports on the Chrome OS 3.14/3.18 kernel
with Clang and GCC 6.0.0.
I never managed to reproduce those reliably, they would always go away
every time I tried to catch them.
You can try enabling some of the ext4 debug checks, but I still think
it won't help much as long as you're using an unsupported compiler.


--
Alexander Potapenko
Software Engineer

Google Germany GmbH
Erika-Mann-Straße, 33
80636 München

Geschäftsführer: Matthew Scott Sucherman, Paul Terence Manicle
Registergericht und -nummer: Hamburg, HRB 86891
Sitz der Gesellschaft: Hamburg

2017-03-07 20:24:49

by Nikolay Borisov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler



On 7.03.2017 19:51, Alexander Potapenko wrote:
> On Tue, Mar 7, 2017 at 6:33 PM, Nikolay Borisov
> <[email protected]> wrote:
>>
>>
>> On 7.03.2017 18:05, Alexander Potapenko wrote:
>>> On Tue, Mar 7, 2017 at 4:54 PM, Dmitry Vyukov <[email protected]> wrote:
>>>> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
>>>> <[email protected]> wrote:
>>>>> Hello,
>>>>>
>>>>> I've been chasing a particular UAF as reported by kasan
>>>>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>>>>> thing which I took notice of rather lately is that I was building my
>>>>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>>>>> the following string:
>>>>>
>>>>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>>>>> -fsanitize=kernel-address is not supported by compiler
>>>>>
>>>>>
>>>>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>>>>> splats as per the referenced thread. If, however, I build the kernel
>>>>> with a newer compiler version 5.4.0 kasan no longer complains.
>>>>>
>>>>>
>>>>> At this point I'm wondering whether the splats can be due to old
>>>>> compiler being used e.g. false positives or are they genuine splats and
>>>>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>>>>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>>>>> splats. Is this valid behavior ?
>>>>
>>>>
>>>> Hi,
>>>>
>>>> Re the message that kasan is not supported while it's still enabled in the end.
>>>> I think it's an issue related to gcc plugins. Originally kasan was
>>>> supported with 5.0+ thus the message. However, later we extended this
>>>> support to 4.5+ with gcc plugins. However, that confusing message from
>>>> build system was not fixed. So yes, it's confusing and it's something
>>>> to fix, but mostly you can just ignore it.
>>>>
>>>> Re false positives with 4.7. By default I would assume that it is true
>>>> positive. Should be easy to check with manual printfs.
>>>>
>>>> Re why 5.4 does not detect it. Difficult to say.
>>>> If you confirm that it's a real bug and provide repro instructions,
>>>> then I can recheck it with latest gcc. If it's a real bug and the
>>>> latest gcc does not detect it, then we need to look more closely at
>>>> it. I afraid 5.4 won't be fixed.
>>>> It's also possible that it's a false positive in the old compiler (I
>>>> think there were some bugs). If so, I would recommend switching to a
>>>> newer compiler.
>>>
>>> I wonder if there's actual KASAN instrumentation in the kernel in
>>> question built with GCC 4.7.
>>> If there's none, there's little point in investigating this further,
>>> as the tool is anyway barely usable.
>>> Note that the report originates from something like copy_to_user() (or
>>> hard to tell the exact place - Nikolay, could you please symbolize the
>>> report?), i.e. it could be triggered even without KASAN
>>> instrumentation.
>>
>> Of course there is kasan instrumentation, otherwise I won't see kasan reports, no ?
> Not necessarily.
> There's KASAN instrumentation inserted by the compiler, and KASAN
> instrumentation added manually to the places the compiler can't
> instrument.
>> I bisected this to commit 1771c6e1a567ea0ba2cc which adds user memory access API
> Commit 1771c6e1a567ea0ba2cc added exactly these calls to
> check_memory_region() you are seeing.
> If there is any other instrumentation inserted by the compiler, it's
> possible that it may catch accesses to the same object (if something
> else except copy_to_user() is being done).
> Otherwise the only thing you can do to investigate this bug with GCC
> 4.7 is to bisect further by rolling to earlier revisions and applying
> 1771c6e1a567ea0ba2cc on top of them.
> I won't be surprised though if at some point the bisection may stop
> for a different reason.
>> instrumentation. Here is a symbolized report:
>>
>> ==================================================================
>> BUG: KASAN: slab-out-of-bounds in filldir+0xc8/0x170 at addr ffff88006a22560e
>> Read of size 20 by task systemd/1
>> =============================================================================
>> BUG kmalloc-96 (Not tainted): kasan: bad access detected
>> -----------------------------------------------------------------------------
>>
>> Disabling lock debugging due to kernel taint
>> INFO: Allocated in ext4_htree_store_dirent+0x3e/0x120 age=0 cpu=2 pid=1
>> [< none >] ___slab_alloc+0x636/0x6a0 mm/slub.c:2446
>> [< none >] __slab_alloc+0x4f/0x86 mm/slub.c:2475
>> [< inline >] slab_alloc_node mm/slub.c:2538
>> [< inline >] slab_alloc mm/slub.c:2580
>> [< none >] __kmalloc+0x27a/0x340 mm/slub.c:3561
>> [< inline >] kmalloc include/linux/slab.h:483
>> [< inline >] kzalloc include/linux/slab.h:622
>> [< none >] ext4_htree_store_dirent+0x3e/0x120 fs/ext4/dir.c:447
>> [< none >] htree_dirblock_to_tree+0x16a/0x190 fs/ext4/namei.c:1001
>> [< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
>> [< inline >] ext4_dx_readdir fs/ext4/dir.c:571
>> [< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
>> [< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
>> [< inline >] SYSC_getdents fs/readdir.c:230
>> [< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
>> [< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
>>
>> INFO: Freed in ext4_ext_map_blocks+0x7f9/0x23e0 age=1 cpu=2 pid=1
>> [< none >] __slab_free+0x31b/0x440 mm/slub.c:2657
>> [< inline >] slab_free mm/slub.c:2810
>> [< none >] kfree+0x27f/0x2d0 mm/slub.c:3662
>> [< none >] ext4_ext_map_blocks+0x7f9/0x23e0 fs/ext4/extents.c:4619
>> [< none >] ext4_map_blocks+0x3b4/0x5b0 fs/ext4/inode.c:529
>> [< none >] ext4_getblk+0x54/0x1a0 fs/ext4/inode.c:929
>> [< none >] ext4_bread+0x13/0x90 fs/ext4/inode.c:979
>> [< none >] __ext4_read_dirblock+0x3f/0x380 fs/ext4/namei.c:99
>> [< none >] htree_dirblock_to_tree+0x48/0x190 fs/ext4/namei.c:959
>> [< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
>> [< inline >] ext4_dx_readdir fs/ext4/dir.c:571
>> [< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
>> [< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
>> [< inline >] SYSC_getdents fs/readdir.c:230
>> [< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
>> [< none >] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
>> INFO: Slab 0xffffea0001a88900 objects=20 used=17 fp=0xffff88006a224e10 flags=0x4080
>> INFO: Object 0xffff88006a2255e0 @offset=5600 fp=0x45b282a2484c60d4
>>
>> Bytes b4 ffff88006a2255d0: 02 00 00 00 01 00 00 00 c9 ac fb ff 00 00 00 00 ................
>> Object ffff88006a2255e0: d4 60 4c 48 a2 82 b2 45 18 8a 82 6a 00 88 ff ff .`LH...E...j....
>> Object ffff88006a2255f0: 38 51 22 6a 00 88 ff ff 88 8b 82 6a 00 88 ff ff 8Q"j.......j....
>> Object ffff88006a225600: 00 00 00 00 00 00 00 00 28 03 08 00 14 01 66 62 ........(.....fb
>> Object ffff88006a225610: 64 65 76 2d 62 6c 61 63 6b 6c 69 73 74 2e 63 6f dev-blacklist.co
>> Object ffff88006a225620: 6e 66 00 00 00 00 00 00 00 00 00 00 00 00 00 00 nf..............
>> Object ffff88006a225630: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
>> CPU: 2 PID: 1 Comm: systemd Tainted: G B 4.7.0-nbor #171
>> Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Ubuntu-1.8.2-1ubuntu1 04/01/2014
>> 0000000000000000 ffff88006cd97c58 ffffffff8146bd4c ffff8800000946c0
>> ffff88006a2255e0 ffff88006cd97c88 ffffffff81198d96 ffff8800000946c0
>> ffffea0001a88900 ffff88006a2255e0 0000000000000000 ffff88006cd97cb0
>> Call Trace:
>> [< inline >] __dump_stack lib/dump_stack.c:15
>> [<ffffffff8146bd4c>] dump_stack+0x85/0xc9 lib/dump_stack.c:51
>> [<ffffffff81198d96>] print_trailer+0x116/0x190 mm/slub.c:667
>> [<ffffffff811992c1>] object_err+0x41/0x50 mm/slub.c:674
>> [< inline >] print_address_description mm/kasan/report.c:180
>> [< inline >] kasan_report_error mm/kasan/report.c:276
>> [<ffffffff811a0a42>] kasan_report+0x282/0x530 mm/kasan/report.c:298
>> [< inline >] check_memory_region_inline mm/kasan/kasan.c:292
>> [<ffffffff8119ffa7>] check_memory_region+0x137/0x160 mm/kasan/kasan.c:299
>> [<ffffffff811a0041>] kasan_check_read+0x11/0x20 mm/kasan/kasan.c:304
>> [< inline >] copy_to_user ./arch/x86/include/asm/uaccess.h:760
>> [<ffffffff811ccc08>] filldir+0xc8/0x170 fs/readdir.c:195
>> [< inline >] dir_emit include/linux/fs.h:3134
>> [<ffffffff8124af38>] call_filldir+0x88/0x140 fs/ext4/dir.c:510
>> [< inline >] ext4_dx_readdir fs/ext4/dir.c:586
>> [<ffffffff8124b934>] ext4_readdir+0x714/0x950 fs/ext4/dir.c:121
>> [<ffffffff811ccd2d>] iterate_dir+0x7d/0x190 fs/readdir.c:50
>> [< inline >] SYSC_getdents fs/readdir.c:230
>> [<ffffffff811ccf71>] SyS_getdents+0x91/0x120 fs/readdir.c:212
>> [<ffffffff816d7d80>] entry_SYSCALL_64_fastpath+0x23/0xc1 arch/x86/entry/entry_64.S:207
>> Memory state around the buggy address:
>> ffff88006a225500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
>> ffff88006a225580: fc fc fc fc fc fc fc fc fc fc fc fc 00 00 00 00
>>> ffff88006a225600: 00 00 00 00 05 fc fc fc fc fc fc fc fc fc fc fc
>> ^
>> ffff88006a225680: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
>> ffff88006a225700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc 00 00
>
> I think I've seen relevant reports on the Chrome OS 3.14/3.18 kernel
> with Clang and GCC 6.0.0.
> I never managed to reproduce those reliably, they would always go away
> every time I tried to catch them.
> You can try enabling some of the ext4 debug checks, but I still think
> it won't help much as long as you're using an unsupported compiler.
>
>


I'm also attaching other kasan reports that follow the first one from
the same boot. It's very strange since there are buffer allocated in
ext4 readdir and freed even in the virtio buffer management code. That's
also rather odd. I also backported 64f8ebaf115bcddc4 and
1771c6e1a567ea0 to 4.6 and still see similar splats. Tried applying
those patches to 4.5 however, it seems there are other pieces of code
missing to make kasan work for that version.


Attachments:
kasan_symbolized.txt (37.83 kB)

2017-03-08 08:18:22

by Nikolay Borisov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler



On 7.03.2017 17:54, Dmitry Vyukov wrote:
> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
> <[email protected]> wrote:
>> Hello,
>>
>> I've been chasing a particular UAF as reported by kasan
>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>> thing which I took notice of rather lately is that I was building my
>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>> the following string:
>>
>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>> -fsanitize=kernel-address is not supported by compiler
>>
>>
>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>> splats as per the referenced thread. If, however, I build the kernel
>> with a newer compiler version 5.4.0 kasan no longer complains.
>>
>>
>> At this point I'm wondering whether the splats can be due to old
>> compiler being used e.g. false positives or are they genuine splats and
>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>> splats. Is this valid behavior ?
>
>
> Hi,
>
> Re the message that kasan is not supported while it's still enabled in the end.
> I think it's an issue related to gcc plugins. Originally kasan was
> supported with 5.0+ thus the message. However, later we extended this
> support to 4.5+ with gcc plugins. However, that confusing message from
> build system was not fixed. So yes, it's confusing and it's something
> to fix, but mostly you can just ignore it.
>
> Re false positives with 4.7. By default I would assume that it is true
> positive. Should be easy to check with manual printfs.

So apparently this is indeed a false positive, resulting from using the old
compiler. I used the attached patch to verify it.

And what it prints is :
[ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
[ 17.185808] Calling filldir with ffff88006ae8fdb0

So the first line essentially happens when the object ffff88006ae8fdb0 is
being allocated and the second when it's used in filldir. The warning in
ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
the value of ext4_global_pointer then I see multiple lines such as:
[ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
[ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
[ 17.388740] Calling filldir with ffff88006ae8fdb0

so that same object was used right before it is allocated again in
ext4_htree_store_dirent. And when you think about it it is logical since
before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
contents of the directory from disk.

This leads me to believe that kasan is getting confused thinking that
the object is being freed AFTER being allocated in
ext4_htree_store_dirent but testing shows it's being freed BEFORE. So
I deem this a false positive, triggered by the compiler.



>
> Re why 5.4 does not detect it. Difficult to say.
> If you confirm that it's a real bug and provide repro instructions,
> then I can recheck it with latest gcc. If it's a real bug and the
> latest gcc does not detect it, then we need to look more closely at
> it. I afraid 5.4 won't be fixed.
> It's also possible that it's a false positive in the old compiler (I
> think there were some bugs). If so, I would recommend switching to a
> newer compiler.
>


Attachments:
ext4-debug.patch (2.04 kB)

2017-03-08 12:35:05

by Dmitry Vyukov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Wed, Mar 8, 2017 at 9:10 AM, Nikolay Borisov
<[email protected]> wrote:
>
>
> On 7.03.2017 17:54, Dmitry Vyukov wrote:
>> On Tue, Mar 7, 2017 at 4:35 PM, Nikolay Borisov
>> <[email protected]> wrote:
>>> Hello,
>>>
>>> I've been chasing a particular UAF as reported by kasan
>>> (https://www.spinics.net/lists/kernel/msg2458136.html). However, one
>>> thing which I took notice of rather lately is that I was building my
>>> kernel with gcc 4.7.4 which is not supported by kasan as indicated by
>>> the following string:
>>>
>>> scripts/Makefile.kasan:19: Cannot use CONFIG_KASAN:
>>> -fsanitize=kernel-address is not supported by compiler
>>>
>>>
>>> Nevertheless, the kernel compiles and when I boot it I see the kasan
>>> splats as per the referenced thread. If, however, I build the kernel
>>> with a newer compiler version 5.4.0 kasan no longer complains.
>>>
>>>
>>> At this point I'm wondering whether the splats can be due to old
>>> compiler being used e.g. false positives or are they genuine splats and
>>> gcc 5 somehow obfuscates them ? Clearly despite the warning about not
>>> being able to use CONFIG_KASAN it is still working since I'm seeing the
>>> splats. Is this valid behavior ?
>>
>>
>> Hi,
>>
>> Re the message that kasan is not supported while it's still enabled in the end.
>> I think it's an issue related to gcc plugins. Originally kasan was
>> supported with 5.0+ thus the message. However, later we extended this
>> support to 4.5+ with gcc plugins. However, that confusing message from
>> build system was not fixed. So yes, it's confusing and it's something
>> to fix, but mostly you can just ignore it.
>>
>> Re false positives with 4.7. By default I would assume that it is true
>> positive. Should be easy to check with manual printfs.
>
> So apparently this is indeed a false positive, resulting from using the old
> compiler. I used the attached patch to verify it.
>
> And what it prints is :
> [ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
> [ 17.185808] Calling filldir with ffff88006ae8fdb0
>
> So the first line essentially happens when the object ffff88006ae8fdb0 is
> being allocated and the second when it's used in filldir. The warning in
> ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
> the value of ext4_global_pointer then I see multiple lines such as:
> [ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
> [ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
> [ 17.388740] Calling filldir with ffff88006ae8fdb0
>
> so that same object was used right before it is allocated again in
> ext4_htree_store_dirent. And when you think about it it is logical since
> before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
> contents of the directory from disk.
>
> This leads me to believe that kasan is getting confused thinking that
> the object is being freed AFTER being allocated in
> ext4_htree_store_dirent but testing shows it's being freed BEFORE. So
> I deem this a false positive, triggered by the compiler.


I am not following how compiler version can affect any of this. But if
you say this is a false positive, let it be so. gcc 4.7 is old.

2017-03-09 09:47:44

by Dmitry Vyukov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Thu, Mar 9, 2017 at 10:46 AM, Andrey Ryabinin
<[email protected]> wrote:
> On 03/08/2017 11:10 AM, Nikolay Borisov wrote:
>
>>
>> So apparently this is indeed a false positive, resulting from using the old
>> compiler. I used the attached patch to verify it.
>>
>> And what it prints is :
>> [ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>> [ 17.185808] Calling filldir with ffff88006ae8fdb0
>>
>> So the first line essentially happens when the object ffff88006ae8fdb0 is
>> being allocated and the second when it's used in filldir. The warning in
>> ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
>> the value of ext4_global_pointer then I see multiple lines such as:
>> [ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
>> [ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>> [ 17.388740] Calling filldir with ffff88006ae8fdb0
>>
>> so that same object was used right before it is allocated again in
>> ext4_htree_store_dirent. And when you think about it it is logical since
>> before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
>> contents of the directory from disk.
>>
>> This leads me to believe that kasan is getting confused thinking that
>> the object is being freed
>
> As I said before, this is *not* use-after-free. It's out-of-bounds access.
> No, kasan is not confused, it doesn't think that object is freed.
> Object is allocated and kasan see it as allocated object.
> The problem is that filldir reads past the end of that allocated object.
>
> I don't see any sign that it's a false-positive.

Then the question is: why is not it detected with newer compiler?
It's equally strange: all of kmalloc/free/copy_to_user have manual
instrumentation.

2017-03-09 09:58:46

by Nikolay Borisov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler



On 9.03.2017 11:46, Andrey Ryabinin wrote:
> On 03/08/2017 11:10 AM, Nikolay Borisov wrote:
>
>>
>> So apparently this is indeed a false positive, resulting from using the old
>> compiler. I used the attached patch to verify it.
>>
>> And what it prints is :
>> [ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>> [ 17.185808] Calling filldir with ffff88006ae8fdb0
>>
>> So the first line essentially happens when the object ffff88006ae8fdb0 is
>> being allocated and the second when it's used in filldir. The warning in
>> ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
>> the value of ext4_global_pointer then I see multiple lines such as:
>> [ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
>> [ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>> [ 17.388740] Calling filldir with ffff88006ae8fdb0
>>
>> so that same object was used right before it is allocated again in
>> ext4_htree_store_dirent. And when you think about it it is logical since
>> before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
>> contents of the directory from disk.
>>
>> This leads me to believe that kasan is getting confused thinking that
>> the object is being freed
>
> As I said before, this is *not* use-after-free. It's out-of-bounds access.
> No, kasan is not confused, it doesn't think that object is freed.
> Object is allocated and kasan see it as allocated object.
> The problem is that filldir reads past the end of that allocated object.
>
> I don't see any sign that it's a false-positive.

Okay in that case I will continue digging. So the name is indeed housed
at the end of the struct fname. In ext4_htree_store_dirent this
structure seems allocated correctly sizeof(struct fname) + ent_name->len
+ 1;

Also the read should indeed be 20 bytes since the filename in question
fbdev-blacklist.conf is indeed 20 bytes. This implies that the 'namlen'
passed to copy_to_user is also correct. I guess I will have to look at
the generated assembly between the 2 gcc versions and see if anything
pops out.

>
>
>> AFTER being allocated in
>> ext4_htree_store_dirent but testing shows it's being freed BEFORE. So
>> I deem this a false positive, triggered by the compiler.
>>
>>
>>

2017-03-09 10:00:24

by Andrey Ryabinin

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On 03/08/2017 11:10 AM, Nikolay Borisov wrote:

>
> So apparently this is indeed a false positive, resulting from using the old
> compiler. I used the attached patch to verify it.
>
> And what it prints is :
> [ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
> [ 17.185808] Calling filldir with ffff88006ae8fdb0
>
> So the first line essentially happens when the object ffff88006ae8fdb0 is
> being allocated and the second when it's used in filldir. The warning in
> ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
> the value of ext4_global_pointer then I see multiple lines such as:
> [ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
> [ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
> [ 17.388740] Calling filldir with ffff88006ae8fdb0
>
> so that same object was used right before it is allocated again in
> ext4_htree_store_dirent. And when you think about it it is logical since
> before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
> contents of the directory from disk.
>
> This leads me to believe that kasan is getting confused thinking that
> the object is being freed

As I said before, this is *not* use-after-free. It's out-of-bounds access.
No, kasan is not confused, it doesn't think that object is freed.
Object is allocated and kasan see it as allocated object.
The problem is that filldir reads past the end of that allocated object.

I don't see any sign that it's a false-positive.


> AFTER being allocated in
> ext4_htree_store_dirent but testing shows it's being freed BEFORE. So
> I deem this a false positive, triggered by the compiler.
>
>
>

2017-03-09 10:16:42

by Dmitry Vyukov

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On Thu, Mar 9, 2017 at 10:58 AM, Nikolay Borisov
<[email protected]> wrote:
>
>
> On 9.03.2017 11:46, Andrey Ryabinin wrote:
>> On 03/08/2017 11:10 AM, Nikolay Borisov wrote:
>>
>>>
>>> So apparently this is indeed a false positive, resulting from using the old
>>> compiler. I used the attached patch to verify it.
>>>
>>> And what it prints is :
>>> [ 17.184288] Assigned fbdev-blacklist.conff(ffff880001ea8020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>>> [ 17.185808] Calling filldir with ffff88006ae8fdb0
>>>
>>> So the first line essentially happens when the object ffff88006ae8fdb0 is
>>> being allocated and the second when it's used in filldir. The warning in
>>> ext4_ext_map_blocks doesn't trigger. However, if I remove the check for
>>> the value of ext4_global_pointer then I see multiple lines such as:
>>> [ 17.386283] ext4_ext_map_blocks:freeing pointer used in ext4_htree_store_dirent: ffff88006ae8fdb0 inode: ffff88006bff60d0
>>> [ 17.387601] Assigned fbdev-blacklist.conff(ffff880001eb3020)20 whole object: ffff88006ae8fdb0 inode:ffff88006bff60d0
>>> [ 17.388740] Calling filldir with ffff88006ae8fdb0
>>>
>>> so that same object was used right before it is allocated again in
>>> ext4_htree_store_dirent. And when you think about it it is logical since
>>> before filling in the dentry names in ext4_htree_store_dirent ext4 has to fetch the
>>> contents of the directory from disk.
>>>
>>> This leads me to believe that kasan is getting confused thinking that
>>> the object is being freed
>>
>> As I said before, this is *not* use-after-free. It's out-of-bounds access.
>> No, kasan is not confused, it doesn't think that object is freed.
>> Object is allocated and kasan see it as allocated object.
>> The problem is that filldir reads past the end of that allocated object.
>>
>> I don't see any sign that it's a false-positive.
>
> Okay in that case I will continue digging. So the name is indeed housed
> at the end of the struct fname. In ext4_htree_store_dirent this
> structure seems allocated correctly sizeof(struct fname) + ent_name->len
> + 1;
>
> Also the read should indeed be 20 bytes since the filename in question
> fbdev-blacklist.conf is indeed 20 bytes. This implies that the 'namlen'
> passed to copy_to_user is also correct. I guess I will have to look at
> the generated assembly between the 2 gcc versions and see if anything
> pops out.
>
>>
>>
>>> AFTER being allocated in
>>> ext4_htree_store_dirent but testing shows it's being freed BEFORE. So
>>> I deem this a false positive, triggered by the compiler.


Looking at this report I don't see how this is OOB.
Object starts at 0xffff88006a2255e0, object size is 69 bytes (from shadow).
ffff88006a22560e + 20 bytes is not OOB.


BUG: KASAN: slab-out-of-bounds in filldir+0xc8/0x170 at addr ffff88006a22560e
Read of size 20 by task systemd/1
=============================================================================
BUG kmalloc-96 (Not tainted): kasan: bad access detected
-----------------------------------------------------------------------------

Disabling lock debugging due to kernel taint
INFO: Allocated in ext4_htree_store_dirent+0x3e/0x120 age=0 cpu=2 pid=1
[< none >] ___slab_alloc+0x636/0x6a0 mm/slub.c:2446
[< none >] __slab_alloc+0x4f/0x86 mm/slub.c:2475
[< inline >] slab_alloc_node mm/slub.c:2538
[< inline >] slab_alloc mm/slub.c:2580
[< none >] __kmalloc+0x27a/0x340 mm/slub.c:3561
[< inline >] kmalloc include/linux/slab.h:483
[< inline >] kzalloc include/linux/slab.h:622
[< none >] ext4_htree_store_dirent+0x3e/0x120 fs/ext4/dir.c:447
[< none >] htree_dirblock_to_tree+0x16a/0x190 fs/ext4/namei.c:1001
[< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
[< inline >] ext4_dx_readdir fs/ext4/dir.c:571
[< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
[< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
[< none >] entry_SYSCALL_64_fastpath+0x23/0xc1
arch/x86/entry/entry_64.S:207

INFO: Freed in ext4_ext_map_blocks+0x7f9/0x23e0 age=1 cpu=2 pid=1
[< none >] __slab_free+0x31b/0x440 mm/slub.c:2657
[< inline >] slab_free mm/slub.c:2810
[< none >] kfree+0x27f/0x2d0 mm/slub.c:3662
[< none >] ext4_ext_map_blocks+0x7f9/0x23e0 fs/ext4/extents.c:4619
[< none >] ext4_map_blocks+0x3b4/0x5b0 fs/ext4/inode.c:529
[< none >] ext4_getblk+0x54/0x1a0 fs/ext4/inode.c:929
[< none >] ext4_bread+0x13/0x90 fs/ext4/inode.c:979
[< none >] __ext4_read_dirblock+0x3f/0x380 fs/ext4/namei.c:99
[< none >] htree_dirblock_to_tree+0x48/0x190 fs/ext4/namei.c:959
[< none >] ext4_htree_fill_tree+0xaa/0x310 fs/ext4/namei.c:1075
[< inline >] ext4_dx_readdir fs/ext4/dir.c:571
[< none >] ext4_readdir+0x698/0x950 fs/ext4/dir.c:121
[< none >] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[< none >] SyS_getdents+0x91/0x120 fs/readdir.c:212
[< none >] entry_SYSCALL_64_fastpath+0x23/0xc1
arch/x86/entry/entry_64.S:207
INFO: Slab 0xffffea0001a88900 objects=20 used=17 fp=0xffff88006a224e10
flags=0x4080
INFO: Object 0xffff88006a2255e0 @offset=5600 fp=0x45b282a2484c60d4

Bytes b4 ffff88006a2255d0: 02 00 00 00 01 00 00 00 c9 ac fb ff 00 00
00 00 ................
Object ffff88006a2255e0: d4 60 4c 48 a2 82 b2 45 18 8a 82 6a 00 88 ff
ff .`LH...E...j....
Object ffff88006a2255f0: 38 51 22 6a 00 88 ff ff 88 8b 82 6a 00 88 ff
ff 8Q"j.......j....
Object ffff88006a225600: 00 00 00 00 00 00 00 00 28 03 08 00 14 01 66
62 ........(.....fb
Object ffff88006a225610: 64 65 76 2d 62 6c 61 63 6b 6c 69 73 74 2e 63
6f dev-blacklist.co
Object ffff88006a225620: 6e 66 00 00 00 00 00 00 00 00 00 00 00 00 00
00 nf..............
Object ffff88006a225630: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 ................
CPU: 2 PID: 1 Comm: systemd Tainted: G B 4.7.0-nbor #171
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS
Ubuntu-1.8.2-1ubuntu1 04/01/2014
0000000000000000 ffff88006cd97c58 ffffffff8146bd4c ffff8800000946c0
ffff88006a2255e0 ffff88006cd97c88 ffffffff81198d96 ffff8800000946c0
ffffea0001a88900 ffff88006a2255e0 0000000000000000 ffff88006cd97cb0
Call Trace:
[< inline >] __dump_stack lib/dump_stack.c:15
[<ffffffff8146bd4c>] dump_stack+0x85/0xc9 lib/dump_stack.c:51
[<ffffffff81198d96>] print_trailer+0x116/0x190 mm/slub.c:667
[<ffffffff811992c1>] object_err+0x41/0x50 mm/slub.c:674
[< inline >] print_address_description mm/kasan/report.c:180
[< inline >] kasan_report_error mm/kasan/report.c:276
[<ffffffff811a0a42>] kasan_report+0x282/0x530 mm/kasan/report.c:298
[< inline >] check_memory_region_inline mm/kasan/kasan.c:292
[<ffffffff8119ffa7>] check_memory_region+0x137/0x160 mm/kasan/kasan.c:299
[<ffffffff811a0041>] kasan_check_read+0x11/0x20 mm/kasan/kasan.c:304
[< inline >] copy_to_user ./arch/x86/include/asm/uaccess.h:760
[<ffffffff811ccc08>] filldir+0xc8/0x170 fs/readdir.c:195
[< inline >] dir_emit include/linux/fs.h:3134
[<ffffffff8124af38>] call_filldir+0x88/0x140 fs/ext4/dir.c:510
[< inline >] ext4_dx_readdir fs/ext4/dir.c:586
[<ffffffff8124b934>] ext4_readdir+0x714/0x950 fs/ext4/dir.c:121
[<ffffffff811ccd2d>] iterate_dir+0x7d/0x190 fs/readdir.c:50
[< inline >] SYSC_getdents fs/readdir.c:230
[<ffffffff811ccf71>] SyS_getdents+0x91/0x120 fs/readdir.c:212
[<ffffffff816d7d80>] entry_SYSCALL_64_fastpath+0x23/0xc1
arch/x86/entry/entry_64.S:207
Memory state around the buggy address:
ffff88006a225500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
ffff88006a225580: fc fc fc fc fc fc fc fc fc fc fc fc 00 00 00 00
>ffff88006a225600: 00 00 00 00 05 fc fc fc fc fc fc fc fc fc fc fc
^
ffff88006a225680: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
ffff88006a225700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc 00 00

2017-03-09 11:38:26

by Andrey Ryabinin

[permalink] [raw]
Subject: Re: kasan behavior when built with unsupported compiler

On 03/09/2017 01:16 PM, Dmitry Vyukov wrote:
>
> Looking at this report I don't see how this is OOB.
> Object starts at 0xffff88006a2255e0, object size is 69 bytes (from shadow).
> ffff88006a22560e + 20 bytes is not OOB.
>

Indeed, perhaps gcc 4.7.4 miscompiles something check_memory_region ?
That's the only reasonable explanation I have.