2023-03-30 22:48:17

by Jim Cromie

[permalink] [raw]
Subject: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

hi Luis, etal

kmemleak is reporting 19 leaks during boot

because the hexdumps appeared to have module-names,
and Ive been hacking nearby, and see the same names
every time I boot my test-vm, I needed a clearer picture
Jason corroborated and bisected.

the 19 leaks split into 2 groups,
9 with names of builtin modules in the hexdump,
all with the same backtrace
9 without module-names (with a shared backtrace)
+1 wo name-ish and a separate backtrace


bash-5.2# ./grok_kmemleak -n
this prints not-module-name set 1st, then the "module"-ish

not: bless( {
'backtraces' => {
'[<0000000003a4e200>] __kmalloc_node_track_caller+0x44/0xb0
[<0000000072a38f0a>] memdup_user+0x26/0x90
[<000000005669249f>] strndup_user+0x47/0x70
[<00000000a8de6ea1>] load_module+0x9d5/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 9,
'[<00000000f9318e0d>] kmalloc_trace+0x26/0x60
[<0000000056fc149d>] ref_module+0xd6/0x200
[<0000000059adcd74>] resolve_symbol+0x2ae/0x320
[<000000006a75da8e>] simplify_symbols+0x1ae/0x5a0
[<0000000061d9061a>] load_module+0x529/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 1
},
'hexdumps' => {
'00 19 b2 fc 00 c8 52 8a ......R.' => 1,
'00 42 7d fc 00 07 09 a2 .B}.....' => 1,
'00 42 ba fc 00 c0 0f 6a .B.....j' => 1,
'00 43 48 fc 00 32 0c 8a .CH..2..' => 1,
'00 44 48 fc 00 32 05 92 .DH..2..' => 1,
'00 4f 7d fc 00 07 0c 42 .O}....B' => 1,
'00 61 a7 fc 00 dd 2d 52 .a....-R' => 1,
'00 62 a7 fc 00 dd 29 f2 .b....).' => 1,
'00 69 a7 fc 00 dd 2a 62 .i....*b' => 1,
'38 86 25 c0 ff ff ff ff 38 86 25 c0 ff ff ff ff 8.%.....8.%.....' => 1
},
'users' => {
'comm "(udev-worker)", pid 218,' => 1,
'comm "(udev-worker)", pid 219,' => 2,
'comm "(udev-worker)", pid 225,' => 1,
'comm "(udev-worker)", pid 226,' => 1,
'comm "(udev-worker)", pid 229,' => 5
}
}, 'LeakSet' )
mods: bless( {
'backtraces' => {
'[<0000000003a4e200>] __kmalloc_node_track_caller+0x44/0xb0
[<00000000a39d2d44>] kstrdup+0x30/0x60
[<000000003584675a>] kobject_set_name_vargs+0x2d/0xb0
[<00000000fd79ba60>] kobject_init_and_add+0x9d/0x100
[<00000000b107c417>] mod_sysfs_setup+0xf8/0x3c0
[<0000000082c80748>] load_module+0xf16/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 9
},
'hexdumps' => {
'63 72 63 33 32 5f 70 63 6c 6d 75 6c 00 ca 32 3f crc32_pclmul..2?' => 1,
'63 72 63 33 32 63 5f 69 6e 74 65 6c 00 dd 01 bf crc32c_intel....' => 1,
'63 72 63 74 31 30 64 69 66 5f 70 63 6c 6d 75 6c crct10dif_pclmul' => 1,
'67 68 61 73 68 5f 63 6c 6d 75 6c 6e 69 5f 69 6e ghash_clmulni_in' => 1,
'69 32 63 5f 70 69 69 78 34 00 77 85 96 39 f4 3f i2c_piix4.w..9.?' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 63 6f 6d 6d 6f intel_rapl_commo' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 6d 73 72 00 9f intel_rapl_msr..' => 1,
'70 63 73 70 6b 72 00 72 pcspkr.r' => 1,
'73 65 72 69 6f 5f 72 61 77 00 c9 84 97 87 cb 3f serio_raw......?' => 1
},
'users' => {
'comm "(udev-worker)", pid 218,' => 1,
'comm "(udev-worker)", pid 219,' => 1,
'comm "(udev-worker)", pid 225,' => 1,
'comm "(udev-worker)", pid 226,' => 1,
'comm "(udev-worker)", pid 229,' => 5
}
}, 'LeakSet' )


After a batch of modprobe, rmmod cycles on drm,
I got more leaks.
They have same backtrace as the original set
only the user has changed, to "modprobe"

:#> [98442.905272] kmemleak: 14 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
./grok_kmemleak -n
not: bless( {
'backtraces' => {
'[<0000000003a4e200>] __kmalloc_node_track_caller+0x44/0xb0
[<0000000072a38f0a>] memdup_user+0x26/0x90
[<000000005669249f>] strndup_user+0x47/0x70
[<00000000a8de6ea1>] load_module+0x9d5/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 15,
'[<00000000f9318e0d>] kmalloc_trace+0x26/0x60
[<0000000056fc149d>] ref_module+0xd6/0x200
[<0000000059adcd74>] resolve_symbol+0x2ae/0x320
[<000000006a75da8e>] simplify_symbols+0x1ae/0x5a0
[<0000000061d9061a>] load_module+0x529/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 2
},
'hexdumps' => {
'00 19 b2 fc 00 c8 52 8a ......R.' => 1,
'00 2d 0d f8 04 77 62 a2 .-...wb.' => 1,
'00 3f 4e fd 01 34 7a aa .?N..4z.' => 1,
'00 41 48 fc 00 32 03 9a .AH..2..' => 1,
'00 42 7d fc 00 07 09 a2 .B}.....' => 1,
'00 42 ba fc 00 c0 0f 6a .B.....j' => 1,
'00 43 48 fc 00 32 0c 8a .CH..2..' => 1,
'00 44 48 fc 00 32 05 92 .DH..2..' => 1,
'00 4e ba fc 00 c0 06 b2 .N......' => 1,
'00 4f 7d fc 00 07 0c 42 .O}....B' => 1,
'00 61 a7 fc 00 dd 2d 52 .a....-R' => 1,
'00 62 a7 fc 00 dd 29 f2 .b....).' => 1,
'00 69 a7 fc 00 dd 2a 62 .i....*b' => 1,
'00 e8 f4 fc 00 8e aa 4a .......J' => 1,
'00 ee f4 fc 00 8e aa 62 .......b' => 1,
'38 86 25 c0 ff ff ff ff 38 86 25 c0 ff ff ff ff 8.%.....8.%.....' => 1,
'80 70 0c 08 80 88 ff ff 38 7a 42 c0 ff ff ff ff .p......8zB.....' => 1
},
'users' => {
'comm "(udev-worker)", pid 218,' => 1,
'comm "(udev-worker)", pid 219,' => 2,
'comm "(udev-worker)", pid 225,' => 1,
'comm "(udev-worker)", pid 226,' => 1,
'comm "(udev-worker)", pid 229,' => 5,
'comm "modprobe", pid 673,' => 1,
'comm "modprobe", pid 683,' => 3,
'comm "modprobe", pid 690,' => 1,
'comm "modprobe", pid 693,' => 2
}
}, 'LeakSet' )
mods: bless( {
'backtraces' => {
'[<0000000003a4e200>] __kmalloc_node_track_caller+0x44/0xb0
[<00000000a39d2d44>] kstrdup+0x30/0x60
[<000000003584675a>] kobject_set_name_vargs+0x2d/0xb0
[<00000000fd79ba60>] kobject_init_and_add+0x9d/0x100
[<00000000b107c417>] mod_sysfs_setup+0xf8/0x3c0
[<0000000082c80748>] load_module+0xf16/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 15,
'[<00000000f9318e0d>] kmalloc_trace+0x26/0x60
[<0000000056fc149d>] ref_module+0xd6/0x200
[<0000000059adcd74>] resolve_symbol+0x2ae/0x320
[<000000006a75da8e>] simplify_symbols+0x1ae/0x5a0
[<0000000061d9061a>] load_module+0x529/0x10b0
[<000000000132739b>] __do_sys_finit_module+0xe4/0x160
[<000000005ac0591a>] do_syscall_64+0x34/0x80
[<0000000078e9b61a>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 1
},
'hexdumps' => {
'38 7a 42 c0 ff ff ff ff 80 97 bf 09 80 88 ff ff 8zB.............' => 1,
'63 65 63 00 04 77 6a c2 cec..wj.' => 1,
'63 72 63 33 32 5f 70 63 6c 6d 75 6c 00 ca 32 3f crc32_pclmul..2?' => 1,
'63 72 63 33 32 63 5f 69 6e 74 65 6c 00 dd 01 bf crc32c_intel....' => 1,
'63 72 63 74 31 30 64 69 66 5f 70 63 6c 6d 75 6c crct10dif_pclmul' => 1,
'67 68 61 73 68 5f 63 6c 6d 75 6c 6e 69 5f 69 6e ghash_clmulni_in' => 1,
'69 32 63 5f 61 6c 67 6f 5f 62 69 74 00 f2 db 3f i2c_algo_bit...?' => 1,
'69 32 63 5f 70 69 69 78 34 00 77 85 96 39 f4 3f i2c_piix4.w..9.?' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 63 6f 6d 6d 6f intel_rapl_commo' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 6d 73 72 00 9f intel_rapl_msr..' => 1,
'69 6f 6d 6d 75 5f 76 32 00 ae da 84 97 94 b1 5f iommu_v2......._' => 1,
'6d 78 6d 5f 77 6d 69 00 mxm_wmi.' => 1,
'70 63 73 70 6b 72 00 72 pcspkr.r' => 1,
'73 65 72 69 6f 5f 72 61 77 00 c9 84 97 87 cb 3f serio_raw......?' => 1,
'76 69 64 65 6f 00 ae 82 video...' => 1,
'77 6d 69 00 00 c0 06 52 wmi....R' => 1
},
'users' => {
'comm "(udev-worker)", pid 218,' => 1,
'comm "(udev-worker)", pid 219,' => 1,
'comm "(udev-worker)", pid 225,' => 1,
'comm "(udev-worker)", pid 226,' => 1,
'comm "(udev-worker)", pid 229,' => 5,
'comm "modprobe", pid 673,' => 1,
'comm "modprobe", pid 683,' => 4,
'comm "modprobe", pid 690,' => 1,
'comm "modprobe", pid 693,' => 1
}
}, 'LeakSet' )
:#>

I hope all this helps to sort out what the kmemleak is,
let me know how I can help further

Jim


2023-03-31 07:04:45

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> hi Luis, etal
>
> kmemleak is reporting 19 leaks during boot
>
> because the hexdumps appeared to have module-names,
> and Ive been hacking nearby, and see the same names
> every time I boot my test-vm, I needed a clearer picture
> Jason corroborated and bisected.
>
> the 19 leaks split into 2 groups,
> 9 with names of builtin modules in the hexdump,
> all with the same backtrace
> 9 without module-names (with a shared backtrace)
> +1 wo name-ish and a separate backtrace

Song, please take a look.

Thanks for the report Jim, what kernel are you on exactly?

Luis

2023-03-31 17:17:17

by Jim Cromie

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Fri, Mar 31, 2023 at 1:00 AM Luis Chamberlain <[email protected]> wrote:
>
> On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > hi Luis, etal
> >
> > kmemleak is reporting 19 leaks during boot
> >
> > because the hexdumps appeared to have module-names,
> > and Ive been hacking nearby, and see the same names
> > every time I boot my test-vm, I needed a clearer picture
> > Jason corroborated and bisected.
> >
> > the 19 leaks split into 2 groups,
> > 9 with names of builtin modules in the hexdump,
> > all with the same backtrace
> > 9 without module-names (with a shared backtrace)
> > +1 wo name-ish and a separate backtrace
>
> Song, please take a look.
>
> Thanks for the report Jim, what kernel are you on exactly?
>
> Luis

:#> uptime
09:45:32 up 1 day, 23:07, 0 users, load average: 0.07, 0.04, 0.01
:#> uname -a
Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux

the leaks I sent previously might be from/on a different commit,
heres the relevant one

fwiw, the config is unremarkable. it started with
CONFIG_BUILD_SALT="5.16.8-200.fc35.x86_64"
then `make localmodconfig` to drop anything I dont have hw for
then `virtme-configkernel --update` to pick up the 9p,etc config options
And some extra DEBUG_* options
If you'd like to see runs with others, or see the config itself, please ask.

:#> uname -a
Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
:#> ./grok_kmemleak -n
not: bless( {
'backtraces' => {
'[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
[<00000000a2f80203>] memdup_user+0x26/0x90
[<00000000f7cd3624>] strndup_user+0x3f/0x60
[<0000000098fd26c5>] load_module+0x188b/0x20e0
[<0000000074361279>] __do_sys_finit_module+0x93/0xf0
[<000000004caeb948>] do_syscall_64+0x34/0x80
[<000000009f5d036c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 16,
'[<0000000094c136c3>] kmalloc_trace+0x26/0x90
[<00000000700fd414>] resolve_symbol+0x2a5/0x3a0
[<000000001dd9228b>] load_module+0x1465/0x20e0
[<0000000074361279>] __do_sys_finit_module+0x93/0xf0
[<000000004caeb948>] do_syscall_64+0x34/0x80
[<000000009f5d036c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 3
},
'hexdumps' => {
'00 b3 af 5a de 87 df 17 ...Z....' => 1,
'00 b6 af 5a de 87 d8 cf ...Z....' => 1,
'00 b7 af 5a de 87 d5 77 ...Z...w' => 1,
'00 c8 b9 5a de 91 a2 2f ...Z.../' => 1,
'00 ca b9 5a de 91 a6 3f ...Z...?' => 1,
'00 cf b9 5a de 91 a2 07 ...Z....' => 1,
'00 e0 4c 56 d2 64 8b 77 ..LV.d.w' => 1,
'00 e0 f2 59 dd da 85 7f ...Y....' => 1,
'00 e3 4c 56 d2 64 89 1f ..LV.d..' => 1,
'00 e4 f2 59 dd da 8f 4f ...Y...O' => 1,
'00 e5 4c 56 d2 64 87 bf ..LV.d..' => 1,
'00 e6 f2 59 dd da 89 57 ...Y...W' => 1,
'00 e8 f2 59 dd da 83 17 ...Y....' => 1,
'00 e9 4c 56 d2 64 8e df ..LV.d..' => 1,
'00 eb 4c 56 d2 64 80 67 ..LV.d.g' => 1,
'00 ec 4c 56 d2 64 8f 7f ..LV.d..' => 1,
'40 d4 1c 08 80 88 ff ff 88 99 37 c0 ff ff ff ff @.........7.....' => 1,
'88 99 37 c0 ff ff ff ff 40 09 e8 13 80 88 ff ff ..7.....@.......' => 1,
'c8 8a 23 c0 ff ff ff ff c8 8a 23 c0 ff ff ff ff ..#.......#.....' => 1
},
'users' => {
'comm "(udev-worker)", pid 219,' => 1,
'comm "(udev-worker)", pid 221,' => 4,
'comm "(udev-worker)", pid 224,' => 3,
'comm "(udev-worker)", pid 229,' => 1,
'comm "(udev-worker)", pid 230,' => 1,
'comm "modprobe", pid 728,' => 1,
'comm "modprobe", pid 814,' => 1,
'comm "modprobe", pid 825,' => 4,
'comm "modprobe", pid 832,' => 1,
'comm "modprobe", pid 835,' => 2
}
}, 'LeakSet' )
mods: bless( {
'backtraces' => {
'[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
[<00000000ab7b01fd>] kstrdup+0x32/0x60
[<000000005ed25b98>] kobject_set_name_vargs+0x1c/0x90
[<0000000090fe19ca>] kobject_init_and_add+0x4d/0x90
[<0000000045666935>] mod_sysfs_setup+0xa9/0x6e0
[<00000000d6f7187b>] load_module+0x1de3/0x20e0
[<0000000074361279>] __do_sys_finit_module+0x93/0xf0
[<000000004caeb948>] do_syscall_64+0x34/0x80
[<000000009f5d036c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 16
},
'hexdumps' => {
'63 65 63 00 d2 64 80 7f cec..d..' => 1,
'63 72 63 33 32 5f 70 63 6c 6d 75 6c 00 24 14 48 crc32_pclmul.$.H' => 1,
'63 72 63 33 32 63 5f 69 6e 74 65 6c 00 a7 e0 f8 crc32c_intel....' => 1,
'63 72 63 74 31 30 64 69 66 5f 70 63 6c 6d 75 6c crct10dif_pclmul' => 1,
'67 68 61 73 68 5f 63 6c 6d 75 6c 6e 69 5f 69 6e ghash_clmulni_in' => 1,
'69 32 63 5f 61 6c 67 6f 5f 62 69 74 00 c4 b6 08 i2c_algo_bit....' => 1,
'69 32 63 5f 70 69 69 78 34 00 cb 8a 66 a7 e2 48 i2c_piix4...f..H' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 63 6f 6d 6d 6f intel_rapl_commo' => 1,
'69 6e 74 65 6c 5f 72 61 70 6c 5f 6d 73 72 00 98 intel_rapl_msr..' => 1,
'69 6f 6d 6d 75 5f 76 32 00 70 a8 80 6c c4 bd 08 iommu_v2.p..l...' => 1,
'6d 78 6d 5f 77 6d 69 00 mxm_wmi.' => 1,
'70 63 73 70 6b 72 00 8f pcspkr..' => 1,
'73 65 72 69 6f 5f 72 61 77 00 cb 8a 66 a7 ed b8 serio_raw...f...' => 1,
'74 65 73 74 5f 64 79 6e 61 6d 69 63 5f 64 65 62 test_dynamic_deb' => 1,
'76 69 64 65 6f 00 d9 bf video...' => 1,
'77 6d 69 00 dd da 80 df wmi.....' => 1
},
'users' => {
'comm "(udev-worker)", pid 219,' => 1,
'comm "(udev-worker)", pid 221,' => 4,
'comm "(udev-worker)", pid 224,' => 2,
'comm "(udev-worker)", pid 229,' => 1,
'comm "(udev-worker)", pid 230,' => 1,
'comm "modprobe", pid 728,' => 1,
'comm "modprobe", pid 814,' => 1,
'comm "modprobe", pid 825,' => 3,
'comm "modprobe", pid 832,' => 1,
'comm "modprobe", pid 835,' => 1
}
}, 'LeakSet' )
:#> lsmod
Module Size Used by
mxm_wmi 12288 0
iommu_v2 20480 0
video 65536 0
i2c_algo_bit 12288 0
wmi 32768 2 video,mxm_wmi
cec 57344 0
test_dynamic_debug 20480 0
intel_rapl_msr 16384 0
crc32_pclmul 12288 0
intel_rapl_common 28672 1 intel_rapl_msr
ghash_clmulni_intel 12288 0
crct10dif_pclmul 12288 1
crc32c_intel 20480 0
serio_raw 16384 0
pcspkr 12288 0
i2c_piix4 28672 0
:#>

2023-03-31 19:15:00

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Fri, Mar 31, 2023 at 11:08:23AM -0600, [email protected] wrote:
> :#> uptime
> 09:45:32 up 1 day, 23:07, 0 users, load average: 0.07, 0.04, 0.01
> :#> uname -a
> Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
> Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
>
> the leaks I sent previously might be from/on a different commit,
> heres the relevant one
>
> fwiw, the config is unremarkable. it started with
> CONFIG_BUILD_SALT="5.16.8-200.fc35.x86_64"
> then `make localmodconfig` to drop anything I dont have hw for
> then `virtme-configkernel --update` to pick up the 9p,etc config options
> And some extra DEBUG_* options
> If you'd like to see runs with others, or see the config itself, please ask.

If you wanna see things explode

echo 0 > /proc/sys/vm/oom_dump_tasks
./stress-ng --module 20 --module-name xfs

This assumes xfs is not already loaded, and has all dependencies already
loaded. What would test the load_module() path.

If you wanna see if the test is earlier, you can try a module which
is already loaded on your system.

> :#> uname -a
> Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
> Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
> :#> ./grok_kmemleak -n
> not: bless( {
> 'backtraces' => {
> '[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
> [<00000000a2f80203>] memdup_user+0x26/0x90
> [<00000000f7cd3624>] strndup_user+0x3f/0x60
> [<0000000098fd26c5>] load_module+0x188b/0x20e0

Can you do:

gdb vmlinux
l *(load_module+0x188b)

And provide the output?

> }, 'LeakSet' )
> mods: bless( {
> 'backtraces' => {
> '[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
> [<00000000ab7b01fd>] kstrdup+0x32/0x60
> [<000000005ed25b98>] kobject_set_name_vargs+0x1c/0x90
> [<0000000090fe19ca>] kobject_init_and_add+0x4d/0x90
> [<0000000045666935>] mod_sysfs_setup+0xa9/0x6e0

Ok that is a specific enough hint. I'll take a review of this sysfs
path see what changed that could break.

> [<00000000d6f7187b>] load_module+0x1de3/0x20e0
> [<0000000074361279>] __do_sys_finit_module+0x93/0xf0
> [<000000004caeb948>] do_syscall_64+0x34/0x80
> [<000000009f5d036c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 16
> },

Luis

2023-03-31 21:48:28

by Jim Cromie

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Fri, Mar 31, 2023 at 1:12 PM Luis Chamberlain <[email protected]> wrote:
>
> On Fri, Mar 31, 2023 at 11:08:23AM -0600, [email protected] wrote:
> > :#> uptime
> > 09:45:32 up 1 day, 23:07, 0 users, load average: 0.07, 0.04, 0.01
> > :#> uname -a
> > Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
> > Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
> >
> > the leaks I sent previously might be from/on a different commit,
> > heres the relevant one
> >
> > fwiw, the config is unremarkable. it started with
> > CONFIG_BUILD_SALT="5.16.8-200.fc35.x86_64"
> > then `make localmodconfig` to drop anything I dont have hw for
> > then `virtme-configkernel --update` to pick up the 9p,etc config options
> > And some extra DEBUG_* options
> > If you'd like to see runs with others, or see the config itself, please ask.
>
> If you wanna see things explode
>
> echo 0 > /proc/sys/vm/oom_dump_tasks
> ./stress-ng --module 20 --module-name xfs
>
> This assumes xfs is not already loaded, and has all dependencies already
> loaded. What would test the load_module() path.
>
> If you wanna see if the test is earlier, you can try a module which
> is already loaded on your system.
>
> > :#> uname -a
> > Linux (none) 6.3.0-rc1-f2-00001-gac3b43283923 #359 SMP PREEMPT_DYNAMIC
> > Wed Mar 29 09:33:11 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
> > :#> ./grok_kmemleak -n
> > not: bless( {
> > 'backtraces' => {
> > '[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
> > [<00000000a2f80203>] memdup_user+0x26/0x90
> > [<00000000f7cd3624>] strndup_user+0x3f/0x60
> > [<0000000098fd26c5>] load_module+0x188b/0x20e0
>
> Can you do:
>
> gdb vmlinux
> l *(load_module+0x188b)
>
> And provide the output?

(gdb) l *(load_module+0x188b)
0xffffffff8122a4bb is in load_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2820).
2815 goto free_modinfo;
2816
2817 flush_module_icache(mod);
2818
2819 /* Now copy in args */
2820 mod->args = strndup_user(uargs, ~0UL >> 1);
2821 if (IS_ERR(mod->args)) {
2822 err = PTR_ERR(mod->args);
2823 goto free_arch_cleanup;
2824 }

>
> > }, 'LeakSet' )
> > mods: bless( {
> > 'backtraces' => {
> > '[<0000000058fb276d>] __kmalloc_node_track_caller+0x4a/0x140
> > [<00000000ab7b01fd>] kstrdup+0x32/0x60
> > [<000000005ed25b98>] kobject_set_name_vargs+0x1c/0x90
> > [<0000000090fe19ca>] kobject_init_and_add+0x4d/0x90
> > [<0000000045666935>] mod_sysfs_setup+0xa9/0x6e0
>
> Ok that is a specific enough hint. I'll take a review of this sysfs
> path see what changed that could break.

(gdb) l *(mod_sysfs_setup+0xa9)
0xffffffff8122d2d9 is in mod_sysfs_setup
(/home/jimc/projects/lx/wk-next/kernel/module/sysfs.c:361).
356
357 mod->mkobj.mod = mod;
358
359 memset(&mod->mkobj.kobj, 0, sizeof(mod->mkobj.kobj));
360 mod->mkobj.kobj.kset = module_kset;
361 err = kobject_init_and_add(&mod->mkobj.kobj, &module_ktype, NULL,
362 "%s", mod->name);
363 if (err)
364 mod_kobject_put(mod);
365
(gdb)

>
> > [<00000000d6f7187b>] load_module+0x1de3/0x20e0

(gdb) l *(load_module+0x1de3)
0xffffffff8122aa13 is in load_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2856).
2851 pr_warn("%s: parameters '%s' after `--' ignored\n",
2852 mod->name, after_dashes);
2853 }
2854
2855 /* Link in to sysfs. */
2856 err = mod_sysfs_setup(mod, info, mod->kp, mod->num_kp);
2857 if (err < 0)
2858 goto coming_cleanup;
2859
2860 if (is_livepatch_module(mod)) {



> > [<0000000074361279>] __do_sys_finit_module+0x93/0xf0
> > [<000000004caeb948>] do_syscall_64+0x34/0x80
> > [<000000009f5d036c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 16
> > },
>> Luis

2023-04-01 00:28:31

by Song Liu

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
>
> On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > hi Luis, etal
> >
> > kmemleak is reporting 19 leaks during boot
> >
> > because the hexdumps appeared to have module-names,
> > and Ive been hacking nearby, and see the same names
> > every time I boot my test-vm, I needed a clearer picture
> > Jason corroborated and bisected.
> >
> > the 19 leaks split into 2 groups,
> > 9 with names of builtin modules in the hexdump,
> > all with the same backtrace
> > 9 without module-names (with a shared backtrace)
> > +1 wo name-ish and a separate backtrace
>
> Song, please take a look.

I will look into this next week.

Thanks,
Song

2023-04-03 21:02:49

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> >
> > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > hi Luis, etal
> > >
> > > kmemleak is reporting 19 leaks during boot
> > >
> > > because the hexdumps appeared to have module-names,
> > > and Ive been hacking nearby, and see the same names
> > > every time I boot my test-vm, I needed a clearer picture
> > > Jason corroborated and bisected.
> > >
> > > the 19 leaks split into 2 groups,
> > > 9 with names of builtin modules in the hexdump,
> > > all with the same backtrace
> > > 9 without module-names (with a shared backtrace)
> > > +1 wo name-ish and a separate backtrace
> >
> > Song, please take a look.
>
> I will look into this next week.

I'm thinking this may be it, at least this gets us to what we used to do
as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
support") and right before Song's patch.

diff --git a/kernel/module/main.c b/kernel/module/main.c
index 6b6da80f363f..3b9c71fa6096 100644
--- a/kernel/module/main.c
+++ b/kernel/module/main.c
@@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
* which is inside the block. Just mark it as not being a
* leak.
*/
- kmemleak_ignore(ptr);
+ if (type == MOD_INIT_TEXT)
+ kmemleak_ignore(ptr);
+ else
+ kmemleak_not_leak(ptr);
if (!ptr) {
t = type;
goto out_enomem;

We used to use the grey area for the TEXT but the original commit
doesn't explain too well why we grey out init but not the others. Ie
why kmemleak_ignore() on init and kmemleak_not_leak() on the others.

Catalinas, any thoughts / suggestions? Should we just stick to
kmemleak_not_leak() for both now?

Luis

2023-04-03 21:12:29

by Konstantin Ryabitsev

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> hi Luis, etal
>
> kmemleak is reporting 19 leaks during boot

Hi, all:

I'm going to use this thread to test out bugbot. You can just ignore it and
let it do its thing behind the scenes -- if it explodes, I'll take care of it.

It should just send a single follow-up telling us that it's tracking the
thread, but otherwise stay entirely out of everyone's hair.

bugbot on

Sorry for interrupting and thank you for your patience -- this is for a good
cause, I promise. :)

-K

2023-04-03 21:24:15

by Kernel.org Bugbot

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

Hello:

This conversation is now tracked by Kernel.org Bugzilla:
https://bugzilla.kernel.org/show_bug.cgi?id=217296

There is no need to do anything else, just keep talking.
--
Deet-doot-dot, I am a bot.
Kernel.org Bugzilla (peebz 0.1)

2023-04-05 01:40:55

by Jim Cromie

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Mon, Apr 3, 2023 at 2:44 PM Luis Chamberlain <[email protected]> wrote:
>
> On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > >
> > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > hi Luis, etal
> > > >
> > > > kmemleak is reporting 19 leaks during boot
> > > >
> > > > because the hexdumps appeared to have module-names,
> > > > and Ive been hacking nearby, and see the same names
> > > > every time I boot my test-vm, I needed a clearer picture
> > > > Jason corroborated and bisected.
> > > >
> > > > the 19 leaks split into 2 groups,
> > > > 9 with names of builtin modules in the hexdump,
> > > > all with the same backtrace
> > > > 9 without module-names (with a shared backtrace)
> > > > +1 wo name-ish and a separate backtrace
> > >
> > > Song, please take a look.
> >
> > I will look into this next week.
>
> I'm thinking this may be it, at least this gets us to what we used to do
> as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> support") and right before Song's patch.
>
> diff --git a/kernel/module/main.c b/kernel/module/main.c
> index 6b6da80f363f..3b9c71fa6096 100644
> --- a/kernel/module/main.c
> +++ b/kernel/module/main.c
> @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> * which is inside the block. Just mark it as not being a
> * leak.
> */
> - kmemleak_ignore(ptr);
> + if (type == MOD_INIT_TEXT)
> + kmemleak_ignore(ptr);
> + else
> + kmemleak_not_leak(ptr);
> if (!ptr) {
> t = type;
> goto out_enomem;
>
> We used to use the grey area for the TEXT but the original commit
> doesn't explain too well why we grey out init but not the others. Ie
> why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
>
> Catalinas, any thoughts / suggestions? Should we just stick to
> kmemleak_not_leak() for both now?
>
> Luis

So I have mixed results.

your patch fixed the 19 leaks on my worktree / branch where I found them.

on top of
ac3b43283923 module: replace module_layout with module_memory

it fixed the (same) 19, but gets a few new ones.
whats weird is that once they report, they disappear from
/sys/kernel/debug/kmemleak

heres that kmemleak report, with a little preceding / setup,
performed by this bash scripting

drms_unload() {
for m in i915 amdgpu nouveau \
iommu_v2 video i2c_algo_bit mxm_wmi wmi intel_rapl_msr \
drm_display_helper cec drm_kms_helper drm_ttm_helper ttm gpu_sched
drm_buddy drm;
do
rmmod $m ;
done
}
drms_load() {
uname -a
for m in i915 amdgpu nouveau;
do
modprobe $m $*
done
}
cycle_drms() {
for i in 1 $*; do # loop 1+argc times
time drms_load
drms_unload
done
}

leak_drive () {

[[ -f /sys/kernel/debug/kmemleak ]] || {
echo "need KMEMLEAK"
return
}
let count=0
#echo ok testing, each dot is 10 secs
while true; do

var=`cat /sys/kernel/debug/kmemleak`
if [[ -z $var ]] ; then
cycle_drms
echo scan >/sys/kernel/debug/kmemleak
else
break
fi
((count=$count+1))
echo finished pass $count
done
cat /sys/kernel/debug/kmemleak
dmesg | grep /sys/kernel/debug/kmemleak
uname -a
}


finished pass 6
Linux (none) 6.3.0-rc1-f2-00002-g30504a44c558 #360 SMP PREEMPT_DYNAMIC
Tue Apr 4 15:25:05 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
[ 51.768797] ACPI: bus type drm_connector registered
[ 52.039443] AMD-Vi: AMD IOMMUv2 functionality not available on this
system - This is not a bug.
[ 52.795766] [drm] amdgpu kernel modesetting enabled.
[ 52.796288] amdgpu: CRAT table not found
[ 52.796502] amdgpu: Virtual CRAT table created for CPU
[ 52.796964] amdgpu: Topology: Add CPU node

real 0m1.354s
user 0m0.002s
sys 0m0.919s
rmmod: ERROR: Module intel_rapl_msr is not currently loaded
[ 53.401823] ACPI: bus type drm_connector unregistered
[ 53.595705] kmemleak: 2 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
finished pass 7
unreferenced object 0xffff8880059b0240 (size 192):
comm "modprobe", pid 716, jiffies 4294714739 (age 6.065s)
hex dump (first 32 bytes):
00 db 50 c0 ff ff ff ff 00 00 00 00 00 00 00 00 ..P.............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000406104d4>] __kmalloc+0x49/0x150
[<00000000fe00c883>] __register_sysctl_table+0x51/0x7f0
[<00000000438011af>] 0xffffffffc04faa78
[<000000009a44098c>] 0xffffffffc037f01b
[<00000000de0b0c0b>] do_one_initcall+0x43/0x210
[<0000000016200549>] do_init_module+0x60/0x240
[<00000000e5f75cca>] __do_sys_finit_module+0x93/0xf0
[<0000000014ed2961>] do_syscall_64+0x34/0x80
[<00000000d14e8c97>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff88800ce25800 (size 256):
comm "modprobe", pid 716, jiffies 4294714739 (age 6.065s)
hex dump (first 32 bytes):
78 58 e2 0c 80 88 ff ff 00 00 00 00 00 00 00 00 xX..............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000406104d4>] __kmalloc+0x49/0x150
[<00000000ec6658c8>] __register_sysctl_table+0x569/0x7f0
[<00000000438011af>] 0xffffffffc04faa78
[<000000009a44098c>] 0xffffffffc037f01b
[<00000000de0b0c0b>] do_one_initcall+0x43/0x210
[<0000000016200549>] do_init_module+0x60/0x240
[<00000000e5f75cca>] __do_sys_finit_module+0x93/0xf0
[<0000000014ed2961>] do_syscall_64+0x34/0x80
[<00000000d14e8c97>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
[ 53.595705] kmemleak: 2 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
Linux (none) 6.3.0-rc1-f2-00002-g30504a44c558 #360 SMP PREEMPT_DYNAMIC
Tue Apr 4 15:25:05 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux

at this point, kmemleak is empty.
Im guessing thats because the leak was in / under do_init_module,
and __init mem is recycled.

Maybe its also why the leak-trace has 2 entries without symbol info
Heres the levels above & below those mystery levels

(gdb) l *(do_one_initcall+0x43)
0xffffffff81001093 is in do_one_initcall
(/home/jimc/projects/lx/wk-next/init/main.c:1306).
1301
1302 if (initcall_blacklisted(fn))
1303 return -EPERM;
1304
1305 do_trace_initcall_start(fn);
1306 ret = fn();
1307 do_trace_initcall_finish(fn, ret);
1308
1309 msgbuf[0] = 0;
1310
(gdb) l *(__register_sysctl_table+0x569)
0xffffffff814e5e99 is in __register_sysctl_table
(/home/jimc/projects/lx/wk-next/fs/proc/proc_sysctl.c:974).
969 char *new_name;
970
971 new = kzalloc(sizeof(*new) + sizeof(struct ctl_node) +
972 sizeof(struct ctl_table)*2 + namelen + 1,
973 GFP_KERNEL);
974 if (!new)
975 return NULL;
976
977 node = (struct ctl_node *)(new + 1);
978 table = (struct ctl_table *)(node + 1);
(gdb)

2023-04-05 02:08:09

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Tue, Apr 04, 2023 at 07:38:41PM -0600, [email protected] wrote:
> On Mon, Apr 3, 2023 at 2:44 PM Luis Chamberlain <[email protected]> wrote:
> >
> > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > >
> > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > hi Luis, etal
> > > > >
> > > > > kmemleak is reporting 19 leaks during boot
> > > > >
> > > > > because the hexdumps appeared to have module-names,
> > > > > and Ive been hacking nearby, and see the same names
> > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > Jason corroborated and bisected.
> > > > >
> > > > > the 19 leaks split into 2 groups,
> > > > > 9 with names of builtin modules in the hexdump,
> > > > > all with the same backtrace
> > > > > 9 without module-names (with a shared backtrace)
> > > > > +1 wo name-ish and a separate backtrace
> > > >
> > > > Song, please take a look.
> > >
> > > I will look into this next week.
> >
> > I'm thinking this may be it, at least this gets us to what we used to do
> > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > support") and right before Song's patch.
> >
> > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > index 6b6da80f363f..3b9c71fa6096 100644
> > --- a/kernel/module/main.c
> > +++ b/kernel/module/main.c
> > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > * which is inside the block. Just mark it as not being a
> > * leak.
> > */
> > - kmemleak_ignore(ptr);
> > + if (type == MOD_INIT_TEXT)
> > + kmemleak_ignore(ptr);
> > + else
> > + kmemleak_not_leak(ptr);
> > if (!ptr) {
> > t = type;
> > goto out_enomem;
> >
> > We used to use the grey area for the TEXT but the original commit
> > doesn't explain too well why we grey out init but not the others. Ie
> > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
> >
> > Catalinas, any thoughts / suggestions? Should we just stick to
> > kmemleak_not_leak() for both now?
> >
> > Luis
>
> So I have mixed results.
>
> your patch fixed the 19 leaks on my worktree / branch where I found them.
>
> on top of
> ac3b43283923 module: replace module_layout with module_memory
>
> it fixed the (same) 19, but gets a few new ones.
> whats weird is that once they report, they disappear from
> /sys/kernel/debug/kmemleak

I think I missed the MOD_INIT_DATA and MOD_INIT_RODATA. Can you try the
patch below instead:

From 6890bd43866c40e1b58a832361812cdc5d965e4c Mon Sep 17 00:00:00 2001
From: Luis Chamberlain <[email protected]>
Date: Tue, 4 Apr 2023 18:52:47 -0700
Subject: [PATCH] module: fix kmemleak annotations for non init ELF sections

Commit ac3b43283923 ("module: replace module_layout with module_memory")
reworked the way to handle memory allocations to make it clearer. But it
lost in translation how we handle kmemleak_ignore() or kmemleak_not_leak()
for these sections.

Fix this and clarify the comments a bit more.

Fixes: ac3b43283923 ("module: replace module_layout with module_memory")
Reported-by: Jim Cromie <[email protected]>
Signed-off-by: Luis Chamberlain <[email protected]>
---
kernel/module/main.c | 20 ++++++++++++++++----
1 file changed, 16 insertions(+), 4 deletions(-)

diff --git a/kernel/module/main.c b/kernel/module/main.c
index 5cc21083af04..fe0f3b8fd3a8 100644
--- a/kernel/module/main.c
+++ b/kernel/module/main.c
@@ -2233,11 +2233,23 @@ static int move_module(struct module *mod, struct load_info *info)
ptr = module_memory_alloc(mod->mem[type].size, type);

/*
- * The pointer to this block is stored in the module structure
- * which is inside the block. Just mark it as not being a
- * leak.
+ * The pointer to these blocks of memory are stored on the module
+ * structure and we keep that around so long as the module is
+ * around. We only free that memory when we unload the module.
+ * Just mark them as not being a leak then. The .init* ELF
+ * sections *do* get freed after boot so we treat them slightly
+ * differently and only grey them out as they work as typical
+ * memory allocations which *do* get eventually get freed.
*/
- kmemleak_ignore(ptr);
+ switch (type) {
+ case MOD_INIT_TEXT: /* fallthrough */
+ case MOD_INIT_DATA: /* fallthrough */
+ case MOD_INIT_RODATA: /* fallthrough */
+ kmemleak_ignore(ptr);
+ break;
+ default:
+ kmemleak_not_leak(ptr);
+ }
if (!ptr) {
t = type;
goto out_enomem;
--
2.39.2

2023-04-06 03:28:31

by Jim Cromie

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Tue, Apr 4, 2023 at 8:01 PM Luis Chamberlain <[email protected]> wrote:
>
> On Tue, Apr 04, 2023 at 07:38:41PM -0600, [email protected] wrote:
> > On Mon, Apr 3, 2023 at 2:44 PM Luis Chamberlain <[email protected]> wrote:
> > >
> > > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > > >
> > > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > > hi Luis, etal
> > > > > >
> > > > > > kmemleak is reporting 19 leaks during boot
> > > > > >
> > > > > > because the hexdumps appeared to have module-names,
> > > > > > and Ive been hacking nearby, and see the same names
> > > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > > Jason corroborated and bisected.
> > > > > >
> > > > > > the 19 leaks split into 2 groups,
> > > > > > 9 with names of builtin modules in the hexdump,
> > > > > > all with the same backtrace
> > > > > > 9 without module-names (with a shared backtrace)
> > > > > > +1 wo name-ish and a separate backtrace
> > > > >
> > > > > Song, please take a look.
> > > >
> > > > I will look into this next week.
> > >
> > > I'm thinking this may be it, at least this gets us to what we used to do
> > > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > > support") and right before Song's patch.
> > >
> > > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > > index 6b6da80f363f..3b9c71fa6096 100644
> > > --- a/kernel/module/main.c
> > > +++ b/kernel/module/main.c
> > > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > > * which is inside the block. Just mark it as not being a
> > > * leak.
> > > */
> > > - kmemleak_ignore(ptr);
> > > + if (type == MOD_INIT_TEXT)
> > > + kmemleak_ignore(ptr);
> > > + else
> > > + kmemleak_not_leak(ptr);
> > > if (!ptr) {
> > > t = type;
> > > goto out_enomem;
> > >
> > > We used to use the grey area for the TEXT but the original commit
> > > doesn't explain too well why we grey out init but not the others. Ie
> > > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
> > >
> > > Catalinas, any thoughts / suggestions? Should we just stick to
> > > kmemleak_not_leak() for both now?
> > >
> > > Luis
> >
> > So I have mixed results.
> >
> > your patch fixed the 19 leaks on my worktree / branch where I found them.
> >
> > on top of
> > ac3b43283923 module: replace module_layout with module_memory
> >
> > it fixed the (same) 19, but gets a few new ones.
> > whats weird is that once they report, they disappear from
> > /sys/kernel/debug/kmemleak

this disappearing act is still going on.
my script issues no echo clear > kmemleak


>
> I think I missed the MOD_INIT_DATA and MOD_INIT_RODATA. Can you try the
> patch below instead:
>
> From 6890bd43866c40e1b58a832361812cdc5d965e4c Mon Sep 17 00:00:00 2001
> From: Luis Chamberlain <[email protected]>
> Date: Tue, 4 Apr 2023 18:52:47 -0700
> Subject: [PATCH] module: fix kmemleak annotations for non init ELF sections
>
> Commit ac3b43283923 ("module: replace module_layout with module_memory")
> reworked the way to handle memory allocations to make it clearer. But it
> lost in translation how we handle kmemleak_ignore() or kmemleak_not_leak()
> for these sections.
>
> Fix this and clarify the comments a bit more.
>
> Fixes: ac3b43283923 ("module: replace module_layout with module_memory")
> Reported-by: Jim Cromie <[email protected]>
> Signed-off-by: Luis Chamberlain <[email protected]>
> ---
> kernel/module/main.c | 20 ++++++++++++++++----
> 1 file changed, 16 insertions(+), 4 deletions(-)
>
> diff --git a/kernel/module/main.c b/kernel/module/main.c
> index 5cc21083af04..fe0f3b8fd3a8 100644
> --- a/kernel/module/main.c
> +++ b/kernel/module/main.c
> @@ -2233,11 +2233,23 @@ static int move_module(struct module *mod, struct load_info *info)
> ptr = module_memory_alloc(mod->mem[type].size, type);
>
> /*
> - * The pointer to this block is stored in the module structure
> - * which is inside the block. Just mark it as not being a
> - * leak.
> + * The pointer to these blocks of memory are stored on the module
> + * structure and we keep that around so long as the module is
> + * around. We only free that memory when we unload the module.
> + * Just mark them as not being a leak then. The .init* ELF
> + * sections *do* get freed after boot so we treat them slightly
> + * differently and only grey them out as they work as typical
> + * memory allocations which *do* get eventually get freed.
> */
> - kmemleak_ignore(ptr);
> + switch (type) {
> + case MOD_INIT_TEXT: /* fallthrough */
> + case MOD_INIT_DATA: /* fallthrough */
> + case MOD_INIT_RODATA: /* fallthrough */
> + kmemleak_ignore(ptr);
> + break;
> + default:
> + kmemleak_not_leak(ptr);
> + }
> if (!ptr) {
> t = type;
> goto out_enomem;
> --
> 2.39.2
>

sorry for the delay, I was seeing heisen-responses, and several BUGs.
a make clean seems to have settled things mostly.
But in case theres any clues in there, Ive kept the paste-in of 2 BUGs

with
f23cd1ffca4b (HEAD) kmemleaks on ac3b43283923 ("module: replace
module_layout with module_memory")
ac3b43283923 module: replace module_layout with module_memory

heres the 1st run. cuz it leaked, I reran in another vm, which got
different results.
I left it overnight doing nothing (laptop slept, vm with it),
and it BUG'd on a soft lockup
(much later, but the leaktrace does have a timerfd in it)
R11 looks poisoned.

[ 49.994743] kmemleak: 1 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
finished pass 6
unreferenced object 0xffff888006fe8600 (size 256):
comm "(udev-worker)", pid 422, jiffies 4294711998 (age 5.223s)
hex dump (first 32 bytes):
00 86 fe 06 80 88 ff ff 00 00 00 00 00 00 00 00 ................
00 00 00 00 00 00 00 00 70 92 a0 80 18 00 00 00 ........p.......
backtrace:
[<00000000da294bc2>] kmalloc_trace+0x26/0x90
[<00000000af593495>] __do_sys_timerfd_create+0x58/0x190
[<0000000044a7da2f>] do_syscall_64+0x34/0x80
[<0000000013b2114c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
[ 49.994743] kmemleak: 1 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #361 SMP PREEMPT_DYNAMIC
Tue Apr 4 20:05:45 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
:#> cat /sys/kernel/debug/kmemleak
:#>:#> [16686.317671] watchdog: BUG: soft lockup - CPU#1 stuck for
85s! [kworker/1:0:707]
[16686.578795] Modules linked in: crc32_pclmul(E) intel_rapl_common(E)
ghash_clmulni_intel(E) crct10dif_pclmul(E) crc32c_intel(E) pcspkr(E)
serio_raw(E) i2c_piix4(E) [last unloaded: drm(E)]
[16686.579866] CPU: 1 PID: 707 Comm: kworker/1:0 Tainted: G
E 6.3.0-rc1-f2-00002-gf23cd1ffca4b #361
[16686.580479] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
BIOS 1.16.2-1.fc37 04/01/2014
[16686.580988] Workqueue: ata_sff ata_sff_pio_task
[16686.581217] RIP: 0010:_raw_spin_unlock_irq+0x11/0x30
[16686.581531] Code: 05 fc b3 23 7e 85 c0 74 01 c3 0f 1f 44 00 00 c3
66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 e8 46 00 00 00 90 fb 0f 1f
44 00 00 <bf> 01 00 00 00 e8 55 c1 3b ff 65 8b 05 c6 b3 23 7e 85 c0 74
01 c3
[16686.582594] RSP: 0018:ffffc9000042fe90 EFLAGS: 00000246
[16686.582899] RAX: 0000000000000000 RBX: ffff888005dcc0b8 RCX: 0000000000000000
[16686.583424] RDX: 0000000000010376 RSI: ffff888005dcc17c RDI: ffff888007b8fe40
[16686.583900] RBP: ffff88807dcb2100 R08: ff6565725e607360 R09: 0000000000000001
[16686.584241] R10: 0000000000000058 R11: fefefefefefefeff R12: ffff88807dcbb500
[16686.584570] R13: 0000000000000000 R14: ffff888008bb20c0 R15: ffff888005dcc0c0
[16686.584891] FS: 0000000000000000(0000) GS:ffff88807dc80000(0000)
knlGS:0000000000000000
[16686.585252] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[16686.585596] CR2: 000055bbcb66c440 CR3: 000000000938d000 CR4: 0000000000750ee0
[16686.585979] PKRU: 55555554
[16686.586156] Call Trace:
[16686.586335] <TASK>
[16686.586490] process_one_work+0x1c3/0x3c0
[16686.586824] worker_thread+0x4d/0x380
[16686.587073] ? _raw_spin_lock_irqsave+0x23/0x50
[16686.587380] ? rescuer_thread+0x370/0x370
[16686.587661] kthread+0xe6/0x110
[16686.587881] ? kthread_complete_and_exit+0x20/0x20
[16686.588221] ret_from_fork+0x1f/0x30
[16686.588460] </TASK>

using sh-script posted previously,

2nd run went like this:
the kmemleak modprobe traces look unchanged from last report,
but new rmmod leaks are seen here.
And kmemleak file is empty after the report.
It also BUG'd later on a soft lockup

finished pass 15
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #361 SMP PREEMPT_DYNAMIC
Tue Apr 4 20:05:45 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
[ 2030.329359] ACPI: bus type drm_connector registered
[ 2030.686297] AMD-Vi: AMD IOMMUv2 functionality not available on this
system - This is not a bug.
[ 2031.600726] [drm] amdgpu kernel modesetting enabled.
[ 2031.601205] amdgpu: CRAT table not found
[ 2031.601403] amdgpu: Virtual CRAT table created for CPU
[ 2031.601797] amdgpu: Topology: Add CPU node

real 0m1.725s
user 0m0.000s
sys 0m0.956s
rmmod: ERROR: Module intel_rapl_msr is not currently loaded
[ 2032.328701] ACPI: bus type drm_connector unregistered
[ 2032.504633] kmemleak: 8 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
finished pass 16
unreferenced object 0xffff88800536eb40 (size 192):
comm "modprobe", pid 927, jiffies 4296693079 (age 6.651s)
hex dump (first 32 bytes):
00 5b 50 c0 ff ff ff ff 00 00 00 00 00 00 00 00 .[P.............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<0000000097c7da82>] __kmalloc+0x49/0x150
[<000000003bcf1708>] __register_sysctl_table+0x51/0x7f0
[<00000000c0b7f00a>] 0xffffffffc04f2a78
[<000000009ea66960>] 0xffffffffc066001b
[<000000001412bcff>] do_one_initcall+0x43/0x210
[<00000000c116532d>] do_init_module+0x60/0x240
[<000000001f641d01>] __do_sys_finit_module+0x93/0xf0
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888009e3b800 (size 256):
comm "modprobe", pid 927, jiffies 4296693079 (age 6.651s)
hex dump (first 32 bytes):
78 b8 e3 09 80 88 ff ff 00 00 00 00 00 00 00 00 x...............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<0000000097c7da82>] __kmalloc+0x49/0x150
[<00000000a17c20b9>] __register_sysctl_table+0x569/0x7f0
[<00000000c0b7f00a>] 0xffffffffc04f2a78
[<000000009ea66960>] 0xffffffffc066001b
[<000000001412bcff>] do_one_initcall+0x43/0x210
[<00000000c116532d>] do_init_module+0x60/0x240
[<000000001f641d01>] __do_sys_finit_module+0x93/0xf0
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888021c363c0 (size 96):
comm "rmmod", pid 947, jiffies 4296694434 (age 5.296s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<0000000085923eeb>] kmalloc_trace+0x26/0x90
[<00000000920b2848>] kernfs_fop_open+0x30c/0x390
[<0000000078b60e3b>] do_dentry_open+0x1de/0x410
[<00000000140ea377>] path_openat+0xaa0/0x10a0
[<000000008bcf35a2>] do_filp_open+0xa1/0x130
[<00000000404bfb4b>] do_sys_openat2+0x74/0x130
[<000000009fd3d965>] __x64_sys_openat+0x5c/0x70
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888021c36540 (size 96):
comm "rmmod", pid 947, jiffies 4296694434 (age 5.296s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<0000000085923eeb>] kmalloc_trace+0x26/0x90
[<00000000920b2848>] kernfs_fop_open+0x30c/0x390
[<0000000078b60e3b>] do_dentry_open+0x1de/0x410
[<00000000140ea377>] path_openat+0xaa0/0x10a0
[<000000008bcf35a2>] do_filp_open+0xa1/0x130
[<00000000404bfb4b>] do_sys_openat2+0x74/0x130
[<000000009fd3d965>] __x64_sys_openat+0x5c/0x70
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888021c36420 (size 96):
comm "rmmod", pid 948, jiffies 4296694456 (age 5.274s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<0000000085923eeb>] kmalloc_trace+0x26/0x90
[<00000000920b2848>] kernfs_fop_open+0x30c/0x390
[<0000000078b60e3b>] do_dentry_open+0x1de/0x410
[<00000000140ea377>] path_openat+0xaa0/0x10a0
[<000000008bcf35a2>] do_filp_open+0xa1/0x130
[<00000000404bfb4b>] do_sys_openat2+0x74/0x130
[<000000009fd3d965>] __x64_sys_openat+0x5c/0x70
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888021c368a0 (size 96):
comm "rmmod", pid 950, jiffies 4296694518 (age 5.240s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<0000000085923eeb>] kmalloc_trace+0x26/0x90
[<00000000920b2848>] kernfs_fop_open+0x30c/0x390
[<0000000078b60e3b>] do_dentry_open+0x1de/0x410
[<00000000140ea377>] path_openat+0xaa0/0x10a0
[<000000008bcf35a2>] do_filp_open+0xa1/0x130
[<00000000404bfb4b>] do_sys_openat2+0x74/0x130
[<000000009fd3d965>] __x64_sys_openat+0x5c/0x70
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888021c36ae0 (size 96):
comm "rmmod", pid 950, jiffies 4296694519 (age 5.239s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<0000000085923eeb>] kmalloc_trace+0x26/0x90
[<00000000920b2848>] kernfs_fop_open+0x30c/0x390
[<0000000078b60e3b>] do_dentry_open+0x1de/0x410
[<00000000140ea377>] path_openat+0xaa0/0x10a0
[<000000008bcf35a2>] do_filp_open+0xa1/0x130
[<00000000404bfb4b>] do_sys_openat2+0x74/0x130
[<000000009fd3d965>] __x64_sys_openat+0x5c/0x70
[<0000000034507d8b>] do_syscall_64+0x34/0x80
[<0000000087a8ea8c>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
[ 2032.504633] kmemleak: 8 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #361 SMP PREEMPT_DYNAMIC
Tue Apr 4 20:05:45 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux

/sys/kernel/debug/kmemleak is empty, so my reader script gets nothing

:#> ./grok_kmemleak -ag
all: bless( {
'backtraces' => {},
'hexdumps' => {},
'users' => {}
}, 'LeakSet' )
mods: bless( {
'backtraces' => {},
'hexdumps' => {},
'users' => {}
}, 'LeakSet' )
:#> [17772.014943] hrtimer: interrupt took 4628209 ns
[17807.172581] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
[18017.793102] watchdog: BUG: soft lockup - CPU#2 stuck for 253s!
[kworker/2:1:420]
[18017.793110] Modules linked in: crc32_pclmul(E) intel_rapl_common(E)
ghash_clmulni_intel(E) crct10dif_pclmul(E) crc32c_intel(E) pcspkr(E)
serio_raw(E) i2c_piix4(E) [last unloaded: drm(E)]
[18017.793173] CPU: 2 PID: 420 Comm: kworker/2:1 Tainted: G
E 6.3.0-rc1-f2-00002-gf23cd1ffca4b #361
[18017.793176] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
BIOS 1.16.2-1.fc37 04/01/2014
[18017.793179] Workqueue: events_freezable_power_ disk_events_workfn
[18017.793240] RIP: 0010:_raw_spin_unlock_irqrestore+0x19/0x40
[18017.793266] Code: 04 31 c0 5b c3 0f 1f 44 00 00 31 c0 eb f5 0f 1f
00 0f 1f 44 00 00 e8 f6 05 00 00 90 f7 c6 00 02 00 00 74 06 fb 0f 1f
44 00 00 <bf> 01 00 00 00 e8 fd c6 3b ff 65 8b 05 6e b9 23 7e 85 c0 74
01 c3
[18017.793268] RSP: 0018:ffffc9000058bb90 EFLAGS: 00000206
[18017.793269] RAX: 0000000000000001 RBX: 0000000000000000 RCX: 0000000000000000
[18017.793270] RDX: 0000000000000000 RSI: 0000000000000293 RDI: ffff888007f99900
[18017.793270] RBP: 0000000000000293 R08: 0000000000000001 R09: ffffffff82c6c380
[18017.793271] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888006b88000
[18017.793271] R13: ffff888006933000 R14: ffff888006914000 R15: 0000000000000000
[18017.793274] FS: 0000000000000000(0000) GS:ffff88807dd00000(0000)
knlGS:0000000000000000
[18017.793275] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18017.793276] CR2: 000055fa79d48440 CR3: 0000000009f97000 CR4: 0000000000750ee0
[18017.793282] PKRU: 55555554
[18017.793294] Call Trace:
[18017.793312] <TASK>
[18017.793313] ata_scsi_queuecmd+0x4f/0x70
[18017.793339] scsi_queue_rq+0x36d/0xc20
[18017.793345] blk_mq_dispatch_rq_list+0x2ab/0x830
[18017.793355] ? _raw_spin_lock_irqsave+0x23/0x50
[18017.793358] __blk_mq_sched_dispatch_requests+0x9d/0x120
[18017.793363] blk_mq_sched_dispatch_requests+0x30/0x60
[18017.793365] __blk_mq_run_hw_queue+0x85/0xa0
[18017.793366] blk_execute_rq+0x9e/0x190
[18017.793368] scsi_execute_cmd+0xfd/0x2d0
[18017.793370] sr_check_events+0xc1/0x2b0
[18017.793374] ? finish_task_switch.isra.0+0x9b/0x2f0
[18017.793394] cdrom_check_events+0x14/0x30
[18017.793403] disk_check_events+0x34/0xf0
[18017.793405] process_one_work+0x1c3/0x3c0
[18017.793408] worker_thread+0x4d/0x380
[18017.793409] ? _raw_spin_lock_irqsave+0x23/0x50
[18017.793411] ? rescuer_thread+0x370/0x370
[18017.793412] kthread+0xe6/0x110
[18017.793415] ? kthread_complete_and_exit+0x20/0x20
[18017.793417] ret_from_fork+0x1f/0x30
[18017.793428] </TASK>
[18020.504528] rcu: 0-...!: (1 GPs behind) idle=753c/0/0x1
softirq=100444/100444 fqs=470
[18020.504962] rcu: (detected by 0, t=273814 jiffies, g=67733, q=11 ncpus=3)
[18020.505425] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G EL
6.3.0-rc1-f2-00002-gf23cd1ffca4b #361
[18020.506002] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
BIOS 1.16.2-1.fc37 04/01/2014
[18020.506499] RIP: 0010:pv_native_safe_halt+0xb/0x10
[18020.506848] Code: c3 0f 23 f6 c3 0f 0b 0f 1f 84 00 00 00 00 00 0f
0b 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 eb 07 0f 00 2d 6f 69 24
00 fb f4 <c3> cc cc cc cc 8b 17 48 89 fe 89 d7 83 e7 fe 0f 01 f9 66 90
48 c1
[18020.508055] RSP: 0018:ffffffff82c03e98 EFLAGS: 00000282
[18020.508376] RAX: 0000000000000000 RBX: ffffffff82c0f2c0 RCX: 0000000000000001
[18020.508824] RDX: 4000000000000000 RSI: ffffffff824c92ca RDI: ffffffff82486ff1
[18020.509199] RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000000
[18020.509532] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[18020.509897] R13: 0000000000000000 R14: ffffffff82c0ea10 R15: 0000000000000000
[18020.510375] FS: 0000000000000000(0000) GS:ffff88807dc00000(0000)
knlGS:0000000000000000
[18020.510901] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18020.511251] CR2: 00007f1773c1e738 CR3: 0000000002c32000 CR4: 0000000000750ef0
[18020.511705] PKRU: 55555554
[18020.511884] Call Trace:
[18020.512076] <TASK>
[18020.512183] default_idle+0x5/0x10
[18020.512353] default_idle_call+0x26/0xd0
[18020.512608] do_idle+0x1cb/0x220
[18020.512832] cpu_startup_entry+0x19/0x20
[18020.513098] rest_init+0xcb/0xd0
[18020.513261] arch_call_rest_init+0xa/0x20
[18020.513477] start_kernel+0x734/0xb40
[18020.513653] ? load_ucode_bsp+0x68/0x180
[18020.513847] secondary_startup_64_no_verify+0xe5/0xeb
[18020.514089] </TASK>

:#>


Im not sure when I did the make clean, maybe here.
it'd be a 'clean' explanation of the BUG struff.
I havent seen any today

so with some minor tweaks to my mod-load-unload leak_drive,
(to use different modules as workload, in case anything tickles different)

cycling on pcspkr, I get

doing cycle_ pcspkr
[ 65.163494] input: PC Speaker as /devices/platform/pcspkr/input/input24
[ 65.335487] kmemleak: 1 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
finished pass 21
unreferenced object 0xffff8880097a1120 (size 96):
comm "bash", pid 412, jiffies 4294727350 (age 5.240s)
hex dump (first 32 bytes):
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
backtrace:
[<000000006a558d73>] kmalloc_trace+0x26/0x90
[<0000000010830fb6>] kernfs_fop_open+0x30c/0x390
[<000000002c6acb11>] do_dentry_open+0x1de/0x410
[<00000000debed23e>] path_openat+0xaa0/0x10a0
[<00000000f619d9cf>] do_filp_open+0xa1/0x130
[<00000000b6a4c64d>] do_sys_openat2+0x74/0x130
[<0000000007cd46d3>] __x64_sys_openat+0x5c/0x70
[<0000000062074f8d>] do_syscall_64+0x34/0x80
[<0000000044b0764d>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
[ 65.335487] kmemleak: 1 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #362 SMP PREEMPT_DYNAMIC
Wed Apr 5 09:26:03 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
:#>

that leaktrace repeated verbatim when cycling on test_dynamic_debug

the leaktrace cycling drm modules is like I posted previously,
but repeated here, since its a new patch

finished pass 4
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #362 SMP PREEMPT_DYNAMIC
Wed Apr 5 09:26:03 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux
[ 305.519903] ACPI: bus type drm_connector registered
[ 305.768467] AMD-Vi: AMD IOMMUv2 functionality not available on this
system - This is not a bug.
[ 306.487182] [drm] amdgpu kernel modesetting enabled.
[ 306.487838] amdgpu: CRAT table not found
[ 306.488124] amdgpu: Virtual CRAT table created for CPU
[ 306.488585] amdgpu: Topology: Add CPU node

real 0m1.258s
user 0m0.002s
sys 0m0.876s
rmmod: ERROR: Module intel_rapl_msr is not currently loaded
[ 307.066093] ACPI: bus type drm_connector unregistered
[ 307.215619] kmemleak: 2 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
finished pass 5
unreferenced object 0xffff888007f87180 (size 192):
comm "modprobe", pid 504, jiffies 4294969152 (age 5.317s)
hex dump (first 32 bytes):
00 fb 50 c0 ff ff ff ff 00 00 00 00 00 00 00 00 ..P.............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<00000000f918d102>] __register_sysctl_table+0x51/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888008a3c800 (size 256):
comm "modprobe", pid 504, jiffies 4294969152 (age 5.317s)
hex dump (first 32 bytes):
78 c8 a3 08 80 88 ff ff 00 00 00 00 00 00 00 00 x...............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<000000001a756730>] __register_sysctl_table+0x569/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
[ 307.215619] kmemleak: 2 new suspected memory leaks (see
/sys/kernel/debug/kmemleak)
Linux (none) 6.3.0-rc1-f2-00002-gf23cd1ffca4b #362 SMP PREEMPT_DYNAMIC
Wed Apr 5 09:26:03 MDT 2023 x86_64 x86_64 x86_64 GNU/Linux

expanding those traces (cuz I scripted it)

:#> ./grok_kmemleak -agp
paste in your selected input:
- ie pasted from above
unreferenced object 0xffff888007f87180 (size 192):
comm "modprobe", pid 504, jiffies 4294969152 (age 5.317s)
hex dump (first 32 bytes):
00 fb 50 c0 ff ff ff ff 00 00 00 00 00 00 00 00 ..P.............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<00000000f918d102>] __register_sysctl_table+0x51/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
unreferenced object 0xffff888008a3c800 (size 256):
comm "modprobe", pid 504, jiffies 4294969152 (age 5.317s)
hex dump (first 32 bytes):
78 c8 a3 08 80 88 ff ff 00 00 00 00 00 00 00 00 x...............
00 00 00 00 00 00 00 00 ea ff ff ff ff ff ff ff ................
backtrace:
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<000000001a756730>] __register_sysctl_table+0x569/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
all: bless( {
'backtraces' => {
'[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<000000001a756730>] __register_sysctl_table+0x569/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 1,
'[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<00000000f918d102>] __register_sysctl_table+0x51/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0' => 1
},
'hexdumps' => {
'00 fb 50 c0 ff ff ff ff 00 00 00 00 00 00 00 00 ..P.............' => 1,
'78 c8 a3 08 80 88 ff ff 00 00 00 00 00 00 00 00 x...............' => 1
},
'users' => {
'comm "modprobe", pid 504,' => 2
}
}, 'LeakSet' )
# this leak backtrace has 1 occurrences
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<00000000f918d102>] __register_sysctl_table+0x51/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
doing: gdb -q -ex 'l *(__kmalloc+0x49)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81386319 is in __kmalloc
(/home/jimc/projects/lx/wk-next/mm/slab_common.c:966).
961 s = kmalloc_slab(size, flags);
962
963 if (unlikely(ZERO_OR_NULL_PTR(s)))
964 return s;
965
966 ret = __kmem_cache_alloc_node(s, flags, node, size, caller);
967 ret = kasan_kmalloc(s, ret, size, flags);
968 trace_kmalloc(caller, ret, size, s->size, flags, node);
969 return ret;
970 }
doing: gdb -q -ex 'l *(__register_sysctl_table+0x51)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff814e5991 is in __register_sysctl_table
(/home/jimc/projects/lx/wk-next/include/linux/slab.h:584).
579 index = kmalloc_index(size);
580 return kmalloc_trace(
581 kmalloc_caches[kmalloc_type(flags)][index],
582 flags, size);
583 }
584 return __kmalloc(size, flags);
585 }
586 #else
587 static __always_inline __alloc_size(1) void *kmalloc(size_t size,
gfp_t flags)
588 {
doing: gdb -q -ex 'l *(0xffffffffc04fca78)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
doing: gdb -q -ex 'l *(0xffffffffc038601b)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
doing: gdb -q -ex 'l *(do_one_initcall+0x43)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81001093 is in do_one_initcall
(/home/jimc/projects/lx/wk-next/init/main.c:1306).
1301
1302 if (initcall_blacklisted(fn))
1303 return -EPERM;
1304
1305 do_trace_initcall_start(fn);
1306 ret = fn();
1307 do_trace_initcall_finish(fn, ret);
1308
1309 msgbuf[0] = 0;
1310
doing: gdb -q -ex 'l *(do_init_module+0x60)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff812287c0 is in do_init_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2481).
2476 freeinit->init_rodata = mod->mem[MOD_INIT_RODATA].base;
2477
2478 do_mod_ctors(mod);
2479 /* Start the module */
2480 if (mod->init != NULL)
2481 ret = do_one_initcall(mod->init);
2482 if (ret < 0) {
2483 goto fail_free_freeinit;
2484 }
2485 if (ret > 0) {
doing: gdb -q -ex 'l *(__do_sys_finit_module+0x93)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff8122af73 is in __do_sys_finit_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2987).
2982 } else {
2983 info.hdr = buf;
2984 info.len = len;
2985 }
2986
2987 return load_module(&info, uargs, flags);
2988 }
2989
2990 static inline int within(unsigned long addr, void *start,
unsigned long size)
2991 {
doing: gdb -q -ex 'l *(do_syscall_64+0x34)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81ddad64 is in do_syscall_64
(/home/jimc/projects/lx/wk-next/arch/x86/entry/common.c:50).
45 */
46 unsigned int unr = nr;
47
48 if (likely(unr < NR_syscalls)) {
49 unr = array_index_nospec(unr, NR_syscalls);
50 regs->ax = sys_call_table[unr](regs);
51 return true;
52 }
53 return false;
54 }
doing: gdb -q -ex 'l *(entry_SYSCALL_64_after_hwframe+0x46)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81e0006a is at
/home/jimc/projects/lx/wk-next/arch/x86/entry/entry_64.S:120.
115
116 /* clobbers %rax, make sure it is after saving the syscall nr */
117 IBRS_ENTER
118 UNTRAIN_RET
119
120 call do_syscall_64 /* returns with IRQs disabled */
121
122 /*
123 * Try to use SYSRET instead of IRET if we're returning to
124 * a completely clean 64-bit userspace context. If we're not,
# this leak backtrace has 1 occurrences
[<00000000f63d45f6>] __kmalloc+0x49/0x150
[<000000001a756730>] __register_sysctl_table+0x569/0x7f0
[<000000002fbfef58>] 0xffffffffc04fca78
[<000000001dcc6780>] 0xffffffffc038601b
[<00000000d82d83ab>] do_one_initcall+0x43/0x210
[<000000002ef9c020>] do_init_module+0x60/0x240
[<00000000859f64f2>] __do_sys_finit_module+0x93/0xf0
[<000000006b72a46f>] do_syscall_64+0x34/0x80
[<000000009dda0f8e>] entry_SYSCALL_64_after_hwframe+0x46/0xb0
doing: gdb -q -ex 'l *(__kmalloc+0x49)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81386319 is in __kmalloc
(/home/jimc/projects/lx/wk-next/mm/slab_common.c:966).
961 s = kmalloc_slab(size, flags);
962
963 if (unlikely(ZERO_OR_NULL_PTR(s)))
964 return s;
965
966 ret = __kmem_cache_alloc_node(s, flags, node, size, caller);
967 ret = kasan_kmalloc(s, ret, size, flags);
968 trace_kmalloc(caller, ret, size, s->size, flags, node);
969 return ret;
970 }
doing: gdb -q -ex 'l *(__register_sysctl_table+0x569)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff814e5ea9 is in __register_sysctl_table
(/home/jimc/projects/lx/wk-next/fs/proc/proc_sysctl.c:974).
969 char *new_name;
970
971 new = kzalloc(sizeof(*new) + sizeof(struct ctl_node) +
972 sizeof(struct ctl_table)*2 + namelen + 1,
973 GFP_KERNEL);
974 if (!new)
975 return NULL;
976
977 node = (struct ctl_node *)(new + 1);
978 table = (struct ctl_table *)(node + 1);
doing: gdb -q -ex 'l *(0xffffffffc04fca78)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
doing: gdb -q -ex 'l *(0xffffffffc038601b)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
doing: gdb -q -ex 'l *(do_one_initcall+0x43)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81001093 is in do_one_initcall
(/home/jimc/projects/lx/wk-next/init/main.c:1306).
1301
1302 if (initcall_blacklisted(fn))
1303 return -EPERM;
1304
1305 do_trace_initcall_start(fn);
1306 ret = fn();
1307 do_trace_initcall_finish(fn, ret);
1308
1309 msgbuf[0] = 0;
1310
doing: gdb -q -ex 'l *(do_init_module+0x60)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff812287c0 is in do_init_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2481).
2476 freeinit->init_rodata = mod->mem[MOD_INIT_RODATA].base;
2477
2478 do_mod_ctors(mod);
2479 /* Start the module */
2480 if (mod->init != NULL)
2481 ret = do_one_initcall(mod->init);
2482 if (ret < 0) {
2483 goto fail_free_freeinit;
2484 }
2485 if (ret > 0) {
doing: gdb -q -ex 'l *(__do_sys_finit_module+0x93)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff8122af73 is in __do_sys_finit_module
(/home/jimc/projects/lx/wk-next/kernel/module/main.c:2987).
2982 } else {
2983 info.hdr = buf;
2984 info.len = len;
2985 }
2986
2987 return load_module(&info, uargs, flags);
2988 }
2989
2990 static inline int within(unsigned long addr, void *start,
unsigned long size)
2991 {
doing: gdb -q -ex 'l *(do_syscall_64+0x34)' -ex quit wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81ddad64 is in do_syscall_64
(/home/jimc/projects/lx/wk-next/arch/x86/entry/common.c:50).
45 */
46 unsigned int unr = nr;
47
48 if (likely(unr < NR_syscalls)) {
49 unr = array_index_nospec(unr, NR_syscalls);
50 regs->ax = sys_call_table[unr](regs);
51 return true;
52 }
53 return false;
54 }
doing: gdb -q -ex 'l *(entry_SYSCALL_64_after_hwframe+0x46)' -ex quit
wk-next/builds/f2/vmlinux
Reading symbols from wk-next/builds/f2/vmlinux...
0xffffffff81e0006a is at
/home/jimc/projects/lx/wk-next/arch/x86/entry/entry_64.S:120.
115
116 /* clobbers %rax, make sure it is after saving the syscall nr */
117 IBRS_ENTER
118 UNTRAIN_RET
119
120 call do_syscall_64 /* returns with IRQs disabled */
121
122 /*
123 * Try to use SYSRET instead of IRET if we're returning to
124 * a completely clean 64-bit userspace context. If we're not,
mods: bless( {
'backtraces' => {},
'hexdumps' => {},
'users' => {}
}, 'LeakSet' )
:#>

2023-04-06 20:10:24

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Wed, Apr 05, 2023 at 09:14:12PM -0600, [email protected] wrote:
> On Tue, Apr 4, 2023 at 8:01 PM Luis Chamberlain <[email protected]> wrote:
> >
> > On Tue, Apr 04, 2023 at 07:38:41PM -0600, [email protected] wrote:
> > > On Mon, Apr 3, 2023 at 2:44 PM Luis Chamberlain <[email protected]> wrote:

<-- my old patch -->

> this disappearing act is still going on.
> my script issues no echo clear > kmemleak

So this email is ab it confusing. Because you comment here before the
new patch.

<-- my new switch statement kmemleak patch to fix the reported leak -->

> sorry for the delay, I was seeing heisen-responses, and several BUGs.
> a make clean seems to have settled things mostly.

And now here you comment after thew new suggested patch and say it
seemst to mostly fixed things.

> But in case theres any clues in there,

In where?

> Ive kept the paste-in of 2 BUGs
>
> with
> f23cd1ffca4b (HEAD) kmemleaks on ac3b43283923 ("module: replace
> module_layout with module_memory")
> ac3b43283923 module: replace module_layout with module_memory

The only commit here that makes sense to me is

ac3b43283923 ("module: replace module_layout with module_memory"

Commit f23cd1ffca4b means absolutely nothing to me. I can only guess
you mean that you've applied my suggested changes with the new switch
statement?

> heres the 1st run. cuz it leaked, I reran in another vm, which got
> different results.
> I left it overnight doing nothing (laptop slept, vm with it),
> and it BUG'd on a soft lockup
> (much later, but the leaktrace does have a timerfd in it)
> R11 looks poisoned.

<-- some unrelated leak I think -->

> using sh-script posted previously,

I don't recall what that sh-script was.

<-- snip some leaks -->

> Im not sure when I did the make clean, maybe here.
> it'd be a 'clean' explanation of the BUG struff.
> I havent seen any today

OK great.

<-- snip some long traces-->

I don't get these long traces if you didn't see any then.

Luis

2023-04-11 15:12:00

by Catalin Marinas

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Mon, Apr 03, 2023 at 01:43:58PM -0700, Luis Chamberlain wrote:
> On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > kmemleak is reporting 19 leaks during boot
> > > >
> > > > because the hexdumps appeared to have module-names,
> > > > and Ive been hacking nearby, and see the same names
> > > > every time I boot my test-vm, I needed a clearer picture
> > > > Jason corroborated and bisected.
> > > >
> > > > the 19 leaks split into 2 groups,
> > > > 9 with names of builtin modules in the hexdump,
> > > > all with the same backtrace
> > > > 9 without module-names (with a shared backtrace)
> > > > +1 wo name-ish and a separate backtrace
> > >
> > > Song, please take a look.
> >
> > I will look into this next week.
>
> I'm thinking this may be it, at least this gets us to what we used to do
> as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> support") and right before Song's patch.
>
> diff --git a/kernel/module/main.c b/kernel/module/main.c
> index 6b6da80f363f..3b9c71fa6096 100644
> --- a/kernel/module/main.c
> +++ b/kernel/module/main.c
> @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> * which is inside the block. Just mark it as not being a
> * leak.
> */
> - kmemleak_ignore(ptr);
> + if (type == MOD_INIT_TEXT)
> + kmemleak_ignore(ptr);
> + else
> + kmemleak_not_leak(ptr);
> if (!ptr) {
> t = type;
> goto out_enomem;
>
> We used to use the grey area for the TEXT but the original commit
> doesn't explain too well why we grey out init but not the others. Ie
> why kmemleak_ignore() on init and kmemleak_not_leak() on the others.

It's safe to use the 'grey' colour in all cases. For text sections that
don't need scanning, there's a slight chance of increasing the false
negatives, so marking it 'black' ignores the scanning. For the init
section, if it gets discarded anyway, just going with
kmemleak_not_leak() is fine. It simplifies the logic above.

--
Catalin

2023-04-11 17:17:05

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Tue, Apr 11, 2023 at 04:10:24PM +0100, Catalin Marinas wrote:
> On Mon, Apr 03, 2023 at 01:43:58PM -0700, Luis Chamberlain wrote:
> > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > kmemleak is reporting 19 leaks during boot
> > > > >
> > > > > because the hexdumps appeared to have module-names,
> > > > > and Ive been hacking nearby, and see the same names
> > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > Jason corroborated and bisected.
> > > > >
> > > > > the 19 leaks split into 2 groups,
> > > > > 9 with names of builtin modules in the hexdump,
> > > > > all with the same backtrace
> > > > > 9 without module-names (with a shared backtrace)
> > > > > +1 wo name-ish and a separate backtrace
> > > >
> > > > Song, please take a look.
> > >
> > > I will look into this next week.
> >
> > I'm thinking this may be it, at least this gets us to what we used to do
> > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > support") and right before Song's patch.
> >
> > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > index 6b6da80f363f..3b9c71fa6096 100644
> > --- a/kernel/module/main.c
> > +++ b/kernel/module/main.c
> > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > * which is inside the block. Just mark it as not being a
> > * leak.
> > */
> > - kmemleak_ignore(ptr);
> > + if (type == MOD_INIT_TEXT)
> > + kmemleak_ignore(ptr);
> > + else
> > + kmemleak_not_leak(ptr);
> > if (!ptr) {
> > t = type;
> > goto out_enomem;
> >
> > We used to use the grey area for the TEXT but the original commit
> > doesn't explain too well why we grey out init but not the others. Ie
> > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
>
> It's safe to use the 'grey' colour in all cases. For text sections that
> don't need scanning, there's a slight chance of increasing the false
> negatives,

It turns out that there are *tons* of false positives today, unless
these are real leaks.

> so marking it 'black' ignores the scanning. For the init
> section, if it gets discarded anyway, just going with
> kmemleak_not_leak() is fine. It simplifies the logic above.

Luis

2023-04-11 23:02:48

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Tue, Apr 11, 2023 at 10:07:53AM -0700, Luis Chamberlain wrote:
> On Tue, Apr 11, 2023 at 04:10:24PM +0100, Catalin Marinas wrote:
> > On Mon, Apr 03, 2023 at 01:43:58PM -0700, Luis Chamberlain wrote:
> > > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > > kmemleak is reporting 19 leaks during boot
> > > > > >
> > > > > > because the hexdumps appeared to have module-names,
> > > > > > and Ive been hacking nearby, and see the same names
> > > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > > Jason corroborated and bisected.
> > > > > >
> > > > > > the 19 leaks split into 2 groups,
> > > > > > 9 with names of builtin modules in the hexdump,
> > > > > > all with the same backtrace
> > > > > > 9 without module-names (with a shared backtrace)
> > > > > > +1 wo name-ish and a separate backtrace
> > > > >
> > > > > Song, please take a look.
> > > >
> > > > I will look into this next week.
> > >
> > > I'm thinking this may be it, at least this gets us to what we used to do
> > > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > > support") and right before Song's patch.
> > >
> > > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > > index 6b6da80f363f..3b9c71fa6096 100644
> > > --- a/kernel/module/main.c
> > > +++ b/kernel/module/main.c
> > > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > > * which is inside the block. Just mark it as not being a
> > > * leak.
> > > */
> > > - kmemleak_ignore(ptr);
> > > + if (type == MOD_INIT_TEXT)
> > > + kmemleak_ignore(ptr);
> > > + else
> > > + kmemleak_not_leak(ptr);
> > > if (!ptr) {
> > > t = type;
> > > goto out_enomem;
> > >
> > > We used to use the grey area for the TEXT but the original commit
> > > doesn't explain too well why we grey out init but not the others. Ie
> > > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
> >
> > It's safe to use the 'grey' colour in all cases. For text sections that
> > don't need scanning, there's a slight chance of increasing the false
> > negatives,
>
> It turns out that there are *tons* of false positives today, unless
> these are real leaks.

I should clarify: *if* we leave things as-is, we seem to get tons of
false positives.

Luis

2023-04-12 10:04:47

by Catalin Marinas

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Tue, Apr 11, 2023 at 04:00:59PM -0700, Luis Chamberlain wrote:
> On Tue, Apr 11, 2023 at 10:07:53AM -0700, Luis Chamberlain wrote:
> > On Tue, Apr 11, 2023 at 04:10:24PM +0100, Catalin Marinas wrote:
> > > On Mon, Apr 03, 2023 at 01:43:58PM -0700, Luis Chamberlain wrote:
> > > > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > > > kmemleak is reporting 19 leaks during boot
> > > > > > >
> > > > > > > because the hexdumps appeared to have module-names,
> > > > > > > and Ive been hacking nearby, and see the same names
> > > > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > > > Jason corroborated and bisected.
> > > > > > >
> > > > > > > the 19 leaks split into 2 groups,
> > > > > > > 9 with names of builtin modules in the hexdump,
> > > > > > > all with the same backtrace
> > > > > > > 9 without module-names (with a shared backtrace)
> > > > > > > +1 wo name-ish and a separate backtrace
> > > > > >
> > > > > > Song, please take a look.
> > > > >
> > > > > I will look into this next week.
> > > >
> > > > I'm thinking this may be it, at least this gets us to what we used to do
> > > > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > > > support") and right before Song's patch.
> > > >
> > > > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > > > index 6b6da80f363f..3b9c71fa6096 100644
> > > > --- a/kernel/module/main.c
> > > > +++ b/kernel/module/main.c
> > > > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > > > * which is inside the block. Just mark it as not being a
> > > > * leak.
> > > > */
> > > > - kmemleak_ignore(ptr);
> > > > + if (type == MOD_INIT_TEXT)
> > > > + kmemleak_ignore(ptr);
> > > > + else
> > > > + kmemleak_not_leak(ptr);
> > > > if (!ptr) {
> > > > t = type;
> > > > goto out_enomem;
> > > >
> > > > We used to use the grey area for the TEXT but the original commit
> > > > doesn't explain too well why we grey out init but not the others. Ie
> > > > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
> > >
> > > It's safe to use the 'grey' colour in all cases. For text sections that
> > > don't need scanning, there's a slight chance of increasing the false
> > > negatives,
> >
> > It turns out that there are *tons* of false positives today, unless
> > these are real leaks.
>
> I should clarify: *if* we leave things as-is, we seem to get tons of
> false positives.

Which makes sense if kmemleak_ignore() is used, such objects would not
be scanned. I'd just replace it with kmemleak_not_leak() irrespective of
the type.

--
Catalin

2023-04-12 17:35:42

by Luis Chamberlain

[permalink] [raw]
Subject: Re: kmemleaks on ac3b43283923 ("module: replace module_layout with module_memory")

On Wed, Apr 12, 2023 at 10:53:36AM +0100, Catalin Marinas wrote:
> On Tue, Apr 11, 2023 at 04:00:59PM -0700, Luis Chamberlain wrote:
> > On Tue, Apr 11, 2023 at 10:07:53AM -0700, Luis Chamberlain wrote:
> > > On Tue, Apr 11, 2023 at 04:10:24PM +0100, Catalin Marinas wrote:
> > > > On Mon, Apr 03, 2023 at 01:43:58PM -0700, Luis Chamberlain wrote:
> > > > > On Fri, Mar 31, 2023 at 05:27:04PM -0700, Song Liu wrote:
> > > > > > On Fri, Mar 31, 2023 at 12:00 AM Luis Chamberlain <[email protected]> wrote:
> > > > > > > On Thu, Mar 30, 2023 at 04:45:43PM -0600, [email protected] wrote:
> > > > > > > > kmemleak is reporting 19 leaks during boot
> > > > > > > >
> > > > > > > > because the hexdumps appeared to have module-names,
> > > > > > > > and Ive been hacking nearby, and see the same names
> > > > > > > > every time I boot my test-vm, I needed a clearer picture
> > > > > > > > Jason corroborated and bisected.
> > > > > > > >
> > > > > > > > the 19 leaks split into 2 groups,
> > > > > > > > 9 with names of builtin modules in the hexdump,
> > > > > > > > all with the same backtrace
> > > > > > > > 9 without module-names (with a shared backtrace)
> > > > > > > > +1 wo name-ish and a separate backtrace
> > > > > > >
> > > > > > > Song, please take a look.
> > > > > >
> > > > > > I will look into this next week.
> > > > >
> > > > > I'm thinking this may be it, at least this gets us to what we used to do
> > > > > as per original Catalinas' 4f2294b6dc88d ("kmemleak: Add modules
> > > > > support") and right before Song's patch.
> > > > >
> > > > > diff --git a/kernel/module/main.c b/kernel/module/main.c
> > > > > index 6b6da80f363f..3b9c71fa6096 100644
> > > > > --- a/kernel/module/main.c
> > > > > +++ b/kernel/module/main.c
> > > > > @@ -2240,7 +2240,10 @@ static int move_module(struct module *mod, struct load_info *info)
> > > > > * which is inside the block. Just mark it as not being a
> > > > > * leak.
> > > > > */
> > > > > - kmemleak_ignore(ptr);
> > > > > + if (type == MOD_INIT_TEXT)
> > > > > + kmemleak_ignore(ptr);
> > > > > + else
> > > > > + kmemleak_not_leak(ptr);
> > > > > if (!ptr) {
> > > > > t = type;
> > > > > goto out_enomem;
> > > > >
> > > > > We used to use the grey area for the TEXT but the original commit
> > > > > doesn't explain too well why we grey out init but not the others. Ie
> > > > > why kmemleak_ignore() on init and kmemleak_not_leak() on the others.
> > > >
> > > > It's safe to use the 'grey' colour in all cases. For text sections that
> > > > don't need scanning, there's a slight chance of increasing the false
> > > > negatives,
> > >
> > > It turns out that there are *tons* of false positives today, unless
> > > these are real leaks.
> >
> > I should clarify: *if* we leave things as-is, we seem to get tons of
> > false positives.
>
> Which makes sense if kmemleak_ignore() is used, such objects would not
> be scanned. I'd just replace it with kmemleak_not_leak() irrespective of
> the type.

OK I'll do that and add a Suggested-by you :)

Luis