LinuxLists.cc - [linus:master] [mm, slub] 0af8489b02: kernel_BUG_at

2022-12-31 15:34:42

Subject: [linus:master] [mm, slub] 0af8489b02: kernel_BUG_at_include/linux/mm.h

Greeting,

FYI, we noticed kernel_BUG_at_include/linux/mm.h due to commit (built with gcc-11):

commit: 0af8489b0216fa1dd83e264bef8063f2632633d7 ("mm, slub: remove percpu slabs with CONFIG_SLUB_TINY")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

[test failed on linux-next/master c76083fac3bae1a87ae3d005b5cb1cbc761e31d5]

in testcase: rcutorture
version:
with following parameters:

runtime: 300s
test: default
torture_type: tasks-tracing

test-description: rcutorture is rcutorture kernel module load/unload test.
test-url: https://www.kernel.org/doc/Documentation/RCU/torture.txt

on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):

If you fix the issue, kindly add following tag
| Reported-by: kernel test robot <[email protected]>
| Link: https://lore.kernel.org/oe-lkp/[email protected]

[ 25.804432][ T214] ------------[ cut here ]------------
[ 25.804917][ T214] kernel BUG at include/linux/mm.h:825!
[ 25.805402][ T214] invalid opcode: 0000 [#1] SMP
[ 25.805820][ T214] CPU: 0 PID: 214 Comm: udevadm Tainted: G S 6.1.0-rc2-00014-g0af8489b0216 #2 1c4d7707ec0ce574ed62a77e82a8580202758048
[ 25.806944][ T214] EIP: __dump_page.cold (include/linux/mm.h:825 mm/debug.c:97)
[ 25.807376][ T214] Code: ff ff 83 05 e8 5d bb c5 01 ba 4c c4 2f c4 89 f8 83 15 ec 5d bb c5 00 e8 f2 92 ed fd 83 05 f8 5d bb c5 01 83 15 fc 5d bb c5 00 <0f> 0b 83 05 00 5e bb c5 01 b8 ac 85 a3 c4 83 15 04 5e bb c5 00 e8
All code
========
0: ff (bad)
1: ff 83 05 e8 5d bb incl -0x44a217fb(%rbx)
7: c5 01 ba (bad)
a: 4c c4 rex.WR (bad)
c: 2f (bad)
d: c4 (bad)
e: 89 f8 mov %edi,%eax
10: 83 15 ec 5d bb c5 00 adcl $0x0,-0x3a44a214(%rip) # 0xffffffffc5bb5e03
17: e8 f2 92 ed fd callq 0xfffffffffded930e
1c: 83 05 f8 5d bb c5 01 addl $0x1,-0x3a44a208(%rip) # 0xffffffffc5bb5e1b
23: 83 15 fc 5d bb c5 00 adcl $0x0,-0x3a44a204(%rip) # 0xffffffffc5bb5e26
2a:* 0f 0b ud2 <-- trapping instruction
2c: 83 05 00 5e bb c5 01 addl $0x1,-0x3a44a200(%rip) # 0xffffffffc5bb5e33
33: b8 ac 85 a3 c4 mov $0xc4a385ac,%eax
38: 83 15 04 5e bb c5 00 adcl $0x0,-0x3a44a1fc(%rip) # 0xffffffffc5bb5e43
3f: e8 .byte 0xe8

Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 83 05 00 5e bb c5 01 addl $0x1,-0x3a44a200(%rip) # 0xffffffffc5bb5e09
9: b8 ac 85 a3 c4 mov $0xc4a385ac,%eax
e: 83 15 04 5e bb c5 00 adcl $0x0,-0x3a44a1fc(%rip) # 0xffffffffc5bb5e19
15: e8 .byte 0xe8
[ 25.808960][ T214] EAX: 00000000 EBX: e764d530 ECX: 00000003 EDX: 4108888f
[ 25.809578][ T214] ESI: e764d4e0 EDI: e764d4e0 EBP: ed89db3c ESP: ed89db00
[ 25.810168][ T214] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 EFLAGS: 00210046
[ 25.810803][ T214] CR0: 80050033 CR2: 00616abc CR3: 2d878000 CR4: 000406d0
[ 25.811407][ T214] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
[ 25.811999][ T214] DR6: fffe0ff0 DR7: 00000400
[ 25.812390][ T214] Call Trace:
[ 25.812675][ T214] dump_page (mm/debug.c:131)
[ 25.813025][ T214] ? _raw_spin_lock_irqsave (kernel/locking/spinlock.c:162)
[ 25.813492][ T214] folio_flags+0x23/0x70
[ 25.813945][ T214] get_partial_node (include/linux/page-flags.h:483 mm/slab.h:140 mm/slub.c:2967 mm/slub.c:2225)
[ 25.814357][ T214] __slab_alloc_node+0xbb/0x270
[ 25.814860][ T214] kmem_cache_alloc_lru (mm/slub.c:3404 mm/slub.c:3418 mm/slub.c:3425 mm/slub.c:3441)
[ 25.815289][ T214] ? __lock_release (kernel/locking/lockdep.c:355 kernel/locking/lockdep.c:5350)
[ 25.815697][ T214] ? iget_locked (fs/inode.c:1275)
[ 25.816096][ T214] alloc_inode (include/linux/fs.h:3117 fs/inode.c:261)
[ 25.816469][ T214] iget_locked (fs/inode.c:1286)
[ 25.816829][ T214] ? lock_is_held_type (kernel/locking/lockdep.c:5409 kernel/locking/lockdep.c:5711)
[ 25.817264][ T214] kernfs_get_inode (fs/kernfs/inode.c:255)
[ 25.817670][ T214] kernfs_iop_lookup (fs/kernfs/dir.c:1154)
[ 25.818087][ T214] __lookup_slow (fs/namei.c:1685)
[ 25.818479][ T214] lookup_slow (fs/namei.c:1702)
[ 25.818847][ T214] walk_component (fs/namei.c:1993)
[ 25.819244][ T214] path_lookupat (fs/namei.c:2450 fs/namei.c:2474)
[ 25.819627][ T214] path_openat (fs/namei.c:3684 fs/namei.c:3706)
[ 25.820007][ T214] do_filp_open (fs/namei.c:3740)
[ 25.820409][ T214] do_sys_openat2 (fs/open.c:1311)
[ 25.820807][ T214] do_sys_open (fs/open.c:1326)
[ 25.821211][ T214] __ia32_sys_openat (fs/open.c:1337)
[ 25.821622][ T214] __do_fast_syscall_32 (arch/x86/entry/common.c:112 arch/x86/entry/common.c:178)
[ 25.822057][ T214] ? trace_hardirqs_on (kernel/trace/trace_preemptirq.c:50 (discriminator 19))
[ 25.822480][ T214] ? __fput (fs/file_table.c:59 fs/file_table.c:333)
[ 25.822842][ T214] ? lockdep_hardirqs_on_prepare (kernel/locking/lockdep.c:4262 kernel/locking/lockdep.c:4321)
[ 25.823346][ T214] ? syscall_exit_to_user_mode (kernel/entry/common.c:299)
[ 25.823823][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.824259][ T214] ? lockdep_hardirqs_on_prepare (kernel/locking/lockdep.c:4262 kernel/locking/lockdep.c:4321)
[ 25.824767][ T214] ? syscall_exit_to_user_mode (kernel/entry/common.c:299)
[ 25.825254][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.825696][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.826155][ T214] ? syscall_exit_to_user_mode (kernel/entry/common.c:299)
[ 25.826627][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.827056][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.827486][ T214] ? __do_fast_syscall_32 (arch/x86/entry/common.c:183)
[ 25.827929][ T214] ? irqentry_exit_to_user_mode (kernel/entry/common.c:312)
[ 25.828423][ T214] ? irqentry_exit (kernel/entry/common.c:445)
[ 25.828812][ T214] do_fast_syscall_32 (arch/x86/entry/common.c:203)
[ 25.829223][ T214] do_SYSENTER_32 (arch/x86/entry/common.c:247)
[ 25.829589][ T214] entry_SYSENTER_32 (arch/x86/entry/entry_32.S:867)
[ 25.830003][ T214] EIP: 0xb7f8c549
[ 25.830330][ T214] Code: 03 74 c0 01 10 05 03 74 b8 01 10 06 03 74 b4 01 10 07 03 74 b0 01 10 08 03 74 d8 01 00 00 00 00 00 51 52 55 89 e5 0f 34 cd 80 <5d> 5a 59 c3 90 90 90 90 8d 76 00 58 b8 77 00 00 00 cd 80 90 8d 76
All code
========
0: 03 74 c0 01 add 0x1(%rax,%rax,8),%esi
4: 10 05 03 74 b8 01 adc %al,0x1b87403(%rip) # 0x1b8740d
a: 10 06 adc %al,(%rsi)
c: 03 74 b4 01 add 0x1(%rsp,%rsi,4),%esi
10: 10 07 adc %al,(%rdi)
12: 03 74 b0 01 add 0x1(%rax,%rsi,4),%esi
16: 10 08 adc %cl,(%rax)
18: 03 74 d8 01 add 0x1(%rax,%rbx,8),%esi
1c: 00 00 add %al,(%rax)
1e: 00 00 add %al,(%rax)
20: 00 51 52 add %dl,0x52(%rcx)
23: 55 push %rbp
24: 89 e5 mov %esp,%ebp
26: 0f 34 sysenter
28: cd 80 int $0x80
2a:* 5d pop %rbp <-- trapping instruction
2b: 5a pop %rdx
2c: 59 pop %rcx
2d: c3 retq
2e: 90 nop
2f: 90 nop
30: 90 nop
31: 90 nop
32: 8d 76 00 lea 0x0(%rsi),%esi
35: 58 pop %rax
36: b8 77 00 00 00 mov $0x77,%eax
3b: cd 80 int $0x80
3d: 90 nop
3e: 8d .byte 0x8d
3f: 76 .byte 0x76

Code starting with the faulting instruction
===========================================
0: 5d pop %rbp
1: 5a pop %rdx
2: 59 pop %rcx
3: c3 retq
4: 90 nop
5: 90 nop
6: 90 nop
7: 90 nop
8: 8d 76 00 lea 0x0(%rsi),%esi
b: 58 pop %rax
c: b8 77 00 00 00 mov $0x77,%eax
11: cd 80 int $0x80
13: 90 nop
14: 8d .byte 0x8d
15: 76 .byte 0x76

To reproduce:

# build kernel
cd linux
cp config-6.1.0-rc2-00014-g0af8489b0216 .config
make HOSTCC=gcc-11 CC=gcc-11 ARCH=i386 olddefconfig prepare modules_prepare bzImage modules
make HOSTCC=gcc-11 CC=gcc-11 ARCH=i386 INSTALL_MOD_PATH=<mod-install-dir> modules_install
cd <mod-install-dir>
find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.

--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests

Attachments:

(No filename) (9.38 kB)
config-6.1.0-rc2-00014-g0af8489b0216 (149.80 kB)
job-script (5.63 kB)
dmesg.xz (55.53 kB)
rcutorture (272.20 kB)
Download all attachments

2023-01-01 06:16:43

by Hyeonggon Yoo

[permalink] [raw]

Subject: Re: [linus:master] [mm, slub] 0af8489b02: kernel_BUG_at_include/linux/mm.h

On Sat, Dec 31, 2022 at 11:26:25PM +0800, kernel test robot wrote:
>
> Greeting,
>
> FYI, we noticed kernel_BUG_at_include/linux/mm.h due to commit (built with gcc-11):
>
> commit: 0af8489b0216fa1dd83e264bef8063f2632633d7 ("mm, slub: remove percpu slabs with CONFIG_SLUB_TINY")
> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
>
> [test failed on linux-next/master c76083fac3bae1a87ae3d005b5cb1cbc761e31d5]
>
> in testcase: rcutorture
> version:
> with following parameters:
>
> runtime: 300s
> test: default
> torture_type: tasks-tracing
>
> test-description: rcutorture is rcutorture kernel module load/unload test.
> test-url: https://www.kernel.org/doc/Documentation/RCU/torture.txt
>
>
> on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
>
> caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
>
>
> If you fix the issue, kindly add following tag
> | Reported-by: kernel test robot <[email protected]>
> | Link: https://lore.kernel.org/oe-lkp/[email protected]
>
>

<snip>

> Failed to start Update UTMP about System Boot/Shutdown.
> See 'systemctl status systemd-update-utmp.service' for details.
> page:e660911a refcount:0 mapcount:0 mapping:00000000 index:0xedaeef00 pfn:0x2daee
> page:0946d53a refcount:0 mapcount:0 mapping:00000000 index:0x0 pfn:0x2daec
> flags: 0x0(zone=0)
> raw: 00000000 e764d494 e6f205b4 00000000 00000000 00020000 ffffffff 00000000
> raw: 00000000 00000000
> page dumped because: VM_BUG_ON_FOLIO(!folio_test_large(folio))
> page_owner tracks the page as freed

the page is freed state.

> page last allocated via order 1, migratetype Unmovable, gfp_mask 0xd20c0(__GFP_IO|__GFP_FS|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP|__GFP_NOMEMALLOC), pid 208, tgid 208 (systemd-udevd), ts 25780391126, free_ts 25780421356
> post_alloc_hook+0x1fa/0x280
> get_page_from_freelist+0x226/0x310
> __alloc_pages+0xdd/0x360
> alloc_slab_page+0x12d/0x200
> allocate_slab+0x6a/0x350
> new_slab+0x48/0xc0
> __slab_alloc_node+0xfb/0x270
> kmem_cache_alloc+0x8f/0x4e0
> getname_flags+0x33/0x2f0
> getname+0x1a/0x30
> do_sys_openat2+0xa5/0x1f0
> do_sys_open+0x8e/0xe0
> __ia32_sys_openat+0x2b/0x40
> __do_fast_syscall_32+0x72/0xd0
> do_fast_syscall_32+0x32/0x70
> do_SYSENTER_32+0x15/0x20

allocated by slab

> page last free stack trace:
> free_pcp_prepare+0x34f/0x940
> free_unref_page_prepare+0x29/0x210
> free_unref_page+0x3a/0x3b0
> __free_pages+0x187/0x1f0
> __free_slab+0x1fd/0x350
> free_slab+0x22/0x70
> free_to_partial_list+0x125/0x260
> do_slab_free+0x30/0x70
> kmem_cache_free+0x171/0x1e0
> putname+0x9f/0xf0
> do_sys_openat2+0xe2/0x1f0
> do_sys_open+0x8e/0xe0
> __ia32_sys_openat+0x2b/0x40
> __do_fast_syscall_32+0x72/0xd0
> do_fast_syscall_32+0x32/0x70
> do_SYSENTER_32+0x15/0x20

freed by slab

> ------------[ cut here ]------------
> kernel BUG at include/linux/mm.h:825!
> invalid opcode: 0000 [#1] SMP
> CPU: 0 PID: 214 Comm: udevadm Tainted: G S 6.1.0-rc2-00014-g0af8489b0216 #2 1c4d7707ec0ce574ed62a77e82a8580202758048
> EIP: __dump_page.cold+0x282/0x369
> Code: ff ff 83 05 e8 5d bb c5 01 ba 4c c4 2f c4 89 f8 83 15 ec 5d bb c5 00 e8 f2 92 ed fd 83 05 f8 5d bb c5 01 83 15 fc 5d bb c5 00 <0f> 0b 83 05 00 5e bb c5 01 b8 ac 85 a3 c4 83 15 04 5e bb c5 00 e8
> EAX: 00000000 EBX: e764d530 ECX: 00000003 EDX: 4108888f
> ESI: e764d4e0 EDI: e764d4e0 EBP: ed89db3c ESP: ed89db00
> DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 EFLAGS: 00210046
> CR0: 80050033 CR2: 00616abc CR3: 2d878000 CR4: 000406d0
> DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
> DR6: fffe0ff0 DR7: 00000400
> Call Trace:
> dump_page+0x2a/0xc0
> ? _raw_spin_lock_irqsave+0x16/0x30
> folio_flags+0x23/0x70
> get_partial_node+0x89/0x290

a page freed by slab is in the partial list?
Sounds like use-after-free from SLUB_TINY but not sure yet how
that could happen :/

> __slab_alloc_node+0xbb/0x270
> kmem_cache_alloc_lru+0x8d/0x4e0
> ? __lock_release+0x3ec/0x410
> ? iget_locked+0x78/0x310
> alloc_inode+0x93/0x150
> iget_locked+0xdd/0x310
> ? lock_is_held_type+0x80/0xf0
> kernfs_get_inode+0x24/0xb0
> kernfs_iop_lookup+0xb5/0x1a0
> __lookup_slow+0xd9/0x2a0
> lookup_slow+0x50/0x90
> walk_component+0x19c/0x2c0
> path_lookupat+0xa3/0x270
> path_openat+0x307/0x3e0
> do_filp_open+0x7c/0x130
> do_sys_openat2+0x113/0x1f0
> do_sys_open+0x8e/0xe0
> __ia32_sys_openat+0x2b/0x40
> __do_fast_syscall_32+0x72/0xd0
> ? trace_hardirqs_on+0xa2/0x110
> ? __fput+0x19f/0x390
> ? lockdep_hardirqs_on_prepare+0x242/0x400
> ? syscall_exit_to_user_mode+0x5f/0x90
> ? __do_fast_syscall_32+0x7c/0xd0
> ? lockdep_hardirqs_on_prepare+0x242/0x400
> ? syscall_exit_to_user_mode+0x5f/0x90
> ? __do_fast_syscall_32+0x7c/0xd0
> ? __do_fast_syscall_32+0x7c/0xd0
> ? syscall_exit_to_user_mode+0x5f/0x90
> ? __do_fast_syscall_32+0x7c/0xd0
> ? __do_fast_syscall_32+0x7c/0xd0
> ? __do_fast_syscall_32+0x7c/0xd0
> ? irqentry_exit_to_user_mode+0x23/0x30
> ? irqentry_exit+0x7f/0xc0
> do_fast_syscall_32+0x32/0x70
> do_SYSENTER_32+0x15/0x20
> entry_SYSENTER_32+0xa2/0xfb
> EIP: 0xb7f8c549
> Code: 03 74 c0 01 10 05 03 74 b8 01 10 06 03 74 b4 01 10 07 03 74 b0 01 10 08 03 74 d8 01 00 00 00 00 00 51 52 55 89 e5 0f 34 cd 80 <5d> 5a 59 c3 90 90 90 90 8d 76 00 58 b8 77 00 00 00 cd 80 90 8d 76
> EAX: ffffffda EBX: 00000006 ECX: 006142a1 EDX: 002a8000
> ESI: 00000000 EDI: 00000001 EBP: 00614024 ESP: bff3c4a0
> DS: 007b ES: 007b FS: 0000 GS: 0033 SS: 007b EFLAGS: 00200246
> Modules linked in:
> ---[ end trace 0000000000000000 ]---
> EIP: __dump_page.cold+0x282/0x369
> Code: ff ff 83 05 e8 5d bb c5 01 ba 4c c4 2f c4 89 f8 83 15 ec 5d bb c5 00 e8 f2 92 ed fd 83 05 f8 5d bb c5 01 83 15 fc 5d bb c5 00 <0f> 0b 83 05 00 5e bb c5 01 b8 ac 85 a3 c4 83 15 04 5e bb c5 00 e8
> EAX: 00000000 EBX: e764d530 ECX: 00000003 EDX: 4108888f
> ESI: e764d4e0 EDI: e764d4e0 EBP: ed89db3c ESP: ed89db00
> DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 EFLAGS: 00210046
> CR0: 80050033 CR2: 00616abc CR3: 2d878000 CR4: 000406d0
> DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
> DR6: fffe0ff0 DR7: 00000400
> Kernel panic - not syncing: Fatal exception
> Kernel Offset: disabled

--
Thanks,
Hyeonggon

2023-01-01 08:26:32

On Tue, Jan 03, 2023 at 09:46:33PM +0800, Oliver Sang wrote:
> On Tue, Jan 03, 2023 at 11:42:11AM +0100, Vlastimil Babka wrote:
> > So the events leading up to this could be something like:
> >
> > - 0x2daee is order-1 slab folio of the inode cache, sitting on the partial list
> > - despite being on partial list, it's freed ???
> > - somebody else allocates order-2 page 0x2daec and uses it for whatever,
> > then frees it
> > - 0x2daec is reallocated as order-1 slab from names_cache, then freed
> > - we try to allocate from the slab page 0x2daee and trip on the PageTail
> >
> > Except, the freeing of order-2 page would have reset the PageTail and
> > compound_head in 0x2daec, so this is even more complicated or involves some
> > extra race?
>
> FYI, we ran tests more up to 500 times, then saw different issues but rate is
> actually low
>
> 56d5a2b9ba85a390 0af8489b0216fa1dd83e264bef8
> ---------------- ---------------------------
> fail:runs %reproduction fail:runs
> | | |
> :500 12% 61:500 dmesg.invalid_opcode:#[##]
> :500 3% 14:500 dmesg.kernel_BUG_at_include/linux/mm.h
> :500 3% 17:500 dmesg.kernel_BUG_at_include/linux/page-flags.h
> :500 5% 26:500 dmesg.kernel_BUG_at_lib/list_debug.c
> :500 0% 2:500 dmesg.kernel_BUG_at_mm/page_alloc.c
> :500 0% 2:500 dmesg.kernel_BUG_at_mm/usercopy.c
>
> >
> > In any case, this is something a debug_pagealloc kernel could have a chance
> > of catching earlier. Would it be possible to enable CONFIG_DEBUG_PAGEALLOC
> > and DEBUG_PAGEALLOC_ENABLE_DEFAULT additionally to the rest of the
> > configuration, and repeat the test?
>
> ok, we are starting to test by these 2 additional configs now.

BTW it seems to be totally unrelated to rcutorture tests.
Are there similar reports in boot tests with the same config?

> >
> > Separately we should also make the __dump_page() more resilient.
> >
> > Thanks,
> > Vlastimil
> >
> > > [ 25.804432][ T214] ------------[ cut here ]------------
> > > [ 25.804917][ T214] kernel BUG at include/linux/mm.h:825!
> > > [ 25.805402][ T214] invalid opcode: 0000 [#1] SMP
> > > [ 25.805820][ T214] CPU: 0 PID: 214 Comm: udevadm Tainted: G S 6.1.0-rc2-00014-g0af8489b0216 #2 1c4d7707ec0ce574ed62a77e82a8580202758048
> > > [ 25.806944][ T214] EIP: __dump_page.cold+0x282/0x369
> > > [ 25.807376][ T214] Code: ff ff 83 05 e8 5d bb c5 01 ba 4c c4 2f c4 89 f8 83 15 ec 5d bb c5 00 e8 f2 92 ed fd 83 05 f8 5d bb c5 01 83 15 fc 5d bb c5 00 <0f> 0b 83 05 00 5e bb c5 01 b8 ac 85 a3 c4 83 15 04 5e bb c5 00 e8
> > > [ 25.808960][ T214] EAX: 00000000 EBX: e764d530 ECX: 00000003 EDX: 4108888f
> > > [ 25.809578][ T214] ESI: e764d4e0 EDI: e764d4e0 EBP: ed89db3c ESP: ed89db00
> > > [ 25.810168][ T214] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 EFLAGS: 00210046
> > > [ 25.810803][ T214] CR0: 80050033 CR2: 00616abc CR3: 2d878000 CR4: 000406d0
> > > [ 25.811407][ T214] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
> > > [ 25.811999][ T214] DR6: fffe0ff0 DR7: 00000400
> > > [ 25.812390][ T214] Call Trace:
> > > [ 25.812675][ T214] dump_page+0x2a/0xc0
> > > [ 25.813025][ T214] ? _raw_spin_lock_irqsave+0x16/0x30
> > > [ 25.813492][ T214] folio_flags+0x23/0x70
> > > [ 25.813945][ T214] get_partial_node+0x89/0x290
> > > [ 25.814357][ T214] __slab_alloc_node+0xbb/0x270
> > > [ 25.814860][ T214] kmem_cache_alloc_lru+0x8d/0x4e0
> > > [ 25.815289][ T214] ? __lock_release+0x3ec/0x410
> > > [ 25.815697][ T214] ? iget_locked+0x78/0x310
> > > [ 25.816096][ T214] alloc_inode+0x93/0x150
> > > [ 25.816469][ T214] iget_locked+0xdd/0x310
> > > [ 25.816829][ T214] ? lock_is_held_type+0x80/0xf0
> > > [ 25.817264][ T214] kernfs_get_inode+0x24/0xb0
> > > [ 25.817670][ T214] kernfs_iop_lookup+0xb5/0x1a0
> > > [ 25.818087][ T214] __lookup_slow+0xd9/0x2a0
> > > [ 25.818479][ T214] lookup_slow+0x50/0x90
> > > [ 25.818847][ T214] walk_component+0x19c/0x2c0
> > > [ 25.819244][ T214] path_lookupat+0xa3/0x270
> > > [ 25.819627][ T214] path_openat+0x307/0x3e0
> > > [ 25.820007][ T214] do_filp_open+0x7c/0x130
> > > [ 25.820409][ T214] do_sys_openat2+0x113/0x1f0
> > > [ 25.820807][ T214] do_sys_open+0x8e/0xe0
> > > [ 25.821211][ T214] __ia32_sys_openat+0x2b/0x40
> > > [ 25.821622][ T214] __do_fast_syscall_32+0x72/0xd0
> > > [ 25.822057][ T214] ? trace_hardirqs_on+0xa2/0x110
> > > [ 25.822480][ T214] ? __fput+0x19f/0x390
> > > [ 25.822842][ T214] ? lockdep_hardirqs_on_prepare+0x242/0x400
> > > [ 25.823346][ T214] ? syscall_exit_to_user_mode+0x5f/0x90
> > > [ 25.823823][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.824259][ T214] ? lockdep_hardirqs_on_prepare+0x242/0x400
> > > [ 25.824767][ T214] ? syscall_exit_to_user_mode+0x5f/0x90
> > > [ 25.825254][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.825696][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.826155][ T214] ? syscall_exit_to_user_mode+0x5f/0x90
> > > [ 25.826627][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.827056][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.827486][ T214] ? __do_fast_syscall_32+0x7c/0xd0
> > > [ 25.827929][ T214] ? irqentry_exit_to_user_mode+0x23/0x30
> > > [ 25.828423][ T214] ? irqentry_exit+0x7f/0xc0
> > > [ 25.828812][ T214] do_fast_syscall_32+0x32/0x70
> > > [ 25.829223][ T214] do_SYSENTER_32+0x15/0x20
> > > [ 25.829589][ T214] entry_SYSENTER_32+0xa2/0xfb
> > > [ 25.830003][ T214] EIP: 0xb7f8c549
> > > [ 25.830330][ T214] Code: 03 74 c0 01 10 05 03 74 b8 01 10 06 03 74 b4 01 10 07 03 74 b0 01 10 08 03 74 d8 01 00 00 00 00 00 51 52 55 89 e5 0f 34 cd 80 <5d> 5a 59 c3 90 90 90 90 8d 76 00 58 b8 77 00 00 00 cd 80 90 8d 76
> > > [ 25.831929][ T214] EAX: ffffffda EBX: 00000006 ECX: 006142a1 EDX: 002a8000
> > > [ 25.832522][ T214] ESI: 00000000 EDI: 00000001 EBP: 00614024 ESP: bff3c4a0
> > > [ 25.833123][ T214] DS: 007b ES: 007b FS: 0000 GS: 0033 SS: 007b EFLAGS: 00200246
> > > [ 25.833738][ T214] Modules linked in:
> > > [ 25.834062][ T214] ---[ end trace 0000000000000000 ]---
> > > [ 25.834522][ T214] EIP: __dump_page.cold+0x282/0x369
> > > [ 25.834960][ T214] Code: ff ff 83 05 e8 5d bb c5 01 ba 4c c4 2f c4 89 f8 83 15 ec 5d bb c5 00 e8 f2 92 ed fd 83 05 f8 5d bb c5 01 83 15 fc 5d bb c5 00 <0f> 0b 83 05 00 5e bb c5 01 b8 ac 85 a3 c4 83 15 04 5e bb c5 00 e8
> > > [ 25.836574][ T214] EAX: 00000000 EBX: e764d530 ECX: 00000003 EDX: 4108888f
> > > [ 25.837183][ T214] ESI: e764d4e0 EDI: e764d4e0 EBP: ed89db3c ESP: ed89db00
> > > [ 25.837772][ T214] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 EFLAGS: 00210046
> > > [ 25.838414][ T214] CR0: 80050033 CR2: 00616abc CR3: 2d878000 CR4: 000406d0
> > > [ 25.839011][ T214] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
> > > [ 25.839597][ T214] DR6: fffe0ff0 DR7: 00000400
> > > [ 25.839995][ T214] Kernel panic - not syncing: Fatal exception
> > > [ 25.840554][ T214] Kernel Offset: disabled
> >
> >

--
Thanks,
Hyeonggon

2023-01-05 02:54:52

Hi, Vlastimil,

On Thu, Jan 12, 2023 at 08:56:59AM +0100, Vlastimil Babka wrote:
>
> Actually no, by "obscure" means with CONFIG_SLUB_DEBUG it wouldn't happen
> anymore. But this is the opposite, it seems to happen a lot. I would have
> preferred that slub debugging catches some slab misuse, but this seems
> useful too. With such fail rates you can perhaps try ealier kernels than 6.0
> and eventually find the truly clean and first bad release and bisect?

Thanks a lot for guidance!

yeah, we reached back to until v5.14-rc1 which still has similar issue,
and v5.13 is clean. new bisection was triggered then we got '7118fc2906'

this was already reported as
"[linus:master] [hugetlb] 7118fc2906: kernel_BUG_at_lib/list_debug.c"
at https://lore.kernel.org/all/[email protected]/
and I add you, Hyeonggon, Feng and Fengwei there.

hope that would be helpful.