2022-07-24 13:44:31

by kernel test robot

[permalink] [raw]
Subject: [cpumask] 7ac97b2db8: kernel_BUG_at_lib/cpumask.c



Greeting,

FYI, we noticed the following commit (built with gcc-11):

commit: 7ac97b2db80a5f6edd333f8b4a6b79610d164f7f ("cpumask: add cpumask_nth_{,and,andnot}")
https://github.com/norov/linux __bitmap-for-next

in testcase: fio-basic
version: fio-x86_64-3.15-1_20220723
with following parameters:

runtime: 300s
disk: 1HDD
fs: btrfs
nr_task: 100%
test_size: 128G
rw: write
bs: 4k
ioengine: filecreate
cpufreq_governor: performance
ucode: 0xb000280

test-description: Fio is a tool that will spawn a number of threads or processes doing a particular type of I/O action as specified by the user.
test-url: https://github.com/axboe/fio


on test machine: 96 threads 2 sockets Ice Lake with 256G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>


[ 6.504630][ T19] ------------[ cut here ]------------
[ 6.504631][ T19] kernel BUG at lib/cpumask.c:145!
[ 6.504637][ T19] invalid opcode: 0000 [#1] SMP NOPTI
[ 6.504640][ T19] CPU: 0 PID: 19 Comm: kworker/0:1 Not tainted 5.19.0-rc3-00028-g7ac97b2db80a #1
[ 6.504643][ T19] Workqueue: events work_for_cpu_fn
[ 6.504650][ T19] RIP: 0010:cpumask_local_spread (kbuild/src/x86_64/lib/cpumask.c:145)
[ 6.504655][ T19] Code: 5b 41 5c c3 89 d1 4c 39 c9 73 1f 45 31 c0 31 f6 4c 89 ca 48 c7 c7 a0 46 d8 82 e8 e0 99 fb ff 3b 05 2a 45 71 01 41 89 c0 72 d3 <0f> 0b cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc 55
All code
========
0: 5b pop %rbx
1: 41 5c pop %r12
3: c3 retq
4: 89 d1 mov %edx,%ecx
6: 4c 39 c9 cmp %r9,%rcx
9: 73 1f jae 0x2a
b: 45 31 c0 xor %r8d,%r8d
e: 31 f6 xor %esi,%esi
10: 4c 89 ca mov %r9,%rdx
13: 48 c7 c7 a0 46 d8 82 mov $0xffffffff82d846a0,%rdi
1a: e8 e0 99 fb ff callq 0xfffffffffffb99ff
1f: 3b 05 2a 45 71 01 cmp 0x171452a(%rip),%eax # 0x171454f
25: 41 89 c0 mov %eax,%r8d
28: 72 d3 jb 0xfffffffffffffffd
2a:* 0f 0b ud2 <-- trapping instruction
2c: cc int3
2d: cc int3
2e: cc int3
2f: cc int3
30: cc int3
31: cc int3
32: cc int3
33: cc int3
34: cc int3
35: cc int3
36: cc int3
37: cc int3
38: cc int3
39: cc int3
3a: cc int3
3b: cc int3
3c: cc int3
3d: cc int3
3e: cc int3
3f: 55 push %rbp

Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: cc int3
3: cc int3
4: cc int3
5: cc int3
6: cc int3
7: cc int3
8: cc int3
9: cc int3
a: cc int3
b: cc int3
c: cc int3
d: cc int3
e: cc int3
f: cc int3
10: cc int3
11: cc int3
12: cc int3
13: cc int3
14: cc int3
15: 55 push %rbp
[ 6.504656][ T19] RSP: 0000:ffa000000032bd00 EFLAGS: 00010246
[ 6.504658][ T19] RAX: 0000000000000060 RBX: 0000000000000000 RCX: 0000000000000000
[ 6.504658][ T19] RDX: 0000000000000002 RSI: 0000000000000080 RDI: 00000000ffffff00
[ 6.504659][ T19] RBP: ff110001093c0980 R08: 0000000000000060 R09: 0000000000000060
[ 6.504660][ T19] R10: ff1100407ffff040 R11: ffffffff82d846a0 R12: 0000000000000030
[ 6.504661][ T19] R13: 0000000000000000 R14: 0000000000000001 R15: 000000000000003f
[ 6.504662][ T19] FS: 0000000000000000(0000) GS:ff11001fff600000(0000) knlGS:0000000000000000
[ 6.504663][ T19] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6.504663][ T19] CR2: ff1100407fc00000 CR3: 000000407ec0a001 CR4: 0000000000771ef0
[ 6.504664][ T19] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 6.504665][ T19] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 6.504665][ T19] PKRU: 55555554
[ 6.504666][ T19] Call Trace:
[ 6.504668][ T19] <TASK>
[ 6.504670][ T19] ixgbe_alloc_q_vector (kbuild/src/x86_64/include/linux/topology.h:96 kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:853)
[ 6.504674][ T19] ixgbe_alloc_q_vectors (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:1094)
[ 6.504675][ T19] ixgbe_init_interrupt_scheme (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:1248)
[ 6.504676][ T19] ixgbe_probe (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c:11004)
[ 6.504680][ T19] local_pci_probe (kbuild/src/x86_64/drivers/pci/pci-driver.c:324)
[ 6.504685][ T19] ? __schedule (kbuild/src/x86_64/kernel/sched/core.c:6466)
[ 6.504688][ T19] work_for_cpu_fn (kbuild/src/x86_64/kernel/workqueue.c:5165)
[ 6.504690][ T19] process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2289)
[ 6.504692][ T19] ? process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2379)
[ 6.504693][ T19] worker_thread (kbuild/src/x86_64/include/linux/list.h:292 kbuild/src/x86_64/kernel/workqueue.c:2349 kbuild/src/x86_64/kernel/workqueue.c:2441)
[ 6.504695][ T19] ? process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2379)
[ 6.504695][ T19] kthread (kbuild/src/x86_64/kernel/kthread.c:376)
[ 6.504698][ T19] ? kthread_complete_and_exit (kbuild/src/x86_64/kernel/kthread.c:331)
[ 6.504700][ T19] ret_from_fork (kbuild/src/x86_64/arch/x86/entry/entry_64.S:302)
[ 6.504705][ T19] </TASK>
[ 6.504705][ T19] Modules linked in:
[ 6.504708][ T19] ---[ end trace 0000000000000000 ]---
[ 6.509032][ T19] RIP: 0010:cpumask_local_spread (kbuild/src/x86_64/lib/cpumask.c:145)
[ 6.509035][ T19] Code: 5b 41 5c c3 89 d1 4c 39 c9 73 1f 45 31 c0 31 f6 4c 89 ca 48 c7 c7 a0 46 d8 82 e8 e0 99 fb ff 3b 05 2a 45 71 01 41 89 c0 72 d3 <0f> 0b cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc 55
All code
========
0: 5b pop %rbx
1: 41 5c pop %r12
3: c3 retq
4: 89 d1 mov %edx,%ecx
6: 4c 39 c9 cmp %r9,%rcx
9: 73 1f jae 0x2a
b: 45 31 c0 xor %r8d,%r8d
e: 31 f6 xor %esi,%esi
10: 4c 89 ca mov %r9,%rdx
13: 48 c7 c7 a0 46 d8 82 mov $0xffffffff82d846a0,%rdi
1a: e8 e0 99 fb ff callq 0xfffffffffffb99ff
1f: 3b 05 2a 45 71 01 cmp 0x171452a(%rip),%eax # 0x171454f
25: 41 89 c0 mov %eax,%r8d
28: 72 d3 jb 0xfffffffffffffffd
2a:* 0f 0b ud2 <-- trapping instruction
2c: cc int3
2d: cc int3
2e: cc int3
2f: cc int3
30: cc int3
31: cc int3
32: cc int3
33: cc int3
34: cc int3
35: cc int3
36: cc int3
37: cc int3
38: cc int3
39: cc int3
3a: cc int3
3b: cc int3
3c: cc int3
3d: cc int3
3e: cc int3
3f: 55 push %rbp

Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: cc int3
3: cc int3
4: cc int3
5: cc int3
6: cc int3
7: cc int3
8: cc int3
9: cc int3
a: cc int3
b: cc int3
c: cc int3
d: cc int3
e: cc int3
f: cc int3
10: cc int3
11: cc int3
12: cc int3
13: cc int3
14: cc int3
15: 55 push %rbp


To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.



--
0-DAY CI Kernel Test Service
https://01.org/lkp



Attachments:
(No filename) (9.54 kB)
config-5.19.0-rc3-00028-g7ac97b2db80a (166.43 kB)
job-script (8.60 kB)
dmesg.xz (16.54 kB)
job.yaml (5.62 kB)
Download all attachments