Greeting,
FYI, we noticed the following commit (built with gcc-11):
commit: abedd49a7ab4f3f13d09e34a3ddbc236d330b1da ("cpumask: add cpumask_nth_{,and,andnot}")
https://github.com/norov/linux and_empty
in testcase: netperf
version: netperf-x86_64-2.7-0_20220627
with following parameters:
ip: ipv4
runtime: 300s
nr_threads: 1
cluster: cs-localhost
test: UDP_STREAM
cpufreq_governor: performance
ucode: 0x7002402
test-description: Netperf is a benchmark that can be use to measure various aspect of networking performance.
test-url: http://www.netperf.org/netperf/
on test machine: 144 threads 4 sockets Intel(R) Xeon(R) Gold 5318H CPU @ 2.50GHz with 128G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>
[ 6.399301][ T917] ------------[ cut here ]------------
[ 6.399304][ T917] kernel BUG at lib/cpumask.c:147!
[ 6.399314][ T917] invalid opcode: 0000 [#1] SMP NOPTI
[ 6.399321][ T917] CPU: 18 PID: 917 Comm: kworker/18:1 Not tainted 5.19.0-rc3-00030-gabedd49a7ab4 #6
[ 6.399327][ T917] Hardware name: Intel Corporation CooperCity/CooperCity, BIOS WLYDCRB1.SYS.0016.P29.2006080250 06/08/2020
[ 6.399331][ T917] Workqueue: events work_for_cpu_fn
[ 6.399345][ T917] RIP: 0010:cpumask_local_spread (kbuild/src/x86_64/lib/cpumask.c:147)
[ 6.399360][ T917] Code: 39 d1 73 26 48 8b 34 dd 00 ba 52 83 49 c7 c0 ff ff ff ff 48 c7 c7 a0 46 d8 82 e8 bc 99 fb ff 39 05 c6 44 71 01 41 89 c0 77 b8 <0f> 0b 48 39 d1 73 f9 45 31 c0 31 f6 eb db cc cc cc cc cc cc cc cc
All code
========
0: 39 d1 cmp %edx,%ecx
2: 73 26 jae 0x2a
4: 48 8b 34 dd 00 ba 52 mov -0x7cad4600(,%rbx,8),%rsi
b: 83
c: 49 c7 c0 ff ff ff ff mov $0xffffffffffffffff,%r8
13: 48 c7 c7 a0 46 d8 82 mov $0xffffffff82d846a0,%rdi
1a: e8 bc 99 fb ff callq 0xfffffffffffb99db
1f: 39 05 c6 44 71 01 cmp %eax,0x17144c6(%rip) # 0x17144eb
25: 41 89 c0 mov %eax,%r8d
28: 77 b8 ja 0xffffffffffffffe2
2a:* 0f 0b ud2 <-- trapping instruction
2c: 48 39 d1 cmp %rdx,%rcx
2f: 73 f9 jae 0x2a
31: 45 31 c0 xor %r8d,%r8d
34: 31 f6 xor %esi,%esi
36: eb db jmp 0x13
38: cc int3
39: cc int3
3a: cc int3
3b: cc int3
3c: cc int3
3d: cc int3
3e: cc int3
3f: cc int3
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 48 39 d1 cmp %rdx,%rcx
5: 73 f9 jae 0x0
7: 45 31 c0 xor %r8d,%r8d
a: 31 f6 xor %esi,%esi
c: eb db jmp 0xffffffffffffffe9
e: cc int3
f: cc int3
10: cc int3
11: cc int3
12: cc int3
13: cc int3
14: cc int3
15: cc int3
[ 6.399364][ T917] RSP: 0000:ffffc900091ebd00 EFLAGS: 00010202
[ 6.399368][ T917] RAX: 0000000000000090 RBX: 0000000000000001 RCX: 00000000ffffff94
[ 6.399372][ T917] RDX: 0000000000000090 RSI: ffffffff82d846b0 RDI: 000000000000ffff
[ 6.399376][ T917] RBP: 0000000000000024 R08: 0000000000000090 R09: ffffffff82d846b0
[ 6.399379][ T917] R10: ffffffff82d846a0 R11: 0000000000000002 R12: 0000000000000024
[ 6.399382][ T917] R13: 0000000000000000 R14: 0000000000000001 R15: 000000000000003f
[ 6.399385][ T917] FS: 0000000000000000(0000) GS:ffff88905f400000(0000) knlGS:0000000000000000
[ 6.399389][ T917] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6.399392][ T917] CR2: 0000000000000000 CR3: 000000207ec0a001 CR4: 00000000007706e0
[ 6.399396][ T917] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 6.399398][ T917] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 6.399400][ T917] PKRU: 55555554
[ 6.399402][ T917] Call Trace:
[ 6.399406][ T917] <TASK>
[ 6.399409][ T917] ixgbe_alloc_q_vector (kbuild/src/x86_64/include/linux/topology.h:96 kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:853)
[ 6.399420][ T917] ixgbe_alloc_q_vectors (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:1094)
[ 6.399420][ T917] ixgbe_init_interrupt_scheme (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c:1248)
[ 6.399420][ T917] ixgbe_probe (kbuild/src/x86_64/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c:11004)
[ 6.399420][ T917] local_pci_probe (kbuild/src/x86_64/drivers/pci/pci-driver.c:324)
[ 6.399420][ T917] ? __schedule (kbuild/src/x86_64/kernel/sched/core.c:6466)
[ 6.399420][ T917] work_for_cpu_fn (kbuild/src/x86_64/kernel/workqueue.c:5165)
[ 6.399420][ T917] process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2289)
[ 6.399420][ T917] ? process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2379)
[ 6.399420][ T917] worker_thread (kbuild/src/x86_64/include/linux/list.h:292 kbuild/src/x86_64/kernel/workqueue.c:2349 kbuild/src/x86_64/kernel/workqueue.c:2441)
[ 6.399420][ T917] ? process_one_work (kbuild/src/x86_64/kernel/workqueue.c:2379)
[ 6.399420][ T917] kthread (kbuild/src/x86_64/kernel/kthread.c:376)
[ 6.399420][ T917] ? kthread_complete_and_exit (kbuild/src/x86_64/kernel/kthread.c:331)
[ 6.399420][ T917] ret_from_fork (kbuild/src/x86_64/arch/x86/entry/entry_64.S:302)
[ 6.399420][ T917] </TASK>
[ 6.399420][ T917] Modules linked in:
[ 6.399525][ T917] ---[ end trace 0000000000000000 ]---
[ 6.434314][ T917] RIP: 0010:cpumask_local_spread (kbuild/src/x86_64/lib/cpumask.c:147)
[ 6.434318][ T917] Code: 39 d1 73 26 48 8b 34 dd 00 ba 52 83 49 c7 c0 ff ff ff ff 48 c7 c7 a0 46 d8 82 e8 bc 99 fb ff 39 05 c6 44 71 01 41 89 c0 77 b8 <0f> 0b 48 39 d1 73 f9 45 31 c0 31 f6 eb db cc cc cc cc cc cc cc cc
All code
========
0: 39 d1 cmp %edx,%ecx
2: 73 26 jae 0x2a
4: 48 8b 34 dd 00 ba 52 mov -0x7cad4600(,%rbx,8),%rsi
b: 83
c: 49 c7 c0 ff ff ff ff mov $0xffffffffffffffff,%r8
13: 48 c7 c7 a0 46 d8 82 mov $0xffffffff82d846a0,%rdi
1a: e8 bc 99 fb ff callq 0xfffffffffffb99db
1f: 39 05 c6 44 71 01 cmp %eax,0x17144c6(%rip) # 0x17144eb
25: 41 89 c0 mov %eax,%r8d
28: 77 b8 ja 0xffffffffffffffe2
2a:* 0f 0b ud2 <-- trapping instruction
2c: 48 39 d1 cmp %rdx,%rcx
2f: 73 f9 jae 0x2a
31: 45 31 c0 xor %r8d,%r8d
34: 31 f6 xor %esi,%esi
36: eb db jmp 0x13
38: cc int3
39: cc int3
3a: cc int3
3b: cc int3
3c: cc int3
3d: cc int3
3e: cc int3
3f: cc int3
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 48 39 d1 cmp %rdx,%rcx
5: 73 f9 jae 0x0
7: 45 31 c0 xor %r8d,%r8d
a: 31 f6 xor %esi,%esi
c: eb db jmp 0xffffffffffffffe9
e: cc int3
f: cc int3
10: cc int3
11: cc int3
12: cc int3
13: cc int3
14: cc int3
15: cc int3
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
--
0-DAY CI Kernel Test Service
https://01.org/lkp