Greeting,
FYI, we noticed the following commit (built with gcc-11):
commit: b8d7b1582e80076c6274bb971818daddfe701a0a ("[PATCH v2] workqueue: Use active mask for new worker when pool is DISASSOCIATED")
url: https://github.com/intel-lab-lkp/linux/commits/Schspa-Shi/workqueue-Use-active-mask-for-new-worker-when-pool-is-DISASSOCIATED/20220713-205410
base: https://git.kernel.org/cgit/linux/kernel/git/tj/wq.git for-next
patch link: https://lore.kernel.org/lkml/[email protected]
in testcase: boot
on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>
[ 0.393607][ T1] ------------[ cut here ]------------
[ 0.394247][ T1] WARNING: CPU: 0 PID: 1 at kernel/kthread.c:593 kthread_set_per_cpu (kernel/kthread.c:593 (discriminator 1))
[ 0.395431][ T1] Modules linked in:
[ 0.396197][ T1] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.19.0-rc1-00007-gb8d7b1582e80 #1
[ 0.397394][ T1] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.0-debian-1.16.0-4 04/01/2014
[ 0.398450][ T1] RIP: 0010:kthread_set_per_cpu (kernel/kthread.c:593 (discriminator 1))
[ 0.399239][ T1] Code: 25 48 8b 87 20 0a 00 00 48 85 c0 74 13 81 e2 00 00 00 04 74 15 85 f6 78 08 89 70 08 f0 80 08 01 c3 f0 80 20 fe c3 0f 0b eb d7 <0f> 0b eb e7 0f 1f 00 0f 1f 44 00 00 48 8b 97 20 0a 00 00 31 c0 48
All code
========
0: 25 48 8b 87 20 and $0x20878b48,%eax
5: 0a 00 or (%rax),%al
7: 00 48 85 add %cl,-0x7b(%rax)
a: c0 74 13 81 e2 shlb $0xe2,-0x7f(%rbx,%rdx,1)
f: 00 00 add %al,(%rax)
11: 00 04 74 add %al,(%rsp,%rsi,2)
14: 15 85 f6 78 08 adc $0x878f685,%eax
19: 89 70 08 mov %esi,0x8(%rax)
1c: f0 80 08 01 lock orb $0x1,(%rax)
20: c3 retq
21: f0 80 20 fe lock andb $0xfe,(%rax)
25: c3 retq
26: 0f 0b ud2
28: eb d7 jmp 0x1
2a:* 0f 0b ud2 <-- trapping instruction
2c: eb e7 jmp 0x15
2e: 0f 1f 00 nopl (%rax)
31: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
36: 48 8b 97 20 0a 00 00 mov 0xa20(%rdi),%rdx
3d: 31 c0 xor %eax,%eax
3f: 48 rex.W
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: eb e7 jmp 0xffffffffffffffeb
4: 0f 1f 00 nopl (%rax)
7: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
c: 48 8b 97 20 0a 00 00 mov 0xa20(%rdi),%rdx
13: 31 c0 xor %eax,%eax
15: 48 rex.W
[ 0.400782][ T1] RSP: 0000:ffffb90880013e10 EFLAGS: 00010246
[ 0.401261][ T1] RAX: ffff930f80135480 RBX: ffff930f801dc180 RCX: 00000000000000a9
[ 0.402421][ T1] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff930f802bd000
[ 0.403375][ T1] RBP: ffff9312afc2af00 R08: ffffb90880013d98 R09: ffff930f802bdb54
[ 0.404378][ T1] R10: 0000000000000010 R11: 0000000000000010 R12: ffff930f801dc180
[ 0.405390][ T1] R13: 0000000000000000 R14: ffff9312afc2b1c0 R15: 0000000000000000
[ 0.406385][ T1] FS: 0000000000000000(0000) GS:ffff9312afc00000(0000) knlGS:0000000000000000
[ 0.407427][ T1] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 0.408258][ T1] CR2: ffff9312bffff000 CR3: 0000000061e0a000 CR4: 00000000000406f0
[ 0.409389][ T1] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 0.410398][ T1] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 0.411390][ T1] Call Trace:
[ 0.411941][ T1] <TASK>
[ 0.412534][ T1] worker_attach_to_pool (kernel/workqueue.c:1875)
[ 0.413943][ T1] create_worker (kernel/workqueue.c:1960)
[ 0.414826][ T1] workqueue_init (kernel/workqueue.c:6103)
[ 0.415216][ T1] kernel_init_freeable (init/main.c:1596)
[ 0.415942][ T1] ? rest_init (init/main.c:1491)
[ 0.416785][ T1] kernel_init (init/main.c:1501)
[ 0.417225][ T1] ret_from_fork (arch/x86/entry/entry_64.S:308)
[ 0.417941][ T1] </TASK>
[ 0.418544][ T1] ---[ end trace 0000000000000000 ]---
[ 0.419471][ T1] cblist_init_generic: Setting adjustable number of callback queues.
[ 0.419938][ T1] cblist_init_generic: Setting shift to 1 and lim to 1.
[ 0.420989][ T1] cblist_init_generic: Setting shift to 1 and lim to 1.
[ 0.421979][ T1] Performance Events: unsupported p6 CPU model 42 no PMU driver, software events only.
[ 0.423563][ T14] printk: console [ttyS0] printing thread started
[ 0.423571][ T1] rcu: Hierarchical SRCU implementation.
[ 0.423602][ T15] printk: console [tty0] printing thread started
[ 0.426677][ T1] NMI watchdog: Perf NMI watchdog permanently disabled
[ 0.426810][ T1] smp: Bringing up secondary CPUs ...
[ 0.428673][ T1] x86: Booting SMP configuration:
[ 0.428674][ T1] .... node #0, CPUs: #1
[ 0.115857][ T0] masked ExtINT on CPU#1
[ 0.431534][ T1] smp: Brought up 1 node, 2 CPUs
[ 0.431534][ T1] smpboot: Max logical packages: 1
[ 0.431534][ T1] smpboot: Total of 2 processors activated (8779.66 BogoMIPS)
[ 0.491379][ T25] node 0 deferred pages initialised in 60ms
[ 0.493264][ T1] devtmpfs: initialized
[ 0.493264][ T1] x86/mm: Memory block size: 128MB
[ 0.496268][ T1] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1911260446275000 ns
[ 0.496275][ T1] futex hash table entries: 512 (order: 3, 32768 bytes, linear)
[ 0.496367][ T1] pinctrl core: initialized pinctrl subsystem
[ 0.500514][ T1] NET: Registered PF_NETLINK/PF_ROUTE protocol family
[ 0.500914][ T1] audit: initializing netlink subsys (disabled)
[ 0.502010][ T1] thermal_sys: Registered thermal governor 'fair_share'
[ 0.502013][ T1] thermal_sys: Registered thermal governor 'bang_bang'
[ 0.502014][ T1] thermal_sys: Registered thermal governor 'step_wise'
[ 0.502015][ T1] thermal_sys: Registered thermal governor 'user_space'
[ 0.502041][ T29] audit: type=2000 audit(1657863732.997:1): state=initialized audit_enabled=0 res=1
[ 0.502048][ T1] cpuidle: using governor menu
[ 0.507066][ T1] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
[ 0.507441][ T1] PCI: Using configuration type 1 for base access
[ 0.511839][ T1] kprobes: kprobe jump-optimization is enabled. All kprobes are optimized if possible.
[ 0.550170][ T1] HugeTLB: can optimize 7 vmemmap pages for hugepages-2048kB
[ 0.550176][ T1] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages
[ 0.550414][ T1] cryptd: max_cpu_qlen set to 1000
[ 0.550937][ T1] ACPI: Added _OSI(Module Device)
[ 0.550937][ T1] ACPI: Added _OSI(Processor Device)
[ 0.550937][ T1] ACPI: Added _OSI(3.0 _SCP Extensions)
[ 0.550937][ T1] ACPI: Added _OSI(Processor Aggregator Device)
[ 0.550937][ T1] ACPI: Added _OSI(Linux-Dell-Video)
[ 0.550937][ T1] ACPI: Added _OSI(Linux-Lenovo-NV-HDMI-Audio)
[ 0.550937][ T1] ACPI: Added _OSI(Linux-HPI-Hybrid-Graphics)
[ 0.551826][ T1] ACPI: 1 ACPI AML tables successfully acquired and loaded
[ 0.579200][ T1] ACPI: Interpreter enabled
[ 0.579223][ T1] ACPI: PM: (supports S0 S3 S4 S5)
[ 0.579225][ T1] ACPI: Using IOAPIC for interrupt routing
[ 0.579249][ T1] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[ 0.579251][ T1] PCI: Using E820 reservations for host bridge windows
[ 0.579424][ T1] ACPI: Enabled 2 GPEs in block 00 to 0F
[ 0.583063][ T1] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff])
[ 0.583073][ T1] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI HPX-Type3]
[ 0.583076][ T1] acpi PNP0A03:00: _OSC: not requesting OS control; OS requires [ExtendedConfig ASPM ClockPM MSI]
[ 0.583089][ T1] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge.
[ 0.583494][ T1] acpiphp: Slot [3] registered
[ 0.583537][ T1] acpiphp: Slot [4] registered
[ 0.583573][ T1] acpiphp: Slot [5] registered
[ 0.583594][ T1] acpiphp: Slot [6] registered
[ 0.583624][ T1] acpiphp: Slot [7] registered
[ 0.583652][ T1] acpiphp: Slot [8] registered
[ 0.583674][ T1] acpiphp: Slot [9] registered
[ 0.583694][ T1] acpiphp: Slot [10] registered
[ 0.583713][ T1] acpiphp: Slot [11] registered
[ 0.583733][ T1] acpiphp: Slot [12] registered
[ 0.583753][ T1] acpiphp: Slot [13] registered
[ 0.583779][ T1] acpiphp: Slot [14] registered
[ 0.583801][ T1] acpiphp: Slot [15] registered
[ 0.583821][ T1] acpiphp: Slot [16] registered
[ 0.583848][ T1] acpiphp: Slot [17] registered
[ 0.583868][ T1] acpiphp: Slot [18] registered
[ 0.583888][ T1] acpiphp: Slot [19] registered
[ 0.583915][ T1] acpiphp: Slot [20] registered
[ 0.583941][ T1] acpiphp: Slot [21] registered
[ 0.583964][ T1] acpiphp: Slot [22] registered
[ 0.583984][ T1] acpiphp: Slot [23] registered
[ 0.584005][ T1] acpiphp: Slot [24] registered
[ 0.584026][ T1] acpiphp: Slot [25] registered
[ 0.584046][ T1] acpiphp: Slot [26] registered
[ 0.584073][ T1] acpiphp: Slot [27] registered
[ 0.584095][ T1] acpiphp: Slot [28] registered
[ 0.584116][ T1] acpiphp: Slot [29] registered
[ 0.584138][ T1] acpiphp: Slot [30] registered
[ 0.584160][ T1] acpiphp: Slot [31] registered
[ 0.584174][ T1] PCI host bridge to bus 0000:00
[ 0.584176][ T1] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window]
[ 0.584180][ T1] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window]
[ 0.584183][ T1] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
[ 0.584194][ T1] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window]
[ 0.584197][ T1] pci_bus 0000:00: root bus resource [mem 0x440000000-0x4bfffffff window]
[ 0.584199][ T1] pci_bus 0000:00: root bus resource [bus 00-ff]
[ 0.584940][ T1] pci 0000:00:00.0: [8086:1237] type 00 class 0x060000
[ 0.585260][ T1] pci 0000:00:01.0: [8086:7000] type 00 class 0x060100
To reproduce:
# build kernel
cd linux
cp config-5.19.0-rc1-00007-gb8d7b1582e80 .config
make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage modules
make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 INSTALL_MOD_PATH=<mod-install-dir> modules_install
cd <mod-install-dir>
find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
--
0-DAY CI Kernel Test Service
https://01.org/lkp