2024-04-16 01:51:29

by Oliver Sang

[permalink] [raw]
Subject: [bristot:dl_server_v6_try11] [sched/fair] 1abba9e7f4: RIP:__dl_server_attach_root



Hello,

kernel test robot noticed "RIP:__dl_server_attach_root" on:

commit: 1abba9e7f47ad4a5dfd8b2dfb59aa607983cdce4 ("sched/fair: Fair server interface")
git://git.kernel.org/cgit/linux/kernel/git/bristot/linux dl_server_v6_try11

in testcase: kernel-selftests
version: kernel-selftests-x86_64-7a6d30c9-1_20240318
with following parameters:

group: cgroup



compiler: gcc-13
test machine: 36 threads 1 sockets Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz (Cascade Lake) with 32G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <[email protected]>
| Closes: https://lore.kernel.org/oe-lkp/[email protected]


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240416/[email protected]


[ 387.680270][ T212] divide error: 0000 [#1] PREEMPT SMP KASAN NOPTI
[ 387.680905][ T212] CPU: 0 PID: 212 Comm: kworker/2:1 Not tainted 6.9.0-rc2-00003-g1abba9e7f47a #1
[ 387.681752][ T212] Hardware name: Gigabyte Technology Co., Ltd. X299 UD4 Pro/X299 UD4 Pro-CF, BIOS F8a 04/27/2021
[ 387.682702][ T212] Workqueue: events cpuset_hotplug_workfn
[ 387.683256][ T212] RIP: 0010:__dl_server_attach_root+0x143/0x430
[ 387.683848][ T212] Code: ea 03 80 3c 02 00 0f 85 d9 02 00 00 4c 01 a5 88 00 00 00 44 89 e0 41 81 fc 00 00 00 80 75 0a 41 83 fd ff 0f 84 1c 0e a7 02 99 <41> f7 fd f7 d8 89 04 24 e8 90 cb aa 02 85 c0 0f 85 6e 01 00 00 49
[ 387.685550][ T212] RSP: 0018:ffffc9000110f988 EFLAGS: 00010887
[ 387.686124][ T212] RAX: 0000000000000000 RBX: 0000000000047100 RCX: dffffc0000000000
[ 387.686861][ T212] RDX: 0000000000000000 RSI: 1ffffffff0bc1724 RDI: ffffffff86ef2948
[ 387.687602][ T212] RBP: ffffffff86ef28c0 R08: ffff888100058638 R09: fffff52000221f23
[ 387.688345][ T212] R10: 0000000000000003 R11: 0000000000000001 R12: 0000000000000000
[ 387.689083][ T212] R13: 0000000000000000 R14: 0000000000000002 R15: ffff888805d47100
[ 387.689824][ T212] FS: 0000000000000000(0000) GS:ffff888805c00000(0000) knlGS:0000000000000000
[ 387.690646][ T212] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 387.691264][ T212] CR2: 0000000000451c00 CR3: 000000089ba7a002 CR4: 00000000003706f0
[ 387.692005][ T212] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 387.692744][ T212] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 387.693490][ T212] Call Trace:
[ 387.693842][ T212] <TASK>
[ 387.694161][ T212] ? die+0x36/0x90
[ 387.694551][ T212] ? do_trap+0x19e/0x240
[ 387.694977][ T212] ? __dl_server_attach_root+0x143/0x430
[ 387.695522][ T212] ? __dl_server_attach_root+0x143/0x430
[ 387.696065][ T212] ? do_error_trap+0xa3/0x170
[ 387.696526][ T212] ? __dl_server_attach_root+0x143/0x430
[ 387.697066][ T212] ? exc_divide_error+0x38/0x50
[ 387.697542][ T212] ? __dl_server_attach_root+0x143/0x430
[ 387.698081][ T212] ? asm_exc_divide_error+0x1a/0x20
[ 387.698592][ T212] ? __dl_server_attach_root+0x143/0x430
[ 387.699133][ T212] ? __dl_server_attach_root+0x100/0x430
[ 387.699675][ T212] ? sched_clock+0x10/0x30
[ 387.700119][ T212] ? lock_pin_lock+0x162/0x240
[ 387.700590][ T212] rq_attach_root+0x3c1/0x490
[ 387.701055][ T212] cpu_attach_domain+0x4c5/0x7e0
[ 387.701536][ T212] ? mark_held_locks+0x96/0xe0
[ 387.702009][ T212] partition_sched_domains_locked+0x356/0xa70
[ 387.702588][ T212] rebuild_sched_domains_locked+0x2e1/0x480
[ 387.703149][ T212] ? __pfx_rebuild_sched_domains_locked+0x10/0x10
[ 387.703754][ T212] ? trace_contention_end+0xf0/0x140
[ 387.704353][ T212] cpuset_hotplug_workfn+0x49b/0xe40
[ 387.704865][ T212] ? __pfx_lock_acquire+0x10/0x10
[ 387.705354][ T212] ? __pfx_cpuset_hotplug_workfn+0x10/0x10
[ 387.705908][ T212] ? lock_is_held_type+0x8f/0x100
[ 387.706404][ T212] process_one_work+0x804/0x1720
[ 387.706891][ T212] ? __pfx_lock_acquire+0x10/0x10
[ 387.707385][ T212] ? __pfx_process_one_work+0x10/0x10
[ 387.707903][ T212] ? assign_work+0x16c/0x240
[ 387.708364][ T212] worker_thread+0x724/0x1300
[ 387.708830][ T212] ? __pfx_worker_thread+0x10/0x10
[ 387.710258][ T212] kthread+0x2de/0x3c0
[ 387.710670][ T212] ? __pfx_kthread+0x10/0x10
[ 387.711122][ T212] ret_from_fork+0x31/0x70
[ 387.711565][ T212] ? __pfx_kthread+0x10/0x10
[ 387.712023][ T212] ret_from_fork_asm+0x1a/0x30
[ 387.712501][ T212] </TASK>
[ 387.712828][ T212] Modules linked in: netconsole openvswitch nf_conncount nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 intel_rapl_msr intel_rapl_common nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp btrfs coretemp blake2b_generic xor zstd_compress kvm_intel raid6_pq libcrc32c kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel sha512_ssse3 nvme rapl nvme_core ahci intel_cstate t10_pi libahci ipmi_devintf mei_me ipmi_msghandler wmi_bmof mxm_wmi intel_wmi_thunderbolt intel_uncore crc64_rocksoft_generic wdat_wdt libata crc64_rocksoft i2c_i801 crc64 ioatdma mei i2c_smbus dca wmi binfmt_misc loop fuse drm dm_mod ip_tables
[ 387.717730][ T212] ---[ end trace 0000000000000000 ]---
[ 387.718259][ T212] RIP: 0010:__dl_server_attach_root+0x143/0x430
[ 387.718852][ T212] Code: ea 03 80 3c 02 00 0f 85 d9 02 00 00 4c 01 a5 88 00 00 00 44 89 e0 41 81 fc 00 00 00 80 75 0a 41 83 fd ff 0f 84 1c 0e a7 02 99 <41> f7 fd f7 d8 89 04 24 e8 90 cb aa 02 85 c0 0f 85 6e 01 00 00 49
[ 387.720553][ T212] RSP: 0018:ffffc9000110f988 EFLAGS: 00010887
[ 387.721130][ T212] RAX: 0000000000000000 RBX: 0000000000047100 RCX: dffffc0000000000
[ 387.721871][ T212] RDX: 0000000000000000 RSI: 1ffffffff0bc1724 RDI: ffffffff86ef2948
[ 387.722616][ T212] RBP: ffffffff86ef28c0 R08: ffff888100058638 R09: fffff52000221f23
[ 387.723359][ T212] R10: 0000000000000003 R11: 0000000000000001 R12: 0000000000000000
[ 387.724101][ T212] R13: 0000000000000000 R14: 0000000000000002 R15: ffff888805d47100
[ 387.724839][ T212] FS: 0000000000000000(0000) GS:ffff888805c00000(0000) knlGS:0000000000000000
[ 387.725661][ T212] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 387.726277][ T212] CR2: 0000000000451c00 CR3: 000000089ba7a002 CR4: 00000000003706f0
[ 387.727017][ T212] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 387.727755][ T212] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 387.728499][ T212] Kernel panic - not syncing: Fatal exception
[ 387.729153][ T212] Kernel Offset: disabled



--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki