2022-07-13 02:27:33

by kernel test robot

[permalink] [raw]
Subject: [cpu/hotplug] c7dfb2591b: WARNING:at_kernel/rcu/rcutorture.c:#rcu_torture_fwd_prog_cr[rcutorture]


(please be noted from message in parent dmesg attached, we even doubt this is
a fix, instead of a regression. so just FYI and hope to get some guidance from
you. Thanks a lot!)


Greeting,

FYI, we noticed the following commit (built with gcc-11):

commit: c7dfb2591b23856182e2c9da6b1d526ef7fd6b2e ("cpu/hotplug: Allow the CPU in CPU_UP_PREPARE state to be brought up again.")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: rcutorture
version:
with following parameters:

runtime: 300s
test: cpuhotplug
torture_type: tasks-tracing

test-description: rcutorture is rcutorture kernel module load/unload test.
test-url: https://www.kernel.org/doc/Documentation/RCU/torture.txt


on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):


e8a69f12f01f487c c7dfb2591b23856182e2c9da6b1
---------------- ---------------------------
fail:runs %reproduction fail:runs
| | |
:20 100% 20:20 dmesg.RIP:rcu_torture_fwd_prog_cr[rcutorture]
:20 100% 20:20 dmesg.WARNING:at_kernel/rcu/rcutorture.c:#rcu_torture_fwd_prog_cr[rcutorture]



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>


however, in parent dmesg (attached), we noticed below:
(parent kernel is built with same config as attached)

739 [ 63.916107][ T456] smpboot: CPU 0 is now offline
740 [ 64.001999][ T456] smpboot: Booting Node 0 Processor 0 APIC 0x0
741 [ 70.096845][ T466] tasks-tracing-torture: rcu_torture_read_exit: Start of episode
742 [ 70.280275][ T466] tasks-tracing-torture: rcu_torture_read_exit: End of episode
743 [ 74.001241][ T456] smpboot: do_boot_cpu failed(-1) to wakeup CPU#0
744 [ 74.010849][ T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
745 [ 74.028550][ T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
746 [ 74.055381][ T456] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
...
15842 [ 345.202204][ T293]
15843 [ 135.197493] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
15844 [ 345.202210][ T293]
15845 [ 345.206162][ T293]
15846 [ 135.266699] tasks-tracing-torture:torture_onoff task: online 0 failed: errno -5
15847 [ 345.206168][ T293]


not sure if this is an issue which fixed by this commit?

and seems below in dmesg from this commit should not be caused by this commit?

[ 99.712428][ T460] ------------[ cut here ]------------
[ 99.712942][ T460] WARNING: CPU: 1 PID: 460 at kernel/rcu/rcutorture.c:2427 rcu_torture_fwd_prog_cr+0x27d/0x30a [rcutorture]
[ 99.713912][ T460] Modules linked in: rcutorture torture mousedev ghash_clmulni_intel bochs drm_vram_helper drm_ttm_helper ttm input_leds psmouse drm_kms_helper parport_pc i2c_piix4 parport evbug fuse drm drm_panel_orientation_quirks
[ 99.715729][ T460] CPU: 1 PID: 460 Comm: rcu_torture_fwd Not tainted 5.18.0-rc2-00002-gc7dfb2591b23 #1
[ 99.716539][ T460] RIP: 0010:rcu_torture_fwd_prog_cr+0x27d/0x30a [rcutorture]
[ 99.717158][ T460] Code: 7c cb ff ff e8 77 6f f9 ff 84 c0 0f 85 86 00 00 00 8a 05 23 55 00 00 84 c0 75 7c e8 ad e9 ff ff 84 c0 75 73 49 83 fd 63 7f 02 <0f> 0b ff 75 d0 48 8b 55 c8 4d 8d 04 1e 4d 89 f1 49 2b 54 24 60 ff
[ 99.718819][ T460] RSP: 0018:ffff888130107e78 EFLAGS: 00010293
[ 99.719332][ T460] RAX: 0000000000000000 RBX: 000000000000c350 RCX: 0000000000001bb3
[ 99.719999][ T460] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff81e1b44d
[ 99.720666][ T460] RBP: ffff888130107ec0 R08: 0000000000000001 R09: 0000000000000000
[ 99.721346][ T460] R10: 0000000000000000 R11: ffff888132822018 R12: ffff888132822000
[ 99.722017][ T460] R13: 0000000000000000 R14: 0000000000000ed1 R15: 0000000000000000
[ 99.722689][ T460] FS: 0000000000000000(0000) GS:ffff88843fc00000(0000) knlGS:0000000000000000
[ 99.723442][ T460] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 99.723998][ T460] CR2: 00007f6c71d68000 CR3: 0000000100376000 CR4: 00000000000406a0
[ 99.724698][ T460] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 99.725374][ T460] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 99.726046][ T460] Call Trace:
[ 99.726325][ T460] <TASK>
[ 99.726576][ T460] rcu_torture_fwd_prog.cold+0x42/0x10b [rcutorture]
[ 99.727143][ T460] ? rcu_torture_fwd_prog_cbfree+0xc0/0xc0 [rcutorture]
[ 99.727741][ T460] kthread+0xe3/0x100
[ 99.728092][ T460] ? kthread_complete_and_exit+0x40/0x40
[ 99.728571][ T460] ret_from_fork+0x1f/0x30
[ 99.728943][ T460] </TASK>
[ 99.729198][ T460] irq event stamp: 409161
[ 99.729562][ T460] hardirqs last enabled at (409169): [<ffffffff8118c52f>] __up_console_sem+0x6f/0x80
[ 99.730367][ T460] hardirqs last disabled at (409176): [<ffffffff8118c514>] __up_console_sem+0x54/0x80
[ 99.731188][ T460] softirqs last enabled at (409190): [<ffffffff82200308>] __do_softirq+0x308/0x4b6
[ 99.731980][ T460] softirqs last disabled at (409185): [<ffffffff810f7408>] __irq_exit_rcu+0x88/0xc0
[ 99.732774][ T460] ---[ end trace 0000000000000000 ]---



To reproduce:

# build kernel
cd linux
cp config-5.18.0-rc2-00002-gc7dfb2591b23 .config
make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage modules
make HOSTCC=gcc-11 CC=gcc-11 ARCH=x86_64 INSTALL_MOD_PATH=<mod-install-dir> modules_install
cd <mod-install-dir>
find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz


git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.



--
0-DAY CI Kernel Test Service
https://01.org/lkp



Attachments:
(No filename) (6.06 kB)
config-5.18.0-rc2-00002-gc7dfb2591b23 (150.17 kB)
job-script (4.97 kB)
dmesg.xz (19.30 kB)
rcutorture (13.18 kB)
dmesg-parent.xz (67.15 kB)
Download all attachments