2022-06-13 07:08:08

by kernel test robot

[permalink] [raw]
Subject: [md] ee3841b9fb: WARNING:at_drivers/md/md.c:#mddev_suspend



Greeting,

FYI, we noticed the following commit (built with gcc-11):

commit: ee3841b9fb6be02ae0bbbcf4d686af664ef47fee ("[PATCH 2/2] md: unlock mddev before reap sync_thread in action_store")
url: https://github.com/intel-lab-lkp/linux/commits/Guoqing-Jiang/md-regression-fix/20220607-100550
base: git://git.kernel.org/cgit/linux/kernel/git/song/md.git md-next
patch link: https://lore.kernel.org/linux-raid/[email protected]

in testcase: mdadm-selftests
version: mdadm-selftests-x86_64-5f41845-1_20220518
with following parameters:

disk: 1HDD
test_prefix: 07revert
ucode: 0x28



on test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-4790T CPU @ 2.70GHz with 16G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>


kern :warn : [ 81.993106] WARNING: CPU: 4 PID: 955 at drivers/md/md.c:490 mddev_suspend (drivers/md/md.c:490)
kern :warn : [ 82.001513] Modules linked in: multipath loop raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx raid10 raid1 raid0 netconsole btrfs blake2b_generic xor ipmi_devintf ipmi_msghandler raid6_pq zstd_compress libcrc32c sd_mod t10_pi intel_rapl_msr intel_rapl_common crc64_rocksoft_generic crc64_rocksoft crc64 sg x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel i915 kvm irqbypass crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel intel_gtt drm_buddy ahci drm_dp_helper rapl libahci intel_cstate ttm mei_me intel_uncore mxm_wmi libata drm_kms_helper mei syscopyarea sysfillrect sysimgblt fb_sys_fops wmi video acpi_pad drm fuse ip_tables
kern :warn : [ 82.061054] CPU: 4 PID: 955 Comm: md0_raid5 Not tainted 5.18.0-rc3-00142-gee3841b9fb6b #1
kern :warn : [ 82.069997] Hardware name: Gigabyte Technology Co., Ltd. Z97X-UD5H/Z97X-UD5H, BIOS F9 04/21/2015
kern :warn : [ 82.079509] RIP: 0010:mddev_suspend (drivers/md/md.c:490)
kern :warn : [ 82.084596] Code: 0f b6 04 02 84 c0 74 04 3c 03 7e 22 8b 45 38 8d 50 01 89 55 38 85 c0 74 05 5b 5d 41 5c c3 5b 48 89 ef 5d 41 5c e9 1d fb ff ff <0f> 0b eb bd e8 94 ac ce fe eb d7 e8 cd ac ce fe e9 76 ff ff ff e8
All code
========
0: 0f b6 04 02 movzbl (%rdx,%rax,1),%eax
4: 84 c0 test %al,%al
6: 74 04 je 0xc
8: 3c 03 cmp $0x3,%al
a: 7e 22 jle 0x2e
c: 8b 45 38 mov 0x38(%rbp),%eax
f: 8d 50 01 lea 0x1(%rax),%edx
12: 89 55 38 mov %edx,0x38(%rbp)
15: 85 c0 test %eax,%eax
17: 74 05 je 0x1e
19: 5b pop %rbx
1a: 5d pop %rbp
1b: 41 5c pop %r12
1d: c3 retq
1e: 5b pop %rbx
1f: 48 89 ef mov %rbp,%rdi
22: 5d pop %rbp
23: 41 5c pop %r12
25: e9 1d fb ff ff jmpq 0xfffffffffffffb47
2a:* 0f 0b ud2 <-- trapping instruction
2c: eb bd jmp 0xffffffffffffffeb
2e: e8 94 ac ce fe callq 0xfffffffffeceacc7
33: eb d7 jmp 0xc
35: e8 cd ac ce fe callq 0xfffffffffecead07
3a: e9 76 ff ff ff jmpq 0xffffffffffffffb5
3f: e8 .byte 0xe8

Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: eb bd jmp 0xffffffffffffffc1
4: e8 94 ac ce fe callq 0xfffffffffeceac9d
9: eb d7 jmp 0xffffffffffffffe2
b: e8 cd ac ce fe callq 0xfffffffffeceacdd
10: e9 76 ff ff ff jmpq 0xffffffffffffff8b
15: e8 .byte 0xe8
kern :warn : [ 82.104122] RSP: 0018:ffffc9000158fb90 EFLAGS: 00010246
kern :warn : [ 82.110084] RAX: dffffc0000000000 RBX: ffff88812b8e1680 RCX: 1ffff11082e88c0c
kern :warn : [ 82.117984] RDX: 1ffff1102571c2d6 RSI: 0000000000000001 RDI: ffff88812b8e16b0
kern :warn : [ 82.125857] RBP: ffff888416476000 R08: 0000000000000000 R09: ffff888410091cbf
kern :warn : [ 82.133727] R10: ffffed1082012397 R11: 0000000000000001 R12: ffff888101218000
kern :warn : [ 82.141596] R13: 0000000000000400 R14: 0000000000000400 R15: ffff888416476000
kern :warn : [ 82.149471] FS: 0000000000000000(0000) GS:ffff8883a8a00000(0000) knlGS:0000000000000000
kern :warn : [ 82.158297] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kern :warn : [ 82.164804] CR2: 00007fbeb561e017 CR3: 000000041de0e004 CR4: 00000000001706e0
kern :warn : [ 82.172702] Call Trace:
kern :warn : [ 82.175897] <TASK>
kern :warn : [ 82.178723] check_reshape (drivers/md/raid5.c:2439 drivers/md/raid5.c:8190) raid456
kern :warn : [ 82.184122] ? _raw_spin_lock (arch/x86/include/asm/atomic.h:202 include/linux/atomic/atomic-instrumented.h:543 include/asm-generic/qspinlock.h:82 include/linux/spinlock.h:185 include/linux/spinlock_api_smp.h:134 kernel/locking/spinlock.c:154)
kern :warn : [ 82.188803] md_check_recovery (drivers/md/md.c:9377 (discriminator 1))
kern :warn : [ 82.193733] raid5d (drivers/md/raid5.c:6512) raid456
kern :warn : [ 82.198426] ? raid5_do_work (drivers/md/raid5.c:6502) raid456
kern :warn : [ 82.203951] ? _raw_spin_lock_irqsave (arch/x86/include/asm/atomic.h:202 include/linux/atomic/atomic-instrumented.h:543 include/asm-generic/qspinlock.h:82 include/linux/spinlock.h:185 include/linux/spinlock_api_smp.h:111 kernel/locking/spinlock.c:162)
kern :warn : [ 82.209333] ? _raw_read_unlock_irqrestore (kernel/locking/spinlock.c:161)
kern :warn : [ 82.215036] ? finish_wait (include/linux/list.h:134 include/linux/list.h:206 kernel/sched/wait.c:407)
kern :warn : [ 82.219447] md_thread (drivers/md/md.c:7886)
kern :warn : [ 82.223590] ? super_90_load (drivers/md/md.c:1745)
kern :warn : [ 82.228859] ? _raw_read_unlock_irqrestore (kernel/locking/spinlock.c:161)
kern :warn : [ 82.234561] ? prepare_to_swait_exclusive (kernel/sched/wait.c:414)
kern :warn : [ 82.240360] ? __kthread_parkme (arch/x86/include/asm/bitops.h:207 (discriminator 4) include/asm-generic/bitops/instrumented-non-atomic.h:135 (discriminator 4) kernel/kthread.c:270 (discriminator 4))
kern :warn : [ 82.245207] ? schedule (arch/x86/include/asm/bitops.h:207 (discriminator 1) include/asm-generic/bitops/instrumented-non-atomic.h:135 (discriminator 1) include/linux/thread_info.h:118 (discriminator 1) include/linux/sched.h:2153 (discriminator 1) kernel/sched/core.c:6462 (discriminator 1))
kern :warn : [ 82.249348] ? super_90_load (drivers/md/md.c:1745)
kern :warn : [ 82.254644] kthread (kernel/kthread.c:376)
kern :warn : [ 82.258605] ? kthread_complete_and_exit (kernel/kthread.c:331)
kern :warn : [ 82.264127] ret_from_fork (arch/x86/entry/entry_64.S:304)
kern :warn : [ 82.268450] </TASK>
kern :warn : [ 82.271376] ---[ end trace 0000000000000000 ]---
user :notice: [ 90.067442] /lkp/benchmarks/mdadm-selftests/tests/07revert-grow... IPMI BMC is not supported on this machine, skip bmc-watchdog setup!

user :err : [ 968.304470] Terminated



To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.



--
0-DAY CI Kernel Test Service
https://01.org/lkp



Attachments:
(No filename) (7.72 kB)
config-5.18.0-rc3-00142-gee3841b9fb6b (168.49 kB)
job-script (5.81 kB)
kmsg.xz (22.71 kB)
mdadm-selftests (1.11 kB)
job.yaml (4.71 kB)
reproduce (100.00 B)
Download all attachments