2022-06-01 19:14:22

by kernel test robot

[permalink] [raw]
Subject: [dm] cfc97abcbe: canonical_address#:#[##]


(please be noted we reported
"[dm] 9196856f66: canonical_address#:#[##]"
on
https://lore.kernel.org/all/20220422131209.GB20828@xsang-OptiPlex-9020/
while this commit is on linux-next/master,
now we noticed similar issue still exists on mainline, FYI)


Greeting,

FYI, we noticed the following commit (built with gcc-11):

commit: cfc97abcbe0b1ffac441202fe330e697370ecbcc ("dm: conditionally enable BIOSET_PERCPU_CACHE for dm_io bioset")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: xfstests
version: xfstests-x86_64-48c5dbb-1_20220523
with following parameters:

disk: 4HDD
fs: xfs
test: xfs-group-00
ucode: 0x21

test-description: xfstests is a regression test suite for xfs and other files ystems.
test-url: git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git


on test machine: 4 threads 1 sockets Intel(R) Core(TM) i3-3220 CPU @ 3.30GHz with 8G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>


[ 474.513126][T25342] XFS (dm-0): Mounting V5 Filesystem
[ 474.634559][T25342] XFS (dm-0): Starting recovery (logdev: internal)
[ 474.688736][T25342] XFS (dm-0): Ending recovery (logdev: internal)
[ 474.697144][T25342] xfs filesystem being mounted at /fs/scratch supports timestamps until 2038 (0x7fffffff)
[ 474.713352][T25351] XFS (dm-0): Unmounting Filesystem
[ 474.816617][T25356] general protection fault, probably for non-canonical address 0xfbd59c0000000024: 0000 [#1] SMP KASAN PTI
[ 474.828029][T25356] KASAN: maybe wild-memory-access in range [0xdead000000000120-0xdead000000000127]
[ 474.837308][T25356] CPU: 3 PID: 25356 Comm: dmsetup Not tainted 5.18.0-rc3-00069-gcfc97abcbe0b #1
[ 474.846321][T25356] Hardware name: Hewlett-Packard HP Pro 3340 MT/17A1, BIOS 8.07 01/24/2013
[474.854908][T25356] RIP: 0010:__cpuhp_state_remove_instance (kbuild/src/consumer/include/linux/list.h:848 kbuild/src/consumer/include/linux/list.h:862 kbuild/src/consumer/kernel/cpu.c:2140)
[ 474.861760][T25356] Code: 00 fc ff df 48 89 fa 48 c1 ea 03 80 3c 02 00 0f 85 7b 01 00 00 48 b8 00 00 00 00 00 fc ff df 4c 8b 65 08 4c 89 e2 48 c1 ea 03 <80> 3c 02 00 0f 85 4f 01 00 00 49 89 1c 24 48 85 db 74 23 48 b8 00
All code
========
0: 00 fc add %bh,%ah
2: ff (bad)
3: df 48 89 fisttps -0x77(%rax)
6: fa cli
7: 48 c1 ea 03 shr $0x3,%rdx
b: 80 3c 02 00 cmpb $0x0,(%rdx,%rax,1)
f: 0f 85 7b 01 00 00 jne 0x190
15: 48 b8 00 00 00 00 00 movabs $0xdffffc0000000000,%rax
1c: fc ff df
1f: 4c 8b 65 08 mov 0x8(%rbp),%r12
23: 4c 89 e2 mov %r12,%rdx
26: 48 c1 ea 03 shr $0x3,%rdx
2a:* 80 3c 02 00 cmpb $0x0,(%rdx,%rax,1) <-- trapping instruction
2e: 0f 85 4f 01 00 00 jne 0x183
34: 49 89 1c 24 mov %rbx,(%r12)
38: 48 85 db test %rbx,%rbx
3b: 74 23 je 0x60
3d: 48 rex.W
3e: b8 .byte 0xb8
...

Code starting with the faulting instruction
===========================================
0: 80 3c 02 00 cmpb $0x0,(%rdx,%rax,1)
4: 0f 85 4f 01 00 00 jne 0x159
a: 49 89 1c 24 mov %rbx,(%r12)
e: 48 85 db test %rbx,%rbx
11: 74 23 je 0x36
13: 48 rex.W
14: b8 .byte 0xb8
...
[ 474.881445][T25356] RSP: 0018:ffffc9000c35fb40 EFLAGS: 00010a06
[ 474.887484][T25356] RAX: dffffc0000000000 RBX: dead000000000100 RCX: ffffffff835c411f
[ 474.895415][T25356] RDX: 1bd5a00000000024 RSI: 0000000000000008 RDI: ffff888206f40380
[ 474.903389][T25356] RBP: ffff888206f40378 R08: 0000000000000001 R09: ffffc9000c35fae7
[ 474.911334][T25356] R10: fffff5200186bf5c R11: 0000000000000001 R12: dead000000000122
[ 474.919273][T25356] R13: 0000000000000000 R14: ffff888206f40200 R15: dffffc0000000000
[ 474.927206][T25356] FS: 00007fc9b9a38880(0000) GS:ffff8881aa980000(0000) knlGS:0000000000000000
[ 474.936100][T25356] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 474.942824][T25356] CR2: 00007fc9b9ce1649 CR3: 000000017c194005 CR4: 00000000001706e0
[ 474.950794][T25356] Call Trace:
[ 474.953969][T25356] <TASK>
[474.956789][T25356] ? kfree (kbuild/src/consumer/mm/slub.c:1754 kbuild/src/consumer/mm/slub.c:3510 kbuild/src/consumer/mm/slub.c:4552)
[474.960691][T25356] bioset_exit (kbuild/src/consumer/block/bio.c:718 kbuild/src/consumer/block/bio.c:1665)
[474.964938][T25356] cleanup_mapped_device (kbuild/src/consumer/drivers/md/dm.c:1776) dm_mod
[474.970865][T25356] __dm_destroy (kbuild/src/consumer/drivers/md/dm.c:809 kbuild/src/consumer/drivers/md/dm.c:1938 kbuild/src/consumer/drivers/md/dm.c:2291) dm_mod


To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.



--
0-DAY CI Kernel Test Service
https://01.org/lkp



Attachments:
(No filename) (5.40 kB)
config-5.18.0-rc3-00069-gcfc97abcbe0b (168.55 kB)
job-script (5.89 kB)
dmesg.xz (5.70 kB)
xfstests (19.01 kB)
job.yaml (4.84 kB)
Download all attachments