by Chen, Rong A

[permalink] [raw]

Subject: [LKP] [async] 06f4f5bfb3: BUG:sleeping_function_called_from_invalid_context_at_include/linux/percpu-rwsem.h

FYI, we noticed the following commit (built with gcc-7):

commit: 06f4f5bfb3404db7b4c45b0e4757b1e9a76cdd9a ("[PATCH v4 4/5] async: Add support for queueing on specific node")
url: https://github.com/0day-ci/linux/commits/Alexander-Duyck/Address-issues-slowing-persistent-memory-initialization/20180921-225440

in testcase: fio-basic
with following parameters:

runtime: 300s
disk: 1HDD
fs: xfs
nr_task: 1
test_size: 128G
rw: write
bs: 4k
ioengine: sync
ucode: 0x42d
cpufreq_governor: performance
fs2: nfsv4

test-description: Fio is a tool that will spawn a number of threads or processes doing a particular type of I/O action as specified by the user.
test-url: https://github.com/axboe/fio

on test machine: 48 threads Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz with 64G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):

+-----------------------------------------------------------------------------------+------------+------------+
| | 0f537b5505 | 06f4f5bfb3 |
+-----------------------------------------------------------------------------------+------------+------------+
| boot_successes | 3 | 4 |
| boot_failures | 11 | 9 |
| WARNING:at#for_ip_interrupt_entry/0x | 8 | 5 |
| WARNING:stack_recursion | 7 | 5 |
| WARNING:at#for_ip_swapgs_restore_regs_and_return_to_usermode/0x | 3 | 3 |
| BUG:sleeping_function_called_from_invalid_context_at_include/linux/percpu-rwsem.h | 0 | 9 |
+-----------------------------------------------------------------------------------+------------+------------+

[ 16.233052] BUG: sleeping function called from invalid context at include/linux/percpu-rwsem.h:34
[ 16.245303] in_atomic(): 1, irqs_disabled(): 1, pid: 555, name: scsi_eh_0
[ 16.245306] CPU: 1 PID: 555 Comm: scsi_eh_0 Not tainted 4.19.0-rc4-00184-g06f4f5b #1
Startin
[ 16.245309] Hardware name: Intel Corporation S2600WP/S2600WP, BIOS SE5C600.86B.02.02.0002.122320131210 12/23/2013
g OpenBSD Secure
[ 16.275455] Call Trace:
Shell server...
[ 16.279747] dump_stack+0x5c/0x7b

[ 16.284992] ___might_sleep+0xf1/0x110
[ 16.289548] cpus_read_lock+0x18/0x50
[ 16.289552] __async_schedule+0x163/0x210
0m] Reached targ
[ 16.289566] ? scsi_try_target_reset+0x90/0x90
et System Time S
[ 16.289569] ? sas_scsi_recover_host+0x2b9/0x390 [libsas]
ynchronized.
[ 16.289571] sas_scsi_recover_host+0x2b9/0x390 [libsas]
[ 16.289576] ? scsi_error_handler+0x3b/0x620
[ 16.333060] ? scsi_error_handler+0x9a/0x620
[ 16.333064] ? scsi_try_target_reset+0x90/0x90
0m] Started Dail
[ 16.349819] ? __wake_up_common+0x76/0x170
y apt download a
[ 16.355945] ? scsi_eh_get_sense+0x240/0x240
ctivities.
[ 16.362271] kthread+0x11e/0x140
[ 16.367041] ? kthread_associate_blkcg+0xb0/0xb0
[ 16.367048] ret_from_fork+0x35/0x40
0m] Started Daily apt upgrade and clean activities.
Starting LSB: Load kernel image with kexec...
[ 16.547582] ata7.00: ATA-8: WDC WD1003FBYZ-010FB0, 01.01V03, max UDMA/133
[ 16.555799] ata7.00: 1953525168 sectors, multi 0: LBA48 NCQ (depth 32)
[ 16.566308] ata7.00: configured for UDMA/133
[ 16.571765] sas: --- Exit sas_scsi_recover_host: busy: 0 failed: 0 tries: 1
[ 16.582543] scsi 0:0:0:0: Direct-Access ATA WDC WD1003FBYZ-0 1V03 PQ: 0 ANSI: 5
[ 16.592293] sas: DONE DISCOVERY on port 0, pid:698, result:0
[ 16.602838] scsi 0:0:0:0: Attached scsi generic sg0 type 0
[ 16.611425] sd 0:0:0:0: [sda] 1953525168 512-byte logical blocks: (1.00 TB/932 GiB)
[ 16.620677] sd 0:0:0:0: [sda] Write Protect is off
[ 16.626659] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 16.632977] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 16.692308] sda: sda1 sda2 sda3
[ 16.697284] sd 0:0:0:0: [sda] Attached SCSI disk
[ 17.073992] raid6: sse2x1 gen() 7636 MB/s
[ 17.095989] raid6: sse2x1 xor() 5824 MB/s
[ 17.117991] raid6: sse2x2 gen() 9554 MB/s
[ 17.139990] raid6: sse2x2 xor() 6521 MB/s
[ 17.161987] raid6: sse2x4 gen() 11550 MB/s
[ 17.183991] raid6: sse2x4 xor() 7769 MB/s
[ 17.189383] raid6: using algorithm sse2x4 gen() 11550 MB/s
[ 17.196124] raid6: .... xor() 7769 MB/s, rmw enabled
[ 17.202273] raid6: using ssse3x2 recovery algorithm
[ 17.212489] xor: automatically using best checksumming function avx
[ 17.247296] Btrfs loaded, crc32c=crc32c-generic
[ 17.254019] BTRFS: device fsid 83e57bc1-35de-4d61-8929-dc8aa3d711c2 devid 1 transid 4454 /dev/sda3
[ 20.961795] Kernel tests: Boot OK!
[ 20.961799]
[ 22.781499] BTRFS info (device sda3): disk space caching is enabled
[ 22.788537] BTRFS info (device sda3): has skinny extents
[ 22.878137] netpoll: netconsole: local port 6665
[ 22.883329] netpoll: netconsole: local IPv4 address 0.0.0.0
[ 22.889559] netpoll: netconsole: interface 'eth0'
[ 22.894835] netpoll: netconsole: remote port 6644
[ 22.900116] netpoll: netconsole: remote IPv4 address 192.168.2.1
[ 22.906837] netpoll: netconsole: remote ethernet address ff:ff:ff:ff:ff:ff
[ 22.914544] netpoll: netconsole: local IP 192.168.2.17
[ 22.920389] console [netcon0] enabled
[ 22.924517] netconsole: network logging started
[ 24.381587] install debs round one: dpkg -i --force-confdef --force-depends /opt/deb/sysstat_11.4.3-2_amd64.deb
[ 24.381593]
[ 24.395562] /opt/deb/gawk_1%3a4.1.4+dfsg-1_amd64.deb
[ 24.395563]
[ 24.403952] Selecting previously unselected package sysstat.
[ 24.403954]
[ 24.413629] (Reading database ... 16106 files and directories currently installed.)
[ 24.413630]
[ 24.425242] Preparing to unpack .../deb/sysstat_11.4.3-2_amd64.deb ...
[ 24.425244]
[ 24.434957] Unpacking sysstat (11.4.3-2) ...
[ 24.434959]
[ 24.442519] Selecting previously unselected package gawk.
[ 24.442520]
[ 24.451624] Preparing to unpack .../gawk_1%3a4.1.4+dfsg-1_amd64.deb ...
[ 24.451625]
[ 24.461553] Unpacking gawk (1:4.1.4+dfsg-1) ...
[ 24.461554]
[ 24.469073] Setting up sysstat (11.4.3-2) ...
[ 24.469075]
[ 24.476496] Setting up gawk (1:4.1.4+dfsg-1) ...
[ 24.476497]
[ 24.484539] Processing triggers for systemd (232-25+deb9u2) ...
[ 24.484540]
[ 24.494683] 23 Sep 03:19:18 ntpdate[865]: step time server 192.168.1.1 offset 18.120275 sec
[ 24.494684]
[ 24.506357] /lkp/lkp/src/bin/run-lkp
[ 24.506358]
[ 24.597525] device-mapper: uevent: version 1.0.3
[ 24.602963] device-mapper: ioctl: 4.39.0-ioctl (2018-04-03) initialised: [email protected]
[ 24.615614] random: crng init done

To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp install job.yaml # job file is attached in this email
bin/lkp run job.yaml

Thanks,
Rong Chen

Attachments:

(No filename) (7.20 kB)
config-4.19.0-rc4-00184-g06f4f5b (170.48 kB)
job-script (7.74 kB)
dmesg.xz (24.58 kB)
job.yaml (5.32 kB)
reproduce (1.02 kB)
Download all attachments