2022-05-23 09:04:36

by Naresh Kamboju

[permalink] [raw]
Subject: WARNING: CPU: 2 PID: 3331 at drivers/firmware/raspberrypi.c:63 rpi_firmware_property_list

The following kernel warning was noticed on raspberrypi4 device while
running LTP controllers with Linus mainline kernel tree.

Reported-by: Linux Kernel Functional Testing <[email protected]>

metadata:
git_ref: master
git_repo: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
git_sha: 4b0986a3613c92f4ec1bdc7f60ec66fea135991f
git_describe: v5.18
kernel_version: 5.18.0
kernel-config: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/config
artifact-location: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu
toolchain: gcc-11
System.map: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/System.map
vmlinux.xz: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/vmlinux.xz

Steps to reproduce:
# cd /opt/ltp
# ./runltp -p -q -f controllers

cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
/sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
/sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
/sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
[ 2442.255887] ------------[ cut here ]------------
[ 2442.260615] Firmware transaction timeout
[ 2442.260704] WARNING: CPU: 2 PID: 3331 at
drivers/firmware/raspberrypi.c:63
rpi_firmware_property_list+0x1e8/0x270
[ 2442.275114] Modules linked in: btrfs blake2b_generic libcrc32c
raid6_pq zstd_compress brcmfmac xhci_pci brcmutil snd_soc_hdmi_codec
xhci_pci_renesas raspberrypi_cpufreq hci_uart btqca btbcm vc4 cfg80211
bluetooth cec drm_cma_helper rfkill drm_kms_helper reset_raspberrypi
clk_raspberrypi crct10dif_ce raspberrypi_hwmon drm pwm_bcm2835
iproc_rng200 pcie_brcmstb i2c_bcm2835 rng_core bcm2711_thermal fuse
[ 2442.311148] CPU: 2 PID: 3331 Comm: kworker/2:2 Not tainted 5.18.0 #1
[ 2442.317599] Hardware name: Raspberry Pi 4 Model B (DT)
[ 2442.322810] Workqueue: events get_values_poll [raspberrypi_hwmon]
[ 2442.329007] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 2442.336074] pc : rpi_firmware_property_list+0x1e8/0x270
[ 2442.341381] lr : rpi_firmware_property_list+0x1e8/0x270
[ 2442.346685] sp : ffff800010d63cc0
[ 2442.350043] x29: ffff800010d63cc0 x28: 0000000000000000 x27: ffff0000419d0540
[ 2442.357297] x26: 0000000000000011 x25: ffff80000afdd008 x24: 0000000000001000
[ 2442.364547] x23: ffff000061ce9300 x22: ffff80000ac330b0 x21: ffff0000419d0500
[ 2442.371797] x20: 0000000000000010 x19: ffff80000afdd000 x18: ffffffffffffffff
[ 2442.379046] x17: 0000000000000000 x16: 0000000000000000 x15: ffff800090d63937
[ 2442.386295] x14: ffff80000a912dd0 x13: 74756f656d697420 x12: 6e6f69746361736e
[ 2442.393544] x11: ffff80000a8b1ce8 x10: ffff80000a91c178 x9 : ffff80000811d524
[ 2442.400794] x8 : 00000000ffffefff x7 : ffff80000a909cf8 x6 : 0000000000000001
[ 2442.408044] x5 : ffff80000a88f000 x4 : ffff80000a88f2c0 x3 : 0000000000000000
[ 2442.415293] x2 : 0000000000000000 x1 : 0000000000000000 x0 : ffff000062921040
[ 2442.422543] Call trace:
[ 2442.425020] rpi_firmware_property_list+0x1e8/0x270
[ 2442.429973] rpi_firmware_property+0x7c/0xc0
[ 2442.434308] get_values_poll+0x54/0x120 [raspberrypi_hwmon]
[ 2442.439967] process_one_work+0x1dc/0x450
[ 2442.444045] worker_thread+0x154/0x450
[ 2442.447850] kthread+0x100/0x110
[ 2442.451126] ret_from_fork+0x10/0x20
[ 2442.454760] ---[ end trace 0000000000000000 ]---
[ 2442.471909] hwmon hwmon1: Failed to get throttled (-110)
[ 2443.499865] raspberrypi-clk soc:firmware:clocks: Failed to change
fw-clk-arm frequency: -110
[ 2446.575893] raspberrypi-clk soc:firmware:clocks: Failed to change
fw-clk-arm frequency: -110
[ 2450.915855] mmc1: Timeout waiting for hardware cmd interrupt.
[ 2450.921693] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 2450.928227] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
[ 2450.934763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
[ 2450.941297] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
[ 2450.947830] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
[ 2450.954362] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 2450.960896] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
[ 2450.967429] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
[ 2450.973960] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
[ 2450.980493] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
[ 2450.987025] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 2450.993558] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
[ 2451.000091] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
[ 2451.006624] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
[ 2451.013155] mmc1: sdhci: Host ctl2: 0x0000808c
[ 2451.017659] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
[ 2451.024191] mmc1: sdhci: ============================================
[ 2452.679889] raspberrypi-clk soc:firmware:clocks: Failed to change
fw-clk-arm frequency: -110
[ 2455.763879] raspberrypi-clk soc:firmware:clocks: Failed to change
fw-clk-arm frequency: -110
[ 2460.759844] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
[ 2460.766034] rcu: 1-...!: (1 GPs behind)
idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=23
[ 2460.775311] (detected by 3, t=5253 jiffies, g=915057, q=822)
[ 2460.781141] Task dump for CPU 1:
[ 2460.784411] task:cgroup_fj_stres state:R running task stack:
0 pid: 6806 ppid: 501 flags:0x00000200
[ 2460.794483] Call trace:
[ 2460.796958] __switch_to+0x104/0x160
[ 2460.800595] 0xffff0000403cff00
[ 2460.803782] rcu: rcu_preempt kthread timer wakeup didn't happen for
5216 jiffies! g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
[ 2460.815430] rcu: Possible timer handling issue on cpu=0 timer-softirq=161506
[ 2460.822667] rcu: rcu_preempt kthread starved for 5222 jiffies!
g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
[ 2460.833345] rcu: Unless rcu_preempt kthread gets sufficient CPU
time, OOM is now expected behavior.
[ 2460.842606] rcu: RCU grace-period kthread stack dump:
[ 2460.847726] task:rcu_preempt state:I stack: 0 pid: 15
ppid: 2 flags:0x00000008
[ 2460.856203] Call trace:
[ 2460.858678] __switch_to+0x104/0x160
[ 2460.862305] __schedule+0x314/0x8ac
[ 2460.865843] schedule+0x5c/0xd4
[ 2460.869028] schedule_timeout+0xa4/0x1c4
[ 2460.873010] rcu_gp_fqs_loop+0x140/0x520
[ 2460.876998] rcu_gp_kthread+0x1a8/0x25c
[ 2460.880891] kthread+0x100/0x110
[ 2460.884167] ret_from_fork+0x10/0x20
[ 2460.887796] rcu: Stack dump where RCU GP kthread last ran:
[ 2460.893357] Task dump for CPU 0:
[ 2460.896624] task:swapper/0 state:R running task stack:
0 pid: 0 ppid: 0 flags:0x0000000a
[ 2460.906690] Call trace:
[ 2460.909165] __switch_to+0x104/0x160
[ 2460.912791] 0x0
[ 2461.159863] mmc1: Timeout waiting for hardware cmd interrupt.
[ 2461.165699] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 2461.172231] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
[ 2461.178763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
[ 2461.185294] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
[ 2461.191826] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
[ 2461.198358] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 2461.204887] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
[ 2461.211419] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
[ 2461.217951] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
[ 2461.224483] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
[ 2461.231013] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 2461.237543] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
[ 2461.244073] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
[ 2461.250604] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
[ 2461.257135] mmc1: sdhci: Host ctl2: 0x0000808c
[ 2461.261639] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
[ 2461.268169] mmc1: sdhci: ============================================
[ 2462.019897] raspberrypi-clk soc:firmware:clocks: Failed to change
fw-clk-arm frequency: -110
[ 2471.403852] mmc1: Timeout waiting for hardware cmd interrupt.
[ 2471.409689] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 2471.416219] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
[ 2471.422751] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
[ 2471.429282] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
[ 2471.435814] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
[ 2471.442345] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 2471.448876] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
[ 2471.455407] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
[ 2471.461938] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
[ 2471.468469] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
[ 2471.474998] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 2471.481529] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
[ 2471.488061] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
[ 2471.494592] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
[ 2471.501122] mmc1: sdhci: Host ctl2: 0x0000808c
[ 2471.505624] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
[ 2471.512155] mmc1: sdhci: ============================================
[ 2481.647850] mmc1: Timeout waiting for hardware cmd interrupt.
[ 2481.653680] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 2481.660211] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
[ 2481.666742] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
[ 2481.673273] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
[ 2481.679804] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
[ 2481.686335] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 2481.692866] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
[ 2481.699397] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
[ 2481.705928] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
[ 2481.712459] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
[ 2481.718988] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 2481.725519] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
[ 2481.732050] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
[ 2481.738581] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
[ 2481.745111] mmc1: sdhci: Host ctl2: 0x0000808c
[ 2481.749613] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
[ 2481.756143] mmc1: sdhci: ============================================
[ 2481.763349] mmc1: card aaaa removed
[ 2523.923843] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
[ 2523.930032] rcu: 1-...0: (1 GPs behind)
idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=3169
[ 2523.939484] (detected by 3, t=21044 jiffies, g=915057, q=1474)
[ 2523.945491] Task dump for CPU 1:
[ 2523.948761] task:cgroup_fj_stres state:R running task stack:
0 pid: 6806 ppid: 501 flags:0x00000202
[ 2523.958831] Call trace:
[ 2523.961307] __switch_to+0x104/0x160
[ 2523.964942] 0xffff0000403cff00

Full test log,
https://lkft.validation.linaro.org/scheduler/job/5075973

--
Linaro LKFT
https://lkft.linaro.org


2022-05-23 11:14:18

by Stefan Wahren

[permalink] [raw]
Subject: Re: WARNING: CPU: 2 PID: 3331 at drivers/firmware/raspberrypi.c:63 rpi_firmware_property_list

Hi Naresh,

Am 23.05.22 um 11:04 schrieb Naresh Kamboju:
> The following kernel warning was noticed on raspberrypi4 device while
> running LTP controllers with Linus mainline kernel tree.
>
> Reported-by: Linux Kernel Functional Testing <[email protected]>
>
> metadata:
> git_ref: master
> git_repo: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
> git_sha: 4b0986a3613c92f4ec1bdc7f60ec66fea135991f
> git_describe: v5.18
> kernel_version: 5.18.0
> kernel-config: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/config
> artifact-location: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu
> toolchain: gcc-11
> System.map: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/System.map
> vmlinux.xz: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/vmlinux.xz

thanks for your report.

Do you know this worked before (e.g. 5.17)?

Am i correct that you are using GPU firmware 2021-02-25T12:10:40?

>
> Steps to reproduce:
> # cd /opt/ltp
> # ./runltp -p -q -f controllers
>
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> [ 2442.255887] ------------[ cut here ]------------
> [ 2442.260615] Firmware transaction timeout
> [ 2442.260704] WARNING: CPU: 2 PID: 3331 at
> drivers/firmware/raspberrypi.c:63
> rpi_firmware_property_list+0x1e8/0x270
> [ 2442.275114] Modules linked in: btrfs blake2b_generic libcrc32c
> raid6_pq zstd_compress brcmfmac xhci_pci brcmutil snd_soc_hdmi_codec
> xhci_pci_renesas raspberrypi_cpufreq hci_uart btqca btbcm vc4 cfg80211
> bluetooth cec drm_cma_helper rfkill drm_kms_helper reset_raspberrypi
> clk_raspberrypi crct10dif_ce raspberrypi_hwmon drm pwm_bcm2835
> iproc_rng200 pcie_brcmstb i2c_bcm2835 rng_core bcm2711_thermal fuse
> [ 2442.311148] CPU: 2 PID: 3331 Comm: kworker/2:2 Not tainted 5.18.0 #1
> [ 2442.317599] Hardware name: Raspberry Pi 4 Model B (DT)
> [ 2442.322810] Workqueue: events get_values_poll [raspberrypi_hwmon]
> [ 2442.329007] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
> [ 2442.336074] pc : rpi_firmware_property_list+0x1e8/0x270
> [ 2442.341381] lr : rpi_firmware_property_list+0x1e8/0x270
> [ 2442.346685] sp : ffff800010d63cc0
> [ 2442.350043] x29: ffff800010d63cc0 x28: 0000000000000000 x27: ffff0000419d0540
> [ 2442.357297] x26: 0000000000000011 x25: ffff80000afdd008 x24: 0000000000001000
> [ 2442.364547] x23: ffff000061ce9300 x22: ffff80000ac330b0 x21: ffff0000419d0500
> [ 2442.371797] x20: 0000000000000010 x19: ffff80000afdd000 x18: ffffffffffffffff
> [ 2442.379046] x17: 0000000000000000 x16: 0000000000000000 x15: ffff800090d63937
> [ 2442.386295] x14: ffff80000a912dd0 x13: 74756f656d697420 x12: 6e6f69746361736e
> [ 2442.393544] x11: ffff80000a8b1ce8 x10: ffff80000a91c178 x9 : ffff80000811d524
> [ 2442.400794] x8 : 00000000ffffefff x7 : ffff80000a909cf8 x6 : 0000000000000001
> [ 2442.408044] x5 : ffff80000a88f000 x4 : ffff80000a88f2c0 x3 : 0000000000000000
> [ 2442.415293] x2 : 0000000000000000 x1 : 0000000000000000 x0 : ffff000062921040
> [ 2442.422543] Call trace:
> [ 2442.425020] rpi_firmware_property_list+0x1e8/0x270
> [ 2442.429973] rpi_firmware_property+0x7c/0xc0
> [ 2442.434308] get_values_poll+0x54/0x120 [raspberrypi_hwmon]
> [ 2442.439967] process_one_work+0x1dc/0x450
> [ 2442.444045] worker_thread+0x154/0x450
> [ 2442.447850] kthread+0x100/0x110
> [ 2442.451126] ret_from_fork+0x10/0x20
> [ 2442.454760] ---[ end trace 0000000000000000 ]---
> [ 2442.471909] hwmon hwmon1: Failed to get throttled (-110)
> [ 2443.499865] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2446.575893] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2450.915855] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2450.921693] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2450.928227] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2450.934763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2450.941297] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2450.947830] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2450.954362] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2450.960896] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2450.967429] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2450.973960] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2450.980493] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2450.987025] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2450.993558] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2451.000091] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2451.006624] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2451.013155] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2451.017659] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2451.024191] mmc1: sdhci: ============================================
> [ 2452.679889] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2455.763879] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2460.759844] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 2460.766034] rcu: 1-...!: (1 GPs behind)
> idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=23
> [ 2460.775311] (detected by 3, t=5253 jiffies, g=915057, q=822)
> [ 2460.781141] Task dump for CPU 1:
> [ 2460.784411] task:cgroup_fj_stres state:R running task stack:
> 0 pid: 6806 ppid: 501 flags:0x00000200
> [ 2460.794483] Call trace:
> [ 2460.796958] __switch_to+0x104/0x160
> [ 2460.800595] 0xffff0000403cff00
> [ 2460.803782] rcu: rcu_preempt kthread timer wakeup didn't happen for
> 5216 jiffies! g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
> [ 2460.815430] rcu: Possible timer handling issue on cpu=0 timer-softirq=161506
> [ 2460.822667] rcu: rcu_preempt kthread starved for 5222 jiffies!
> g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
> [ 2460.833345] rcu: Unless rcu_preempt kthread gets sufficient CPU
> time, OOM is now expected behavior.
> [ 2460.842606] rcu: RCU grace-period kthread stack dump:
> [ 2460.847726] task:rcu_preempt state:I stack: 0 pid: 15
> ppid: 2 flags:0x00000008
> [ 2460.856203] Call trace:
> [ 2460.858678] __switch_to+0x104/0x160
> [ 2460.862305] __schedule+0x314/0x8ac
> [ 2460.865843] schedule+0x5c/0xd4
> [ 2460.869028] schedule_timeout+0xa4/0x1c4
> [ 2460.873010] rcu_gp_fqs_loop+0x140/0x520
> [ 2460.876998] rcu_gp_kthread+0x1a8/0x25c
> [ 2460.880891] kthread+0x100/0x110
> [ 2460.884167] ret_from_fork+0x10/0x20
> [ 2460.887796] rcu: Stack dump where RCU GP kthread last ran:
> [ 2460.893357] Task dump for CPU 0:
> [ 2460.896624] task:swapper/0 state:R running task stack:
> 0 pid: 0 ppid: 0 flags:0x0000000a
> [ 2460.906690] Call trace:
> [ 2460.909165] __switch_to+0x104/0x160
> [ 2460.912791] 0x0
> [ 2461.159863] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2461.165699] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2461.172231] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2461.178763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2461.185294] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2461.191826] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2461.198358] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2461.204887] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2461.211419] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2461.217951] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2461.224483] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2461.231013] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2461.237543] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2461.244073] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2461.250604] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2461.257135] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2461.261639] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2461.268169] mmc1: sdhci: ============================================
> [ 2462.019897] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2471.403852] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2471.409689] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2471.416219] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2471.422751] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2471.429282] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2471.435814] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2471.442345] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2471.448876] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2471.455407] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2471.461938] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2471.468469] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2471.474998] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2471.481529] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2471.488061] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2471.494592] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2471.501122] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2471.505624] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2471.512155] mmc1: sdhci: ============================================
> [ 2481.647850] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2481.653680] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2481.660211] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2481.666742] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2481.673273] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2481.679804] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2481.686335] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2481.692866] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2481.699397] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2481.705928] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2481.712459] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2481.718988] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2481.725519] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2481.732050] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2481.738581] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2481.745111] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2481.749613] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2481.756143] mmc1: sdhci: ============================================
> [ 2481.763349] mmc1: card aaaa removed
> [ 2523.923843] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 2523.930032] rcu: 1-...0: (1 GPs behind)
> idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=3169
> [ 2523.939484] (detected by 3, t=21044 jiffies, g=915057, q=1474)
> [ 2523.945491] Task dump for CPU 1:
> [ 2523.948761] task:cgroup_fj_stres state:R running task stack:
> 0 pid: 6806 ppid: 501 flags:0x00000202
> [ 2523.958831] Call trace:
> [ 2523.961307] __switch_to+0x104/0x160
> [ 2523.964942] 0xffff0000403cff00
>
> Full test log,
> https://lkft.validation.linaro.org/scheduler/job/5075973
>
> --
> Linaro LKFT
> https://lkft.linaro.org
>
> _______________________________________________
> linux-arm-kernel mailing list
> [email protected]
> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel

2022-05-23 12:37:09

by Naresh Kamboju

[permalink] [raw]
Subject: Re: WARNING: CPU: 2 PID: 3331 at drivers/firmware/raspberrypi.c:63 rpi_firmware_property_list

On Mon, 23 May 2022 at 16:43, Stefan Wahren <[email protected]> wrote:
>
> Hi Naresh,
>
> Am 23.05.22 um 11:04 schrieb Naresh Kamboju:
> > The following kernel warning was noticed on raspberrypi4 device while
> > running LTP controllers with Linus mainline kernel tree.
> >
> > Reported-by: Linux Kernel Functional Testing <[email protected]>
> >
> > metadata:
> > git_ref: master
> > git_repo: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
> > git_sha: 4b0986a3613c92f4ec1bdc7f60ec66fea135991f
> > git_describe: v5.18
> > kernel_version: 5.18.0
> > kernel-config: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/config
> > artifact-location: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu
> > toolchain: gcc-11
> > System.map: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/System.map
> > vmlinux.xz: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/vmlinux.xz
>
> thanks for your report.
>
> Do you know this worked before (e.g. 5.17)?

Yes. it worked before.

> Am I correct that you are using GPU firmware 2021-02-25T12:10:40?

Correct.

- Naresh

2022-05-23 13:47:21

by Paul E. McKenney

[permalink] [raw]
Subject: Re: WARNING: CPU: 2 PID: 3331 at drivers/firmware/raspberrypi.c:63 rpi_firmware_property_list

On Mon, May 23, 2022 at 02:34:04PM +0530, Naresh Kamboju wrote:
> The following kernel warning was noticed on raspberrypi4 device while
> running LTP controllers with Linus mainline kernel tree.
>
> Reported-by: Linux Kernel Functional Testing <[email protected]>
>
> metadata:
> git_ref: master
> git_repo: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
> git_sha: 4b0986a3613c92f4ec1bdc7f60ec66fea135991f
> git_describe: v5.18
> kernel_version: 5.18.0
> kernel-config: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/config
> artifact-location: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu
> toolchain: gcc-11
> System.map: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/System.map
> vmlinux.xz: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/vmlinux.xz
>
> Steps to reproduce:
> # cd /opt/ltp
> # ./runltp -p -q -f controllers
>
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> cgroup_fj_stress_blkio_1_200_one 1 TINFO: Attaching task 7221 to
> /sys/fs/cgroup/blkio/ltp/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1/1
> [ 2442.255887] ------------[ cut here ]------------
> [ 2442.260615] Firmware transaction timeout
> [ 2442.260704] WARNING: CPU: 2 PID: 3331 at
> drivers/firmware/raspberrypi.c:63
> rpi_firmware_property_list+0x1e8/0x270
> [ 2442.275114] Modules linked in: btrfs blake2b_generic libcrc32c
> raid6_pq zstd_compress brcmfmac xhci_pci brcmutil snd_soc_hdmi_codec
> xhci_pci_renesas raspberrypi_cpufreq hci_uart btqca btbcm vc4 cfg80211
> bluetooth cec drm_cma_helper rfkill drm_kms_helper reset_raspberrypi
> clk_raspberrypi crct10dif_ce raspberrypi_hwmon drm pwm_bcm2835
> iproc_rng200 pcie_brcmstb i2c_bcm2835 rng_core bcm2711_thermal fuse
> [ 2442.311148] CPU: 2 PID: 3331 Comm: kworker/2:2 Not tainted 5.18.0 #1
> [ 2442.317599] Hardware name: Raspberry Pi 4 Model B (DT)
> [ 2442.322810] Workqueue: events get_values_poll [raspberrypi_hwmon]
> [ 2442.329007] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
> [ 2442.336074] pc : rpi_firmware_property_list+0x1e8/0x270
> [ 2442.341381] lr : rpi_firmware_property_list+0x1e8/0x270
> [ 2442.346685] sp : ffff800010d63cc0
> [ 2442.350043] x29: ffff800010d63cc0 x28: 0000000000000000 x27: ffff0000419d0540
> [ 2442.357297] x26: 0000000000000011 x25: ffff80000afdd008 x24: 0000000000001000
> [ 2442.364547] x23: ffff000061ce9300 x22: ffff80000ac330b0 x21: ffff0000419d0500
> [ 2442.371797] x20: 0000000000000010 x19: ffff80000afdd000 x18: ffffffffffffffff
> [ 2442.379046] x17: 0000000000000000 x16: 0000000000000000 x15: ffff800090d63937
> [ 2442.386295] x14: ffff80000a912dd0 x13: 74756f656d697420 x12: 6e6f69746361736e
> [ 2442.393544] x11: ffff80000a8b1ce8 x10: ffff80000a91c178 x9 : ffff80000811d524
> [ 2442.400794] x8 : 00000000ffffefff x7 : ffff80000a909cf8 x6 : 0000000000000001
> [ 2442.408044] x5 : ffff80000a88f000 x4 : ffff80000a88f2c0 x3 : 0000000000000000
> [ 2442.415293] x2 : 0000000000000000 x1 : 0000000000000000 x0 : ffff000062921040
> [ 2442.422543] Call trace:
> [ 2442.425020] rpi_firmware_property_list+0x1e8/0x270
> [ 2442.429973] rpi_firmware_property+0x7c/0xc0
> [ 2442.434308] get_values_poll+0x54/0x120 [raspberrypi_hwmon]
> [ 2442.439967] process_one_work+0x1dc/0x450
> [ 2442.444045] worker_thread+0x154/0x450
> [ 2442.447850] kthread+0x100/0x110
> [ 2442.451126] ret_from_fork+0x10/0x20
> [ 2442.454760] ---[ end trace 0000000000000000 ]---
> [ 2442.471909] hwmon hwmon1: Failed to get throttled (-110)
> [ 2443.499865] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2446.575893] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2450.915855] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2450.921693] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2450.928227] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2450.934763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2450.941297] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2450.947830] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2450.954362] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2450.960896] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2450.967429] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2450.973960] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2450.980493] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2450.987025] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2450.993558] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2451.000091] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2451.006624] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2451.013155] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2451.017659] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2451.024191] mmc1: sdhci: ============================================
> [ 2452.679889] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2455.763879] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2460.759844] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 2460.766034] rcu: 1-...!: (1 GPs behind)
> idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=23
> [ 2460.775311] (detected by 3, t=5253 jiffies, g=915057, q=822)
> [ 2460.781141] Task dump for CPU 1:
> [ 2460.784411] task:cgroup_fj_stres state:R running task stack:
> 0 pid: 6806 ppid: 501 flags:0x00000200
> [ 2460.794483] Call trace:
> [ 2460.796958] __switch_to+0x104/0x160
> [ 2460.800595] 0xffff0000403cff00
> [ 2460.803782] rcu: rcu_preempt kthread timer wakeup didn't happen for
> 5216 jiffies! g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
> [ 2460.815430] rcu: Possible timer handling issue on cpu=0 timer-softirq=161506

This looks to be a follow-on error due to the earlier firmware splat
above. What happened here is that RCU's grace-period kthread attempted
to sleep for a few jiffies, but the timer's wakeup still hadn't happened
more than 5,000 jiffies later.

And is the "Timeout waiting for hardware cmd interrupt" below
additional evidence in favor of this diagnosis?

Thanx, Paul

> [ 2460.822667] rcu: rcu_preempt kthread starved for 5222 jiffies!
> g915057 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
> [ 2460.833345] rcu: Unless rcu_preempt kthread gets sufficient CPU
> time, OOM is now expected behavior.
> [ 2460.842606] rcu: RCU grace-period kthread stack dump:
> [ 2460.847726] task:rcu_preempt state:I stack: 0 pid: 15
> ppid: 2 flags:0x00000008
> [ 2460.856203] Call trace:
> [ 2460.858678] __switch_to+0x104/0x160
> [ 2460.862305] __schedule+0x314/0x8ac
> [ 2460.865843] schedule+0x5c/0xd4
> [ 2460.869028] schedule_timeout+0xa4/0x1c4
> [ 2460.873010] rcu_gp_fqs_loop+0x140/0x520
> [ 2460.876998] rcu_gp_kthread+0x1a8/0x25c
> [ 2460.880891] kthread+0x100/0x110
> [ 2460.884167] ret_from_fork+0x10/0x20
> [ 2460.887796] rcu: Stack dump where RCU GP kthread last ran:
> [ 2460.893357] Task dump for CPU 0:
> [ 2460.896624] task:swapper/0 state:R running task stack:
> 0 pid: 0 ppid: 0 flags:0x0000000a
> [ 2460.906690] Call trace:
> [ 2460.909165] __switch_to+0x104/0x160
> [ 2460.912791] 0x0
> [ 2461.159863] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2461.165699] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2461.172231] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2461.178763] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2461.185294] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2461.191826] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2461.198358] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2461.204887] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2461.211419] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2461.217951] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2461.224483] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2461.231013] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2461.237543] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2461.244073] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2461.250604] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2461.257135] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2461.261639] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2461.268169] mmc1: sdhci: ============================================
> [ 2462.019897] raspberrypi-clk soc:firmware:clocks: Failed to change
> fw-clk-arm frequency: -110
> [ 2471.403852] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2471.409689] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2471.416219] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2471.422751] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2471.429282] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2471.435814] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2471.442345] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2471.448876] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2471.455407] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2471.461938] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2471.468469] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2471.474998] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2471.481529] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2471.488061] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2471.494592] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2471.501122] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2471.505624] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2471.512155] mmc1: sdhci: ============================================
> [ 2481.647850] mmc1: Timeout waiting for hardware cmd interrupt.
> [ 2481.653680] mmc1: sdhci: ============ SDHCI REGISTER DUMP ===========
> [ 2481.660211] mmc1: sdhci: Sys addr: 0x00000008 | Version: 0x00001002
> [ 2481.666742] mmc1: sdhci: Blk size: 0x00007200 | Blk cnt: 0x00000000
> [ 2481.673273] mmc1: sdhci: Argument: 0xaaaa0000 | Trn mode: 0x00000033
> [ 2481.679804] mmc1: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000017
> [ 2481.686335] mmc1: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
> [ 2481.692866] mmc1: sdhci: Wake-up: 0x00000000 | Clock: 0x00000107
> [ 2481.699397] mmc1: sdhci: Timeout: 0x00000000 | Int stat: 0x00000001
> [ 2481.705928] mmc1: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
> [ 2481.712459] mmc1: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
> [ 2481.718988] mmc1: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
> [ 2481.725519] mmc1: sdhci: Cmd: 0x00000d1a | Max curr: 0x00080008
> [ 2481.732050] mmc1: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x0076b27f
> [ 2481.738581] mmc1: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00000900
> [ 2481.745111] mmc1: sdhci: Host ctl2: 0x0000808c
> [ 2481.749613] mmc1: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0xc3400208
> [ 2481.756143] mmc1: sdhci: ============================================
> [ 2481.763349] mmc1: card aaaa removed
> [ 2523.923843] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
> [ 2523.930032] rcu: 1-...0: (1 GPs behind)
> idle=add/1/0x4000000000000000 softirq=481373/481374 fqs=3169
> [ 2523.939484] (detected by 3, t=21044 jiffies, g=915057, q=1474)
> [ 2523.945491] Task dump for CPU 1:
> [ 2523.948761] task:cgroup_fj_stres state:R running task stack:
> 0 pid: 6806 ppid: 501 flags:0x00000202
> [ 2523.958831] Call trace:
> [ 2523.961307] __switch_to+0x104/0x160
> [ 2523.964942] 0xffff0000403cff00
>
> Full test log,
> https://lkft.validation.linaro.org/scheduler/job/5075973
>
> --
> Linaro LKFT
> https://lkft.linaro.org

2022-05-23 20:42:52

by Stefan Wahren

[permalink] [raw]
Subject: Re: WARNING: CPU: 2 PID: 3331 at drivers/firmware/raspberrypi.c:63 rpi_firmware_property_list

Hi Naresh,

Am 23.05.22 um 14:35 schrieb Naresh Kamboju:
> On Mon, 23 May 2022 at 16:43, Stefan Wahren <[email protected]> wrote:
>> Hi Naresh,
>>
>> Am 23.05.22 um 11:04 schrieb Naresh Kamboju:
>>> The following kernel warning was noticed on raspberrypi4 device while
>>> running LTP controllers with Linus mainline kernel tree.
>>>
>>> Reported-by: Linux Kernel Functional Testing <[email protected]>
>>>
>>> metadata:
>>> git_ref: master
>>> git_repo: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
>>> git_sha: 4b0986a3613c92f4ec1bdc7f60ec66fea135991f
>>> git_describe: v5.18
>>> kernel_version: 5.18.0
>>> kernel-config: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/config
>>> artifact-location: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu
>>> toolchain: gcc-11
>>> System.map: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/System.map
>>> vmlinux.xz: https://builds.tuxbuild.com/29XDpgnCMxtWJ6lVJ9M0x3hTNWu/vmlinux.xz
>> thanks for your report.
>>
>> Do you know this worked before (e.g. 5.17)?
> Yes. it worked before.
>
>> Am I correct that you are using GPU firmware 2021-02-25T12:10:40?
> Correct.

Since i also experienced firmware crashes [1] with 5.18, it possible for
you to update to a more recent firmware?

Best regards

[1] -
https://lore.kernel.org/all/[email protected]/

>
> - Naresh