2021-10-21 06:50:35

by Abdul Haleem

[permalink] [raw]
Subject: [next-20211019][PPC] kernel panics with lspci -vvnn command

Greeting's

Today's next kernel panics when lspci -vvnn commands is executed on my
powerpc machine

# lspci -vvnn
0012:01:00.0 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)
    Subsystem: IBM Device [1014:0650]
    Physical Slot: U78D8.ND0.FGD004S-P0-C2-C0
    Device tree node:
/sys/firmware/devicetree/base/pci@800000020000012/fibre-channel@0
    Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
Stepping- SERR+ FastB2B- DisINTx-
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 128 bytes
    Interrupt: pin A routed to IRQ 48
    NUMA node: 2
    IOMMU group: 0
    Region 0: Memory at 40000285000 (64-bit, prefetchable) [size=4K]
    Region 2: Memory at 40000282000 (64-bit, prefetchable) [size=8K]
    Region 4: Memory at 40000100000 (64-bit, prefetchable) [size=1M]
    Expansion ROM at 40000240000 [disabled] [size=256K]
    Capabilities: [44] Power Management version 3
        Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
        Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
    Capabilities: [4c] Express (v2) Endpoint, MSI 00
        DevCap:    MaxPayload 2048 bytes, PhantFunc 0, Latency L0s
<4us, L1 <1us
            ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+
SlotPowerLimit 0.000W
        DevCtl:    CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
            RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
            MaxPayload 512 bytes, MaxReadReq 4096 bytes
        DevSta:    CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr-
TransPend-
        LnkCap:    Port #0, Speed 8GT/s, Width x8, ASPM L0s L1, Exit
Latency L0s <2us, L1 <2us
            ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
        LnkCtl:    ASPM Disabled; RCB 64 bytes, Disabled- CommClk-
            ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
        LnkSta:    Speed 8GT/s (ok), Width x8 (ok)
            TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
        DevCap2: Completion Timeout: Range B, TimeoutDis+ NROPrPrP- LTR-
             10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt-
EETLPPrefix-
             EmergencyPowerReduction Not Supported,
EmergencyPowerReductionInit-
             FRS- TPHComp- ExtTPHComp-
             AtomicOpsCap: 32bit- 64bit- 128bitCAS-
        DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis+ LTR-
OBFF Disabled,
             AtomicOpsCtl: ReqEn-
        LnkCap2: Supported Link Speeds: 2.5-8GT/s, Crosslink- Retimer-
2Retimers- DRS-
        LnkCtl2: Target Link Speed: 8GT/s, EnterCompliance- SpeedDis-
             Transmit Margin: Normal Operating Range,
EnterModifiedCompliance- ComplianceSOS-
             Compliance De-emphasis: -6dB
        LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete+
EqualizationPhase1+
             EqualizationPhase2+ EqualizationPhase3+
LinkEqualizationRequest-
             Retimer- 2Retimers- CrosslinkRes: unsupported
    Capabilities: [88] Vital Product Data
BUG: Kernel NULL pointer dereference on read at 0x000080a0
BUG: Unable to handle kernel data access on read at 0x3949ffff40920078
BUG: Unable to handle kernel data access on read at 0x694a0002e94d00f0
Faulting instruction address: 0xc0000000006f4498
Faulting instruction address: 0xc0000000001d3680
Oops: Kernel access of bad area, sig: 11 [#1]
LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=2048 NUMA pSeries
Faulting instruction address: 0xc0000000001abcf0
Modules linked in:
Thread overran stack, or stack corrupted
 rpadlpar_io rpaphp nfnetlink tcp_diag udp_diag inet_diag unix_diag
af_packet_diag netlink_diag bonding rfkill sunrpc raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c
pseries_rng xts vmx_crypto gf128mul binfmt_misc sch_fq_codel ip_tables ext4
mbcache jbd2 dm_service_time sd_mod sg qla2xxx ibmvfc ibmveth nvme_fc
nvme_fabrics nvme_core t10_pi scsi_transport_fc dm_multipath dm_mirror
dm_region_hash dm_log dm_mod fuse
CPU: 24 PID: 0 Comm: swapper/24 Kdump: loaded Not tainted
5.15.0-rc5-next-20211012-autotest #1
NIP:  c0000000006f4498 LR: c0000000006f9c18 CTR: c000000000026e60
REGS: c000000006797560 TRAP: 0380   Not tainted
(5.15.0-rc5-next-20211012-autotest)
MSR:  8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 42000824  XER: 00000000
CFAR: c0000000006f440c IRQMASK: 1
GPR00: c00000000022434c c000000006797800 c0000000019b2500 c00000117db0ac28
GPR04: c00000117db0a520 0000000000000000 3949ffff40920078 0000000000000001
GPR08: c000000063bd3cf0 c00000000073a7a8 892100602e3f0000 7265677368657265
GPR12: c000000000026e60 c00000117fb4be80 0000000000000000 000000001eef2b00
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000000000003 0000000000000001
GPR24: 0000638695346493 0000000000000002 0000000000000003 c00000117db0a480
GPR28: c00000117db0a480 0000000000000000 c00000117db0a520 c00000117db0ac28
NIP [c0000000006f4498] rb_erase+0x158/0x440
LR [c0000000006f9c18] timerqueue_del+0x58/0xa0
Call Trace:
[c000000006797800] [0000000000000003] 0x3 (unreliable)
[c000000006797830] [c00000000022434c] __remove_hrtimer+0x5c/0xf0
[c000000006797870] [c000000000224b84] __hrtimer_run_queues+0x144/0x360
[c0000000067978f0] [c000000000225b90] hrtimer_interrupt+0x120/0x2f0
[c0000000067979a0] [c0000000000272d4] timer_interrupt+0x144/0x3e0
[c000000006797a00] [c000000000009a60] decrementer_common_virt+0x210/0x220
--- interrupt: 900 at plpar_hcall_norets_notrace+0x18/0x2c
NIP:  c0000000000ef100 LR: c0000000009bca88 CTR: 0000000000000000
REGS: c000000006797a70 TRAP: 0900   Not tainted
(5.15.0-rc5-next-20211012-autotest)
MSR:  800000000280b033 <SF,VEC,VSX,EE,FP,ME,IR,DR,RI,LE>  CR: 24000224 
XER: 00000000
CFAR: 0000000000000c00 IRQMASK: 0
GPR00: 0000000000000000 c000000006797d10 c0000000019b2500 0000000000000000
GPR04: 8004001f54ca7c10 0000000000000000 0000000000000004 0000000000000018
GPR08: 0000000000000000 0000000000000090 0000000000000001 8004001f54ca7c00
GPR12: 0000000000000000 c00000117fb4be80 0000000000000000 000000001eef2b00
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000000000000 c0000000018f1e28
GPR24: 0000000000000001 0000638691f200fe 0000000000000000 0000000000000000
GPR28: 0000000000000001 0000000000000000 c000000001192090 c000000001192098
NIP [c0000000000ef100] plpar_hcall_norets_notrace+0x18/0x2c
LR [c0000000009bca88] check_and_cede_processor+0x48/0x60
--- interrupt: 900
[c000000006797d10] [c00000117db0ac28] 0xc00000117db0ac28 (unreliable)
[c000000006797d70] [c0000000009bcec4] dedicated_cede_loop+0x94/0x1a0
[c000000006797dc0] [c0000000009b97f4] cpuidle_enter_state+0x2d4/0x4e0
[c000000006797e20] [c0000000009b9a98] cpuidle_enter+0x48/0x70
[c000000006797e60] [c0000000001a0264] call_cpuidle+0x44/0x80
[c000000006797e80] [c0000000001a0840] do_idle+0x340/0x390
[c000000006797f00] [c0000000001a0ac4] cpu_startup_entry+0x34/0x50
[c000000006797f30] [c00000000006144c] start_secondary+0x27c/0x290
[c000000006797f90] [c00000000000d254] start_secondary_prolog+0x10/0x14
Instruction dump:
7d284b78 e9280008 7fa95040 409eff9c e9280010 e8e90000 e9490008 70e70001
41820168 e8c90010 2fa60000 419e0010 <e8e60000> 70e70001 41820044 2faa0000
---[ end trace 3024a0b7ce3c6f83 ]---

# lspci -nn
0012:01:00.0 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)
0012:01:00.1 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)

Problem is not seen with 5.15.0-rc4-next-20211005

--
Regard's

Abdul Haleem
IBM Linux Technology Center