2019-11-04 13:58:39

by Justin Piszcz

[permalink] [raw]
Subject: 5.4-rc6 on Supermicro X9SRL-F - Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast exception handler

Hello,

Kernel: 5.4-rc6
Arch: x86_64
Distro: Debian Testing

Problem: On occasion, every 4-5 reboots or so I get the following error
and the Linux kernel fails to boot (see attached screenshot of the
Linux console) when using a USB-3 PCI-e card.

Error when booting:
Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast
exception handler

Question:
What are typical causes of this error? I've been using this X9SRL-F
board for years ~2011/2012 and no major issues until I installed this
USB-3 PCI-e card to use with 2 x 8TB USB-3 WD drives.

When I remove the card, the system _seems_ to boot properly every
time. Is there some sort of IRQ/interrupt issue using this USB-3
PCI-e card with this
motherboard-- or is there a bug with the latest BIOS that would cause
this error? I also went back to the latest stable 5.3.x kernel and
experienced the same issue there as well.

Thanks,

Justin.


Attachments:
kernel_error.jpg (139.64 kB)

2019-11-08 08:44:07

by Justin Piszcz

[permalink] [raw]
Subject: RE: 5.4-rc6 on Supermicro X9SRL-F - Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast exception handler



-----Original Message-----
From: Justin Piszcz [mailto:[email protected]]
Sent: Monday, November 4, 2019 8:57 AM
To: LKML
Subject: 5.4-rc6 on Supermicro X9SRL-F - Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast exception handler

Hello,

Kernel: 5.4-rc6
Arch: x86_64
Distro: Debian Testing

Problem: On occasion, every 4-5 reboots or so I get the following error
and the Linux kernel fails to boot (see attached screenshot of the
Linux console) when using a USB-3 PCI-e card.

[ .. ]

In case anyone is reading this in the future-- the root cause was using a High Point USB 3.0 card (PCI-e 2.0 -x4 width) in the machine:
RocketU 1144A/1144AM/1144AR on the PCB

Issue - every 1-5 reboots, the kernel would fail to boot with: Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast exception handler

After removing the card, this has not recurred (tested w/88 reboots via script) and not a single recurrence of the issue.

Regards,

Justin.