2024-04-04 07:44:42

by Paul Menzel

[permalink] [raw]
Subject: NOHZ tick-stop error: local softirq work is pending, handler #08!!! on Dell XPS 13 9360

Dear Linux folks,


On Dell XPS 13 9360/0596KF, BIOS 2.21.0 06/02/2022, with Linux 6.9-rc2+
built from commit b1e6ec0a0fd0 (Merge tag 'docs-6.9-fixes' of
git://git.lwn.net/linux) the external USB-C adapter Dell DA300 stopped
working (only the Ethernet port was used). Linux logged:

[ 0.000000] DMI: Dell Inc. XPS 13 9360/0596KF, BIOS 2.21.0
06/02/2022
[…]
[36790.763879] r8152 4-1.2:1.0 enx18dbf22dccf3: carrier on
[37100.727070] NOHZ tick-stop error: local softirq work is pending,
handler #08!!!
[37115.053932] NOHZ tick-stop error: local softirq work is pending,
handler #08!!!
[37595.306954] NOHZ tick-stop error: local softirq work is pending,
handler #08!!!
[37635.242523] NOHZ tick-stop error: local softirq work is pending,
handler #08!!!
[38056.236277] xhci_hcd 0000:39:00.0: WARN: xHC restore state timeout
[38056.236298] xhci_hcd 0000:39:00.0: PCI post-resume error -110!
[38056.236302] xhci_hcd 0000:39:00.0: HC died; cleaning up
[38056.236327] usb 3-1: USB disconnect, device number 2
[38056.236331] usb 3-1.1: USB disconnect, device number 3
[38056.236464] usb 4-1: USB disconnect, device number 2
[38056.236467] r8152-cfgselector 4-1.2: USB disconnect, device number 3
[38183.298119] INFO: task kworker/1:2:36399 blocked for more than
120 seconds.
[38183.298148] Not tainted 6.9.0-rc2+ #88
[38183.298159] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
disables this message.
[38183.298166] task:kworker/1:2 state:D stack:0 pid:36399
tgid:36399 ppid:2 flags:0x00004000
[38183.298191] Workqueue: usb_hub_wq hub_event [usbcore]
[38183.298332] Call Trace:
[38183.298339] <TASK>
[38183.298353] __schedule+0x3eb/0xb40
[38183.298386] schedule+0x27/0xf0
[38183.298402] schedule_preempt_disabled+0x15/0x30
[38183.298419] __mutex_lock.constprop.0+0x339/0x630
[38183.298451] usb_unlocked_enable_lpm+0x23/0x40 [usbcore]
[38183.298561] usb_unbind_interface+0x1c3/0x280 [usbcore]
[38183.298696] device_release_driver_internal+0x19f/0x200
[38183.298723] bus_remove_device+0xc6/0x130
[38183.298742] device_del+0x163/0x3f0
[38183.298760] ? kobject_put+0xa0/0x1d0
[38183.298782] usb_disable_device+0xcd/0x1e0 [usbcore]
[38183.298914] usb_disconnect+0xe6/0x2e0 [usbcore]
[38183.299027] usb_disconnect+0xca/0x2e0 [usbcore]
[38183.299139] hub_quiesce+0x5e/0xc0 [usbcore]
[38183.299248] hub_event+0x5bc/0x1bf0 [usbcore]
[38183.299365] ? psi_task_switch+0xd6/0x230
[38183.299378] ? finish_task_switch.isra.0+0x97/0x2c0
[38183.299399] ? __schedule+0x3f3/0xb40
[38183.299416] process_one_work+0x189/0x3b0
[38183.299441] worker_thread+0x277/0x390
[38183.299462] ? __pfx_worker_thread+0x10/0x10
[38183.299479] kthread+0xcf/0x100
[38183.299496] ? __pfx_kthread+0x10/0x10
[38183.299512] ret_from_fork+0x31/0x50
[38183.299527] ? __pfx_kthread+0x10/0x10
[38183.299540] ret_from_fork_asm+0x1a/0x30
[38183.299566] </TASK>
[38183.299574] INFO: task kworker/1:1:36539 blocked for more than
120 seconds.
[38183.299586] Not tainted 6.9.0-rc2+ #88
[38183.299594] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
disables this message.
[38183.299601] task:kworker/1:1 state:D stack:0 pid:36539
tgid:36539 ppid:2 flags:0x00004000
[38183.299620] Workqueue: usb_hub_wq hub_event [usbcore]
[…]


Re-plugging the adapter, it worked again.


Kind regards,

Paul


Attachments:
20240404--dell-xps-13-9360--linux-6.9-rc2+--messages.txt (134.67 kB)
config-6.9.0-rc2+ (196.81 kB)
Download all attachments

2024-04-08 10:11:47

by Anna-Maria Behnsen

[permalink] [raw]
Subject: Re: NOHZ tick-stop error: local softirq work is pending, handler #08!!! on Dell XPS 13 9360

Hi Paul,

Paul Menzel <[email protected]> writes:

> Dear Linux folks,
>
>
> On Dell XPS 13 9360/0596KF, BIOS 2.21.0 06/02/2022, with Linux 6.9-rc2+
> built from commit b1e6ec0a0fd0 (Merge tag 'docs-6.9-fixes' of
> git://git.lwn.net/linux) the external USB-C adapter Dell DA300 stopped
> working (only the Ethernet port was used). Linux logged:

thanks for the report. Can you please provide a trace beside the dmesg
output? The following trace events should be enabled (via kernel command
line):

trace_event=timer:*,timer_migration:*,sched:sched_switch,sched:sched_wakeup,sched:sched_process_hang,irq:softirq_entry,irq:softirq_raise,irq:softirq_exit

Thanks,

Anna-Maria

2024-04-08 11:20:25

by Paul Menzel

[permalink] [raw]
Subject: Re: NOHZ tick-stop error: local softirq work is pending, handler #08!!! on Dell XPS 13 9360

Dear Anna-Maria,


Thank you for your response.


Am 08.04.24 um 12:10 schrieb Anna-Maria Behnsen:

> Paul Menzel writes:

>> On Dell XPS 13 9360/0596KF, BIOS 2.21.0 06/02/2022, with Linux 6.9-rc2+
>> built from commit b1e6ec0a0fd0 (Merge tag 'docs-6.9-fixes' of
>> git://git.lwn.net/linux) the external USB-C adapter Dell DA300 stopped
>> working (only the Ethernet port was used). Linux logged:
>
> thanks for the report. Can you please provide a trace beside the dmesg
> output? The following trace events should be enabled (via kernel command
> line):
>
> trace_event=timer:*,timer_migration:*,sched:sched_switch,sched:sched_wakeup,sched:sched_process_hang,irq:softirq_entry,irq:softirq_raise,irq:softirq_exit
Unfortunately I haven’t been able to reproduce it until now. Should it
happen again, I am going to try your suggestion.


Kind regards,

Paul

2024-04-09 07:57:53

by Anna-Maria Behnsen

[permalink] [raw]
Subject: Re: NOHZ tick-stop error: local softirq work is pending, handler #08!!! on Dell XPS 13 9360

Paul Menzel <[email protected]> writes:

> Dear Anna-Maria,
>
>
> Thank you for your response.
>
>
> Am 08.04.24 um 12:10 schrieb Anna-Maria Behnsen:
>
>> Paul Menzel writes:
>
>>> On Dell XPS 13 9360/0596KF, BIOS 2.21.0 06/02/2022, with Linux 6.9-rc2+
>>> built from commit b1e6ec0a0fd0 (Merge tag 'docs-6.9-fixes' of
>>> git://git.lwn.net/linux) the external USB-C adapter Dell DA300 stopped
>>> working (only the Ethernet port was used). Linux logged:
>>
>> thanks for the report. Can you please provide a trace beside the dmesg
>> output? The following trace events should be enabled (via kernel command
>> line):
>>
>> trace_event=timer:*,timer_migration:*,sched:sched_switch,sched:sched_wakeup,sched:sched_process_hang,irq:softirq_entry,irq:softirq_raise,irq:softirq_exit
> Unfortunately I haven’t been able to reproduce it until now. Should it
> happen again, I am going to try your suggestion.
>

Thanks for letting me know.

Thanks,

Anna-Maria