Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933976AbbGUWI3 (ORCPT ); Tue, 21 Jul 2015 18:08:29 -0400 Received: from catern.com ([104.131.201.120]:58813 "EHLO mail.catern.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932148AbbGUWI0 (ORCPT ); Tue, 21 Jul 2015 18:08:26 -0400 From: Spencer Baugh To: Don Zickus , Andrew Morton , Ulrich Obergfell , Ingo Molnar , Andrew Jones , chai wen , Chris Metcalf , Stephane Eranian , linux-kernel@vger.kernel.org (open list) Cc: Joern Engel , Spencer Baugh , Joern Engel , Spencer Baugh Subject: [PATCH] soft lockup: kill realtime threads before panic Date: Tue, 21 Jul 2015 15:07:57 -0700 Message-Id: <1437516477-30554-5-git-send-email-sbaugh@catern.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1234 Lines: 37 From: Joern Engel We have observed cases where the soft lockup detector triggered, but no kernel bug existed. Instead we had a buggy realtime thread that monopolized a cpu. So let's kill the responsible party and not panic the entire system. Signed-off-by: Joern Engel Signed-off-by: Spencer Baugh --- kernel/watchdog.c | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/kernel/watchdog.c b/kernel/watchdog.c index a6ffa43..2355bd5 100644 --- a/kernel/watchdog.c +++ b/kernel/watchdog.c @@ -428,7 +428,10 @@ static enum hrtimer_restart watchdog_timer_fn(struct hrtimer *hrtimer) } add_taint(TAINT_SOFTLOCKUP, LOCKDEP_STILL_OK); - if (softlockup_panic) + if (rt_prio(current->prio)) { + pr_emerg("killing realtime thread\n"); + send_sig(SIGILL, current, 0); + } else if (softlockup_panic) panic("softlockup: hung tasks"); __this_cpu_write(soft_watchdog_warn, true); } else -- 2.4.3 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/