Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751804AbdF1LlG (ORCPT ); Wed, 28 Jun 2017 07:41:06 -0400 Received: from mx2.suse.de ([195.135.220.15]:46060 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751506AbdF1LlE (ORCPT ); Wed, 28 Jun 2017 07:41:04 -0400 Date: Wed, 28 Jun 2017 13:40:59 +0200 From: Michal Hocko To: kan.liang@intel.com Cc: linux-kernel@vger.kernel.org, dzickus@redhat.com, mingo@kernel.org, akpm@linux-foundation.org, babu.moger@oracle.com, atomlin@redhat.com, prarit@redhat.com, torvalds@linux-foundation.org, peterz@infradead.org, tglx@linutronix.de, eranian@google.com, acme@redhat.com, ak@linux.intel.com, stable@vger.kernel.org Subject: Re: [PATCH] kernel/watchdog: fix spurious hard lockups Message-ID: <20170628114058.GB5234@dhcp22.suse.cz> References: <20170620213309.30051-1-kan.liang@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170620213309.30051-1-kan.liang@intel.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 641 Lines: 16 On Tue 20-06-17 14:33:09, kan.liang@intel.com wrote: > From: Kan Liang > > Some users reported spurious NMI watchdog timeouts. > > We now have more and more systems where the Turbo range is wide enough > that the NMI watchdog expires faster than the soft watchdog timer that > updates the interrupt tick the NMI watchdog relies on. AFAIR the watchdog doesn't rely on deferred timers so this would suggest that a standard hrtimer can expire much later than programmed, right? If that is the case how come other parts of the system do not break. We do rely on hrtimers on many other places? -- Michal Hocko SUSE Labs