Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754704AbaKSE7c (ORCPT ); Tue, 18 Nov 2014 23:59:32 -0500 Received: from mx1.redhat.com ([209.132.183.28]:43122 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754233AbaKSE7b (ORCPT ); Tue, 18 Nov 2014 23:59:31 -0500 Date: Tue, 18 Nov 2014 23:59:17 -0500 From: Dave Jones To: Linus Torvalds Cc: Don Zickus , Thomas Gleixner , Linux Kernel , the arch/x86 maintainers Subject: Re: frequent lockups in 3.18rc4 Message-ID: <20141119045917.GA17741@redhat.com> Mail-Followup-To: Dave Jones , Linus Torvalds , Don Zickus , Thomas Gleixner , Linux Kernel , the arch/x86 maintainers References: <20141118020959.GA2091@redhat.com> <20141118023930.GA2871@redhat.com> <20141118145234.GA7487@redhat.com> <20141118215540.GD35311@redhat.com> <20141119021902.GA14216@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Nov 18, 2014 at 08:40:55PM -0800, Linus Torvalds wrote: > On Tue, Nov 18, 2014 at 6:19 PM, Dave Jones wrote: > > > > NMI watchdog: BUG: soft lockup - CPU#2 stuck for 21s! [trinity-c42:31480] > > CPU: 2 PID: 31480 Comm: trinity-c42 Not tainted 3.18.0-rc5+ #91 [loadavg: 174.61 150.35 148.64 9/411 32140] > > RIP: 0010:[] [] context_tracking_user_enter+0xa4/0x190 > > Call Trace: > > [] syscall_trace_leave+0xa5/0x160 > > [] int_check_syscall_exit_work+0x34/0x3d > > Hmm, if we are getting soft-lockups here, maybe it suggest too much exit-work. > > Some TIF_NOHZ loop, perhaps? You have nohz on, don't you? I do. > That makes me wonder: does the problem go away if you disable NOHZ? I'll give it a try, and see what falls out overnight. Dave -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/