Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751861AbaLSEDs (ORCPT ); Thu, 18 Dec 2014 23:03:48 -0500 Received: from mx1.redhat.com ([209.132.183.28]:48868 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751387AbaLSEDq (ORCPT ); Thu, 18 Dec 2014 23:03:46 -0500 Date: Thu, 18 Dec 2014 23:03:08 -0500 From: Dave Jones To: Linus Torvalds , Chris Mason , Mike Galbraith , Ingo Molnar , Peter Zijlstra , =?iso-8859-1?Q?D=E2niel?= Fraga , Sasha Levin , "Paul E. McKenney" , Linux Kernel Mailing List , Suresh Siddha , Oleg Nesterov , Peter Anvin Subject: Re: frequent lockups in 3.18rc4 Message-ID: <20141219040308.GB20022@redhat.com> Mail-Followup-To: Dave Jones , Linus Torvalds , Chris Mason , Mike Galbraith , Ingo Molnar , Peter Zijlstra , =?iso-8859-1?Q?D=E2niel?= Fraga , Sasha Levin , "Paul E. McKenney" , Linux Kernel Mailing List , Suresh Siddha , Oleg Nesterov , Peter Anvin References: <20141215055707.GA26225@redhat.com> <20141218051327.GA31988@redhat.com> <1418918059.17358.6@mail.thefacebook.com> <20141218161230.GA6042@redhat.com> <20141219024549.GB1671@redhat.com> <20141219035859.GA20022@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20141219035859.GA20022@redhat.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Dec 18, 2014 at 10:58:59PM -0500, Dave Jones wrote: > > lock debugging and other overheads (does this still have > > DEBUG_PAGEALLOC?) you really are getting into a "real" softlockup > > because things are scaling so horribly badly. > > > > If you now disable spinlock debugging and lockdep, hopefully that page > > table lock now doesn't always get hung up on the lockdep locking, so > > it starts scaling much better, and maybe you'd not see this... > > I can give it a shot. Hopefully there's some further mitigation that > could be done to allow a workload like this to survive under a debug > build though, as we've caught *so many* bugs with this stuff in the past. Turns out also that this build didn't have PROVE_LOCKING enabled. CONFIG_LOCKDEP was, but that just bloats the structures a little, and afaik doesn't incur the same runtime overhead. I also forgot to answer the question above, PAGEALLOC is also off. So the only thing that was on that could cause spinlock overhead was DEBUG_SPINLOCK (and LOCK_STAT, though iirc that's not huge either) Dave -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/