Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932446AbbHCVzq (ORCPT ); Mon, 3 Aug 2015 17:55:46 -0400 Received: from arcturus.aphlor.org ([188.246.204.175]:50834 "EHLO arcturus.aphlor.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932332AbbHCVzp (ORCPT ); Mon, 3 Aug 2015 17:55:45 -0400 Date: Mon, 3 Aug 2015 17:55:35 -0400 From: Dave Jones To: "Paul E. McKenney" Cc: Linux Kernel , Josh Triplett , sasha.levin@oracle.com, fweisbec@gmail.com Subject: Re: 4.2-rc5 rcu stalls. Message-ID: <20150803215535.GA13717@codemonkey.org.uk> Mail-Followup-To: Dave Jones , "Paul E. McKenney" , Linux Kernel , Josh Triplett , sasha.levin@oracle.com, fweisbec@gmail.com References: <20150803210835.GA4467@codemonkey.org.uk> <20150803213723.GN27280@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20150803213723.GN27280@linux.vnet.ibm.com> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -2.9 (--) X-Spam-Report: Spam report generated by SpamAssassin on "arcturus.aphlor.org" Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Authenticated-User: davej@codemonkey.org.uk Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1538 Lines: 30 On Mon, Aug 03, 2015 at 02:37:23PM -0700, Paul E. McKenney wrote: > On Mon, Aug 03, 2015 at 05:08:35PM -0400, Dave Jones wrote: > > [ 2120.855128] Tasks blocked on level-0 rcu_node (CPUs 0-3): P1497 > > [ 2120.855263] (detected by 2, t=65002 jiffies, g=78835, c=78834, q=0) > > [ 2120.855403] trinity-watchdo R running task 14336 1497 1496 0x00080000 > > [ 2120.855563] ffff8804b94e3c88 ffffffffa17fa0b0 ffff8805010a1b40 ffff8804f58b51c0 > > [ 2120.855728] ffff8805010a1b40 ffff8804b94e3c78 0000000000000000 ffff8804b94e4000 > > [ 2120.855893] 0000000000000001 0000000000000001 0000000000000002 ffff8804b94e3ca8 > > [ 2120.856062] Call Trace: > > [ 2120.856116] [] ? preempt_schedule_irq+0x40/0xa0 > > [ 2120.856252] [] preempt_schedule_irq+0x46/0xa0 > > Sasha Levin (CCed) was having roughly similar stalls, and found that > reverting b30f0e3ffedf (sched/preempt: Optimize preemption operations > on __schedule() callers) made the stalls go away. Does that help in > your case? Ugh, that doesn't revert cleanly. Got something handy ? The curious thing is why I've only just started seeing these. I've had that machine for a month now, so it's been fuzzing since ~4.1, and until yesterday I'd not seen this at all. Dave -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/