Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932187AbaJXQRc (ORCPT ); Fri, 24 Oct 2014 12:17:32 -0400 Received: from e9.ny.us.ibm.com ([32.97.182.139]:56107 "EHLO e9.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756141AbaJXQRb (ORCPT ); Fri, 24 Oct 2014 12:17:31 -0400 Date: Fri, 24 Oct 2014 09:13:37 -0700 From: "Paul E. McKenney" To: Sasha Levin Cc: Dave Jones , Linux Kernel , htejun@gmail.com Subject: Re: rcu_preempt detected stalls. Message-ID: <20141024161337.GQ4977@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20141013173504.GA27955@redhat.com> <543DDD5E.9080602@oracle.com> <20141023183917.GX4977@linux.vnet.ibm.com> <54494F2F.6020005@oracle.com> <20141023195808.GB4977@linux.vnet.ibm.com> <544A45F8.2030207@oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <544A45F8.2030207@oracle.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 14102416-0033-0000-0000-000000D507F6 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Oct 24, 2014 at 08:28:40AM -0400, Sasha Levin wrote: > On 10/23/2014 03:58 PM, Paul E. McKenney wrote: > > On Thu, Oct 23, 2014 at 02:55:43PM -0400, Sasha Levin wrote: > >> > On 10/23/2014 02:39 PM, Paul E. McKenney wrote: > >>> > > On Tue, Oct 14, 2014 at 10:35:10PM -0400, Sasha Levin wrote: > >>>> > >> On 10/13/2014 01:35 PM, Dave Jones wrote: > >>>>> > >>> oday in "rcu stall while fuzzing" news: > >>>>> > >>> > >>>>> > >>> INFO: rcu_preempt detected stalls on CPUs/tasks: > >>>>> > >>> Tasks blocked on level-0 rcu_node (CPUs 0-3): P766 P646 > >>>>> > >>> Tasks blocked on level-0 rcu_node (CPUs 0-3): P766 P646 > >>>>> > >>> (detected by 0, t=6502 jiffies, g=75434, c=75433, q=0) > >>>> > >> > >>>> > >> I've complained about RCU stalls couple days ago (in a different context) > >>>> > >> on -next. I guess whatever causing them made it into Linus's tree? > >>>> > >> > >>>> > >> https://lkml.org/lkml/2014/10/11/64 > >>> > > > >>> > > And on that one, I must confess that I don't see where the RCU read-side > >>> > > critical section might be. > >>> > > > >>> > > Hmmm... Maybe someone forgot to put an rcu_read_unlock() somewhere. > >>> > > Can you reproduce this with CONFIG_PROVE_RCU=y? > >> > > >> > Paul, if that was directed to me - Yes, I see stalls with CONFIG_PROVE_RCU > >> > set and nothing else is showing up before/after that. > > Indeed it was directed to you. ;-) > > > > Does the following crude diagnostic patch turn up anything? > > Nope, seeing stalls but not seeing that pr_err() you added. OK, color me confused. Could you please send me the full dmesg or a pointer to it? Thanx, Paul > [ 5107.395916] INFO: rcu_preempt detected stalls on CPUs/tasks: > [ 5107.395916] 0: (776 ticks this GP) idle=a8d/140000000000002/0 softirq=16356/16356 last_accelerate: f5b7/55e5, nonlazy_posted: 24252, .. > [ 5107.395916] (detected by 1, t=20502 jiffies, g=13949, c=13948, q=0) > [ 5107.395916] Task dump for CPU 0: > [ 5107.395916] trinity-c0 R running task 12848 20357 9041 0x0008000e > [ 5107.395916] 0000000000000000 ffff88006bfd76c0 ffff88065722b988 ffffffffa10af964 > [ 5107.395916] ffff88065722b998 ffffffffa106ad23 ffff88065722b9c8 ffffffffa119dce5 > [ 5107.395916] 00000000001d76c0 ffff88006bfd76c0 00000000001d76c0 ffff8806473cbd10 > [ 5107.395916] Call Trace: > [ 5107.395916] [] ? kvm_clock_read+0x24/0x40 > [ 5107.395916] [] ? sched_clock+0x13/0x30 > [ 5107.395916] [] ? sched_clock_local+0x25/0x90 > [ 5107.395916] [] ? __slab_free+0xbb/0x3a0 > [ 5107.395916] [] ? debug_smp_processor_id+0x17/0x20 > [ 5107.395916] [] ? _raw_spin_unlock_irqrestore+0x64/0xa0 > [ 5107.395916] [] ? __slab_free+0xbb/0x3a0 > [ 5107.395916] [] ? __debug_check_no_obj_freed+0x10e/0x210 > [ 5107.395916] [] ? kmem_cache_free+0xb1/0x4f0 > [ 5107.395916] [] ? kmem_cache_free+0xc3/0x4f0 > [ 5107.395916] [] ? kmem_cache_free+0x3f2/0x4f0 > [ 5107.395916] [] ? unlink_anon_vmas+0x10e/0x180 > [ 5107.395916] [] ? unlink_anon_vmas+0x10e/0x180 > [ 5107.395916] [] ? free_pgtables+0x3f/0x130 > [ 5107.395916] [] ? exit_mmap+0xc4/0x180 > [ 5107.395916] [] ? __khugepaged_exit+0xbe/0x120 > [ 5107.395916] [] ? mmput+0x73/0x110 > [ 5107.395916] [] ? do_exit+0x2c7/0xd30 > [ 5107.395916] [] ? get_signal+0x3c9/0xaf0 > [ 5107.395916] [] ? debug_smp_processor_id+0x17/0x20 > [ 5107.395916] [] ? put_lock_stats.isra.13+0xe/0x30 > [ 5107.395916] [] ? _raw_spin_unlock_irq+0x30/0x70 > [ 5107.395916] [] ? do_group_exit+0x52/0xe0 > [ 5107.395916] [] ? get_signal+0x306/0xaf0 > [ 5107.395916] [] ? sched_clock_local+0x25/0x90 > [ 5107.395916] [] ? do_signal+0x20/0x130 > [ 5107.395916] [] ? context_tracking_user_exit+0x78/0x2d0 > [ 5107.395916] [] ? __this_cpu_preempt_check+0x13/0x20 > [ 5107.395916] [] ? trace_hardirqs_on_caller+0xfb/0x280 > [ 5107.395916] [] ? trace_hardirqs_on+0xd/0x10 > [ 5107.395916] [] ? do_notify_resume+0x69/0xb0 > [ 5107.395916] [] ? int_signal+0x12/0x17 > > > Thanks, > Sasha > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/