Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760014Ab3GSOxf (ORCPT ); Fri, 19 Jul 2013 10:53:35 -0400 Received: from mx1.redhat.com ([209.132.183.28]:50297 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753836Ab3GSOxd (ORCPT ); Fri, 19 Jul 2013 10:53:33 -0400 Date: Fri, 19 Jul 2013 10:53:23 -0400 From: Dave Jones To: Linux Kernel Cc: linux-mm@kvack.org, paulmck@linux.vnet.ibm.com Subject: mlockall triggred rcu_preempt stall. Message-ID: <20130719145323.GA1903@redhat.com> Mail-Followup-To: Dave Jones , Linux Kernel , linux-mm@kvack.org, paulmck@linux.vnet.ibm.com MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3589 Lines: 68 My fuzz tester keeps hitting this. Every instance shows the non-irq stack came in from mlockall. I'm only seeing this on one box, but that has more ram (8gb) than my other machines, which might explain it. Dave INFO: rcu_preempt self-detected stall on CPU { 3} (t=6500 jiffies g=470344 c=470343 q=0) sending NMI to all CPUs: NMI backtrace for cpu 3 CPU: 3 PID: 29664 Comm: trinity-child2 Not tainted 3.11.0-rc1+ #32 task: ffff88023e743fc0 ti: ffff88022f6f2000 task.ti: ffff88022f6f2000 RIP: 0010:[] [] trace_hardirqs_off_caller+0x21/0xb0 RSP: 0018:ffff880244e03c30 EFLAGS: 00000046 RAX: ffff88023e743fc0 RBX: 0000000000000001 RCX: 000000000000003c RDX: 000000000000000f RSI: 0000000000000004 RDI: ffffffff81033cab RBP: ffff880244e03c38 R08: ffff880243288a80 R09: 0000000000000001 R10: 0000000000000000 R11: 0000000000000001 R12: ffff880243288a80 R13: ffff8802437eda40 R14: 0000000000080000 R15: 000000000000d010 FS: 00007f50ae33b740(0000) GS:ffff880244e00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 000000000097f000 CR3: 0000000240fa0000 CR4: 00000000001407e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000600 Stack: ffffffff810bf86d ffff880244e03c98 ffffffff81033cab 0000000000000096 000000000000d008 0000000300000002 0000000000000004 0000000000000003 0000000000002710 ffffffff81c50d00 ffffffff81c50d00 ffff880244fcde00 Call Trace: [] ? trace_hardirqs_off+0xd/0x10 [] __x2apic_send_IPI_mask+0x1ab/0x1c0 [] x2apic_send_IPI_all+0x1c/0x20 [] arch_trigger_all_cpu_backtrace+0x65/0xa0 [] rcu_check_callbacks+0x331/0x8e0 [] ? hrtimer_run_queues+0x20/0x180 [] ? sched_clock_cpu+0xb5/0x100 [] update_process_times+0x47/0x80 [] tick_sched_handle.isra.16+0x25/0x60 [] tick_sched_timer+0x41/0x60 [] __run_hrtimer+0x81/0x4e0 [] ? tick_sched_do_timer+0x60/0x60 [] hrtimer_interrupt+0xff/0x240 [] local_apic_timer_interrupt+0x34/0x60 [] smp_apic_timer_interrupt+0x3f/0x60 [] apic_timer_interrupt+0x6f/0x80 [] ? retint_restore_args+0xe/0xe [] ? __do_softirq+0xb1/0x440 [] irq_exit+0xcd/0xe0 [] smp_apic_timer_interrupt+0x45/0x60 [] apic_timer_interrupt+0x6f/0x80 [] ? retint_restore_args+0xe/0xe [] ? wait_for_completion_killable+0x170/0x170 [] ? preempt_schedule_irq+0x53/0x90 [] retint_kernel+0x26/0x30 [] ? queue_work_on+0x43/0x90 [] schedule_on_each_cpu+0xc9/0x1a0 [] ? lru_add_drain+0x50/0x50 [] lru_add_drain_all+0x15/0x20 [] SyS_mlockall+0xa5/0x1a0 [] tracesys+0xdd/0xe2 Code: 5d c3 0f 1f 84 00 00 00 00 00 44 8b 1d 29 73 bd 00 65 48 8b 04 25 00 ba 00 00 45 85 db 74 69 44 8b 90 a4 06 00 00 45 85 d2 75 5d <44> 8b 0d a0 47 00 01 45 85 c9 74 33 44 8b 80 70 06 00 00 45 85 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/