Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753181AbaJPVTa (ORCPT ); Thu, 16 Oct 2014 17:19:30 -0400 Received: from mail-qc0-f173.google.com ([209.85.216.173]:47935 "EHLO mail-qc0-f173.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752984AbaJPVTR (ORCPT ); Thu, 16 Oct 2014 17:19:17 -0400 From: Vince Weaver X-Google-Original-From: Vince Weaver Date: Thu, 16 Oct 2014 17:25:54 -0400 (EDT) To: Vince Weaver cc: "linux-kernel@vger.kernel.org" , Peter Zijlstra , Paul Mackerras , Ingo Molnar , Arnaldo Carvalho de Melo Subject: Re: perf: 3.17 another perf_fuzzer lockup In-Reply-To: Message-ID: References: User-Agent: Alpine 2.11 (DEB 23 2013-08-11) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 15 Oct 2014, Vince Weaver wrote: > I can use sysrq to get the stack trace, the one CPU is stuck in a call > to find_get_context(). > > An example backtrace: > > [88200.300003] > [88200.300003] [] ? ____cache_alloc+0x130/0x25b > [88200.300003] [] ? __call_rcu.constprop.63+0x1bf/0x1cb > [88200.300003] [] kfree_call_rcu+0x1a/0x1c > [88200.300003] [] put_ctx+0x51/0x55 > [88200.300003] [] find_get_context+0x166/0x195 > [88200.300003] [] SYSC_perf_event_open+0x47b/0x7f5 > [88200.300003] [] SyS_perf_event_open+0xe/0x10 > [88200.300003] [] system_call_fastpath+0x16/0x1b > > It looks like the > else if (task->perf_event_ctxp[ctxn]) > err = -EAGAIN; It is indeed stuck there, waiting for task->perf_event_ctxp[1] to get set to zero, which never happens. As far as I can tell it's when a Software event is being opened. Still struggling through the code trying to figure out what's going on. [ 7071.252607] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.259439] VMW: type=1 config=8 [ 7071.262713] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.269506] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.276299] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.283087] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.289879] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.296671] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.303457] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.310248] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7071.317035] VMW: task->perf_event_ctxp[1]=ffff8800cb12ec00, EAGAIN, ref=1 [ 7076.256032] find_get_context: 7246310 callbacks suppressed Vince -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/