Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752089AbdHBNHs (ORCPT ); Wed, 2 Aug 2017 09:07:48 -0400 Received: from mail.kernel.org ([198.145.29.99]:57594 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751600AbdHBNHr (ORCPT ); Wed, 2 Aug 2017 09:07:47 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 6842A22BF3 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=goodmis.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=rostedt@goodmis.org Date: Wed, 2 Aug 2017 09:07:44 -0400 From: Steven Rostedt To: Daniel Lezcano Cc: paulmck@linux.vnet.ibm.com, john.stultz@linaro.org, linux-kernel@vger.kernel.org, Pratyush Anand Subject: Re: RCU stall when using function_graph Message-ID: <20170802090744.6922e9e9@gandalf.local.home> In-Reply-To: <20170802124239.GD1919@mai> References: <20170801220405.GL3730@linux.vnet.ibm.com> <20170801201214.1e9c7d8e@gandalf.local.home> <20170802124239.GD1919@mai> X-Mailer: Claws Mail 3.14.0 (GTK+ 2.24.31; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1585 Lines: 46 On Wed, 2 Aug 2017 14:42:39 +0200 Daniel Lezcano wrote: > On Tue, Aug 01, 2017 at 08:12:14PM -0400, Steven Rostedt wrote: > > On Wed, 2 Aug 2017 00:15:44 +0200 > > Daniel Lezcano wrote: > > > > > On 02/08/2017 00:04, Paul E. McKenney wrote: > > > >> Hi Paul, > > > >> > > > >> I have been trying to set the function_graph tracer for ftrace and each time I > > > >> get a CPU stall. > > > >> > > > >> How to reproduce: > > > >> ----------------- > > > >> > > > >> echo function_graph > /sys/kernel/debug/tracing/current_tracer > > > >> > > > >> This error appears with v4.13-rc3 and v4.12-rc6. > > > > Can you bisect this? It may be due to this commit: > > > > 0598e4f08 ("ftrace: Add use of synchronize_rcu_tasks() with dynamic trampolines") > > Hi Steve, > > I git bisected but each time the issue occured. I went through the different > version down to v4.4 where the board was not fully supported and it ended up to > have the same issue. > > Finally, I had the intuition it could be related to the wall time (there is no > RTC clock with battery on the board and the wall time is Jan 1st, 1970). > > Setting up the with ntpdate solved the problem. > > Even if it is rarely the case to have the time not set, is it normal to have a > RCU cpu stall ? > > BTW, function_graph tracer is the most invasive of the tracers. It's 4x slower than function tracer. I'm wondering if the tracer isn't the cause, but just slows things down enough to cause a some other race condition that triggers the bug. -- Steve