Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752967AbcD0QJL (ORCPT ); Wed, 27 Apr 2016 12:09:11 -0400 Received: from mail-wm0-f50.google.com ([74.125.82.50]:35256 "EHLO mail-wm0-f50.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751103AbcD0QJJ (ORCPT ); Wed, 27 Apr 2016 12:09:09 -0400 Date: Wed, 27 Apr 2016 18:09:05 +0200 From: Frederic Weisbecker To: Arnaldo Carvalho de Melo Cc: Alexei Starovoitov , David Ahern , Peter Zijlstra , Ingo Molnar , Adrian Hunter , Brendan Gregg , Alexander Shishkin , Alexei Starovoitov , He Kuang , Jiri Olsa , Masami Hiramatsu , Milian Wolff , Namhyung Kim , Stephane Eranian , Thomas Gleixner , Vince Weaver , Wang Nan , Zefan Li , Linux Kernel Mailing List Subject: Re: [PATCH/RFC v3] perf core: Allow setting up max frame stack depth via sysctl Message-ID: <20160427160903.GA31971@lerouge> References: <20160425162706.GB25218@kernel.org> <20160425192229.GC25218@kernel.org> <20160425200646.GA23875@ast-mbp.thefacebook.com> <20160425201750.GD25218@kernel.org> <20160425215947.GA25915@ast-mbp.thefacebook.com> <20160425234138.GA16708@kernel.org> <20160426000724.GA28705@ast-mbp.thefacebook.com> <20160426002928.GB16708@kernel.org> <20160426215806.GA15680@lerouge> <20160427125358.GM11033@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160427125358.GM11033@kernel.org> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2127 Lines: 49 On Wed, Apr 27, 2016 at 09:53:58AM -0300, Arnaldo Carvalho de Melo wrote: > Em Tue, Apr 26, 2016 at 11:58:10PM +0200, Frederic Weisbecker escreveu: > > On Mon, Apr 25, 2016 at 09:29:28PM -0300, Arnaldo Carvalho de Melo wrote: > > > commit cd544af4f7fede01cb512d52bb3efe62aa19271d > > > Author: Arnaldo Carvalho de Melo > > > Date: Thu Apr 21 12:28:50 2016 -0300 > > > > > > perf core: Allow setting up max frame stack depth via sysctl > > > > > > The default remains 127, which is good for most cases, and not even hit > > > most of the time, but then for some cases, as reported by Brendan, 1024+ > > > deep frames are appearing on the radar for things like groovy, ruby. > > > > > > And in some workloads putting a _lower_ cap on this may make sense. One > > > that is per event still needs to be put in place tho. > > > > > > The new file is: > > > > > > # cat /proc/sys/kernel/perf_event_max_stack > > > 127 > > > > > > Chaging it: > > > > > > # echo 256 > /proc/sys/kernel/perf_event_max_stack > > > # cat /proc/sys/kernel/perf_event_max_stack > > > 256 > > > > > > But as soon as there is some event using callchains we get: > > > > > > # echo 512 > /proc/sys/kernel/perf_event_max_stack > > > -bash: echo: write error: Device or resource busy > > > # > > > > > > Because we only allocate the callchain percpu data structures when there > > > is a user, which allows for changing the max easily, its just a matter > > > of having no callchain users at that point. > > > > > > Reported-and-Tested-by: Brendan Gregg > > > Acked-by: Alexei Starovoitov > > > Acked-by: David Ahern > > > > Reviewed-by: Frederic Weisbecker > > > > I first thought that this should be a tunable per event instead of a global sysctl > > Yeah, I'll work on that too. There is no rush though. The sysfs limit will probably be enough for most users. Unless someone requested it?