Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp576867imu; Tue, 27 Nov 2018 17:39:52 -0800 (PST) X-Google-Smtp-Source: AJdET5fPSCo5pc2tT5bBrB+Hl7D9UnUspMchx4QpE6L4qLbfrrPGDA3s3RwBjhNZCNPKAlvBv1aT X-Received: by 2002:a62:4e49:: with SMTP id c70mr34942135pfb.167.1543369192214; Tue, 27 Nov 2018 17:39:52 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543369192; cv=none; d=google.com; s=arc-20160816; b=mmHyIvmcr8ipgXS9SihukQL52NnqH4FT4CYoS4gbVUzyRN36YKOy4MMXtJza81TEjn YlhyY01s+xKHt1CBn/go4ZESG4/0uq5kIvCH53adVllhowtTTNNn1xKA3SzRVJ/LknpP s1afvAGsvSd6XvqZcrf4Cl9jaDP55DE1R4a/LF4W3WpnwzJDjlgOL6ZgubpdytfM9fXP b8JcdDPoK50Wm4ciHBHwDw+ARmsvMP4URINXXxPRq2Hu4UsUorEDLQBzrsDJ23N9n2wb jYhg94ABcANIwUTLYHLokyfo/HSFH8b6fuXqmDhzL7sQ5CVcHsDke2kbb+4I4gVvEzLd HH3w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=JNdihKxWNgXN1sI5WjZ4i4dNcNiAZif9zCWX2xjiSu8=; b=s5lDueKT35LdpQ5DsnbKYYk89iJbe+j8M8+0lg1IGQh56CvAyWUEdUzbLdWYDRc7EQ JlaQ+x5oe4zkgQvGyzWU7fS2I5SU1LHUQ2b6MjNJwwVlTpa7IZ4kOXU+3lBVGLv01CEd Ll2ju2JUO/0Uwu+M+/D8PZu7SB513QSQ8zjFAhQ4Qw5ZdvyHkmXCxDKtLPax0hIwDrjn /OLMWUXSnqIKjZ9nZH2lm4zb7zqEMVBXBzhnqYA0NHZs33+u036CtGQCNV+XwE4Bldvb +wim0Vgtets7B6JC/l45uoK5ncNtNfpjQSFrCDPi/XpQQNYMJyVbic7dQvXRcOb+pwrv h52w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=DUA6ulg6; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w8si5713734pgm.467.2018.11.27.17.39.34; Tue, 27 Nov 2018 17:39:52 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=DUA6ulg6; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726957AbeK1Mim (ORCPT + 99 others); Wed, 28 Nov 2018 07:38:42 -0500 Received: from mail-pf1-f195.google.com ([209.85.210.195]:36183 "EHLO mail-pf1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726539AbeK1Mil (ORCPT ); Wed, 28 Nov 2018 07:38:41 -0500 Received: by mail-pf1-f195.google.com with SMTP id b85so9329942pfc.3 for ; Tue, 27 Nov 2018 17:38:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=JNdihKxWNgXN1sI5WjZ4i4dNcNiAZif9zCWX2xjiSu8=; b=DUA6ulg6Suoju66XnirkdvGc1/8jaruCcaQTcjd2zLW+ZEMwJaC/Y3MzmH/gME07/L XAJ0TRBU1SyNm2EcqmGrKA+eYFMeUshjM+ZpFzo5LGo4HyTVPekHDx3cCjpn7BCmNF+V whWO14eUSg87OMT5Q8ON/CtRT856seK7igp7E= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=JNdihKxWNgXN1sI5WjZ4i4dNcNiAZif9zCWX2xjiSu8=; b=L/E/aVsL7a+XWIzFOEP4ymycE2P19OM7ViY3MHnkIrBjAxNvb5kUohYHM3t6fuOP8A odXxWFKptSWwNkxEnczTMVTzuroL57SYJmgONfVrt1TnptExbLdUwUPNLfab+1XHEWdl KvQXAGTqsQOcDET3IvzZ2H7JKjPglpQiiy2j3G7H1hb3sEL0ZBbjt+NAnGGLTLtSQVp+ ncFt1cbLYTpMt/fUBpye/jRN2RN5hBtzO9wv9by2GvxkQxU15sZU0luQ8sv52GmLosND SBCjfD6YfeUeT+/xXS5XLrAWa5Z0ehIR2wKOqhS3xmWg8TFqrTqIxyV6Bre64eA0Izwt TYjw== X-Gm-Message-State: AA+aEWaJtifH4n6cG/4CW5o5nGfwKUf+FsJCZ1Np/ic7wQkOikCD6HLi l5uSCCYZWIkMqjt+kiDXtM213iL1/+Q= X-Received: by 2002:a65:57cb:: with SMTP id q11mr31463299pgr.60.1543369132480; Tue, 27 Nov 2018 17:38:52 -0800 (PST) Received: from localhost ([2620:0:1000:1601:3aef:314f:b9ea:889f]) by smtp.gmail.com with ESMTPSA id 128sm8618566pfu.129.2018.11.27.17.38.50 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 27 Nov 2018 17:38:51 -0800 (PST) Date: Tue, 27 Nov 2018 17:38:50 -0800 From: Joel Fernandes To: Steven Rostedt Cc: Masami Hiramatsu , linux-kernel@vger.kernel.org, Ingo Molnar , Andrew Morton , Thomas Gleixner , Peter Zijlstra , Josh Poimboeuf , Frederic Weisbecker , Andy Lutomirski , Mark Rutland Subject: Re: [RFC][PATCH 11/14] function_graph: Convert ret_stack to a series of longs Message-ID: <20181128013850.GA19606@google.com> References: <20181122012708.491151844@goodmis.org> <20181122012804.122411256@goodmis.org> <20181124053138.GA242510@google.com> <20181127010755.0f897c13a57315a3859d225b@kernel.org> <20181126112603.6c5519dd@gandalf.local.home> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181126112603.6c5519dd@gandalf.local.home> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Nov 26, 2018 at 11:26:03AM -0500, Steven Rostedt wrote: > On Tue, 27 Nov 2018 01:07:55 +0900 > Masami Hiramatsu wrote: > > > > > --- a/include/linux/sched.h > > > > +++ b/include/linux/sched.h > > > > @@ -1119,7 +1119,7 @@ struct task_struct { > > > > int curr_ret_depth; > > > > > > > > /* Stack of return addresses for return function tracing: */ > > > > - struct ftrace_ret_stack *ret_stack; > > > > + unsigned long *ret_stack; > > > > > > > > /* Timestamp for last schedule: */ > > > > unsigned long long ftrace_timestamp; > > > > diff --git a/kernel/trace/fgraph.c b/kernel/trace/fgraph.c > > > > index 9b85638ecded..1389fe39f64c 100644 > > > > --- a/kernel/trace/fgraph.c > > > > +++ b/kernel/trace/fgraph.c > > > > @@ -23,6 +23,17 @@ > > > > #define ASSIGN_OPS_HASH(opsname, val) > > > > #endif > > > > > > > > +#define FGRAPH_RET_SIZE (sizeof(struct ftrace_ret_stack)) > > > > +#define FGRAPH_RET_INDEX (ALIGN(FGRAPH_RET_SIZE, sizeof(long)) / sizeof(long)) > > > > +#define SHADOW_STACK_SIZE (FTRACE_RETFUNC_DEPTH * FGRAPH_RET_SIZE) > > > > +#define SHADOW_STACK_INDEX \ > > > > + (ALIGN(SHADOW_STACK_SIZE, sizeof(long)) / sizeof(long)) > > > > +#define SHADOW_STACK_MAX_INDEX (SHADOW_STACK_INDEX - FGRAPH_RET_INDEX) > > > > + > > > > +#define RET_STACK(t, index) ((struct ftrace_ret_stack *)(&(t)->ret_stack[index])) > > > > +#define RET_STACK_INC(c) ({ c += FGRAPH_RET_INDEX; }) > > > > +#define RET_STACK_DEC(c) ({ c -= FGRAPH_RET_INDEX; }) > > > > + > > > [...] > > > > @@ -514,7 +531,7 @@ void ftrace_graph_init_task(struct task_struct *t) > > > > > > > > void ftrace_graph_exit_task(struct task_struct *t) > > > > { > > > > - struct ftrace_ret_stack *ret_stack = t->ret_stack; > > > > + unsigned long *ret_stack = t->ret_stack; > > > > > > > > t->ret_stack = NULL; > > > > /* NULL must become visible to IRQs before we free it: */ > > > > @@ -526,12 +543,10 @@ void ftrace_graph_exit_task(struct task_struct *t) > > > > /* Allocate a return stack for each task */ > > > > static int start_graph_tracing(void) > > > > { > > > > - struct ftrace_ret_stack **ret_stack_list; > > > > + unsigned long **ret_stack_list; > > > > int ret, cpu; > > > > > > > > - ret_stack_list = kmalloc_array(FTRACE_RETSTACK_ALLOC_SIZE, > > > > - sizeof(struct ftrace_ret_stack *), > > > > - GFP_KERNEL); > > > > + ret_stack_list = kmalloc(SHADOW_STACK_SIZE, GFP_KERNEL); > > > > > > > > > > I had dumped the fgraph size related macros to understand the patch better, I > > > got: > > > [ 0.909528] val of FGRAPH_RET_SIZE is 40 > > > [ 0.910250] val of FGRAPH_RET_INDEX is 5 > > > [ 0.910866] val of FGRAPH_ARRAY_SIZE is 16 > > > [ 0.911488] val of FGRAPH_ARRAY_MASK is 255 > > > [ 0.912134] val of FGRAPH_MAX_INDEX is 16 > > > [ 0.912751] val of FGRAPH_INDEX_SHIFT is 8 > > > [ 0.913382] val of FGRAPH_FRAME_SIZE is 168 > > > [ 0.914033] val of FGRAPH_FRAME_INDEX is 21 > > > FTRACE_RETFUNC_DEPTH is 50 > > > [ 0.914686] val of SHADOW_STACK_SIZE is 8400 > > > > > > I had a concern about memory overhead per-task. It seems the total memory > > > needed per task for the stack is 8400 bytes (with my configuration with > > > FUNCTION_PROFILE > > > turned off). > > > > > > Where as before it would be 32 * 40 = 1280 bytes. That looks like ~7 times > > > more than before. > > > > Hmm, this seems too big... I thought the shadow-stack size should be > > smaller than 1 page (4kB). Steve, can we give a 4k page for shadow stack > > and define FTRACE_RETFUNC_DEPTH = 4096 / FGRAPH_RET_SIZE ? > > For the first pass, I decided not to worry about the size. It made the > code less complex :-) > > Yes, I plan on working on making the size of the stack smaller, but > that will probably be added on patches to do so. Cool, sounds good. > > > On my system with ~4000 threads, that becomes ~32MB which seems a bit > > > wasteful especially if there was only one or 2 function graph callbacks > > > registered and most of the callback array in the stack isn't used. > > Note, all 4000 threads could be doing those trace backs, and if you are > doing full function graph tracing, it will use a lot. But I think each of the threads will only use N entries in the callback array where N is the number of function graph callback users who registered, right? So the remaining total-N allocated callback array entries per thread will not be used. > > > Could we make the array size configurable at compile time and start it with a > > > small number like 4 or 6? > > > > Or, we can introduce online setting :) > > Yes, that can easily be added. I didn't try to make this into the > perfect solution, I wanted a solid one first, and then massage it into > something that is more efficient, both with memory consumption and > performance. > > Joel and Masami, thanks for the feedback. I agree the first step is good so far. Looking forward to the patches, thanks a lot, - Joel