Received: by 10.192.165.148 with SMTP id m20csp3571011imm; Mon, 7 May 2018 14:47:00 -0700 (PDT) X-Google-Smtp-Source: AB8JxZrEDYvQ78lDwiePXA0euGVQYMm6A7Hyi2CqttkAANxhA22/BOSgxWsE9YLjLk3hF70Nj+A5 X-Received: by 2002:a17:902:5a88:: with SMTP id r8-v6mr39078815pli.78.1525729620224; Mon, 07 May 2018 14:47:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525729620; cv=none; d=google.com; s=arc-20160816; b=wJXSm5trIiR++LJD+/WQP611e9xN+c88C1cCs4in07uMZwxIlCOc2NUMUsWFhO+lVl oHUOIWHclBFCs5yRM9Hiwd/kqc4VVJ/Mnx1cjDKaHH/lO08WxPug3sU6agCI8uIPu3im vxHAUhEE9s8M7zaM7n4u42XmdDrK1TPb6Z4gRZyvtH8poYBiuZR/gsX1J3UzGH361YON TOtXxycf6ZkErcQrgs4dPtfVcP/30WV07oFB0/G50mKdVkUIa+MrV+Ud+QdJuEn2V4jh 3O/VUERGyPDcljCzlUYLasIokveuQwbSHE8XeR9t8avwACKPgP+jmPGqxCOJvfbo5dEJ TFng== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=s54xtGLXdPJoW9SgwrZoGU0Rj1jO/RMwvYQWKV3kZi0=; b=fniorIFv3mM4dO8nJw5jDr8NCP96a0BI/3vfYqeeEOwmUtA4JUkmaV/Qd4YR2JcBw2 8Gipl5gh0GcIH47u2wEuswuY7Xm0wmbvZ2OpJD7Brt5oU94gWE0Z+Grr4uepuVG3lMg2 oOO1+JWnBNVAAodZyDZFgZxn/6pqN7hTeuMn9LXC6wSnx0MhkxN/Tdwr1OnIQP1g+a1A 7mSVJS3WOO6hsO+q/wTD0Og4qCdtLiDjxqiK2VNve94KT6E8riHRLSZqLGDjp8E1fFSD Rk6H3Bwq7V/k2iFSgur/Aer5kYjHPwJAsGEPyuQiORVsMIAiXHaPjUaVIe0Hwn9zD8L1 1e8g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes-org.20150623.gappssmtp.com header.s=20150623 header.b=NreySvMI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t5-v6si1123015ply.598.2018.05.07.14.46.46; Mon, 07 May 2018 14:47:00 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes-org.20150623.gappssmtp.com header.s=20150623 header.b=NreySvMI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753090AbeEGVpR (ORCPT + 99 others); Mon, 7 May 2018 17:45:17 -0400 Received: from mail-pf0-f194.google.com ([209.85.192.194]:41152 "EHLO mail-pf0-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753051AbeEGVpQ (ORCPT ); Mon, 7 May 2018 17:45:16 -0400 Received: by mail-pf0-f194.google.com with SMTP id v63so22907769pfk.8 for ; Mon, 07 May 2018 14:45:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes-org.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=s54xtGLXdPJoW9SgwrZoGU0Rj1jO/RMwvYQWKV3kZi0=; b=NreySvMIPhwIOEEVSw9eeFCGmGzWZD1o8TRJ8KIwhs4QuJ96uF0Ky0p2XZw+ZP93AQ f3T6zjMfPC0g9Iaa21ig5rdhRNtCssZAL+KTKMUlsXKlyfN000JS8tGpB9b+OgM+vvXu Ig4RBv7POSZW+VH9s4Umu/XjvgkkH71yGUbHEaDj4acaSvhD72JFPT3NpswVjXlPT8Wo ab/wiLkfaZOlcpZUEwJQ3pDJ0xb9e740wrDNm/NRvWbrUudLwr7Ilb0lGcro4YdZUcAq 0rzXXFFM/4P144+lLjvTj4N/AW9DeofdCHufJACUfDxwKlm6wq5SYrLiOD/uB7XyZCkR R1og== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=s54xtGLXdPJoW9SgwrZoGU0Rj1jO/RMwvYQWKV3kZi0=; b=m8xKzHHkkSsOPGrL8kuDIX1yJ8hID7Q21c1hFBBU48/4r/MAXM6g/73iJRPJkgkX7x J3OUVUwYw5wnz6lLGChaOic4UaadRNfhWD6pWTz/keUlpeJsM2H+xSVZ80GYUPdQQarg GWm5QmgarnT4TMCwLofO2Ga2g7x4mChaXw/YF2mtk1yEMyktHgvEc+R4twSWBrSYUDtj zMih9bxoCCCwDgr2rq9t/EzOeWv3koTkPoSQbR6pW+8CSUszh7wkiFF3RyEYw1RPd9j9 JCFJuJK0YjYtC+Mp3td7hBdlRKPKf5OVwACucXzHcuFWyjNvUGhNVx3XAyCmZjtizkCx LJ9Q== X-Gm-Message-State: ALQs6tAdU6mtJUE79kd+jrKDS/p3jl6NJPDjXGeFGAN3t1l6OtMgsPm0 PKNwpqG4/iMKFuQM7Xv9Wcn7pg== X-Received: by 10.98.141.201 with SMTP id p70mr37443504pfk.72.1525729515743; Mon, 07 May 2018 14:45:15 -0700 (PDT) Received: from localhost ([2620:0:1000:1600:3122:ea9c:d178:eb]) by smtp.gmail.com with ESMTPSA id d72sm33570775pfe.150.2018.05.07.14.45.15 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Mon, 07 May 2018 14:45:15 -0700 (PDT) Date: Mon, 7 May 2018 14:45:14 -0700 From: Joel Fernandes To: "Paul E. McKenney" Cc: Joel Fernandes , linux-kernel@vger.kernel.org, Steven Rostedt , Peter Zilstra , Ingo Molnar , Mathieu Desnoyers , Tom Zanussi , Namhyung Kim , Thomas Glexiner , Boqun Feng , Frederic Weisbecker , Randy Dunlap , Masami Hiramatsu , Fenguang Wu , Baohong Liu , Vedang Patel , kernel-team@android.com Subject: Re: [PATCH RFC v6 4/5] tracepoint: Make rcuidle tracepoint callers use SRCU Message-ID: <20180507214514.GA13787@joelaf.mtv.corp.google.com> References: <20180507204143.13061-1-joelaf@google.com> <20180507204143.13061-5-joelaf@google.com> <20180507210801.GZ26088@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180507210801.GZ26088@linux.vnet.ibm.com> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, May 07, 2018 at 02:08:01PM -0700, Paul E. McKenney wrote: > On Mon, May 07, 2018 at 01:41:42PM -0700, Joel Fernandes wrote: > > From: "Joel Fernandes (Google)" > > > > In recent tests with IRQ on/off tracepoints, a large performance > > overhead ~10% is noticed when running hackbench. This is root caused to > > calls to rcu_irq_enter_irqson and rcu_irq_exit_irqson from the > > tracepoint code. Following a long discussion on the list [1] about this, > > we concluded that srcu is a better alternative for use during rcu idle. > > Although it does involve extra barriers, its lighter than the sched-rcu > > version which has to do additional RCU calls to notify RCU idle about > > entry into RCU sections. > > > > In this patch, we change the underlying implementation of the > > trace_*_rcuidle API to use SRCU. This has shown to improve performance > > alot for the high frequency irq enable/disable tracepoints. [...] >> Signed-off-by: Joel Fernandes (Google) > > --- > > include/linux/tracepoint.h | 46 +++++++++++++++++++++++++++++++------- > > kernel/tracepoint.c | 15 ++++++++++++- > > 2 files changed, 52 insertions(+), 9 deletions(-) > > > > diff --git a/include/linux/tracepoint.h b/include/linux/tracepoint.h > > index c94f466d57ef..f56f290cf8eb 100644 > > --- a/include/linux/tracepoint.h > > +++ b/include/linux/tracepoint.h > > @@ -15,6 +15,7 @@ > > */ > > > > #include > > +#include > > #include > > #include > > #include > > @@ -33,6 +34,8 @@ struct trace_eval_map { > > > > #define TRACEPOINT_DEFAULT_PRIO 10 > > > > +extern struct srcu_struct tracepoint_srcu; > > + > > extern int > > tracepoint_probe_register(struct tracepoint *tp, void *probe, void *data); > > extern int > > @@ -77,6 +80,9 @@ int unregister_tracepoint_module_notifier(struct notifier_block *nb) > > */ > > static inline void tracepoint_synchronize_unregister(void) > > { > > +#ifdef CONFIG_TRACEPOINTS > > + synchronize_srcu(&tracepoint_srcu); > > +#endif > > synchronize_sched(); > > } > > > > @@ -129,18 +135,38 @@ extern void syscall_unregfunc(void); > > * as "(void *, void)". The DECLARE_TRACE_NOARGS() will pass in just > > * "void *data", where as the DECLARE_TRACE() will pass in "void *data, proto". > > */ > > -#define __DO_TRACE(tp, proto, args, cond, rcucheck) \ > > +#define __DO_TRACE(tp, proto, args, cond, rcuidle) \ > > do { \ > > struct tracepoint_func *it_func_ptr; \ > > void *it_func; \ > > void *__data; \ > > + int __maybe_unused idx = 0; \ > > \ > > if (!(cond)) \ > > return; \ > > - if (rcucheck) \ > > - rcu_irq_enter_irqson(); \ > > - rcu_read_lock_sched_notrace(); \ > > - it_func_ptr = rcu_dereference_sched((tp)->funcs); \ > > + \ > > + /* \ > > + * For rcuidle callers, use srcu since sched-rcu \ > > + * doesn't work from the idle path. \ > > + */ \ > > + if (rcuidle) { \ > > + if (in_nmi()) { \ > > + WARN_ON_ONCE(1); \ > > + return; /* no srcu from nmi */ \ > > + } \ > > + \ > > + idx = srcu_read_lock_notrace(&tracepoint_srcu); \ > > + it_func_ptr = \ > > + srcu_dereference_notrace((tp)->funcs, \ > > + &tracepoint_srcu); \ > > + /* To keep it consistent with !rcuidle path */ \ > > + preempt_disable_notrace(); \ > > + } else { \ > > + rcu_read_lock_sched_notrace(); \ > > + it_func_ptr = \ > > + rcu_dereference_sched((tp)->funcs); \ > > + } \ > > + \ > > if (it_func_ptr) { \ > > do { \ > > it_func = (it_func_ptr)->func; \ > > @@ -148,9 +174,13 @@ extern void syscall_unregfunc(void); > > ((void(*)(proto))(it_func))(args); \ > > } while ((++it_func_ptr)->func); \ > > } \ > > - rcu_read_unlock_sched_notrace(); \ > > - if (rcucheck) \ > > - rcu_irq_exit_irqson(); \ > > + \ > > + if (rcuidle) { \ > > Don't we also need an in_nmi() check here in order to avoid unbalanced > srcu_read_unlock_notrace() calls? The in_nmi() in the lock path should take care of making sure its balanced. The diff the way its formatted appears confusing as Mathieu pointed. thanks, - Joel