Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp31043ybt; Thu, 18 Jun 2020 17:20:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzx51t6yJhb/sXDY77v3xi4eTMBrzOqDjOh13zEpDtuOkKs4nSwtzmcyLG+CHiVJGXdX2rr X-Received: by 2002:a05:6402:311c:: with SMTP id dc28mr747589edb.184.1592526055610; Thu, 18 Jun 2020 17:20:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1592526055; cv=none; d=google.com; s=arc-20160816; b=l1X4qD2Zni2rGKjjknTXu18NSaE/LC85noQkBn4UvPVaem7pulx8IXOTkcEXkMHvK6 DeSiqosEvYdRNoaPUxqLhnTGszBx1imrHotcUrm7RRKPRJg7+AAlCdcCDkHZ4OU5mkas JUaKMwxNBzoF8Otk2+6Y13V6uqZbwi5TB0d0eE1y4RfF1Vd0bh+V2zRAnr5TN7t0FjgG 7f7vZ4KgwbLj0axB14jMMMxfmiva2OWmq5veGjL8OeCp6Kb7zFYjPDZrxXrjtOuzuDot W+qgijeU9Tt4U0H2pK1Ug09L82gJ8rFfj85MqoVd02kv8kGh9cPZeZLM2qFE1KT7WDfr TO0w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=sJ6iVkq04FxNtyOXJkAufZ2LVSoK80HP6aYhYz31ppg=; b=uPQoosd+wGy72rUOeI+3JX7LMiSEb1uw9vqflIS7zQ/YvTTzbuCcQoGJnK5Y1Sw4a2 iyQJRiKvT5j8fTAD6GVXWCAt5o+AoFYX5f/fJPPzG0Fwd3PxlStJtP15ioRO1/CkSBnZ B42ci3pdUSVGPr+5WrCix16MprEQVTmhkoNHVulykiDV+ck35VFwtTXHWps+UdJY48/F HgdzWwiCJ7dhFSrrfs7t2K5RoXF4dePGWuhrLQt0qs3+Q/iKqcutB7pfWtQyYD6uE1PZ m/bYHhznwHAkRLgiKYq7lEXKM5Xs0Tb9gRfC08XgZ348imc2rQBntmOJV7uMVO5HyUVu Jkwg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=ltOWHmgE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a7si2728947ejx.184.2020.06.18.17.20.33; Thu, 18 Jun 2020 17:20:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=ltOWHmgE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731089AbgFRUaN (ORCPT + 99 others); Thu, 18 Jun 2020 16:30:13 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54446 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729632AbgFRUaH (ORCPT ); Thu, 18 Jun 2020 16:30:07 -0400 Received: from mail-qk1-x744.google.com (mail-qk1-x744.google.com [IPv6:2607:f8b0:4864:20::744]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C42F2C06174E for ; Thu, 18 Jun 2020 13:30:06 -0700 (PDT) Received: by mail-qk1-x744.google.com with SMTP id c185so6907261qke.7 for ; Thu, 18 Jun 2020 13:30:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=sJ6iVkq04FxNtyOXJkAufZ2LVSoK80HP6aYhYz31ppg=; b=ltOWHmgE67kpom8qEcg4f2cgclspP7Nd8j9PIGZWy7hVj93lwe56HBaZUqxjS1Kehq Nf9pQKYM81R8cYd35yAMsXLYQSn++jBpn8s+2NZsw8Sz8H4c2t/OCzzhrJVBK9jly6Gz fQ6Lr3VN5W+/vSWi44B8R+TqCVxyNZAmvlkjo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=sJ6iVkq04FxNtyOXJkAufZ2LVSoK80HP6aYhYz31ppg=; b=BnNHvRCvPrW4GbcznuOASYPzwRXLFnJijXGciMPbhyAKJMaVqTuDbQ8Ikekry/HIfu y9ayEeJ/xJ2PQt1N2nR4ksYB7zFSucVhZoOy07aPdraT8orpBU+bqNPDPvQLqOM67RAv PWpbMqFnp7HUzlydL4+a18dsZ0bCigsoCnDAeQQzsVro1pKg8JZ6f29zYwNWuOxbEIrS kTSE/5Sn1amxsGBOirrjgQE2JcX3CfjNJnQpaJuYmc0t/KoXVc4oFPSlhV4aIxw1rV36 9V8cm6s1gPebaEUF3PjKhwNjGIUqRaj6iq0Zj/F/2Tn9F1C9gMp4ILXzTdcNu/NPKGDz z//w== X-Gm-Message-State: AOAM533sJNEX+Xrv1BGmOoy0z/6komVPMxVQAw+mgmquhOa3dXeowCrj NTHUILdW1N435fiX8olyhUa5jMofrrQ= X-Received: by 2002:a37:4147:: with SMTP id o68mr187414qka.491.1592512205659; Thu, 18 Jun 2020 13:30:05 -0700 (PDT) Received: from joelaf.cam.corp.google.com ([2620:15c:6:12:9c46:e0da:efbf:69cc]) by smtp.gmail.com with ESMTPSA id v3sm4027503qkh.130.2020.06.18.13.30.04 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 18 Jun 2020 13:30:05 -0700 (PDT) From: "Joel Fernandes (Google)" To: linux-kernel@vger.kernel.org Cc: "Joel Fernandes (Google)" , Davidlohr Bueso , Ingo Molnar , Josh Triplett , Lai Jiangshan , Marco Elver , Mathieu Desnoyers , "Paul E. McKenney" , rcu@vger.kernel.org, Steven Rostedt , "Uladzislau Rezki (Sony)" Subject: [PATCH 2/7] rcu/trace: Add tracing for how segcb list changes Date: Thu, 18 Jun 2020 16:29:50 -0400 Message-Id: <20200618202955.4024-2-joel@joelfernandes.org> X-Mailer: git-send-email 2.27.0.111.gc72c7da667-goog In-Reply-To: <20200618202955.4024-1-joel@joelfernandes.org> References: <20200618202955.4024-1-joel@joelfernandes.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Track how the segcb list changes before/after acceleration, during queuing and during dequeuing. This has proved useful to discover an optimization to avoid unwanted GP requests when there are no callbacks accelerated. Signed-off-by: Joel Fernandes (Google) --- include/trace/events/rcu.h | 25 +++++++++++++++++++++++++ kernel/rcu/rcu_segcblist.c | 37 +++++++++++++++++++++++++++++++++++++ kernel/rcu/rcu_segcblist.h | 7 +++++++ kernel/rcu/tree.c | 24 ++++++++++++++++++++++++ 4 files changed, 93 insertions(+) diff --git a/include/trace/events/rcu.h b/include/trace/events/rcu.h index 02dcd119f3263..a6d49864dcc27 100644 --- a/include/trace/events/rcu.h +++ b/include/trace/events/rcu.h @@ -507,6 +507,31 @@ TRACE_EVENT_RCU(rcu_callback, __entry->qlen) ); +TRACE_EVENT_RCU(rcu_segcb, + + TP_PROTO(const char *ctx, int *cb_count, unsigned long *gp_seq), + + TP_ARGS(ctx, cb_count, gp_seq), + + TP_STRUCT__entry( + __field(const char *, ctx) + __array(int, cb_count, 4) + __array(unsigned long, gp_seq, 4) + ), + + TP_fast_assign( + __entry->ctx = ctx; + memcpy(__entry->cb_count, cb_count, 4 * sizeof(int)); + memcpy(__entry->gp_seq, gp_seq, 4 * sizeof(unsigned long)); + ), + + TP_printk("%s cb_count: (DONE=%d, WAIT=%d, NEXT_READY=%d, NEXT=%d) " + "gp_seq: (DONE=%lu, WAIT=%lu, NEXT_READY=%lu, NEXT=%lu)", __entry->ctx, + __entry->cb_count[0], __entry->cb_count[1], __entry->cb_count[2], __entry->cb_count[3], + __entry->gp_seq[0], __entry->gp_seq[1], __entry->gp_seq[2], __entry->gp_seq[3]) + +); + /* * Tracepoint for the registration of a single RCU callback of the special * kfree() form. The first argument is the RCU type, the second argument diff --git a/kernel/rcu/rcu_segcblist.c b/kernel/rcu/rcu_segcblist.c index 4782cf17bf4f9..036d4abac7c5a 100644 --- a/kernel/rcu/rcu_segcblist.c +++ b/kernel/rcu/rcu_segcblist.c @@ -316,6 +316,43 @@ void rcu_segcblist_extract_done_cbs(struct rcu_segcblist *rsclp, WRITE_ONCE(rsclp->tails[i], &rsclp->head); } +/* + * Return how many CBs each segment along with their gp_seq values. + * + * This function is O(N) where N is the number of callbacks. Only used from + * tracing code which is usually disabled in production. + */ +#ifdef CONFIG_RCU_TRACE +void rcu_segcblist_countseq(struct rcu_segcblist *rsclp, + int cbcount[RCU_CBLIST_NSEGS], + unsigned long gpseq[RCU_CBLIST_NSEGS]) +{ + struct rcu_head **cur_tail, *h; + int i, c; + + for (i = 0; i < RCU_CBLIST_NSEGS; i++) + cbcount[i] = 0; + + cur_tail = &(rsclp->head); + + for (i = 0; i < RCU_CBLIST_NSEGS; i++) { + c = 0; + // List empty? + if (rsclp->tails[i] != cur_tail) { + // The loop skips the last node + c = 1; + for (h = *cur_tail; h->next != *(rsclp->tails[i]); h = h->next) { + c++; + } + } + + cbcount[i] = c; + gpseq[i] = rsclp->gp_seq[i]; + cur_tail = rsclp->tails[i]; + } +} +#endif + /* * Extract only those callbacks still pending (not yet ready to be * invoked) from the specified rcu_segcblist structure and place them in diff --git a/kernel/rcu/rcu_segcblist.h b/kernel/rcu/rcu_segcblist.h index 5c293afc07b8e..0a8dbac438529 100644 --- a/kernel/rcu/rcu_segcblist.h +++ b/kernel/rcu/rcu_segcblist.h @@ -104,3 +104,10 @@ void rcu_segcblist_advance(struct rcu_segcblist *rsclp, unsigned long seq); bool rcu_segcblist_accelerate(struct rcu_segcblist *rsclp, unsigned long seq); void rcu_segcblist_merge(struct rcu_segcblist *dst_rsclp, struct rcu_segcblist *src_rsclp); +#ifdef CONFIG_RCU_TRACE +void rcu_segcblist_countseq(struct rcu_segcblist *rsclp, + int cbcount[RCU_CBLIST_NSEGS], + unsigned long gpseq[RCU_CBLIST_NSEGS]); +#else +#define rcu_segcblist_countseq(...) +#endif diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c index ebce14e470d0e..c61af6a33fbfd 100644 --- a/kernel/rcu/tree.c +++ b/kernel/rcu/tree.c @@ -1407,6 +1407,8 @@ static bool rcu_accelerate_cbs(struct rcu_node *rnp, struct rcu_data *rdp) { unsigned long gp_seq_req; bool ret = false; + int cbs[RCU_CBLIST_NSEGS]; + unsigned long gps[RCU_CBLIST_NSEGS]; rcu_lockdep_assert_cblist_protected(rdp); raw_lockdep_assert_held_rcu_node(rnp); @@ -1415,6 +1417,10 @@ static bool rcu_accelerate_cbs(struct rcu_node *rnp, struct rcu_data *rdp) if (!rcu_segcblist_pend_cbs(&rdp->cblist)) return false; + /* Count CBs for tracing. */ + rcu_segcblist_countseq(&rdp->cblist, cbs, gps); + trace_rcu_segcb("SegCbPreAcc", cbs, gps); + /* * Callbacks are often registered with incomplete grace-period * information. Something about the fact that getting exact @@ -1434,6 +1440,11 @@ static bool rcu_accelerate_cbs(struct rcu_node *rnp, struct rcu_data *rdp) trace_rcu_grace_period(rcu_state.name, rdp->gp_seq, TPS("AccWaitCB")); else trace_rcu_grace_period(rcu_state.name, rdp->gp_seq, TPS("AccReadyCB")); + + /* Count CBs for tracing. */ + rcu_segcblist_countseq(&rdp->cblist, cbs, gps); + trace_rcu_segcb("SegCbPostAcc", cbs, gps); + return ret; } @@ -2316,6 +2327,8 @@ static void rcu_do_batch(struct rcu_data *rdp) struct rcu_cblist rcl = RCU_CBLIST_INITIALIZER(rcl); long bl, count; long pending, tlimit = 0; + int cbs[RCU_CBLIST_NSEGS]; + unsigned long gps[RCU_CBLIST_NSEGS]; /* If no callbacks are ready, just return. */ if (!rcu_segcblist_ready_cbs(&rdp->cblist)) { @@ -2350,6 +2363,11 @@ static void rcu_do_batch(struct rcu_data *rdp) /* Invoke callbacks. */ tick_dep_set_task(current, TICK_DEP_BIT_RCU); rhp = rcu_cblist_dequeue(&rcl); + + /* Count CBs for tracing. */ + rcu_segcblist_countseq(&rdp->cblist, cbs, gps); + trace_rcu_segcb("SegCbDequeued", cbs, gps); + for (; rhp; rhp = rcu_cblist_dequeue(&rcl)) { rcu_callback_t f; @@ -2808,6 +2826,8 @@ __call_rcu(struct rcu_head *head, rcu_callback_t func) unsigned long flags; struct rcu_data *rdp; bool was_alldone; + int cbs[RCU_CBLIST_NSEGS]; + unsigned long gps[RCU_CBLIST_NSEGS]; /* Misaligned rcu_head! */ WARN_ON_ONCE((unsigned long)head & (sizeof(void *) - 1)); @@ -2852,6 +2872,10 @@ __call_rcu(struct rcu_head *head, rcu_callback_t func) trace_rcu_callback(rcu_state.name, head, rcu_segcblist_n_cbs(&rdp->cblist)); + /* Count CBs for tracing. */ + rcu_segcblist_countseq(&rdp->cblist, cbs, gps); + trace_rcu_segcb("SegCBQueued", cbs, gps); + /* Go handle any RCU core processing required. */ if (IS_ENABLED(CONFIG_RCU_NOCB_CPU) && unlikely(rcu_segcblist_is_offloaded(&rdp->cblist))) { -- 2.27.0.111.gc72c7da667-goog