Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp1369551pxb; Wed, 4 Nov 2020 07:13:49 -0800 (PST) X-Google-Smtp-Source: ABdhPJxl9/FSly5z9VBdLv09MOiuxEulFxYF7OxBq5iDGQy6yb8N4FR1D3rZdf/+Fm+8E4qhVgB2 X-Received: by 2002:a05:6402:10c7:: with SMTP id p7mr27432734edu.34.1604502829264; Wed, 04 Nov 2020 07:13:49 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1604502829; cv=none; d=google.com; s=arc-20160816; b=ILPWUJf/FSIQVmO2EeHJzWwKnMrya2KB6vwXlyE+uR0T4rWbnfsHF5bSBHpShC2N/B PYrYXkxk4mlscWQzh0i++lnKhs66VOnR4GleBPIGkYE/bZvp7yVFTsEZy0Z8RctGi9jk rdDupMSLUg7KFL43lj1CodYM1H9cQ0f5tKgqCLm+mddKZnmNaG7HLOncVFbtS+fxdcFp N79z0EtJ9vBsPF5S4Xbff4J45z8ZAYJgQzyIyUIOWnUrj4byO+IiRLZ9CCrRHH4tfPfG d9Re9Mt3tgVjInV9drnhAoc4sE7a/ETEtWqgzoGpqVpk4wOdZkP8PdDinqQcHOguh2jn M9XA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=xHKBvoGz6qQ1RRpTvT7gKL/8ddg9DSuEs8nci0Mxe1U=; b=t6bgRUHvv9ih8sXlrMkXD7KIo35Pv1xZxM+z5Ygs89sZIc7bv7g5TMQ6BH8Lck64fk v78Kbtx/zbz6Zm2JCwDzOwhwEra/F0ZXaRDXB2/TnUBHYkQnXnUtMiGf5JJu5NC3fGtP L7Y6ddVoPbb8VSgstZRubojrhi9uZLYNHESugCKRHvJlsOQrnzS2B5qBLDSo7DdomQYl JnqMpJeK6ShJGcjelNQI0AOcRarZT9OZ2TdO8RoFFk73G+v3wGxo++kmhC+FgMnJ3oE4 tlUTLsqxQTXa8WMnJmCr3Dsv8gA9KJBklAhHPTiG3Afo/bOqbjF70sZpKGv+CiqiiX7v Sf4g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=elI06k2n; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q25si1473277ejd.426.2020.11.04.07.13.24; Wed, 04 Nov 2020 07:13:49 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=elI06k2n; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730154AbgKDPJt (ORCPT + 99 others); Wed, 4 Nov 2020 10:09:49 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51844 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726801AbgKDPJt (ORCPT ); Wed, 4 Nov 2020 10:09:49 -0500 Received: from mail-qv1-xf44.google.com (mail-qv1-xf44.google.com [IPv6:2607:f8b0:4864:20::f44]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 287B7C0613D3 for ; Wed, 4 Nov 2020 07:09:49 -0800 (PST) Received: by mail-qv1-xf44.google.com with SMTP id dj6so6050982qvb.3 for ; Wed, 04 Nov 2020 07:09:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=xHKBvoGz6qQ1RRpTvT7gKL/8ddg9DSuEs8nci0Mxe1U=; b=elI06k2nqQQbhPSzLNxtPRCF6lsNJabSf7epMpsvZazgJXWjvVUXdfc1sTpnx++ojJ lN2+c2Mx1u6/+Oa5RN/yox7PUkD5P234R+at5xD41Wf4HB+i9U5okLiGPnw+9lkrvbpZ tI5tMD35/SYD1PeZLKp7DGmCWAFLL37vDP9eI= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=xHKBvoGz6qQ1RRpTvT7gKL/8ddg9DSuEs8nci0Mxe1U=; b=YWsWAnKtCi9rpO86crN2DazC5K+P55zcXvoR1bvHXquvO+QF9kcY4R+abtZzbK5dxM rhnv21a1qp20aZXZzrnkW+6/S9HFEaGc/O5RRRZEGZ2PMQssxX6EmDVCwDom3f8ebUqU QbRuzsmyfU3R1W0WHNh9+ctForb2qeK5J/13TQlONCLrivN4EIp4Yp3HGZEOsIq8YV/O Kr8Sjkvwyws1QGjIaAnmLimzelJ8pAtlNkfx/S8VRhsESwW15P0xARnPUqvZOPzTvi2z vsw0VRKqKVGTWBewy4BssXar5uWnVMMEq+aDuVqgWN2Ac4UcourVUTALt0oo+pTfG4JN 8fxw== X-Gm-Message-State: AOAM531/eHHn62wTXX2LSVgDYZwQSrdl98V/dEtR+8uWPm/vFyf6ioeO EtRSoL4eIE+4PthNzsMZdmTYEQ== X-Received: by 2002:a05:6214:c2:: with SMTP id f2mr30417864qvs.2.1604502588226; Wed, 04 Nov 2020 07:09:48 -0800 (PST) Received: from localhost ([2620:15c:6:411:cad3:ffff:feb3:bd59]) by smtp.gmail.com with ESMTPSA id j50sm329748qtc.5.2020.11.04.07.09.47 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 04 Nov 2020 07:09:47 -0800 (PST) Date: Wed, 4 Nov 2020 10:09:47 -0500 From: Joel Fernandes To: "Paul E. McKenney" Cc: linux-kernel@vger.kernel.org, Frederic Weisbecker , Neeraj Upadhyay , Josh Triplett , Lai Jiangshan , Marco Elver , Mathieu Desnoyers , rcu@vger.kernel.org, Steven Rostedt , "Uladzislau Rezki (Sony)" , fweisbec@gmail.com, neeraj.iitr10@gmail.com Subject: Re: [PATCH v9 1/7] rcu/tree: Make rcu_do_batch count how many callbacks were executed Message-ID: <20201104150947.GD2313912@google.com> References: <20201103142603.1302207-1-joel@joelfernandes.org> <20201103142603.1302207-2-joel@joelfernandes.org> <20201104000554.GF3249@paulmck-ThinkPad-P72> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201104000554.GF3249@paulmck-ThinkPad-P72> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Nov 03, 2020 at 04:05:54PM -0800, Paul E. McKenney wrote: > On Tue, Nov 03, 2020 at 09:25:57AM -0500, Joel Fernandes (Google) wrote: > > Currently, rcu_do_batch() depends on the unsegmented callback list's len field > > to know how many CBs are executed. This fields counts down from 0 as CBs are > > dequeued. It is possible that all CBs could not be run because of reaching > > limits in which case the remaining unexecuted callbacks are requeued in the > > CPU's segcblist. > > > > The number of callbacks that were not requeued are then the negative count (how > > many CBs were run) stored in the rcl->len which has been counting down on every > > dequeue. This negative count is then added to the per-cpu segmented callback > > list's to correct its count. > > > > Such a design works against future efforts to track the length of each segment > > of the segmented callback list. The reason is because > > rcu_segcblist_extract_done_cbs() will be populating the unsegmented callback > > list's length field (rcl->len) during extraction. > > > > Also, the design of counting down from 0 is confusing and error-prone IMHO. > > > > This commit therefore explicitly counts how many callbacks were executed in > > rcu_do_batch() itself, and uses that to update the per-CPU segcb list's ->len > > field, without relying on the negativity of rcl->len. > > > > Reviewed-by: Frederic Weisbecker > > Reviewed-by: Neeraj Upadhyay > > Signed-off-by: Joel Fernandes (Google) > > Queued for testing and further review, thank you all! > > I will be pulling these in as I look them over, and running tests > in between times, so this will take some time. > > As usual, I could not resist editing the commit log, so please > see below to check whether I messed something up. Looks good to me, thanks. - Joel > > Thanx, Paul > > ------------------------------------------------------------------------ > > commit d77ef2684cbff728c14b3c84a356139c52ca3a5e > Author: Joel Fernandes (Google) > Date: Tue Nov 3 09:25:57 2020 -0500 > > rcu/tree: Make rcu_do_batch count how many callbacks were executed > > The rcu_do_batch() function extracts the ready-to-invoke callbacks > from the rcu_segcblist located in the ->cblist field of the current > CPU's rcu_data structure. These callbacks are first moved to a local > (unsegmented) rcu_cblist. The rcu_do_batch() function then uses this > rcu_cblist's ->len field to count how many CBs it has invoked, but it > does so by counting that field down from zero. Finally, this function > negates the value in this ->len field (resulting in a positive number) > and subtracts the result from the ->len field of the current CPU's > ->cblist field. > > Except that it is sometimes necessary for rcu_do_batch() to stop invoking > callbacks mid-stream, despite there being more ready to invoke, for > example, if a high-priority task wakes up. In this case the remaining > not-yet-invoked callbacks are requeued back onto the CPU's ->cblist, > but remain in the ready-to-invoke segment of that list. As above, the > negative of the local rcu_cblist's ->len field is still subtracted from > the ->len field of the current CPU's ->cblist field. > > The design of counting down from 0 is confusing and error-prone, plus > use of a positive count will make it easier to provide a uniform and > consistent API to deal with the per-segment counts that are added > later in this series. For example, rcu_segcblist_extract_done_cbs() > can unconditionally populate the resulting unsegmented list's ->len > field during extraction. > > This commit therefore explicitly counts how many callbacks were executed > in rcu_do_batch() itself, counting up from zero, and then uses that > to update the per-CPU segcb list's ->len field, without relying on the > downcounting of rcl->len from zero. > > Reviewed-by: Frederic Weisbecker > Reviewed-by: Neeraj Upadhyay > Signed-off-by: Joel Fernandes (Google) > Signed-off-by: Paul E. McKenney > > diff --git a/kernel/rcu/rcu_segcblist.c b/kernel/rcu/rcu_segcblist.c > index 2d2a6b6b9..bb246d8 100644 > --- a/kernel/rcu/rcu_segcblist.c > +++ b/kernel/rcu/rcu_segcblist.c > @@ -95,7 +95,7 @@ static void rcu_segcblist_set_len(struct rcu_segcblist *rsclp, long v) > * This increase is fully ordered with respect to the callers accesses > * both before and after. > */ > -static void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v) > +void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v) > { > #ifdef CONFIG_RCU_NOCB_CPU > smp_mb__before_atomic(); /* Up to the caller! */ > diff --git a/kernel/rcu/rcu_segcblist.h b/kernel/rcu/rcu_segcblist.h > index 492262b..1d2d614 100644 > --- a/kernel/rcu/rcu_segcblist.h > +++ b/kernel/rcu/rcu_segcblist.h > @@ -76,6 +76,7 @@ static inline bool rcu_segcblist_restempty(struct rcu_segcblist *rsclp, int seg) > } > > void rcu_segcblist_inc_len(struct rcu_segcblist *rsclp); > +void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v); > void rcu_segcblist_init(struct rcu_segcblist *rsclp); > void rcu_segcblist_disable(struct rcu_segcblist *rsclp); > void rcu_segcblist_offload(struct rcu_segcblist *rsclp); > diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c > index b6ec565..c4035e7 100644 > --- a/kernel/rcu/tree.c > +++ b/kernel/rcu/tree.c > @@ -2436,7 +2436,7 @@ static void rcu_do_batch(struct rcu_data *rdp) > const bool offloaded = rcu_segcblist_is_offloaded(&rdp->cblist); > struct rcu_head *rhp; > struct rcu_cblist rcl = RCU_CBLIST_INITIALIZER(rcl); > - long bl, count; > + long bl, count = 0; > long pending, tlimit = 0; > > /* If no callbacks are ready, just return. */ > @@ -2481,6 +2481,7 @@ static void rcu_do_batch(struct rcu_data *rdp) > for (; rhp; rhp = rcu_cblist_dequeue(&rcl)) { > rcu_callback_t f; > > + count++; > debug_rcu_head_unqueue(rhp); > > rcu_lock_acquire(&rcu_callback_map); > @@ -2494,15 +2495,14 @@ static void rcu_do_batch(struct rcu_data *rdp) > > /* > * Stop only if limit reached and CPU has something to do. > - * Note: The rcl structure counts down from zero. > */ > - if (-rcl.len >= bl && !offloaded && > + if (count >= bl && !offloaded && > (need_resched() || > (!is_idle_task(current) && !rcu_is_callbacks_kthread()))) > break; > if (unlikely(tlimit)) { > /* only call local_clock() every 32 callbacks */ > - if (likely((-rcl.len & 31) || local_clock() < tlimit)) > + if (likely((count & 31) || local_clock() < tlimit)) > continue; > /* Exceeded the time limit, so leave. */ > break; > @@ -2519,7 +2519,6 @@ static void rcu_do_batch(struct rcu_data *rdp) > > local_irq_save(flags); > rcu_nocb_lock(rdp); > - count = -rcl.len; > rdp->n_cbs_invoked += count; > trace_rcu_batch_end(rcu_state.name, count, !!rcl.head, need_resched(), > is_idle_task(current), rcu_is_callbacks_kthread()); > @@ -2527,7 +2526,7 @@ static void rcu_do_batch(struct rcu_data *rdp) > /* Update counts and requeue any remaining callbacks. */ > rcu_segcblist_insert_done_cbs(&rdp->cblist, &rcl); > smp_mb(); /* List handling before counting for rcu_barrier(). */ > - rcu_segcblist_insert_count(&rdp->cblist, &rcl); > + rcu_segcblist_add_len(&rdp->cblist, -count); > > /* Reinstate batch limit if we have worked down the excess. */ > count = rcu_segcblist_n_cbs(&rdp->cblist);