Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp902988pxb; Tue, 3 Nov 2020 16:08:48 -0800 (PST) X-Google-Smtp-Source: ABdhPJyn9pT74+nGbHC1pvD4spIVNjdT8q+B8wshYLHsl/Eqh6oRJdluuLxx55ROC0/SUasDkoqL X-Received: by 2002:a05:6402:28e:: with SMTP id l14mr24125202edv.157.1604448528522; Tue, 03 Nov 2020 16:08:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1604448528; cv=none; d=google.com; s=arc-20160816; b=Z8N1Zct+bSlriiVqjdmbkH+HjGj0Bs4KvvMqnf/XoJN3UXZB70WlwTEemgJ6xx0kGd YXuslVvQ+7uFPzR8WOaAfPS1eK7A/4Q8ItGqWMlmSGdCthwYTMqbS2y1bBpOC70HdoQq uBBvn4jkBBws6CsW7EzVixJhnH5No239B2CTIJ6N2eJjr7zbBAA+LjlZYz1TNmwrQgos y/0QD8uO2p2y8vjHwnfS5GJ/jabTeCfizsVgX5LVfZNX4j8Hq+9nx26nXSa0VDcQFovY LTkb+CAowaHr1ljD7A8Y5ek0Y3uW0/XDIbeb+sNRkNpmgYPI4WKxI+YsUKUR57L2o/MG URJQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=N7iHqImfriusvIPoJeXN6gsES7AUNXsUmIzq4f6YazE=; b=ubybJ5e6tSRUa0UybVq6jKoBFNTM9iBz2kz6XfYrvR4grVJfWLe9AS2SnB37WN5G/S wVVHlAcR9G+GOvecnzpF3ClbfCOVoqlkxbADSC5EyJ5oJES7rCt07u8fX8aIgWf8w0Wb p8m2n+Oto2CA5QbOk85HsFuIpTuLzgqFP/tMQtKsZEhBXa8A6EEtNCkFySihDTXmp+x5 fGnUWWwH4ZITO48bbLCUgr6nkNA36CQNCEzEFtfjgNvpL2GVlvGQHIEkrfQ6u9/+xCNN cX2zjceAWAP1TaZ8pSGhQYb6NDw0PRmavMb7b1MGPS1IG9Cek0RcNe/b2lBKk8bzfjVR qnEw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=Pl1XoOMk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i9si287671ejr.467.2020.11.03.16.08.25; Tue, 03 Nov 2020 16:08:48 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=Pl1XoOMk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729189AbgKDAF5 (ORCPT + 99 others); Tue, 3 Nov 2020 19:05:57 -0500 Received: from mail.kernel.org ([198.145.29.99]:34032 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729836AbgKDAFz (ORCPT ); Tue, 3 Nov 2020 19:05:55 -0500 Received: from paulmck-ThinkPad-P72.home (50-39-104-11.bvtn.or.frontiernet.net [50.39.104.11]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id D483D223BD; Wed, 4 Nov 2020 00:05:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1604448354; bh=KZ6By4TUfVUgQ/D4nmd6klO+Ho7DgbM92Y0TRb1lyGw=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=Pl1XoOMkN8m5IwRwgIRc9JBtjNijN8KaAgcOw6gblZkJg6ARjKvVVNOrtxO+gqpLE 3GMaabVmFV5qQhJIfS/iM9/+SYUw9Ev9c3b7uIh4dAQfl38vpLL2BqwQSkxWpCw/Yp ynPkxWU5PmOVNgFADfOBUnsO3YXMF2aT7Kqw8F2k= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 9B3CF3522ABD; Tue, 3 Nov 2020 16:05:54 -0800 (PST) Date: Tue, 3 Nov 2020 16:05:54 -0800 From: "Paul E. McKenney" To: "Joel Fernandes (Google)" Cc: linux-kernel@vger.kernel.org, Frederic Weisbecker , Neeraj Upadhyay , Josh Triplett , Lai Jiangshan , Marco Elver , Mathieu Desnoyers , rcu@vger.kernel.org, Steven Rostedt , "Uladzislau Rezki (Sony)" , fweisbec@gmail.com, neeraj.iitr10@gmail.com Subject: Re: [PATCH v9 1/7] rcu/tree: Make rcu_do_batch count how many callbacks were executed Message-ID: <20201104000554.GF3249@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20201103142603.1302207-1-joel@joelfernandes.org> <20201103142603.1302207-2-joel@joelfernandes.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201103142603.1302207-2-joel@joelfernandes.org> User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Nov 03, 2020 at 09:25:57AM -0500, Joel Fernandes (Google) wrote: > Currently, rcu_do_batch() depends on the unsegmented callback list's len field > to know how many CBs are executed. This fields counts down from 0 as CBs are > dequeued. It is possible that all CBs could not be run because of reaching > limits in which case the remaining unexecuted callbacks are requeued in the > CPU's segcblist. > > The number of callbacks that were not requeued are then the negative count (how > many CBs were run) stored in the rcl->len which has been counting down on every > dequeue. This negative count is then added to the per-cpu segmented callback > list's to correct its count. > > Such a design works against future efforts to track the length of each segment > of the segmented callback list. The reason is because > rcu_segcblist_extract_done_cbs() will be populating the unsegmented callback > list's length field (rcl->len) during extraction. > > Also, the design of counting down from 0 is confusing and error-prone IMHO. > > This commit therefore explicitly counts how many callbacks were executed in > rcu_do_batch() itself, and uses that to update the per-CPU segcb list's ->len > field, without relying on the negativity of rcl->len. > > Reviewed-by: Frederic Weisbecker > Reviewed-by: Neeraj Upadhyay > Signed-off-by: Joel Fernandes (Google) Queued for testing and further review, thank you all! I will be pulling these in as I look them over, and running tests in between times, so this will take some time. As usual, I could not resist editing the commit log, so please see below to check whether I messed something up. Thanx, Paul ------------------------------------------------------------------------ commit d77ef2684cbff728c14b3c84a356139c52ca3a5e Author: Joel Fernandes (Google) Date: Tue Nov 3 09:25:57 2020 -0500 rcu/tree: Make rcu_do_batch count how many callbacks were executed The rcu_do_batch() function extracts the ready-to-invoke callbacks from the rcu_segcblist located in the ->cblist field of the current CPU's rcu_data structure. These callbacks are first moved to a local (unsegmented) rcu_cblist. The rcu_do_batch() function then uses this rcu_cblist's ->len field to count how many CBs it has invoked, but it does so by counting that field down from zero. Finally, this function negates the value in this ->len field (resulting in a positive number) and subtracts the result from the ->len field of the current CPU's ->cblist field. Except that it is sometimes necessary for rcu_do_batch() to stop invoking callbacks mid-stream, despite there being more ready to invoke, for example, if a high-priority task wakes up. In this case the remaining not-yet-invoked callbacks are requeued back onto the CPU's ->cblist, but remain in the ready-to-invoke segment of that list. As above, the negative of the local rcu_cblist's ->len field is still subtracted from the ->len field of the current CPU's ->cblist field. The design of counting down from 0 is confusing and error-prone, plus use of a positive count will make it easier to provide a uniform and consistent API to deal with the per-segment counts that are added later in this series. For example, rcu_segcblist_extract_done_cbs() can unconditionally populate the resulting unsegmented list's ->len field during extraction. This commit therefore explicitly counts how many callbacks were executed in rcu_do_batch() itself, counting up from zero, and then uses that to update the per-CPU segcb list's ->len field, without relying on the downcounting of rcl->len from zero. Reviewed-by: Frederic Weisbecker Reviewed-by: Neeraj Upadhyay Signed-off-by: Joel Fernandes (Google) Signed-off-by: Paul E. McKenney diff --git a/kernel/rcu/rcu_segcblist.c b/kernel/rcu/rcu_segcblist.c index 2d2a6b6b9..bb246d8 100644 --- a/kernel/rcu/rcu_segcblist.c +++ b/kernel/rcu/rcu_segcblist.c @@ -95,7 +95,7 @@ static void rcu_segcblist_set_len(struct rcu_segcblist *rsclp, long v) * This increase is fully ordered with respect to the callers accesses * both before and after. */ -static void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v) +void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v) { #ifdef CONFIG_RCU_NOCB_CPU smp_mb__before_atomic(); /* Up to the caller! */ diff --git a/kernel/rcu/rcu_segcblist.h b/kernel/rcu/rcu_segcblist.h index 492262b..1d2d614 100644 --- a/kernel/rcu/rcu_segcblist.h +++ b/kernel/rcu/rcu_segcblist.h @@ -76,6 +76,7 @@ static inline bool rcu_segcblist_restempty(struct rcu_segcblist *rsclp, int seg) } void rcu_segcblist_inc_len(struct rcu_segcblist *rsclp); +void rcu_segcblist_add_len(struct rcu_segcblist *rsclp, long v); void rcu_segcblist_init(struct rcu_segcblist *rsclp); void rcu_segcblist_disable(struct rcu_segcblist *rsclp); void rcu_segcblist_offload(struct rcu_segcblist *rsclp); diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c index b6ec565..c4035e7 100644 --- a/kernel/rcu/tree.c +++ b/kernel/rcu/tree.c @@ -2436,7 +2436,7 @@ static void rcu_do_batch(struct rcu_data *rdp) const bool offloaded = rcu_segcblist_is_offloaded(&rdp->cblist); struct rcu_head *rhp; struct rcu_cblist rcl = RCU_CBLIST_INITIALIZER(rcl); - long bl, count; + long bl, count = 0; long pending, tlimit = 0; /* If no callbacks are ready, just return. */ @@ -2481,6 +2481,7 @@ static void rcu_do_batch(struct rcu_data *rdp) for (; rhp; rhp = rcu_cblist_dequeue(&rcl)) { rcu_callback_t f; + count++; debug_rcu_head_unqueue(rhp); rcu_lock_acquire(&rcu_callback_map); @@ -2494,15 +2495,14 @@ static void rcu_do_batch(struct rcu_data *rdp) /* * Stop only if limit reached and CPU has something to do. - * Note: The rcl structure counts down from zero. */ - if (-rcl.len >= bl && !offloaded && + if (count >= bl && !offloaded && (need_resched() || (!is_idle_task(current) && !rcu_is_callbacks_kthread()))) break; if (unlikely(tlimit)) { /* only call local_clock() every 32 callbacks */ - if (likely((-rcl.len & 31) || local_clock() < tlimit)) + if (likely((count & 31) || local_clock() < tlimit)) continue; /* Exceeded the time limit, so leave. */ break; @@ -2519,7 +2519,6 @@ static void rcu_do_batch(struct rcu_data *rdp) local_irq_save(flags); rcu_nocb_lock(rdp); - count = -rcl.len; rdp->n_cbs_invoked += count; trace_rcu_batch_end(rcu_state.name, count, !!rcl.head, need_resched(), is_idle_task(current), rcu_is_callbacks_kthread()); @@ -2527,7 +2526,7 @@ static void rcu_do_batch(struct rcu_data *rdp) /* Update counts and requeue any remaining callbacks. */ rcu_segcblist_insert_done_cbs(&rdp->cblist, &rcl); smp_mb(); /* List handling before counting for rcu_barrier(). */ - rcu_segcblist_insert_count(&rdp->cblist, &rcl); + rcu_segcblist_add_len(&rdp->cblist, -count); /* Reinstate batch limit if we have worked down the excess. */ count = rcu_segcblist_n_cbs(&rdp->cblist);