Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp2736824ybz; Sun, 19 Apr 2020 08:28:36 -0700 (PDT) X-Google-Smtp-Source: APiQypKxfXy1rcMPiGV9f4RrxM2UAild4vNA+ScAwPv6lTqv/aC3cf0pzV/QeRy3sDkgtmodzKgS X-Received: by 2002:a17:906:b28f:: with SMTP id q15mr11576437ejz.188.1587310116274; Sun, 19 Apr 2020 08:28:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1587310116; cv=none; d=google.com; s=arc-20160816; b=CmCdBmDF1RWoid11kM/InXh7ZTqhF7nH723BSSpdvoSotHp6UI0GBu2xz40QsMPCmG EQBxRvPiwNGw0LzZdEOIObVPbPPPQeaFS11rh7mif9d7QMbDPnsZXvPHeLn1TCeLZa4S WeV7UtnTdC9/Y3X88iis+wQBu8IFlzWH1fQxt/SNQ2onaP8W2eFYz5zYfz1mF2XQiaCO B9SYyUZhH/L93mViLvuKR0PIECqohnFhsp5km/fhVlJFX02H/xFgQk0dK7H9a+Dl7jxA dLBQba8WoW5RNmMpehZRrheXYp7bwnv0Pa/9C5a00luF2FtWj1xhx/xIym685kDDl07G 2hFw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date:dkim-signature; bh=7EcQWdZj9DR6Nt3SP+IW9bgzgNp8p88ZaGODajZk/h4=; b=UanpQZ7tNLMgRmX9BWAnbIausPdKz0+0j4tcGL2Vpvqku2avaC2NKuZI4tD2tbK4Nh y0q80InfiBJfg97JfnoHTXDuO9PuQTgVEEK4TIKMdMOlJ6X0NVKif60B7go9X/kYYsdM TRez6mfdNVS8F49NI9RJXLBCfyACaZm1sfFpImH5UlQJ5kmlhtFmyiUingm307ouRPKm yP90VDKITg9DsUVSCen6oVSwUhJiGWiCHKL+k0J2F6KD421qrfl78hDJ7tX1c1ct2vCH 1rW+krqDIa8bUPBcTtvmDyklWV7582+Hza0ZiF+oeITIrbXBVPnyl6HkGDVNBYH6RjWF SrXA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=rKr8RAxR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id w16si2603299edr.110.2020.04.19.08.28.12; Sun, 19 Apr 2020 08:28:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=rKr8RAxR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726160AbgDSPYq (ORCPT + 99 others); Sun, 19 Apr 2020 11:24:46 -0400 Received: from mail.kernel.org ([198.145.29.99]:34854 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725927AbgDSPYq (ORCPT ); Sun, 19 Apr 2020 11:24:46 -0400 Received: from paulmck-ThinkPad-P72.home (50-39-105-78.bvtn.or.frontiernet.net [50.39.105.78]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 459F4214D8; Sun, 19 Apr 2020 15:24:45 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1587309885; bh=Ogj0JioqdRU35CGmtnbwyWL1X4jEg4maBSJo2mTBnKE=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=rKr8RAxRR8DMQWFo8VMW41Kknn8GCroRJQATNWfQln9+d0MU3UAduyEXb55Hb2Ybs LdIw110qiZnDvgNq7FSBcu4Bk9CEvUn+FrfN9e52lJvFOmqtfwZTW1RMS7u51AdJze b8hLORRO4U9Tz4mCSr0L75Z4vtU5H5xBrDys4opk= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 1BDD435226A1; Sun, 19 Apr 2020 08:24:45 -0700 (PDT) Date: Sun, 19 Apr 2020 08:24:45 -0700 From: "Paul E. McKenney" To: Wei Yang Cc: josh@joshtriplett.org, rcu@vger.kernel.org, Linux Kernel Mailing List Subject: Re: [PATCH] rcu: simplify the calculation of rcu_state.ncpus Message-ID: <20200419152445.GU17661@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20200417213951.29837-1-richard.weiyang@gmail.com> <20200418031922.GR17661@paulmck-ThinkPad-P72> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Apr 19, 2020 at 06:02:12AM +0800, Wei Yang wrote: > On Sat, Apr 18, 2020 at 11:19 AM Paul E. McKenney wrote: > > > > On Fri, Apr 17, 2020 at 09:39:51PM +0000, Wei Yang wrote: > > > There is only 1 bit set in mask, which means the difference between > > > oldmask and the new one would be at the position where the bit is set in > > > mask. > > > > > > Based on this knowledge, rcu_state.ncpus could be calculated by checking > > > whether mask is already set in oldmask. > > > > Nice!!! Good eyes! > > > > > BTW, the comment at the last of this line is mysterious. Not sure it > > > could be removed or not. > > > > The "^^^" in that comment says to look at the comment on the preceding > > line. Memory-ordering functions like smp_store_release() are supposed > > to have comments indicating what they are ordering. ;-) > > > > Could you please do the following things and resubmit? > > > > 1. Forward-port to -rcu branch dev? This tree lives here: > > git://git.kernel.org/pub/scm/linux/kernel/git/paulmck/linux-rcu.git > > > > 2. Given that oldmask is used only to test to see if a new bit > > was set, why not just replace oldmask with a bool variable > > that is set to "!(rnp->expmaskinitnext & mask)" before the > > bit is ORed into rnp->expmaskinitnext? > > > > 3. Put the comment inside the "if" statement with the > > smp_store_release(). > > > > 4. In -rcu, you will find a ASSERT_EXCLUSIVE_WRITER() statement > > that should also be placed inside the "if" statement with > > the smp_store_release(). > > > > Oops, my email client EAT this mail. Hope this mail will not be banned. This one made it. ;-) > I adjust the code a little according to your suggestion like below. Is this > what you expected? Much better, thank you! > diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c > index f288477ee1c2..f01367a80b70 100644 > --- a/kernel/rcu/tree.c > +++ b/kernel/rcu/tree.c > @@ -3732,10 +3732,9 @@ void rcu_cpu_starting(unsigned int cpu) > { > unsigned long flags; > unsigned long mask; > - int nbits; > - unsigned long oldmask; > struct rcu_data *rdp; > struct rcu_node *rnp; > + bool has_seen; Could you please use a more descriptive name, perhaps something like "newcpu"? Yes, "has_seen" does make sense, but it leaves the reader the question "has seen what?". > if (per_cpu(rcu_cpu_started, cpu)) > return; > @@ -3747,13 +3746,13 @@ void rcu_cpu_starting(unsigned int cpu) > mask = rdp->grpmask; > raw_spin_lock_irqsave_rcu_node(rnp, flags); > WRITE_ONCE(rnp->qsmaskinitnext, rnp->qsmaskinitnext | mask); > - oldmask = rnp->expmaskinitnext; > + has_seen = rnp->expmaskinitnext & mask; > rnp->expmaskinitnext |= mask; > - oldmask ^= rnp->expmaskinitnext; > - nbits = bitmap_weight(&oldmask, BITS_PER_LONG); > - /* Allow lockless access for expedited grace periods. */ > - smp_store_release(&rcu_state.ncpus, rcu_state.ncpus + nbits); /* ^^^ */ > - ASSERT_EXCLUSIVE_WRITER(rcu_state.ncpus); > + if (!has_seen) { > + /* Allow lockless access for expedited grace periods. */ > + smp_store_release(&rcu_state.ncpus, rcu_state.ncpus + > 1); /* ^^^ */ > + ASSERT_EXCLUSIVE_WRITER(rcu_state.ncpus); One advantage of flipping the conditional is that it allows dropping the "if" statement, which saves a few lines of code and gets closer to your original suggestion: newcpu = !(rnp->expmaskinitnext & mask); rnp->expmaskinitnext |= mask; /* Allow lockless access for expedited grace periods. */ smp_store_release(&rcu_state.ncpus, rcu_state.ncpus + newcpu); /* ^^^ */ ASSERT_EXCLUSIVE_WRITER(rcu_state.ncpus); Yes, this does execute a store-release that doesn't change anything, but this is OK because (1) CPU hotplug is infrequent and (2) CPU hotplug has very high overhead. The overhead of the store is thus neglible. (But please do not try this to heavily contended cachelines on fastpaths! And yes, I should have thought of this in response to your initial patch, but then again, I should have thought of this when writing this code in the first place, shouldn't I have? ;-) Thanx, Paul > + } > rcu_gpnum_ovf(rnp, rdp); /* Offline-induced counter wrap? */ > rdp->rcu_onl_gp_seq = READ_ONCE(rcu_state.gp_seq); > rdp->rcu_onl_gp_flags = READ_ONCE(rcu_state.gp_flags); > > > Thanx, Paul > > > > > Signed-off-by: Wei Yang > > > --- > > > kernel/rcu/tree.c | 6 ++---- > > > 1 file changed, 2 insertions(+), 4 deletions(-) > > > > > > diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c > > > index d91c9156fab2..f0d9251fa663 100644 > > > --- a/kernel/rcu/tree.c > > > +++ b/kernel/rcu/tree.c > > > @@ -3364,7 +3364,6 @@ void rcu_cpu_starting(unsigned int cpu) > > > { > > > unsigned long flags; > > > unsigned long mask; > > > - int nbits; > > > unsigned long oldmask; > > > struct rcu_data *rdp; > > > struct rcu_node *rnp; > > > @@ -3381,10 +3380,9 @@ void rcu_cpu_starting(unsigned int cpu) > > > rnp->qsmaskinitnext |= mask; > > > oldmask = rnp->expmaskinitnext; > > > rnp->expmaskinitnext |= mask; > > > - oldmask ^= rnp->expmaskinitnext; > > > - nbits = bitmap_weight(&oldmask, BITS_PER_LONG); > > > /* Allow lockless access for expedited grace periods. */ > > > - smp_store_release(&rcu_state.ncpus, rcu_state.ncpus + nbits); /* ^^^ */ > > > + if (!(oldmask & mask)) > > > + smp_store_release(&rcu_state.ncpus, rcu_state.ncpus + 1); /* ^^^ */ > > > rcu_gpnum_ovf(rnp, rdp); /* Offline-induced counter wrap? */ > > > rdp->rcu_onl_gp_seq = READ_ONCE(rcu_state.gp_seq); > > > rdp->rcu_onl_gp_flags = READ_ONCE(rcu_state.gp_flags); > > > -- > > > 2.23.0 > > >