Received: by 2002:a05:6358:f14:b0:e5:3b68:ec04 with SMTP id b20csp1712806rwj; Sun, 18 Dec 2022 14:03:32 -0800 (PST) X-Google-Smtp-Source: AA0mqf6SaqmetLOK45bEgPCRqSZDQovnmUy+TV3yeWv9nFZ1YMoJlC7pbdiEuLTVPfVwe4xQWjt2 X-Received: by 2002:a17:906:39d7:b0:7ad:c8d7:4fd9 with SMTP id i23-20020a17090639d700b007adc8d74fd9mr31772258eje.36.1671401011992; Sun, 18 Dec 2022 14:03:31 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1671401011; cv=none; d=google.com; s=arc-20160816; b=ICQwpC4+0IR02j6zecm79GrmCY1nfNqXI+zG0FcnO/4NpwcC8NeDp+p7/Xuihf7L0h Lf2bwmFeVlSApropN+zaEOdoEq8rFWfsIZNCQJQphP3Ays2UWOtmFHKa61CDM9a6w1hq 6IRhOeQOjHPRZvoeP8yMIplzv9Eg8pJ78PZP1AeqpKJbAfxRxZLv+oTQE+J/1mloLR2p /bWIa+LgKlrcm9+MDL5PIlqBhusKklA9hVBK6OBOQfzUVLg8OnB0I90c8Ji5wUTXyBNj ttQXk4I4S5EUJqeQXN1NlFnjszP40++bWX115/gxwuFFtshFtimbTnTIgKQVOwSExs/n v1hQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=872LnoBCESelenHSzac+Txz57fKSWodE2QCDbdsM524=; b=iWf3e/HhyDvJaP4asFMKCb6W2R37QLfovTURH1hFWfuxfs6XHdCm/Xh95pX+beCDS6 aDRtzoHpfCb1nEq9DNljC27I55CBVRRrjBJCSUroQMqql1+FGNPB1Pw8FLPaiieMsizR hh8C8iTI9ON9Kf3AvORb/WwgROQ+RnmSX1KyBnO4l0DmzQeXSQZcEeWamwdQ1v1SE1Cn nSR9mGOrMoxFmW/9SC+uQ7ni8OH5ao0JWtAA1LiN2TIzxmajQ1mmg9jfFAREL7v9jnWw h8iCXA89fzzXJxC+D6/sVrjvsBx6kmGrPYLBvmEOS3z3Ul8HNf5AbJW/RLMXS/UOby/C k2tg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@efficios.com header.s=smtpout1 header.b=BHXMdpIm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=efficios.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hu9-20020a170907a08900b0077d854aa10dsi8628859ejc.57.2022.12.18.14.03.14; Sun, 18 Dec 2022 14:03:31 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@efficios.com header.s=smtpout1 header.b=BHXMdpIm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=efficios.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230445AbiLRVHn (ORCPT + 71 others); Sun, 18 Dec 2022 16:07:43 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59842 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229507AbiLRVHl (ORCPT ); Sun, 18 Dec 2022 16:07:41 -0500 Received: from smtpout.efficios.com (unknown [IPv6:2607:5300:203:b2ee::31e5]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2330760E1 for ; Sun, 18 Dec 2022 13:07:40 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=efficios.com; s=smtpout1; t=1671397659; bh=bzzA9VFUm6NqJzBX2Bls/aaFsWETWnz+FBiGd0+U0XU=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=BHXMdpImVf4HLS+Ql8/RN8Lluozm+JA+JFkfgRHd0SqPzaqpgJEAf2mceIkAJPMcV yerbMgxa8D66izgvwomHPAi6HzS6ajQKfdp4dbjzfkotdf7ehyCdvopNUdq5T2ufXm dtDkVtcGTaRCrgE3I55VDDCf/oNo/KdZaetAZyPObjuC1RAxV6BQIOXbYzBdc5bE6p JwlU4c7/pUVTegZge2SUoD+xVWzsUIPiTKgL+cX8CTGpv7nA4fTy/2ZiaWBeYbKd6U 0Tx/BbwbqZGzQ2EGozbCSPYOacUcCu4wJ/pPA4jH34RHONjarsijz5WP6q4Q+8Bc++ LUNEVTx3Mv3Bg== Received: from [10.1.0.203] (192-222-188-69.qc.cable.ebox.net [192.222.188.69]) by smtpout.efficios.com (Postfix) with ESMTPSA id 4NZwQf6jcCzbfL; Sun, 18 Dec 2022 16:07:38 -0500 (EST) Message-ID: Date: Sun, 18 Dec 2022 16:08:02 -0500 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.5.0 Subject: Re: [RFC 1/2] srcu: Remove comment about prior read lock counts Content-Language: en-US To: "Joel Fernandes (Google)" , linux-kernel@vger.kernel.org Cc: Josh Triplett , Lai Jiangshan , "Paul E. McKenney" , rcu@vger.kernel.org, Steven Rostedt References: <20221218191310.130904-1-joel@joelfernandes.org> <20221218191310.130904-2-joel@joelfernandes.org> From: Mathieu Desnoyers In-Reply-To: <20221218191310.130904-2-joel@joelfernandes.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RDNS_NONE, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-12-18 14:13, Joel Fernandes (Google) wrote: > The comment says that if an updater saw lock count updates, then > ensure the reader does not see the new srcu_idx. > > However, there is no memory barrier between a READER reading srcu_idx > with respect to incrementing the lock count for that srcu_idx. > > So what is really happening is, both "B" and "C" will order the current > reader's unlock count update, and the _next_ readers lock count update, with > respect to the write to the currently active index. > > Consider first the case of the unlock count update being seen by the UPDATER: > > (for brevity, the pseudocode shortens "srcu_idx" to "idx") > > READER UPDATER > > rcu_read_lock() { > idx = READ(idx); > lock_count[idx]++; > > smp_mb(); // B > } > srcu_flip() { > smp_mb(); //E > idx++; > smp_mb(); > } > rcu_read_unlock() { > smp_mb(); // C > unlock_count[idx]++; > } > > Consider that the updater saw the unlock count update, and due to this, we > expect "E" to make sure that the reader only used the old srcu_idx. > > However, say the reader used the new srcu_idx because we dropped "E". That is > totally OK because both unlock and lock counts of this reader will negate each > other during the next scan of the srcu_idx. So we don't have to guarantee at > all that the reader used the old srcu_idx, that does not buy us anything > because if it used the new one, we would just ignore it during the next scan > anyway (the reader is "done"). > > Now lets look at the following case: > > READER UPDATER > > rcu_read_lock() { > idx = READ(idx); > lock_count[idx]++; > > smp_mb(); // B > } > > rcu_read_unlock() { > smp_mb(); // C > unlock_count[idx]++; > } > srcu_flip() { > smp_mb(); //E > idx++; > rcu_read_lock() { > idx = READ(idx); > lock_count[idx]++; > > smp_mb(); // B > smp_mb(); > } > } > > Consider that the updater saw the lock count update of the second > rcu_read_lock(). It does not matter that we guarantee that the reader sees only > the old srcu_idx. This is because, a reader could totally just sample > srcu_idx, and stay preempted for long periods of time. So, during any scan, we > already have the issue of a preempted-reader randomly springing up with a copy > of the index which we consider the "new index". So guaranteeing that the reader > saw the old srcu_idx instead of the new one if we saw its lock count updates, > also does not buy us anything. > > Due to these reasons, drop the argument that the reader has to see a certain > srcu_idx since we have no control over that anyway, and guaranteeing that does not > buy us anything. I don't understand why this first patch only removes a comment about the need to order things, when in fact it's the entire memory barrier /* E */ that is useless. I suspect we should just remove the comment along with the barrier without this added step. What SRCU fundamentally does is detect quiescence of all SRCU readers between the beginning of the grace period and its completion. In order to ensure forward progress, it does so in a two-phase algorithm. What the grace period does to detect quiescence is to observe that each of the periods (0/1) have no active reader at a given point in the grace period. Then the fact that the period is flipped to send new-coming readers into a different period is just to ensure forward progress, and is basically irrelevant (order-wise) with the fact that the grace period scans both periods and validates that both periods are observed to have no active readers. I'd favor just clarifying the SRCU model in this way, and then remove the useless barrier, and also implement improvements to the SRCU algorithm and skip the "flip" entirely when we notice (early) that no active readers are present in both periods. This is actually similar to what I implemented in side-rcu. Thoughts ? Thanks, Mathieu > > Signed-off-by: Joel Fernandes (Google) > --- > kernel/rcu/srcutree.c | 10 ++++------ > 1 file changed, 4 insertions(+), 6 deletions(-) > > diff --git a/kernel/rcu/srcutree.c b/kernel/rcu/srcutree.c > index 1c304fec89c0..d6a4c2439ca6 100644 > --- a/kernel/rcu/srcutree.c > +++ b/kernel/rcu/srcutree.c > @@ -983,12 +983,10 @@ static bool try_check_zero(struct srcu_struct *ssp, int idx, int trycount) > static void srcu_flip(struct srcu_struct *ssp) > { > /* > - * Ensure that if this updater saw a given reader's increment > - * from __srcu_read_lock(), that reader was using an old value > - * of ->srcu_idx. Also ensure that if a given reader sees the > - * new value of ->srcu_idx, this updater's earlier scans cannot > - * have seen that reader's increments (which is OK, because this > - * grace period need not wait on that reader). > + * Ensure that if a given reader sees the new value of ->srcu_idx, this > + * updater's earlier scans cannot have seen that reader's increments > + * (which is OK, because this grace period need not wait on that > + * reader). > */ > smp_mb(); /* E */ /* Pairs with B and C. */ > -- Mathieu Desnoyers EfficiOS Inc. https://www.efficios.com