Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757590AbdDRTpR (ORCPT ); Tue, 18 Apr 2017 15:45:17 -0400 Received: from relay7-d.mail.gandi.net ([217.70.183.200]:42883 "EHLO relay7-d.mail.gandi.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755504AbdDRTpO (ORCPT ); Tue, 18 Apr 2017 15:45:14 -0400 Date: Tue, 18 Apr 2017 12:44:53 -0700 From: Josh Triplett To: "Paul E. McKenney" Cc: linux-kernel@vger.kernel.org, mingo@kernel.org, jiangshanlai@gmail.com, dipankar@in.ibm.com, akpm@linux-foundation.org, mathieu.desnoyers@efficios.com, tglx@linutronix.de, peterz@infradead.org, rostedt@goodmis.org, dhowells@redhat.com, edumazet@google.com, fweisbec@gmail.com, oleg@redhat.com, bobby.prani@gmail.com Subject: Re: [PATCH v2 tip/core/rcu 04/39] srcu: Check for tardy grace-period activity in cleanup_srcu_struct() Message-ID: <20170418194453.GA12976@cloud> References: <20170417234452.GB19013@linux.vnet.ibm.com> <1492472726-3841-4-git-send-email-paulmck@linux.vnet.ibm.com> <20170418003332.7ocpffvkr3okbhr7@x> <20170418003429.wa7qvhiy3vvccbxw@x> <20170418183453.GZ3956@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170418183453.GZ3956@linux.vnet.ibm.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3724 Lines: 74 On Tue, Apr 18, 2017 at 11:34:53AM -0700, Paul E. McKenney wrote: > On Mon, Apr 17, 2017 at 05:34:30PM -0700, Josh Triplett wrote: > > On Mon, Apr 17, 2017 at 05:33:32PM -0700, Josh Triplett wrote: > > > On Mon, Apr 17, 2017 at 04:44:51PM -0700, Paul E. McKenney wrote: > > > > Users of SRCU are obliged to complete all grace-period activity before > > > > invoking cleanup_srcu_struct(). This means that all calls to either > > > > synchronize_srcu() or synchronize_srcu_expedited() must have returned, > > > > and all calls to call_srcu() must have returned, and the last call to > > > > call_srcu() must have been followed by a call to srcu_barrier(). > > > > Furthermore, the caller must have done something to prevent any > > > > further calls to synchronize_srcu(), synchronize_srcu_expedited(), > > > > and call_srcu(). > > > > > > > > Therefore, if there has ever been an invocation of call_srcu() on > > > > the srcu_struct in question, the sequence of events must be as > > > > follows: > > > > > > > > 1. Prevent any further calls to call_srcu(). > > > > 2. Wait for any pre-existing call_srcu() invocations to return. > > > > 3. Invoke srcu_barrier(). > > > > 4. It is now safe to invoke cleanup_srcu_struct(). > > > > > > > > On the other hand, if there has ever been a call to synchronize_srcu() > > > > or synchronize_srcu_expedited(), the sequence of events must be as > > > > follows: > > > > > > > > 1. Prevent any further calls to synchronize_srcu() or > > > > synchronize_srcu_expedited(). > > > > 2. Wait for any pre-existing synchronize_srcu() or > > > > synchronize_srcu_expedited() invocations to return. > > > > 3. It is now safe to invoke cleanup_srcu_struct(). > > > > > > > > If there have been calls to all both types of functions (call_srcu() > > > > and either of synchronize_srcu() and synchronize_srcu_expedited()), then > > > > the caller must do the first three steps of the call_srcu() procedure > > > > above and the first two steps of the synchronize_s*() procedure above, > > > > and only then invoke cleanup_srcu_struct(). > > > > > > This commit message clearly explains the correct sequence for the > > > client, but not which aspects of this the change now enforces. Some of > > > the steps above remain the responsibility of the caller, while the > > > callee now checks more of them. Could you add something at the end > > > explaining the change and what it enforces? > > > > More importantly, perhaps this explanation could find its way into the > > documentation of cleanup_srcu_struct? > > Like this? > > /** > * cleanup_srcu_struct - deconstruct a sleep-RCU structure > * @sp: structure to clean up. > * > * Must invoke this only after you are finished using a given srcu_struct > * that was initialized via init_srcu_struct(). This code does some > * probabalistic checking, spotting late uses of srcu_read_lock(), > * synchronize_srcu(), synchronize_srcu_expedited(), and call_srcu(). > * If any such late uses are detected, the per-CPU memory associated with > * the srcu_struct is simply leaked and WARN_ON() is invoked. If the > * caller frees the srcu_struct itself, a use-after-free crash will likely > * ensue, but at least there will be a warning printed. > */ > > I added the following paragraph to the commit log: > > Note that cleanup_srcu_struct() does some probabilistic checks > for the caller failing to follow these procedures, in which > case cleanup_srcu_struct() does WARN_ON() and avoids freeing > the per-CPU structures associated with the specified srcu_struct > structure. > > And added your Reviewed-by, but please let me if more is needed. Looks good to me, thanks for improving the documentation.