Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp5646922imm; Tue, 16 Oct 2018 13:42:06 -0700 (PDT) X-Google-Smtp-Source: ACcGV61sKAnRAz4dxZNu/YVUKU2XIav9SkzPmP9VG73gE9K/W8k85ucQsbJxqjVfn4lOzjBxH+g+ X-Received: by 2002:a63:1520:: with SMTP id v32-v6mr21980467pgl.150.1539722526503; Tue, 16 Oct 2018 13:42:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539722526; cv=none; d=google.com; s=arc-20160816; b=iPPRdkglPEaOY/Nc1SunpgJRWeBKJsfc14OTSUK1PReAN4LW5CZdQGA2vm6of4iqcE goJQ6+7pe9uiSigwTCmFXpJMCTHGf4tQlhikJw4kuF7eIdpnbedaLZC5doUbOoD59M3B 0hfiJPmPB662D9iToD+drt5+1ZMHtW1fMGtSiIemtzZTDYE7l/yzg5cTTGTYek7pcN3f suQKWjFJD0hTCdlRHNHMBTGvhXG+JIbfNQbRsO5zFJ/sAwIeFczDZbmY2h/1pV2aX0ex aQ2Ocaw6Ow0m1I8/UfH+drcMZgZLbmerQ9u9SxNuVENLuckd5Mndn+xGgdlnnPeLpyAl kLyA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=cDAz38rqSWZEMZJsggvonncLFKvAtEhJAuUaJpMxy74=; b=KDSeW9riUPnWfz9FK0hErUr5IVFnIWa0U/R18fK4mYFJm+GhyK1b5OlpP8p7pXRIwv h4PHeTMFrYuVWJO2+UBhYPUDLagJ5hn8vnFwA3Rio9fyueyXZkdNSoSh0kxcXgmpSrAw ln5gnJGyZbyfbqmhOMw7PwPcgU3uFoGjgMS59wE054l9seOk4o8HRH5VgaepM7DVI8UC Vftxwiv4AesGDqFWgeZFdbqU8DEYsI/3ixFVlxZcc1zh+wwovmnR49kk3ERJKo8Pkzhy 7XRQDpBLiN1ybX8O1pnKkzhlQ+XOHGzWpTTsJQaFvRkDE9IVPxpUImQSyARqrPkD3y7w W6Gg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=f8UEHrGy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h128-v6si2206321pfc.165.2018.10.16.13.41.50; Tue, 16 Oct 2018 13:42:06 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=f8UEHrGy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727062AbeJQEdf (ORCPT + 99 others); Wed, 17 Oct 2018 00:33:35 -0400 Received: from mail-pg1-f193.google.com ([209.85.215.193]:39306 "EHLO mail-pg1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726168AbeJQEde (ORCPT ); Wed, 17 Oct 2018 00:33:34 -0400 Received: by mail-pg1-f193.google.com with SMTP id r9-v6so11405141pgv.6 for ; Tue, 16 Oct 2018 13:41:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=cDAz38rqSWZEMZJsggvonncLFKvAtEhJAuUaJpMxy74=; b=f8UEHrGyJpVqEfjGd92DNzTWyngG0PsMqmHuTU33P0HMwSw5+8gVwocR0AOynlbcE6 zuu0q8nGDnBBtCR1fSFUc6Z39L0eVwZBl6oAewbzQljt4lPz1VgwbAv6MzO/AG+Y9oLV XlYt38KAKwPscpzb4yVsmKn/S8mo4ExBtbxdo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=cDAz38rqSWZEMZJsggvonncLFKvAtEhJAuUaJpMxy74=; b=Mv6X67VLUQXCoTrokTGG6FLtX+UBD3HjvL+l4Ak71bzQ9fEENwDxyjH4v7a+l4UgUD b3e/PMkhGvAVD6HcJgg9Lrpo6Rm+9FcOrR3ZXcxkuC/Jj+2wrwcHfDfaB2xvnB8ja+NE wmTjgGh9YQ++MT3iZEBgTjK4tuOnnGe3B+4sQQht6Xxfiet1UUIJu+//sIrOPvGfCEOi WVXznN82IDFwYhsK7pVC80UUoKrdxZCJvJT9SwXLtpg1FeWOKQErqjFS1vlN15xtr8zO DWbP9J4PAafQrh/hi9yRgqeuIThIoiUYrxZNXh5ftxryqX9Vaiuwll/AjEdx5Ji9L6Tp 882A== X-Gm-Message-State: ABuFfogeG+CpNBXFSpoMSbemt6NQpon3qGMkWT08+iGVjGdiUCIKQQHP 08FjlWhv7e8owiwTvzTzgxu3449+940= X-Received: by 2002:a63:d917:: with SMTP id r23-v6mr22018718pgg.0.1539722484850; Tue, 16 Oct 2018 13:41:24 -0700 (PDT) Received: from localhost ([2620:0:1000:1601:3aef:314f:b9ea:889f]) by smtp.gmail.com with ESMTPSA id c21-v6sm29034309pfe.93.2018.10.16.13.41.23 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 16 Oct 2018 13:41:23 -0700 (PDT) Date: Tue, 16 Oct 2018 13:41:22 -0700 From: Joel Fernandes To: "Paul E. McKenney" Cc: Nikolay Borisov , linux-kernel@vger.kernel.org, Jonathan Corbet , Josh Triplett , Lai Jiangshan , linux-doc@vger.kernel.org, Mathieu Desnoyers , Steven Rostedt Subject: Re: [PATCH RFC] doc: rcu: remove obsolete (non-)requirement about disabling preemption Message-ID: <20181016204122.GA8176@joelaf.mtv.corp.google.com> References: <20181015021349.GB217384@joelaf.mtv.corp.google.com> <20181015023328.GP2674@linux.ibm.com> <20181015024758.GA227989@joelaf.mtv.corp.google.com> <5151da01-343b-bb37-353e-b6652ae530f5@suse.com> <20181015112112.GT2674@linux.ibm.com> <20181015193951.GA33528@joelaf.mtv.corp.google.com> <20181015195426.GD2674@linux.ibm.com> <20181015201556.GA43575@joelaf.mtv.corp.google.com> <20181015210856.GE2674@linux.ibm.com> <20181016112611.GA27405@linux.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181016112611.GA27405@linux.ibm.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Oct 16, 2018 at 04:26:11AM -0700, Paul E. McKenney wrote: > On Mon, Oct 15, 2018 at 02:08:56PM -0700, Paul E. McKenney wrote: > > On Mon, Oct 15, 2018 at 01:15:56PM -0700, Joel Fernandes wrote: > > > On Mon, Oct 15, 2018 at 12:54:26PM -0700, Paul E. McKenney wrote: > > > [...] > > > > > > In any case, please don't spin for milliseconds with preemption disabled. > > > > > > The real-time guys are unlikely to be happy with you if you do this! > > > > > > > > > > Well just to clarify, I was just running Oleg's test which did this. This > > > > > test was mentioned in the original documentation that I deleted. Ofcourse I > > > > > would not dare do such a thing in production code :-D. I guess to Oleg's > > > > > defense, he did it to very that synchronize_rcu() was not blocked on > > > > > preempt-disable sections which was a different test. > > > > > > > > Understood! Just pointing out that RCU's tolerating a given action does > > > > not necessarily mean that it is a good idea to take that action. ;-) > > > > > > Makes sense :-) thanks. > > > > Don't worry, that won't happen again. ;-) > > > > > > > > > > + pr_crit("SPIN done!\n"); > > > > > > > > + preempt_enable(); > > > > > > > > + break; > > > > > > > > + case 777: > > > > > > > > + pr_crit("SYNC start\n"); > > > > > > > > + synchronize_rcu(); > > > > > > > > + pr_crit("SYNC done!\n"); > > > > > > > > > > > > > > But you are using the console printing infrastructure which is rather > > > > > > > heavyweight. Try replacing pr_* calls with trace_printk so that you > > > > > > > write to the lock-free ring buffer, this will reduce the noise from the > > > > > > > heavy console printing infrastructure. > > > > > > > > > > > > And this might be a problem as well. > > > > > > > > > > This was not the issue (or atleast not fully the issue) since I saw the same > > > > > thing with trace_printk. It was exactly what you said - which is the > > > > > excessively long preempt disabled times. > > > > > > > > One approach would be to apply this patch against (say) v4.18, which > > > > does not have consolidated grace periods. You might then be able to > > > > tell if the pr_crit() calls make any difference. > > > > > > I could do that, yeah. But since the original problem went away due to > > > disabling preempts for a short while, I will move on and continue to focus on > > > updating other parts of the documenation. Just to mention I > > > brought this up because I thought its better to do that than not to, just > > > incase there is any lurking issue with the consolidation. Sorry if that ended > > > up with me being noisy. > > > > Not a problem, no need to apologize! > > Besides, digging through the code did point out a reasonable optimization. > In the common case, this would buy 100s of microseconds rather than > milliseconds, but it seems simple enough to be worthwhile. Thoughts? Cool, thanks. One comment below: > ------------------------------------------------------------------------ > > commit 07921e8720907f58f82b142f2027fc56d5abdbfd > Author: Paul E. McKenney > Date: Tue Oct 16 04:12:58 2018 -0700 > > rcu: Speed up expedited GPs when interrupting RCU reader > > In PREEMPT kernels, an expedited grace period might send an IPI to a > CPU that is executing an RCU read-side critical section. In that case, > it would be nice if the rcu_read_unlock() directly interacted with the > RCU core code to immediately report the quiescent state. And this does > happen in the case where the reader has been preempted. But it would > also be a nice performance optimization if immediate reporting also > happened in the preemption-free case. > > This commit therefore adds an ->exp_hint field to the task_struct structure's > ->rcu_read_unlock_special field. The IPI handler sets this hint when > it has interrupted an RCU read-side critical section, and this causes > the outermost rcu_read_unlock() call to invoke rcu_read_unlock_special(), > which, if preemption is enabled, reports the quiescent state immediately. > If preemption is disabled, then the report is required to be deferred > until preemption (or bottom halves or interrupts or whatever) is re-enabled. > > Because this is a hint, it does nothing for more complicated cases. For > example, if the IPI interrupts an RCU reader, but interrupts are disabled > across the rcu_read_unlock(), but another rcu_read_lock() is executed > before interrupts are re-enabled, the hint will already have been cleared. > If you do crazy things like this, reporting will be deferred until some > later RCU_SOFTIRQ handler, context switch, cond_resched(), or similar. > > Reported-by: Joel Fernandes > Signed-off-by: Paul E. McKenney > > diff --git a/include/linux/sched.h b/include/linux/sched.h > index 004ca21f7e80..64ce751b5fe9 100644 > --- a/include/linux/sched.h > +++ b/include/linux/sched.h > @@ -571,8 +571,10 @@ union rcu_special { > struct { > u8 blocked; > u8 need_qs; > + u8 exp_hint; /* Hint for performance. */ > + u8 pad; /* No garbage from compiler! */ > } b; /* Bits. */ > - u16 s; /* Set of bits. */ > + u32 s; /* Set of bits. */ > }; > > enum perf_event_task_context { > diff --git a/kernel/rcu/tree_exp.h b/kernel/rcu/tree_exp.h > index e669ccf3751b..928fe5893a57 100644 > --- a/kernel/rcu/tree_exp.h > +++ b/kernel/rcu/tree_exp.h > @@ -692,8 +692,10 @@ static void sync_rcu_exp_handler(void *unused) > */ > if (t->rcu_read_lock_nesting > 0) { > raw_spin_lock_irqsave_rcu_node(rnp, flags); > - if (rnp->expmask & rdp->grpmask) > + if (rnp->expmask & rdp->grpmask) { > rdp->deferred_qs = true; > + WRITE_ONCE(t->rcu_read_unlock_special.b.exp_hint, true); > + } > raw_spin_unlock_irqrestore_rcu_node(rnp, flags); > } > > diff --git a/kernel/rcu/tree_plugin.h b/kernel/rcu/tree_plugin.h > index 8b48bb7c224c..d6286eb6e77e 100644 > --- a/kernel/rcu/tree_plugin.h > +++ b/kernel/rcu/tree_plugin.h > @@ -643,8 +643,9 @@ static void rcu_read_unlock_special(struct task_struct *t) > local_irq_save(flags); > irqs_were_disabled = irqs_disabled_flags(flags); > if ((preempt_bh_were_disabled || irqs_were_disabled) && > - t->rcu_read_unlock_special.b.blocked) { > + t->rcu_read_unlock_special.s) { > /* Need to defer quiescent state until everything is enabled. */ > + WRITE_ONCE(t->rcu_read_unlock_special.b.exp_hint, false); > raise_softirq_irqoff(RCU_SOFTIRQ); Still going through this patch, but it seems to me like the fact that rcu_read_unlock_special is called means someone has requested for a grace period. Then in that case, does it not make sense to raise the softirq for processing anyway? thanks, - Joel