Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp3792209imm; Mon, 15 Oct 2018 04:23:09 -0700 (PDT) X-Google-Smtp-Source: ACcGV628XbkreQWlrAcXzqxmMjNSwsJD9S8uR8Ff3lFnxmVsejaq89KuFV38TwyY7rxeCGD2MTb8 X-Received: by 2002:a63:d70c:: with SMTP id d12-v6mr15445358pgg.110.1539602589735; Mon, 15 Oct 2018 04:23:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539602589; cv=none; d=google.com; s=arc-20160816; b=fJN2XPr+HeXpPuT4tCONE36Lm8rZrtk6piLOAvqbrPAiWeEb5CFt7rOTL/yYYYHZwt qzQsHZaS1abGwbIgMxIAV3QdVOr2vCVzWh0vaiWe9ArP8UkDU0Cxyf3yg1SAYupu0jPG BhjgStcxKT864tcg2c/iWxU/a0E/vFaepbaqA8EeARIqJfMRmpIw0lmn+f1Zi/A5uv63 mK/VrzzKX1wezErj9Dt2zd/8tj46ZDCsQZuTqx+H49CBUl1yQRYY478vu2Md9Zsu6CDd I4STPpFM5uDacl8YwhFqAJNfirX+Iw3/FgvPQGxdCGXN8MQNc1Fz+N+M1IgVstZEhDui egIQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:subject:cc:to :from:date; bh=nAhQ5mJZJI2oYTlpR9DAr4e39EJ2WW33GvuRqQdxICY=; b=V5DmUv/OqB0NKPhLOzGd07fa7irg+J1fGckEo1xiPvY9rE/svaEavMB02Xmg22Uu12 IvbavOIgWbZWWl0Dafkl4hsXtMoHCEjVNFnId4JDVwAYl47c4mUpcWH/viwstCsZpgXA rA7zQmKC7rvJ4dF5VU0R5uM4QNh5h87/hOkPYrbX45Sb73S3/EyIgaSFAiAlwtDdjd30 6wJTcPKmGayxrF/PaTirQnukRL27PuNre4qaOQBgLHAaHnu3AtFvfxwpL0vQY4VAK+Kt 9HXjNzfnL1HEQaTMd2GvnGksZ3bw1DH1+FmlJI6iBdKpiY+3vhWGZ7TH+sPyVGxGBxLd 059w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m23-v6si10276812pgk.424.2018.10.15.04.22.55; Mon, 15 Oct 2018 04:23:09 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726557AbeJOTGF (ORCPT + 99 others); Mon, 15 Oct 2018 15:06:05 -0400 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:40744 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726469AbeJOTGE (ORCPT ); Mon, 15 Oct 2018 15:06:04 -0400 Received: from pps.filterd (m0098409.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w9FBK54I110963 for ; Mon, 15 Oct 2018 07:21:14 -0400 Received: from e15.ny.us.ibm.com (e15.ny.us.ibm.com [129.33.205.205]) by mx0a-001b2d01.pphosted.com with ESMTP id 2n4ss30cq1-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Mon, 15 Oct 2018 07:21:14 -0400 Received: from localhost by e15.ny.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Mon, 15 Oct 2018 07:21:13 -0400 Received: from b01cxnp23034.gho.pok.ibm.com (9.57.198.29) by e15.ny.us.ibm.com (146.89.104.202) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Mon, 15 Oct 2018 07:21:10 -0400 Received: from b01ledav003.gho.pok.ibm.com (b01ledav003.gho.pok.ibm.com [9.57.199.108]) by b01cxnp23034.gho.pok.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w9FBL9ek29032548 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Mon, 15 Oct 2018 11:21:09 GMT Received: from b01ledav003.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id CE53CB2064; Mon, 15 Oct 2018 07:19:04 -0400 (EDT) Received: from b01ledav003.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 933DAB2065; Mon, 15 Oct 2018 07:19:04 -0400 (EDT) Received: from paulmck-ThinkPad-W541 (unknown [9.85.131.128]) by b01ledav003.gho.pok.ibm.com (Postfix) with ESMTP; Mon, 15 Oct 2018 07:19:04 -0400 (EDT) Received: by paulmck-ThinkPad-W541 (Postfix, from userid 1000) id ABB2716C0B33; Mon, 15 Oct 2018 04:21:12 -0700 (PDT) Date: Mon, 15 Oct 2018 04:21:12 -0700 From: "Paul E. McKenney" To: Nikolay Borisov Cc: Joel Fernandes , linux-kernel@vger.kernel.org, Jonathan Corbet , Josh Triplett , Lai Jiangshan , linux-doc@vger.kernel.org, Mathieu Desnoyers , Steven Rostedt Subject: Re: [PATCH RFC] doc: rcu: remove obsolete (non-)requirement about disabling preemption Reply-To: paulmck@linux.ibm.com References: <20181014212955.95267-1-joel@joelfernandes.org> <20181014231731.GN2674@linux.ibm.com> <20181015020827.GA217384@joelaf.mtv.corp.google.com> <20181015021349.GB217384@joelaf.mtv.corp.google.com> <20181015023328.GP2674@linux.ibm.com> <20181015024758.GA227989@joelaf.mtv.corp.google.com> <5151da01-343b-bb37-353e-b6652ae530f5@suse.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <5151da01-343b-bb37-353e-b6652ae530f5@suse.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-GCONF: 00 x-cbid: 18101511-0068-0000-0000-0000034D4692 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00009879; HX=3.00000242; KW=3.00000007; PH=3.00000004; SC=3.00000268; SDB=6.01102982; UDB=6.00570882; IPR=6.00883033; MB=3.00023760; MTD=3.00000008; XFM=3.00000015; UTC=2018-10-15 11:21:12 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18101511-0069-0000-0000-00004613B833 Message-Id: <20181015112112.GT2674@linux.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2018-10-15_08:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1807170000 definitions=main-1810150105 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Oct 15, 2018 at 09:05:22AM +0300, Nikolay Borisov wrote: > On 15.10.2018 05:47, Joel Fernandes wrote: > > On Sun, Oct 14, 2018 at 07:33:28PM -0700, Paul E. McKenney wrote: > >> On Sun, Oct 14, 2018 at 07:13:49PM -0700, Joel Fernandes wrote: > >>> On Sun, Oct 14, 2018 at 07:08:27PM -0700, Joel Fernandes wrote: > >>>> On Sun, Oct 14, 2018 at 04:17:31PM -0700, Paul E. McKenney wrote: > >>>>> On Sun, Oct 14, 2018 at 02:29:55PM -0700, Joel Fernandes (Google) wrote: > >>>>>> The Requirements.html document says "Disabling Preemption Does Not Block > >>>>>> Grace Periods". However this is no longer true with the RCU > >>>>>> consolidation. Lets remove the obsolete (non-)requirement entirely. > >>>>>> > >>>>>> Signed-off-by: Joel Fernandes (Google) > >>>>> > >>>>> Good catch, queued, thank you! > >>>> > >>>> Thanks! By the way after I sent the patch, I also tried Oleg's experiment to > >>>> confirm that this is indeed obsolete. :) > >>>> > >>>> One thing interesting came up when I tried synchronize_rcu_expedited() > >>>> instead of synchronize_rcu() in Oleg's experiment, I still saw a multiple > >>>> millisecond delay between when the rcu read section completely and the > >>>> synchronize_rcu_expedited returns: > >>>> > >>>> For example, with synchronize_rcu_expedited, the 'SPIN done' and the 'SYNC > >>>> done' are about 3 millisecond apart: > >>>> [ 77.599142] SPIN start > >>>> [ 77.601595] SYNC start > >>>> [ 82.604950] SPIN done! > >>>> [ 82.607836] SYNC done! > >>>> I saw anywhere from 2-6 milliseconds. > >>>> > >>>> The reason I bring this up is according to Requirements.html: In some cases, > >>>> the multi-millisecond synchronize_rcu() latencies are unacceptable. In these > >>>> cases, synchronize_rcu_expedited() may be used instead,.. so either I messed > >>>> something up in the experiment, or I need to update this part of the document ;-) > >> > >> In normal testing, 2-6 milliseconds is indeed excessive. Could you please > >> point me at Oleg's experiment? Also, what CONFIG_PREEMPT setting were > >> you using? (My guess is CONFIG_PREEMPT=y.) > > > > The CONFIG_PREEMPT config I am using is CONFIG_PREEMPT=y. > > > >>> So I realized I'm running in Qemu so it could also be a scheduling delay of > >>> the vcpu thread. So apologies about the noise if the experiment works fine > >>> for you. > >> > >> I used rcuperf, which might not be doing the same thing as Oleg's > >> experiment. > > > > The experiment is mentioned at: > > https://www.mail-archive.com/linux-kernel@vger.kernel.org/msg912055.html > > > > If you apply the below diff, it applies cleanly on rcu/dev. And then run: > > taskset 2 perl -e 'syscall 157, 666, 5000' & > > taskset 1 perl -e 'syscall 157, 777' > > > > diff --git a/kernel/sys.c b/kernel/sys.c > > index cf5c67533ff1..b654b7566ca3 100644 > > --- a/kernel/sys.c > > +++ b/kernel/sys.c > > @@ -2261,6 +2261,9 @@ int __weak arch_prctl_spec_ctrl_set(struct task_struct *t, unsigned long which, > > return -EINVAL; > > } > > > > +#include > > + > > + > > SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, > > unsigned long, arg4, unsigned long, arg5) > > { > > @@ -2274,6 +2277,19 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, > > > > error = 0; > > switch (option) { > > + case 666: > > + preempt_disable(); > > + pr_crit("SPIN start\n"); > > + while (arg2--) > > + mdelay(1); OK, this is the problem. When you spin in the kernel for several milliseconds with preemption disabled, the consolidated grace period is -required- to wait for this preemption-disabled reader to complete, whether expedited or not. So, expected behavior. ;-) In any case, please don't spin for milliseconds with preemption disabled. The real-time guys are unlikely to be happy with you if you do this! > > + pr_crit("SPIN done!\n"); > > + preempt_enable(); > > + break; > > + case 777: > > + pr_crit("SYNC start\n"); > > + synchronize_rcu(); > > + pr_crit("SYNC done!\n"); > > But you are using the console printing infrastructure which is rather > heavyweight. Try replacing pr_* calls with trace_printk so that you > write to the lock-free ring buffer, this will reduce the noise from the > heavy console printing infrastructure. And this might be a problem as well. Thanx, Paul > > + break; > > case PR_SET_PDEATHSIG: > > if (!valid_signal(arg2)) { > > error = -EINVAL; > > >