Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp2916615yba; Mon, 8 Apr 2019 07:24:18 -0700 (PDT) X-Google-Smtp-Source: APXvYqxNPUsRej4St6zBBj3ebhZj5C+IfC88eHc8Pq3T27ZoO8IerqxDIpa0B66qAksr+EwjbXV5 X-Received: by 2002:a17:902:d70f:: with SMTP id w15mr30796424ply.134.1554733458903; Mon, 08 Apr 2019 07:24:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1554733458; cv=none; d=google.com; s=arc-20160816; b=ejrzhxFmDMUJ5kaici1Pa6VvqQgN1SK1aY8qVD5ND5Uoj6abMQXOyThBoaFk72YHIQ FekHbBa16cGar2enNe145+qfAOe+5u631aGtb0voUwx23+o+l07DgcLuwslMIi598lnp Trc0ORMuKhVJbHFWoblVUo81bWnw3aIqy14oF8p+SLq+nqAgodcVHZBCWOvHZD++g9M4 +kn08ifTbJqJl1UqY0m1qRLUTejOttFyoOO2wgagcQ83GCL2Qe47Vg0WHWnmlkyTOER2 0P3QXasmSGqDw/Cb83+AqVEPWZEC9/0eS2oPRasQw+FhZX0O4VDOFIIWHeorlTXD+ovq QUqQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:subject:cc:to :from:date; bh=YVwycvModfgkg5Yo68dsa2+YtHy6RugIbTyvFHW3a/k=; b=Gazz+KwRmGUEmn7gEkEKYyU9bVDLT/VZrNSCJcwjtxo7ZtptiAoHhSShtx9s4Cw/wj CIJH4MvAqIvCDzxZbKq+e3q7FBkH1kWtcORTKicrEe+NDISGOujEB8Rtg1SE90cIe0E5 no9gDFhFmH+cXX+XuD5XzpnoDPLJ8Nj6HtKalXzwURGK1AfeLTmQxv1i4q1iFviLOiiU ak9U2y3ioTinXy38yeXjBNuouztbhvClZU9kIHz9269rxJmMNTuzMQMtfyjOhKiCvPqq jTtPRQ+PVzied/f4hJYdmF0fYPSeJR1qB5ARJM1Ed2Fl/Pb9Le2fSZgAST2XUuuNOPSO YoYQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 3si11882720plo.300.2019.04.08.07.24.03; Mon, 08 Apr 2019 07:24:18 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727341AbfDHOWi (ORCPT + 99 others); Mon, 8 Apr 2019 10:22:38 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:42742 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726806AbfDHOWh (ORCPT ); Mon, 8 Apr 2019 10:22:37 -0400 Received: from pps.filterd (m0098419.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.27/8.16.0.27) with SMTP id x38EERrs002796 for ; Mon, 8 Apr 2019 10:22:36 -0400 Received: from e13.ny.us.ibm.com (e13.ny.us.ibm.com [129.33.205.203]) by mx0b-001b2d01.pphosted.com with ESMTP id 2rr72m3kf6-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Mon, 08 Apr 2019 10:22:35 -0400 Received: from localhost by e13.ny.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Mon, 8 Apr 2019 15:22:35 +0100 Received: from b01cxnp22036.gho.pok.ibm.com (9.57.198.26) by e13.ny.us.ibm.com (146.89.104.200) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Mon, 8 Apr 2019 15:22:29 +0100 Received: from b01ledav003.gho.pok.ibm.com (b01ledav003.gho.pok.ibm.com [9.57.199.108]) by b01cxnp22036.gho.pok.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id x38EMSrF33095754 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Mon, 8 Apr 2019 14:22:28 GMT Received: from b01ledav003.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 08215B2067; Mon, 8 Apr 2019 14:22:28 +0000 (GMT) Received: from b01ledav003.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id D5EAAB2064; Mon, 8 Apr 2019 14:22:27 +0000 (GMT) Received: from paulmck-ThinkPad-W541 (unknown [9.70.82.188]) by b01ledav003.gho.pok.ibm.com (Postfix) with ESMTP; Mon, 8 Apr 2019 14:22:27 +0000 (GMT) Received: by paulmck-ThinkPad-W541 (Postfix, from userid 1000) id CD5BC16C1171; Mon, 8 Apr 2019 07:22:30 -0700 (PDT) Date: Mon, 8 Apr 2019 07:22:30 -0700 From: "Paul E. McKenney" To: Mathieu Desnoyers Cc: "Joel Fernandes, Google" , rcu , linux-kernel , Ingo Molnar , Lai Jiangshan , dipankar , Andrew Morton , Josh Triplett , Thomas Gleixner , Peter Zijlstra , rostedt , David Howells , Eric Dumazet , fweisbec , Oleg Nesterov , linux-nvdimm , dri-devel , amd-gfx Subject: Re: [PATCH RFC tip/core/rcu 0/4] Forbid static SRCU use in modules Reply-To: paulmck@linux.ibm.com References: <20190402142816.GA13084@linux.ibm.com> <20190407133941.GC14111@linux.ibm.com> <20190407135937.GA30053@linux.ibm.com> <134026717.535.1554665176677.JavaMail.zimbra@efficios.com> <20190407193202.GA30934@localhost> <1632568795.549.1554669696728.JavaMail.zimbra@efficios.com> <20190407210718.GA6656@localhost> <20190408022728.GF14111@linux.ibm.com> <1504296005.857.1554728734661.JavaMail.zimbra@efficios.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1504296005.857.1554728734661.JavaMail.zimbra@efficios.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-GCONF: 00 x-cbid: 19040814-0064-0000-0000-000003C7FEF1 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00010889; HX=3.00000242; KW=3.00000007; PH=3.00000004; SC=3.00000284; SDB=6.01186137; UDB=6.00621202; IPR=6.00966894; MB=3.00026343; MTD=3.00000008; XFM=3.00000015; UTC=2019-04-08 14:22:33 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19040814-0065-0000-0000-00003CFD615A Message-Id: <20190408142230.GJ14111@linux.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2019-04-08_05:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=2 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=971 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1904080120 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 08, 2019 at 09:05:34AM -0400, Mathieu Desnoyers wrote: > ----- On Apr 7, 2019, at 10:27 PM, paulmck paulmck@linux.ibm.com wrote: > > > On Sun, Apr 07, 2019 at 09:07:18PM +0000, Joel Fernandes wrote: > >> On Sun, Apr 07, 2019 at 04:41:36PM -0400, Mathieu Desnoyers wrote: > >> > > >> > ----- On Apr 7, 2019, at 3:32 PM, Joel Fernandes, Google joel@joelfernandes.org > >> > wrote: > >> > > >> > > On Sun, Apr 07, 2019 at 03:26:16PM -0400, Mathieu Desnoyers wrote: > >> > >> ----- On Apr 7, 2019, at 9:59 AM, paulmck paulmck@linux.ibm.com wrote: > >> > >> > >> > >> > On Sun, Apr 07, 2019 at 06:39:41AM -0700, Paul E. McKenney wrote: > >> > >> >> On Sat, Apr 06, 2019 at 07:06:13PM -0400, Joel Fernandes wrote: > >> > >> > > >> > >> > [ . . . ] > >> > >> > > >> > >> >> > > diff --git a/include/asm-generic/vmlinux.lds.h > >> > >> >> > > b/include/asm-generic/vmlinux.lds.h > >> > >> >> > > index f8f6f04c4453..c2d919a1566e 100644 > >> > >> >> > > --- a/include/asm-generic/vmlinux.lds.h > >> > >> >> > > +++ b/include/asm-generic/vmlinux.lds.h > >> > >> >> > > @@ -338,6 +338,10 @@ > >> > >> >> > > KEEP(*(__tracepoints_ptrs)) /* Tracepoints: pointer array */ \ > >> > >> >> > > __stop___tracepoints_ptrs = .; \ > >> > >> >> > > *(__tracepoints_strings)/* Tracepoints: strings */ \ > >> > >> >> > > + . = ALIGN(8); \ > >> > >> >> > > + __start___srcu_struct = .; \ > >> > >> >> > > + *(___srcu_struct_ptrs) \ > >> > >> >> > > + __end___srcu_struct = .; \ > >> > >> >> > > } \ > >> > >> >> > > >> > >> >> > This vmlinux linker modification is not needed. I tested without it and srcu > >> > >> >> > torture works fine with rcutorture built as a module. Putting further prints > >> > >> >> > in kernel/module.c verified that the kernel is able to find the srcu structs > >> > >> >> > just fine. You could squash the below patch into this one or apply it on top > >> > >> >> > of the dev branch. > >> > >> >> > >> > >> >> Good point, given that otherwise FORTRAN named common blocks would not > >> > >> >> work. > >> > >> >> > >> > >> >> But isn't one advantage of leaving that stuff in the RO_DATA_SECTION() > >> > >> >> macro that it can be mapped read-only? Or am I suffering from excessive > >> > >> >> optimism? > >> > >> > > >> > >> > And to answer the other question, in the case where I am suffering from > >> > >> > excessive optimism, it should be a separate commit. Please see below > >> > >> > for the updated original commit thus far. > >> > >> > > >> > >> > And may I have your Tested-by? > >> > >> > >> > >> Just to confirm: does the cleanup performed in the modules going > >> > >> notifier end up acting as a barrier first before freeing the memory ? > >> > >> If not, is it explicitly stated that a barrier must be issued before > >> > >> module unload ? > >> > >> > >> > > > >> > > You mean rcu_barrier? It is mentioned in the documentation that this is the > >> > > responsibility of the module writer to prevent delays for all modules. > >> > > >> > It's a srcu barrier yes. Considering it would be a barrier specific to the > >> > srcu domain within that module, I don't see how it would cause delays for > >> > "all" modules if we implicitly issue the barrier on module unload. What > >> > am I missing ? > >> > >> Yes you are right. I thought of this after I just sent my email. I think it > >> makes sense for srcu case to do and could avoid a class of bugs. > > > > If there are call_srcu() callbacks outstanding, the module writer still > > needs the srcu_barrier() because otherwise callbacks arrive after > > the module text has gone, which will be disappoint the CPU when it > > tries fetching instructions that are no longer mapped. If there are > > no call_srcu() callbacks from that module, then there is no need for > > srcu_barrier() either way. > > > > So if an srcu_barrier() is needed, the module developer needs to > > supply it. > > When you say "callbacks arrive after the module text has gone", > I think you assume that free_module() is invoked before the > MODULE_STATE_GOING notifiers are called. But it's done in the > opposite order: going notifiers are called first, and then > free_module() is invoked. > > So AFAIU it would be safe to issue the srcu_barrier() from the module > going notifier. > > Or am I missing something ? We do seem to be talking past each other. ;-) This has nothing to do with the order of events at module-unload time. So please let me try again. If a given srcu_struct in a module never has call_srcu() invoked, there is no need to invoke rcu_barrier() at any time, whether at module-unload time or not. Adding rcu_barrier() in this case adds overhead and latency for no good reason. If a given srcu_struct in a module does have at least one call_srcu() invoked, it is already that module's responsibility to make sure that the code sticks around long enough for the callback to be invoked. This means that correct SRCU users that invoke call_srcu() already have srcu_barrier() at module-unload time. Incorrect SRCU users, with reasonable probability, now get a WARN_ON() at module-unload time, with the per-CPU state getting leaked. Before this change, they would (also with reasonable probability) instead get an instruction-fetch fault when the SRCU callback was invoked after the completion of the module unload. Furthermore, in all cases where they would previously have gotten the instruction-fetch fault, they now get the WARN_ON(), like this: if (WARN_ON(rcu_segcblist_n_cbs(&sdp->srcu_cblist))) return; /* Forgot srcu_barrier(), so just leak it! */ So this change already represents an improvement in usability. Thanx, Paul