Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755386AbdGSV4H (ORCPT ); Wed, 19 Jul 2017 17:56:07 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:32778 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1754034AbdGSV4F (ORCPT ); Wed, 19 Jul 2017 17:56:05 -0400 Date: Wed, 19 Jul 2017 14:56:02 -0700 From: "Paul E. McKenney" To: Akira Yokosawa Cc: linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] documentation: Fix two-CPU control-dependency example Reply-To: paulmck@linux.vnet.ibm.com References: <0d60072f-3848-a2b7-ce18-e1ed317b5c09@gmail.com> <20170719174334.GH3730@linux.vnet.ibm.com> <101f5108-663e-7fa4-ac2b-e790320e4e6f@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <101f5108-663e-7fa4-ac2b-e790320e4e6f@gmail.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-GCONF: 00 x-cbid: 17071921-0008-0000-0000-00000261B830 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00007389; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000214; SDB=6.00889986; UDB=6.00444605; IPR=6.00670157; BA=6.00005479; NDR=6.00000001; ZLA=6.00000005; ZF=6.00000009; ZB=6.00000000; ZP=6.00000000; ZH=6.00000000; ZU=6.00000002; MB=3.00016282; XFM=3.00000015; UTC=2017-07-19 21:56:03 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 17071921-0009-0000-0000-000036163ADB Message-Id: <20170719215602.GK3730@linux.vnet.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-07-19_15:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1706020000 definitions=main-1707190342 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4939 Lines: 133 On Thu, Jul 20, 2017 at 06:33:26AM +0900, Akira Yokosawa wrote: > On 2017/07/20 2:43, Paul E. McKenney wrote: > > On Mon, Jul 17, 2017 at 05:24:42PM +0900, Akira Yokosawa wrote: > >> >From b798b9b631e237d285aa8699da00bfb8ced33bea Mon Sep 17 00:00:00 2001 > >> From: Akira Yokosawa > >> Date: Mon, 17 Jul 2017 16:25:33 +0900 > >> Subject: [PATCH] documentation: Fix two-CPU control-dependency example > >> > >> In commit 5646f7acc95f ("memory-barriers: Fix control-ordering > >> no-transitivity example"), the operator in "if" statement of > >> the two-CPU example was modified from ">=" to ">". > >> Now the example misses the point because there is no party > >> who will modify "x" nor "y". So each CPU performs only the > >> READ_ONCE(). > >> > >> The point of this example is to use control dependency for ordering, > >> and the WRITE_ONCE() should always be executed. > >> > >> So it was correct prior to the above mentioned commit. Partial > >> revert of the commit (with context adjustments regarding other > >> changes thereafter) restores the point. > >> > >> Note that the three-CPU example demonstrating the lack of > >> transitivity stands regardless of this partial revert. > >> > >> Signed-off-by: Akira Yokosawa > > > > Hello, Akira, > > > > You are quite right that many compilers would generate straightforward > > code for the code fragment below, and in that case, the assertion could > > never trigger due to either TSO or control dependencies, depending on > > the architecture this was running on. > > > > However, if the compiler was too smart for our good, it could figure > > out that "x" and "y" only take on the values zero and one, so that > > the "if" would always be taken. At that point, the compiler could > > simply ignore the "if" with the result shown below. > > > >> --- > >> Documentation/memory-barriers.txt | 2 +- > >> 1 file changed, 1 insertion(+), 1 deletion(-) > >> > >> diff --git a/Documentation/memory-barriers.txt b/Documentation/memory-barriers.txt > >> index c4ddfcd..c1ebe99 100644 > >> --- a/Documentation/memory-barriers.txt > >> +++ b/Documentation/memory-barriers.txt > >> @@ -851,7 +851,7 @@ demonstrated by two related examples, with the initial values of > >> CPU 0 CPU 1 > >> ======================= ======================= > >> r1 = READ_ONCE(x); r2 = READ_ONCE(y); > >> - if (r1 > 0) if (r2 > 0) > >> + if (r1 >= 0) if (r2 >= 0) > >> WRITE_ONCE(y, 1); WRITE_ONCE(x, 1); > >> > >> assert(!(r1 == 1 && r2 == 1)); > > > > Original program: > > > > CPU 0 CPU 1 > > ======================= ======================= > > r1 = READ_ONCE(x); r2 = READ_ONCE(y); > > if (r1 >= 0) if (r2 >= 0) > > WRITE_ONCE(y, 1); WRITE_ONCE(x, 1); > > > > assert(!(r1 == 1 && r2 == 1)); > > > > Enthusiastically optimized program: > > > > CPU 0 CPU 1 > > ======================= ======================= > > r1 = READ_ONCE(x); r2 = READ_ONCE(y); > > WRITE_ONCE(y, 1); WRITE_ONCE(x, 1); > > > > assert(!(r1 == 1 && r2 == 1)); > > > > This optimized version could trigger the assertion. > > > > So we do need to stick with the ">" comparison. > > Well but, > > Current example: > > CPU 0 CPU 1 > ======================= ======================= > r1 = READ_ONCE(x); r2 = READ_ONCE(y); > if (r1 > 0) if (r2 > 0) > WRITE_ONCE(y, 1); WRITE_ONCE(x, 1); > > assert(!(r1 == 1 && r2 == 1)); > > Such a clever compiler might be able to prove that "x" and "y" > are never modified and end up in the following: > > CPU 0 CPU 1 > ======================= ======================= > r1 = READ_ONCE(x); r2 = READ_ONCE(y); > > assert(!(r1 == 1 && r2 == 1)); > > This means it is impossible to describe this example in C, > doesn't it? That is a matter of some debate, but it has gotten increasingly more difficult to get C to do one's bidding over the decades. > What am I missing here? The compiler has to work harder in your example case, so it is probably just a matter of time. In the original with ">=", all the compiler needed to do was look at all the assignments and the initial value. In the original, it also had to do reasoning based on control dependencies (which are not yet part of the C standard). But yes, the degree to which a compiler can optimize atomics is a matter of some debate. Here is a proposal to let the developer choose: http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2016/p0062r1.html What are your thoughts on this? Thanx, Paul > Thanks, Akira > > > That said, I very much welcome critical reviews of memory-barriers.txt, > > so please do feel free to continue doing that! > > > > Thanx, Paul > > > > >