Received: by 2002:a25:868d:0:0:0:0:0 with SMTP id z13csp2049724ybk; Mon, 11 May 2020 10:38:31 -0700 (PDT) X-Google-Smtp-Source: APiQypJeOkQSeoFBQ4M2zHwCqUnJRKaIyYd5Rw0sKWGc4gVltVaz1FGaxrdsXifb1hPHQSY1FXCG X-Received: by 2002:aa7:ce17:: with SMTP id d23mr15014422edv.89.1589218710986; Mon, 11 May 2020 10:38:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1589218710; cv=none; d=google.com; s=arc-20160816; b=gkyG4lHZphgirGX8CKFQne5b7NbZuN3CZfDdGCYccM714FjV1mS9Dc6ga9VzWPxk5M Ok/PuDX6g5BIR0ffUuCht6SU6Zjj+JGzS/lgrp2V8C+cqwKz+dUSc3s2zFHQ+t/civ0m NSpc/yYB99zu/8u/pjzlTiG35awgJNHsxOglTlQLxrvynSQGnDUNPrg7FW+h3D+2omzk fZleU4KD5Y4/y1IhhNr6ba3rmkQMDdJzGuHMTqnaeU9JCbB4aJW6E0EQQ+VAF0Zk+ajt BEUonDwcBY/Sn4GiEuUej8jzCztDKQEgVuqbHRl9FajDFCqDLG/Xl+PmzA0f1hSOjbK5 cO/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=p539FHoZ90RTUtOujWNx2OV34RGSmrhBjMXEcsohfeg=; b=cXQvYD4Qmqiwe5aZny+8bShejHN/NRxea0Qr1tBRfunslfxFMxvsx1ZQA8BWCnfE7C zjaaz6mdkrmuB+bI0G8kmk8FewyS/5t1iQs8HFOYZCb3CJu8W6iH3tm3R9fEM2HKm3n/ scTOqUMCEXVS3P9tsOpGW3EsdNh8R1wUOg5iTWIaad8dvktSAg0pCNZ3M/A/HHvyE7Ud bdN7MC4CINIecXwI3hxiHKgdYRhGNom+Yx0CtHlzplwJOepW9Fbcl+W318M/JHErI0xI Iez32X37Ac+YRJNi/WMxijT0JDcW431rOegKDHdbDE3U1gT8eGakbg8GsYjpfc+pnQnT GJ7Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=EdQthZ7e; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id d13si1161250eds.27.2020.05.11.10.38.08; Mon, 11 May 2020 10:38:30 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=EdQthZ7e; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730953AbgEKReS (ORCPT + 99 others); Mon, 11 May 2020 13:34:18 -0400 Received: from mail.kernel.org ([198.145.29.99]:53820 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730215AbgEKReS (ORCPT ); Mon, 11 May 2020 13:34:18 -0400 Received: from willie-the-truck (236.31.169.217.in-addr.arpa [217.169.31.236]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id DDD55206D6; Mon, 11 May 2020 17:34:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1589218458; bh=5Z7Qml4RQ3MPjBIIlhjG06U4sRXkHJ7HmXF/gF2FyYA=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=EdQthZ7eA7y0JgM/tZ+KCl24Ng8Bqo+RH6ZUw0/pUa6bsjFROzPLsAdUyLLKhucWD ooStrur1zZCuMCMttUQJo0AlyECjIT+5s8oKGQgXXQRXbmKMR29POVgfK9F3z50oLo YLpV7pljTGX1/UEr6FwPgktdYJ9dM+PjNekvXls4= Date: Mon, 11 May 2020 18:34:13 +0100 From: Will Deacon To: "Paul E. McKenney" Cc: Qian Cai , Elver Marco , LKML , Ingo Molnar , "Peter Zijlstra (Intel)" Subject: Re: [PATCH -next v2] locking/osq_lock: annotate a data race in osq_lock Message-ID: <20200511173412.GC23081@willie-the-truck> References: <20200509161217.GN2869@paulmck-ThinkPad-P72> <45D9EEEB-D887-485D-9045-417A7F2C6A1A@lca.pw> <20200509213654.GO2869@paulmck-ThinkPad-P72> <20200511155812.GB22270@willie-the-truck> <20200511164319.GV2869@paulmck-ThinkPad-P72> <20200511165216.GA23081@willie-the-truck> <20200511172918.GW2869@paulmck-ThinkPad-P72> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200511172918.GW2869@paulmck-ThinkPad-P72> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, May 11, 2020 at 10:29:18AM -0700, Paul E. McKenney wrote: > On Mon, May 11, 2020 at 05:52:17PM +0100, Will Deacon wrote: > > On Mon, May 11, 2020 at 09:43:19AM -0700, Paul E. McKenney wrote: > > > On Mon, May 11, 2020 at 04:58:13PM +0100, Will Deacon wrote: > > > > On Sat, May 09, 2020 at 02:36:54PM -0700, Paul E. McKenney wrote: > > > > > diff --git a/kernel/locking/osq_lock.c b/kernel/locking/osq_lock.c > > > > > index 1f77349..1de006e 100644 > > > > > --- a/kernel/locking/osq_lock.c > > > > > +++ b/kernel/locking/osq_lock.c > > > > > @@ -154,7 +154,11 @@ bool osq_lock(struct optimistic_spin_queue *lock) > > > > > */ > > > > > > > > > > for (;;) { > > > > > - if (prev->next == node && > > > > > + /* > > > > > + * cpu_relax() below implies a compiler barrier which would > > > > > + * prevent this comparison being optimized away. > > > > > + */ > > > > > + if (data_race(prev->next) == node && > > > > > cmpxchg(&prev->next, node, NULL) == node) > > > > > break; > > > > > > > > I'm fine with the data_race() placement, but I don't find the comment > > > > very helpful. We assign the result of a READ_ONCE() to 'prev' in the > > > > loop, so I don't think that the cpu_relax() is really relevant. > > > > > > Suppose that the compiler loaded a value that was not equal to "node". > > > In that case, the cmpxchg() won't happen, so something else must force > > > the compiler to do the reload in order to avoid an infinite loop, right? > > > Or am I missing something here? > > > > Then we just go round the loop and reload prev: > > > > prev = READ_ONCE(node->prev); > > > > which should be enough to stop the compiler, no? > > Yes, that would also work. Either have the cpu_relax() or a barrier() > or whatever on the one hand, or, as you say, turn the data_race() into > a READ_ONCE(). I personally prefer the READ_ONCE() myself, unless that > would undesirably suppress other KCSAN warnings. No, I mean here is the code after this patch is applied: for (;;) { if (data_race(prev->next) == node && cmpxchg(&prev->next, node, NULL) == node) break; /* * We can only fail the cmpxchg() racing against an unlock(), * in which case we should observe @node->locked becomming * true. */ if (smp_load_acquire(&node->locked)) return true; cpu_relax(); /* * Or we race against a concurrent unqueue()'s step-B, in which * case its step-C will write us a new @node->prev pointer. */ prev = READ_ONCE(node->prev); } I'm saying that this READ_ONCE at the end of the loop should be sufficient to stop the compiler making value assumptions about prev->next. Do you agree? Will