Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752261AbdGaNeA (ORCPT ); Mon, 31 Jul 2017 09:34:00 -0400 Received: from bombadil.infradead.org ([65.50.211.133]:54077 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750954AbdGaNd6 (ORCPT ); Mon, 31 Jul 2017 09:33:58 -0400 Date: Mon, 31 Jul 2017 15:33:49 +0200 From: Peter Zijlstra To: Paolo Bonzini Cc: torvalds@linux-foundation.org, linux-kernel@vger.kernel.org, tglx@linutronix.de, mingo@kernel.org, dvyukov@google.com, hpa@zytor.com, linux-tip-commits@vger.kernel.org, Steven Rostedt , Dima Zavin Subject: Re: [tip:locking/urgent] locking/static_key: Fix concurrent static_key_slow_inc() Message-ID: <20170731133349.gphnv5pqkn2xibwf@hirez.programming.kicks-ass.net> References: <1466527937-69798-1-git-send-email-pbonzini@redhat.com> <20170731093612.7v23kxkk47i56io6@hirez.programming.kicks-ass.net> <671056c7-5fc7-60c0-4035-b11d43d95bf1@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <671056c7-5fc7-60c0-4035-b11d43d95bf1@redhat.com> User-Agent: NeoMutt/20170609 (1.8.3) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3087 Lines: 74 On Mon, Jul 31, 2017 at 03:04:06PM +0200, Paolo Bonzini wrote: > - key->enabled cannot go from 0 to nonzero outside jump_label_mutex. > For this reason the atomic_try_cmpxchg is unnecessary. Agreed, the only reason was the implied barrier. > - the (implied) smp_mb before jump_label_update is not interesting, but > I don't think it is useful because: 1) during the jump_label_update > there is no correspondence between what static_key_enabled returns and > what the text look like; 2) what would it even be pairing with? Ah, indeed. So I was worried about the text changes escaping upwards, but you're right in that there's no harm in that because there's nothing that cares. Another inc would see 0 and still serialize on the mutex. > - the smp_mb (though it could be a smp_wmb or atomic_set_release) > initially triggered my paranoia indeed. But even then, I can't see why > you would need it because there's nothing it pairs with. So this one would pair with the mb implied by the cmpxchg loop for inc-if-positive. The ordering being that if we see a positive v, we must then also see all the text modifications. So if jump_label_update() were to not fully serialize things, it would be possible for the v=1 store to appear before the last text changes. And in that case we'd allow the fast path to complete static_key_slow_inc() before it was in fact done with changing all text. Now, I suspect (but did not audit) that anything that changes text must in fact serialize world, but I wasn't sure. > Rather, it's *any use of key->enabled outside jump_label_lock* > (meaning: any use of static_key_enabled and static_key_count outside > the core jump_label.c code) that should be handled with care. > And indeed, while there aren't many, I think two of them are wrong (and > not fixed by your patch): > > - include/linux/cpuset.h defines nr_cpusets which uses static_key_count. > It makes no sense to call it outside cpuset_mutex, and indeed that's > how kernel/cgroup/cpuset.c uses it (nr_cpusets <- generate_sched_domains > <- rebuild_sched_domains_locked). The function should be moved inside > kernel/cgroup/cpuset.c since the mutex is static. Dima was poking at that code. > - net/ipv4/udp.c and net/ipv6/udp.c want to implement a "once-only" > increment of the static key. It's racy and maybe we should provide a > new API static_key_enable_forever: > > void static_key_enable_forever(struct static_key *key) > { > STATIC_KEY_CHECK_USE(); > if (atomic_read(&key->enabled) > 0) > return; > > cpus_read_lock(); > jump_label_lock(); > if (atomic_read(&key->enabled) == 0) { > atomic_set(&key->enabled, -1); > jump_label_update(key); > atomic_set(&key->enabled, 1); > } > jump_label_unlock(); > cpus_read_unlock(); > } > EXPORT_SYMBOL_GPL(static_key_enable_forever); > > I can prepare a patch if you agree. Isn't that what we have static_key_enable() for? Which btw also uses static_key_count() outside of the mutex.