Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp770552ybb; Fri, 20 Mar 2020 07:50:33 -0700 (PDT) X-Google-Smtp-Source: ADFU+vtqJfQVsHDdhdPPrb9cANw4lYkCYk4TIzdKmjuuK76Ga124drMedu4Ihc9le1zplc+odAbt X-Received: by 2002:a9d:6b1a:: with SMTP id g26mr6782551otp.2.1584715833365; Fri, 20 Mar 2020 07:50:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584715833; cv=none; d=google.com; s=arc-20160816; b=gH91z5tl1T4TASpTv6MqQNqTo2ijoeJIGtvCzFDX4cFgWfc3l8D/jyM/q9+cm7ANNV 5C6pirxU0aQ8jYdWsUo1S6Mc9fEWcJkkK+Kd6xLpOJo0zKJN8zTnW1DPhr3aDcJuxUl3 LrP1nMuz2Bgy8C7ULDKihZjfN+HfZlPimsHA4L6whr7hDSkebyWWsywGghbR7R5pGGM9 LMVJfvma01E0vpph4HMwGBTSWXmutM5GhP+qp4wZEYsQ35Vbn+9A33L3dm4DaDViFEie PQSnSo9AfAvYZX5+dxPDHRDhLKDBdALGIGf06KvpWEiooKgSyU8GplieViWdx8veFLpF np4Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=sYXSjYRj4e0el0aohJdLv/kFhC+73J+gTXrl9WkNGlo=; b=CinMef1AHmvfv8cUsS+uZmZrsclijcKAshuFyQM+O+kSHz1Och/l7Te3oMZAKR6AMv T91sfUVSGOb/S7MROtyrrbA8p0KyOn8KbW3EzhHqdkVt2YIN/tE5TU8Md0HC2zwkMEM/ 1BykzgKZYgxkY2Li7yyqBTVh4hrDABZp5gSH04IxeQTlPS46QdmwjZ3Svq0vocyTy25i Ata1Z88sCFYI4wAiLwzdgfEtbWUZGU7g65qYKh6ty9rVjmHl5UbRpQaSTTFVr2ED8Tm7 W3vRy1nHSDQEzBWOL0mG/+K9aq18fOZTOp5CL2TxtXzrBXElYxIGZfg/Lz4oRnJTYQQN Ep4Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Q1KfnjtV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x14si3217650oto.47.2020.03.20.07.50.19; Fri, 20 Mar 2020 07:50:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Q1KfnjtV; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727025AbgCTOtp (ORCPT + 99 others); Fri, 20 Mar 2020 10:49:45 -0400 Received: from mail-oi1-f195.google.com ([209.85.167.195]:39430 "EHLO mail-oi1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726847AbgCTOtp (ORCPT ); Fri, 20 Mar 2020 10:49:45 -0400 Received: by mail-oi1-f195.google.com with SMTP id d63so6711265oig.6 for ; Fri, 20 Mar 2020 07:49:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=sYXSjYRj4e0el0aohJdLv/kFhC+73J+gTXrl9WkNGlo=; b=Q1KfnjtVII6PYpugCOKxtrLL0L4mjq+hGVg0rgiOafOitBfys4rpAwof4WUh2lRyoV NVqVgcgifsYKikjjER+ma7tz+13R5TwoY42l2FNwabCHKjW6NwAeW9NdpML1NiY1npbI g2BqxoWj2xgC1lbjhYsb6x+lOeA4McwvirWkXHDORmoWHVlbNCOgA/PPUpwYWfN+ETQs NPGeVUbreN09mxOs/rkQ3zC3jPgIeOvyAWnmgRkLJw41K79FyBWFkJ1elj5Ow2EQDFAG mEVuI8PuApd/pU4CGRX2dbzuex3orKqSO/zinAh9gnjZ7+HfwLjYr0XpjkCRomGydcPZ B+Og== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=sYXSjYRj4e0el0aohJdLv/kFhC+73J+gTXrl9WkNGlo=; b=W/NVu4z54uS7VtnLce2v6/7su5KaktlsI6cemVUDn1iNhqdfSoaEEj88H0UJ0feSGe LR7B/lr7j64SdcLzsTXd9m8prcVeQOXZBXyCNj/rf7b0KHbcG7W6aoY7wEma0GQenKMv hxVErrr7KuLqytHz6nD5SwVSxYl+7AEtsMdFVU6TcVTjVUWDzorfrKgGPAXLCPfzdYT2 JaJ2uzwe4kqefGGq6GHqfJBiA61iisFxFxGPMgUtKN/qwqHCnCoG8MGGrwQDTf5wPLNH p4s19Kq5wWzG9sa3AsnpqWnA9gLY4KaNa0X7tyG5rWAGpE48TWjU7+oZfqc95tYjAII/ kb6Q== X-Gm-Message-State: ANhLgQ0jCZYcgFAFpmXUepal1QaozyKvnr+gaw4b2o9qyTIydtogFZxP U1xsuTG47wYdFHIn5dMRhijUkPobxX+YKBmTN46GPA== X-Received: by 2002:aca:3089:: with SMTP id w131mr6911074oiw.121.1584715783013; Fri, 20 Mar 2020 07:49:43 -0700 (PDT) MIME-Version: 1.0 References: <20200309190359.GA5822@paulmck-ThinkPad-P72> <20200309190420.6100-17-paulmck@kernel.org> <20200313085220.GC105953@debian-boqun.qqnc3lrjykvubdpftowmye0fmh.lx.internal.cloudapp.net> <20200314022210.GD105953@debian-boqun.qqnc3lrjykvubdpftowmye0fmh.lx.internal.cloudapp.net> <20200319032326.GE105953@debian-boqun.qqnc3lrjykvubdpftowmye0fmh.lx.internal.cloudapp.net> In-Reply-To: <20200319032326.GE105953@debian-boqun.qqnc3lrjykvubdpftowmye0fmh.lx.internal.cloudapp.net> From: Marco Elver Date: Fri, 20 Mar 2020 15:49:31 +0100 Message-ID: Subject: Re: [PATCH kcsan 17/32] kcsan: Introduce ASSERT_EXCLUSIVE_* macros To: Boqun Feng Cc: "Paul E. McKenney" , LKML , kasan-dev , kernel-team@fb.com, Ingo Molnar , Andrey Konovalov , Alexander Potapenko , Dmitry Vyukov , Qian Cai Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 19 Mar 2020 at 04:23, Boqun Feng wrote: > > On Tue, Mar 17, 2020 at 12:12:36PM +0100, Marco Elver wrote: > > On Sat, 14 Mar 2020 at 03:22, Boqun Feng wrote: > > > > > > On Fri, Mar 13, 2020 at 05:15:32PM +0100, Marco Elver wrote: > > > > On Fri, 13 Mar 2020 at 09:52, Boqun Feng wrote: > > > > > > > > > > Hi Marco, > > > > > > > > > > On Mon, Mar 09, 2020 at 12:04:05PM -0700, paulmck@kernel.org wrote: > > > > > > From: Marco Elver > > > > > > > > > > > > Introduces ASSERT_EXCLUSIVE_WRITER and ASSERT_EXCLUSIVE_ACCESS, which > > > > > > may be used to assert properties of synchronization logic, where > > > > > > violation cannot be detected as a normal data race. > > > > > > > > > > > > Examples of the reports that may be generated: > > > > > > > > > > > > ================================================================== > > > > > > BUG: KCSAN: assert: race in test_thread / test_thread > > > > > > > > > > > > write to 0xffffffffab3d1540 of 8 bytes by task 466 on cpu 2: > > > > > > test_thread+0x8d/0x111 > > > > > > debugfs_write.cold+0x32/0x44 > > > > > > ... > > > > > > > > > > > > assert no writes to 0xffffffffab3d1540 of 8 bytes by task 464 on cpu 0: > > > > > > test_thread+0xa3/0x111 > > > > > > debugfs_write.cold+0x32/0x44 > > > > > > ... > > > > > > ================================================================== > > > > > > > > > > > > ================================================================== > > > > > > BUG: KCSAN: assert: race in test_thread / test_thread > > > > > > > > > > > > assert no accesses to 0xffffffffab3d1540 of 8 bytes by task 465 on cpu 1: > > > > > > test_thread+0xb9/0x111 > > > > > > debugfs_write.cold+0x32/0x44 > > > > > > ... > > > > > > > > > > > > read to 0xffffffffab3d1540 of 8 bytes by task 464 on cpu 0: > > > > > > test_thread+0x77/0x111 > > > > > > debugfs_write.cold+0x32/0x44 > > > > > > ... > > > > > > ================================================================== > > > > > > > > > > > > Signed-off-by: Marco Elver > > > > > > Suggested-by: Paul E. McKenney > > > > > > Signed-off-by: Paul E. McKenney > > > > > > --- > > > > > > include/linux/kcsan-checks.h | 40 ++++++++++++++++++++++++++++++++++++++++ > > > > > > 1 file changed, 40 insertions(+) > > > > > > > > > > > > diff --git a/include/linux/kcsan-checks.h b/include/linux/kcsan-checks.h > > > > > > index 5dcadc2..cf69617 100644 > > > > > > --- a/include/linux/kcsan-checks.h > > > > > > +++ b/include/linux/kcsan-checks.h > > > > > > @@ -96,4 +96,44 @@ static inline void kcsan_check_access(const volatile void *ptr, size_t size, > > > > > > kcsan_check_access(ptr, size, KCSAN_ACCESS_ATOMIC | KCSAN_ACCESS_WRITE) > > > > > > #endif > > > > > > > > > > > > +/** > > > > > > + * ASSERT_EXCLUSIVE_WRITER - assert no other threads are writing @var > > > > > > + * > > > > > > + * Assert that there are no other threads writing @var; other readers are > > > > > > + * allowed. This assertion can be used to specify properties of concurrent code, > > > > > > + * where violation cannot be detected as a normal data race. > > > > > > + * > > > > > > > > > > I like the idea that we can assert no other writers, however I think > > > > > assertions like ASSERT_EXCLUSIVE_WRITER() are a little limited. For > > > > > example, if we have the following code: > > > > > > > > > > preempt_disable(); > > > > > do_sth(); > > > > > raw_cpu_write(var, 1); > > > > > do_sth_else(); > > > > > preempt_enable(); > > > > > > > > > > we can add the assert to detect another potential writer like: > > > > > > > > > > preempt_disable(); > > > > > do_sth(); > > > > > ASSERT_EXCLUSIVE_WRITER(var); > > > > > raw_cpu_write(var, 1); > > > > > do_sth_else(); > > > > > preempt_enable(); > > > > > > > > > > , but, if I understand how KCSAN works correctly, it only works if the > > > > > another writer happens when the ASSERT_EXCLUSIVE_WRITER(var) is called, > > > > > IOW, it can only detect another writer between do_sth() and > > > > > raw_cpu_write(). But our intent is to prevent other writers for the > > > > > whole preemption-off section. With this assertion introduced, people may > > > > > end up with code like: > > > > > > > > To confirm: KCSAN will detect a race if it sets up a watchpoint on > > > > ASSERT_EXCLUSIVE_WRITER(var), and a concurrent write happens. Note > > > > that the watchpoints aren't always set up, but only periodically > > > > (discussed more below). For every watchpoint, we also inject an > > > > artificial delay. Pseudo-code: > > > > > > > > if watchpoint for access already set up { > > > > consume watchpoint; > > > > else if should set up watchpoint { > > > > setup watchpoint; > > > > udelay(...); > > > > check watchpoint consumed; > > > > release watchpoint; > > > > } > > > > > > > > > > Yes, I get this part. > > > > > > > > preempt_disable(); > > > > > ASSERT_EXCLUSIVE_WRITER(var); > > > > > do_sth(); > > > > > ASSERT_EXCLUSIVE_WRITER(var); > > > > > raw_cpu_write(var, 1); > > > > > ASSERT_EXCLUSIVE_WRITER(var); > > > > > do_sth_else(); > > > > > ASSERT_EXCLUSIVE_WRITER(var); > > > > > preempt_enable(); > > > > > > > > > > and that is horrible... > > > > > > > > It is, and I would strongly discourage any such use, because it's not > > > > necessary. See below. > > > > > > > > > So how about making a pair of annotations > > > > > ASSERT_EXCLUSIVE_WRITER_BEGIN() and ASSERT_EXCLUSIVE_WRITER_END(), so > > > > > that we can write code like: > > > > > > > > > > preempt_disable(); > > > > > ASSERT_EXCLUSIVE_WRITER_BEGIN(var); > > > > > do_sth(); > > > > > raw_cpu_write(var, 1); > > > > > do_sth_else(); > > > > > ASSERT_EXCLUSIVE_WRITER_END(var); > > > > > preempt_enable(); > > > > > > > > > > ASSERT_EXCLUSIVE_WRITER_BEGIN() could be a rough version of watchpoint > > > > > setting up and ASSERT_EXCLUSIVE_WRITER_END() could be watchpoint > > > > > removing. So I think it's feasible. > > > > > > > > Keep in mind that the time from ASSERT_EXCLUSIVE_WRITER_BEGIN to END > > > > might be on the order of a few nanosec, whereas KCSAN's default > > > > watchpoint delay is 10s of microsec (default ~80 for tasks). That > > > > means we would still have to set up a delay somewhere, and the few > > > > nanosec between BEGIN and END are insignificant and don't buy us > > > > anything. > > > > > > > > > > Yeah, the delay doesn't buy us anything given the default watchpoint > > > delay, and I agree even with *_{BEGIN/END}, we still need to set up a > > > delay somewhere. Adding a delay makes the watchpoint live longer so that > > > a problem will more likely happen, but sometimes the delay won't be > > > enough, considering another writer like: > > > > > > if (per_cpu(var, cpu) == 1) > > > per_cpu(var, cpu) = 0; > > > > > > in this user case, percpu variable "var" is used for maintaining some > > > state machine, and a CPU set a state with its own variable so that other > > > CPUs can consume it. And this another writer cannot be catched by: > > > > > > preempt_disable(); > > > do_sth(); > > > ASSERT_EXCLUSIVE_WRITER(var); > > > raw_cpu_write(var, 1); > > > do_sth_else(); > > > preempt_enable(); > > > > > > > Right, the example makes sense. > > > > That is assuming there are various other expected racy reads that are > > fine. If that's not true, ASSERT_EXCLUSIVE_ACCESS should be > > considered. > > > > > , no matter how long the delay is set. Another example: let's say the > > > do_sth_else() above is actually an operation that queues a callback > > > which writes to "var". In one version, do_sth_else() uses call_rcu(), > > > which works, because preemption-off is treated as RCU read-side critical > > > section, so we are fine. But if someone else changes it to queue_work() > > > for some reason, the code is just broken, and KCSAN cannot detect it, no > > > matter how long the delay is. > > > > > > To summarize, a delay is helpful to trigger a problem because it allows > > > _other_ CPU/threads to run more code and do more memory accesses, > > > however it's not helpful if a particular problem happens due to some > > > memory effects of the current/watched CPU/thread. While *_{BEGIN/END} > > > can be helpful in this case. > > > > Makes sense. > > > > > > Re feasibility: Right now setting up and removing watchpoints is not > > > > exposed, and doing something like this would be an extremely intrusive > > > > change. Because of that, without being able to quantify the actual > > > > usefulness of this, and having evaluated better options (see below), > > > > I'd recommend not pursuing this. > > > > > > > > > Thoughts? > > > > > > > > Firstly, what is your objective? From what I gather you want to > > > > increase the probability of detecting a race with 'var'. > > > > > > > > > > Right, I want to increase the probablity. > > > > > > > I agree, and have been thinking about it, but there are other options > > > > that haven't been exhausted, before we go and make the interface more > > > > complicated. > > > > > > > > == Interface design == > > > > The interface as it is right now, is intuitive and using it is hard to > > > > get wrong. Demanding begin/end markers introduces complexity that will > > > > > > Yeah, the interface is intuitive, however it's still an extra effort to > > > put those assertions, right? Which means it doesn't come for free, > > > compared to other detection KCSAN can do, the developers don't need to > > > put extra lines of code. Given the extra effort for developers to use > > > the detect, I think we should dicuss the design thoroughly. > > > > > > Besides the semantics of assertions is usually "do some checking right > > > now to see if things go wrong", and I don't think it quite matches the > > > semantics of an exclusive writer: "in this piece of code, I'm the only > > > one who can do the write". > > > > > > > undoubtedly result in incorrect usage, because as soon as you somehow > > > > forget to end the region, you'll get tons of false positives. This may > > > > be due to control-flow that was missed etc. We had a similar problem > > > > with seqlocks, and getting them to work correctly with KCSAN was > > > > extremely difficult, because clear begin and end markers weren't > > > > always given. I imagine introducing an interface like this will > > > > ultimately result in similar problems, as much as we'd like to believe > > > > this won't ever happen. > > > > > > > > > > Well, if we use *_{BEGIN,END} approach, one solution is combining them > > > with sections introducing primitives (such as preemp_disable() and > > > preempt_enable()), for example, we can add > > > > > > #define preempt_disable_for(var) \ > > > do { \ > > > preempt_disable(); \ > > > ASSERT_EXCLUSIVE_WRITER_BEGIN(var); \ > > > } > > > > > > #define preempt_enable_for(var) \ > > > do { \ > > > ASSERT_EXCLUSIVE_WRITER_END(var); \ > > > preempt_enable(); \ > > > } > > > > > > (similar for spin lock) > > > > > > #define spin_lock_for(lock, var) \ > > > do { \ > > > spin_lock(lock); \ > > > ASSERT_EXCLUSIVE_WRITER_BEGIN(var); \ > > > } > > > > > > #define spin_unlock_for(lock, var) \ > > > do { \ > > > ASSERT_EXCLUSIVE_WRITER_END(var); \ > > > spin_unlock(lock); \ > > > } > > > > > > I admit that I haven't thought this thoroughly, but I think this works, > > > and besides primitives like above can help the reader to understand the > > > questions like: what this lock/preemption-off critical sections are > > > protecting? > > > > I can't say anything about introducing even more macros. I'd say we > > need at least a dozen use-cases or more and understand them, otherwise > > we may end up with the wrong API that we can never take back. > > > > Agreed, real use-cases are needed for the justification of introducing > those APIs. > > > > Thoughts? > > > > Makes sense for the cases you described. > > > > Changing KCSAN to do this is a major change. On surface, it seems like > > a refactor and exporting some existing functionality, but there are > > various new corner cases, because now 2 accesses don't really have to > > be concurrent anymore to detect a race (and simple properties like a > > thread can't race with itself need to be taken care of). The existing > > ASSERT_EXCLUSIVE macros were able to leverage existing functionality > > mostly as-is. So, to motivate something like this, we need at least a > > dozen or so good use-cases, where careful placement of an existing > > ASSERT_EXCLUSIVE would not catch what you describe. > > > > Right, I think at this point, I'm not object to merging this into > kernel, using ASSERT_EXCLUSIVE_*() does provide more chances for us to > catch bugs. That said, I think it's better if we have some comments > describing the semantics (or the limitation) of the annotations to avoid > "mis-use"s (for example, using multiple ASSERT_EXCLUSIVE_WRITER()s for > one variables in a function). But that doesn't necessarily block the > merge of this feature, we can always do that later. > > Also, I think it's worthwhile to do some experiments on the > *_{BEGIN,END} interfaces. If you're interested and have cycles to work > on this, please let me, othwerwise, I can have a look at it. Because I really think the BEGIN/END interface is error-prone, I was curious if I could make this work: preempt_disable(); { ASSERT_EXCLUSIVE_WRITER_SCOPED(var); do_sth(); raw_cpu_write(var, 1); do_sth_else(); } preempt_enable(); Since all compilers that support TSAN instrumentation support attribute "cleanup", this works for KCSAN. Half-baked patch: https://github.com/melver/linux/commit/d5a510a80b9755909d8b2ccc7bcfdeac99fc0080 Since it's not too intrusive, I'd be fine including this. But I think you have to come up with the use-cases. :-) I can send the patch once I'm done with it. Thanks, -- Marco > Thanks! > > Regards, > Boqun > > > > Thanks, > > -- Marco > > > > > Regards, > > > Boqun > > > > [...]