Received: by 2002:a25:31c3:0:0:0:0:0 with SMTP id x186csp3402417ybx; Fri, 8 Nov 2019 19:44:01 -0800 (PST) X-Google-Smtp-Source: APXvYqxUk4hkmOrqAN9xaLLAKgL29CNVeng/kM1sU9U6tlwJMMK8SOr87/U7Xvs6t6l+uf01Nc4I X-Received: by 2002:aa7:d652:: with SMTP id v18mr14108266edr.184.1573271041241; Fri, 08 Nov 2019 19:44:01 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1573271041; cv=none; d=google.com; s=arc-20160816; b=DsVBxC7HffZM3kfZEvtlr62eLlsDiTz3keYoqYDBAX1nNYSuIVwYgJ6GsB7/SoP0kj vZfrgdzF3pWmnij1evPuUvGF/p3XNYANR7Sh0/6hIxw0gDhfEk33meTRnLYtqT8GDQNI zZJBAZHR/r3xmwp4v2GEahmdsn3e0S9ntKpd6rLAwPCgNHrBLsfxhRrsefwTorac76sd bIbKuaFvLvRvsjJFXuzDH6XgOEm/XsO5XBfuLSPUuVTEesPERzYi2iu1ZZeSc6g+bSP3 U8C1wSPc3eX1/NF0568DdtyL7h/o+DKC5wAPeoFgFjIcLn7cBzd4AG7fFkz0KZqsF3Wm KBZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=LEQk7sFAoN8CWP8jvTuW+bl0ja9oHBLlsIXGwrkQBgw=; b=LuKkcSOoGBynn+g7UaBdEJI4jT7L3d4sQHbooVgYsYs+PIcYZ5lMmo0YJfb7w0Iw6Q 9kQqYycaHPUWvBGYiuuvNH++/SAep25QJlKi2HXRbq7zr7WXLIVcR1uwS7IC2oqz3BPk B5ZZ7+0ksnt7fDeDDBIzXexN7f264QXNLOeKSSpbWIT95wjNLjTs3TtgxdrVFkZUmtvX xcby/c0Zg9Ct1tsX7t+j0MsmjTDkBayKZwpu1AfiLPs3nFZItlVrbxdd7vbVHdMreNj+ J5ZmNetQZsAqqL+2IBLbDg792vK+3zn3NeiTEzejlSWveAWKrHAXez+ZCItyXCnAgs7k yMSg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=pcaAEX9T; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g17si5448898edh.379.2019.11.08.19.43.37; Fri, 08 Nov 2019 19:44:01 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=pcaAEX9T; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726135AbfKIDmw (ORCPT + 99 others); Fri, 8 Nov 2019 22:42:52 -0500 Received: from mail-yw1-f68.google.com ([209.85.161.68]:38787 "EHLO mail-yw1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725895AbfKIDmw (ORCPT ); Fri, 8 Nov 2019 22:42:52 -0500 Received: by mail-yw1-f68.google.com with SMTP id s6so3034809ywe.5 for ; Fri, 08 Nov 2019 19:42:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=LEQk7sFAoN8CWP8jvTuW+bl0ja9oHBLlsIXGwrkQBgw=; b=pcaAEX9THRPG51rpy1I+pgw0cWw+N8IrYBja/ek9sUwGb+zm7wMEoheVy1nBpfraFe kZsq5tNPZaXWJ4lBkGt36OiNdQUGyngdmEwuFWlQW9Ujr08CScd696vQNGGdmXBXLonP 43W6dDloj+a/N+u+QQ59R32kZlvA2BucQtP9wwUdvuB8qAAOR0AffLaktMhLFEkzRmiJ K7FeUS8BgQ2S4wp5wN+0/27s9GqWc/rVTaBdTgVVrcyxv/HZCzhQpoPIVCLshKDoBCh0 geeLon2l6kMQt2PijHYEUBxT553RDPqRncquLtmwbAloQ9TPOBS2S95wKpKQakre2c9f HdoA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=LEQk7sFAoN8CWP8jvTuW+bl0ja9oHBLlsIXGwrkQBgw=; b=WtAPXukRVAnRQ2YLu7nCYaw6ZigPhINfc6OvgkgAQ+LuPhiSK3hwsgugAtCyNWyvBY 3qPq65qzMP5AljmMGbtSpE4c6pka9x2AemiUWcdin0A5MFcvl/ZhxST2SRzbzvS8X7wY jlRlMabjn5G5vTzY+q3XGuFrKKANXR+ANBwmEF/hKCuWSnZwAfrtcZAENrS/8LNKDari gDPUHQUp5IL8b+Yq10TvKYwNGmirOCUT/zbKPLGfg6xMsxJlluwUtxsXwtxuaW78430g bmx5VHjARLd8N2ooAOfwiGgeeWpGPoemZLv/sVRAgguHik7QGLmrxbK+XnN7aWisklpk Bzaw== X-Gm-Message-State: APjAAAVXjNyovxUPH/slrHRulJ28dcxdp0SbTxrntrhrOkt80GjnLQwh b88Cy28Jhc1a3nLhIYsJTFuA2S1g3/FtbgXDEKc= X-Received: by 2002:a81:1881:: with SMTP id 123mr9470290ywy.287.1573270970767; Fri, 08 Nov 2019 19:42:50 -0800 (PST) MIME-Version: 1.0 References: <0000000000009b403005942237bf@google.com> <27b57b11aadf1bd41ad8326101713ca0be7b8edf.camel@gmail.com> In-Reply-To: From: Balbir Singh Date: Sat, 9 Nov 2019 14:42:39 +1100 Message-ID: Subject: Re: KCSAN: data-race in taskstats_exit / taskstats_exit To: Dmitry Vyukov Cc: Marco Elver , syzbot , LKML , syzkaller-bugs Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Nov 8, 2019 at 7:55 PM Dmitry Vyukov wrote: > > On Fri, Nov 8, 2019 at 1:54 AM Balbir Singh wrote: > > > > On Wed, 2019-11-06 at 11:23 +0100, Marco Elver wrote: > > > On Wed, 6 Nov 2019 at 01:10, Balbir Singh wrote: > > > > > > > > On Fri, 2019-10-04 at 21:26 -0700, syzbot wrote: > > > > > Hello, > > > > > > > > > > syzbot found the following crash on: > > > > > > > > > > HEAD commit: b4bd9343 x86, kcsan: Enable KCSAN for x86 > > > > > git tree: https://github.com/google/ktsan.git kcsan > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=125329db600000 > > > > > kernel config: > > > > > https://syzkaller.appspot.com/x/.config?x=c0906aa620713d80 > > > > > dashboard link: > > > > > https://syzkaller.appspot.com/bug?extid=c5d03165a1bd1dead0c1 > > > > > compiler: gcc (GCC) 9.0.0 20181231 (experimental) > > > > > > > > > > Unfortunately, I don't have any reproducer for this crash yet. > > > > > > > > > > IMPORTANT: if you fix the bug, please add the following tag to the > > > > > commit: > > > > > Reported-by: syzbot+c5d03165a1bd1dead0c1@syzkaller.appspotmail.com > > > > > > > > > > ================================================================== > > > > > BUG: KCSAN: data-race in taskstats_exit / taskstats_exit > > > > > > > > > > write to 0xffff8881157bbe10 of 8 bytes by task 7951 on cpu 0: > > > > > taskstats_tgid_alloc kernel/taskstats.c:567 [inline] > > > > > taskstats_exit+0x6b7/0x717 kernel/taskstats.c:596 > > > > > do_exit+0x2c2/0x18e0 kernel/exit.c:864 > > > > > do_group_exit+0xb4/0x1c0 kernel/exit.c:983 > > > > > get_signal+0x2a2/0x1320 kernel/signal.c:2734 > > > > > do_signal+0x3b/0xc00 arch/x86/kernel/signal.c:815 > > > > > exit_to_usermode_loop+0x250/0x2c0 arch/x86/entry/common.c:159 > > > > > prepare_exit_to_usermode arch/x86/entry/common.c:194 [inline] > > > > > syscall_return_slowpath arch/x86/entry/common.c:274 [inline] > > > > > do_syscall_64+0x2d7/0x2f0 arch/x86/entry/common.c:299 > > > > > entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > > > > > > > > > read to 0xffff8881157bbe10 of 8 bytes by task 7949 on cpu 1: > > > > > taskstats_tgid_alloc kernel/taskstats.c:559 [inline] > > > > > taskstats_exit+0xb2/0x717 kernel/taskstats.c:596 > > > > > do_exit+0x2c2/0x18e0 kernel/exit.c:864 > > > > > do_group_exit+0xb4/0x1c0 kernel/exit.c:983 > > > > > __do_sys_exit_group kernel/exit.c:994 [inline] > > > > > __se_sys_exit_group kernel/exit.c:992 [inline] > > > > > __x64_sys_exit_group+0x2e/0x30 kernel/exit.c:992 > > > > > do_syscall_64+0xcf/0x2f0 arch/x86/entry/common.c:296 > > > > > entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > > > > > > > > > > > > Sorry I've been away and just catching up with email > > > > > > > > I don't think this is a bug, if I interpret the report correctly it shows > > > > a > > > > race > > > > > > > > static struct taskstats *taskstats_tgid_alloc(struct task_struct *tsk) > > > > { > > > > struct signal_struct *sig = tsk->signal; > > > > struct taskstats *stats; > > > > > > > > #1 if (sig->stats || thread_group_empty(tsk)) <- the check of sig- > > > > >stats > > > > goto ret; > > > > > > > > /* No problem if kmem_cache_zalloc() fails */ > > > > stats = kmem_cache_zalloc(taskstats_cache, GFP_KERNEL); > > > > > > > > spin_lock_irq(&tsk->sighand->siglock); > > > > if (!sig->stats) { > > > > #2 sig->stats = stats; <- here in setting sig->stats > > > > stats = NULL; > > > > } > > > > spin_unlock_irq(&tsk->sighand->siglock); > > > > > > > > if (stats) > > > > kmem_cache_free(taskstats_cache, stats); > > > > ret: > > > > return sig->stats; > > > > } > > > > > > > > The worst case scenario is that we might see sig->stats as being NULL when > > > > two > > > > threads belonging to the same tgid. We do free up stats if we got that > > > > wrong > > > > > > > > Am I misinterpreting the report? > > > > > > > > Balbir Singh. > > > > > > The plain concurrent reads/writes are a data race, which may manifest > > > in various undefined behaviour due to compiler optimizations [1, 2]. > > > Note that, "data race" does not necessarily imply "race condition"; > > > some data races are race conditions (usually the more interesting > > > bugs) -- but not *all* data races are race conditions (sometimes > > > referred to as "benign races"). KCSAN reports data races according to > > > the LKMM. > > > [1] https://lwn.net/Articles/793253/ > > > [2] https://lwn.net/Articles/799218/ > > > > > > If there is no race condition here that warrants heavier > > > synchronization (locking etc.), we still have the data race which > > > needs fixing by using marked atomic operations (READ_ONCE, WRITE_ONCE, > > > atomic_t, etc.). We also need to consider memory ordering requirements > > > (do we need smp_*mb(), smp_load_acquire/smp_store_release, ..)? > > > > > > In the case here, the pattern is double-checked locking, which is > > > incorrect without atomic operations and the correct memory ordering. > > > There is a lengthy discussion here: > > > > > https://lore.kernel.org/lkml/20191021113327.22365-1-christian.brauner@ubuntu.com/ > > > > > > > > > > I am still not convinced unless someone can prove that unsigned long reads are > > non-atomic, > > None of the relevant standards guarantee this. C standard very > explicitly states the opposite - any data race renders behavior of the > program as undefined. LKMM does not give any defined behavior to data > races either. QED ;) > Fair enough! I am going to have a read the specification, in the meanwhile, I agree changes are needed based on what you've just stated above Balbir > > acquire/release and barriers semantics don't matter because the > > code deals with the race inside of a lock if the read was spurious, The > > assumption is based on the face that sig->stats can be NULL or the kzalloc'ed > > value in all cases. > > > > Balbir Singh. > > > > -- > > You received this message because you are subscribed to the Google Groups "syzkaller-bugs" group. > > To unsubscribe from this group and stop receiving emails from it, send an email to syzkaller-bugs+unsubscribe@googlegroups.com. > > To view this discussion on the web visit https://groups.google.com/d/msgid/syzkaller-bugs/acd6b0d98a7ebcb4ead9b263ec5c568c5a747166.camel%40gmail.com.