Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp130213ybz; Wed, 15 Apr 2020 05:55:24 -0700 (PDT) X-Google-Smtp-Source: APiQypIfwezg59WmIhTv3qgHC6cwrUo3VIAUq1vdSMdE9c9SJmKAPyGFkPvnhZxx2MJ0TzT+/6Ua X-Received: by 2002:a17:906:3b11:: with SMTP id g17mr4855530ejf.118.1586955324139; Wed, 15 Apr 2020 05:55:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586955324; cv=none; d=google.com; s=arc-20160816; b=MIxZpGSQPQYWukbiXnN4OI2sW0eGVtQ4hhqC0Ys5ulOjxw/v62TxshX87r4AquDVHR Yy7jbIkTVes9S6nnXi8Z5mdNQMitsDXxHt+vis/yD729N4FQsPgGz/kwIpVS7kPDIXr2 a57dWcdbyyY6zwp2pF0g/HnrYXnAatgS49luThRSKsT/vtXF3oV9+nvJmsZEMIjfQrrj cNT5vRuEVRedoZS3b5SiK+IToo4lfa6K6UtbcT4+A+ZLZ61zYV5KcIvoxwxbtu5dffKT ba+I9caeqqx12DqYsNWNGb3icYnoqHDz1zw0HH5IdPZJR9JeR6kw7ieMl2IhOlx//plS p/Vg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=TEcf9WnegdoS4TK74rJ3jgH/4y0s3Rie/mbgUIU1/FE=; b=R6oEJ2/A/BWvWmOCSoieCev+qvgKvsyi0rmIKoVetsPYykXfGDKSO8hFHTM2P04wpx gnKQrZo17D2aypvFRAmGtt2kvB6h4GT3kZ+1s870OlRK0RxFpGOOd6Fk7OAUMAw7slHS /focpyLJV3K2ZFOMFVvhhSP0CV2C1CQCdhfuaQFq+sCc3H0kVWJKEywaiih+5/blA3GN s+FJxN8TNCPTrp3HmZ1jGYTdCHefQ3uH08fGncSZ669FDi8guq4xJlxjnJFbF7UuzYzM 5AkkmPObBvuTsYikKdEPWrLzwgCBa6VMcmJeoCRwGXlg37U5DTcs8MmLxdfZDqc0Dqhi owgg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=UAMvKl7x; spf=pass (google.com: best guess record for domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dk9si4278918edb.403.2020.04.15.05.54.50; Wed, 15 Apr 2020 05:55:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=UAMvKl7x; spf=pass (google.com: best guess record for domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2502768AbgDNRuO (ORCPT + 99 others); Tue, 14 Apr 2020 13:50:14 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41660 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S2502881AbgDNRuK (ORCPT ); Tue, 14 Apr 2020 13:50:10 -0400 Received: from mail-oi1-x243.google.com (mail-oi1-x243.google.com [IPv6:2607:f8b0:4864:20::243]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id DD001C061A0C for ; Tue, 14 Apr 2020 10:50:09 -0700 (PDT) Received: by mail-oi1-x243.google.com with SMTP id 8so944615oiy.6 for ; Tue, 14 Apr 2020 10:50:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=TEcf9WnegdoS4TK74rJ3jgH/4y0s3Rie/mbgUIU1/FE=; b=UAMvKl7xtSUqKv4GH9GoENT8bjniUZLfgA1J6ygO0hUfMB3Cq7HU3VRLr7MF3tNSBc 6acm2t0Th/b86RF+1RJx/BWkJWMT19MCvJUjMPxA6WoQDJ4po8xwBBNiKXkq5oAT6yE7 RW500W0rIeHDSvvoMWQxe/VoG1JMqhBEZ/hfisGAcN8iIRwXRvbUeC8pbC6sXSnC6eWl F7DwK+3hUK3Ea3Z9HSZw8mzXP81JyPKqkZueHq7U94LiXRGo9H3tfwgZV4P6918kZEHR o3p4ueeWK0Y9jozDlZEN/+/mux5LBnZqKHPo3C74Kxa/VN5G2idQ3qy6k2zJ2+SIEP6v oTKA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=TEcf9WnegdoS4TK74rJ3jgH/4y0s3Rie/mbgUIU1/FE=; b=C7tUli3YWGCzebbdfLhmbDNCx1cBXtFmf4V8Ro+zUYHoea5OMSIjLbTJf+qOPlDHO6 L1QPO7WpZgg/RzHYTNphBhZbzYKE0J64BwpWwxEN72gzJq8y7bR1TE+s9fv6EhEtOOBf HctyBefF7HTfibgz/MNPmVksC5UKAM/cIwDDxziqp8Jq+DqOwedpSeGgLI/g8Ww1CoB2 FrPq41ZYCu0ibDFC4g4FbJYB5Vk/WOD3Wb9WM6IaV4tNysFJrizPfriVs4bn40Ut5K4C AcvPjCbCqiPf3+jPwhiYww0bkYaHJJXr+sEW4GHpTqOfdY5cZBPcI9hvR4l4tniI/pHg 3GlA== X-Gm-Message-State: AGi0PubWnNTsApeWZXXTxDVqB3lVYtYUUDxgQNKXqg7W23wtVr4dw9/t z29XumhKr86h6FSP/Vwstht15EMjMSmPWCnxbOff2b5v X-Received: by 2002:a54:481a:: with SMTP id j26mr16403284oij.172.1586886608876; Tue, 14 Apr 2020 10:50:08 -0700 (PDT) MIME-Version: 1.0 References: <0000000000009d5cef05a22baa95@google.com> <20200331202706.GA127606@gmail.com> <20200401162028.GA201933@gmail.com> In-Reply-To: <20200401162028.GA201933@gmail.com> From: Marco Elver Date: Tue, 14 Apr 2020 19:49:57 +0200 Message-ID: Subject: Re: KCSAN: data-race in glue_cbc_decrypt_req_128bit / glue_cbc_decrypt_req_128bit To: Eric Biggers Cc: Dmitry Vyukov , syzbot , Borislav Petkov , David Miller , Herbert Xu , "H. Peter Anvin" , "open list:HARDWARE RANDOM NUMBER GENERATOR CORE" , LKML , Ingo Molnar , syzkaller-bugs , Thomas Gleixner , "the arch/x86 maintainers" , kasan-dev , "Paul E. McKenney" Content-Type: text/plain; charset="UTF-8" Sender: linux-crypto-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-crypto@vger.kernel.org On Wed, 1 Apr 2020 at 18:20, Eric Biggers wrote: > > On Wed, Apr 01, 2020 at 12:24:01PM +0200, Marco Elver wrote: > > On Wed, 1 Apr 2020 at 09:04, Dmitry Vyukov wrote: > > > > > > On Tue, Mar 31, 2020 at 10:27 PM Eric Biggers wrote: > > > > > > > > On Tue, Mar 31, 2020 at 12:35:13PM -0700, syzbot wrote: > > > > > Hello, > > > > > > > > > > syzbot found the following crash on: > > > > > > > > > > HEAD commit: b12d66a6 mm, kcsan: Instrument SLAB free with ASSERT_EXCLU.. > > > > > git tree: https://github.com/google/ktsan.git kcsan > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=111f0865e00000 > > > > > kernel config: https://syzkaller.appspot.com/x/.config?x=10bc0131c4924ba9 > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=6a6bca8169ffda8ce77b > > > > > compiler: gcc (GCC) 9.0.0 20181231 (experimental) > > > > > > > > > > Unfortunately, I don't have any reproducer for this crash yet. > > > > > > > > > > IMPORTANT: if you fix the bug, please add the following tag to the commit: > > > > > Reported-by: syzbot+6a6bca8169ffda8ce77b@syzkaller.appspotmail.com > > > > > > > > > > ================================================================== > > > > > BUG: KCSAN: data-race in glue_cbc_decrypt_req_128bit / glue_cbc_decrypt_req_128bit > > > > > > > > > > write to 0xffff88809966e128 of 8 bytes by task 24119 on cpu 0: > > > > > u128_xor include/crypto/b128ops.h:67 [inline] > > > > > glue_cbc_decrypt_req_128bit+0x396/0x460 arch/x86/crypto/glue_helper.c:144 > > > > > cbc_decrypt+0x26/0x40 arch/x86/crypto/serpent_avx2_glue.c:152 > > > > > crypto_skcipher_decrypt+0x65/0x90 crypto/skcipher.c:652 > > > > > _skcipher_recvmsg crypto/algif_skcipher.c:142 [inline] > > > > > skcipher_recvmsg+0x7fa/0x8c0 crypto/algif_skcipher.c:161 > > > > > skcipher_recvmsg_nokey+0x5e/0x80 crypto/algif_skcipher.c:279 > > > > > sock_recvmsg_nosec net/socket.c:886 [inline] > > > > > sock_recvmsg net/socket.c:904 [inline] > > > > > sock_recvmsg+0x92/0xb0 net/socket.c:900 > > > > > ____sys_recvmsg+0x167/0x3a0 net/socket.c:2566 > > > > > ___sys_recvmsg+0xb2/0x100 net/socket.c:2608 > > > > > __sys_recvmsg+0x9d/0x160 net/socket.c:2642 > > > > > __do_sys_recvmsg net/socket.c:2652 [inline] > > > > > __se_sys_recvmsg net/socket.c:2649 [inline] > > > > > __x64_sys_recvmsg+0x51/0x70 net/socket.c:2649 > > > > > do_syscall_64+0xcc/0x3a0 arch/x86/entry/common.c:294 > > > > > entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > > > > > > > > > read to 0xffff88809966e128 of 8 bytes by task 24118 on cpu 1: > > > > > u128_xor include/crypto/b128ops.h:67 [inline] > > > > > glue_cbc_decrypt_req_128bit+0x37c/0x460 arch/x86/crypto/glue_helper.c:144 > > > > > cbc_decrypt+0x26/0x40 arch/x86/crypto/serpent_avx2_glue.c:152 > > > > > crypto_skcipher_decrypt+0x65/0x90 crypto/skcipher.c:652 > > > > > _skcipher_recvmsg crypto/algif_skcipher.c:142 [inline] > > > > > skcipher_recvmsg+0x7fa/0x8c0 crypto/algif_skcipher.c:161 > > > > > skcipher_recvmsg_nokey+0x5e/0x80 crypto/algif_skcipher.c:279 > > > > > sock_recvmsg_nosec net/socket.c:886 [inline] > > > > > sock_recvmsg net/socket.c:904 [inline] > > > > > sock_recvmsg+0x92/0xb0 net/socket.c:900 > > > > > ____sys_recvmsg+0x167/0x3a0 net/socket.c:2566 > > > > > ___sys_recvmsg+0xb2/0x100 net/socket.c:2608 > > > > > __sys_recvmsg+0x9d/0x160 net/socket.c:2642 > > > > > __do_sys_recvmsg net/socket.c:2652 [inline] > > > > > __se_sys_recvmsg net/socket.c:2649 [inline] > > > > > __x64_sys_recvmsg+0x51/0x70 net/socket.c:2649 > > > > > do_syscall_64+0xcc/0x3a0 arch/x86/entry/common.c:294 > > > > > entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > > > > > > > > > Reported by Kernel Concurrency Sanitizer on: > > > > > CPU: 1 PID: 24118 Comm: syz-executor.1 Not tainted 5.6.0-rc1-syzkaller #0 > > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011 > > > > > ================================================================== > > > > > > > > > > > > > I think this is a problem for almost all the crypto code. Due to AF_ALG, both > > > > the source and destination buffers can be userspace pages that were gotten with > > > > get_user_pages(). Such pages can be concurrently modified, not just by the > > > > kernel but also by userspace. > > > > > > > > I'm not sure what can be done about this. > > > > > > Oh, I thought it's something more serious like a shared crypto object. > > > Thanks for debugging. [...] > > > > > > Marco, I think we need to ignore all memory that comes from > > > get_user_pages() somehow. Either not set watchpoints at all, or > > > perhaps filter them out later if the check is not totally free. > > > > Makes sense. We already have similar checks, and they're in the > > slow-path, so it shouldn't be a problem. Let me investigate. > > I'm wondering whether you really should move so soon to ignoring these races? > They are still races; the crypto code is doing standard unannotated reads/writes > of memory that can be concurrently modified. > [...] Wanted to follow up on this, just to clarify: The issue here essentially boils down to a user-space race involving an API that isn't designed to be thread-safe with the provided arguments (pointer to same user-space memory). The data race here merely manifests in kernel code, but otherwise the kernel is unaffected (if it were affected, a real fix would be needed). I.e. if we observe this data race, KCSAN is helpfully pointing out that user space has a bug. There are some options to deal with cases like this: 1. Do nothing, and just let KCSAN report the data race. 2. Somehow make KCSAN distinguish in-kernel data races that are due to user space misusing the API. KCSAN can still show the race, but clearly denote the nature of it by e.g. saying "KCSAN: user data-race in ..." (instead of "KCSAN: data-race in ..."). This will require one of 2 things: a. Distinguish the access by memory range. This doesn't seem great, because I don't know if we can apply a general rule like "all races involving this memory are user-space's fault". What if we have data races in the memory range that aren't user-space's fault? b. Mark the accesses somehow, either by providing a region in which all races are deemed user-space's fault. This is likely more problematic than (a), because saying something like "all races in this section of code are user-space's fault" may also hide real issues. Because none of (2.a) or (2.b) seem great, at present I would opt for (1). Anything better we can do here? Thanks, -- Marco