Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5060571imm; Tue, 21 Aug 2018 05:45:20 -0700 (PDT) X-Google-Smtp-Source: AA+uWPwg9BLfHWMZ3OiZiMUM5vsFwM0HauEO4/W/nyw+PbkQYr1bRKTudyXPODHCE//nzMGjDGT3 X-Received: by 2002:a65:6398:: with SMTP id h24-v6mr47400866pgv.245.1534855520724; Tue, 21 Aug 2018 05:45:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1534855520; cv=none; d=google.com; s=arc-20160816; b=O9wQiRBAr8dhbLELFDtffs5R2IIT2/DcTHFFBUPU1Xd0xwfk1dkN4wLKjf0C/38bvL SXzAT38Rm079tDCsSrMrASVio+yk0SaOsRW+FkXCLhkOJLLVYlMJvyY0qczeiFC7+kab annLdR8EFcVYvQqTwgdxHJqcpZ4inT3eULCI/Ud3zIaBahm1vy+IfUbwOBPc3YsA/8lw rCgQvqMSmpzqOz4+r9Q2/Vv9f3x+g8fWgBJE8gIwGCqyZqv5F2eDFR5rByKaOexlMiDb T3TvY61CSGlqfVj5cPjkeMJVLuP/saomEShh7rWiGw9CPx7rnu9ELcj2bqt6/aTmdHlF iIfA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=Jhi3RTp8eUnFr4q17mOUY0ZqDqUHb47YI3CjfMXx2c8=; b=CmVVFKzO/YgPZQvpembhiD0Z9JOX/nryb/RHW5uXX8JbHB0Rf9QqnJZ5wLinfnzAXQ 0W6JHhfRBsWfiWM3sJxArSYZhlOf70CZZx9inxzvqO+OOHs7ScCqVgVohJmau4+GuIdK RczshnaVgKWMUJd2o2ifq+i3AhyPBD3Wr8Ff7pG2CESLqeA0nIa8ITe/6sf0vP3Ae2ty wQ9qdWOJ8oTCGN7aDNUX9JIlq7hmwr9MQzVediD914ccRYGM3VtuyGV/bPiesn0RFi27 yGtvmN2xYsU+bZIl4Q6lgju5/3BSvfTYBbs/fe5DWgOigWJL6VE9QK47eXMePL0wrwet AzUA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=YS5gCHhp; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j5-v6si12719026plk.406.2018.08.21.05.45.05; Tue, 21 Aug 2018 05:45:20 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=YS5gCHhp; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727376AbeHUQD5 (ORCPT + 99 others); Tue, 21 Aug 2018 12:03:57 -0400 Received: from mail-it0-f66.google.com ([209.85.214.66]:33251 "EHLO mail-it0-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727306AbeHUQD5 (ORCPT ); Tue, 21 Aug 2018 12:03:57 -0400 Received: by mail-it0-f66.google.com with SMTP id d16-v6so10907628itj.0 for ; Tue, 21 Aug 2018 05:43:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=Jhi3RTp8eUnFr4q17mOUY0ZqDqUHb47YI3CjfMXx2c8=; b=YS5gCHhpFxto72Unr0XDP1W0GL+CGcHoPEfLK4MmT7DRZUEpJuaUmPkYcX2qk8iDWw BRdiYGHEIXQGM0PZzOrVDL/7M3U3pJ9fAanbrvQZqjNyGvUbRYbsKOjiTimm3PbsOhOo vY8a6kgS4tZAYLVL5wzPpU/W0puEbZfLVFB9k= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=Jhi3RTp8eUnFr4q17mOUY0ZqDqUHb47YI3CjfMXx2c8=; b=b//4l5SDcpU/+t6WW2rYuPMKouvHvP4QUPbYj55tnAZuC25q/unznQ1d9Z/pp8LrcB D7rZQ1xkcbS2o4b0CvIZJAjteDkVlI41KYf23ZBYp3mUJWJjw/7gVYY8RqWJxyq6xZnS CjA5hh2O3H1DQye2Xpg10QDQa4FtDcpr8djz9/4uMWVEtVcw9uD6NP4vh9cf7sPmMQOY 3qZWfU8yrWtGJ8McLwPjTdYobLqakpCdM0vqA4H2A0oVcBqC89Va/j/sywUhLEoEK8kA /9ucFG+5FU07xzx7gA1pEZzXuF5job0G2Kf+NNKhfAs+VOKYgx6zIrWOSc8xV8ILK1PL 62tA== X-Gm-Message-State: AOUpUlEVc7xAee/fA8cIv0fa2Z6206x6lDg3C9Zl98uCqStFW8waCJSi EM+m6GG9u4rEQrPV5DxXSotAc/hewFTR7yXcfllQgg== X-Received: by 2002:a24:9686:: with SMTP id z128-v6mr16049867itd.58.1534855436688; Tue, 21 Aug 2018 05:43:56 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a6b:ac05:0:0:0:0:0 with HTTP; Tue, 21 Aug 2018 05:43:56 -0700 (PDT) In-Reply-To: <20180820073119.GA14931@sol.localdomain> References: <00000000000072d64d05737b6b8c@google.com> <20180820073119.GA14931@sol.localdomain> From: Ard Biesheuvel Date: Tue, 21 Aug 2018 14:43:56 +0200 Message-ID: Subject: Re: KASAN: use-after-free Read in sha512_ctx_mgr_resubmit To: Eric Biggers Cc: Megha Dey , Tim Chen , "David S. Miller" , Herbert Xu , syzbot , "open list:HARDWARE RANDOM NUMBER GENERATOR CORE" , Linux Kernel Mailing List , syzkaller-bugs , "the arch/x86 maintainers" Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 20 August 2018 at 09:31, Eric Biggers wrote: > [+sha512-mb maintainers...] > > On Wed, Aug 15, 2018 at 09:00:04AM -0700, syzbot wrote: >> Hello, >> >> syzbot found the following crash on: >> >> HEAD commit: 7796916146b8 Merge branch 'x86-cpu-for-linus' of git://git.. >> git tree: upstream >> console output: https://syzkaller.appspot.com/x/log.txt?x=164b1922400000 >> kernel config: https://syzkaller.appspot.com/x/.config?x=265bef9882cce8d7 >> dashboard link: https://syzkaller.appspot.com/bug?extid=d5455bac3ba1ee9114e5 >> compiler: gcc (GCC) 8.0.1 20180413 (experimental) >> syzkaller repro:https://syzkaller.appspot.com/x/repro.syz?x=1013478c400000 >> C reproducer: https://syzkaller.appspot.com/x/repro.c?x=1349c8aa400000 >> >> IMPORTANT: if you fix the bug, please add the following tag to the commit: >> Reported-by: syzbot+d5455bac3ba1ee9114e5@syzkaller.appspotmail.com >> >> ================================================================== >> BUG: KASAN: use-after-free in sha512_ctx_mgr_resubmit.part.3+0x3b1/0x4a0 >> arch/x86/crypto/sha512-mb/sha512_mb.c:136 >> Read of size 4 at addr ffff8801b0b9e838 by task kworker/0:1/13 >> >> CPU: 0 PID: 13 Comm: kworker/0:1 Not tainted 4.18.0+ #187 >> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS >> Google 01/01/2011 >> Workqueue: crypto mcryptd_flusher >> Call Trace: >> __dump_stack lib/dump_stack.c:77 [inline] >> dump_stack+0x1c9/0x2b4 lib/dump_stack.c:113 >> print_address_description+0x6c/0x20b mm/kasan/report.c:256 >> kasan_report_error mm/kasan/report.c:354 [inline] >> kasan_report.cold.7+0x242/0x2fe mm/kasan/report.c:412 >> __asan_report_load4_noabort+0x14/0x20 mm/kasan/report.c:432 >> sha512_ctx_mgr_resubmit.part.3+0x3b1/0x4a0 >> arch/x86/crypto/sha512-mb/sha512_mb.c:136 >> sha512_ctx_mgr_resubmit arch/x86/crypto/sha512-mb/sha512_mb.c:135 [inline] >> sha512_ctx_mgr_flush+0x5c/0xb0 arch/x86/crypto/sha512-mb/sha512_mb.c:367 >> sha512_mb_flusher+0x27b/0x610 arch/x86/crypto/sha512-mb/sha512_mb.c:939 >> mcryptd_flusher+0x342/0x4b0 crypto/mcryptd.c:208 >> process_one_work+0xc73/0x1ba0 kernel/workqueue.c:2153 >> worker_thread+0x189/0x13c0 kernel/workqueue.c:2296 >> kthread+0x35a/0x420 kernel/kthread.c:246 >> ret_from_fork+0x3a/0x50 arch/x86/entry/entry_64.S:413 >> >> Allocated by task 23902: >> save_stack+0x43/0xd0 mm/kasan/kasan.c:448 >> set_track mm/kasan/kasan.c:460 [inline] >> kasan_kmalloc+0xc4/0xe0 mm/kasan/kasan.c:553 >> __do_kmalloc mm/slab.c:3718 [inline] >> __kmalloc+0x14e/0x760 mm/slab.c:3727 >> kmalloc include/linux/slab.h:518 [inline] >> sock_kmalloc+0x156/0x1f0 net/core/sock.c:1996 >> hash_accept_parent_nokey+0x58/0x2e0 crypto/algif_hash.c:438 >> hash_accept_parent+0x5b/0x80 crypto/algif_hash.c:465 >> af_alg_accept+0x127/0x7d0 crypto/af_alg.c:296 >> alg_accept+0x46/0x60 crypto/af_alg.c:332 >> __sys_accept4+0x3b2/0x8a0 net/socket.c:1600 >> __do_sys_accept4 net/socket.c:1635 [inline] >> __se_sys_accept4 net/socket.c:1632 [inline] >> __x64_sys_accept4+0x97/0xf0 net/socket.c:1632 >> do_syscall_64+0x1b9/0x820 arch/x86/entry/common.c:290 >> entry_SYSCALL_64_after_hwframe+0x49/0xbe >> >> Freed by task 23902: >> save_stack+0x43/0xd0 mm/kasan/kasan.c:448 >> set_track mm/kasan/kasan.c:460 [inline] >> __kasan_slab_free+0x11a/0x170 mm/kasan/kasan.c:521 >> kasan_slab_free+0xe/0x10 mm/kasan/kasan.c:528 >> __cache_free mm/slab.c:3498 [inline] >> kfree+0xd9/0x260 mm/slab.c:3813 >> __sock_kfree_s net/core/sock.c:2017 [inline] >> sock_kfree_s+0x29/0x60 net/core/sock.c:2023 >> hash_sock_destruct+0x157/0x1c0 crypto/algif_hash.c:427 >> __sk_destruct+0x107/0xa60 net/core/sock.c:1573 >> sk_destruct+0x78/0x90 net/core/sock.c:1608 >> __sk_free+0xcf/0x300 net/core/sock.c:1619 >> sk_free+0x42/0x50 net/core/sock.c:1630 >> sock_put include/net/sock.h:1667 [inline] >> af_alg_release+0x6e/0x90 crypto/af_alg.c:126 >> __sock_release+0xd7/0x260 net/socket.c:600 >> sock_close+0x19/0x20 net/socket.c:1151 >> __fput+0x355/0x8b0 fs/file_table.c:209 >> ____fput+0x15/0x20 fs/file_table.c:243 >> task_work_run+0x1e8/0x2a0 kernel/task_work.c:113 >> tracehook_notify_resume include/linux/tracehook.h:192 [inline] >> exit_to_usermode_loop+0x313/0x370 arch/x86/entry/common.c:166 >> prepare_exit_to_usermode arch/x86/entry/common.c:197 [inline] >> syscall_return_slowpath arch/x86/entry/common.c:268 [inline] >> do_syscall_64+0x6be/0x820 arch/x86/entry/common.c:293 >> entry_SYSCALL_64_after_hwframe+0x49/0xbe >> >> The buggy address belongs to the object at ffff8801b0b9e340 >> which belongs to the cache kmalloc-2048 of size 2048 >> The buggy address is located 1272 bytes inside of >> 2048-byte region [ffff8801b0b9e340, ffff8801b0b9eb40) >> The buggy address belongs to the page: >> page:ffffea0006c2e780 count:1 mapcount:0 mapping:ffff8801dac00c40 index:0x0 >> compound_mapcount: 0 >> flags: 0x2fffc0000008100(slab|head) >> raw: 02fffc0000008100 ffffea0007543a88 ffffea000760b188 ffff8801dac00c40 >> raw: 0000000000000000 ffff8801b0b9e340 0000000100000003 0000000000000000 >> page dumped because: kasan: bad access detected >> >> Memory state around the buggy address: >> ffff8801b0b9e700: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb >> ffff8801b0b9e780: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb >> > ffff8801b0b9e800: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb >> ^ >> ffff8801b0b9e880: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb >> ffff8801b0b9e900: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb >> ================================================================== > > Apparently, the SHA multibuffer algorithms sometimes return success while the > hash request is still being asynchronously processed, which allows an in-use > request to be freed or reused. It also allows the wrong digest value to be > computed. [...] > > Personally, I'm wondering why the SHA multibuffer code is in the kernel at all > given all the severe issues it has that its authors/maintainers don't seem to be > working very hard to fix. The code is very difficult to understand due to the > weird 3-layer design with "mcryptd" and other issues, making debugging it very > time consuming; most of the code is duplicated in 3 places (sha1-mb, sha256-mb, > and sha512-mb), making maintenance even more difficult; and most importantly > there are severe bugs, including edge cases where it computes the wrong hash, as > shown not only by this bug but also the sha256_mb bug I recently ran into. It > seems the algorithms were never tested under load to cover these edge cases. > > That's *not* acceptable for crypto code. Security and correctness come first. > > Also as I've shown previously, in most cases the multibuffer SHA algorithms are > ~1000x slower than the regular ones due to the flush delay. So the performance > argument for them actually seems pretty tenuous... And, isn't AVX2 multibuffer > useless on new processors, which have SHA instructions? > > I'd also be very interested to hear an explanation for why systemwide sharing of > hash jobs doesn't enable side-channel attacks and isn't just the latest example > of prioritizing "performance" over security? > > We need to have higher standards for crypto and not accept buggy spaghetti code > just because it's slightly faster in some artificial microbenchmark. > > So unless major improvements are made, I personally think we'd be much better > off without the SHA multibuffer algorithms in the kernel. > I agree. The code is obviously broken in a way that would have been noticed if it were in wide use, and it is too complicated for mere mortals to fix or maintain. I suggest we simply remove it for now, and if anyone wants to reintroduce it, we can review the code *and* the justification for the approach from scratch (in which case we should consider factoring out the algo agnostics plumbing in a way that allows it to be reused by other architectures as well)