Received: by 2002:a05:6358:bb9e:b0:b9:5105:a5b4 with SMTP id df30csp3046107rwb; Mon, 5 Sep 2022 05:47:12 -0700 (PDT) X-Google-Smtp-Source: AA6agR6RD7ruhwCRneFe+Rb6DvrUgnNhSZpww3BoEXVMe/4/8ra6QSDX04GKRGeKfhjY02eXT4n4 X-Received: by 2002:a17:903:32ca:b0:176:adf8:cd1e with SMTP id i10-20020a17090332ca00b00176adf8cd1emr4873746plr.67.1662382032239; Mon, 05 Sep 2022 05:47:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662382032; cv=none; d=google.com; s=arc-20160816; b=g1iei94vdKYVdQ9u+VQrUuwVv02f6tTo6DimHdPtgcLWgZdiwa42NRQNgJyyDSeqU7 oO2YWbRm0OJrJfeMUnTQBJfxUV+bP4eFPBjkSa2JTyAO10wX4Dvl9Oz772PuZu0APliE iAz8uvLbbG5BxLTbQZd63B+DJg/ekpmJ6EhHc1v0xap8PCbNIUSGPb3i3nXfbA72ON9G z0M5PDR5zBrfqwXqOBIWVsgjRRQ6FFttVy2NnwFLQ2o/wB3xpnTNaJr2PvGG/PngGlj5 uMsrzQVv/g3RzvxjUN3O8HqQKs/1BDf5a1lW4X9Z9aqYkRKIgd60DamYw3urCkFL61Gr aXXQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=78o9fcmZe1rakLyEcoaxPY9Q3UwZCgWFE2+savdbFCc=; b=x/bq7TYfa1KWP0UoyMnJsJ/alGjCpCvzyIlEEKUrXv+pULDjqUbWJKffpJQOq8ualc CpwGihmviOArgVO5DZwErd0jqyHoSuWDwztFoAL5AnH8sjZ0HPDonTCqe7Zvn6JRjEkw Ojux6FLpF+zXNVn9I/nw3PuHZEJ2cB3b8VzzfcnRG3W6Ulzxwky26WgHLQcUeKKgoPo8 gMQkbpY8X0+h5QcJZwNdqZd9tGrUXgRLVapJYyAH2PPoGq7ZEZ7rbbA11ME6XGKRNTQt y+gvgaGHsG7BvE4RqdYvexUF53VLjR3v7slk5+Jty80efi8/2ynck4dyIruuSHwEoqVG HS2g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=AsRZmg4F; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id oo3-20020a17090b1c8300b001fb23f3f238si12099652pjb.71.2022.09.05.05.47.00; Mon, 05 Sep 2022 05:47:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=AsRZmg4F; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236403AbiIEM3K (ORCPT + 99 others); Mon, 5 Sep 2022 08:29:10 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60288 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237485AbiIEM1r (ORCPT ); Mon, 5 Sep 2022 08:27:47 -0400 Received: from mail-ej1-x64a.google.com (mail-ej1-x64a.google.com [IPv6:2a00:1450:4864:20::64a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 49DE35F20F for ; Mon, 5 Sep 2022 05:25:49 -0700 (PDT) Received: by mail-ej1-x64a.google.com with SMTP id ho13-20020a1709070e8d00b00730a655e173so2265535ejc.8 for ; Mon, 05 Sep 2022 05:25:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date; bh=78o9fcmZe1rakLyEcoaxPY9Q3UwZCgWFE2+savdbFCc=; b=AsRZmg4FcaRnkB2xKxGrm6iL2iDGcXmhS0tj+V1yFF2aDqOfpFXiHIRCT6zqVsW0aD N5h5Pb+5EyE1FTqgt8cLBFPdqtvKIDtXWqRbvYO/ATFWFWd2HAeTUbRSL9GKhNVVG3UR +Cz38urRZj9ZruT9hAA3LKKRoer80w2xrOai37ChA1Od6371h/dioMQgPIKGmaf+yNq2 ppD8ItTWVHSMEd1mMuPT2G7k7EIkowWAHt4nuZyfb6OsxakVTglc/JRDQZk9/I+LgGq5 2tgG5D4XNS/pj0QT5fzRb+TJYhHEDdcxlyJc0krsiHxzHfX0aka2JbETQ+vXADss1MTT Brbw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date; bh=78o9fcmZe1rakLyEcoaxPY9Q3UwZCgWFE2+savdbFCc=; b=ertnVrMeF3Ie8qgglF68J0PRgUvVb2T5248FSPjrnIjqyW2xdpx8RUgG/D8ZCky6EJ n0/WMR3xa78WE2Uxg4x8vWhKWRz5bytjJeNeyJuHqahh7s2z8JNNrgmtxS2s6cuvVEj6 S736pIn6vYjr8hTRy/UYWCdltzPx4IK53r+/HEcbwNtklW1lId6e0IYR5bmveDM5hjTj Q2t9cihEOmPO3RaxQXyA/bi/UHZIU4DzMDcsbfoFva3nXE5nLI9huL0g2uSXzsaY8Xyd rFxe80AxhIFxwTehXT6pASMT3+D+QVdncwKepI7bqabBAY16bLTOzuC27VGE2BZAFoJ/ FyAg== X-Gm-Message-State: ACgBeo0gVbxnvweUVuvrmaVIw9pt+bflPx/1br87X8n189nE1a8Xe6Ah PY45yhyTVTdiwffc6sQUCXNgdCBh0d4= X-Received: from glider.muc.corp.google.com ([2a00:79e0:9c:201:b808:8d07:ab4a:554c]) (user=glider job=sendgmr) by 2002:a17:906:9bd9:b0:73d:da74:120c with SMTP id de25-20020a1709069bd900b0073dda74120cmr32752711ejc.412.1662380746896; Mon, 05 Sep 2022 05:25:46 -0700 (PDT) Date: Mon, 5 Sep 2022 14:24:26 +0200 In-Reply-To: <20220905122452.2258262-1-glider@google.com> Mime-Version: 1.0 References: <20220905122452.2258262-1-glider@google.com> X-Mailer: git-send-email 2.37.2.789.g6183377224-goog Message-ID: <20220905122452.2258262-19-glider@google.com> Subject: [PATCH v6 18/44] instrumented.h: add KMSAN support From: Alexander Potapenko To: glider@google.com Cc: Alexander Viro , Alexei Starovoitov , Andrew Morton , Andrey Konovalov , Andy Lutomirski , Arnd Bergmann , Borislav Petkov , Christoph Hellwig , Christoph Lameter , David Rientjes , Dmitry Vyukov , Eric Dumazet , Greg Kroah-Hartman , Herbert Xu , Ilya Leoshkevich , Ingo Molnar , Jens Axboe , Joonsoo Kim , Kees Cook , Marco Elver , Mark Rutland , Matthew Wilcox , "Michael S. Tsirkin" , Pekka Enberg , Peter Zijlstra , Petr Mladek , Steven Rostedt , Thomas Gleixner , Vasily Gorbik , Vegard Nossum , Vlastimil Babka , kasan-dev@googlegroups.com, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org To avoid false positives, KMSAN needs to unpoison the data copied from the userspace. To detect infoleaks - check the memory buffer passed to copy_to_user(). Signed-off-by: Alexander Potapenko Reviewed-by: Marco Elver --- v2: -- move implementation of kmsan_copy_to_user() here v5: -- simplify kmsan_copy_to_user() -- provide instrument_get_user() and instrument_put_user() v6: -- rebase after changing "x86: asm: instrument usercopy in get_user() and put_user()" Link: https://linux-review.googlesource.com/id/I43e93b9c02709e6be8d222342f1b044ac8bdbaaf --- include/linux/instrumented.h | 18 ++++++++++++----- include/linux/kmsan-checks.h | 19 ++++++++++++++++++ mm/kmsan/hooks.c | 38 ++++++++++++++++++++++++++++++++++++ 3 files changed, 70 insertions(+), 5 deletions(-) diff --git a/include/linux/instrumented.h b/include/linux/instrumented.h index 9f1dba8f717b0..501fa84867494 100644 --- a/include/linux/instrumented.h +++ b/include/linux/instrumented.h @@ -2,7 +2,7 @@ /* * This header provides generic wrappers for memory access instrumentation that - * the compiler cannot emit for: KASAN, KCSAN. + * the compiler cannot emit for: KASAN, KCSAN, KMSAN. */ #ifndef _LINUX_INSTRUMENTED_H #define _LINUX_INSTRUMENTED_H @@ -10,6 +10,7 @@ #include #include #include +#include #include /** @@ -117,6 +118,7 @@ instrument_copy_to_user(void __user *to, const void *from, unsigned long n) { kasan_check_read(from, n); kcsan_check_read(from, n); + kmsan_copy_to_user(to, from, n, 0); } /** @@ -151,6 +153,7 @@ static __always_inline void instrument_copy_from_user_after(const void *to, const void __user *from, unsigned long n, unsigned long left) { + kmsan_unpoison_memory(to, n - left); } /** @@ -162,10 +165,14 @@ instrument_copy_from_user_after(const void *to, const void __user *from, * * @to destination variable, may not be address-taken */ -#define instrument_get_user(to) \ -({ \ +#define instrument_get_user(to) \ +({ \ + u64 __tmp = (u64)(to); \ + kmsan_unpoison_memory(&__tmp, sizeof(__tmp)); \ + to = __tmp; \ }) + /** * instrument_put_user() - add instrumentation to put_user()-like macros * @@ -177,8 +184,9 @@ instrument_copy_from_user_after(const void *to, const void __user *from, * @ptr userspace pointer to copy to * @size number of bytes to copy */ -#define instrument_put_user(from, ptr, size) \ -({ \ +#define instrument_put_user(from, ptr, size) \ +({ \ + kmsan_copy_to_user(ptr, &from, sizeof(from), 0); \ }) #endif /* _LINUX_INSTRUMENTED_H */ diff --git a/include/linux/kmsan-checks.h b/include/linux/kmsan-checks.h index a6522a0c28df9..c4cae333deec5 100644 --- a/include/linux/kmsan-checks.h +++ b/include/linux/kmsan-checks.h @@ -46,6 +46,21 @@ void kmsan_unpoison_memory(const void *address, size_t size); */ void kmsan_check_memory(const void *address, size_t size); +/** + * kmsan_copy_to_user() - Notify KMSAN about a data transfer to userspace. + * @to: destination address in the userspace. + * @from: source address in the kernel. + * @to_copy: number of bytes to copy. + * @left: number of bytes not copied. + * + * If this is a real userspace data transfer, KMSAN checks the bytes that were + * actually copied to ensure there was no information leak. If @to belongs to + * the kernel space (which is possible for compat syscalls), KMSAN just copies + * the metadata. + */ +void kmsan_copy_to_user(void __user *to, const void *from, size_t to_copy, + size_t left); + #else static inline void kmsan_poison_memory(const void *address, size_t size, @@ -58,6 +73,10 @@ static inline void kmsan_unpoison_memory(const void *address, size_t size) static inline void kmsan_check_memory(const void *address, size_t size) { } +static inline void kmsan_copy_to_user(void __user *to, const void *from, + size_t to_copy, size_t left) +{ +} #endif diff --git a/mm/kmsan/hooks.c b/mm/kmsan/hooks.c index 6f3e64b0b61f8..5c0eb25d984d7 100644 --- a/mm/kmsan/hooks.c +++ b/mm/kmsan/hooks.c @@ -205,6 +205,44 @@ void kmsan_iounmap_page_range(unsigned long start, unsigned long end) kmsan_leave_runtime(); } +void kmsan_copy_to_user(void __user *to, const void *from, size_t to_copy, + size_t left) +{ + unsigned long ua_flags; + + if (!kmsan_enabled || kmsan_in_runtime()) + return; + /* + * At this point we've copied the memory already. It's hard to check it + * before copying, as the size of actually copied buffer is unknown. + */ + + /* copy_to_user() may copy zero bytes. No need to check. */ + if (!to_copy) + return; + /* Or maybe copy_to_user() failed to copy anything. */ + if (to_copy <= left) + return; + + ua_flags = user_access_save(); + if ((u64)to < TASK_SIZE) { + /* This is a user memory access, check it. */ + kmsan_internal_check_memory((void *)from, to_copy - left, to, + REASON_COPY_TO_USER); + } else { + /* Otherwise this is a kernel memory access. This happens when a + * compat syscall passes an argument allocated on the kernel + * stack to a real syscall. + * Don't check anything, just copy the shadow of the copied + * bytes. + */ + kmsan_internal_memmove_metadata((void *)to, (void *)from, + to_copy - left); + } + user_access_restore(ua_flags); +} +EXPORT_SYMBOL(kmsan_copy_to_user); + /* Functions from kmsan-checks.h follow. */ void kmsan_poison_memory(const void *address, size_t size, gfp_t flags) { -- 2.37.2.789.g6183377224-goog