Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp3476205pxu; Sun, 11 Oct 2020 11:03:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyEUXu6OWIFkG37tOMSmZ5U+QBpjJ1a28Je5oriZHoF7xhPTXKY8WI2rV6bq/4Xwr171QXK X-Received: by 2002:a05:6402:289:: with SMTP id l9mr10396765edv.294.1602439407839; Sun, 11 Oct 2020 11:03:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602439407; cv=none; d=google.com; s=arc-20160816; b=FnLOVLF6fwPtTUjNO6QRkooJZm/lmwBqA5JcVr9wK2+EQCD7v1rov1CiJoU0l5FuQH vOWn047ZZPkC7AuIAaVYRwCrT/UBPFNnvoe9kq8bxOBh6zuEcuLA1S6pBpmOz5zEeguG sBnEgfFIyEs+uQAAXzz85uO16or91BKbpewLRE1XQS3d+uSOS7Eri+33dcdoIOEx83C9 xNXFNMbU/2v5sGMHB7e3Pn1wA2/pdWXHBKhNqGL6qIPKYTqrPQu26kOkED3vhuql25k6 goHBd0dD0mySUzMYW5owkDbQzYGOpMAHRPrKPk+ExOyCxZwvZ8Q4HSBYa3xbrp+jNthL 5ayw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:references:mime-version :message-id:in-reply-to:date:sender:dkim-signature; bh=EOovoB63mTNgnDeBGShd0QgMbi9PshapUqeSaojca1c=; b=HrPLH95b/8j0xqbv6IdC7/eU3Ul16hEU8A7Q99yGI6PJdRpqtTpgihhy+SNmUAehz8 p5jPrjBUe3LlvZdeqJfzHUz+M6PsnDhqoQCwp638SiocdQr+FNRVhgLps+il3s5HBKyw 5nK2KiE1Qe6h+kiz5tuoWj1p1ahpJMMkXHQPTCFT1H4LZ70MrADe3evMCkN1WkO0TRFI b+WGy0gtfftjPJedggQG7t2hV7oPBIfhzFQfCNyFldLRYntTQqFYQ+fWLs98TyhQnAQ6 RameiXjUMrnJjHOm6j8C0X3UlpWrfFWyTrs0xYjvFfpMzkiBc1VzvF/UjwvDWa2YVH3Y YKwg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=LVXko3Hc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m22si10518362eds.570.2020.10.11.11.03.04; Sun, 11 Oct 2020 11:03:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=LVXko3Hc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726898AbgJKGZE (ORCPT + 99 others); Sun, 11 Oct 2020 02:25:04 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54030 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726719AbgJKGZD (ORCPT ); Sun, 11 Oct 2020 02:25:03 -0400 Received: from mail-qv1-xf4a.google.com (mail-qv1-xf4a.google.com [IPv6:2607:f8b0:4864:20::f4a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B3F77C0613CE for ; Sat, 10 Oct 2020 23:25:03 -0700 (PDT) Received: by mail-qv1-xf4a.google.com with SMTP id c3so8920461qvj.4 for ; Sat, 10 Oct 2020 23:25:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=sender:date:in-reply-to:message-id:mime-version:references:subject :from:to:cc; bh=EOovoB63mTNgnDeBGShd0QgMbi9PshapUqeSaojca1c=; b=LVXko3Hcrxqpkqjw4qp3l2lqDGjuJbCxZwlPQHlN3ABbKQE5MnJQJGYvrPrzp0+XQJ xe35ME2rFCaVv6FCK8RBf96ywpGvNTtsbOgMFmcsifDgIDsJYxLffmrAJLgPe04ITLhP 7q7V3YPlWvaRT8l/h21YGG8ykaEMDOJrVf7fa4Ogu7ZBflNc0n3VGKRQUato1UY9JDCz 9awfqjwCrbO7YZWSEu12mT6XSBJNXvsdCK2YM2TRriD/nL1oskRfAYOVUiLoz1mXdluR R0BzQdOraoC1ywdVMUmzqmwmIKhxYEJMbfUIrEp1IjVUgpvuHVfJlp28ySs9B/AhhGcv uF7A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:date:in-reply-to:message-id:mime-version :references:subject:from:to:cc; bh=EOovoB63mTNgnDeBGShd0QgMbi9PshapUqeSaojca1c=; b=YS0oiYSRThEXoBKmeaKsySCdl2tblqDbVmjVQzV2X0QbtWVlKLI15wDsya3l9xlFV2 jDqHjIUYivWtAwVEkZGcQ7jtw0CCndPM9AweKiOhs2US63scavWd9kAcBn4b2h22fzxR kuGmrx+Kq4h3f2RnXFB5MJ6fm814QkSyjEAugEUNCq/Fy4q1gs+zvxIC3zwmXoTCbiwH abnpRSjmfFCqIvLm4QW4nEM7X/9AP//fSKmingrHVopq1LibVRszO+c7YCt/yl9kRUON pFslbeCK741c8wjGwcuqPJc56TMEZe9I9piDDMDZjKG0jeFf4aUZQzEI+GWQLFM1dugT kZ3Q== X-Gm-Message-State: AOAM530GRD6VqMK8692EFEcarAdyfaw+9Qm4OuhGDG2Pf5CrLimbAS9j zDbv+dLkTTSoanzMbhGyybk493lsKlRaLhrL4A== Sender: "lokeshgidra via sendgmr" X-Received: from lg.mtv.corp.google.com ([2620:15c:211:202:f693:9fff:fef4:29dd]) (user=lokeshgidra job=sendgmr) by 2002:ad4:544a:: with SMTP id h10mr20428528qvt.35.1602397502814; Sat, 10 Oct 2020 23:25:02 -0700 (PDT) Date: Sat, 10 Oct 2020 23:24:55 -0700 In-Reply-To: <20201011062456.4065576-1-lokeshgidra@google.com> Message-Id: <20201011062456.4065576-2-lokeshgidra@google.com> Mime-Version: 1.0 References: <20201011062456.4065576-1-lokeshgidra@google.com> X-Mailer: git-send-email 2.28.0.1011.ga647a8990f-goog Subject: [PATCH v5 1/2] Add UFFD_USER_MODE_ONLY From: Lokesh Gidra To: Kees Cook , Jonathan Corbet , Peter Xu , Andrea Arcangeli , Sebastian Andrzej Siewior , Andrew Morton Cc: Alexander Viro , Stephen Smalley , Eric Biggers , Lokesh Gidra , Daniel Colascione , "Joel Fernandes (Google)" , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, kaleshsingh@google.com, calin@google.com, surenb@google.com, nnk@google.com, jeffv@google.com, kernel-team@android.com, Mike Rapoport , Shaohua Li , Jerome Glisse , Mauro Carvalho Chehab , Johannes Weiner , Mel Gorman , Nitin Gupta , Vlastimil Babka , Iurii Zaikin , Luis Chamberlain , Daniel Colascione Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org userfaultfd handles page faults from both user and kernel code. Add a new UFFD_USER_MODE_ONLY flag for userfaultfd(2) that makes the resulting userfaultfd object refuse to handle faults from kernel mode, treating these faults as if SIGBUS were always raised, causing the kernel code to fail with EFAULT. A future patch adds a knob allowing administrators to give some processes the ability to create userfaultfd file objects only if they pass UFFD_USER_MODE_ONLY, reducing the likelihood that these processes will exploit userfaultfd's ability to delay kernel page faults to open timing windows for future exploits. Signed-off-by: Daniel Colascione Signed-off-by: Lokesh Gidra --- fs/userfaultfd.c | 10 +++++++++- include/uapi/linux/userfaultfd.h | 9 +++++++++ 2 files changed, 18 insertions(+), 1 deletion(-) diff --git a/fs/userfaultfd.c b/fs/userfaultfd.c index 0e4a3837da52..bd229f06d4e9 100644 --- a/fs/userfaultfd.c +++ b/fs/userfaultfd.c @@ -405,6 +405,13 @@ vm_fault_t handle_userfault(struct vm_fault *vmf, unsigned long reason) if (ctx->features & UFFD_FEATURE_SIGBUS) goto out; + if ((vmf->flags & FAULT_FLAG_USER) == 0 && + ctx->flags & UFFD_USER_MODE_ONLY) { + printk_once(KERN_WARNING "uffd: Set unprivileged_userfaultfd " + "sysctl knob to 1 if kernel faults must be handled " + "without obtaining CAP_SYS_PTRACE capability\n"); + goto out; + } /* * If it's already released don't get it. This avoids to loop @@ -1975,10 +1982,11 @@ SYSCALL_DEFINE1(userfaultfd, int, flags) BUG_ON(!current->mm); /* Check the UFFD_* constants for consistency. */ + BUILD_BUG_ON(UFFD_USER_MODE_ONLY & UFFD_SHARED_FCNTL_FLAGS); BUILD_BUG_ON(UFFD_CLOEXEC != O_CLOEXEC); BUILD_BUG_ON(UFFD_NONBLOCK != O_NONBLOCK); - if (flags & ~UFFD_SHARED_FCNTL_FLAGS) + if (flags & ~(UFFD_SHARED_FCNTL_FLAGS | UFFD_USER_MODE_ONLY)) return -EINVAL; ctx = kmem_cache_alloc(userfaultfd_ctx_cachep, GFP_KERNEL); diff --git a/include/uapi/linux/userfaultfd.h b/include/uapi/linux/userfaultfd.h index e7e98bde221f..5f2d88212f7c 100644 --- a/include/uapi/linux/userfaultfd.h +++ b/include/uapi/linux/userfaultfd.h @@ -257,4 +257,13 @@ struct uffdio_writeprotect { __u64 mode; }; +/* + * Flags for the userfaultfd(2) system call itself. + */ + +/* + * Create a userfaultfd that can handle page faults only in user mode. + */ +#define UFFD_USER_MODE_ONLY 1 + #endif /* _LINUX_USERFAULTFD_H */ -- 2.28.0.1011.ga647a8990f-goog