Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp5390048ybv; Tue, 11 Feb 2020 14:56:51 -0800 (PST) X-Google-Smtp-Source: APXvYqw+N3utOBOgrOir1pVGCKkR1ewIP84q93c9sN437dyXnsGqiwKYtDhgQl5flRVk/LxJoYRo X-Received: by 2002:a9d:63d6:: with SMTP id e22mr360492otl.185.1581461811627; Tue, 11 Feb 2020 14:56:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581461811; cv=none; d=google.com; s=arc-20160816; b=B4PkIkGfCCODPHEeuQnYnrrxinxANn1LfNrcJx5L6ieHm8p+BHLJEXlrYyQguZR+DG gSeVXpaE1Q7JyoaDUr+KYNVVBX6XR4qTNVUXe8Ti1uahooR5MPKfZ5odu5nO3V2D3kCV IQPO8wrSlweAbox8fiy25BfXsIQHgkYAgR5vyo1aVXShExD+H6qVk5rm1k2bT1kA0+Xd /sYvcJ5+lRnkv9tD1LGdeur25TYfbGPJ/vZSz1A5LxdEYudMA6AN+OR2v4SEGyfL3A1r yvxLnSomoVFMclJvylUf5DxK2NWpEascJO3bTcYd9/L2ux0h7Ybi8yF7ivRFfN39ADl9 ER4g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:from:subject:references:mime-version :message-id:in-reply-to:date:dkim-signature; bh=zuajClpY4n8pjVJyU4VcQsX18SP0Y8Sji2fDd6chARE=; b=LUFJybkBlefZI9pL+lBk6TGFnuor7Fqpz5y69z2bxkWjw0OYstVC5o/MwYHpXniXV4 xDsgGHFsVmPl2nwpPKLyW/Mf/VWkxZWgPVApEZrPG/WZQfrw4USklHBg2TLOlHpRAbD1 PRC2kTKbF5eCajAYpnoW+dfRIj1+H84/GDR8ZMMGHF6snQPVY8r7slL26BA7wysmHH3W pNSFzgd2UkOxyyKispN4RRHBFus26oyj16AuGHh68H/TA+cykqXGLg36QG35ixYKwjIH h/ywoVOanYLxJAU3kmZQQiuYnEQytUAmHKDClF4DnRvSFdL4S5IpFgV6/y4NZzxRXNhT h71Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=WeWDMgIw; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r66si2358801oie.255.2020.02.11.14.56.39; Tue, 11 Feb 2020 14:56:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=WeWDMgIw; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727911AbgBKW4Z (ORCPT + 99 others); Tue, 11 Feb 2020 17:56:25 -0500 Received: from mail-pf1-f202.google.com ([209.85.210.202]:49292 "EHLO mail-pf1-f202.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727858AbgBKW4W (ORCPT ); Tue, 11 Feb 2020 17:56:22 -0500 Received: by mail-pf1-f202.google.com with SMTP id c9so138281pfo.16 for ; Tue, 11 Feb 2020 14:56:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:in-reply-to:message-id:mime-version:references:subject:from:to; bh=zuajClpY4n8pjVJyU4VcQsX18SP0Y8Sji2fDd6chARE=; b=WeWDMgIwDwbj7/AXWD1nXsQn6GX/nkkwQPg4D8SaZzdpux/IoFOYqW+cPEdoZXKA+z KeyUCv3fQ3BRHWHWuuiFxseX9APuVA3h8PGUbRZUh0l3He+njmrEg5cSm/IGa4tv+6Tm I50x1iyZueTqfd07v6Uq2TOxUr049ueKUeUSlpfmn1QnCWByi3jR0lnGyH60rSEo5Ri9 +r05f2myW4dtsdTS2k9fi5i5grbiS2UUw7IMdRJU2i1ST+/XyzYGRpXwXW2Y5w0D9TbP fsD2J9d+UsZnC4Cc7LQtwcEX9aAg8yO14OPMXom6LPmXosZmilC9gWbzfXSkRwxCCAPx 85QQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:in-reply-to:message-id:mime-version :references:subject:from:to; bh=zuajClpY4n8pjVJyU4VcQsX18SP0Y8Sji2fDd6chARE=; b=YxMUzliqRyvmpnaJbVoNc1NaziOkch+J3brtm+ydvPIxjNF0d3BcIK+0eFzn1KQCmV ggX7qn88jiBd2OsU8X4DIdWaE1oxUV4gNe7379eGsLilNbUYjzGEdTwW1OmJ0AWpRTsW ASVbJNGpysryT9SrtUwohjoBQIjM5SFZLyXiWaFdPW+y84Lduyy0TyH1Cs7ubgvF9bsP PGf9ikav7ycVDGxaOH4lgrI5Gwn8945T9kVbldFGhR4WZln2DIr8LWln2P8WsCGYn7SI JE1cWzzusYR/HM0Ouxi7DP6Cblw88g7zvmmiuQZvtixsN9l61Wnh97syek2g0AU7izoy hNDg== X-Gm-Message-State: APjAAAVVWIuJTopN1FiZQRQYObz9Z54Xowu58EW5lfgEgceaqrrQlOsd L1/EMmec0sgaUjy7n6Tg7Z3fPzq+2uM= X-Received: by 2002:a63:4525:: with SMTP id s37mr9086010pga.418.1581461781964; Tue, 11 Feb 2020 14:56:21 -0800 (PST) Date: Tue, 11 Feb 2020 14:55:46 -0800 In-Reply-To: <20200211225547.235083-1-dancol@google.com> Message-Id: <20200211225547.235083-6-dancol@google.com> Mime-Version: 1.0 References: <20200211225547.235083-1-dancol@google.com> X-Mailer: git-send-email 2.25.0.225.g125e21ebc7-goog Subject: [PATCH v2 5/6] Let userfaultfd opt out of handling kernel-mode faults From: Daniel Colascione To: dancol@google.com, timmurray@google.com, nosh@google.com, nnk@google.com, lokeshgidra@google.com, linux-kernel@vger.kernel.org, linux-api@vger.kernel.org, selinux@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org userfaultfd handles page faults from both user and kernel code. Add a new UFFD_USER_MODE_ONLY flag for userfaultfd(2) that makes the resulting userfaultfd object refuse to handle faults from kernel mode, treating these faults as if SIGBUS were always raised, causing the kernel code to fail with EFAULT. A future patch adds a knob allowing administrators to give some processes the ability to create userfaultfd file objects only if they pass UFFD_USER_MODE_ONLY, reducing the likelihood that these processes will exploit userfaultfd's ability to delay kernel page faults to open timing windows for future exploits. Signed-off-by: Daniel Colascione --- fs/userfaultfd.c | 7 ++++++- include/uapi/linux/userfaultfd.h | 9 +++++++++ 2 files changed, 15 insertions(+), 1 deletion(-) diff --git a/fs/userfaultfd.c b/fs/userfaultfd.c index 11227b94a5a7..47825a804d63 100644 --- a/fs/userfaultfd.c +++ b/fs/userfaultfd.c @@ -389,6 +389,9 @@ vm_fault_t handle_userfault(struct vm_fault *vmf, unsigned long reason) if (ctx->features & UFFD_FEATURE_SIGBUS) goto out; + if ((vmf->flags & FAULT_FLAG_USER) == 0 && + ctx->flags & UFFD_USER_MODE_ONLY) + goto out; /* * If it's already released don't get it. This avoids to loop @@ -1945,6 +1948,7 @@ static void init_once_userfaultfd_ctx(void *mem) SYSCALL_DEFINE1(userfaultfd, int, flags) { + static const int uffd_flags = UFFD_USER_MODE_ONLY; struct userfaultfd_ctx *ctx; int fd; @@ -1954,10 +1958,11 @@ SYSCALL_DEFINE1(userfaultfd, int, flags) BUG_ON(!current->mm); /* Check the UFFD_* constants for consistency. */ + BUILD_BUG_ON(uffd_flags & UFFD_SHARED_FCNTL_FLAGS); BUILD_BUG_ON(UFFD_CLOEXEC != O_CLOEXEC); BUILD_BUG_ON(UFFD_NONBLOCK != O_NONBLOCK); - if (flags & ~UFFD_SHARED_FCNTL_FLAGS) + if (flags & ~(UFFD_SHARED_FCNTL_FLAGS | uffd_flags)) return -EINVAL; ctx = kmem_cache_alloc(userfaultfd_ctx_cachep, GFP_KERNEL); diff --git a/include/uapi/linux/userfaultfd.h b/include/uapi/linux/userfaultfd.h index 48f1a7c2f1f0..fe3565eae617 100644 --- a/include/uapi/linux/userfaultfd.h +++ b/include/uapi/linux/userfaultfd.h @@ -231,4 +231,13 @@ struct uffdio_zeropage { __s64 zeropage; }; +/* + * Flags for the userfaultfd(2) system call itself. + */ + +/* + * Create a userfaultfd that can handle page faults only in user mode. + */ +#define UFFD_USER_MODE_ONLY 1 + #endif /* _LINUX_USERFAULTFD_H */ -- 2.25.0.225.g125e21ebc7-goog