Received: by 2002:ac0:8845:0:0:0:0:0 with SMTP id g63csp931343img; Thu, 28 Feb 2019 10:06:40 -0800 (PST) X-Google-Smtp-Source: APXvYqxmpvMvvdM3d+NFhP62oQZz5e/yXzZ4cb9xznQbgTrhF06NOSQJVxXSFrw4s8rXqOMoMuKD X-Received: by 2002:a63:eb56:: with SMTP id b22mr396166pgk.287.1551377199948; Thu, 28 Feb 2019 10:06:39 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1551377199; cv=none; d=google.com; s=arc-20160816; b=IFC36Izh5WZ9dFAugSK1n3JKbQRIaQNx/mkl2URDX9sl4tdS+g+Q7zIFijlmRfP6ar u2bUZW8W0Wz0Mdc44rGSL1D1HLBAPoUdkXaRud+prDlyEdwzgcZFUGTdDR1gSh2BsGZH pvqfoXLa2QWRLHJe78GuSvlMLDaBKX57VagHy+AE6Q4p4K7KmsSo/RPD0r9ysNzq74xF Jg2nA3Tfwbu4fkWPyWm5VqaGJzoUuy6/jMDOsfg45QzT7uGjv0Ah+woopQge0Pzjc4c7 DxktmLyvle0aDlILTlsbMCro8ruexGGJwKP2UzLs9xrvTIC6sCZeydFkKE4RcRE6Vlaz vs+Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=OGoBH7lh5O4MSgKbKkrrDqzPz++VQGYatdpahoqm3E0=; b=oauqPX8VAlMOiy8IejfoLBTMCi+fCnDcBNSxLro78tVDKEE+bD8BvK7El1Bn99v40h D3lBRpZcSiGcTN/hRVrNMyPnpTqA945iRoQKi65naWFbgVzQaxmbTR/RzvYv7jz/UhXF JHqH+xZ146sc4iWjBiw0JMLmueNoqjYT/EZxrpLyBAsb9j8uOXoeyZArPVG1PzAgOOcs o/JLSEiABP4yzFo/LubbiGGLVBd60m4cvl+nRvs8UULLGa7MVOf74zYvKRrsO/VF21L3 YrFiF8660R6w5WDc9mB0rGm8f1y6jWU76bEDyP1P6ymrJuEtAQokyHAYG9sKAr/R7xAG Coxg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=IG7sVzLF; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r192si940153pgr.331.2019.02.28.10.06.23; Thu, 28 Feb 2019 10:06:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=IG7sVzLF; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732505AbfB1QEU (ORCPT + 99 others); Thu, 28 Feb 2019 11:04:20 -0500 Received: from mail.kernel.org ([198.145.29.99]:50112 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731142AbfB1QEU (ORCPT ); Thu, 28 Feb 2019 11:04:20 -0500 Received: from localhost.localdomain (NE2965lan1.rev.em-net.ne.jp [210.141.244.193]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id B0BD62184A; Thu, 28 Feb 2019 16:04:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1551369859; bh=Z5ga+B2KEl8AyZcpOnfmbw6TPyGKVbXH8zrvkH6poNw=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=IG7sVzLF2g242gZ8uhUgEfzEGNo8gwOofd0G/dG78hYU1J/tcWS/4bQX70H0FZaKX 7ci6XDPUrJIQCxdbnCVfbsVQjqeAPOSxkg8dd+w4JMNotAezpdZ88BfXOayI9MUZnO Y/3E4A6lNsxj8HiBYu+3kcyjvy9m2VFCOy534l3A= From: Masami Hiramatsu To: Steven Rostedt , Linus Torvalds , Shuah Khan Cc: mhiramat@kernel.org, linux-kernel@vger.kernel.org, Andy Lutomirski , Ingo Molnar , Andrew Morton , Changbin Du , Jann Horn , Kees Cook , Andy Lutomirski , Alexei Starovoitov , Nadav Amit , Peter Zijlstra , Joel Fernandes , yhs@fb.com Subject: [PATCH v5 3/6] uaccess: Add non-pagefault user-space read functions Date: Fri, 1 Mar 2019 01:03:54 +0900 Message-Id: <155136983467.2968.13980231890937828195.stgit@devbox> X-Mailer: git-send-email 2.13.6 In-Reply-To: <155136974478.2968.3105123100519786079.stgit@devbox> References: <155136974478.2968.3105123100519786079.stgit@devbox> User-Agent: StGit/0.17.1-dirty MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Add probe_user_read(), strncpy_from_unsafe_user() and strnlen_unsafe_user() which allows caller to access user-space in IRQ context. Current probe_kernel_read() and strncpy_from_unsafe() are not available for user-space memory, because it sets KERNEL_DS while accessing data. On some arch, user address space and kernel address space can be co-exist, but others can not. In that case, setting KERNEL_DS means given address is treated as a kernel address space. Also strnlen_user() is only available from user context since it can sleep if pagefault is enabled. To access user-space memory without pagefault, we need these new functions which sets USER_DS while accessing the data. Signed-off-by: Masami Hiramatsu --- Changes in v5: - Simplify probe_user_read() (Thanks, Peter!) - Add strnlen_unsafe_user() Changes in v3: - Use user_access_ok() for probe_user_read(). Changes in v2: - Simplify strncpy_from_unsafe_user() using strncpy_from_user() according to Linus's suggestion. - Simplify probe_user_read() not using intermediate function. --- include/linux/uaccess.h | 14 +++++ mm/maccess.c | 122 +++++++++++++++++++++++++++++++++++++++++++++-- 2 files changed, 130 insertions(+), 6 deletions(-) diff --git a/include/linux/uaccess.h b/include/linux/uaccess.h index 1afd9dfabe67..5be7f9adb418 100644 --- a/include/linux/uaccess.h +++ b/include/linux/uaccess.h @@ -258,6 +258,17 @@ extern long probe_kernel_read(void *dst, const void *src, size_t size); extern long __probe_kernel_read(void *dst, const void *src, size_t size); /* + * probe_user_read(): safely attempt to read from a location in user space + * @dst: pointer to the buffer that shall take the data + * @src: address to read from + * @size: size of the data chunk + * + * Safely read from address @src to the buffer at @dst. If a kernel fault + * happens, handle that and return -EFAULT. + */ +extern long probe_user_read(void *dst, const void __user *src, size_t size); + +/* * probe_kernel_write(): safely attempt to write to a location * @dst: address to write to * @src: pointer to the data that shall be written @@ -270,6 +281,9 @@ extern long notrace probe_kernel_write(void *dst, const void *src, size_t size); extern long notrace __probe_kernel_write(void *dst, const void *src, size_t size); extern long strncpy_from_unsafe(char *dst, const void *unsafe_addr, long count); +extern long strncpy_from_unsafe_user(char *dst, const void __user *unsafe_addr, + long count); +extern long strnlen_unsafe_user(const void __user *unsafe_addr, long count); /** * probe_kernel_address(): safely attempt to read from a location diff --git a/mm/maccess.c b/mm/maccess.c index ec00be51a24f..d1b2ec78d9ef 100644 --- a/mm/maccess.c +++ b/mm/maccess.c @@ -5,8 +5,20 @@ #include #include +static __always_inline long +probe_read_common(void *dst, const void __user *src, size_t size) +{ + long ret; + + pagefault_disable(); + ret = __copy_from_user_inatomic(dst, src, size); + pagefault_enable(); + + return ret ? -EFAULT : 0; +} + /** - * probe_kernel_read(): safely attempt to read from a location + * probe_kernel_read(): safely attempt to read from a kernel-space location * @dst: pointer to the buffer that shall take the data * @src: address to read from * @size: size of the data chunk @@ -29,17 +41,45 @@ long __probe_kernel_read(void *dst, const void *src, size_t size) mm_segment_t old_fs = get_fs(); set_fs(KERNEL_DS); - pagefault_disable(); - ret = __copy_from_user_inatomic(dst, - (__force const void __user *)src, size); - pagefault_enable(); + ret = probe_read_common(dst, (__force const void __user *)src, size); set_fs(old_fs); - return ret ? -EFAULT : 0; + return ret; } EXPORT_SYMBOL_GPL(probe_kernel_read); /** + * probe_user_read(): safely attempt to read from a user-space location + * @dst: pointer to the buffer that shall take the data + * @src: address to read from. This must be a user address. + * @size: size of the data chunk + * + * Safely read from user address @src to the buffer at @dst. If a kernel fault + * happens, handle that and return -EFAULT. + */ + +long __weak probe_user_read(void *dst, const void __user *src, size_t size) + __attribute__((alias("__probe_user_read"))); + +long __probe_user_read(void *dst, const void __user *src, size_t size) +{ + long ret = -EFAULT; + mm_segment_t old_fs = get_fs(); + + /* + * Since this can be called in IRQ context, we carefully set the + * USER_DS and use user_access_ok() which checks segment setting + * instead of task context. + */ + set_fs(USER_DS); + if (user_access_ok(src, size)) + ret = probe_read_common(dst, src, size); + set_fs(old_fs); + return ret; +} +EXPORT_SYMBOL_GPL(probe_user_read); + +/** * probe_kernel_write(): safely attempt to write to a location * @dst: address to write to * @src: pointer to the data that shall be written @@ -66,6 +106,7 @@ long __probe_kernel_write(void *dst, const void *src, size_t size) } EXPORT_SYMBOL_GPL(probe_kernel_write); + /** * strncpy_from_unsafe: - Copy a NUL terminated string from unsafe address. * @dst: Destination address, in kernel space. This buffer must be at @@ -105,3 +146,72 @@ long strncpy_from_unsafe(char *dst, const void *unsafe_addr, long count) return ret ? -EFAULT : src - unsafe_addr; } + +/** + * strncpy_from_unsafe_user: - Copy a NUL terminated string from unsafe user + * address. + * @dst: Destination address, in kernel space. This buffer must be at + * least @count bytes long. + * @unsafe_addr: Unsafe user address. + * @count: Maximum number of bytes to copy, including the trailing NUL. + * + * Copies a NUL-terminated string from unsafe user address to kernel buffer. + * + * On success, returns the length of the string INCLUDING the trailing NUL. + * + * If access fails, returns -EFAULT (some data may have been copied + * and the trailing NUL added). + * + * If @count is smaller than the length of the string, copies @count-1 bytes, + * sets the last byte of @dst buffer to NUL and returns @count. + */ +long strncpy_from_unsafe_user(char *dst, const void __user *unsafe_addr, + long count) +{ + mm_segment_t old_fs = get_fs(); + long ret; + + if (unlikely(count <= 0)) + return 0; + + set_fs(USER_DS); + pagefault_disable(); + ret = strncpy_from_user(dst, unsafe_addr, count); + pagefault_enable(); + set_fs(old_fs); + if (ret >= count) { + ret = count; + dst[ret - 1] = '\0'; + } else if (ret > 0) + ret++; + return ret; +} + +/** + * strnlen_unsafe_user: - Get the size of a user string INCLUDING final NUL. + * @unsafe_addr: The string to measure. + * @count: Maximum count (including NUL character) + * + * Get the size of a NUL-terminated string in user space without pagefault. + * + * Returns the size of the string INCLUDING the terminating NUL. + * + * If the string is too long, returns a number larger than @count. User + * has to check the return value against "> count". + * On exception (or invalid count), returns 0. + * + * Unlike strnlen_user, this can be used from IRQ handler etc. because + * it disables pagefaults. + */ +long strnlen_unsafe_user(const void __user *unsafe_addr, long count) +{ + mm_segment_t old_fs = get_fs(); + int ret; + + set_fs(USER_DS); + pagefault_disable(); + ret = strnlen_user(unsafe_addr, count); + pagefault_enable(); + set_fs(old_fs); + return ret; +}