Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp9226308imu; Wed, 5 Dec 2018 00:53:02 -0800 (PST) X-Google-Smtp-Source: AFSGD/Weib6rVoLMaGcroT6QDyrls3x8rtPjpcPklTBDmvgIv11ZhdAHrpxjrCuSWxqafv/lepko X-Received: by 2002:a17:902:33c1:: with SMTP id b59mr23088694plc.220.1543999982318; Wed, 05 Dec 2018 00:53:02 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543999982; cv=none; d=google.com; s=arc-20160816; b=A5ZZ+KLwyNu55RGf5Nz9V4U0sfXDdfuuwZOX3Np/maFtfE9Mdj38XoRYtUXTCW13Nr v5yUdn7AOlCAXWN6iKwqAR3H6GFHOcLRviiehVt7R8MQOCikHagpo1+dTFS1deok3HfB IxBIGN6X6XKY3ska+rGbRpb3znbfslr11K46/+U9T2y7Yyi/i6xHtDkjsFVf4hr855st 4Kqsoltl2N3GqNxbHeH9Aw2Q1fatr0rSwZDc5zXrqjkfP7f38g70SXWhy6fhcF6L3kxY 6x3UlLc3Yy4lBFFMoTLJ5O/48uazNXAtZDiJwfpZWJdS0CNaBf9jviDHxUU1XyMV3I/e yotw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from; bh=3aVXnKaWYjDlGDG7H/EuOoimtpx+hdhDUTheo3rb678=; b=duUrpQVCSEKfgNoX1oFZ0oetb8qIkDoxdI7EheelpF9Y6wC5KABW8vVKOgmN9Jmc70 wdNRQ/z14GKyhQWWxlJq5VdSYKqUcTQ9FJPYVR5tVMb7jIAQm1KFOt+eZS8qAVQbu1E9 D29HnFKhg8tyqXwqilJ8OSL7AHTaJ53mGMAqH6RVWM0WTlYEL5LFMrypFOZll+61mJhL zTvNLUOWh7ajsnOHmwCN5jbkuB6MKyjCVRwi68z4AZv7xiTzsLxo2zCKJdEIY/UmHBAi 9rAdlpHK9FiYT58WDNqLfqoAATz4px6/L5tEKOEEYGKkRXEGjZwynoNOzN++j7ZoIFSe ZDNA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=vmware.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y8si2349243pfn.26.2018.12.05.00.52.44; Wed, 05 Dec 2018 00:53:02 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=vmware.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727330AbeLEIwH (ORCPT + 99 others); Wed, 5 Dec 2018 03:52:07 -0500 Received: from ex13-edg-ou-001.vmware.com ([208.91.0.189]:33316 "EHLO EX13-EDG-OU-001.vmware.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726351AbeLEIwG (ORCPT ); Wed, 5 Dec 2018 03:52:06 -0500 Received: from sc9-mailhost3.vmware.com (10.113.161.73) by EX13-EDG-OU-001.vmware.com (10.113.208.155) with Microsoft SMTP Server id 15.0.1156.6; Wed, 5 Dec 2018 00:52:02 -0800 Received: from sc2-haas01-esx0118.eng.vmware.com (sc2-haas01-esx0118.eng.vmware.com [10.172.44.118]) by sc9-mailhost3.vmware.com (Postfix) with ESMTP id D7E9C41396; Wed, 5 Dec 2018 00:52:04 -0800 (PST) From: Nadav Amit To: Ingo Molnar CC: , , "H. Peter Anvin" , Thomas Gleixner , Borislav Petkov , Andy Lutomirski , Nadav Amit , Dave Hansen , Peter Zijlstra , , , , Kees Cook , Dave Hansen , Nadav Amit Subject: [PATCH v7 03/14] x86/mm: temporary mm struct Date: Tue, 4 Dec 2018 17:33:57 -0800 Message-ID: <20181205013408.47725-4-namit@vmware.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20181205013408.47725-1-namit@vmware.com> References: <20181205013408.47725-1-namit@vmware.com> MIME-Version: 1.0 Content-Type: text/plain Received-SPF: None (EX13-EDG-OU-001.vmware.com: namit@vmware.com does not designate permitted sender hosts) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Andy Lutomirski Sometimes we want to set a temporary page-table entries (PTEs) in one of the cores, without allowing other cores to use - even speculatively - these mappings. There are two benefits for doing so: (1) Security: if sensitive PTEs are set, temporary mm prevents their use in other cores. This hardens the security as it prevents exploding a dangling pointer to overwrite sensitive data using the sensitive PTE. (2) Avoiding TLB shootdowns: the PTEs do not need to be flushed in remote page-tables. To do so a temporary mm_struct can be used. Mappings which are private for this mm can be set in the userspace part of the address-space. During the whole time in which the temporary mm is loaded, interrupts must be disabled. The first use-case for temporary PTEs, which will follow, is for poking the kernel text. [ Commit message was written by Nadav ] Cc: Kees Cook Cc: Peter Zijlstra Cc: Dave Hansen Reviewed-by: Masami Hiramatsu Tested-by: Masami Hiramatsu Signed-off-by: Andy Lutomirski Signed-off-by: Nadav Amit --- arch/x86/include/asm/mmu_context.h | 32 ++++++++++++++++++++++++++++++ 1 file changed, 32 insertions(+) diff --git a/arch/x86/include/asm/mmu_context.h b/arch/x86/include/asm/mmu_context.h index 0ca50611e8ce..0141b7fa6d01 100644 --- a/arch/x86/include/asm/mmu_context.h +++ b/arch/x86/include/asm/mmu_context.h @@ -338,4 +338,36 @@ static inline unsigned long __get_current_cr3_fast(void) return cr3; } +typedef struct { + struct mm_struct *prev; +} temporary_mm_state_t; + +/* + * Using a temporary mm allows to set temporary mappings that are not accessible + * by other cores. Such mappings are needed to perform sensitive memory writes + * that override the kernel memory protections (e.g., W^X), without exposing the + * temporary page-table mappings that are required for these write operations to + * other cores. + * + * Context: The temporary mm needs to be used exclusively by a single core. To + * harden security IRQs must be disabled while the temporary mm is + * loaded, thereby preventing interrupt handler bugs from override the + * kernel memory protection. + */ +static inline temporary_mm_state_t use_temporary_mm(struct mm_struct *mm) +{ + temporary_mm_state_t state; + + lockdep_assert_irqs_disabled(); + state.prev = this_cpu_read(cpu_tlbstate.loaded_mm); + switch_mm_irqs_off(NULL, mm, current); + return state; +} + +static inline void unuse_temporary_mm(temporary_mm_state_t prev) +{ + lockdep_assert_irqs_disabled(); + switch_mm_irqs_off(NULL, prev.prev, current); +} + #endif /* _ASM_X86_MMU_CONTEXT_H */ -- 2.17.1