Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp4832670pxy; Tue, 27 Apr 2021 13:48:04 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzR8oQQ2y1yV4mh+L1PEzDsYtDQ+PRUfRLvD4tD9Gfcz+oZniXc/vv4Nce2kHRRnKnEWH6b X-Received: by 2002:a17:902:a415:b029:e7:137b:ef9c with SMTP id p21-20020a170902a415b02900e7137bef9cmr26060093plq.28.1619556483984; Tue, 27 Apr 2021 13:48:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619556483; cv=none; d=google.com; s=arc-20160816; b=qY8Bo4gUaxFuJRU0cg19TcKlZEK9kkjMp/LUYdPqrtsBZkHrstD5NFKcqHNuoDTpQ8 T/Vlh98Lu3afh65j3RFP+euo+LYAOWBvD8Yq7WdLwJMTyhSoqjyZxkxWkZxf7ANgZfjO W95z3zTdUkTA/9/vx8J1meaL6IfnxmUH52YQIhNc8q7gnADIkrE90CgoNf7DgtjTH3uP 8xXTW9oSG/aY/o9NOpdy5ks8eTXb4IcPIPxdT3StWYhWzVySDILXoWkQ1rAaQz1zQblZ +bd/AEU8FIbfqkCX7OmCfzrNzYTPrw1wChn0OpV/cAZ+tynA0x5hIR6wtRnaOdeawMGu w9RA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :ironport-sdr:ironport-sdr; bh=B8kQoImyZLpbLKaEE5vTikmJmWqDWjDxP/mz2aYyrWM=; b=vO03dUs7xsioNy4C8QCeO0RpGNbJTf3mlLxF8uzxDXW9dgPflLA6NX7RoqvsbZQBdF KPD3MHww2TkGrlXj80f53LBajz3GN0cMaiZzUUzkxT0ga2S9Yj04rptWv4wNTAxsBPzf 5n5JSI5Ly9TfaG4FdZ2yqfRKASSxVgeiiSFUpI0wYYWytspJc4A1n4cLP1QfhQRQ9bL3 nj5UbeYTdi5NPcW9Zr3Ai0o1yxeSUvAnHEvgxZvTT6uGpkYHUJkggjFsqdCXAPd4qB9J 6rCPVdc9+JxQJx0dE9YO2mVX8a8eBP45qhW1kSVg6NwOS+pOhPKyJL46JB7DoR71TJzF 0nlw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a67si1048082pgc.147.2021.04.27.13.47.38; Tue, 27 Apr 2021 13:48:03 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239445AbhD0UrN (ORCPT + 99 others); Tue, 27 Apr 2021 16:47:13 -0400 Received: from mga05.intel.com ([192.55.52.43]:31782 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239394AbhD0Uqg (ORCPT ); Tue, 27 Apr 2021 16:46:36 -0400 IronPort-SDR: Pd2J7z9M3T0ne7v5OF+Gr0fM0qEhmmWEnOwHAoJladbkTJcQ6ZuQJnYOZTDS6fLSXSkXTAx7eT QIULjITpphAQ== X-IronPort-AV: E=McAfee;i="6200,9189,9967"; a="281922506" X-IronPort-AV: E=Sophos;i="5.82,255,1613462400"; d="scan'208";a="281922506" Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga105.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 Apr 2021 13:44:26 -0700 IronPort-SDR: xWB2H/oAI2gkiea5eIHtXHrJcmmmzHjfnM/w0JGR3fwufgi1UErfUQeXDwBL4TzugG+6o0qqyj zJ4vZYmADzSQ== X-IronPort-AV: E=Sophos;i="5.82,255,1613462400"; d="scan'208";a="465623580" Received: from yyu32-desk.sc.intel.com ([143.183.136.146]) by orsmga001-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 Apr 2021 13:44:25 -0700 From: Yu-cheng Yu To: x86@kernel.org, "H. Peter Anvin" , Thomas Gleixner , Ingo Molnar , linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-api@vger.kernel.org, Arnd Bergmann , Andy Lutomirski , Balbir Singh , Borislav Petkov , Cyrill Gorcunov , Dave Hansen , Eugene Syromiatnikov , Florian Weimer , "H.J. Lu" , Jann Horn , Jonathan Corbet , Kees Cook , Mike Kravetz , Nadav Amit , Oleg Nesterov , Pavel Machek , Peter Zijlstra , Randy Dunlap , "Ravi V. Shankar" , Vedvyas Shanbhogue , Dave Martin , Weijiang Yang , Pengfei Xu , Haitao Huang Cc: Yu-cheng Yu , "Kirill A . Shutemov" Subject: [PATCH v26 30/30] mm: Introduce PROT_SHADOW_STACK for shadow stack Date: Tue, 27 Apr 2021 13:43:15 -0700 Message-Id: <20210427204315.24153-31-yu-cheng.yu@intel.com> X-Mailer: git-send-email 2.21.0 In-Reply-To: <20210427204315.24153-1-yu-cheng.yu@intel.com> References: <20210427204315.24153-1-yu-cheng.yu@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org There are three possible options to create a shadow stack allocation API: an arch_prctl, a new syscall, or adding PROT_SHADOW_STACK to mmap() and mprotect(). Each has its advantages and compromises. An arch_prctl() is the least intrusive. However, the existing x86 arch_prctl() takes only two parameters. Multiple parameters must be passed in a memory buffer. There is a proposal to pass more parameters in registers [1], but no active discussion on that. A new syscall minimizes compatibility issues and offers an extensible frame work to other architectures, but this will likely result in some overlap of mmap()/mprotect(). The introduction of PROT_SHADOW_STACK to mmap()/mprotect() takes advantage of existing APIs. The x86-specific PROT_SHADOW_STACK is translated to VM_SHADOW_STACK and a shadow stack mapping is created without reinventing the wheel. There are potential pitfalls though. The most obvious one would be using this as a bypass to shadow stack protection. However, the attacker would have to get to the syscall first. [1] https://lore.kernel.org/lkml/20200828121624.108243-1-hjl.tools@gmail.com/ Signed-off-by: Yu-cheng Yu Cc: Kees Cook Cc: Kirill A. Shutemov --- v26: - Change PROT_SHSTK to PROT_SHADOW_STACK. - Remove (vm_flags & VM_SHARED) check, since it is covered by !vma_is_anonymous(). v24: - Update arch_calc_vm_prot_bits(), leave PROT* checking to arch_validate_prot(). - Update arch_validate_prot(), leave vma flags checking to arch_validate_flags(). - Add arch_validate_flags(). arch/x86/include/asm/mman.h | 60 +++++++++++++++++++++++++++++++- arch/x86/include/uapi/asm/mman.h | 2 ++ include/linux/mm.h | 1 + 3 files changed, 62 insertions(+), 1 deletion(-) diff --git a/arch/x86/include/asm/mman.h b/arch/x86/include/asm/mman.h index 629f6c81263a..fbb90f1b02c0 100644 --- a/arch/x86/include/asm/mman.h +++ b/arch/x86/include/asm/mman.h @@ -20,11 +20,69 @@ ((vm_flags) & VM_PKEY_BIT2 ? _PAGE_PKEY_BIT2 : 0) | \ ((vm_flags) & VM_PKEY_BIT3 ? _PAGE_PKEY_BIT3 : 0)) -#define arch_calc_vm_prot_bits(prot, key) ( \ +#define pkey_vm_prot_bits(prot, key) ( \ ((key) & 0x1 ? VM_PKEY_BIT0 : 0) | \ ((key) & 0x2 ? VM_PKEY_BIT1 : 0) | \ ((key) & 0x4 ? VM_PKEY_BIT2 : 0) | \ ((key) & 0x8 ? VM_PKEY_BIT3 : 0)) +#else +#define pkey_vm_prot_bits(prot, key) (0) #endif +static inline unsigned long arch_calc_vm_prot_bits(unsigned long prot, + unsigned long pkey) +{ + unsigned long vm_prot_bits = pkey_vm_prot_bits(prot, pkey); + + if (prot & PROT_SHADOW_STACK) + vm_prot_bits |= VM_SHADOW_STACK; + + return vm_prot_bits; +} + +#define arch_calc_vm_prot_bits(prot, pkey) arch_calc_vm_prot_bits(prot, pkey) + +#ifdef CONFIG_X86_SHADOW_STACK +static inline bool arch_validate_prot(unsigned long prot, unsigned long addr) +{ + unsigned long valid = PROT_READ | PROT_WRITE | PROT_EXEC | PROT_SEM | + PROT_SHADOW_STACK; + + if (prot & ~valid) + return false; + + if (prot & PROT_SHADOW_STACK) { + if (!current->thread.cet.shstk_size) + return false; + + /* + * A shadow stack mapping is indirectly writable by only + * the CALL and WRUSS instructions, but not other write + * instructions). PROT_SHADOW_STACK and PROT_WRITE are + * mutually exclusive. + */ + if (prot & PROT_WRITE) + return false; + } + + return true; +} + +#define arch_validate_prot arch_validate_prot + +static inline bool arch_validate_flags(struct vm_area_struct *vma, unsigned long vm_flags) +{ + /* + * Shadow stack must be anonymous and not shared. + */ + if ((vm_flags & VM_SHADOW_STACK) && !vma_is_anonymous(vma)) + return false; + + return true; +} + +#define arch_validate_flags(vma, vm_flags) arch_validate_flags(vma, vm_flags) + +#endif /* CONFIG_X86_SHADOW_STACK */ + #endif /* _ASM_X86_MMAN_H */ diff --git a/arch/x86/include/uapi/asm/mman.h b/arch/x86/include/uapi/asm/mman.h index f28fa4acaeaf..4c36b263cf0a 100644 --- a/arch/x86/include/uapi/asm/mman.h +++ b/arch/x86/include/uapi/asm/mman.h @@ -4,6 +4,8 @@ #define MAP_32BIT 0x40 /* only give out 32bit addresses */ +#define PROT_SHADOW_STACK 0x10 /* shadow stack pages */ + #include #endif /* _UAPI_ASM_X86_MMAN_H */ diff --git a/include/linux/mm.h b/include/linux/mm.h index 1ccec5cc399b..9a7652eea207 100644 --- a/include/linux/mm.h +++ b/include/linux/mm.h @@ -342,6 +342,7 @@ extern unsigned int kobjsize(const void *objp); #if defined(CONFIG_X86) # define VM_PAT VM_ARCH_1 /* PAT reserves whole VMA at once (x86) */ +# define VM_ARCH_CLEAR VM_SHADOW_STACK #elif defined(CONFIG_PPC) # define VM_SAO VM_ARCH_1 /* Strong Access Ordering (powerpc) */ #elif defined(CONFIG_PARISC) -- 2.21.0