Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp3445301pxj; Tue, 11 May 2021 04:50:08 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx2SxBsxFwqb4ADadJuJ3YuAQe8cwjefPifSfYVnpWcslQWPSCmb1oqJTSaHAdUHO4JWQpS X-Received: by 2002:a05:6638:13cc:: with SMTP id i12mr26582007jaj.20.1620733808050; Tue, 11 May 2021 04:50:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620733808; cv=none; d=google.com; s=arc-20160816; b=niGaJYSlD1sW1pMugV/kYKvS+BbuxMgJK7yylGv/BxRofQFcH+Suicnmtwd+q0NMqT ibn6LrWy+QEOhVmfvfliZ56oVS7BKlKaAygLy/72PrizsLZQvB745qyVHntHDutI1sbN XhIXb+zYHbcDDQMJAyF5FutK7ZMKzzc3ypeJnBj0BC1h/QomSBzmtgaTSXvLIOtfwDA/ xnI3S8h5LR67IrTgjmNKDA2lKNesTZgrm1Jw6ebPaZpWCK3aHmKbmMMSQVn/QP8uV1yl cEnxV/gq1VEpGh7IaPgsXuFHY+PMoMNJJQQNWA0TyzKyH3tLeX17zcAhXQTr2ImSHfE5 TXXw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=OxdoaKPOQuUdKzkVcLxQZyyXBdgyHiRS3tUGaZINUts=; b=NLo69Bf7yifwpbMfJwsyK7nMY5WO+3FF36kxDMguDAdokIm6dkeUaQSMiInWFrk4nL 0/nuDj4YcGU0cgWXab+W5Gz/u1ZusqeruzfaMPDZg7pXO95s0BVfEeQPSf6yH9AyDEIe E9Dz011WhvRbNXGuVn1H7EC6vZqOmvmyHQkdXHs0wSpclr3f7J2DZrcDv0E0AHxRL5Ho HFpCN8wtmDtPg3YnM476F6BbApDfqsKSO8z+yf4yMpM4S2mY0Vwwx9rhIPdYr+x04cuQ LEDCxWKYzdYBsbAp3cnncLeHEKY4PMJ0fXIW6Ce6SF3z9ieDqyfUByEEWHM7gQdL2UeM d0cQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@shutemov-name.20150623.gappssmtp.com header.s=20150623 header.b=pu5+cwzE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l1si20790429iln.7.2021.05.11.04.49.54; Tue, 11 May 2021 04:50:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@shutemov-name.20150623.gappssmtp.com header.s=20150623 header.b=pu5+cwzE; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231533AbhEKLuB (ORCPT + 99 others); Tue, 11 May 2021 07:50:01 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40496 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231518AbhEKLuA (ORCPT ); Tue, 11 May 2021 07:50:00 -0400 Received: from mail-lf1-x130.google.com (mail-lf1-x130.google.com [IPv6:2a00:1450:4864:20::130]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A7C55C061574 for ; Tue, 11 May 2021 04:48:53 -0700 (PDT) Received: by mail-lf1-x130.google.com with SMTP id j10so28150974lfb.12 for ; Tue, 11 May 2021 04:48:53 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=shutemov-name.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=OxdoaKPOQuUdKzkVcLxQZyyXBdgyHiRS3tUGaZINUts=; b=pu5+cwzEKuzc6EKXsqJ7EaWSkxB7rCn6gNQ30VMCpZCLAPn9ir6RnzaGipRTyxVm+K pSnBAdAG4NNgwEkwdQTxjhsKwtxFqJ99SznjFTuhezxvdIHgaI6BBv3p+8p0XsHszG+V RQaHASxm9Et0cxJy0qUKzFBFViubxluBaNVVg45aQybB/7wIbjaw1IMMvZVohB06z+O9 3s8uL1JLKzVYvB1RZkPBju2l4gzFHKeK2tL7LkbfaYQ8lrlvI566AMHvwhvFVbRBso9Z tDguC2zPMdW7HGa4bDpl5xh98LNl9ydjJwxQSDPvWhWqmX6vFAPsI2/5z5P21MbGsP3t 13iA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=OxdoaKPOQuUdKzkVcLxQZyyXBdgyHiRS3tUGaZINUts=; b=XAwn/BUHzD4T/+zgiXLLKW2wg1olk7BoCXOgyPZWq89hfdysnqi9AmzdoQ4Y277beQ lvyC+Wtu/3aFQhTkY4CzMkN7Y/6qj2giaOfS1CfuS5YGVGdQCE6+5kccMkXJklopIFb0 0fOSpras+lqKdC3/0AP8htHU4ob29FEDzor7u7/Tc1NcAPtClQdBp9PZoQ/GHKN3ghTZ 1RYncUXRlaOxFZPeAt4jjIvke4ixdZxi+oA+RUeEbl5mnTtWGXFALbfhet0g733iLhaW ASamkLKs+B+BdSuGdE/0uhNQ1zEydJWWQcICvYFbkyhtJAtp5ismOXAAALB5FtW0kXfi PIIQ== X-Gm-Message-State: AOAM533URk/suo0R8GvG6tAZScl++/Int4ER3loOD5x39Ve64WWHlP6R 6yNG9SpHsDNOvjqi066vMS0W6w== X-Received: by 2002:a05:6512:b8e:: with SMTP id b14mr20299532lfv.404.1620733732151; Tue, 11 May 2021 04:48:52 -0700 (PDT) Received: from box.localdomain ([86.57.175.117]) by smtp.gmail.com with ESMTPSA id x19sm2604242lfa.22.2021.05.11.04.48.51 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 11 May 2021 04:48:51 -0700 (PDT) Received: by box.localdomain (Postfix, from userid 1000) id 40BD0102615; Tue, 11 May 2021 14:48:52 +0300 (+03) Date: Tue, 11 May 2021 14:48:52 +0300 From: "Kirill A. Shutemov" To: Yu-cheng Yu Cc: x86@kernel.org, "H. Peter Anvin" , Thomas Gleixner , Ingo Molnar , linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-api@vger.kernel.org, Arnd Bergmann , Andy Lutomirski , Balbir Singh , Borislav Petkov , Cyrill Gorcunov , Dave Hansen , Eugene Syromiatnikov , Florian Weimer , "H.J. Lu" , Jann Horn , Jonathan Corbet , Kees Cook , Mike Kravetz , Nadav Amit , Oleg Nesterov , Pavel Machek , Peter Zijlstra , Randy Dunlap , "Ravi V. Shankar" , Vedvyas Shanbhogue , Dave Martin , Weijiang Yang , Pengfei Xu , Haitao Huang , "Kirill A . Shutemov" Subject: Re: [PATCH v26 30/30] mm: Introduce PROT_SHADOW_STACK for shadow stack Message-ID: <20210511114852.5wm6a5z72xjlqc4c@box> References: <20210427204315.24153-1-yu-cheng.yu@intel.com> <20210427204315.24153-31-yu-cheng.yu@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210427204315.24153-31-yu-cheng.yu@intel.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 27, 2021 at 01:43:15PM -0700, Yu-cheng Yu wrote: > There are three possible options to create a shadow stack allocation API: > an arch_prctl, a new syscall, or adding PROT_SHADOW_STACK to mmap() and > mprotect(). Each has its advantages and compromises. > > An arch_prctl() is the least intrusive. However, the existing x86 > arch_prctl() takes only two parameters. Multiple parameters must be > passed in a memory buffer. There is a proposal to pass more parameters in > registers [1], but no active discussion on that. > > A new syscall minimizes compatibility issues and offers an extensible frame > work to other architectures, but this will likely result in some overlap of > mmap()/mprotect(). > > The introduction of PROT_SHADOW_STACK to mmap()/mprotect() takes advantage > of existing APIs. The x86-specific PROT_SHADOW_STACK is translated to > VM_SHADOW_STACK and a shadow stack mapping is created without reinventing > the wheel. There are potential pitfalls though. The most obvious one > would be using this as a bypass to shadow stack protection. However, the > attacker would have to get to the syscall first. > > [1] https://lore.kernel.org/lkml/20200828121624.108243-1-hjl.tools@gmail.com/ > > Signed-off-by: Yu-cheng Yu > Cc: Kees Cook > Cc: Kirill A. Shutemov > --- > v26: > - Change PROT_SHSTK to PROT_SHADOW_STACK. > - Remove (vm_flags & VM_SHARED) check, since it is covered by > !vma_is_anonymous(). > > v24: > - Update arch_calc_vm_prot_bits(), leave PROT* checking to > arch_validate_prot(). > - Update arch_validate_prot(), leave vma flags checking to > arch_validate_flags(). > - Add arch_validate_flags(). > > arch/x86/include/asm/mman.h | 60 +++++++++++++++++++++++++++++++- > arch/x86/include/uapi/asm/mman.h | 2 ++ > include/linux/mm.h | 1 + > 3 files changed, 62 insertions(+), 1 deletion(-) > > diff --git a/arch/x86/include/asm/mman.h b/arch/x86/include/asm/mman.h > index 629f6c81263a..fbb90f1b02c0 100644 > --- a/arch/x86/include/asm/mman.h > +++ b/arch/x86/include/asm/mman.h > @@ -20,11 +20,69 @@ > ((vm_flags) & VM_PKEY_BIT2 ? _PAGE_PKEY_BIT2 : 0) | \ > ((vm_flags) & VM_PKEY_BIT3 ? _PAGE_PKEY_BIT3 : 0)) > > -#define arch_calc_vm_prot_bits(prot, key) ( \ > +#define pkey_vm_prot_bits(prot, key) ( \ > ((key) & 0x1 ? VM_PKEY_BIT0 : 0) | \ > ((key) & 0x2 ? VM_PKEY_BIT1 : 0) | \ > ((key) & 0x4 ? VM_PKEY_BIT2 : 0) | \ > ((key) & 0x8 ? VM_PKEY_BIT3 : 0)) > +#else > +#define pkey_vm_prot_bits(prot, key) (0) > #endif > > +static inline unsigned long arch_calc_vm_prot_bits(unsigned long prot, > + unsigned long pkey) > +{ > + unsigned long vm_prot_bits = pkey_vm_prot_bits(prot, pkey); > + > + if (prot & PROT_SHADOW_STACK) > + vm_prot_bits |= VM_SHADOW_STACK; > + > + return vm_prot_bits; > +} > + > +#define arch_calc_vm_prot_bits(prot, pkey) arch_calc_vm_prot_bits(prot, pkey) > + > +#ifdef CONFIG_X86_SHADOW_STACK > +static inline bool arch_validate_prot(unsigned long prot, unsigned long addr) > +{ > + unsigned long valid = PROT_READ | PROT_WRITE | PROT_EXEC | PROT_SEM | > + PROT_SHADOW_STACK; > + > + if (prot & ~valid) > + return false; > + > + if (prot & PROT_SHADOW_STACK) { > + if (!current->thread.cet.shstk_size) > + return false; > + > + /* > + * A shadow stack mapping is indirectly writable by only > + * the CALL and WRUSS instructions, but not other write > + * instructions). PROT_SHADOW_STACK and PROT_WRITE are > + * mutually exclusive. > + */ > + if (prot & PROT_WRITE) > + return false; > + } > + > + return true; > +} > + > +#define arch_validate_prot arch_validate_prot > + > +static inline bool arch_validate_flags(struct vm_area_struct *vma, unsigned long vm_flags) > +{ > + /* > + * Shadow stack must be anonymous and not shared. > + */ > + if ((vm_flags & VM_SHADOW_STACK) && !vma_is_anonymous(vma)) > + return false; > + > + return true; > +} > + > +#define arch_validate_flags(vma, vm_flags) arch_validate_flags(vma, vm_flags) > + > +#endif /* CONFIG_X86_SHADOW_STACK */ > + > #endif /* _ASM_X86_MMAN_H */ > diff --git a/arch/x86/include/uapi/asm/mman.h b/arch/x86/include/uapi/asm/mman.h > index f28fa4acaeaf..4c36b263cf0a 100644 > --- a/arch/x86/include/uapi/asm/mman.h > +++ b/arch/x86/include/uapi/asm/mman.h > @@ -4,6 +4,8 @@ > > #define MAP_32BIT 0x40 /* only give out 32bit addresses */ > > +#define PROT_SHADOW_STACK 0x10 /* shadow stack pages */ > + > #include > > #endif /* _UAPI_ASM_X86_MMAN_H */ > diff --git a/include/linux/mm.h b/include/linux/mm.h > index 1ccec5cc399b..9a7652eea207 100644 > --- a/include/linux/mm.h > +++ b/include/linux/mm.h > @@ -342,6 +342,7 @@ extern unsigned int kobjsize(const void *objp); > > #if defined(CONFIG_X86) > # define VM_PAT VM_ARCH_1 /* PAT reserves whole VMA at once (x86) */ > +# define VM_ARCH_CLEAR VM_SHADOW_STACK Nit: you can put VM_SHADOW_STACK directly into VM_FLAGS_CLEAR. It's already conditinal on the feature enabled and VM_NONE otherwise. Up to you. Reviewed-by: Kirill A. Shutemov -- Kirill A. Shutemov