Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp82555pxb; Tue, 5 Apr 2022 00:47:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyh1B5UYWqmToR4FpGQHb8J5PnxnV6R27lF+9fQDD3uv2R9TanLEkS58foJGCuasf+vdqNf X-Received: by 2002:a17:906:6a11:b0:6e4:976b:e94 with SMTP id qw17-20020a1709066a1100b006e4976b0e94mr2274491ejc.142.1649144826548; Tue, 05 Apr 2022 00:47:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649144826; cv=none; d=google.com; s=arc-20160816; b=e99DY4YsYo75NBrT46OgSTjtsoPFge+OKXEg20EfEPYAVvQ2gw/PmqipRA2vcDq9BF hjBzaCatRjBSEeYW7GxfCmg4AbGGyDwFlj1octJI+mVSGABnOJ6U7/zMAqDYVtFe+5Mw LuhXuRyccCC+nFWG1IplZJwizwNrJo2BJdUYHPfJWXk/sxw63APW9gGSY6OZYP6tepZC PHRxFxUwRftAT13OF6IiZkZyw7JrHUXQCxhiRBZ4S2wqGOvEKVSv+wi3FL+35iTNfkZq 75cTntRlaDZcy5Jlqhbi8nOMdRW1RqwdfexzxvP2mzKvIEDYlhoBJJk0umtW4t9vq04m Aw5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=WwUvwWQ7Wvoj94+6fvvLzImrTms05/oSq14VqjEv36Y=; b=ukj1BiuUwtLSRtik7665RzPQJj/o7096EJFBDXiJ5XsqKxYMN5fA30xcGsftSdYrbC rCJaUlClrjvijqTFnGvjAFejKReV6VSjr0SB2z1s3gm+thlhnJ69Tdr/4l7oPShlJgNi g3FYYNF08UMD6lvaXT7eaD3FGnEHf/H7JBm00H5AF2ncZpKGf9cGTlR+HkcNj3Z4Ov/8 VZXfd4V1y/fHiRdoznPP2rLv2ZiIIWuNiRKCfGawRnA4AuAOiIcSsgfRZ37Qf08Dl7Lp Y/O/sBilNSRI+iVqDo8Rz70yUQcLRQeyZz6pJutgJWxPmbwAvO0m4CsLgw3G+yMtLWqL ZAMA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Tl4MIlTa; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a3-20020a50e703000000b0041cd9ea80acsi3123195edn.507.2022.04.05.00.46.41; Tue, 05 Apr 2022 00:47:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=Tl4MIlTa; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229876AbiDEHHd (ORCPT + 99 others); Tue, 5 Apr 2022 03:07:33 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53256 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229761AbiDEHH2 (ORCPT ); Tue, 5 Apr 2022 03:07:28 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D606AB76; Tue, 5 Apr 2022 00:05:29 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 57BF2615DE; Tue, 5 Apr 2022 07:05:29 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 3C088C340F3; Tue, 5 Apr 2022 07:05:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1649142328; bh=iDZXYN3nW83IxRmh0QuxTnMGV7/i5Ew/58Ad7WkP1bI=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=Tl4MIlTatuPxvY7n/S0s5DgJQb8ueHoVbiHJY83y5zEEW2jY7NYmCOT29bMTzxJNf omJpd/zeSpkOIc1vWYcO9auMaGBZQhYDrASrSq5oaw15GerPbrT046Gwtk7qBe3pU2 hPf1idOeIt03vTF3x3gr3VcyrAdSHbRc1EDSB3gESDQrJlxPPGXO7aniMcGRurNAEd mMRC5zVBfhCg9qoIlS655bmIbc7CsjC/l5tZCKiB+ZsmYeClqyJ5CI7rrAsBFkMCyW /nUpSNsQ1BE+DWoqdMHF9oKWiXXVmRR+vxnT8tIUtMMnPb3diSXl/N70dzxbJlu9LO 4BK2bkfN0lYpA== Date: Tue, 5 Apr 2022 10:06:39 +0300 From: Jarkko Sakkinen To: Reinette Chatre Cc: dave.hansen@linux.intel.com, tglx@linutronix.de, bp@alien8.de, luto@kernel.org, mingo@redhat.com, linux-sgx@vger.kernel.org, x86@kernel.org, seanjc@google.com, kai.huang@intel.com, cathy.zhang@intel.com, cedric.xing@intel.com, haitao.huang@intel.com, mark.shanahan@intel.com, hpa@zytor.com, linux-kernel@vger.kernel.org Subject: Re: [PATCH V3 17/30] x86/sgx: Support modifying SGX page type Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 04, 2022 at 09:49:25AM -0700, Reinette Chatre wrote: > Every enclave contains one or more Thread Control Structures (TCS). The > TCS contains meta-data used by the hardware to save and restore thread > specific information when entering/exiting the enclave. With SGX1 an > enclave needs to be created with enough TCSs to support the largest > number of threads expecting to use the enclave and enough enclave pages > to meet all its anticipated memory demands. In SGX1 all pages remain in > the enclave until the enclave is unloaded. > > SGX2 introduces a new function, ENCLS[EMODT], that is used to change > the type of an enclave page from a regular (SGX_PAGE_TYPE_REG) enclave > page to a TCS (SGX_PAGE_TYPE_TCS) page or change the type from a > regular (SGX_PAGE_TYPE_REG) or TCS (SGX_PAGE_TYPE_TCS) > page to a trimmed (SGX_PAGE_TYPE_TRIM) page (setting it up for later > removal). > > With the existing support of dynamically adding regular enclave pages > to an initialized enclave and changing the page type to TCS it is > possible to dynamically increase the number of threads supported by an > enclave. > > Changing the enclave page type to SGX_PAGE_TYPE_TRIM is the first step > of dynamically removing pages from an initialized enclave. The complete > page removal flow is: > 1) Change the type of the pages to be removed to SGX_PAGE_TYPE_TRIM > using the SGX_IOC_ENCLAVE_MODIFY_TYPE ioctl() introduced here. > 2) Approve the page removal by running ENCLU[EACCEPT] from within > the enclave. > 3) Initiate actual page removal using the ioctl() introduced in the > following patch. > > Add ioctl() SGX_IOC_ENCLAVE_MODIFY_TYPE to support changing SGX > enclave page types within an initialized enclave. With > SGX_IOC_ENCLAVE_MODIFY_TYPE the user specifies a page range and the > enclave page type to be applied to all pages in the provided range. > The ioctl() itself can return an error code based on failures > encountered by the kernel. It is also possible for SGX specific > failures to be encountered. Add a result output parameter to > communicate the SGX return code. It is possible for the enclave page > type change request to fail on any page within the provided range. > Support partial success by returning the number of pages that were > successfully changed. > > After the page type is changed the page continues to be accessible > from the kernel perspective with page table entries and internal > state. The page may be moved to swap. Any access until ENCLU[EACCEPT] > will encounter a page fault with SGX flag set in error code. > > Signed-off-by: Reinette Chatre > --- > Changes since V2: > - Adjust ioctl number after removal of SGX_IOC_ENCLAVE_RELAX_PERMISSIONS. > - Remove attempt at runtime tracking of EPCM permissions > (sgx_encl_page->vm_run_prot_bits). (Jarkko) > - Change names to follow guidance of using detailed names (Jarkko): > struct sgx_enclave_modt -> struct sgx_enclave_modify_type > sgx_enclave_modt() -> sgx_enclave_modify_type() > sgx_ioc_enclave_modt() -> sgx_ioc_enclave_modify_type() > > Changes since V1: > - Remove the "Earlier changes ..." paragraph (Jarkko). > - Change "new ioctl" text to "Add SGX_IOC_ENCLAVE_MOD_TYPE" (Jarkko). > - Discussion about EPCM interaction and the EPCM MODIFIED bit is moved > to new patch that introduces the ENCLS[EMODT] wrapper while keeping > the higher level discussion on page accessibility in > this commit log (Jarkko). > - Rename SGX_IOC_PAGE_MODT ioctl() to SGX_IOC_ENCLAVE_MODIFY_TYPE > (Jarkko). > - Rename struct sgx_page_modt to struct sgx_enclave_modt in support > of ioctl() rename. > - Rename sgx_page_modt() to sgx_enclave_modt() and sgx_ioc_page_modt() > to sgx_ioc_enclave_modt() in support of ioctl() rename. > - Provide secinfo as parameter to ioctl() instead of just > page type (Jarkko). > - Update comments to refer to new ioctl() names. > - Use new SGX2 checking helper(). > - Use ETRACK flow utility. > - Move kernel-doc to function that provides documentation for > Documentation/x86/sgx.rst. > - Remove redundant comment. > - Use offset/length validation utility. > - Make explicit which members of struct sgx_enclave_modt are for > output (Dave). > > arch/x86/include/uapi/asm/sgx.h | 20 +++ > arch/x86/kernel/cpu/sgx/ioctl.c | 209 ++++++++++++++++++++++++++++++++ > 2 files changed, 229 insertions(+) > > diff --git a/arch/x86/include/uapi/asm/sgx.h b/arch/x86/include/uapi/asm/sgx.h > index a0a24e94fb27..529f4ab28410 100644 > --- a/arch/x86/include/uapi/asm/sgx.h > +++ b/arch/x86/include/uapi/asm/sgx.h > @@ -31,6 +31,8 @@ enum sgx_page_flags { > _IO(SGX_MAGIC, 0x04) > #define SGX_IOC_ENCLAVE_RESTRICT_PERMISSIONS \ > _IOWR(SGX_MAGIC, 0x05, struct sgx_enclave_restrict_permissions) > +#define SGX_IOC_ENCLAVE_MODIFY_TYPE \ > + _IOWR(SGX_MAGIC, 0x06, struct sgx_enclave_modify_type) > > /** > * struct sgx_enclave_create - parameter structure for the > @@ -97,6 +99,24 @@ struct sgx_enclave_restrict_permissions { > __u64 count; > }; > > +/** > + * struct sgx_enclave_modify_type - parameters for %SGX_IOC_ENCLAVE_MODIFY_TYPE > + * @offset: starting page offset (page aligned relative to enclave base > + * address defined in SECS) > + * @length: length of memory (multiple of the page size) > + * @secinfo: address for the SECINFO data containing the new type > + * for pages in range described by @offset and @length > + * @result: (output) SGX result code of ENCLS[EMODT] function > + * @count: (output) bytes successfully changed (multiple of page size) > + */ > +struct sgx_enclave_modify_type { > + __u64 offset; > + __u64 length; > + __u64 secinfo; > + __u64 result; > + __u64 count; > +}; > + > struct sgx_enclave_run; > > /** > diff --git a/arch/x86/kernel/cpu/sgx/ioctl.c b/arch/x86/kernel/cpu/sgx/ioctl.c > index 4d88bfd163e7..6f769e67ec2d 100644 > --- a/arch/x86/kernel/cpu/sgx/ioctl.c > +++ b/arch/x86/kernel/cpu/sgx/ioctl.c > @@ -898,6 +898,212 @@ static long sgx_ioc_enclave_restrict_permissions(struct sgx_encl *encl, > return ret; > } > > +/** > + * sgx_enclave_modify_type() - Modify type of SGX enclave pages > + * @encl: Enclave to which the pages belong. > + * @modt: Checked parameters from user about which pages need modifying. > + * @page_type: New page type. > + * > + * Return: > + * - 0: Success > + * - -errno: Otherwise > + */ > +static long sgx_enclave_modify_type(struct sgx_encl *encl, > + struct sgx_enclave_modify_type *modt, > + enum sgx_page_type page_type) > +{ > + unsigned long max_prot_restore; > + struct sgx_encl_page *entry; > + struct sgx_secinfo secinfo; > + unsigned long prot; > + unsigned long addr; > + unsigned long c; > + void *epc_virt; > + int ret; > + > + /* > + * The only new page types allowed by hardware are PT_TCS and PT_TRIM. > + */ > + if (page_type != SGX_PAGE_TYPE_TCS && page_type != SGX_PAGE_TYPE_TRIM) > + return -EINVAL; > + > + memset(&secinfo, 0, sizeof(secinfo)); > + > + secinfo.flags = page_type << 8; > + > + for (c = 0 ; c < modt->length; c += PAGE_SIZE) { > + addr = encl->base + modt->offset + c; > + > + mutex_lock(&encl->lock); > + > + entry = sgx_encl_load_page(encl, addr); > + if (IS_ERR(entry)) { > + ret = PTR_ERR(entry) == -EBUSY ? -EAGAIN : -EFAULT; > + goto out_unlock; > + } > + > + /* > + * Borrow the logic from the Intel SDM. Regular pages > + * (SGX_PAGE_TYPE_REG) can change type to SGX_PAGE_TYPE_TCS > + * or SGX_PAGE_TYPE_TRIM but TCS pages can only be trimmed. > + * CET pages not supported yet. > + */ > + if (!(entry->type == SGX_PAGE_TYPE_REG || > + (entry->type == SGX_PAGE_TYPE_TCS && > + page_type == SGX_PAGE_TYPE_TRIM))) { > + ret = -EINVAL; > + goto out_unlock; > + } > + > + max_prot_restore = entry->vm_max_prot_bits; > + > + /* > + * Once a regular page becomes a TCS page it cannot be > + * changed back. So the maximum allowed protection reflects > + * the TCS page that is always RW from kernel perspective but > + * will be inaccessible from within enclave. Before doing > + * so, do make sure that the new page type continues to > + * respect the originally vetted page permissions. > + */ > + if (entry->type == SGX_PAGE_TYPE_REG && > + page_type == SGX_PAGE_TYPE_TCS) { > + if (~entry->vm_max_prot_bits & (VM_READ | VM_WRITE)) { > + ret = -EPERM; > + goto out_unlock; > + } > + prot = PROT_READ | PROT_WRITE; > + entry->vm_max_prot_bits = calc_vm_prot_bits(prot, 0); > + > + /* > + * Prevent page from being reclaimed while mutex > + * is released. > + */ > + if (sgx_unmark_page_reclaimable(entry->epc_page)) { > + ret = -EAGAIN; > + goto out_entry_changed; > + } > + > + /* > + * Do not keep encl->lock because of dependency on > + * mmap_lock acquired in sgx_zap_enclave_ptes(). > + */ > + mutex_unlock(&encl->lock); > + > + sgx_zap_enclave_ptes(encl, addr); > + > + mutex_lock(&encl->lock); > + > + sgx_mark_page_reclaimable(entry->epc_page); > + } > + > + /* Change EPC type */ > + epc_virt = sgx_get_epc_virt_addr(entry->epc_page); > + ret = __emodt(&secinfo, epc_virt); > + if (encls_faulted(ret)) { > + /* > + * All possible faults should be avoidable: > + * parameters have been checked, will only change > + * valid page types, and no concurrent > + * SGX1/SGX2 ENCLS instructions since these are > + * protected with mutex. > + */ > + pr_err_once("EMODT encountered exception %d\n", > + ENCLS_TRAPNR(ret)); > + ret = -EFAULT; > + goto out_entry_changed; > + } > + if (encls_failed(ret)) { > + modt->result = ret; > + ret = -EFAULT; > + goto out_entry_changed; > + } > + > + ret = sgx_enclave_etrack(encl); > + if (ret) { > + ret = -EFAULT; > + goto out_unlock; > + } > + > + entry->type = page_type; > + > + mutex_unlock(&encl->lock); > + } > + > + ret = 0; > + goto out; > + > +out_entry_changed: > + entry->vm_max_prot_bits = max_prot_restore; > +out_unlock: > + mutex_unlock(&encl->lock); > +out: > + modt->count = c; > + > + return ret; > +} > + > +/** > + * sgx_ioc_enclave_modify_type() - handler for %SGX_IOC_ENCLAVE_MODIFY_TYPE > + * @encl: an enclave pointer > + * @arg: userspace pointer to a &struct sgx_enclave_modify_type instance > + * > + * Ability to change the enclave page type supports the following use cases: > + * > + * * It is possible to add TCS pages to an enclave by changing the type of > + * regular pages (%SGX_PAGE_TYPE_REG) to TCS (%SGX_PAGE_TYPE_TCS) pages. > + * With this support the number of threads supported by an initialized > + * enclave can be increased dynamically. > + * > + * * Regular or TCS pages can dynamically be removed from an initialized > + * enclave by changing the page type to %SGX_PAGE_TYPE_TRIM. Changing the > + * page type to %SGX_PAGE_TYPE_TRIM marks the page for removal with actual > + * removal done by handler of %SGX_IOC_ENCLAVE_REMOVE_PAGES ioctl() called > + * after ENCLU[EACCEPT] is run on %SGX_PAGE_TYPE_TRIM page from within the > + * enclave. > + * > + * Return: > + * - 0: Success > + * - -errno: Otherwise > + */ > +static long sgx_ioc_enclave_modify_type(struct sgx_encl *encl, void __user *arg) > +{ > + struct sgx_enclave_modify_type params; > + enum sgx_page_type page_type; > + struct sgx_secinfo secinfo; > + long ret; > + > + ret = sgx_ioc_sgx2_ready(encl); > + if (ret) > + return ret; > + > + if (copy_from_user(¶ms, arg, sizeof(params))) > + return -EFAULT; > + > + if (sgx_validate_offset_length(encl, params.offset, params.length)) > + return -EINVAL; > + > + if (copy_from_user(&secinfo, (void __user *)params.secinfo, > + sizeof(secinfo))) > + return -EFAULT; > + > + if (secinfo.flags & ~SGX_SECINFO_PAGE_TYPE_MASK) > + return -EINVAL; > + > + if (memchr_inv(secinfo.reserved, 0, sizeof(secinfo.reserved))) > + return -EINVAL; > + > + if (params.result || params.count) > + return -EINVAL; > + > + page_type = (secinfo.flags & SGX_SECINFO_PAGE_TYPE_MASK) >> 8; > + ret = sgx_enclave_modify_type(encl, ¶ms, page_type); > + > + if (copy_to_user(arg, ¶ms, sizeof(params))) > + return -EFAULT; > + > + return ret; > +} > + > long sgx_ioctl(struct file *filep, unsigned int cmd, unsigned long arg) > { > struct sgx_encl *encl = filep->private_data; > @@ -923,6 +1129,9 @@ long sgx_ioctl(struct file *filep, unsigned int cmd, unsigned long arg) > ret = sgx_ioc_enclave_restrict_permissions(encl, > (void __user *)arg); > break; > + case SGX_IOC_ENCLAVE_MODIFY_TYPE: > + ret = sgx_ioc_enclave_modify_type(encl, (void __user *)arg); > + break; > default: > ret = -ENOIOCTLCMD; > break; > -- > 2.25.1 > To be coherent with other names, this should be SGX_IOC_ENCLAVE_MODIFY_TYPES. BR, Jarkko