Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp1866324pxm; Fri, 4 Mar 2022 05:13:18 -0800 (PST) X-Google-Smtp-Source: ABdhPJw0U9M7/xO8HOsjbqI6OE2AbhFieQDQWsG6HTMJBugDa24qwOoOiZ5iCVO63lv/3z9CD3Mi X-Received: by 2002:a17:907:b590:b0:6c1:c061:d945 with SMTP id qx16-20020a170907b59000b006c1c061d945mr30369138ejc.768.1646399597769; Fri, 04 Mar 2022 05:13:17 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646399597; cv=none; d=google.com; s=arc-20160816; b=Ixi70i0LixOCQjlwc65cJT5xEbhDCnwGry+vijFxCunvp2j5pd5at6FbqTP6Q8lOUX FSfRZ3cmp83rgPBPmZ9dTGD828dWpH6jB0Cy5XVmaF7LUUOfsg4zdcWM1pYWL3E4A1Ey 3H9aQ4V5sQtsQw23rPvh3bm1G84t8BakXqmIOEDyqyygaD+RW8wlDQX5y/aOukD59Wku +ycnEfUjr32wPYgnctVHd/eSTsylmmS/S+p7dlT7HViuueZxb6fQzePwgEnL98bhiQ9L 9NIodGGK5oVANMATCPI1RmtPJFP0USQLIryRmtcw5Fjx5VjpGD9w3/j244AOdFOMk5vp jcKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=MXfZUBBnMRX9zkt8nbB7Boeo4xSSYpmKpVlhUeqwrS4=; b=QaK0V5fqhmj9Idpc5N+xQzr6sE4leYqZHqHGYNToPPjlWUNZbX4qi1AyEZJesZNmuk lVeGZSjBOA+/n/fwsPGzkR5TcfCOO5VnVod3sHHTN7vW5fNetfD3ZVf2VM333QFbJ3ye MfqKsitNTWFihnPXY3be4fWG1Fyf6JJ51nIRKzebKMbNLFfeyIaj3C1s1qt7nifwTXnN zWbzpg4iTkXEsBPvMAx4FtRzqJOWoTGLZhM/56DoYLxA9vTxpntKZA/I9k79iAk9QHCJ UZqEmgWFWVh0K6P8mUWgjzHF2enzwWRWVhmFuc2xS/Xr4+A13Apy53WfG9BtYp3r2HWl wp+Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=iQPMNgIO; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id sc16-20020a1709078a1000b006da7d677f1dsi3659898ejc.811.2022.03.04.05.12.54; Fri, 04 Mar 2022 05:13:17 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=iQPMNgIO; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233555AbiCDIb6 (ORCPT + 99 others); Fri, 4 Mar 2022 03:31:58 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35938 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233522AbiCDIb4 (ORCPT ); Fri, 4 Mar 2022 03:31:56 -0500 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8939D19BE4D; Fri, 4 Mar 2022 00:31:04 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 1F25061015; Fri, 4 Mar 2022 08:31:04 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id C8064C340E9; Fri, 4 Mar 2022 08:31:02 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1646382663; bh=fwzhEw5YqiuNQASpOM0rdBn+ZueUJTulpkPCJl4FRD4=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=iQPMNgIOVMbPo2z4bssQovKPfoWBWK9E4of+Mn9kMAf1shu8/6b9LBKxtJl0lsq0+ P8E8Il3euNLHVqYMVSy1NOWODd+HZVGCUxG6L9E/ZMPLHEyxXtH2zzEFszN0/hxt61 GaC0BEwmkwtmfw9vthxmHO8YJE2f8eWxSl3Zm5b4kZeXgAHAIJMynYz4GVVgk6wdVi lUTKic+is8MrxhP80ovlP5iU7dwNr/wE7l985cQd1rDPwav45W6ggETNR0r/EQ9PqG r6s93dHaDr+9UEO3sefQv2XZeP+y8uKyAS5nrt/kPps34IGQ7g834LVTsa0t+PplMB TtxXxJZ3W/r4g== Date: Fri, 4 Mar 2022 10:30:22 +0200 From: Jarkko Sakkinen To: Haitao Huang , Reinette Chatre Cc: Dave Hansen , "Dhanraj, Vijay" , "dave.hansen@linux.intel.com" , "tglx@linutronix.de" , "bp@alien8.de" , "Lutomirski, Andy" , "mingo@redhat.com" , "linux-sgx@vger.kernel.org" , "x86@kernel.org" , "Christopherson,, Sean" , "Huang, Kai" , "Zhang, Cathy" , "Xing, Cedric" , "Huang, Haitao" , "Shanahan, Mark" , "hpa@zytor.com" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH V2 16/32] x86/sgx: Support restricting of enclave page permissions Message-ID: References: <2d2d3471-78ce-9faa-daf6-138078f5ffaa@intel.com> <6f65287a-f69c-971e-be2c-929e327e7ff9@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Mar 03, 2022 at 10:03:30PM -0600, Haitao Huang wrote: > > On Thu, 03 Mar 2022 17:18:33 -0600, Jarkko Sakkinen > wrote: > > > On Thu, Mar 03, 2022 at 10:08:14AM -0600, Haitao Huang wrote: > > > Hi all, > > > > > > On Wed, 02 Mar 2022 16:57:45 -0600, Reinette Chatre > > > wrote: > > > > > > > Hi Jarkko, > > > > > > > > On 3/1/2022 6:05 PM, Jarkko Sakkinen wrote: > > > > > On Tue, Mar 01, 2022 at 09:48:48AM -0800, Reinette Chatre wrote: > > > > > > Hi Jarkko, > > > > > > > > > > > > On 3/1/2022 5:42 AM, Jarkko Sakkinen wrote: > > > > > > > > With EACCEPTCOPY (kudos to Mark S. for reminding me of > > > > > > > > this version of > > > > > > > > EACCEPT @ chat.enarx.dev) it is possible to make R and RX > > > pages but > > > > > > > > obviously new RX pages are now out of the picture: > > > > > > > > > > > > > > > > > > > > > > > > /* > > > > > > > > * Adding a regular page that is architecturally allowed > > > to only > > > > > > > > * be created with RW permissions. > > > > > > > > * TBD: Interface with user space policy to support max > > > permissions > > > > > > > > * of RWX. > > > > > > > > */ > > > > > > > > prot = PROT_READ | PROT_WRITE; > > > > > > > > encl_page->vm_run_prot_bits = calc_vm_prot_bits(prot, 0); > > > > > > > > encl_page->vm_max_prot_bits = encl_page->vm_run_prot_bits; > > > > > > > > > > > > > > > > If that TBD is left out to the final version the page > > > > > > > > augmentation has a > > > > > > > > risk of a API bottleneck, and that risk can realize then > > > > > > > > also in the page > > > > > > > > permission ioctls. > > > > > > > > > > > > > > > > I.e. now any review comment is based on not fully known > > > > > > > > territory, we have > > > > > > > > one known unknown, and some unknown unknowns from > > > > > > > > unpredictable effect to > > > > > > > > future API changes. > > > > > > > > > > > > The plan to complete the "TBD" in the above snippet was to > > > > > > follow this work > > > > > > with user policy integration at this location. On a high level > > > > > > the plan was > > > > > > for this to look something like: > > > > > > > > > > > > > > > > > > /* > > > > > > * Adding a regular page that is architecturally allowed to only > > > > > > * be created with RW permissions. > > > > > > * Interface with user space policy to support max permissions > > > > > > * of RWX. > > > > > > */ > > > > > > prot = PROT_READ | PROT_WRITE; > > > > > > encl_page->vm_run_prot_bits = calc_vm_prot_bits(prot, 0); > > > > > > > > > > > > if (user space policy allows RWX on dynamically added > > > pages) > > > > > > encl_page->vm_max_prot_bits = calc_vm_prot_bits(PROT_READ | > > > > > > PROT_WRITE | PROT_EXEC, 0); > > > > > > else > > > > > > encl_page->vm_max_prot_bits = calc_vm_prot_bits(PROT_READ | > > > > > > PROT_WRITE, 0); > > > > > > > > > > > > The work that follows this series aimed to do the integration > > > with user > > > > > > space policy. > > > > > > > > > > What do you mean by "user space policy" anyway exactly? I'm > > > sorry but I > > > > > just don't fully understand this. > > > > > > > > My apologies - I just assumed that you would need no reminder > > > about this > > > > contentious > > > > part of SGX history. Essentially it means that, yes, the kernel could > > > > theoretically > > > > permit any kind of access to any file/page, but some accesses are > > > known > > > > to generally > > > > be a bad idea - like making memory executable as well as writable > > > - and > > > > thus there > > > > are additional checks based on what user space permits before the > > > kernel > > > > allows > > > > such accesses. > > > > > > > > For example, > > > > mm/mprotect.c:do_mprotect_pkey()->security_file_mprotect() > > > > > > > > User policy and SGX has seen significant discussion. Some notable > > > > threads: > > > > https://lore.kernel.org/linux-security-module/CALCETrXf8mSK45h7sTK5Wf+pXLVn=Bjsc_RLpgO-h-qdzBRo5Q@mail.gmail.com/ > > > > https://lore.kernel.org/linux-security-module/20190619222401.14942-1-sean.j.christopherson@intel.com/ > > > > > > > > > It's too big of a risk to accept this series without X taken care > > > > > of. Patch > > > > > series should neither have TODO nor TBD comments IMHO. I don't want > > > > > to ack > > > > > a series based on speculation what might happen in the future. > > > > > > > > ok > > > > > > > > > > > > > > > > I think the best way to move forward would be to do EAUG's > > > > > > > explicitly with > > > > > > > an ioctl that could also include secinfo for permissions. > > > Then you can > > > > > > > easily do the rest with EACCEPTCOPY inside the enclave. > > > > > > > > > > > > SGX_IOC_ENCLAVE_ADD_PAGES already exists and could possibly be > > > used for > > > > > > this purpose. It already includes SECINFO which may also be > > > useful if > > > > > > needing to later support EAUG of PT_SS* pages. > > > > > > > > > > You could also simply add SGX_IOC_ENCLAVE_AUGMENT_PAGES and call it > > > > > a day. > > > > > > > > I could, yes. > > > > > > > > > And if there is plan to extend SGX_IOC_ENCLAVE_ADD_PAGES what is > > > > > this weird > > > > > thing added to the #PF handler? Why is it added at all then? > > > > > > > > I was just speculating in my response, there is no plan to extend > > > > SGX_IOC_ENCLAVE_ADD_PAGES (that I am aware of). > > > > > > > > > > How this could work is user space calls SGX_IOC_ENCLAVE_ADD_PAGES > > > > > > after enclave initialization on any memory region within the > > > > > > enclave where > > > > > > pages are planned to be added dynamically. This ioctl() calls > > > > > > EAUG to add the > > > > > > new pages with RW permissions and their vm_max_prot_bits can be > > > > > > set to the > > > > > > permissions found in the included SECINFO. This will support > > > > > > later EACCEPTCOPY > > > > > > as well as SGX_IOC_ENCLAVE_RELAX_PERMISSIONS > > > > > > > > > > I don't like this type of re-use of the existing API. > > > > > > > > I could proceed with SGX_IOC_ENCLAVE_AUGMENT_PAGES if there is > > > consensus > > > > after > > > > considering the user policy question (above) and performance trade-off > > > > (more below). > > > > > > > > > > > > > > > The big question is whether communicating user policy after > > > > > > enclave initialization > > > > > > via the SECINFO within SGX_IOC_ENCLAVE_ADD_PAGES is acceptable > > > > > > to all? I would > > > > > > appreciate a confirmation on this direction considering the > > > > > > significant history > > > > > > behind this topic. > > > > > > > > > > I have no idea because I don't know what is user space policy. > > > > > > > > This discussion is about some enclave usages needing RWX permissions > > > > on dynamically added enclave pages. RWX permissions on dynamically > > > added > > > > pages is > > > > not something that should blindly be allowed for all SGX enclaves but > > > > instead the user > > > > needs to explicitly allow specific enclaves to have such ability. This > > > > is equivalent > > > > to (but not the same as) what exists in Linux today with LSM. As > > > seen in > > > > mm/mprotect.c:do_mprotect_pkey()->security_file_mprotect() Linux > > > is able > > > > to make > > > > files and memory be both writable and executable, but it would only do > > > > so for those > > > > files and memory that the LSM (which is how user policy is > > > communicated, > > > > like SELinux) > > > > indicates it is allowed, not blindly do so for all files and all > > > memory. > > > > > > > > > > > Putting EAUG to the #PF handler and implicitly call it just > > > > > > > too flakky and > > > > > > > hard to make deterministic for e.g. JIT compiler in our use > > > > > > > case (not to > > > > > > > mention that JIT is not possible at all because inability to > > > > > > > do RX pages). > > > > > > > > I understand how SGX_IOC_ENCLAVE_AUGMENT_PAGES can be more > > > deterministic > > > > but from > > > > what I understand it would have a performance impact since it would > > > > require all memory > > > > that may be needed by the enclave be pre-allocated from outside the > > > > enclave and not > > > > just dynamically allocated from within the enclave at the time it is > > > > needed. > > > > > > > > Would such a performance impact be acceptable? > > > > > > > > > > User space won't always have enough info to decide whether the pages > > > to be > > > EAUG'd immediately. In some cases (shared libraries, JVM for > > > example) lots > > > of code/data pages can be mapped but never actually touched. One > > > enclave/process does not know if any other more important > > > enclave/process > > > would need the EPC. > > > > > > It should be for kernel to make the final decision as it has overall > > > picture > > > of the system EPC usage and availability. > > > > EAUG ioctl does not give better capabilities for user space to waste > > EPC given that EADD ioctl already exists, i.e. your argument is logically > > incorrect. > > The point of adding EAUG is to allow more efficient use of EPC pages. > Without EAUG, enclaves have to EADD everything upfront into EPC, consuming > predetermined number of EPC pages, some of which may not be used at all. > With EAUG, enclaves should be able to load minimal pages to get started, > pages added on #PF as they are actually accessed. > > Obviously as you pointed out, some usages make more sense to pre-EAUG (EAUG > before #PF). But your proposal of supporting only pre-EAUG here essentially > makes EAUG behave almost the same as EADD. If the current implementation > with EAUG on #PF can also use MAP_POPULATE for pre-EAUG (seems possible > based on Dave's comments), then it is flxible to cover all cases and allow > kernel to optimize allocation of EPC pages. There is no even a working #PF based implementation in existance, and your argument has too many if's for my taste. Reinette, can you squash this fixup to your patch set and send v3 so that we get to a working implementation that can be benchmarked against e.g. ioctl based version: https://lore.kernel.org/linux-sgx/20220304033918.361495-1-jarkko@kernel.org/T/#u This also objectively fixes some performance issues, e.g. EMODPE can be just used without any round-trips (v2 requires relax ioctl). BR, Jark