Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp6344734imm; Mon, 27 Aug 2018 14:09:42 -0700 (PDT) X-Google-Smtp-Source: ANB0Vdb+uUx5y84CHEHJYyWa7Y++lTWjtYkJjDYuzh6o5+YMa/k+KRIWl8YsZU+AX51Z1SmUXK3/ X-Received: by 2002:a63:4b46:: with SMTP id k6-v6mr13445501pgl.351.1535404182610; Mon, 27 Aug 2018 14:09:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1535404182; cv=none; d=google.com; s=arc-20160816; b=PVCbcopwz6xlxwHqVfX1et8R8A/Tb4nvxNkw2K3qnO2gh/LNhF5X7oIOfK9Zv94X8s VY7q0BnXkqbbxLg/wRm88rJeiiONHEgznsK/sjTcyjyUEZb0EIB2zJFLXSXL0mzgN8J6 SiZvULpkwuetE/otXIh7AlxzVvzXMni7xJ4Vkhq9rDne3I1Fnj+HHE0hNkG0TTwVM67C OjMIo4zBWTFq9b20Ei4RLv4msCu3Liu8gWE0IFbwIDsIwLxQ722FHu0Mq0pBYYBN2CuK H4y0EJbCrOAWRDuW4ytza3tmcMvaB/YkrfTqICRDMM1n+vF/QFwDoMvpb4zLLa0pQZqd ue+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:autocrypt:openpgp:from:cc:references:to:subject :arc-authentication-results; bh=mqBjO1Bc9gugQZM0wjHOXj7xd5CcY4nSvnmHfWg2cYY=; b=Y3kWjUOFjqqCwkgHK/oQ87AK49Rrm2Rvakzr3Aa+VnRUMx2r0BhBQJIurHoOLBpzcw pWgVWUImmB/r50Q8EIdv9vgVY/yU9RB+632Q1QvEakuzPlLOxg5GJcoKCMycX0i7alFi 2TVU60dBVo99arAuYx0XQvVfna2CqwejYNkG71QwMR4kTNdYEVx7m0aALYYoH/jHT7YO zO56b7VL5gR6vwHnJ8Yv9F9R//vUA+lOsYCNu7K2kolcDiCuyb7nuVb/O8i55SM8IO/9 U5iNXqzO+d0QQjuPIJHJcQrv1ZAgENu1uQd1gx3oJ4oSQnZ/ci4a4Vhqan5PzY7wKUcr H2zw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f10-v6si256168pgk.367.2018.08.27.14.09.27; Mon, 27 Aug 2018 14:09:42 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727470AbeH1A4O (ORCPT + 99 others); Mon, 27 Aug 2018 20:56:14 -0400 Received: from mga18.intel.com ([134.134.136.126]:29212 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727223AbeH1A4O (ORCPT ); Mon, 27 Aug 2018 20:56:14 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 27 Aug 2018 14:07:55 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,296,1531810800"; d="scan'208";a="252363698" Received: from ray.jf.intel.com (HELO [10.54.74.168]) ([10.54.74.168]) by orsmga005.jf.intel.com with ESMTP; 27 Aug 2018 14:07:54 -0700 Subject: Re: [PATCH v13 07/13] x86/sgx: Add data structures for tracking the EPC pages To: Jarkko Sakkinen , x86@kernel.org, platform-driver-x86@vger.kernel.org References: <20180827185507.17087-1-jarkko.sakkinen@linux.intel.com> <20180827185507.17087-8-jarkko.sakkinen@linux.intel.com> Cc: sean.j.christopherson@intel.com, nhorman@redhat.com, npmccallum@redhat.com, linux-sgx@vger.kernel.org, Serge Ayoun , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" , Suresh Siddha , "open list:X86 ARCHITECTURE (32-BIT AND 64-BIT)" From: Dave Hansen Openpgp: preference=signencrypt Autocrypt: addr=dave.hansen@intel.com; keydata= xsFNBE6HMP0BEADIMA3XYkQfF3dwHlj58Yjsc4E5y5G67cfbt8dvaUq2fx1lR0K9h1bOI6fC oAiUXvGAOxPDsB/P6UEOISPpLl5IuYsSwAeZGkdQ5g6m1xq7AlDJQZddhr/1DC/nMVa/2BoY 2UnKuZuSBu7lgOE193+7Uks3416N2hTkyKUSNkduyoZ9F5twiBhxPJwPtn/wnch6n5RsoXsb ygOEDxLEsSk/7eyFycjE+btUtAWZtx+HseyaGfqkZK0Z9bT1lsaHecmB203xShwCPT49Blxz VOab8668QpaEOdLGhtvrVYVK7x4skyT3nGWcgDCl5/Vp3TWA4K+IofwvXzX2ON/Mj7aQwf5W iC+3nWC7q0uxKwwsddJ0Nu+dpA/UORQWa1NiAftEoSpk5+nUUi0WE+5DRm0H+TXKBWMGNCFn c6+EKg5zQaa8KqymHcOrSXNPmzJuXvDQ8uj2J8XuzCZfK4uy1+YdIr0yyEMI7mdh4KX50LO1 pmowEqDh7dLShTOif/7UtQYrzYq9cPnjU2ZW4qd5Qz2joSGTG9eCXLz5PRe5SqHxv6ljk8mb ApNuY7bOXO/A7T2j5RwXIlcmssqIjBcxsRRoIbpCwWWGjkYjzYCjgsNFL6rt4OL11OUF37wL QcTl7fbCGv53KfKPdYD5hcbguLKi/aCccJK18ZwNjFhqr4MliQARAQABzShEYXZpZCBDaHJp c3RvcGhlciBIYW5zZW4gPGRhdmVAc3I3MS5uZXQ+wsF7BBMBAgAlAhsDBgsJCAcDAgYVCAIJ CgsEFgIDAQIeAQIXgAUCTo3k0QIZAQAKCRBoNZUwcMmSsMO2D/421Xg8pimb9mPzM5N7khT0 2MCnaGssU1T59YPE25kYdx2HntwdO0JA27Wn9xx5zYijOe6B21ufrvsyv42auCO85+oFJWfE K2R/IpLle09GDx5tcEmMAHX6KSxpHmGuJmUPibHVbfep2aCh9lKaDqQR07gXXWK5/yU1Dx0r VVFRaHTasp9fZ9AmY4K9/BSA3VkQ8v3OrxNty3OdsrmTTzO91YszpdbjjEFZK53zXy6tUD2d e1i0kBBS6NLAAsqEtneplz88T/v7MpLmpY30N9gQU3QyRC50jJ7LU9RazMjUQY1WohVsR56d ORqFxS8ChhyJs7BI34vQusYHDTp6PnZHUppb9WIzjeWlC7Jc8lSBDlEWodmqQQgp5+6AfhTD kDv1a+W5+ncq+Uo63WHRiCPuyt4di4/0zo28RVcjtzlGBZtmz2EIC3vUfmoZbO/Gn6EKbYAn rzz3iU/JWV8DwQ+sZSGu0HmvYMt6t5SmqWQo/hyHtA7uF5Wxtu1lCgolSQw4t49ZuOyOnQi5 f8R3nE7lpVCSF1TT+h8kMvFPv3VG7KunyjHr3sEptYxQs4VRxqeirSuyBv1TyxT+LdTm6j4a mulOWf+YtFRAgIYyyN5YOepDEBv4LUM8Tz98lZiNMlFyRMNrsLV6Pv6SxhrMxbT6TNVS5D+6 UorTLotDZKp5+M7BTQRUY85qARAAsgMW71BIXRgxjYNCYQ3Xs8k3TfAvQRbHccky50h99TUY sqdULbsb3KhmY29raw1bgmyM0a4DGS1YKN7qazCDsdQlxIJp9t2YYdBKXVRzPCCsfWe1dK/q 66UVhRPP8EGZ4CmFYuPTxqGY+dGRInxCeap/xzbKdvmPm01Iw3YFjAE4PQ4hTMr/H76KoDbD cq62U50oKC83ca/PRRh2QqEqACvIH4BR7jueAZSPEDnzwxvVgzyeuhwqHY05QRK/wsKuhq7s UuYtmN92Fasbxbw2tbVLZfoidklikvZAmotg0dwcFTjSRGEg0Gr3p/xBzJWNavFZZ95Rj7Et db0lCt0HDSY5q4GMR+SrFbH+jzUY/ZqfGdZCBqo0cdPPp58krVgtIGR+ja2Mkva6ah94/oQN lnCOw3udS+Eb/aRcM6detZr7XOngvxsWolBrhwTQFT9D2NH6ryAuvKd6yyAFt3/e7r+HHtkU kOy27D7IpjngqP+b4EumELI/NxPgIqT69PQmo9IZaI/oRaKorYnDaZrMXViqDrFdD37XELwQ gmLoSm2VfbOYY7fap/AhPOgOYOSqg3/Nxcapv71yoBzRRxOc4FxmZ65mn+q3rEM27yRztBW9 AnCKIc66T2i92HqXCw6AgoBJRjBkI3QnEkPgohQkZdAb8o9WGVKpfmZKbYBo4pEAEQEAAcLB XwQYAQIACQUCVGPOagIbDAAKCRBoNZUwcMmSsJeCEACCh7P/aaOLKWQxcnw47p4phIVR6pVL e4IEdR7Jf7ZL00s3vKSNT+nRqdl1ugJx9Ymsp8kXKMk9GSfmZpuMQB9c6io1qZc6nW/3TtvK pNGz7KPPtaDzvKA4S5tfrWPnDr7n15AU5vsIZvgMjU42gkbemkjJwP0B1RkifIK60yQqAAlT YZ14P0dIPdIPIlfEPiAWcg5BtLQU4Wg3cNQdpWrCJ1E3m/RIlXy/2Y3YOVVohfSy+4kvvYU3 lXUdPb04UPw4VWwjcVZPg7cgR7Izion61bGHqVqURgSALt2yvHl7cr68NYoFkzbNsGsye9ft M9ozM23JSgMkRylPSXTeh5JIK9pz2+etco3AfLCKtaRVysjvpysukmWMTrx8QnI5Nn5MOlJj 1Ov4/50JY9pXzgIDVSrgy6LYSMc4vKZ3QfCY7ipLRORyalFDF3j5AGCMRENJjHPD6O7bl3Xo 4DzMID+8eucbXxKiNEbs21IqBZbbKdY1GkcEGTE7AnkA3Y6YB7I/j9mQ3hCgm5muJuhM/2Fr OPsw5tV/LmQ5GXH0JQ/TZXWygyRFyyI2FqNTx4WHqUn3yFj8rwTAU1tluRUYyeLy0ayUlKBH ybj0N71vWO936MqP6haFERzuPAIpxj2ezwu0xb1GjTk4ynna6h5GjnKgdfOWoRtoWndMZxbA z5cecg== Message-ID: <4666cae8-c711-8dd5-cbce-3d97cc19a9e5@intel.com> Date: Mon, 27 Aug 2018 14:07:53 -0700 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20180827185507.17087-8-jarkko.sakkinen@linux.intel.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 08/27/2018 11:53 AM, Jarkko Sakkinen wrote: > Add data structures to track Enclave Page Cache (EPC) pages. EPC is > divided into multiple banks (1-N) of which addresses and sizes can be > enumerated with CPUID by the OS. > > On NUMA systems a node can have at most bank. A bank can be at most part of > two nodes. SGX supports both nodes with a single memory controller and also > sub-cluster nodes with severals memory controllers on a single die. > > Signed-off-by: Jarkko Sakkinen > Co-developed-by: Serge Ayoun > Co-developed-by: Sean Christopherson > Signed-off-by: Serge Ayoun > Signed-off-by: Sean Christopherson > --- > arch/x86/include/asm/sgx.h | 60 ++++++++++++++++++ > arch/x86/kernel/cpu/intel_sgx.c | 106 +++++++++++++++++++++++++++++++- > 2 files changed, 164 insertions(+), 2 deletions(-) > > diff --git a/arch/x86/include/asm/sgx.h b/arch/x86/include/asm/sgx.h > index 2130e639ab49..17b7b3aa66bf 100644 > --- a/arch/x86/include/asm/sgx.h > +++ b/arch/x86/include/asm/sgx.h > @@ -4,9 +4,69 @@ > #ifndef _ASM_X86_SGX_H > #define _ASM_X86_SGX_H > > +#include > +#include > +#include > #include > +#include > +#include > + > +#define SGX_MAX_EPC_BANKS 8 This is _still_ missing a meaningful description of what a bank is and whether it is a hardware or software structure. It would also help us to determine whether your bit packing below is really required. > +struct sgx_epc_page { > + unsigned long desc; > + struct list_head list; > +}; > + > +struct sgx_epc_bank { > + unsigned long pa; > + void *va; > + unsigned long size; Please add units. size could be bytes or pages, or who knows what. I can't tell you how many bugs I've tripped over in the past from simple unit conversions > + struct sgx_epc_page *pages_data; > + struct sgx_epc_page **pages; > + unsigned long free_cnt; > + spinlock_t lock; > +}; > > extern bool sgx_enabled; > extern bool sgx_lc_enabled; > +extern struct sgx_epc_bank sgx_epc_banks[SGX_MAX_EPC_BANKS]; > + > +/* > + * enum sgx_epc_page_desc - defines bits and masks for an EPC page's desc Why are you bothering packing these bits? This seems a rather convoluted way to store two integers. > +static __init int sgx_init_epc_bank(u64 addr, u64 size, unsigned long index, > + struct sgx_epc_bank *bank) > +{ > + unsigned long nr_pages = size >> PAGE_SHIFT; > + struct sgx_epc_page *pages_data; > + unsigned long i; > + void *va; > + > + va = ioremap_cache(addr, size); > + if (!va) > + return -ENOMEM; > + > + pages_data = kcalloc(nr_pages, sizeof(struct sgx_epc_page), GFP_KERNEL); > + if (!pages_data) > + goto out_iomap; This looks like you're roughly limited by the page allocator to a bank size of ~1.4GB which seems kinda small. Is this really OK? > + bank->pages = kcalloc(nr_pages, sizeof(struct sgx_epc_page *), > + GFP_KERNEL); > + if (!bank->pages) > + goto out_pdata; > + > + for (i = 0; i < nr_pages; i++) { > + bank->pages[i] = &pages_data[i]; > + bank->pages[i]->desc = (addr + (i << PAGE_SHIFT)) | index; > + } > + > + bank->pa = addr; > + bank->size = size; > + bank->va = va; > + bank->free_cnt = nr_pages; > + bank->pages_data = pages_data; > + spin_lock_init(&bank->lock); > + return 0; > +out_pdata: > + kfree(pages_data); > +out_iomap: > + iounmap(va); > + return -ENOMEM; > +} > + > +static __init void sgx_page_cache_teardown(void) > +{ > + struct sgx_epc_bank *bank; > + int i; > + > + for (i = 0; i < sgx_nr_epc_banks; i++) { > + bank = &sgx_epc_banks[i]; > + iounmap((void *)bank->va); > + kfree(bank->pages); > + kfree(bank->pages_data); > + } > +} > + > +static inline u64 sgx_combine_bank_regs(u64 low, u64 high) > +{ > + return (low & 0xFFFFF000) + ((high & 0xFFFFF) << 32); > +} -ENOCOMMENT for a rather weird looking calculation > +static __init int sgx_page_cache_init(void) > +{ > + u32 eax, ebx, ecx, edx; > + u64 pa, size; > + int ret; > + int i; > + > + for (i = 0; i < SGX_MAX_EPC_BANKS; i++) { > + cpuid_count(SGX_CPUID, 2 + i, &eax, &ebx, &ecx, &edx); > + if (!(eax & 0xF)) > + break; So, we have random data coming out of a random CPUID leaf being called 'eax' and then being tested against a random hard-coded mask. This seems rather unfortunate for someone trying to understand the code. Can we do better? > + pa = sgx_combine_bank_regs(eax, ebx); > + size = sgx_combine_bank_regs(ecx, edx); > + pr_info("EPC bank 0x%llx-0x%llx\n", pa, pa + size - 1); > + ret = sgx_init_epc_bank(pa, size, i, &sgx_epc_banks[i]); > + if (ret) { > + sgx_page_cache_teardown(); > + return ret; > + } So if one bank fails, we tear down all banks, yet leave sgx_nr_epc_banks incremented? That sounds troublesome. > + sgx_nr_epc_banks++; > + } > + > + if (!sgx_nr_epc_banks) { > + pr_err("There are zero EPC banks.\n"); > + return -ENODEV; > + } > + > + return 0; > +} Does this support hot-addition of a bank? If not, why not?