Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp717481imm; Fri, 31 Aug 2018 11:16:58 -0700 (PDT) X-Google-Smtp-Source: ANB0VdbXf0PaNos5t73GSzMETfknlbFktf1sJdhSHpixwVZfEJiXli7b6lFrVge/c8JOgnfcIdLq X-Received: by 2002:a62:858c:: with SMTP id m12-v6mr17185111pfk.173.1535739418837; Fri, 31 Aug 2018 11:16:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1535739418; cv=none; d=google.com; s=arc-20160816; b=FtfUWH3ZXdoEVWfwrkyUmpeI2uv28qbA7UTqOZy8TliAuUX8/KTz9TMqshvymx4AY7 c8WWPt/RUU5FGh1VBfgK0Q6fFO9/Z9sX0f9F1FHlRRlhRBPm4EPRMeTpZYaZEZSHQWMf bFOrV/yrGdUlj1RB31pQ/9GRQxPkZPB63w74GhiM4Vzocbr/otkbQmDJJVmGkhHcUnn+ pEO4CBh1IPufjfSrchJ5ub/b1WQ52D7CX3qeumiJxDcjKiH53cbNCWEbKrNSSwQvlrZv H0Y8W5/l5Iseddi95D3eAyvEQd7PymXOg88GolQRVr0MigHBXsw/fvDjoTo2aHmAKk6O 6Uaw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=tafVIeY8aYmXqzTbfAUtsjISUhza7kQW2ZEkGT6zYD4=; b=Z5IkE6m2M4R712A5TfHwDUdEncYX+wOaJ9QALcwwNTs2+Q9b3SzeEyysJ7HWc47g2z EyjmSKn01RafJcNLahLcLln8rFG0Wu0Q7KNy6qxLDH3NZzv7e6jKTVs9MANoxfRQ2xgS BgTxx+k1OaqPqNubbwRlxAzTswZQzFU4NAskbI9HfqxZuhO07QSlFndNCzy0SQa+IWk0 5jgAAiR9cz4MjZ7XNukFmPCZod201y0E9LoV5j6M5DBojflevOF16HOsxAkSV9q9MRX9 2d0nfBI8COL8VMx/T2U+cspWVzAh/eJHlXc/gLVjHVa5c1UBOovDHScit2hZgXc619OX S2AA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z89-v6si10066757pfd.357.2018.08.31.11.16.43; Fri, 31 Aug 2018 11:16:58 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727429AbeHaWXt (ORCPT + 99 others); Fri, 31 Aug 2018 18:23:49 -0400 Received: from mga18.intel.com ([134.134.136.126]:3133 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727201AbeHaWXt (ORCPT ); Fri, 31 Aug 2018 18:23:49 -0400 X-Amp-Result: UNSCANNABLE X-Amp-File-Uploaded: False Received: from orsmga007.jf.intel.com ([10.7.209.58]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 31 Aug 2018 11:15:09 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,312,1531810800"; d="scan'208";a="69420777" Received: from sjchrist-coffee.jf.intel.com (HELO linux.intel.com) ([10.54.74.20]) by orsmga007.jf.intel.com with ESMTP; 31 Aug 2018 11:15:09 -0700 Date: Fri, 31 Aug 2018 11:15:09 -0700 From: Sean Christopherson To: Jarkko Sakkinen Cc: "Huang, Kai" , "platform-driver-x86@vger.kernel.org" , "x86@kernel.org" , "nhorman@redhat.com" , "linux-kernel@vger.kernel.org" , "tglx@linutronix.de" , "suresh.b.siddha@intel.com" , "Ayoun, Serge" , "hpa@zytor.com" , "npmccallum@redhat.com" , "mingo@redhat.com" , "linux-sgx@vger.kernel.org" , "Hansen, Dave" Subject: Re: [PATCH v13 10/13] x86/sgx: Add sgx_einit() for initializing enclaves Message-ID: <20180831181509.GB21555@linux.intel.com> References: <20180827185507.17087-1-jarkko.sakkinen@linux.intel.com> <20180827185507.17087-11-jarkko.sakkinen@linux.intel.com> <1535406078.3416.9.camel@intel.com> <20180828070129.GA5301@linux.intel.com> <105F7BF4D0229846AF094488D65A09893541037C@PGSMSX112.gar.corp.intel.com> <20180831121645.GA18075@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180831121645.GA18075@linux.intel.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Aug 31, 2018 at 03:17:03PM +0300, Jarkko Sakkinen wrote: > On Wed, Aug 29, 2018 at 07:33:54AM +0000, Huang, Kai wrote: > > [snip..] > > > > > > > > > > > > @@ -38,6 +39,18 @@ static LIST_HEAD(sgx_active_page_list); static > > > > > DEFINE_SPINLOCK(sgx_active_page_list_lock); > > > > > static struct task_struct *ksgxswapd_tsk; static > > > > > DECLARE_WAIT_QUEUE_HEAD(ksgxswapd_waitq); > > > > > +static struct notifier_block sgx_pm_notifier; static u64 > > > > > +sgx_pm_cnt; > > > > > + > > > > > +/* The cache for the last known values of IA32_SGXLEPUBKEYHASHx > > > > > +MSRs > > > > > for each > > > > > + * CPU. The entries are initialized when they are first used by > > > > > sgx_einit(). > > > > > + */ > > > > > +struct sgx_lepubkeyhash { > > > > > + u64 msrs[4]; > > > > > + u64 pm_cnt; > > > > > > > > May I ask why do we need pm_cnt here? In fact why do we need suspend > > > > staff (namely, sgx_pm_cnt above, and related code in this patch) here > > > > in this patch? From the patch commit message I don't see why we need > > > > PM staff here. Please give comment why you need PM staff, or you may > > > > consider to split the PM staff to another patch. > > > > > > Refining the commit message probably makes more sense because without PM > > > code sgx_einit() would be broken. The MSRs have been reset after waking up. > > > > > > Some kind of counter is required to keep track of the power cycle. When going > > > to sleep the sgx_pm_cnt is increased. sgx_einit() compares the current value of > > > the global count to the value in the cache entry to see whether we are in a new > > > power cycle. > > > > You mean reset to Intel default? I think we can also just reset the > > cached MSR values on each power cycle, which would be simpler, IMHO? > > I don't really see that much difference in the complexity. Tracking the validity of the cache means we're hosed if we miss any condition that causes the MSRs to be reset. I think we're better off assuming the cache can be stale at any time, i.e. don't track power cyles and instead handle EINIT failure due to INVALID_TOKEN by writing the cache+MSRs with the desired hash and retrying EINIT. EINIT is interruptible and its latency is extremely variable in any case, e.g. tens of thousands of cycles, so this rarely-hit "slow path" probably wouldn't affect the worst case latency of EINIT. > > I think we definitely need some code to handle S3-S5, but should be in > > separate patches, since I think the major impact of S3-S5 is entire > > EPC being destroyed. I think keeping pm_cnt is not sufficient enough > > to handle such case? > > The driver has SGX_POWER_LOST_ENCLAVE for ioctls and it deletes the TCS > entries. > > > > This brings up one question though: how do we deal with VM host going to sleep? > > > VM guest would not be aware of this. > > > > IMO VM just gets "sudden loss of EPC" after suspend & resume in host. > > SGX driver and SDK should be able to handle "sudden loss of EPC", ie, > > co-working together to re-establish the missing enclaves. > > This is not about EPC. It is already dealt by the driver. I'm concerned > about the MSR cache as it would mess up. > > But I guess this logic is part of the KVM code anyway now that I think > more of it. Ya, I expect that VMMs will preserve VM's virtual pubkey MSRs, though there might be scenarios where a VM has direct access to the hardware MSRs. This is probably a moot point since I don't think we want to assume the kernel's cache is 100% accurate, regardless of environment.