Received: by 2002:ac0:aa62:0:0:0:0:0 with SMTP id w31-v6csp3816074ima; Tue, 23 Oct 2018 11:44:51 -0700 (PDT) X-Google-Smtp-Source: AJdET5cWny+JY4pdleDgSTGdFn/0e1O4R1ey6ePKG0Aunm8CYVB67DSe0T/eYMLrDsN3nTLpiC9T X-Received: by 2002:a62:640c:: with SMTP id y12-v6mr2978441pfb.249.1540320291405; Tue, 23 Oct 2018 11:44:51 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1540320291; cv=none; d=google.com; s=arc-20160816; b=LG5gDfELpNeXDRVQEDmoqVEFWChppjr6E95iTIBVZpwUvyAfzl4BqUFxA1Q6csupbi Mzrt1yzufQbK+1lnzne2wRs7C4DPjOBFhu5OcaD8P6YyzvuO78bpbaqztI+HuaBJ/RZg 1xc19xL+Necl9iX34aD9cfGQmmNrviXibEOHca7014NS5nVVJXzCBM1sHQuEjUZq4Jp7 9LxBL/sd7+qxIVt+8zb6dQQMIvy/xXWNIPM8djEy5LkMbJHLb9y4tJynHMmyI5UxpZiA A1nrfhva5cYSFUvHIQ9+SEtq3zjNTkI8vYERw3D2Bq6Don9RAypLQR/tRT3j2yCrmfUL hBRw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=U0AHJsvC9B85R3+fw6Pcb9iHEc0NtYvAT3DjRkHD4vQ=; b=qMri6ZLA0NtxhkbXIEjpEgIEdXHS/TdjO5DZHAQo3HCU6xnmLC8ZbpPqXOgikKgnHA mRhJAYjMBXcbkDKwDMe+qRiyeu89YQm0nQNiiQJ5nSbqDJthQ9SmvjEZ3C2wbUL+qxkm YM8s/vsku3z7bV52ppc20xB1Bv5b8sGOQ53+fW8r2hJ2CFRhll0YE94zUerJ69EpWhV+ 9Nc1SgXFFDnCjHfE2vR1qFQ4AqfvQO9YLL7Xm+i9iqMbjStdj2LNFNmtTacSDx1+TPD0 ByH+P8PIFCUA2TxuM+B/KxMjw8flO5Je+i0cJb/vqo66DaFq0E+qWrILSN9KcB52dzuq Oo9Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k1-v6si2048879pld.395.2018.10.23.11.44.35; Tue, 23 Oct 2018 11:44:51 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728894AbeJXDIC (ORCPT + 99 others); Tue, 23 Oct 2018 23:08:02 -0400 Received: from mga05.intel.com ([192.55.52.43]:55080 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727758AbeJXDIB (ORCPT ); Tue, 23 Oct 2018 23:08:01 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga105.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 23 Oct 2018 11:43:26 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,417,1534834800"; d="scan'208";a="243726577" Received: from chang-linux-2.sc.intel.com ([10.3.52.139]) by orsmga004.jf.intel.com with ESMTP; 23 Oct 2018 11:43:25 -0700 From: "Chang S. Bae" To: Ingo Molnar , Thomas Gleixner , Andy Lutomirski , "H . Peter Anvin" Cc: Andi Kleen , Dave Hansen , Markus T Metzger , Ravi Shankar , "Chang S . Bae" , LKML Subject: [v3 04/12] x86/fsgsbase/64: Enable FSGSBASE instructions in the helper functions Date: Tue, 23 Oct 2018 11:42:26 -0700 Message-Id: <20181023184234.14025-5-chang.seok.bae@intel.com> X-Mailer: git-send-email 2.19.1 In-Reply-To: <20181023184234.14025-1-chang.seok.bae@intel.com> References: <20181023184234.14025-1-chang.seok.bae@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The helper functions will switch on faster accesses to FSBASE and GSBASE when the FSGSBASE feature is enabled. Accessing user GSBASE needs a couple of SWAPGS operations. It is avoidable if the user GSBASE is saved at kernel entry, being updated as changes, and restored back at kernel exit. However, it seems to spend more cycles for savings and restorations. Little or no benefit was measured from experiments. Signed-off-by: Chang S. Bae Reviewed-by: Andi Kleen Cc: Any Lutomirski Cc: H. Peter Anvin Cc: Thomas Gleixner Cc: Ingo Molnar Cc: Dave Hansen --- arch/x86/include/asm/fsgsbase.h | 17 +++---- arch/x86/kernel/process_64.c | 82 +++++++++++++++++++++++++++------ 2 files changed, 75 insertions(+), 24 deletions(-) diff --git a/arch/x86/include/asm/fsgsbase.h b/arch/x86/include/asm/fsgsbase.h index b4d4509b786c..e500d771155f 100644 --- a/arch/x86/include/asm/fsgsbase.h +++ b/arch/x86/include/asm/fsgsbase.h @@ -57,26 +57,23 @@ static __always_inline void wrgsbase(unsigned long gsbase) : "memory"); } +#include + /* Helper functions for reading/writing FS/GS base */ static inline unsigned long x86_fsbase_read_cpu(void) { unsigned long fsbase; - rdmsrl(MSR_FS_BASE, fsbase); + if (static_cpu_has(X86_FEATURE_FSGSBASE)) + fsbase = rdfsbase(); + else + rdmsrl(MSR_FS_BASE, fsbase); return fsbase; } -static inline unsigned long x86_gsbase_read_cpu_inactive(void) -{ - unsigned long gsbase; - - rdmsrl(MSR_KERNEL_GS_BASE, gsbase); - - return gsbase; -} - +extern unsigned long x86_gsbase_read_cpu_inactive(void); extern void x86_fsbase_write_cpu(unsigned long fsbase); extern void x86_gsbase_write_cpu_inactive(unsigned long gsbase); diff --git a/arch/x86/kernel/process_64.c b/arch/x86/kernel/process_64.c index 31b4755369f0..fcf18046c3d6 100644 --- a/arch/x86/kernel/process_64.c +++ b/arch/x86/kernel/process_64.c @@ -159,6 +159,36 @@ enum which_selector { GS }; +/* + * Interrupts are disabled here. Out of line to be protected from kprobes. + */ +static noinline __kprobes unsigned long rd_inactive_gsbase(void) +{ + unsigned long gsbase, flags; + + local_irq_save(flags); + native_swapgs(); + gsbase = rdgsbase(); + native_swapgs(); + local_irq_restore(flags); + + return gsbase; +} + +/* + * Interrupts are disabled here. Out of line to be protected from kprobes. + */ +static noinline __kprobes void wr_inactive_gsbase(unsigned long gsbase) +{ + unsigned long flags; + + local_irq_save(flags); + native_swapgs(); + wrgsbase(gsbase); + native_swapgs(); + local_irq_restore(flags); +} + /* * Saves the FS or GS base for an outgoing thread if FSGSBASE extensions are * not available. The goal is to be reasonably fast on non-FSGSBASE systems. @@ -337,22 +367,42 @@ static unsigned long x86_fsgsbase_read_task(struct task_struct *task, return base; } +unsigned long x86_gsbase_read_cpu_inactive(void) +{ + unsigned long gsbase; + + if (static_cpu_has(X86_FEATURE_FSGSBASE)) + gsbase = rd_inactive_gsbase(); + else + rdmsrl(MSR_KERNEL_GS_BASE, gsbase); + + return gsbase; +} + void x86_fsbase_write_cpu(unsigned long fsbase) { - /* - * Set the selector to 0 as a notion, that the segment base is - * overwritten, which will be checked for skipping the segment load - * during context switch. - */ - loadseg(FS, 0); - wrmsrl(MSR_FS_BASE, fsbase); + if (static_cpu_has(X86_FEATURE_FSGSBASE)) { + wrfsbase(fsbase); + } else { + /* + * Set the selector to 0 as a notion, that the segment base is + * overwritten, which will be checked for skipping the segment load + * during context switch. + */ + loadseg(FS, 0); + wrmsrl(MSR_FS_BASE, fsbase); + } } void x86_gsbase_write_cpu_inactive(unsigned long gsbase) { - /* Set the selector to 0 for the same reason as %fs above. */ - loadseg(GS, 0); - wrmsrl(MSR_KERNEL_GS_BASE, gsbase); + if (static_cpu_has(X86_FEATURE_FSGSBASE)) { + wr_inactive_gsbase(gsbase); + } else { + /* Set the selector to 0 for the same reason as %fs above. */ + loadseg(GS, 0); + wrmsrl(MSR_KERNEL_GS_BASE, gsbase); + } } unsigned long x86_fsbase_read_task(struct task_struct *task) @@ -361,7 +411,8 @@ unsigned long x86_fsbase_read_task(struct task_struct *task) if (task == current) fsbase = x86_fsbase_read_cpu(); - else if (task->thread.fsindex == 0) + else if (static_cpu_has(X86_FEATURE_FSGSBASE) || + (task->thread.fsindex == 0)) fsbase = task->thread.fsbase; else fsbase = x86_fsgsbase_read_task(task, task->thread.fsindex); @@ -375,7 +426,8 @@ unsigned long x86_gsbase_read_task(struct task_struct *task) if (task == current) gsbase = x86_gsbase_read_cpu_inactive(); - else if (task->thread.gsindex == 0) + else if (static_cpu_has(X86_FEATURE_FSGSBASE) || + (task->thread.gsindex == 0)) gsbase = task->thread.gsbase; else gsbase = x86_fsgsbase_read_task(task, task->thread.gsindex); @@ -396,7 +448,8 @@ int x86_fsbase_write_task(struct task_struct *task, unsigned long fsbase) task->thread.fsbase = fsbase; if (task == current) x86_fsbase_write_cpu(fsbase); - task->thread.fsindex = 0; + if (!static_cpu_has(X86_FEATURE_FSGSBASE)) + task->thread.fsindex = 0; preempt_enable(); return 0; @@ -411,7 +464,8 @@ int x86_gsbase_write_task(struct task_struct *task, unsigned long gsbase) task->thread.gsbase = gsbase; if (task == current) x86_gsbase_write_cpu_inactive(gsbase); - task->thread.gsindex = 0; + if (!static_cpu_has(X86_FEATURE_FSGSBASE)) + task->thread.gsindex = 0; preempt_enable(); return 0; -- 2.19.1