Received: by 10.213.65.68 with SMTP id h4csp653826imn; Tue, 13 Mar 2018 16:44:30 -0700 (PDT) X-Google-Smtp-Source: AG47ELv0BOFLghX/OL3RwYLJcuQek8TtiVaDzRE9yKW+82/CWVO4xkDRGgheVQJvn0pG4L0mneMt X-Received: by 10.99.167.2 with SMTP id d2mr1907568pgf.408.1520984670310; Tue, 13 Mar 2018 16:44:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1520984670; cv=none; d=google.com; s=arc-20160816; b=mlvlfEC0tR/sAXsU5TsxsanQPBj8vjiU8wJEzw+7R2dhA1ZFrn518IKM1jzjsxwovn xgKzWT7KmbbYEYIMpeAnjpOFzC3N5GffFZ49roo+zJtp6gNt6OZ/MTb7N9SjZgHMmdup yOWU7IAov2b+zuT6iO/IZcjfgAsViusOwsP1fd+6opIJ797c52Ljsuzf7jsZH2torg3H 0vKq5I/ezTxcM3uSEN6yiPR69gabVagO791ZX6D/N3KlgZg/isIsa2ErWQUnMKvP1THy 4YcC3xofI18Cw6FfuskgBkB2gdKgsklz7YXs1E00G18HSyQ1gLn76mKt1pMINMIMiriU Fzsg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:references:in-reply-to:mime-version :dkim-signature:arc-authentication-results; bh=zW9RxqGfDDbavXLTga1JY2hP2KntWpoi6iEtHuC8ocE=; b=AD0Z2UsxTDjsBF0IO/bsFqS4xgDMgIOG+whvvMhQdYWX5JdJlvWOrpb6flakZl9fzK TquM6Zi9th5Qh2WA1lWm42/I8Dqg0H2ltbTjl2LsrtxJJM00jhOYOqiNcjen+T0kFax+ 3rxXWIBxFxk2ymb1c8K7MHE71GXjUvKZNU5Gp6lDNcXzEAcTPCjsUTh/kfFtoFvmxJkD pD7aa/s5++J46XVxHhQrSjA5SOYQdQMKDoHJ3J6TkQf6MO5MUCir1Aw2R2SBpcsp2Irh IvySit8I7cB0kuWMG94cPTEbZqZPkPtRyyyGHFxpt8naIttAls8cToFjGDo6oKvNlOqj C5gw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tUtZ2AUe; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l63-v6si937312plb.7.2018.03.13.16.44.16; Tue, 13 Mar 2018 16:44:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tUtZ2AUe; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932864AbeCMXmC (ORCPT + 99 others); Tue, 13 Mar 2018 19:42:02 -0400 Received: from mail-ot0-f196.google.com ([74.125.82.196]:44762 "EHLO mail-ot0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932349AbeCMXmA (ORCPT ); Tue, 13 Mar 2018 19:42:00 -0400 Received: by mail-ot0-f196.google.com with SMTP id 79-v6so1436382oth.11; Tue, 13 Mar 2018 16:42:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=zW9RxqGfDDbavXLTga1JY2hP2KntWpoi6iEtHuC8ocE=; b=tUtZ2AUeH7SXvP00MT/TPgQXb331wQeRQVW5jKVe5MpyEhRLFH5z5jJymnKrhHK2NZ ej7GEDMunHWXSGCBlCIgknm1jWB7BUqQkOudM2v1K0leIfJY1G03REXJuwCWanbTGdN7 u4g3JjI6QnixByvOwmtL/p7v8M0RyP5H/IUmcWpLuZnp86dg0OWcDQKocVO6nKxsZolO JIhOWNGlmwaMpxqQjyCYkARyyT9Tgryq39VadCkbrLVlPgdldATGdsFfl2aqCUXBWcHC sY8+EiJ3WTT9Q6kh2cELaMJVQGba5cXLphuYk82KJxPrU1NLiNSTsIKnWpLBNM7XEcXI RXQw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=zW9RxqGfDDbavXLTga1JY2hP2KntWpoi6iEtHuC8ocE=; b=VOKvXhNr/puNyJOXtHsC/IEoGqixQbPArTLnLGZtR1W1tD5qW4FH3hJkgGQA76sXPh Gx7wLLFSo0bNX9DGldrfPYO6ETliRt0v81PkiOC6b1rfi/5HVPD37YLLJwu+lbFPR/4w 9+mPX3lx9izJmX/OubjgE+YhIgp2nAfLkfV8WYzqGlK/o5m1LFjXdxa3IG5lp/TTVguS 3aQhfqefgQyGEVodS+KzRyi9e4UKNbe7BZS6BurtTUDxsZ2VhuZPXJR5UXdIT22/np3q tEpcvp1YojpM10e+QMS+62O7IZjka0J+m2ikt2FtKZYif9wTHDR8Yl35NKfQbgsEo8D1 SM4g== X-Gm-Message-State: AElRT7F5j1AwlM2GCvSQW2bpz6woLJA7xh6t7foi03dXEpSLSTahKe8n gwX8ukhoXFcbbv3XMvVBihySPSxnx9D/T+DG+CI= X-Received: by 10.157.58.33 with SMTP id j30mr1844007otc.250.1520984517091; Tue, 13 Mar 2018 16:41:57 -0700 (PDT) MIME-Version: 1.0 Received: by 10.74.208.10 with HTTP; Tue, 13 Mar 2018 16:41:56 -0700 (PDT) In-Reply-To: References: <1520855584-10079-1-git-send-email-wanpengli@tencent.com> <1520855584-10079-2-git-send-email-wanpengli@tencent.com> From: Wanpeng Li Date: Wed, 14 Mar 2018 07:41:56 +0800 Message-ID: Subject: Re: [PATCH v2 1/3] KVM: X86: Provides userspace with a capability to not intercept MWAIT To: Jim Mattson Cc: LKML , kvm list , Paolo Bonzini , =?UTF-8?B?UmFkaW0gS3LEjW3DocWZ?= , =?UTF-8?B?SmFuIEggLiBTY2jDtm5oZXJy?= Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jim, 2018-03-14 2:21 GMT+08:00 Jim Mattson : > Is there a need for a new API for yielding MONITOR/MWAIT to the guest? > Why not just tie this to the guest CPUID.01H:ECX[MWAIT] being set? The API also will be used by HLT/PAUSE. Please refer to Paolo's original proposal though I didn't find a link which is replied by Paolo direclty. https://marc.info/?l=3Dkvm&m=3D151182818103804&w=3D2 Regards, Wanpeng Li > > On Mon, Mar 12, 2018 at 4:53 AM, Wanpeng Li wrote: >> From: Wanpeng Li >> >> Allowing a guest to execute MWAIT without interception enables a guest >> to put a (physical) CPU into a power saving state, where it takes >> longer to return from than what may be desired by the host. >> >> Don't give a guest that power over a host by default. (Especially, >> since nothing prevents a guest from using MWAIT even when it is not >> advertised via CPUID.) >> >> Cc: Paolo Bonzini >> Cc: Radim Kr=C4=8Dm=C3=A1=C5=99 >> Cc: Jan H. Sch=C3=B6nherr >> Signed-off-by: Wanpeng Li >> --- >> Documentation/virtual/kvm/api.txt | 23 ++++++++++++++--------- >> arch/x86/include/asm/kvm_host.h | 2 ++ >> arch/x86/kvm/svm.c | 2 +- >> arch/x86/kvm/vmx.c | 9 +++++---- >> arch/x86/kvm/x86.c | 24 ++++++++++++++++++++---- >> arch/x86/kvm/x86.h | 10 +++++----- >> include/uapi/linux/kvm.h | 2 +- >> tools/include/uapi/linux/kvm.h | 2 +- >> 8 files changed, 49 insertions(+), 25 deletions(-) >> >> diff --git a/Documentation/virtual/kvm/api.txt b/Documentation/virtual/k= vm/api.txt >> index 98de506..76e5a15 100644 >> --- a/Documentation/virtual/kvm/api.txt >> +++ b/Documentation/virtual/kvm/api.txt >> @@ -4358,6 +4358,20 @@ enables QEMU to build error log and branch to gue= st kernel registered >> machine check handling routine. Without this capability KVM will >> branch to guests' 0x200 interrupt vector. >> >> +7.13 KVM_CAP_X86_DISABLE_EXITS >> + >> +Architectures: x86 >> +Parameters: args[0] defines which exits are disabled >> +Returns: 0 on success, -EINVAL when args[0] contains invalid exits >> + >> +Valid exits in args[0] are >> + >> +#define KVM_X86_DISABLE_EXITS_MWAIT (1 << 0) >> + >> +Enabling this capability on a VM provides userspace with a way to no >> +longer intercepts some instructions for improved latency in some >> +workloads. >> + >> 8. Other capabilities. >> ---------------------- >> >> @@ -4470,15 +4484,6 @@ reserved. >> Both registers and addresses are 64-bits wide. >> It will be possible to run 64-bit or 32-bit guest code. >> >> -8.8 KVM_CAP_X86_GUEST_MWAIT >> - >> -Architectures: x86 >> - >> -This capability indicates that guest using memory monotoring instructio= ns >> -(MWAIT/MWAITX) to stop the virtual CPU will not cause a VM exit. As su= ch time >> -spent while virtual CPU is halted in this way will then be accounted fo= r as >> -guest running time on the host (as opposed to e.g. HLT). >> - >> 8.9 KVM_CAP_ARM_USER_IRQ >> >> Architectures: arm, arm64 >> diff --git a/arch/x86/include/asm/kvm_host.h b/arch/x86/include/asm/kvm_= host.h >> index 0395c35..e107171 100644 >> --- a/arch/x86/include/asm/kvm_host.h >> +++ b/arch/x86/include/asm/kvm_host.h >> @@ -811,6 +811,8 @@ struct kvm_arch { >> >> gpa_t wall_clock; >> >> + bool mwait_in_guest; >> + >> bool ept_identity_pagetable_done; >> gpa_t ept_identity_map_addr; >> >> diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c >> index be9c839..321b3fd 100644 >> --- a/arch/x86/kvm/svm.c >> +++ b/arch/x86/kvm/svm.c >> @@ -1390,7 +1390,7 @@ static void init_vmcb(struct vcpu_svm *svm) >> set_intercept(svm, INTERCEPT_XSETBV); >> set_intercept(svm, INTERCEPT_RSM); >> >> - if (!kvm_mwait_in_guest()) { >> + if (!kvm_mwait_in_guest(svm->vcpu.kvm)) { >> set_intercept(svm, INTERCEPT_MONITOR); >> set_intercept(svm, INTERCEPT_MWAIT); >> } >> diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c >> index 6cefd7b..2302ae2 100644 >> --- a/arch/x86/kvm/vmx.c >> +++ b/arch/x86/kvm/vmx.c >> @@ -3733,13 +3733,11 @@ static __init int setup_vmcs_config(struct vmcs_= config *vmcs_conf) >> CPU_BASED_UNCOND_IO_EXITING | >> CPU_BASED_MOV_DR_EXITING | >> CPU_BASED_USE_TSC_OFFSETING | >> + CPU_BASED_MWAIT_EXITING | >> + CPU_BASED_MONITOR_EXITING | >> CPU_BASED_INVLPG_EXITING | >> CPU_BASED_RDPMC_EXITING; >> >> - if (!kvm_mwait_in_guest()) >> - min |=3D CPU_BASED_MWAIT_EXITING | >> - CPU_BASED_MONITOR_EXITING; >> - >> opt =3D CPU_BASED_TPR_SHADOW | >> CPU_BASED_USE_MSR_BITMAPS | >> CPU_BASED_ACTIVATE_SECONDARY_CONTROLS; >> @@ -5531,6 +5529,9 @@ static u32 vmx_exec_control(struct vcpu_vmx *vmx) >> exec_control |=3D CPU_BASED_CR3_STORE_EXITING | >> CPU_BASED_CR3_LOAD_EXITING | >> CPU_BASED_INVLPG_EXITING; >> + if (kvm_mwait_in_guest(vmx->vcpu.kvm)) >> + exec_control &=3D ~(CPU_BASED_MWAIT_EXITING | >> + CPU_BASED_MONITOR_EXITING); >> return exec_control; >> } >> >> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c >> index 36ef3d8..5fae476 100644 >> --- a/arch/x86/kvm/x86.c >> +++ b/arch/x86/kvm/x86.c >> @@ -2809,9 +2809,15 @@ static int msr_io(struct kvm_vcpu *vcpu, struct k= vm_msrs __user *user_msrs, >> return r; >> } >> >> +static inline bool kvm_can_mwait_in_guest(void) >> +{ >> + return boot_cpu_has(X86_FEATURE_MWAIT) && >> + !boot_cpu_has_bug(X86_BUG_MONITOR); >> +} >> + >> int kvm_vm_ioctl_check_extension(struct kvm *kvm, long ext) >> { >> - int r; >> + int r =3D 0; >> >> switch (ext) { >> case KVM_CAP_IRQCHIP: >> @@ -2867,8 +2873,9 @@ int kvm_vm_ioctl_check_extension(struct kvm *kvm, = long ext) >> case KVM_CAP_ADJUST_CLOCK: >> r =3D KVM_CLOCK_TSC_STABLE; >> break; >> - case KVM_CAP_X86_GUEST_MWAIT: >> - r =3D kvm_mwait_in_guest(); >> + case KVM_CAP_X86_DISABLE_EXITS: >> + if(kvm_can_mwait_in_guest()) >> + r |=3D KVM_X86_DISABLE_EXITS_MWAIT; >> break; >> case KVM_CAP_X86_SMM: >> /* SMBASE is usually relocated above 1M on modern chipse= ts, >> @@ -2909,7 +2916,6 @@ int kvm_vm_ioctl_check_extension(struct kvm *kvm, = long ext) >> r =3D KVM_X2APIC_API_VALID_FLAGS; >> break; >> default: >> - r =3D 0; >> break; >> } >> return r; >> @@ -4214,6 +4220,16 @@ static int kvm_vm_ioctl_enable_cap(struct kvm *kv= m, >> >> r =3D 0; >> break; >> + case KVM_CAP_X86_DISABLE_EXITS: >> + r =3D -EINVAL; >> + if (cap->args[0] & ~KVM_X86_DISABLE_VALID_EXITS) >> + break; >> + >> + if ((cap->args[0] & KVM_X86_DISABLE_EXITS_MWAIT) && >> + kvm_can_mwait_in_guest()) >> + kvm->arch.mwait_in_guest =3D true; >> + r =3D 0; >> + break; >> default: >> r =3D -EINVAL; >> break; >> diff --git a/arch/x86/kvm/x86.h b/arch/x86/kvm/x86.h >> index b91215d..cd1215e 100644 >> --- a/arch/x86/kvm/x86.h >> +++ b/arch/x86/kvm/x86.h >> @@ -2,8 +2,6 @@ >> #ifndef ARCH_X86_KVM_X86_H >> #define ARCH_X86_KVM_X86_H >> >> -#include >> -#include >> #include >> #include >> #include "kvm_cache_regs.h" >> @@ -264,10 +262,12 @@ static inline u64 nsec_to_cycles(struct kvm_vcpu *= vcpu, u64 nsec) >> __rem; \ >> }) >> >> -static inline bool kvm_mwait_in_guest(void) >> +#define KVM_X86_DISABLE_EXITS_MWAIT (1 << 0) >> +#define KVM_X86_DISABLE_VALID_EXITS (KVM_X86_DISABLE_EXITS_MWA= IT) >> + >> +static inline bool kvm_mwait_in_guest(struct kvm *kvm) >> { >> - return boot_cpu_has(X86_FEATURE_MWAIT) && >> - !boot_cpu_has_bug(X86_BUG_MONITOR); >> + return kvm->arch.mwait_in_guest; >> } >> >> #endif >> diff --git a/include/uapi/linux/kvm.h b/include/uapi/linux/kvm.h >> index 088c2c9..1065006 100644 >> --- a/include/uapi/linux/kvm.h >> +++ b/include/uapi/linux/kvm.h >> @@ -929,7 +929,7 @@ struct kvm_ppc_resize_hpt { >> #define KVM_CAP_S390_GS 140 >> #define KVM_CAP_S390_AIS 141 >> #define KVM_CAP_SPAPR_TCE_VFIO 142 >> -#define KVM_CAP_X86_GUEST_MWAIT 143 >> +#define KVM_CAP_X86_DISABLE_EXITS 143 >> #define KVM_CAP_ARM_USER_IRQ 144 >> #define KVM_CAP_S390_CMMA_MIGRATION 145 >> #define KVM_CAP_PPC_FWNMI 146 >> diff --git a/tools/include/uapi/linux/kvm.h b/tools/include/uapi/linux/k= vm.h >> index 0fb5ef9..b13c257 100644 >> --- a/tools/include/uapi/linux/kvm.h >> +++ b/tools/include/uapi/linux/kvm.h >> @@ -924,7 +924,7 @@ struct kvm_ppc_resize_hpt { >> #define KVM_CAP_S390_GS 140 >> #define KVM_CAP_S390_AIS 141 >> #define KVM_CAP_SPAPR_TCE_VFIO 142 >> -#define KVM_CAP_X86_GUEST_MWAIT 143 >> +#define KVM_CAP_X86_DISABLE_EXITS 143 >> #define KVM_CAP_ARM_USER_IRQ 144 >> #define KVM_CAP_S390_CMMA_MIGRATION 145 >> #define KVM_CAP_PPC_FWNMI 146 >> -- >> 2.7.4 >>