Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id C6D22C433F5 for ; Tue, 14 Dec 2021 18:56:18 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237209AbhLNS4R (ORCPT ); Tue, 14 Dec 2021 13:56:17 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39550 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231987AbhLNS4Q (ORCPT ); Tue, 14 Dec 2021 13:56:16 -0500 Received: from mail-pf1-x430.google.com (mail-pf1-x430.google.com [IPv6:2607:f8b0:4864:20::430]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4F8C4C061574 for ; Tue, 14 Dec 2021 10:56:16 -0800 (PST) Received: by mail-pf1-x430.google.com with SMTP id u80so18604008pfc.9 for ; Tue, 14 Dec 2021 10:56:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=fO41zWDQQqtUAZWhF4qywuUJLzeFNaam3H6vOEt/LkQ=; b=R14JMzhYTBttWXYr1U4EJVkcEQKPpXVtZKH2b5ohQOgtXm5ft0IfbnkC8QjsRP1mTa uAhLpb3TIXsRAA8ftrKoz9GdJrpSLOSPvsDoNcrRWyQ6hsdF/p6Kvh1GStCyXrU4Nli/ Qpua7wnBmZpf4uPuYkXr2qy92WMxjRw2DeUyTrTAv00hDCXIzNsxGsGiLAtrpORirR0q jqXHs94+tCDhgJpKJ9/QvBd1Z17+Bmiaz/1YhogplE2HGhiv3OgeYdsrt4RQs4FRoO9f 1Mpt5ZP4VzV3LKbbPOupisL5quwCK8+933NtyAKXUSNz4eqTfLIEB10O1BK+KeucEMSZ 9lpw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=fO41zWDQQqtUAZWhF4qywuUJLzeFNaam3H6vOEt/LkQ=; b=E5yKoO1OOWCnAYJX2JUL+ertrvYrqXaHj4+q/Ub7vusuJd+ITeLX3Fy3pvbiDSvlOB Y3jkBmXy3CD4JDWmA4H4gnYfDKNr+TSuWgZSysLGHp1fhcqTdvN24l3WxuVH1jn+2002 tj9Z08MeazJ448LDgZ53RV7Nug1Fxwg643OnjigBzVDjQJFqdaah0+vbJpRZfMvbrm6+ x9JZAfgwEI5Nn+p+4PNh1uiwI972oyov/mGjYE3KpvzEfpdsV/6qwYa3zLg17wL6pxu1 +f1Oa+O02HMQV7RxEVr5PZF+CmPfjLIzSTLdQ/Nf3ygw4qki8H+of2cfQFojPfp/2Atp JCbw== X-Gm-Message-State: AOAM531medoSazsgRzO+fQUBbCQoVDe0sh+00tYTJb9KqPBDSbeUPnow nqnMi0vibgrEiKb9aE0lUf2csg== X-Google-Smtp-Source: ABdhPJxD4FVAIHptbkERfJnHHMLsFKYvx2nE1poXME33PIV0WBGnwDM6XtYE24tRDZW+kciy4qTEwA== X-Received: by 2002:a63:3285:: with SMTP id y127mr4846666pgy.479.1639508175593; Tue, 14 Dec 2021 10:56:15 -0800 (PST) Received: from google.com (157.214.185.35.bc.googleusercontent.com. [35.185.214.157]) by smtp.gmail.com with ESMTPSA id m6sm408934pgb.31.2021.12.14.10.56.14 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 14 Dec 2021 10:56:15 -0800 (PST) Date: Tue, 14 Dec 2021 18:56:11 +0000 From: Sean Christopherson To: Kechen Lu Cc: kvm@vger.kernel.org, pbonzini@redhat.com, wanpengli@tencent.com, rkrcmar@redhat.com, vkuznets@redhat.com, somduttar@nvidia.com, linux-kernel@vger.kernel.org Subject: Re: [RFC PATCH] KVM: x86: add kvm per-vCPU exits disable capability Message-ID: References: <20211214033227.264714-1-kechenl@nvidia.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20211214033227.264714-1-kechenl@nvidia.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Dec 13, 2021, Kechen Lu wrote: > --- > Documentation/virt/kvm/api.rst | 8 +++++++- > arch/x86/include/asm/kvm_host.h | 1 + > arch/x86/kvm/cpuid.c | 2 +- > arch/x86/kvm/svm/svm.c | 2 +- > arch/x86/kvm/vmx/vmx.c | 4 ++-- > arch/x86/kvm/x86.c | 5 ++++- > arch/x86/kvm/x86.h | 5 +++-- > include/uapi/linux/kvm.h | 4 +++- > 8 files changed, 22 insertions(+), 9 deletions(-) > > diff --git a/Documentation/virt/kvm/api.rst b/Documentation/virt/kvm/api.rst > index aeeb071c7688..9a44896dc950 100644 > --- a/Documentation/virt/kvm/api.rst > +++ b/Documentation/virt/kvm/api.rst > @@ -6580,6 +6580,9 @@ branch to guests' 0x200 interrupt vector. > > :Architectures: x86 > :Parameters: args[0] defines which exits are disabled > + args[1] defines vCPU bitmask based on vCPU ID, 1 on > + corresponding vCPU ID bit would enable exists > + on that vCPU > :Returns: 0 on success, -EINVAL when args[0] contains invalid exits > > Valid bits in args[0] are:: > @@ -6588,13 +6591,16 @@ Valid bits in args[0] are:: > #define KVM_X86_DISABLE_EXITS_HLT (1 << 1) > #define KVM_X86_DISABLE_EXITS_PAUSE (1 << 2) > #define KVM_X86_DISABLE_EXITS_CSTATE (1 << 3) > + #define KVM_X86_DISABLE_EXITS_PER_VCPU (1UL << 63) This doesn't scale, there are already plenty of use cases for VMs with 65+ vCPUs. At a glance, I don't see anything fundamentally wrong with simply supporting a vCPU-scoped ioctl(). The VM-scoped version already has an undocumented requirement that it be called before vCPUs are created, because neither VMX nor SVM will update the controls if exits are disabled after vCPUs are created. That means the flags checked at runtime can be purely vCPU, with the per-VM flag picked up at vCPU creation. Probably worth formalizing that requirement too, e.g. diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index 85127b3e3690..6c9bc022a522 100644 --- a/arch/x86/kvm/x86.c +++ b/arch/x86/kvm/x86.c @@ -5775,6 +5775,10 @@ int kvm_vm_ioctl_enable_cap(struct kvm *kvm, if (cap->args[0] & ~KVM_X86_DISABLE_VALID_EXITS) break; + mutex_lock(&kvm->lock); + if (kvm->created_vcpus) + goto disable_exits_unlock; + if ((cap->args[0] & KVM_X86_DISABLE_EXITS_MWAIT) && kvm_can_mwait_in_guest()) kvm->arch.mwait_in_guest = true; @@ -5785,6 +5789,8 @@ int kvm_vm_ioctl_enable_cap(struct kvm *kvm, if (cap->args[0] & KVM_X86_DISABLE_EXITS_CSTATE) kvm->arch.cstate_in_guest = true; r = 0; +disable_exits_unlock: + mutex_unlock(&kvm->lock); break; case KVM_CAP_MSR_PLATFORM_INFO: kvm->arch.guest_can_read_msr_platform_info = cap->args[0];