Received: by 2002:a05:6358:bb9e:b0:b9:5105:a5b4 with SMTP id df30csp4312731rwb; Tue, 6 Sep 2022 05:58:26 -0700 (PDT) X-Google-Smtp-Source: AA6agR6Ink3LVAOP0daWKvymGPZrI4KOoKlQ6gfb750ARrOXKvf5WWtgIDHZa/OLZyPOkJDad/o+ X-Received: by 2002:a17:907:c28:b0:73d:beb7:b9c2 with SMTP id ga40-20020a1709070c2800b0073dbeb7b9c2mr39202914ejc.336.1662469106334; Tue, 06 Sep 2022 05:58:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662469106; cv=none; d=google.com; s=arc-20160816; b=WRg5SCht3yK1uMgr/xqwmPbqcop/yskvRXcUIfiBKeKDJ3wzE4QcoWRVXAYkI4lHYT eRkX6GixGxQGdxp5jb3AvP6xj/RZ6VY3eO8HxxWALI5N/bDrH6EPskqpMmLe5Db8vrZH f4tQkEvOpMAnCjyQ+V4pwIVG6UQbpA6LR0xlamP9LHgXMh/Wvrj2HREhhCYaIK1C7cjE norcioIYuU3565HrdQQaaA6/CjbMa9Lb7sXvkDv7h9csdBSQm94J/gviN9A+jCbaLo6g ket3V/zl5K3rUFL4J8TLgkLI4r2nUGk5JM83tnaDaYZ2VSzDU1QJASu9/KvNqDgl/tSi 8QTw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=4JGTiDJcgL+m7f4YRbRBhULIgiYlDFX66WuxAILPmWA=; b=QRk2mevEui1jlizjlz85g460B4kOTtubue4dS2q24kfeTsF5PSH0Y+fAyWIA/ehpad 3XSxuNNAM553a13u6QQD54DTPaLGDPBFdeSSBNYcUH+FwwCDzt3tHIIJ0b+0MS06Avgr l4u8X6RMxqLrXTJsk7i5fG+DIizYxX4+UUgUIZ/Hki0vNun/Nn7UGbvdFjCBfwMBKN7b hsZAtA0InVVcMmSw6FKFWC9XynvwRzsd3EpFEx576ZC0AlMK9Osmk+kJycjr69H3+56B lnxhfyiKIgciJnl+ZZP+kiz/G/L48+6szTgksPCndfcPgdYeE2/CYZi49g05JJXOipYf n8nA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=iZUwtOJf; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hu8-20020a170907a08800b0076f3512cb3fsi1037727ejc.200.2022.09.06.05.58.00; Tue, 06 Sep 2022 05:58:26 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=iZUwtOJf; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240062AbiIFMpQ (ORCPT + 99 others); Tue, 6 Sep 2022 08:45:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59198 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240045AbiIFMpM (ORCPT ); Tue, 6 Sep 2022 08:45:12 -0400 Received: from mail-pl1-x62b.google.com (mail-pl1-x62b.google.com [IPv6:2607:f8b0:4864:20::62b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E48C53CBE9; Tue, 6 Sep 2022 05:45:10 -0700 (PDT) Received: by mail-pl1-x62b.google.com with SMTP id v5so11175433plo.9; Tue, 06 Sep 2022 05:45:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date; bh=4JGTiDJcgL+m7f4YRbRBhULIgiYlDFX66WuxAILPmWA=; b=iZUwtOJfsM1j7MbG17D7Q1RqfQ+KHHvhEAQUl8beLSej/Bi5zelwoEVkEstB2FqLeA z5BZHnGgRwuacozG2yTzl8QzMRXPTxT/Oelu6YuBwqV7zeZqdhnwCHzMR4JJiRi4kFtr SDD0uF/8aHL7ZuaLZiK3cvTm4oWcrqSc17gJ66QPPBpqRXlcRonid267w+lGFHWaKlff F02Fgh32dLBPti56AIwzkHX+M/q+1Y7RVIXy1od9r1Cx0XbsOaWsT23+peEuC55cDbRE xAbrAQvPdl1CJrBJZr+SP4y7Etbd5exQsx76n1wuMRzEczbGiW2MJHDNW9+XoBEIGgM5 E2tA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date; bh=4JGTiDJcgL+m7f4YRbRBhULIgiYlDFX66WuxAILPmWA=; b=BH/2MymNL9nQxhS0w97cXqYiaB098ACXNPypsuvIpR/QmvrSQF68j2/wIcv7PfNHNF VD6vPRmembzkc/Db3u3iVPs52u/wrhLBZpHIVAwW+mjNhseKIYkVsyTDVRDQEkUtP4rA f+s62dH1lPe9AG/117osh0PCMLckAXmr2y5F6XYa9Nuok0MRRoMw3LzJ9uZngMhPDf0J UaIItpLKT1TkwdcUzVBgcAfCV3XALanQLzbzV20G/fvr5Ap1Ex9qioFbZbthth6BnXqE w9Aa6tp5iVW8pLmqxHuAKTvybfiHhWBYn6JYoGMI88kS+PGme6zqwUAjUL2gUHyGrv+e f7iw== X-Gm-Message-State: ACgBeo3dsNa07a51WX6fdRqzjUidmsEETDVDFyb9UvEcd87PlRZkdA+6 KYFWEgzAfy8/sCWXevWJln4= X-Received: by 2002:a17:902:76c6:b0:175:395c:b650 with SMTP id j6-20020a17090276c600b00175395cb650mr33705142plt.98.1662468310345; Tue, 06 Sep 2022 05:45:10 -0700 (PDT) Received: from [192.168.255.10] ([103.7.29.32]) by smtp.gmail.com with ESMTPSA id z4-20020a170902ccc400b00172c7a2c662sm9676801ple.48.2022.09.06.05.45.08 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 06 Sep 2022 05:45:09 -0700 (PDT) Message-ID: Date: Tue, 6 Sep 2022 20:45:00 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:91.0) Gecko/20100101 Thunderbird/91.13.0 Subject: Re: [PATCH 3/4] KVM: x86/svm/pmu: Add AMD PerfMonV2 support Content-Language: en-US To: Jim Mattson Cc: Sean Christopherson , Paolo Bonzini , Sandipan Das , kvm@vger.kernel.org, linux-kernel@vger.kernel.org References: <20220905123946.95223-1-likexu@tencent.com> <20220905123946.95223-4-likexu@tencent.com> From: Like Xu In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-3.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,NICE_REPLY_A, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/9/2022 2:00 am, Jim Mattson wrote: > On Mon, Sep 5, 2022 at 5:44 AM Like Xu wrote: >> >> From: Like Xu >> >> If AMD Performance Monitoring Version 2 (PerfMonV2) is detected >> by the guest, it can use a new scheme to manage the Core PMCs using >> the new global control and status registers. >> >> In addition to benefiting from the PerfMonV2 functionality in the same >> way as the host (higher precision), the guest also can reduce the number >> of vm-exits by lowering the total number of MSRs accesses. >> >> In terms of implementation details, amd_is_valid_msr() is resurrected >> since three newly added MSRs could not be mapped to one vPMC. >> The possibility of emulating PerfMonV2 on the mainframe has also >> been eliminated for reasons of precision. >> >> Co-developed-by: Sandipan Das >> Signed-off-by: Sandipan Das >> Signed-off-by: Like Xu >> --- >> arch/x86/kvm/pmu.c | 6 +++++ >> arch/x86/kvm/svm/pmu.c | 50 +++++++++++++++++++++++++++++++++--------- >> arch/x86/kvm/x86.c | 11 ++++++++++ >> 3 files changed, 57 insertions(+), 10 deletions(-) >> >> diff --git a/arch/x86/kvm/pmu.c b/arch/x86/kvm/pmu.c >> index 7002e1b74108..56b4f898a246 100644 >> --- a/arch/x86/kvm/pmu.c >> +++ b/arch/x86/kvm/pmu.c >> @@ -455,12 +455,15 @@ int kvm_pmu_get_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) >> >> switch (msr) { >> case MSR_CORE_PERF_GLOBAL_STATUS: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS: >> msr_info->data = pmu->global_status; >> return 0; >> case MSR_CORE_PERF_GLOBAL_CTRL: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: >> msr_info->data = pmu->global_ctrl; >> return 0; >> case MSR_CORE_PERF_GLOBAL_OVF_CTRL: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS_CLR: >> msr_info->data = 0; >> return 0; >> default: >> @@ -479,12 +482,14 @@ int kvm_pmu_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) >> >> switch (msr) { >> case MSR_CORE_PERF_GLOBAL_STATUS: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS: >> if (msr_info->host_initiated) { >> pmu->global_status = data; >> return 0; >> } >> break; /* RO MSR */ >> case MSR_CORE_PERF_GLOBAL_CTRL: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: >> if (pmu->global_ctrl == data) >> return 0; >> if (kvm_valid_perf_global_ctrl(pmu, data)) { >> @@ -495,6 +500,7 @@ int kvm_pmu_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) >> } >> break; >> case MSR_CORE_PERF_GLOBAL_OVF_CTRL: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS_CLR: >> if (!(data & pmu->global_ovf_ctrl_mask)) { >> if (!msr_info->host_initiated) >> pmu->global_status &= ~data; >> diff --git a/arch/x86/kvm/svm/pmu.c b/arch/x86/kvm/svm/pmu.c >> index 3a20972e9f1a..4c7d408e3caa 100644 >> --- a/arch/x86/kvm/svm/pmu.c >> +++ b/arch/x86/kvm/svm/pmu.c >> @@ -92,12 +92,6 @@ static struct kvm_pmc *amd_rdpmc_ecx_to_pmc(struct kvm_vcpu *vcpu, >> return amd_pmc_idx_to_pmc(vcpu_to_pmu(vcpu), idx & ~(3u << 30)); >> } >> >> -static bool amd_is_valid_msr(struct kvm_vcpu *vcpu, u32 msr) >> -{ >> - /* All MSRs refer to exactly one PMC, so msr_idx_to_pmc is enough. */ >> - return false; >> -} >> - >> static struct kvm_pmc *amd_msr_idx_to_pmc(struct kvm_vcpu *vcpu, u32 msr) >> { >> struct kvm_pmu *pmu = vcpu_to_pmu(vcpu); >> @@ -109,6 +103,29 @@ static struct kvm_pmc *amd_msr_idx_to_pmc(struct kvm_vcpu *vcpu, u32 msr) >> return pmc; >> } >> >> +static bool amd_is_valid_msr(struct kvm_vcpu *vcpu, u32 msr) >> +{ >> + struct kvm_pmu *pmu = vcpu_to_pmu(vcpu); >> + >> + switch (msr) { >> + case MSR_K7_EVNTSEL0 ... MSR_K7_PERFCTR3: >> + return pmu->version > 0; >> + case MSR_F15H_PERF_CTL0 ... MSR_F15H_PERF_CTR5: >> + return guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE); >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: >> + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS_CLR: >> + return pmu->version > 1; >> + default: >> + if (msr > MSR_F15H_PERF_CTR5 && >> + msr < MSR_F15H_PERF_CTL0 + 2 * KVM_AMD_PMC_MAX_GENERIC) >> + return pmu->version > 1; > > Should this be bounded by guest CPUID.80000022H:EBX[NumCorePmc] > (unless host-initiated)? Indeed, how about: default: if (msr > MSR_F15H_PERF_CTR5 && msr < MSR_F15H_PERF_CTL0 + 2 * pmu->nr_arch_gp_counters) return pmu->version > 1; and for host-initiated: #define MSR_F15H_PERF_MSR_MAX \ (MSR_F15H_PERF_CTR0 + 2 * (KVM_AMD_PMC_MAX_GENERIC - 1)) kvm_{set|get}_msr_common() case MSR_F15H_PERF_CTL0 ... MSR_F15H_PERF_MSR_MAX: if (kvm_pmu_is_valid_msr(vcpu, msr)) return kvm_pmu_set_msr(vcpu, msr_info); ? > >> + break; >> + } >> + >> + return amd_msr_idx_to_pmc(vcpu, msr); >> +} >> + >> static int amd_pmu_get_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) >> { >> struct kvm_pmu *pmu = vcpu_to_pmu(vcpu); >> @@ -162,20 +179,31 @@ static int amd_pmu_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) >> static void amd_pmu_refresh(struct kvm_vcpu *vcpu) >> { >> struct kvm_pmu *pmu = vcpu_to_pmu(vcpu); >> + struct kvm_cpuid_entry2 *entry; >> + union cpuid_0x80000022_ebx ebx; >> >> - if (guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE)) >> + pmu->version = 1; >> + entry = kvm_find_cpuid_entry_index(vcpu, 0x80000022, 0); >> + if (kvm_pmu_cap.version > 1 && entry && (entry->eax & BIT(0))) { >> + pmu->version = 2; >> + ebx.full = entry->ebx; >> + pmu->nr_arch_gp_counters = min3((unsigned int)ebx.split.num_core_pmc, >> + (unsigned int)kvm_pmu_cap.num_counters_gp, >> + (unsigned int)KVM_AMD_PMC_MAX_GENERIC); >> + pmu->global_ctrl_mask = ~((1ull << pmu->nr_arch_gp_counters) - 1); >> + pmu->global_ovf_ctrl_mask = pmu->global_ctrl_mask; >> + } else if (guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE)) { >> pmu->nr_arch_gp_counters = AMD64_NUM_COUNTERS_CORE; > > The logic above doesn't seem quite right, since guest_cpuid_has(vcpu, > X86_FEATURE_PERFCTR_CORE) promises 6 PMCs, regardless of what > CPUID.80000022 says. I would have expected the appearance of CPUID.80000022 to override PERFCTR_CORE, now I don't think it's a good idea as you do, so how about: amd_pmu_refresh(): bool perfctr_core = guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE); pmu->version = 1; if (kvm_pmu_cap.version > 1) entry = kvm_find_cpuid_entry_index(vcpu, 0x80000022, 0); if (!perfctr_core) pmu->nr_arch_gp_counters = AMD64_NUM_COUNTERS; if (entry && (entry->eax & BIT(0))) { pmu->version = 2; ebx.full = entry->ebx; pmu->nr_arch_gp_counters = min3((unsigned int)ebx.split.num_core_pmc, (unsigned int)kvm_pmu_cap.num_counters_gp, (unsigned int)KVM_AMD_PMC_MAX_GENERIC); } /* PERFCTR_CORE promises 6 PMCs, regardless of CPUID.80000022 */ if (perfctr_core) { pmu->nr_arch_gp_counters = max(pmu->nr_arch_gp_counters, AMD64_NUM_COUNTERS_CORE); } if (pmu->version > 1) { pmu->global_ctrl_mask = ~((1ull << pmu->nr_arch_gp_counters) - 1); pmu->global_ovf_ctrl_mask = pmu->global_ctrl_mask; } ?