Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp271525rwl; Thu, 6 Apr 2023 19:01:04 -0700 (PDT) X-Google-Smtp-Source: AKy350bdpk4zIL4Ric6Ns4+rilDt1MtDFY9gAOIKbZK6DRn0j3IJnf61zOAC6vdkr6JD0kKQDDyS X-Received: by 2002:a17:906:f18e:b0:8dd:5710:a017 with SMTP id gs14-20020a170906f18e00b008dd5710a017mr755594ejb.4.1680832864422; Thu, 06 Apr 2023 19:01:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1680832864; cv=none; d=google.com; s=arc-20160816; b=YgFqjeuM8/C1ZoYbAB1lG62yPAflob7PgMFuGNAR570D/TNFOlcKVc2cHz2YE5VVjs DVBycHkt/TraF660GPfjFTjdEXNoP4tSO4cEJ0TY9HDFo2H3hj0WhG5srOX0VqX0siuE ONQmbGV5pXZMo4+wC8qcih1yVgWflmtG18u0KTu0RcEd26tnTQxvX1y/HYG5VWaH4sEP G9W4Nw8D+dR4Lt4eqYPg+DeDxUnKxANgXed7B6EyXx0yRxePB+uBh763xJiMJbqGno27 4zmHKOL5755aWJi1vEljr4EmEOyljaSnnOTL7Zzatf+MYT4d1A7IKjmvdT0E4zekFNTo Oylw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=9+58jKVdigi+Tne/Y0QC6raeydAklSYvu/dDGhyA9LI=; b=0kW6DnETuM2k0q61pxxNLqy2J433pIrJREZADU3MeVdErm3IUJ8bpYB68+5BhOFnIA 8b60G8PX+PiK6gfICQo5s7ZwZL5eEnABlOeS8ZF4YqT4Rx8ErSBSlgTuLJXhFDUN7Aob XCe6+E9fuYaeksrFcpTYvYGoTw9Uqg3zCqA7rUhELANjqrAUc5wYYdoOh0+JGeRsQuFD WCrrPF51xZ4RK1wBZFPvD09A6PYbVJz4ArK0XsbBT3ctC8AFk2h2GsfEo9+tOuAetN8Y hhQgkoa/QLACRot0EKvQPZaNgj42b/wd+WnD8dRfCVh6g6J7uVBg5BlkDha9YAwxCqLE cC8A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=QtXcg8Xg; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id pk6-20020a170906d7a600b009334edaed9asi2135840ejb.976.2023.04.06.19.00.39; Thu, 06 Apr 2023 19:01:04 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=QtXcg8Xg; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239196AbjDGBfM (ORCPT + 99 others); Thu, 6 Apr 2023 21:35:12 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39150 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230440AbjDGBfL (ORCPT ); Thu, 6 Apr 2023 21:35:11 -0400 Received: from mail-pj1-x1049.google.com (mail-pj1-x1049.google.com [IPv6:2607:f8b0:4864:20::1049]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2AA075597 for ; Thu, 6 Apr 2023 18:35:09 -0700 (PDT) Received: by mail-pj1-x1049.google.com with SMTP id ge3-20020a17090b0e0300b00244aca4bb4aso447459pjb.1 for ; Thu, 06 Apr 2023 18:35:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; t=1680831308; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=9+58jKVdigi+Tne/Y0QC6raeydAklSYvu/dDGhyA9LI=; b=QtXcg8XgBdeJFclfdJA1Gf40qCN5Njp44/cYv+xnps8qZm4RxwJvBPd/ZXB2O1kZxt fI37KySC7rXT/vtJcCJ0YPSh3zLPDBHF5+0u9VVGq/JQ6VxWVwLTZMbKkSnB4Jj7ewKp eXeY4mdzSjSUR3e1I2UFxyuH7KkGVzF+rTqbz0Q1GtBpuqoSv3hUivl3Es0uArsVPn5k u/7dCWKGMyrqBo5Y9Fdtah13J8Tw6wL1XeVcx0v6mSlnhc0bLh8hOQ/gJWv0SwIh7b4D xkPJw+eVloe+TNpyvtLqmrxzg1DmhvRTYHcMCXS9p6DFl9TAqBKSXcLqrl1tOM8BC954 9ArQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1680831308; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=9+58jKVdigi+Tne/Y0QC6raeydAklSYvu/dDGhyA9LI=; b=YclJrWwS9N29X5lTDTmrYQVVOISahJnZcM8jHEWYmH/HBuqf/T7NV2DcbfwtWYwgMf Zc59RKa4UPkdZUDa3biaDoZFWP3d9fO3H8IP08W33UIxI5NphdGIxiyBvI40QBcXFHbu KvvT3cgb0shgaUQEIj3ccjJ8A3WiVqKC+rH9gB4UOdR8lX1XvvjEm0xUgdFWiUdGCuvc 0EiVTMhWpIWXbKfmfzc8lELf1FQcJYgudIcYDj83oKrdYFoRwA00u9OEbp/sPaFLPwnW lvD4YuWhy9QOPY1gl1jIfxfszmrge5JmttBzb4rmvSZj6sf3HCvi1S+rQwyg+xplLQw2 wnqQ== X-Gm-Message-State: AAQBX9dXpLsDNdXW1/CdwhzJFYg6L4fOp2gL8l4G9/EqWYskANVI24ZJ kKBXgxCoE7Ae6nzNJDA9s1uK4n9oUIQ= X-Received: from zagreus.c.googlers.com ([fda3:e722:ac3:cc00:7f:e700:c0a8:5c37]) (user=seanjc job=sendgmr) by 2002:a17:90a:b316:b0:23d:bd2:ab35 with SMTP id d22-20020a17090ab31600b0023d0bd2ab35mr163222pjr.3.1680831308682; Thu, 06 Apr 2023 18:35:08 -0700 (PDT) Date: Thu, 6 Apr 2023 18:35:06 -0700 In-Reply-To: <20230214050757.9623-12-likexu@tencent.com> Mime-Version: 1.0 References: <20230214050757.9623-1-likexu@tencent.com> <20230214050757.9623-12-likexu@tencent.com> Message-ID: Subject: Re: [PATCH v4 11/12] KVM: x86/svm/pmu: Add AMD PerfMonV2 support From: Sean Christopherson To: Like Xu Cc: Paolo Bonzini , kvm@vger.kernel.org, linux-kernel@vger.kernel.org, Sandipan Das Content-Type: text/plain; charset="us-ascii" X-Spam-Status: No, score=-7.7 required=5.0 tests=DKIMWL_WL_MED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Feb 14, 2023, Like Xu wrote: > + case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS: > + if (!msr_info->host_initiated) > + return 0; /* Writes are ignored */ Where is the "writes ignored" behavior documented? I can't find anything in the APM that defines write behavior. > > pmu->global_status = data; > return 0; > case MSR_CORE_PERF_GLOBAL_CTRL: > if (!kvm_valid_perf_global_ctrl(pmu, data)) > return 1; > - > + fallthrough; This _definitely_ needs a comment. Hmm, and I would prefer to reverse these, i.e. case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: data &= ~pmu->global_ctrl_mask; fallthrough; case MSR_CORE_PERF_GLOBAL_CTRL: if (!kvm_valid_perf_global_ctrl(pmu, data)) return 1; It's a bit arbitrary, but either Intel or AMD is going to end up with extra code, and IMO skipping a validity check is more alarming than skipping clearing of reserved bits, i.e. will look like a bug to future readers. > + case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: > + data &= ~pmu->global_ctrl_mask; > if (pmu->global_ctrl != data) { > diff = pmu->global_ctrl ^ data; > pmu->global_ctrl = data; > @@ -616,7 +625,8 @@ int kvm_pmu_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) > case MSR_CORE_PERF_GLOBAL_OVF_CTRL: > if (data & pmu->global_ovf_ctrl_mask) > return 1; > - > + fallthrough; Here too. Argh, the APM doesn't actually define what happens on reserved bits, it just says "WO". I vote to be conservative and ignore writes to reserved bits. And then we can have one comment for the whole block, e.g. /* * Note, AMD ignores writes to read-only PMU MSRs/bits, whereas Intel * generates #GP on attempts to write reserved bits or RO MSRs. */ switch (msr) { case MSR_CORE_PERF_GLOBAL_STATUS: if (!msr_info->host_initiated) return 1; /* RO MSR */ fallthrough; case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS: if (!msr_info->host_initiated) break; pmu->global_status = data; break; case MSR_AMD64_PERF_CNTR_GLOBAL_CTL: data &= ~pmu->global_ctrl_mask; fallthrough; case MSR_CORE_PERF_GLOBAL_CTRL: if (!kvm_valid_perf_global_ctrl(pmu, data)) return 1; if (pmu->global_ctrl != data) { diff = pmu->global_ctrl ^ data; pmu->global_ctrl = data; reprogram_counters(pmu, diff); } break; case MSR_AMD64_PERF_CNTR_GLOBAL_STATUS_CLR: fallthrough; case MSR_CORE_PERF_GLOBAL_OVF_CTRL: if (data & pmu->global_ovf_ctrl_mask) return 1; if (!msr_info->host_initiated) pmu->global_status &= ~data; break; default: kvm_pmu_mark_pmc_in_use(vcpu, msr_info->index); return static_call(kvm_x86_pmu_set_msr)(vcpu, msr_info); } return 0; > @@ -164,20 +181,34 @@ static int amd_pmu_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr_info) > static void amd_pmu_refresh(struct kvm_vcpu *vcpu) > { > struct kvm_pmu *pmu = vcpu_to_pmu(vcpu); > + struct kvm_cpuid_entry2 *entry; > + union cpuid_0x80000022_ebx ebx; > > - if (guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE)) > + pmu->version = 1; > + if (guest_cpuid_has(vcpu, X86_FEATURE_PERFMON_V2)) { > + pmu->version = 2; > + entry = kvm_find_cpuid_entry_index(vcpu, 0x80000022, 0); No need for the intermediate "entry". > + ebx.full = entry->ebx; Oof, at first glance this looks like a potential null-pointer deref bug. I believe we can do /* * Note, PERFMON_V2 is also in 0x80000022.0x0, i.e. the guest * CPUID entry is guaranteed to be non-NULL. */ BUILD_BUG_ON(x86_feature_cpuid(X86_FEATURE_PERFMON_V2).function != 0x80000022 || x86_feature_cpuid(X86_FEATURE_PERFMON_V2).index != 0x80000022); ebx.full = kvm_find_cpuid_entry_index(vcpu, 0x80000022, 0)->ebx; > + pmu->nr_arch_gp_counters = min_t(unsigned int, > + ebx.split.num_core_pmc, > + kvm_pmu_cap.num_counters_gp); > + } else if (guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE)) { > pmu->nr_arch_gp_counters = AMD64_NUM_COUNTERS_CORE; This needs to be sanitized, no? E.g. if KVM only has access to 4 counters, but userspace sets X86_FEATURE_PERFCTR_CORE anyways. Hrm, unless I'm missing something, that's a pre-existing bug. If I'm right, can you add a patch to cap nr_arch_gp_counters at kvm_pmu_cap.num_counters_gp in the common flow, i.e. after this if-else block? Then there is no change needed in this patch, e.g. we'll naturally end up with: union cpuid_0x80000022_ebx ebx; pmu->version = 1; if (guest_cpuid_has(vcpu, X86_FEATURE_PERFMON_V2)) { pmu->version = 2; /* * Note, PERFMON_V2 is also in 0x80000022.0x0, i.e. the guest * CPUID entry is guaranteed to be non-NULL. */ BUILD_BUG_ON(x86_feature_cpuid(X86_FEATURE_PERFMON_V2).function != 0x80000022 || x86_feature_cpuid(X86_FEATURE_PERFMON_V2).index); ebx.full = kvm_find_cpuid_entry_index(vcpu, 0x80000022, 0)->ebx; pmu->nr_arch_gp_counters = ebx.split.num_core_pmc; } else if (guest_cpuid_has(vcpu, X86_FEATURE_PERFCTR_CORE)) { pmu->nr_arch_gp_counters = AMD64_NUM_COUNTERS_CORE; } else { pmu->nr_arch_gp_counters = AMD64_NUM_COUNTERS; } pmu->nr_arch_gp_counters = min_t(unsigned int, pmu->nr_arch_gp_counters, kvm_pmu_cap.num_counters_gp);