Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp3127242pxj; Mon, 10 May 2021 19:45:20 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwJSe+qi9eD7rjHH6RUll32GwRlabPS8g6XS5RFGhW0vhYNd7SvAMtzaSwao48ORTHUiKn+ X-Received: by 2002:a17:906:57c3:: with SMTP id u3mr29734080ejr.162.1620701120159; Mon, 10 May 2021 19:45:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620701120; cv=none; d=google.com; s=arc-20160816; b=VPlud7QkcDh5fQ/2qEiYtWG5vY93EfSSLArvdV0dhdbBcLz6VOovJkZm4z36P9ufb5 QDBKHc+kr4VHqMkt9d6CD8B3xE4y+jGDEvG0uWnBHR0HJu+1fHjubFsZ6AswvYFFxGmb cRvLDvnLjENTRCFJSgdWFy2+TMyW7C51pCx6ETEg+rAroaWgYB/K+AyU/3emdVAqlSPj j2xcCroZGPa7CQzFX/FHsR1shexTDf56OlbEnWV4bEAVkGpPcGvPc51t16+dLrhLNc6P l48b0FtcvWQwyIYo9nyydNFE6VSA+hJp+wL0uuM2hfaVLQEmgzCz5IeAowZ/zbcfLUXl I63Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :ironport-sdr:ironport-sdr; bh=SQAjSGFD+9DNAJa8d+wcwYw56hIKUEl7rCtfYvN3SLs=; b=k2TlzkT1TZUNt9UBpd08ztlJUTgdFhHKcTDCuqNsMQVsMjqb5aiM36AzIBv7DdazJq +1ST9k3sZLv3OVMyfjYXnMTi2jSORRQ622EfRfzGRqb/QcFG3WAHykdY/EFOibRGKiak gnfHe4W+kI31daoYp5qbac0HOWWHLbr9LJMZNVvmZqIUUJWql0+B5lPNA9Aoydp6CTHN IYFTD585fTxF07STMDCUM6abie/bYB+Kvw3E+BbeVAVtytPK9Zb455Jr4nnt1kQ2TftL VHT0mnnd2cMHrep8c/HE29IOQqSqvNYJ5UwkpJhH47PPeMGVrVHO9tpAiAdR+/0GrJHD Ge0w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id g16si1503637ejm.224.2021.05.10.19.44.56; Mon, 10 May 2021 19:45:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230301AbhEKCoh (ORCPT + 99 others); Mon, 10 May 2021 22:44:37 -0400 Received: from mga03.intel.com ([134.134.136.65]:7565 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229931AbhEKCod (ORCPT ); Mon, 10 May 2021 22:44:33 -0400 IronPort-SDR: I3+UPtZyx25UDPxHU412dlc0w1pbFx+LZUX0Ki+it0LUSjg9oao8rzfmir+z+zG9W2V0Nw7gH1 slGJgxnZER3w== X-IronPort-AV: E=McAfee;i="6200,9189,9980"; a="199391221" X-IronPort-AV: E=Sophos;i="5.82,290,1613462400"; d="scan'208";a="199391221" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 May 2021 19:43:27 -0700 IronPort-SDR: IIlXoo9ndiCRZcnBLaQH80ePrVEAkhrDOcN9dB1HqJ42AMzFI5aaBbs+9j12GsW7R6/z48Cu9V LBsRQwBQzkPA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.82,290,1613462400"; d="scan'208";a="468591791" Received: from clx-ap-likexu.sh.intel.com ([10.239.48.108]) by fmsmga002.fm.intel.com with ESMTP; 10 May 2021 19:43:23 -0700 From: Like Xu To: Peter Zijlstra , Paolo Bonzini Cc: Borislav Petkov , Sean Christopherson , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , weijiang.yang@intel.com, Kan Liang , ak@linux.intel.com, wei.w.wang@intel.com, eranian@google.com, liuxiangdong5@huawei.com, linux-kernel@vger.kernel.org, x86@kernel.org, kvm@vger.kernel.org, Like Xu Subject: [PATCH v6 07/16] KVM: x86/pmu: Reprogram PEBS event to emulate guest PEBS counter Date: Tue, 11 May 2021 10:42:05 +0800 Message-Id: <20210511024214.280733-8-like.xu@linux.intel.com> X-Mailer: git-send-email 2.31.1 In-Reply-To: <20210511024214.280733-1-like.xu@linux.intel.com> References: <20210511024214.280733-1-like.xu@linux.intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org When a guest counter is configured as a PEBS counter through IA32_PEBS_ENABLE, a guest PEBS event will be reprogrammed by configuring a non-zero precision level in the perf_event_attr. The guest PEBS overflow PMI bit would be set in the guest GLOBAL_STATUS MSR when PEBS facility generates a PEBS overflow PMI based on guest IA32_DS_AREA MSR. Even with the same counter index and the same event code and mask, guest PEBS events will not be reused for non-PEBS events. Originally-by: Andi Kleen Co-developed-by: Kan Liang Signed-off-by: Kan Liang Signed-off-by: Like Xu --- arch/x86/kvm/pmu.c | 34 ++++++++++++++++++++++++++++++++-- 1 file changed, 32 insertions(+), 2 deletions(-) diff --git a/arch/x86/kvm/pmu.c b/arch/x86/kvm/pmu.c index 827886c12c16..0f86c1142f17 100644 --- a/arch/x86/kvm/pmu.c +++ b/arch/x86/kvm/pmu.c @@ -74,11 +74,21 @@ static void kvm_perf_overflow_intr(struct perf_event *perf_event, { struct kvm_pmc *pmc = perf_event->overflow_handler_context; struct kvm_pmu *pmu = pmc_to_pmu(pmc); + bool skip_pmi = false; if (!test_and_set_bit(pmc->idx, pmu->reprogram_pmi)) { - __set_bit(pmc->idx, (unsigned long *)&pmu->global_status); + if (perf_event->attr.precise_ip) { + /* Indicate PEBS overflow PMI to guest. */ + skip_pmi = __test_and_set_bit(GLOBAL_STATUS_BUFFER_OVF_BIT, + (unsigned long *)&pmu->global_status); + } else { + __set_bit(pmc->idx, (unsigned long *)&pmu->global_status); + } kvm_make_request(KVM_REQ_PMU, pmc->vcpu); + if (skip_pmi) + return; + /* * Inject PMI. If vcpu was in a guest mode during NMI PMI * can be ejected on a guest mode re-entry. Otherwise we can't @@ -99,6 +109,7 @@ static void pmc_reprogram_counter(struct kvm_pmc *pmc, u32 type, bool exclude_kernel, bool intr, bool in_tx, bool in_tx_cp) { + struct kvm_pmu *pmu = vcpu_to_pmu(pmc->vcpu); struct perf_event *event; struct perf_event_attr attr = { .type = type, @@ -110,6 +121,7 @@ static void pmc_reprogram_counter(struct kvm_pmc *pmc, u32 type, .exclude_kernel = exclude_kernel, .config = config, }; + bool pebs = test_bit(pmc->idx, (unsigned long *)&pmu->pebs_enable); attr.sample_period = get_sample_period(pmc, pmc->counter); @@ -124,9 +136,23 @@ static void pmc_reprogram_counter(struct kvm_pmc *pmc, u32 type, attr.sample_period = 0; attr.config |= HSW_IN_TX_CHECKPOINTED; } + if (pebs) { + /* + * The non-zero precision level of guest event makes the ordinary + * guest event becomes a guest PEBS event and triggers the host + * PEBS PMI handler to determine whether the PEBS overflow PMI + * comes from the host counters or the guest. + * + * For most PEBS hardware events, the difference in the software + * precision levels of guest and host PEBS events will not affect + * the accuracy of the PEBS profiling result, because the "event IP" + * in the PEBS record is calibrated on the guest side. + */ + attr.precise_ip = 1; + } event = perf_event_create_kernel_counter(&attr, -1, current, - intr ? kvm_perf_overflow_intr : + (intr || pebs) ? kvm_perf_overflow_intr : kvm_perf_overflow, pmc); if (IS_ERR(event)) { pr_debug_ratelimited("kvm_pmu: event creation failed %ld for pmc->idx = %d\n", @@ -161,6 +187,10 @@ static bool pmc_resume_counter(struct kvm_pmc *pmc) get_sample_period(pmc, pmc->counter))) return false; + if (!test_bit(pmc->idx, (unsigned long *)&pmc_to_pmu(pmc)->pebs_enable) && + pmc->perf_event->attr.precise_ip) + return false; + /* reuse perf_event to serve as pmc_reprogram_counter() does*/ perf_event_enable(pmc->perf_event); -- 2.31.1