Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp2319201pxb; Mon, 19 Apr 2021 02:54:17 -0700 (PDT) X-Google-Smtp-Source: ABdhPJydInKJ7kbGGKnC95gMhWGYpk4hs1XK1v8wIJc29P19p3H096N7i2pf+LsgKAyAqqUq2khm X-Received: by 2002:a65:62d7:: with SMTP id m23mr11121763pgv.244.1618826057651; Mon, 19 Apr 2021 02:54:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618826057; cv=none; d=google.com; s=arc-20160816; b=A1xc4GOctx9Ap84UJiJz1UrOofFpCKrL6uAo9a0W0enVuzEmwYN/mWvY1peeqyPoF7 FtqW2WZ++sQ2AA0Zyp2Ny0WhK4KPLz6AR1dJvWcZhMf5ySlJt+orx3qHWwdQ+FqkCaMu L1xZWkKarWAIV4ybqS/KwQbsp7Si95j5TURKZOtmIJovm87KHJLVuHdUfsAEZyL5WjBo kT6AwNJpi19FJFMMUksyIcs7m9EfMtZtpTM6Ulea9WPtWAsEhZr80DalrwclP2f7iKpA K47F5aA1H9LOLwFD6ax26sZUySETbi5oV3xGzJpMz5VOesaH581ofiSvkFqESqs7mvbC gh/w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-language:content-transfer-encoding :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:ironport-sdr:ironport-sdr; bh=ZPZe8FmQH3wPuLMc9euv1JSGEHyt9OCsXuPoxcjfjRQ=; b=OO4J35dTbp58TedOkCyg28/yDpzkN6v+4gpNhJgu+x+TP0cAp3IMNdLzLtJ+V893Pu +R56ExYAG1lSt81/CcB+FtssU6498dcA53GLvgTh+SwO+vtvFc9Mr908860AJMXyypCf SGTRW9oK5DjvXU57klU3OZk9whOq/S+hbd31UaR25M+2vuyGxk+6GWyS0gXQmRh7a8QC CxGkxyskBaj5Db4IGyClsZR1/6mv43YFWcuaarCOEU8Q9KR3AQx4bWrPRQ3sUSR8XYzd ouqz1/1vcQRLxLZZOQI5fa1yyP6QhDCApjKYNJ/bIllKwtYzH+tjUQzAF6c+Pnkf6dXi tqMg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id k18si17232478pll.324.2021.04.19.02.54.05; Mon, 19 Apr 2021 02:54:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236756AbhDSISb (ORCPT + 99 others); Mon, 19 Apr 2021 04:18:31 -0400 Received: from mga07.intel.com ([134.134.136.100]:34272 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238169AbhDSISb (ORCPT ); Mon, 19 Apr 2021 04:18:31 -0400 IronPort-SDR: 0EOm6+UG79k5SAbgTEj/e+kg9JfhEzmAC1bY5VUVpGj5LDNMWFEmQE95ocHJt84L/x60n6p4fT pvNl1WO0b+gg== X-IronPort-AV: E=McAfee;i="6200,9189,9958"; a="259245845" X-IronPort-AV: E=Sophos;i="5.82,233,1613462400"; d="scan'208";a="259245845" Received: from orsmga008.jf.intel.com ([10.7.209.65]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 19 Apr 2021 01:17:59 -0700 IronPort-SDR: oR6E+ZXMvs/PYgGpV4EEpUULAIie472Ka8gep1tdPIvDEr/cSiwNovLvRE0wx8loZ1vV26mS7H EPbUH9jV31uQ== X-IronPort-AV: E=Sophos;i="5.82,233,1613462400"; d="scan'208";a="426419123" Received: from likexu-mobl1.ccr.corp.intel.com (HELO [10.238.4.93]) ([10.238.4.93]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 19 Apr 2021 01:17:57 -0700 Subject: Re: [PATCH v5 06/16] KVM: x86/pmu: Reprogram PEBS event to emulate guest PEBS counter To: Liuxiangdong Cc: kvm@vger.kernel.org, x86@kernel.org, linux-kernel@vger.kernel.org, Andi Kleen , Like Xu , Xiexiangyou , "Fangyi (Eric)" References: <20210415032016.166201-1-like.xu@linux.intel.com> <20210415032016.166201-7-like.xu@linux.intel.com> <607D3B26.5020904@huawei.com> From: "Xu, Like" Message-ID: <478c630b-1dc7-e042-2884-21086c417d99@intel.com> Date: Mon, 19 Apr 2021 16:17:54 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.9.1 MIME-Version: 1.0 In-Reply-To: <607D3B26.5020904@huawei.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/4/19 16:11, Liuxiangdong wrote: > > > On 2021/4/15 11:20, Like Xu wrote: >> When a guest counter is configured as a PEBS counter through >> IA32_PEBS_ENABLE, a guest PEBS event will be reprogrammed by >> configuring a non-zero precision level in the perf_event_attr. >> >> The guest PEBS overflow PMI bit would be set in the guest >> GLOBAL_STATUS MSR when PEBS facility generates a PEBS >> overflow PMI based on guest IA32_DS_AREA MSR. >> >> Even with the same counter index and the same event code and >> mask, guest PEBS events will not be reused for non-PEBS events. >> >> Originally-by: Andi Kleen >> Co-developed-by: Kan Liang >> Signed-off-by: Kan Liang >> Signed-off-by: Like Xu >> --- >>   arch/x86/kvm/pmu.c | 34 ++++++++++++++++++++++++++++++++-- >>   1 file changed, 32 insertions(+), 2 deletions(-) >> >> diff --git a/arch/x86/kvm/pmu.c b/arch/x86/kvm/pmu.c >> index 827886c12c16..0f86c1142f17 100644 >> --- a/arch/x86/kvm/pmu.c >> +++ b/arch/x86/kvm/pmu.c >> @@ -74,11 +74,21 @@ static void kvm_perf_overflow_intr(struct perf_event >> *perf_event, >>   { >>       struct kvm_pmc *pmc = perf_event->overflow_handler_context; >>       struct kvm_pmu *pmu = pmc_to_pmu(pmc); >> +    bool skip_pmi = false; >>         if (!test_and_set_bit(pmc->idx, pmu->reprogram_pmi)) { >> -        __set_bit(pmc->idx, (unsigned long *)&pmu->global_status); >> +        if (perf_event->attr.precise_ip) { >> +            /* Indicate PEBS overflow PMI to guest. */ >> +            skip_pmi = __test_and_set_bit(GLOBAL_STATUS_BUFFER_OVF_BIT, >> +                              (unsigned long *)&pmu->global_status); >> +        } else { >> +            __set_bit(pmc->idx, (unsigned long *)&pmu->global_status); >> +        } >>           kvm_make_request(KVM_REQ_PMU, pmc->vcpu); >>   +        if (skip_pmi) >> +            return; >> + >>           /* >>            * Inject PMI. If vcpu was in a guest mode during NMI PMI >>            * can be ejected on a guest mode re-entry. Otherwise we can't >> @@ -99,6 +109,7 @@ static void pmc_reprogram_counter(struct kvm_pmc >> *pmc, u32 type, >>                     bool exclude_kernel, bool intr, >>                     bool in_tx, bool in_tx_cp) >>   { >> +    struct kvm_pmu *pmu = vcpu_to_pmu(pmc->vcpu); >>       struct perf_event *event; >>       struct perf_event_attr attr = { >>           .type = type, >> @@ -110,6 +121,7 @@ static void pmc_reprogram_counter(struct kvm_pmc >> *pmc, u32 type, >>           .exclude_kernel = exclude_kernel, >>           .config = config, >>       }; >> +    bool pebs = test_bit(pmc->idx, (unsigned long *)&pmu->pebs_enable); > > pebs_enable is defined in patch 07, but used here(in patch 06). > Maybe we can change the patches order in next patch version if necessary. Thanks! I'll fix it. > >>       attr.sample_period = get_sample_period(pmc, pmc->counter); >>   @@ -124,9 +136,23 @@ static void pmc_reprogram_counter(struct kvm_pmc >> *pmc, u32 type, >>           attr.sample_period = 0; >>           attr.config |= HSW_IN_TX_CHECKPOINTED; >>       } >> +    if (pebs) { >> +        /* >> +         * The non-zero precision level of guest event makes the ordinary >> +         * guest event becomes a guest PEBS event and triggers the host >> +         * PEBS PMI handler to determine whether the PEBS overflow PMI >> +         * comes from the host counters or the guest. >> +         * >> +         * For most PEBS hardware events, the difference in the software >> +         * precision levels of guest and host PEBS events will not affect >> +         * the accuracy of the PEBS profiling result, because the >> "event IP" >> +         * in the PEBS record is calibrated on the guest side. >> +         */ >> +        attr.precise_ip = 1; >> +    } >>         event = perf_event_create_kernel_counter(&attr, -1, current, >> -                         intr ? kvm_perf_overflow_intr : >> +                         (intr || pebs) ? kvm_perf_overflow_intr : >>                            kvm_perf_overflow, pmc); >>       if (IS_ERR(event)) { >>           pr_debug_ratelimited("kvm_pmu: event creation failed %ld for >> pmc->idx = %d\n", >> @@ -161,6 +187,10 @@ static bool pmc_resume_counter(struct kvm_pmc *pmc) >>                     get_sample_period(pmc, pmc->counter))) >>           return false; >>   +    if (!test_bit(pmc->idx, (unsigned long >> *)&pmc_to_pmu(pmc)->pebs_enable) && >> +        pmc->perf_event->attr.precise_ip) >> +        return false; >> + >>       /* reuse perf_event to serve as pmc_reprogram_counter() does*/ >>       perf_event_enable(pmc->perf_event); >