Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3605860imu; Mon, 7 Jan 2019 06:25:20 -0800 (PST) X-Google-Smtp-Source: ALg8bN5k8FFSkgOYpB8rXeRxDyef6havikhBh7FfrkuRclKnKR7Nt1j3hEF777wEKSa8FczFn0GN X-Received: by 2002:a17:902:b090:: with SMTP id p16mr61532893plr.190.1546871120840; Mon, 07 Jan 2019 06:25:20 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1546871120; cv=none; d=google.com; s=arc-20160816; b=QFUSyY7DKHobK3DvVKvn4DltqbI2RLUWjauE3NJ5tdNbWsHUYfpSreedx3SPutoWHM PT1OafDX2PHx3PuQM0TFnGtilWqqwslejWcCbfirbI5JDqrv2YjquY8+NlS9dBQdqRa7 uWxxZ1+MTlXJUuOKLWH9Kqk7UPgzW8GiEpIYwr1H6QSqH8PlYCcJ+lQ80Lwa98Kit8dR IOJPWLsrc6936Iz/wJBM+Lkv31gKo9R3dlRSkxBJ795h4Pr5RmUtj2heXOcxhLf1l47D zaCEJVaB1t3vvpxpDjnZz0MVr2zXZN6orvUzl5Uk0S17BAXbCzenEI4sPFs+9gEEZnaY Xu9A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=s0J8P/KRmkl8TPJO1VNt0IN5Cx0/d+wfwlg5iShMaHg=; b=bmKuGhBlXai949DAcI6Gjza7N51BUZSCBW3gHqAujoMy/+XR3bjj8yM53xmCzwQ7WQ pzPx4Wz96Kl9z3gXq4ry+hAbFCtt5fugIhd2ONI28OHGKQ4roajVAb9cCl9eRLTNh7vI QOZn5STaRWB1VMt/zIt1Z7YE/JgAXH1EpeIzyXVajFEEcbembqqsMI8PRoBikEB5P8sT I3wcPY9QxfzG4IzduB7xhXxaCCoprmMTwEDlo6zSu3wLLa0Li6+D2x+acXCkhTjIzanI XNfis94Wy2qSRghLc3OOw3mc9vo131xRiWm6UWpIzPdmj7NnR/tcY+ZBx1X9c32bOZK4 vVkw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w11si11064288pgp.161.2019.01.07.06.25.04; Mon, 07 Jan 2019 06:25:20 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727958AbfAGOWv (ORCPT + 99 others); Mon, 7 Jan 2019 09:22:51 -0500 Received: from mga18.intel.com ([134.134.136.126]:6731 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726817AbfAGOWv (ORCPT ); Mon, 7 Jan 2019 09:22:51 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga003.jf.intel.com ([10.7.209.27]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 07 Jan 2019 06:22:50 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.56,451,1539673200"; d="scan'208";a="116106680" Received: from linux.intel.com ([10.54.29.200]) by orsmga003.jf.intel.com with ESMTP; 07 Jan 2019 06:22:50 -0800 Received: from [10.251.7.1] (kliang2-mobl1.ccr.corp.intel.com [10.251.7.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by linux.intel.com (Postfix) with ESMTPS id 345955803DC; Mon, 7 Jan 2019 06:22:49 -0800 (PST) Subject: Re: [PATCH v4 04/10] KVM/x86: intel_pmu_lbr_enable To: Wei Wang , linux-kernel@vger.kernel.org, kvm@vger.kernel.org, pbonzini@redhat.com, ak@linux.intel.com, peterz@infradead.org Cc: kan.liang@intel.com, mingo@redhat.com, rkrcmar@redhat.com, like.xu@intel.com, jannh@google.com, arei.gonglei@huawei.com References: <1545816338-1171-1-git-send-email-wei.w.wang@intel.com> <1545816338-1171-5-git-send-email-wei.w.wang@intel.com> <5a04d8ea-b788-6018-8b34-ebd528578916@linux.intel.com> <5C2F2E3E.7020306@intel.com> <4e5cd929-8a28-461d-7f8f-79a2f9301b7c@linux.intel.com> <5C308277.3090005@intel.com> From: "Liang, Kan" Message-ID: Date: Mon, 7 Jan 2019 09:22:47 -0500 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <5C308277.3090005@intel.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 1/5/2019 5:09 AM, Wei Wang wrote: > On 01/04/2019 11:57 PM, Liang, Kan wrote: >> >> >> On 1/4/2019 4:58 AM, Wei Wang wrote: >>> On 01/03/2019 12:33 AM, Liang, Kan wrote: >>>> >>>> >>>> On 12/26/2018 4:25 AM, Wei Wang wrote: >>>>> + >>>>> +    /* >>>>> +     * It could be possible that people have vcpus of old model >>>>> run on >>>>> +     * physcal cpus of newer model, for example a BDW guest on a SKX >>>>> +     * machine (but not possible to be the other way around). >>>>> +     * The BDW guest may not get accurate results on a SKX machine >>>>> as it >>>>> +     * only reads 16 entries of the lbr stack while there are 32 >>>>> entries >>>>> +     * of recordings. So we currently forbid the lbr enabling when >>>>> the >>>>> +     * vcpu and physical cpu see different lbr stack entries. >>>> >>>> I think it's not enough to only check number of entries. The LBR >>>> from/to MSRs may be different even the number of entries is the >>>> same, e.g SLM and KNL. >>> >>> Yes, we could add the comparison of the FROM msrs. >>> >>>> >>>>> +     */ >>>>> +    switch (vcpu_model) { >>>> >>>> That's a duplicate of intel_pmu_init(). I think it's better to >>>> factor out the common part if you want to check LBR MSRs and >>>> entries. Then we don't need to add the same codes in two different >>>> places when enabling new platforms. >>>> >>> >>> >>> Yes, I thought about this, but intel_pmu_init() does a lot more >>> things in each "Case xx". Any thought about how to factor them out? >>> >> >> I think we may only move the "switch (boot_cpu_data.x86_model) { ... >> }" to a new function, e.g. __intel_pmu_init(int model, struct x86_pmu >> *x86_pmu) >> >> In __intel_pmu_init, if the model != boot_cpu_data.x86_model, you only >> need to update x86_pmu.*. Just ignore global settings, e.g >> hw_cache_event_ids, mem_attr, extra_attr etc. > > Thanks for sharing. I understand the point of maintaining those models > at one place, > but this factor-out doesn't seem very elegant to me, like below > > __intel_pmu_init (int model, struct x86_pmu *x86_pmu) > { > ... > switch (model) > case INTEL_FAM6_NEHALEM: > case INTEL_FAM6_NEHALEM_EP: > case INTEL_FAM6_NEHALEM_EX: >     intel_pmu_lbr_init(x86_pmu); >     if (model != boot_cpu_data.x86_model) >         return; > >     /* Other a lot of things init like below..*/ >     memcpy(hw_cache_event_ids, nehalem_hw_cache_event_ids, >                    sizeof(hw_cache_event_ids)); >     memcpy(hw_cache_extra_regs, nehalem_hw_cache_extra_regs, >                    sizeof(hw_cache_extra_regs)); >     x86_pmu.event_constraints = intel_nehalem_event_constraints; >                 x86_pmu.pebs_constraints = > intel_nehalem_pebs_event_constraints; >                 x86_pmu.enable_all = intel_pmu_nhm_enable_all; >                 x86_pmu.extra_regs = intel_nehalem_extra_regs; >  ... > > Case... > } > We need insert "if (model != boot_cpu_data.x86_model)" in every "Case xx". > > What would be the rationale that we only do lbr_init for "x86_pmu" > when model != boot_cpu_data.x86_model? > (It looks more like a workaround to factor-out the function and get what > we want) I thought the new function may be extended to support fake pmu as below. It's not only for lbr. PMU has many CPU specific features. It can be used for other features, if you want to check the compatibility in future. But I don't have an example now. __intel_pmu_init (int model, struct x86_pmu *x86_pmu) { bool fake_pmu = (model != boot_cpu_data.x86_model) ? true : false; ... switch (model) case INTEL_FAM6_NEHALEM: case INTEL_FAM6_NEHALEM_EP: case INTEL_FAM6_NEHALEM_EX: intel_pmu_lbr_init(x86_pmu); x86_pmu->event_constraints = intel_nehalem_event_constraints; x86_pmu->pebs_constraints = intel_nehalem_pebs_event_constraints; x86_pmu->enable_all = intel_pmu_nhm_enable_all; x86_pmu->extra_regs = intel_nehalem_extra_regs; if (fake_pmu) return; /* Global variables should not be updated for fake PMU */ memcpy(hw_cache_event_ids, nehalem_hw_cache_event_ids, sizeof(hw_cache_event_ids)); memcpy(hw_cache_extra_regs, nehalem_hw_cache_extra_regs, sizeof(hw_cache_extra_regs)); > > I would prefer having them separated as this patch for now - it is > logically more clear to me. > But it will be a problem for maintenance. Perf developer probably forget to update the list in KVM. I think you have to regularly check the perf code. Thanks, Kan