Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp1994832yba; Thu, 25 Apr 2019 08:59:24 -0700 (PDT) X-Google-Smtp-Source: APXvYqyd9iz+fqwuuXrGm0dQk30XDt+L5aU2WrgDEssUc/UDV+IE4iTBPwpV/6Fe7nHoZvFTW7y3 X-Received: by 2002:aa7:8282:: with SMTP id s2mr40809657pfm.7.1556207964141; Thu, 25 Apr 2019 08:59:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1556207964; cv=none; d=google.com; s=arc-20160816; b=cjgqoYhor1AB2gPY6tmiV5TFfP4k7zvY3KOGSjqTmuJyARm2yxpa6tU31EFxrClLkg UKAVBzW0cqJq6aSSJjn7qQ1PZNdzIlsDW/w4+3uB1nEWJlrQ+7RwREDBk387EK+1Hggf MLVpyi6p+SdGfWvlZFZm83OKy8eY9lFFcIYw0c/Ud8INpAzffJwDOpvVeu6GH7xEt9C9 gbeEZ6Lr51w6cFxFA0JKaO8dNZjNT56YI89obe8cRiy2E5pocrXddpG+i4DMzOYQCu/P rw3A6jTsVCOkS42wgpQY10tUR0Q5IDkNnGOeSi7NoD6QMjIcCQmY9Ty1jj4BK/2T1UYh rbdA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:from:references:cc:to:subject; bh=A3cHwuwhR1njDuETJ8Si39BjFW+gyG8IOIAYZeE47PA=; b=RIVW8kEmqrX+MTguNLx7T3fCEnYA90hmSLmoPZpu10bXNr3/celU9BlXKm190WdaHi AgYSubVS94x1+gsCA2sb9EStYQFQPBSL5+QhYO8a6GqkZaGEANll1lu5yKrqlxfhlMUZ FFjtS/ZWHszvF8pJTateREbJjVNvILRkorEjtcq3kVpRDm8el6suQTEL9yno3eClce5/ p/z5hC/f3751ET8Q7/RwM525/p6V/Lc/SrzuW1l58x345AYJwhqvI18PnRZUKY4M069a 7EOEAKPANocM86333dJMHkSbsbXg23GUMTmwlTEtcVe/Dh/2wCLUlKa/hrXhlJhtmvZY BUpQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r2si5038170pfg.93.2019.04.25.08.59.09; Thu, 25 Apr 2019 08:59:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730186AbfDYPd1 (ORCPT + 99 others); Thu, 25 Apr 2019 11:33:27 -0400 Received: from mga17.intel.com ([192.55.52.151]:53982 "EHLO mga17.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730136AbfDYPdY (ORCPT ); Thu, 25 Apr 2019 11:33:24 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga008.jf.intel.com ([10.7.209.65]) by fmsmga107.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 25 Apr 2019 08:33:23 -0700 X-IronPort-AV: E=Sophos;i="5.60,394,1549958400"; d="scan'208";a="137410325" Received: from likexu-mobl1.ccr.corp.intel.com (HELO [10.255.28.123]) ([10.255.28.123]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/AES128-SHA; 25 Apr 2019 08:33:21 -0700 Subject: Re: [PATCH] KVM: x86: Add Intel CPUID.1F cpuid emulation support To: Sean Christopherson Cc: Xiaoyao Li , kvm@vger.kernel.org, Paolo Bonzini , Thomas Gleixner , Len Brown , linux-kernel@vger.kernel.org References: <1555915234-2536-1-git-send-email-like.xu@linux.intel.com> <20190424143238.GB18442@linux.intel.com> <30857e40-05b6-0f4c-d07c-919de08c90ac@linux.intel.com> <7568a794-2e91-c84c-5b64-d33e394f0e2b@linux.intel.com> <20190425141944.GA31197@linux.intel.com> From: Like Xu Organization: Intel OTC Message-ID: <5203b6ee-4a14-64b7-0237-dae2a977054d@linux.intel.com> Date: Thu, 25 Apr 2019 23:33:19 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <20190425141944.GA31197@linux.intel.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/4/25 22:19, Sean Christopherson wrote: > On Thu, Apr 25, 2019 at 03:07:35PM +0800, Like Xu wrote: >> On 2019/4/25 14:30, Xiaoyao Li wrote: >>>>> Besides, the problem of simply using cpuid_exc(0x1f) in Sean's codes is >>>>> that we cannot assmue the reserved bits 31:16 of ECX is always 0 for the >>>>> future generation. > > It doesn't matter if future CPUs use 31:16 for other things since this > code only cares about whether or not CPUID.1F exists. The check > entry->eax >= 0x1f ensures that CPUID.1F will return zeros if the leaf > does *NOT* exist, ergo the check against CPUID.1F.ECX only needs to > look for a non-zero value. CPUID.1F.ECX is the logical choice for > detecting support because it is guaranteed to be non-zero if the leaf > actually exists (because bits 15:8 will be non-zero). Whether or not > bits 31:16 are non-zero is irrelevant. Here is a case: one future CPU may have "cpuid.0.eax > 0x1f" but not use multi-chip packaging technology thus its CPUID.1F leaf **could** not exist to expose multi-die info and it still use cpuid.0B leaf. So the entry->eax >= 0x1f check is true and cpuid_ecx(0x1f) check is true as well due to default return value. (one of my machines for cpuid.1f.ecx would return 0x00000064 without cpuid.1f support). When we only cares about whether or not CPUID.1F exists, we may need (cpuid_ecx(0x1f) & 0x0000ff00) != 0 or just follow what host check_extended_topology_leaf() does. > >>>> >>>> It's true cause the statement in public spec is not "Reserved = 0" but >>>> "Bits 31 - 16: Reserved". >>>> >>>>> >>>>> In my opinion, Sean's codes is OK and much simple and clear. >>>> >>>> If the host cpuid.0.eax is greater than 0x1f but actually it doesn't >>>> have multi-chip packaging technology and we may want to expose >>>> entry->eax to some value smaller than 0x1f but greater than 0x14, much >>>> effort needs to apply on Sean's code. >>>> >>>> My improvement is good to overwrite cpuid.0.eax in future usage >>> >from the perspective of kvm feature setting not just from value check. >>> >>> Alright, there is something wrong in your code that you haven't realised. >>> >>> When you do >>> entry->eax = min(entry->eax, (u32)(f_intel_pt ? 0x14 : 0xd)); >>> >>> it changes the entry->eax if entry->eax > 0x14. So you cannot directly use >>> cpuid_ecx(0x1f). At least, you need to cache the value of entry->eax, like: >>> >>> u32 max_leaf = entry->eax; >>> entry->eax = min(entry->eax, (u32)(f_intel_pt ? 0x14 : 0xd)); >>> >>> //...leaf between 0x14 and 0x1f >>> >>> if (max_leaf >= 0x1f && (cpuid_ecx(0x1f) & 0x0000ff00)) >>> entry->eax = 0x1f; >> >> The cache value make no sense on this. > > Xiaoyao is pointing out that by limiting entry->eax to 0x14 (or 0xd), then > it can't be used to detect support for CPUID.1F since KVM will have lost > the required information. The point is that its existence does not really depend on cpuid.0.eax> = 0x1f. My test machine (CLX-AP, two die in one package) has cpuid.0.eax = 0x16 but does have CPUID.1F support (a bit strange on BIOS). > >>> However, handling in increasing order in totally wrong. Since it's to report the >>> max the leaf supported, we should handle in descending order, which is what Sean >>> does. >> >> There is no need to check "entry->eax >= 0x1f" before "setting entry->eax = >> 0x1f" if and only if cpuid_ecx(0x1f) meets requirements. > > entry->eax absolutely needs to be checked, otherwise you have no idea what > CPUID leaf is actually being consumed. For example, a CPU with a maximum > basic CPUID leaf of 0xB will report information that is indistinguishable > from the actual CPUID.1F, i.e. KVM will incorrectly think the CPU supports > CPUID.1F regardless of what heuristic is used to detect a "valid" CPUID.1F > if it only looks at the result of CPUID.1F. Based on what I mentioned, just reconsider this proposal: case 0: entry->eax = min(entry->eax, (u32)(f_intel_pt ? 0x14 : 0xd)); if ((cpuid_ecx(0x1f) & 0x0000ff00)) != 0) entry->eax = 0x1f; It increases the cpuid.0.ecx value along with the minimized feature requirement in a natural and readable way. Why don't we design a little bit future ahead of time? > >> An increasing manner helps to overwrite this value on demand in a flat code >> flow (easy to understand and maintain) not an if-else_if-else flow. > > Maybe in the future the code will need to be refactored as more cases are > added, but for now an if-else is quite readable. Worry about the future > when it happens. :-) >