Received: by 2002:ab2:6203:0:b0:1f5:f2ab:c469 with SMTP id o3csp2642295lqt; Mon, 22 Apr 2024 18:02:11 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCVybBXrObU/12MR0vmIxMCP58gAGCGzZm85p5JqlmxNvS3rffn4+PBbOU08tmjWSp8oQLU6DtxxuzGIg/V1yRlqfGylNDao/P3cNkrNxA== X-Google-Smtp-Source: AGHT+IFhxquQVNdGnAMLGuTB5CYK2UAjMO3YJPTHgD+EfFiKBmbJB6uH8V+cU3CJS77Svy3n7cOg X-Received: by 2002:a17:906:329b:b0:a55:88a0:4b7c with SMTP id 27-20020a170906329b00b00a5588a04b7cmr1220411ejw.38.1713834130820; Mon, 22 Apr 2024 18:02:10 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1713834130; cv=pass; d=google.com; s=arc-20160816; b=kfuinRCv+mU1x5XGzleHJC6VcUR8CMj8u/F3qd9z4CVemgQ6yX0t+Lcx0s/zvl42LX X55IuA9Ihm1lGO5OGezqAREFFPI4vaeFstSXM+vh9qHoodSw/OCE7vgoYTSdUGZiNqQR YdzTSiZTZKqnCInwA0LEFFVTQFCrTWjz2rUjZnoWbvniU7i2KNGxCIVAEpxAlvKp3nMb XHRFJVWrlD1Oefp2DE13WEoGGCRvaNQWPSHqhgwieCz60g8NbYMiC8cRqW2dEehQKBGr Z9e9QMm4m2/OkAmgtl8zw7kuakMeo9d4uk7vEDh6Cwieb1nfnvkwNOPsg26Yb/vtUM64 XoGg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:content-language:in-reply-to:mime-version :list-unsubscribe:list-subscribe:list-id:precedence:user-agent:date :message-id:from:references:cc:to:subject; bh=IMbW4dujbcDu2bWJd0xFD68sf9CvmGjZ1ez9STgw2VM=; fh=PRSs9BEu0tJ24v2+K0l1dtY+kZdUz2ROPkXtTDKvdH0=; b=OTusoY0MTknhmGanJAch4ePMcVpKZ5YB0DnyJtVOY5wM7DhqpFSmn4s1tZ1HdrYj3C Yg7z9ZpsviJ1OhFNsR0ilhaeVXvq4oOd3hrrl6JSi0onjdlgLCHIN+YI4s8XwtAndn2q ENTqywA3OUpEyabIWhaR1obxfxjoa6Wn2Z68kUlUtuXJFlBZd70/UuzUO3CTkEaOX+yg dFvDdWmitTqAUVMxWGjGRKaIUI//k28I0C87ZzOANIbaNkT8KxLhpx5H85Ut1cTG6clX 0WCe23QOABpjgeQFGuqr8GW12ujDervuJrHpCTX6c5CFSct5CkLZxz1RCUlpdPXQTkxr z6LA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=loongson.cn); spf=pass (google.com: domain of linux-kernel+bounces-154376-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-154376-linux.lists.archive=gmail.com@vger.kernel.org" Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id a10-20020a170906368a00b00a4e256a82ccsi6489187ejc.232.2024.04.22.18.02.10 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 22 Apr 2024 18:02:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-154376-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=loongson.cn); spf=pass (google.com: domain of linux-kernel+bounces-154376-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-154376-linux.lists.archive=gmail.com@vger.kernel.org" Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 734C41F215D7 for ; Tue, 23 Apr 2024 01:01:48 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 54DD012B8B; Tue, 23 Apr 2024 01:01:38 +0000 (UTC) Received: from mail.loongson.cn (mail.loongson.cn [114.242.206.163]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 93F3428E8; Tue, 23 Apr 2024 01:01:31 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=114.242.206.163 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1713834097; cv=none; b=r3UhUF0s4ykKPFtoc62HE/hbeppNccH6lLPGaNabrqxTUWO8M4oZGz015CMGyGBlRSX9SS+wSxucSKBRxPopwBRMqxp7Y75DHVKg1oBavwc9+iv5PRM47NJuKDn7/IvnPcTz9Q8MvryaTHev+iNc/gMZfVIZwsJA04ox0bg7xP4= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1713834097; c=relaxed/simple; bh=TdZudvDns2lFuZhynJtoo+sTQ97LiuD4HrUp0Xffd1Q=; h=Subject:To:Cc:References:From:Message-ID:Date:MIME-Version: In-Reply-To:Content-Type; b=jeNFOR1iqAchkAgvQqmSluqJgvMs8s1XPzY/goFfp1k90gW2pWCI+/C7Rad5j7HwQkSipzsiVT7jw+mo1UCmDnohS7FT2T6hr7c7rqC6LAB+oO0kppv+PhnbOLGBWx+Tst7MRXQ81mFB9LYXG00O03vyVQcXRTlD8PTNmoLi6DI= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=loongson.cn; spf=pass smtp.mailfrom=loongson.cn; arc=none smtp.client-ip=114.242.206.163 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=loongson.cn Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=loongson.cn Received: from loongson.cn (unknown [10.20.42.173]) by gateway (Coremail) with SMTP id _____8BxmPBnCCdmyBEBAA--.6479S3; Tue, 23 Apr 2024 09:01:27 +0800 (CST) Received: from [10.20.42.173] (unknown [10.20.42.173]) by localhost.localdomain (Coremail) with SMTP id AQAAf8DxvldiCCdme84BAA--.2668S3; Tue, 23 Apr 2024 09:01:24 +0800 (CST) Subject: Re: [RFC PATCH 23/41] KVM: x86/pmu: Implement the save/restore of PMU state for Intel CPU To: Sean Christopherson Cc: Mingwei Zhang , Dapeng Mi , Xiong Zhang , pbonzini@redhat.com, peterz@infradead.org, kan.liang@intel.com, zhenyuw@linux.intel.com, jmattson@google.com, kvm@vger.kernel.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, zhiyuan.lv@intel.com, eranian@google.com, irogers@google.com, samantha.alt@intel.com, like.xu.linux@gmail.com, chao.gao@intel.com References: <18b19dd4-6d76-4ed8-b784-32436ab93d06@linux.intel.com> <4c47b975-ad30-4be9-a0a9-f0989d1fa395@linux.intel.com> <737f0c66-2237-4ed3-8999-19fe9cca9ecc@linux.intel.com> <4d60384a-11e0-2f2b-a568-517b40c91b25@loongson.cn> From: maobibo Message-ID: Date: Tue, 23 Apr 2024 09:01:22 +0800 User-Agent: Mozilla/5.0 (X11; Linux loongarch64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-CM-TRANSID:AQAAf8DxvldiCCdme84BAA--.2668S3 X-CM-SenderInfo: xpdruxter6z05rqj20fqof0/ X-Coremail-Antispam: 1Uk129KBj93XoWxWr1DtFWDAFy7Cw43CFy5KFX_yoWrAry5pa yI9ayYkr4DJr4Iyw1Iqw48AFyYkrZ7Kr98Xw1ktayUAa90934rZr1UKrW3CFn8ur4xKa4j vr4Ut3sxW3Z0yagCm3ZEXasCq-sJn29KB7ZKAUJUUUUx529EdanIXcx71UUUUU7KY7ZEXa sCq-sGcSsGvfJ3Ic02F40EFcxC0VAKzVAqx4xG6I80ebIjqfuFe4nvWSU5nxnvy29KBjDU 0xBIdaVrnRJUUUPFb4IE77IF4wAFF20E14v26r1j6r4UM7CY07I20VC2zVCF04k26cxKx2 IYs7xG6rWj6s0DM7CIcVAFz4kK6r1Y6r17M28lY4IEw2IIxxk0rwA2F7IY1VAKz4vEj48v e4kI8wA2z4x0Y4vE2Ix0cI8IcVAFwI0_JFI_Gr1l84ACjcxK6xIIjxv20xvEc7CjxVAFwI 0_Gr0_Cr1l84ACjcxK6I8E87Iv67AKxVW8JVWxJwA2z4x0Y4vEx4A2jsIEc7CjxVAFwI0_ Gr0_Gr1UM2kKe7AKxVWUAVWUtwAS0I0E0xvYzxvE52x082IY62kv0487Mc804VCY07AIYI kI8VC2zVCFFI0UMc02F40EFcxC0VAKzVAqx4xG6I80ewAv7VC0I7IYx2IY67AKxVWUAVWU twAv7VC2z280aVAFwI0_Jr0_Gr1lOx8S6xCaFVCjc4AY6r1j6r4UM4x0Y48IcVAKI48JMx k0xIA0c2IEe2xFo4CEbIxvr21lc7CjxVAaw2AFwI0_Jw0_GFyl42xK82IYc2Ij64vIr41l 4I8I3I0E4IkC6x0Yz7v_Jr0_Gr1l4IxYO2xFxVAFwI0_JF0_Jw1lx2IqxVAqx4xG67AKxV WUJVWUGwC20s026x8GjcxK67AKxVWUGVWUWwC2zVAF1VAY17CE14v26r4a6rW5MIIYrxkI 7VAKI48JMIIF0xvE2Ix0cI8IcVAFwI0_JFI_Gr1lIxAIcVC0I7IYx2IY6xkF7I0E14v26r 4j6F4UMIIF0xvE42xK8VAvwI8IcIk0rVWUJVWUCwCI42IY6I8E87Iv67AKxVWUJVW8JwCI 42IY6I8E87Iv6xkF7I0E14v26r4j6r4UJbIYCTnIWIevJa73UjIFyTuYvjxU2-VyUUUUU On 2024/4/23 上午1:01, Sean Christopherson wrote: > On Mon, Apr 22, 2024, maobibo wrote: >> On 2024/4/16 上午6:45, Sean Christopherson wrote: >>> On Mon, Apr 15, 2024, Mingwei Zhang wrote: >>>> On Mon, Apr 15, 2024 at 10:38 AM Sean Christopherson wrote: >>>>> One my biggest complaints with the current vPMU code is that the roles and >>>>> responsibilities between KVM and perf are poorly defined, which leads to suboptimal >>>>> and hard to maintain code. >>>>> >>>>> Case in point, I'm pretty sure leaving guest values in PMCs _would_ leak guest >>>>> state to userspace processes that have RDPMC permissions, as the PMCs might not >>>>> be dirty from perf's perspective (see perf_clear_dirty_counters()). >>>>> >>>>> Blindly clearing PMCs in KVM "solves" that problem, but in doing so makes the >>>>> overall code brittle because it's not clear whether KVM _needs_ to clear PMCs, >>>>> or if KVM is just being paranoid. >>>> >>>> So once this rolls out, perf and vPMU are clients directly to PMU HW. >>> >>> I don't think this is a statement we want to make, as it opens a discussion >>> that we won't win. Nor do I think it's one we *need* to make. KVM doesn't need >>> to be on equal footing with perf in terms of owning/managing PMU hardware, KVM >>> just needs a few APIs to allow faithfully and accurately virtualizing a guest PMU. >>> >>>> Faithful cleaning (blind cleaning) has to be the baseline >>>> implementation, until both clients agree to a "deal" between them. >>>> Currently, there is no such deal, but I believe we could have one via >>>> future discussion. >>> >>> What I am saying is that there needs to be a "deal" in place before this code >>> is merged. It doesn't need to be anything fancy, e.g. perf can still pave over >>> PMCs it doesn't immediately load, as opposed to using cpu_hw_events.dirty to lazily >>> do the clearing. But perf and KVM need to work together from the get go, ie. I >>> don't want KVM doing something without regard to what perf does, and vice versa. >>> >> There is similar issue on LoongArch vPMU where vm can directly pmu hardware >> and pmu hw is shard with guest and host. Besides context switch there are >> other places where perf core will access pmu hw, such as tick >> timer/hrtimer/ipi function call, and KVM can only intercept context switch. > > Two questions: > > 1) Can KVM prevent the guest from accessing the PMU? > > 2) If so, KVM can grant partial access to the PMU, or is it all or nothing? > > If the answer to both questions is "yes", then it sounds like LoongArch *requires* > mediated/passthrough support in order to virtualize its PMU. Hi Sean, Thank for your quick response. yes, kvm can prevent guest from accessing the PMU and grant partial or all to access to the PMU. Only that if one pmu event is granted to VM, host can not access this pmu event again. There must be pmu event switch if host want to. > >> Can we add callback handler in structure kvm_guest_cbs? just like this: >> @@ -6403,6 +6403,7 @@ static struct perf_guest_info_callbacks kvm_guest_cbs >> = { >> .state = kvm_guest_state, >> .get_ip = kvm_guest_get_ip, >> .handle_intel_pt_intr = NULL, >> + .lose_pmu = kvm_guest_lose_pmu, >> }; >> >> By the way, I do not know should the callback handler be triggered in perf >> core or detailed pmu hw driver. From ARM pmu hw driver, it is triggered in >> pmu hw driver such as function kvm_vcpu_pmu_resync_el0, >> but I think it will be better if it is done in perf core. > > I don't think we want to take the approach of perf and KVM guests "fighting" over > the PMU. That's effectively what we have today, and it's a mess for KVM because > it's impossible to provide consistent, deterministic behavior for the guest. And > it's just as messy for perf, which ends up having wierd, cumbersome flows that > exists purely to try to play nice with KVM. With existing pmu core code, in tick timer interrupt or IPI function call interrupt pmu hw may be accessed by host when VM is running and pmu is already granted to guest. KVM can not intercept host IPI/timer interrupt, there is no pmu context switch, there will be problem. Regards Bibo Mao