Received: by 2002:a05:7412:251c:b0:e2:908c:2ebd with SMTP id w28csp2600584rda; Wed, 25 Oct 2023 07:22:49 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEjXpv4HE48jvdrukak9pn6hWwkLgxy1UzIv7f4a5OarwmseJEKgSRgBkLo8jy+yS3TFwOf X-Received: by 2002:a05:6830:2054:b0:6c4:ae52:9596 with SMTP id f20-20020a056830205400b006c4ae529596mr16425557otp.18.1698243768730; Wed, 25 Oct 2023 07:22:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698243768; cv=none; d=google.com; s=arc-20160816; b=Wy9fGkaAk5EQhTWjwimTI8iDolb3kSVzI/VU+++AtyLsyVgQFXuilKjAlhtkUMr9FQ dAMW5Ebesv9AZKAWNL7AVCxmMRNzaAIDfQy4VfA7rWkGxWz5Y0kf8uJueg/EX/HT4GH5 GTtvUruvVT20ECj9CSqPtp/3cqfjjkp6jq41yCYABnmY0iZiA0QJE/684Kh4HPMJ+DpE zPkxCsxf9Gm0zQGFJKIgGR7H2iYi5rLo7GSpGYulk1vloTB5mBKGKlANq+eXb0sMAh66 2wbDAuhb7zdIO5qZxri8egnHEfLJ27jyjI8Oi0JveQuujNKmmM6f5XriFLFlYyyKFVic FppQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=UBKcY3PTFSrO9ZLn4V0uAyxFec2EF562aCVgDc00c5Q=; fh=Iod4dZELMLg2Np14yggjnFR8sjOEq1tEC8xH/57c2zc=; b=TmtFpnJXSr73FPR6aKVPlUqg7n9OyGvQFuCAAV/+gaJlf73CIFYW+/WwxtOy9rB4zd 4qjXsqmYeACXmZFJF3+VxGcI/cioK1qHmeaJtbO2my/WXpfj/NuNkGaqs2r0ZGooKphm WSHok6uLekAxIH2yf6+MZ0sf49rSpnw7/z7Q//bW4tNiBgkHdPT29Y+48eYpB6dW2lpX 6m5SQ67HBLPWP2f1K1KvuFXDFRupCAkCpoQQ4VL0Td3qas33LvhC2TiMNMly/oGV84r4 EnHkarC5Z3zxJqjMq/T+OuNKW6KsTu+GNSxFEQ2GwLjl+kELWZlYgBTBMXCV2XIxyx+M QzCw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=n2SD6gVq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [2620:137:e000::3:3]) by mx.google.com with ESMTPS id t67-20020a0dd146000000b00579e8e0eba0si10349485ywd.556.2023.10.25.07.22.48 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 25 Oct 2023 07:22:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) client-ip=2620:137:e000::3:3; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=n2SD6gVq; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id F2936814595A; Wed, 25 Oct 2023 07:22:45 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344550AbjJYOWY (ORCPT + 99 others); Wed, 25 Oct 2023 10:22:24 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40660 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344569AbjJYOWX (ORCPT ); Wed, 25 Oct 2023 10:22:23 -0400 Received: from mail-pl1-x64a.google.com (mail-pl1-x64a.google.com [IPv6:2607:f8b0:4864:20::64a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id BC65A132 for ; Wed, 25 Oct 2023 07:22:20 -0700 (PDT) Received: by mail-pl1-x64a.google.com with SMTP id d9443c01a7336-1c9f973d319so45637955ad.0 for ; Wed, 25 Oct 2023 07:22:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1698243740; x=1698848540; darn=vger.kernel.org; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=UBKcY3PTFSrO9ZLn4V0uAyxFec2EF562aCVgDc00c5Q=; b=n2SD6gVqGtK7M5570KDR+FkUDZRFLvYjAmz5Buf7L5oQkTc5XyXSvzt/fyWRJD5mmM DH23+7m/z4XpCrYvQl7mgGV2mRhKG1+euGFkqlJxeVOFvcvEB5ZGl0ShxhEOaYW1jZun 09CL7yy4kjtnb4cnvTxC5VPT5STQSmbj5X6ijwfLr3BdOg9TCioskEpfCVDs6YVh80Xc GhUPJd+KqBNBN9sIFEYWNrdZ1+sFNsV5ODU92wcuKyQHO6JcV5mDD7MftYTGLzWmnUfG 1f74RCGOPHtYaf4ZipuHzb5F4/HnMEfMAFqSWLIfDUuHmu7SGP3QmNL63+dlVBisy5iY ieiA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1698243740; x=1698848540; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=UBKcY3PTFSrO9ZLn4V0uAyxFec2EF562aCVgDc00c5Q=; b=UbXS9N/ylK1cI2L60dWOqFzlcjEEGlO9PE88WCh29HKO21/XXsUUFV2dOcfY34P5OT H1IoaCvPryAX/UCaj4RNvPD1sfyO+b8EB1ug8gVOmrW2lwr9uqomUw3Ds2Gx7vgnwX8b PGgDC+hQqdfg4lcbcYMsUwt7K2T6q7hDJqQBwEPV0dWAn/1Yw4DyFwYy+/FnguuXYoIj YdZq8Y4/pXVCpAPmVHCGYdsbFvVxiT9A2vp5SFRQlF1o5AScO0kOOEi9aDBbSB3tnGhr wwfmfDDQSv3h6TMPHJvoTkdtDSzz+H2c3wlkOzI0j0cEv8R89h7U2723U1Ctxo4M0k4f CTCw== X-Gm-Message-State: AOJu0YwcHV4zp0xYOz7W8XzAXx5tcjr2Lc4Muw8Mo/MEZr46YOV76QQg sG+Qvashir3+g/JXBVPzc79CrOcMzM4= X-Received: from zagreus.c.googlers.com ([fda3:e722:ac3:cc00:7f:e700:c0a8:5c37]) (user=seanjc job=sendgmr) by 2002:a17:903:200a:b0:1ca:a251:786e with SMTP id s10-20020a170903200a00b001caa251786emr245057pla.6.1698243740202; Wed, 25 Oct 2023 07:22:20 -0700 (PDT) Date: Wed, 25 Oct 2023 14:22:18 +0000 In-Reply-To: <87a5s73w53.fsf@redhat.com> Mime-Version: 1.0 References: <20231025055914.1201792-1-xiaoyao.li@intel.com> <20231025055914.1201792-2-xiaoyao.li@intel.com> <87a5s73w53.fsf@redhat.com> Message-ID: Subject: Re: [PATCH v2 1/2] x86/kvm/async_pf: Use separate percpu variable to track the enabling of asyncpf From: Sean Christopherson To: Vitaly Kuznetsov Cc: Xiaoyao Li , Paolo Bonzini , Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , Jonathan Corbet , Wanpeng Li , x86@kernel.org, kvm@vger.kernel.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="us-ascii" X-Spam-Status: No, score=-8.4 required=5.0 tests=DKIMWL_WL_MED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Wed, 25 Oct 2023 07:22:46 -0700 (PDT) On Wed, Oct 25, 2023, Vitaly Kuznetsov wrote: > Xiaoyao Li writes: > > diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c > > index b8ab9ee5896c..388a3fdd3cad 100644 > > --- a/arch/x86/kernel/kvm.c > > +++ b/arch/x86/kernel/kvm.c > > @@ -65,6 +65,7 @@ static int __init parse_no_stealacc(char *arg) > > > > early_param("no-steal-acc", parse_no_stealacc); > > > > +static DEFINE_PER_CPU_READ_MOSTLY(bool, async_pf_enabled); > > Would it make a difference is we replace this with a cpumask? I realize > that we need to access it on all CPUs from hotpaths but this mask will > rarely change so maybe there's no real perfomance hit? FWIW, I personally prefer per-CPU booleans from a readability perspective. I doubt there is a meaningful performance difference for a bitmap vs. individual booleans, the check is already gated by a static key, i.e. kernels that are NOT running as KVM guests don't care. Actually, if there's performance gains to be had, optimizing kvm_read_and_reset_apf_flags() to read the "enabled" flag if and only if it's necessary is a more likely candidate. Assuming the host isn't being malicious/stupid, then apf_reason.flags will be '0' if PV async #PFs are disabled. The only question is whether or not apf_reason.flags is predictable enough for the CPU. Aha! In practice, the CPU already needs to resolve a branch based on apf_reason.flags, it's just "hidden" up in __kvm_handle_async_pf(). If we really want to micro-optimize, provide an __always_inline inner helper so that __kvm_handle_async_pf() doesn't need to make a CALL just to read the flags. Then in the common case where a #PF isn't due to the host swapping out a page, the paravirt happy path doesn't need a taken branch and never reads the enabled variable. E.g. the below generates: 0xffffffff81939ed0 <+0>: 41 54 push %r12 0xffffffff81939ed2 <+2>: 31 c0 xor %eax,%eax 0xffffffff81939ed4 <+4>: 55 push %rbp 0xffffffff81939ed5 <+5>: 53 push %rbx 0xffffffff81939ed6 <+6>: 48 83 ec 08 sub $0x8,%rsp 0xffffffff81939eda <+10>: 65 8b 2d df 81 6f 7e mov %gs:0x7e6f81df(%rip),%ebp # 0x320c0 0xffffffff81939ee1 <+17>: 85 ed test %ebp,%ebp 0xffffffff81939ee3 <+19>: 75 09 jne 0xffffffff81939eee <__kvm_handle_async_pf+30> 0xffffffff81939ee5 <+21>: 48 83 c4 08 add $0x8,%rsp 0xffffffff81939ee9 <+25>: 5b pop %rbx 0xffffffff81939eea <+26>: 5d pop %rbp 0xffffffff81939eeb <+27>: 41 5c pop %r12 0xffffffff81939eed <+29>: c3 ret diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c index b8ab9ee5896c..b24133dc0731 100644 --- a/arch/x86/kernel/kvm.c +++ b/arch/x86/kernel/kvm.c @@ -240,22 +240,29 @@ void kvm_async_pf_task_wake(u32 token) } EXPORT_SYMBOL_GPL(kvm_async_pf_task_wake); -noinstr u32 kvm_read_and_reset_apf_flags(void) +static __always_inline u32 __kvm_read_and_reset_apf_flags(void) { - u32 flags = 0; + u32 flags = __this_cpu_read(apf_reason.flags); - if (__this_cpu_read(apf_reason.enabled)) { - flags = __this_cpu_read(apf_reason.flags); - __this_cpu_write(apf_reason.flags, 0); + if (unlikely(flags)) { + if (likely(__this_cpu_read(apf_reason.enabled))) + __this_cpu_write(apf_reason.flags, 0); + else + flags = 0; } return flags; } + +u32 kvm_read_and_reset_apf_flags(void) +{ + return __kvm_read_and_reset_apf_flags(); +} EXPORT_SYMBOL_GPL(kvm_read_and_reset_apf_flags); noinstr bool __kvm_handle_async_pf(struct pt_regs *regs, u32 token) { - u32 flags = kvm_read_and_reset_apf_flags(); + u32 flags = __kvm_read_and_reset_apf_flags(); irqentry_state_t state; if (!flags) > > static DEFINE_PER_CPU_DECRYPTED(struct kvm_vcpu_pv_apf_data, apf_reason) __aligned(64); > > DEFINE_PER_CPU_DECRYPTED(struct kvm_steal_time, steal_time) __aligned(64) __visible; > > static int has_steal_clock = 0; > > @@ -244,7 +245,7 @@ noinstr u32 kvm_read_and_reset_apf_flags(void) > > { > > u32 flags = 0; > > > > - if (__this_cpu_read(apf_reason.enabled)) { > > + if (__this_cpu_read(async_pf_enabled)) { > > flags = __this_cpu_read(apf_reason.flags); > > __this_cpu_write(apf_reason.flags, 0); > > } > > @@ -295,7 +296,7 @@ DEFINE_IDTENTRY_SYSVEC(sysvec_kvm_asyncpf_interrupt) > > > > inc_irq_stat(irq_hv_callback_count); > > > > - if (__this_cpu_read(apf_reason.enabled)) { > > + if (__this_cpu_read(async_pf_enabled)) { > > token = __this_cpu_read(apf_reason.token); > > kvm_async_pf_task_wake(token); > > __this_cpu_write(apf_reason.token, 0); > > @@ -362,7 +363,7 @@ static void kvm_guest_cpu_init(void) > > wrmsrl(MSR_KVM_ASYNC_PF_INT, HYPERVISOR_CALLBACK_VECTOR); > > > > wrmsrl(MSR_KVM_ASYNC_PF_EN, pa); > > - __this_cpu_write(apf_reason.enabled, 1); > > + __this_cpu_write(async_pf_enabled, 1); > > As 'async_pf_enabled' is bool, it would probably be more natural to > write > > __this_cpu_write(async_pf_enabled, true); +1000