Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp206108rwb; Tue, 13 Dec 2022 16:19:17 -0800 (PST) X-Google-Smtp-Source: AA0mqf79+ADmT2RaQbO+3PnCjdltK+4RrcPAD+eqOKf4bYk03vNo2TgwsKYKeK+3p/VgcrXxb5iZ X-Received: by 2002:a17:902:c948:b0:190:cd21:1ebe with SMTP id i8-20020a170902c94800b00190cd211ebemr5103456pla.1.1670977157649; Tue, 13 Dec 2022 16:19:17 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1670977157; cv=none; d=google.com; s=arc-20160816; b=n3N3HMRs3JIRZCo6Bk0oGbT3VLOI7xi+V2VBHp2vOf5X2flDkDP1ob3ZJRBQ3dt/tM pQT8jGG+l6L6f9jq+ywiZbg6lwn+oV0yATjFbys6D30nIm1bnDP6XJmlkKCz8FVgzz1/ n3llvRGi5z142zV90pHQwXITdL8RDhim0WNeLPHiIV2I0bXnZcffOJUsJtIqorba4rLs DIlvssmr3CQCB93DGeEZbIj8xl8uWDmnJkG3bPmoqG0Zmlgv5gEst5qa13/OyoK1rUzm v6yQ2x/Ip08thUXPqYL8CDz67N3L4IpQjj+5Gn0efu9rBOmSgES46n7ESypeO3drrmGj y5Lg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=HrQ1LqblXXg6sKYW9Y0qqf5MQwpOKAM3y8Jy58CaLXM=; b=u09XF5pxGBA3K/a5Do7kqJ98zvb0welmgBlQ5/LaYvCO1zadzWdv5xwHBYyvfH2P// 4tHxE3GW340cQ9AZXWecyEbzvKmeQiiKI0P8LmBKx0mEkVfNSisWsQSIzlOxndLwjWzP c13YkgvuPbDdMHP/OHGFsjcCjmRg6B573DIKLGRFT/j+cuv4fmIcTdWZUOdwkUAY/Utv sEZ0cCumrEl0zTUbRLR5ZZF5oR4fb8GrRcyb64cKMjpf/IjGpQdyXFN57w2rePCBKZSz ujRVbo45GM30lAkcxhniRNd1YIhDou750lwo4lCmYNBmKnusf5ROjd5q5CPGFq7pTP3G pHHQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=lM4BKLR0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id n8-20020a170902e54800b0018907d64909si1267813plf.325.2022.12.13.16.19.08; Tue, 13 Dec 2022 16:19:17 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=lM4BKLR0; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237943AbiLNANg (ORCPT + 73 others); Tue, 13 Dec 2022 19:13:36 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43022 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238118AbiLNANM (ORCPT ); Tue, 13 Dec 2022 19:13:12 -0500 Received: from mail-qv1-xf33.google.com (mail-qv1-xf33.google.com [IPv6:2607:f8b0:4864:20::f33]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id EF9C728702 for ; Tue, 13 Dec 2022 16:11:12 -0800 (PST) Received: by mail-qv1-xf33.google.com with SMTP id pp21so10720262qvb.5 for ; Tue, 13 Dec 2022 16:11:12 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=HrQ1LqblXXg6sKYW9Y0qqf5MQwpOKAM3y8Jy58CaLXM=; b=lM4BKLR0etyUsOEMu97Z7Dgs5Ml2TnW7SAJmolxITMCmqMOORsB8NE3XySG/DbgYno iQUWLv/5FPzr9RZi24827fwrgzoJ5eerNqgj1sdH+kyoIUzByCwNcxBTocqj7wnPg7Og q3ncZT1ptFYv6mLyy10K0NX2yO5PykJrPFtEv4mZum7tDnbEmE7In0XkLBxUiOcmUrBV K3+C0P/RpuzJ8UQD3Ivr3WM2T23LXLYJ0ciADc2w/UIN4qiihAP5L0KvBxOys1Xg/TSf dCMyfg9p4sGrrynqGByOfan38p+xJFDiJucSsrhsAFUaw7quWEtFdDvJvCzEJD8f0Qx3 do4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=HrQ1LqblXXg6sKYW9Y0qqf5MQwpOKAM3y8Jy58CaLXM=; b=CgxCfXloASfFEnKpvMqfI+w0muTm/Yqayz7XOJq5KKj7oShekdGYm3HVxjABPBwpaQ T8zu8SGa+wHiQ1KCmJVjZZcu+fauoJOfTmjBJ9kCEHeQThSTd2X8Z+QTPa31NDrXOaFZ sbGKPi3a4pC+OGg8u/74gebu7HmrobY67g9x4X+92hGo4CQdcWXv03D3wui0p6Eq0MlN JL0yuwI3Roorrt9FLrTqP21qZ3rtnj66NeEH1/cmfEtGOY1ie0KKPVq1LGt/HwfmCN/E N/rlnyGGkC7+WH4XZBhKXa8TC+7qw7sHuekkcD1nSwtkg0G5m9fGxk3eEJpnjHnzsftz kZvQ== X-Gm-Message-State: ANoB5pkuWqAqNtQTWuOTxqJziHzVrioRxVkwVTGaT9HHA5BZFtbS0IoF a595mLwxGsKUvBthUOR8YyZ4FfXbLD6Ttj9tr3kYei1Mx3f5UQXM X-Received: by 2002:a05:6214:2c21:b0:4df:59a8:b6e6 with SMTP id lc33-20020a0562142c2100b004df59a8b6e6mr634993qvb.28.1670976671859; Tue, 13 Dec 2022 16:11:11 -0800 (PST) MIME-Version: 1.0 References: <20221206173601.549281-1-bgardon@google.com> <20221206173601.549281-2-bgardon@google.com> In-Reply-To: From: Ben Gardon Date: Tue, 13 Dec 2022 16:11:00 -0800 Message-ID: Subject: Re: [PATCH 1/7] KVM: x86/MMU: Move pte_list operations to rmap.c To: Vipin Sharma Cc: linux-kernel@vger.kernel.org, kvm@vger.kernel.org, Paolo Bonzini , Peter Xu , Sean Christopherson , David Matlack Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Dec 7, 2022 at 2:58 PM Vipin Sharma wrote: > > On Tue, Dec 6, 2022 at 9:36 AM Ben Gardon wrote: > > > > In the interest of eventually splitting the Shadow MMU out of mmu.c, > > start by moving some of the operations for manipulating pte_lists out of > > mmu.c and into a new pair of files: rmap.c and rmap.h. > > > > No functional change intended. > > > > Signed-off-by: Ben Gardon > > --- > > arch/x86/kvm/Makefile | 2 +- > > arch/x86/kvm/debugfs.c | 1 + > > arch/x86/kvm/mmu/mmu.c | 152 +------------------------------- > > arch/x86/kvm/mmu/mmu_internal.h | 1 - > > arch/x86/kvm/mmu/rmap.c | 141 +++++++++++++++++++++++++++++ > > arch/x86/kvm/mmu/rmap.h | 34 +++++++ > > 6 files changed, 179 insertions(+), 152 deletions(-) > > create mode 100644 arch/x86/kvm/mmu/rmap.c > > create mode 100644 arch/x86/kvm/mmu/rmap.h > > > > diff --git a/arch/x86/kvm/Makefile b/arch/x86/kvm/Makefile > > index 80e3fe184d17..9f766eebeddf 100644 > > --- a/arch/x86/kvm/Makefile > > +++ b/arch/x86/kvm/Makefile > > @@ -12,7 +12,7 @@ include $(srctree)/virt/kvm/Makefile.kvm > > kvm-y += x86.o emulate.o i8259.o irq.o lapic.o \ > > i8254.o ioapic.o irq_comm.o cpuid.o pmu.o mtrr.o \ > > hyperv.o debugfs.o mmu/mmu.o mmu/page_track.o \ > > - mmu/spte.o > > + mmu/spte.o mmu/rmap.o > > > > ifdef CONFIG_HYPERV > > kvm-y += kvm_onhyperv.o > > diff --git a/arch/x86/kvm/debugfs.c b/arch/x86/kvm/debugfs.c > > index c1390357126a..29f692ecd6f3 100644 > > --- a/arch/x86/kvm/debugfs.c > > +++ b/arch/x86/kvm/debugfs.c > > @@ -9,6 +9,7 @@ > > #include "lapic.h" > > #include "mmu.h" > > #include "mmu/mmu_internal.h" > > +#include "mmu/rmap.h" > > > > static int vcpu_get_timer_advance_ns(void *data, u64 *val) > > { > > diff --git a/arch/x86/kvm/mmu/mmu.c b/arch/x86/kvm/mmu/mmu.c > > index 4736d7849c60..90b3735d6064 100644 > > --- a/arch/x86/kvm/mmu/mmu.c > > +++ b/arch/x86/kvm/mmu/mmu.c > > @@ -26,6 +26,7 @@ > > #include "kvm_emulate.h" > > #include "cpuid.h" > > #include "spte.h" > > +#include "rmap.h" > > > > #include > > #include > > @@ -112,24 +113,6 @@ module_param(dbg, bool, 0644); > > > > #include > > > > -/* make pte_list_desc fit well in cache lines */ > > -#define PTE_LIST_EXT 14 > > - > > -/* > > - * Slight optimization of cacheline layout, by putting `more' and `spte_count' > > - * at the start; then accessing it will only use one single cacheline for > > - * either full (entries==PTE_LIST_EXT) case or entries<=6. > > - */ > > -struct pte_list_desc { > > - struct pte_list_desc *more; > > - /* > > - * Stores number of entries stored in the pte_list_desc. No need to be > > - * u64 but just for easier alignment. When PTE_LIST_EXT, means full. > > - */ > > - u64 spte_count; > > - u64 *sptes[PTE_LIST_EXT]; > > -}; > > - > > struct kvm_shadow_walk_iterator { > > u64 addr; > > hpa_t shadow_addr; > > @@ -155,7 +138,6 @@ struct kvm_shadow_walk_iterator { > > ({ spte = mmu_spte_get_lockless(_walker.sptep); 1; }); \ > > __shadow_walk_next(&(_walker), spte)) > > > > -static struct kmem_cache *pte_list_desc_cache; > > struct kmem_cache *mmu_page_header_cache; > > static struct percpu_counter kvm_total_used_mmu_pages; > > > > @@ -674,11 +656,6 @@ static void mmu_free_memory_caches(struct kvm_vcpu *vcpu) > > kvm_mmu_free_memory_cache(&vcpu->arch.mmu_page_header_cache); > > } > > > > -static void mmu_free_pte_list_desc(struct pte_list_desc *pte_list_desc) > > -{ > > - kmem_cache_free(pte_list_desc_cache, pte_list_desc); > > -} > > - > > static bool sp_has_gptes(struct kvm_mmu_page *sp); > > > > static gfn_t kvm_mmu_page_get_gfn(struct kvm_mmu_page *sp, int index) > > @@ -878,111 +855,6 @@ gfn_to_memslot_dirty_bitmap(struct kvm_vcpu *vcpu, gfn_t gfn, > > return slot; > > } > > > > -/* > > - * About rmap_head encoding: > > - * > > - * If the bit zero of rmap_head->val is clear, then it points to the only spte > > - * in this rmap chain. Otherwise, (rmap_head->val & ~1) points to a struct > > - * pte_list_desc containing more mappings. > > - */ > > - > > -/* > > - * Returns the number of pointers in the rmap chain, not counting the new one. > > - */ > > -static int pte_list_add(struct kvm_mmu_memory_cache *cache, u64 *spte, > > - struct kvm_rmap_head *rmap_head) > > -{ > > - struct pte_list_desc *desc; > > - int count = 0; > > - > > - if (!rmap_head->val) { > > - rmap_printk("%p %llx 0->1\n", spte, *spte); > > - rmap_head->val = (unsigned long)spte; > > - } else if (!(rmap_head->val & 1)) { > > - rmap_printk("%p %llx 1->many\n", spte, *spte); > > - desc = kvm_mmu_memory_cache_alloc(cache); > > - desc->sptes[0] = (u64 *)rmap_head->val; > > - desc->sptes[1] = spte; > > - desc->spte_count = 2; > > - rmap_head->val = (unsigned long)desc | 1; > > - ++count; > > - } else { > > - rmap_printk("%p %llx many->many\n", spte, *spte); > > - desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > - while (desc->spte_count == PTE_LIST_EXT) { > > - count += PTE_LIST_EXT; > > - if (!desc->more) { > > - desc->more = kvm_mmu_memory_cache_alloc(cache); > > - desc = desc->more; > > - desc->spte_count = 0; > > - break; > > - } > > - desc = desc->more; > > - } > > - count += desc->spte_count; > > - desc->sptes[desc->spte_count++] = spte; > > - } > > - return count; > > -} > > - > > -static void > > -pte_list_desc_remove_entry(struct kvm_rmap_head *rmap_head, > > - struct pte_list_desc *desc, int i, > > - struct pte_list_desc *prev_desc) > > -{ > > - int j = desc->spte_count - 1; > > - > > - desc->sptes[i] = desc->sptes[j]; > > - desc->sptes[j] = NULL; > > - desc->spte_count--; > > - if (desc->spte_count) > > - return; > > - if (!prev_desc && !desc->more) > > - rmap_head->val = 0; > > - else > > - if (prev_desc) > > - prev_desc->more = desc->more; > > - else > > - rmap_head->val = (unsigned long)desc->more | 1; > > - mmu_free_pte_list_desc(desc); > > -} > > - > > -static void pte_list_remove(u64 *spte, struct kvm_rmap_head *rmap_head) > > -{ > > - struct pte_list_desc *desc; > > - struct pte_list_desc *prev_desc; > > - int i; > > - > > - if (!rmap_head->val) { > > - pr_err("%s: %p 0->BUG\n", __func__, spte); > > - BUG(); > > - } else if (!(rmap_head->val & 1)) { > > - rmap_printk("%p 1->0\n", spte); > > - if ((u64 *)rmap_head->val != spte) { > > - pr_err("%s: %p 1->BUG\n", __func__, spte); > > - BUG(); > > - } > > - rmap_head->val = 0; > > - } else { > > - rmap_printk("%p many->many\n", spte); > > - desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > - prev_desc = NULL; > > - while (desc) { > > - for (i = 0; i < desc->spte_count; ++i) { > > - if (desc->sptes[i] == spte) { > > - pte_list_desc_remove_entry(rmap_head, > > - desc, i, prev_desc); > > - return; > > - } > > - } > > - prev_desc = desc; > > - desc = desc->more; > > - } > > - pr_err("%s: %p many->many\n", __func__, spte); > > - BUG(); > > - } > > -} > > - > > static void kvm_zap_one_rmap_spte(struct kvm *kvm, > > struct kvm_rmap_head *rmap_head, u64 *sptep) > > { > > @@ -1011,7 +883,7 @@ static bool kvm_zap_all_rmap_sptes(struct kvm *kvm, > > for (i = 0; i < desc->spte_count; i++) > > mmu_spte_clear_track_bits(kvm, desc->sptes[i]); > > next = desc->more; > > - mmu_free_pte_list_desc(desc); > > + free_pte_list_desc(desc); > > } > > out: > > /* rmap_head is meaningless now, remember to reset it */ > > @@ -1019,26 +891,6 @@ static bool kvm_zap_all_rmap_sptes(struct kvm *kvm, > > return true; > > } > > > > -unsigned int pte_list_count(struct kvm_rmap_head *rmap_head) > > -{ > > - struct pte_list_desc *desc; > > - unsigned int count = 0; > > - > > - if (!rmap_head->val) > > - return 0; > > - else if (!(rmap_head->val & 1)) > > - return 1; > > - > > - desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > - > > - while (desc) { > > - count += desc->spte_count; > > - desc = desc->more; > > - } > > - > > - return count; > > -} > > - > > static struct kvm_rmap_head *gfn_to_rmap(gfn_t gfn, int level, > > const struct kvm_memory_slot *slot) > > { > > diff --git a/arch/x86/kvm/mmu/mmu_internal.h b/arch/x86/kvm/mmu/mmu_internal.h > > index dbaf6755c5a7..cd1c8f32269d 100644 > > --- a/arch/x86/kvm/mmu/mmu_internal.h > > +++ b/arch/x86/kvm/mmu/mmu_internal.h > > @@ -166,7 +166,6 @@ bool kvm_mmu_slot_gfn_write_protect(struct kvm *kvm, > > int min_level); > > void kvm_flush_remote_tlbs_with_address(struct kvm *kvm, > > u64 start_gfn, u64 pages); > > -unsigned int pte_list_count(struct kvm_rmap_head *rmap_head); > > > > extern int nx_huge_pages; > > static inline bool is_nx_huge_page_enabled(struct kvm *kvm) > > diff --git a/arch/x86/kvm/mmu/rmap.c b/arch/x86/kvm/mmu/rmap.c > > new file mode 100644 > > index 000000000000..daa99dee0709 > > --- /dev/null > > +++ b/arch/x86/kvm/mmu/rmap.c > > @@ -0,0 +1,141 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > + > > A comment would be nice to write expectations from this file and what > code lives here. I'll add one. > > > +#include "mmu.h" > > +#include "mmu_internal.h" > > +#include "mmutrace.h" > > +#include "rmap.h" > > +#include "spte.h" > > + > > +#include > > +#include > > + > > +/* > > + * About rmap_head encoding: > > + * > > + * If the bit zero of rmap_head->val is clear, then it points to the only spte > > + * in this rmap chain. Otherwise, (rmap_head->val & ~1) points to a struct > > + * pte_list_desc containing more mappings. > > + */ > > + > > +/* > > + * Returns the number of pointers in the rmap chain, not counting the new one. > > + */ > > +int pte_list_add(struct kvm_mmu_memory_cache *cache, u64 *spte, > > + struct kvm_rmap_head *rmap_head) > > +{ > > + struct pte_list_desc *desc; > > + int count = 0; > > + > > + if (!rmap_head->val) { > > + rmap_printk("%p %llx 0->1\n", spte, *spte); > > + rmap_head->val = (unsigned long)spte; > > + } else if (!(rmap_head->val & 1)) { > > + rmap_printk("%p %llx 1->many\n", spte, *spte); > > + desc = kvm_mmu_memory_cache_alloc(cache); > > + desc->sptes[0] = (u64 *)rmap_head->val; > > + desc->sptes[1] = spte; > > + desc->spte_count = 2; > > + rmap_head->val = (unsigned long)desc | 1; > > + ++count; > > + } else { > > + rmap_printk("%p %llx many->many\n", spte, *spte); > > + desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > + while (desc->spte_count == PTE_LIST_EXT) { > > + count += PTE_LIST_EXT; > > + if (!desc->more) { > > + desc->more = kvm_mmu_memory_cache_alloc(cache); > > + desc = desc->more; > > + desc->spte_count = 0; > > + break; > > + } > > + desc = desc->more; > > + } > > + count += desc->spte_count; > > + desc->sptes[desc->spte_count++] = spte; > > + } > > + return count; > > +} > > + > > +void free_pte_list_desc(struct pte_list_desc *pte_list_desc) > > +{ > > + kmem_cache_free(pte_list_desc_cache, pte_list_desc); > > +} > > + > > +static void > > +pte_list_desc_remove_entry(struct kvm_rmap_head *rmap_head, > > + struct pte_list_desc *desc, int i, > > + struct pte_list_desc *prev_desc) > > +{ > > + int j = desc->spte_count - 1; > > + > > + desc->sptes[i] = desc->sptes[j]; > > + desc->sptes[j] = NULL; > > + desc->spte_count--; > > + if (desc->spte_count) > > + return; > > + if (!prev_desc && !desc->more) > > + rmap_head->val = 0; > > + else > > + if (prev_desc) > > + prev_desc->more = desc->more; > > + else > > + rmap_head->val = (unsigned long)desc->more | 1; > > + free_pte_list_desc(desc); > > +} > > + > > +void pte_list_remove(u64 *spte, struct kvm_rmap_head *rmap_head) > > +{ > > + struct pte_list_desc *desc; > > + struct pte_list_desc *prev_desc; > > + int i; > > + > > + if (!rmap_head->val) { > > + pr_err("%s: %p 0->BUG\n", __func__, spte); > > + BUG(); > > + } else if (!(rmap_head->val & 1)) { > > + rmap_printk("%p 1->0\n", spte); > > + if ((u64 *)rmap_head->val != spte) { > > + pr_err("%s: %p 1->BUG\n", __func__, spte); > > + BUG(); > > + } > > + rmap_head->val = 0; > > + } else { > > + rmap_printk("%p many->many\n", spte); > > + desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > + prev_desc = NULL; > > + while (desc) { > > + for (i = 0; i < desc->spte_count; ++i) { > > + if (desc->sptes[i] == spte) { > > + pte_list_desc_remove_entry(rmap_head, > > + desc, i, prev_desc); > > + return; > > + } > > + } > > + prev_desc = desc; > > + desc = desc->more; > > + } > > + pr_err("%s: %p many->many\n", __func__, spte); > > + BUG(); > > + } > > +} > > + > > +unsigned int pte_list_count(struct kvm_rmap_head *rmap_head) > > +{ > > + struct pte_list_desc *desc; > > + unsigned int count = 0; > > + > > + if (!rmap_head->val) > > + return 0; > > + else if (!(rmap_head->val & 1)) > > + return 1; > > + > > + desc = (struct pte_list_desc *)(rmap_head->val & ~1ul); > > + > > + while (desc) { > > + count += desc->spte_count; > > + desc = desc->more; > > + } > > + > > + return count; > > +} > > + > > diff --git a/arch/x86/kvm/mmu/rmap.h b/arch/x86/kvm/mmu/rmap.h > > new file mode 100644 > > index 000000000000..059765b6e066 > > --- /dev/null > > +++ b/arch/x86/kvm/mmu/rmap.h > > @@ -0,0 +1,34 @@ > > +// SPDX-License-Identifier: GPL-2.0 > > + > > +#ifndef __KVM_X86_MMU_RMAP_H > > +#define __KVM_X86_MMU_RMAP_H > > + > > +#include > > + > > +/* make pte_list_desc fit well in cache lines */ > > +#define PTE_LIST_EXT 14 > > + > > +/* > > + * Slight optimization of cacheline layout, by putting `more' and `spte_count' > > + * at the start; then accessing it will only use one single cacheline for > > + * either full (entries==PTE_LIST_EXT) case or entries<=6. > > + */ > > +struct pte_list_desc { > > + struct pte_list_desc *more; > > + /* > > + * Stores number of entries stored in the pte_list_desc. No need to be > > + * u64 but just for easier alignment. When PTE_LIST_EXT, means full. > > + */ > > + u64 spte_count; > > + u64 *sptes[PTE_LIST_EXT]; > > +}; > > + > > +static struct kmem_cache *pte_list_desc_cache; > > Does it make sense to make it non static and extern here. Also, you > can provide an init function which can be called from mmu.c? Going to follow David's suggestion and leave it in mmu.c for now. > > > > + > > +int pte_list_add(struct kvm_mmu_memory_cache *cache, u64 *spte, > > + struct kvm_rmap_head *rmap_head); > > +void free_pte_list_desc(struct pte_list_desc *pte_list_desc); > > +void pte_list_remove(u64 *spte, struct kvm_rmap_head *rmap_head); > > +unsigned int pte_list_count(struct kvm_rmap_head *rmap_head); > > + > > Similar to tdp_mmu, and other rmap functions in next patches in the > series should above functions be prefixed with "rmap_"? I think I'm going to abandon the idea of having a seperate file for rmap stuff and just have one, larger shadow mmu file with a variety of names. I'll clean up the naming at the end of the series once everything is moved over and the set of things being exported from the shadow_mmu.c file has stabilized. > > > > +#endif /* __KVM_X86_MMU_RMAP_H */ > > -- > > 2.39.0.rc0.267.gcb52ba06e7-goog > >