by Ben Gardon

[permalink] [raw]

Subject: Re: [PATCH v3 07/28] KVM: x86/mmu: Check for !leaf=>leaf, not PFN change, in TDP MMU SP removal

On Fri, Feb 25, 2022 at 4:16 PM Sean Christopherson <[email protected]> wrote:
>
> Look for a !leaf=>leaf conversion instead of a PFN change when checking
> if a SPTE change removed a TDP MMU shadow page. Convert the PFN check
> into a WARN, as KVM should never change the PFN of a shadow page (except
> when its being zapped or replaced).
>
> From a purely theoretical perspective, it's not illegal to replace a SP
> with a hugepage pointing at the same PFN. In practice, it's impossible
> as that would require mapping guest memory overtop a kernel-allocated SP.
> Either way, the check is odd.
>
> Signed-off-by: Sean Christopherson <[email protected]>

Reviewed-by: Ben Gardon <[email protected]>

> ---
> arch/x86/kvm/mmu/tdp_mmu.c | 7 +++++--
> 1 file changed, 5 insertions(+), 2 deletions(-)
>
> diff --git a/arch/x86/kvm/mmu/tdp_mmu.c b/arch/x86/kvm/mmu/tdp_mmu.c
> index 189f21e71c36..848448b65703 100644
> --- a/arch/x86/kvm/mmu/tdp_mmu.c
> +++ b/arch/x86/kvm/mmu/tdp_mmu.c
> @@ -505,9 +505,12 @@ static void __handle_changed_spte(struct kvm *kvm, int as_id, gfn_t gfn,
>
> /*
> * Recursively handle child PTs if the change removed a subtree from
> - * the paging structure.
> + * the paging structure. Note the WARN on the PFN changing without the
> + * SPTE being converted to a hugepage (leaf) or being zapped. Shadow
> + * pages are kernel allocations and should never be migrated.
> */
> - if (was_present && !was_leaf && (pfn_changed || !is_present))
> + if (was_present && !was_leaf &&
> + (is_leaf || !is_present || WARN_ON_ONCE(pfn_changed)))
> handle_removed_pt(kvm, spte_to_child_pt(old_spte, level), shared);
> }
>
> --
> 2.35.1.574.g5d30c73bfb-goog
>

2022-03-01 01:13:08

On Sat, Feb 26, 2022, Sean Christopherson wrote:
> Batch TLB flushes (with other MMUs) when handling ->change_spte()
> notifications in the TDP MMU. The MMU notifier path in question doesn't
> allow yielding and correcty flushes before dropping mmu_lock.
nit: correctly
>
> Signed-off-by: Sean Christopherson <[email protected]>
> Reviewed-by: Ben Gardon <[email protected]>
Reviewed-by: Mingwei Zhang <[email protected]>
> ---
> arch/x86/kvm/mmu/tdp_mmu.c | 13 ++++++-------
> 1 file changed, 6 insertions(+), 7 deletions(-)
>
> diff --git a/arch/x86/kvm/mmu/tdp_mmu.c b/arch/x86/kvm/mmu/tdp_mmu.c
> index 848448b65703..634a2838e117 100644
> --- a/arch/x86/kvm/mmu/tdp_mmu.c
> +++ b/arch/x86/kvm/mmu/tdp_mmu.c
> @@ -1203,13 +1203,12 @@ static bool set_spte_gfn(struct kvm *kvm, struct tdp_iter *iter,
> */
> bool kvm_tdp_mmu_set_spte_gfn(struct kvm *kvm, struct kvm_gfn_range *range)
> {
> - bool flush = kvm_tdp_mmu_handle_gfn(kvm, range, set_spte_gfn);
> -
> - /* FIXME: return 'flush' instead of flushing here. */
> - if (flush)
> - kvm_flush_remote_tlbs_with_address(kvm, range->start, 1);
> -
> - return false;
> + /*
> + * No need to handle the remote TLB flush under RCU protection, the
> + * target SPTE _must_ be a leaf SPTE, i.e. cannot result in freeing a
> + * shadow page. See the WARN on pfn_changed in __handle_changed_spte().
> + */
> + return kvm_tdp_mmu_handle_gfn(kvm, range, set_spte_gfn);
> }
>
> /*
> --
> 2.35.1.574.g5d30c73bfb-goog
>