Received: by 2002:a05:6358:9144:b0:117:f937:c515 with SMTP id r4csp10988277rwr; Fri, 12 May 2023 16:50:51 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ4DaOKfA7geB9knBQpTOwR+ZPaZKgQSFjXUkr412LzyG2j0Nje80c0+rWwqAbI4G6+VOgRb X-Received: by 2002:a17:902:b907:b0:1ac:84dd:6d1f with SMTP id bf7-20020a170902b90700b001ac84dd6d1fmr17325884plb.1.1683935450835; Fri, 12 May 2023 16:50:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1683935450; cv=none; d=google.com; s=arc-20160816; b=YWm0s9mWBsuWYiMPkfC7Xe6oJa5AcD73GcDoaXI0Q3sp4oB63tetyxFbeLgqNz+yfD 7sXcA6NpO8eDtZH058XWZefJL1ZzsLNngBLWVvGNmBl+a0PNEwParDmJaXk9o1yHqHFf YhZVWtfxCdRTpuDE367ddo/4S9oR0FsxCQsx1AVZs1BDEO7439WbnZYsm71Kff/ijO4i 4wGyxX48awHiJ6dWxWoLXGClDI40mMhg6CRMuM1gUnhlSUQkRlFp37FFRiuuiNlHqhxw dyexfmBVJzM8Gg4A5KFZO0Bymi+kW0Zko9OXWE+fw0qgyyCnKzBCyW+tw6YORx+4K5fR ERSA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=rdSWNdPBzCvQZue8s+nJ8+4j93W1/3mOYRyvy9I/esk=; b=Wji/jx1LUN9tYs20qlla77M7tMLfy+gEnnZ9v5Zq8eaDY6cOud4aSgbWTKP3zp6KWn 7EtjP39QAPwN0f0doQzzH0Axn3xq4FWZJWsIQnBaMCE9BNlcAqSIxVPj2gKPtytnKuAD exUKMPbFpLoBpYwTD+YkRIEq/Lk/zCKWWxyNQuWPcLscptOkMUsC/POzq9osQ67Jfi71 V4kJY0kH2KwsJEWWlIq6DwyWomP7Fw/92IMeKLNfjNvxsgDc+3y+U7XEU7mtr/blrFkY MvOm29eBskFwIciovbZ3zhjNP0hHBEmNgJzaqC80EXwYiPUsd6/gX4S1OGTa3+brmc1W uTEg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=s748YbYX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id u13-20020a17090341cd00b001ac4c0de994si10947084ple.564.2023.05.12.16.50.38; Fri, 12 May 2023 16:50:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=s748YbYX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240832AbjELXlD (ORCPT + 99 others); Fri, 12 May 2023 19:41:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47258 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240666AbjELXlA (ORCPT ); Fri, 12 May 2023 19:41:00 -0400 Received: from mail-pg1-x54a.google.com (mail-pg1-x54a.google.com [IPv6:2607:f8b0:4864:20::54a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B8F181FC1 for ; Fri, 12 May 2023 16:40:59 -0700 (PDT) Received: by mail-pg1-x54a.google.com with SMTP id 41be03b00d2f7-518d6f87a47so5259695a12.3 for ; Fri, 12 May 2023 16:40:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1683934859; x=1686526859; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=rdSWNdPBzCvQZue8s+nJ8+4j93W1/3mOYRyvy9I/esk=; b=s748YbYXBKYE0u5/i9t2zN0VCehO3hUO821bHn0KtVX4LXrWfGzZ4Jbc6Q9rO3V43b RiyXOdLXeaLu1852urhveSmkofnGqK7qbxSK014EGoO6mUFQdrZhTVh7xP56qFs/ATrE MVCFF+tfhGcmsE/KSSsyntK9K+1Z48J82yBZDZCMbNAcXrXE5esiP7TaLOp2ffybU5c3 if3wbDGggFxaCeOy3aqvsUdvpZtLMT/bklfy6heYp/Wg3hXEsrFldIoHxeZAA2BayQHp 3Fhz7FG2ldpIJhdmn4rkX9lUdmSuRINZLW+58knj9s89UwMfOzRBvTXpwOeXTBrz3txA JCBQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1683934859; x=1686526859; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=rdSWNdPBzCvQZue8s+nJ8+4j93W1/3mOYRyvy9I/esk=; b=ihn1ydqubotgx/eKc4LqllLdsc46/MLNuzz3+jBmgB0dlP4WxCj3QS124ahQBpSvGz nVWAmqjKLOTYpnINlB0QX2kw7AaQkeDosSBxqC+TkT47Uzx94hZxD4rqHk8l25+Wy22X aVnpyldVP0pQjmI0J97iHFXjgq3JBml3T9OyAvO5MQrYAZb/9k0ccKUOR7FjVlaslruL /o22E46P7JKnV3DEKPTbK0nkfpnECfEQUcwkljLRat/qihR/bLG221noMfStiqgFY3XD JQDNygeDELS5dg0yXN/rD6FWOtvyjQItxxYseKtIt4bXtyU5MXjn4hCmw3m63B76crUR PMfQ== X-Gm-Message-State: AC+VfDz+iKAWWWilVjWglmh+7OcnVL7irAvR4Kjr7DSY1i/D2jvibYYT yQ3k1L52Acr5r0HTvIYF4VzRILYS/yw= X-Received: from zagreus.c.googlers.com ([fda3:e722:ac3:cc00:7f:e700:c0a8:5c37]) (user=seanjc job=sendgmr) by 2002:a63:2b04:0:b0:521:62ef:9b38 with SMTP id r4-20020a632b04000000b0052162ef9b38mr7266189pgr.3.1683934859334; Fri, 12 May 2023 16:40:59 -0700 (PDT) Date: Fri, 12 May 2023 16:40:57 -0700 In-Reply-To: Mime-Version: 1.0 References: <20230511235917.639770-1-seanjc@google.com> <20230511235917.639770-7-seanjc@google.com> Message-ID: Subject: Re: [PATCH 6/9] KVM: x86/mmu: Bug the VM if a vCPU ends up in long mode without PAE enabled From: Sean Christopherson To: David Matlack Cc: Paolo Bonzini , kvm@vger.kernel.org, linux-kernel@vger.kernel.org, Mingwei Zhang , Jim Mattson Content-Type: text/plain; charset="us-ascii" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 12, 2023, David Matlack wrote: > On Thu, May 11, 2023 at 04:59:14PM -0700, Sean Christopherson wrote: > > Promote the ASSERT(), which is quite dead code in KVM, into a KVM_BUG_ON() > > for KVM's sanity check that CR4.PAE=1 if the vCPU is in long mode when > > performing a walk of guest page tables. The sanity is quite cheap since > > neither EFER nor CR4.PAE requires a VMREAD, especially relative to the > > cost of walking the guest page tables. > > > > More importantly, the sanity check would have prevented the true badness > > fixed by commit 112e66017bff ("KVM: nVMX: add missing consistency checks > > for CR0 and CR4"). The missed consistency check resulted in some versions > > of KVM corrupting the on-stack guest_walker structure due to KVM thinking > > there are 4/5 levels of page tables, but wiring up the MMU hooks to point > > at the paging32 implementation, which only allocates space for two levels > > of page tables in "struct guest_walker32". > > > > Queue a page fault for injection if the assertion fails, as the sole > > caller, FNAME(gva_to_gpa), assumes that walker.fault contains sane info > > FNAME(page_fault)->FNAME(walk_addr)->FNAME(walk_addr_generic) is another > caller but I think the same reasoning here applies. Huh. No idea what I was doing. Missed the super obvious use case... I'll make sure the call from walk_addr() does something not awful. > > on a walk failure, i.e. avoid making the situation worse between the time > > the assertion fails and when KVM kicks the vCPU out to userspace (because > > the VM is bugged). > > > > Move the check below the initialization of "pte_access" so that the > > aforementioned to-be-injected page fault doesn't consume uninitialized > > stack data. The information _shouldn't_ reach the guest or userspace, > > but there's zero downside to being paranoid in this case. > > > > Signed-off-by: Sean Christopherson > > --- > > arch/x86/kvm/mmu/paging_tmpl.h | 5 ++++- > > 1 file changed, 4 insertions(+), 1 deletion(-) > > > > diff --git a/arch/x86/kvm/mmu/paging_tmpl.h b/arch/x86/kvm/mmu/paging_tmpl.h > > index a3fc7c1a7f8d..f297e9311dcd 100644 > > --- a/arch/x86/kvm/mmu/paging_tmpl.h > > +++ b/arch/x86/kvm/mmu/paging_tmpl.h > > @@ -338,7 +338,6 @@ static int FNAME(walk_addr_generic)(struct guest_walker *walker, > > } > > #endif > > walker->max_level = walker->level; > > - ASSERT(!(is_long_mode(vcpu) && !is_pae(vcpu))); > > > > /* > > * FIXME: on Intel processors, loads of the PDPTE registers for PAE paging > > @@ -348,6 +347,10 @@ static int FNAME(walk_addr_generic)(struct guest_walker *walker, > > nested_access = (have_ad ? PFERR_WRITE_MASK : 0) | PFERR_USER_MASK; > > > > pte_access = ~0; > > + > > + if (KVM_BUG_ON(is_long_mode(vcpu) && !is_pae(vcpu), vcpu->kvm)) > > + goto error; > > This if() deserves a comment since it's queueing a page fault for what > is likely a KVM bug. As a reader that'd be pretty jarring to see. Will add.