Received: by 2002:a05:6a10:a852:0:0:0:0 with SMTP id d18csp2998350pxy; Mon, 3 May 2021 12:39:36 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy5BvQqc90NYrs1d9EGQB9ZsEGiSxUxzPpf4H6N6aR30BzHwWIljR+uJHEM2BoRDwFthf3n X-Received: by 2002:a17:906:c0c3:: with SMTP id bn3mr3488672ejb.498.1620070776411; Mon, 03 May 2021 12:39:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620070776; cv=none; d=google.com; s=arc-20160816; b=YWHn0RdccL5JuGNFO2QEEO5RmNrE+8UbvdeoaS0WusxQ4F4pPr7lQzWomLJXmdMuFD 6GUFhZ53FzrUcOqILZ1L+f0RWMGa1X2dtOUdkHflnqHtfZqXs5d0kLVb7gkT3tjxRQt4 leIrUSIK5psQpMKhhMxu+UkKkOVzPHpGak99qvB6E4smaLbyZOthJwWSu0zp8frOil4+ DirctiMSKENiORiVeFlWTc/hln7Zo6PZBGCse06TOLoqHtUXrJjd+CUzkgyPZZA2xpMB f9P0/BGTWt5PTiTeFepxOKMxYd8RHABMc4QywgJvclIAkDI+UeWghBcOPogUgwSSmX38 qWZw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:references :in-reply-to:subject:cc:to:dkim-signature:dkim-signature:from; bh=QRjECRjxk0gCtINki1a2W5i2rPrFEb8mTYWTcf6OIYs=; b=Nb+KeFSWgg+up/pjPMWxx5gDy0FstKMphEC31HgZF2ibYTxPB02tA1jpqCBriAjTs8 jTxaLFoqUQwCuwGHXNz/URfWZWvzbc+pp/yPCOLnaPRLLaZtcOQCPc9cKI2OkAbMyfGO o8GFfWVuY1YAhn3mXBh2BhfNU6pRBoCYbW0AoMs2ean4m2teDmcQ1bxeV26nOBqpuqoO oFZI6Iahfwh/cFcICdDjVb7bIK9NC3W6vgepjPFn3V3OA0nfh3ZS1NF6oSumcUQA0eLm joxm7sSh8+NEJTmsuYQtv+0o/A/XQLLrURHZoxuVxTHY1qGgEwf3t8g7zq1+HpPs56pG Li0w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=BoiaW9it; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m21si659610ejo.633.2021.05.03.12.39.11; Mon, 03 May 2021 12:39:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=BoiaW9it; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229727AbhECTiq (ORCPT + 99 others); Mon, 3 May 2021 15:38:46 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34656 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229472AbhECTiq (ORCPT ); Mon, 3 May 2021 15:38:46 -0400 Received: from galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B4467C06174A; Mon, 3 May 2021 12:37:52 -0700 (PDT) From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1620070670; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=QRjECRjxk0gCtINki1a2W5i2rPrFEb8mTYWTcf6OIYs=; b=BoiaW9itpBPtVgzdTOBhFgLCo2NGqyL/URF7vVqPcVeHk7/YCM8i86r8kzW0R+jfmnyZjS aTtphJo5qWBkOkMei4JbuBaDLusvwtJfvSjVG6UZEa1/vburSP2lioSO9DMVpFU291coFj jQwFEK6CqwP40lEI5QMCW5YNBgZt16W6r2Nl/3N2oZdB69ImUHRFAi0I26AI6B1F8K9L5o +pKAYZNMafT8flf+Gk998Hn0TLSiKatGDggBcQKj6pp81NMTmln/x177nHuhaOv+SbAI8r iJOdATkRrpftiKERsYzMqkMNc/YpUkWxgLCGck8eNiAdlKByuFgm6p4gurAB5Q== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1620070670; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=QRjECRjxk0gCtINki1a2W5i2rPrFEb8mTYWTcf6OIYs=; b=8QTDNKcjtGKkCGlBhiQmL2RmwBgHZ1+ZygEdoaXTJxrRk9Ns3aJGKiqRbrhPNmfYwIpvip /s3Y9tYdmMl0u1DA== To: Lai Jiangshan , linux-kernel@vger.kernel.org Cc: Lai Jiangshan , Paolo Bonzini , Sean Christopherson , Steven Rostedt , Andi Kleen , Andy Lutomirski , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , kvm@vger.kernel.org, Josh Poimboeuf , Uros Bizjak , Maxim Levitsky , Ingo Molnar , Borislav Petkov , x86@kernel.org, "H. Peter Anvin" , Peter Zijlstra Subject: Re: [PATCH 3/4] KVM/VMX: Invoke NMI non-IST entry instead of IST entry In-Reply-To: <20210426230949.3561-4-jiangshanlai@gmail.com> References: <20210426230949.3561-1-jiangshanlai@gmail.com> <20210426230949.3561-4-jiangshanlai@gmail.com> Date: Mon, 03 May 2021 21:37:49 +0200 Message-ID: <87k0ofk3qq.ffs@nanos.tec.linutronix.de> MIME-Version: 1.0 Content-Type: text/plain Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 27 2021 at 07:09, Lai Jiangshan wrote: > From: Lai Jiangshan > > In VMX, the NMI handler needs to be invoked after NMI VM-Exit. > > Before the commit 1a5488ef0dcf6 ("KVM: VMX: Invoke NMI handler via > indirect call instead of INTn"), the work is done by INTn ("int $2"). > > But INTn microcode is relatively expensive, so the commit reworked > NMI VM-Exit handling to invoke the kernel handler by function call. > And INTn doesn't set the NMI blocked flag required by the linux kernel > NMI entry. So moving away from INTn are very reasonable. > > Yet some details were missed. After the said commit applied, the NMI > entry pointer is fetched from the IDT table and called from the kernel > stack. But the NMI entry pointer installed on the IDT table is > asm_exc_nmi() which expects to be invoked on the IST stack by the ISA. > And it relies on the "NMI executing" variable on the IST stack to work > correctly. When it is unexpectedly called from the kernel stack, the > RSP-located "NMI executing" variable is also on the kernel stack and > is "uninitialized" and can cause the NMI entry to run in the wrong way. > > So we should not used the NMI entry installed on the IDT table. Rather, > we should use the NMI entry allowed to be used on the kernel stack which > is asm_noist_exc_nmi() which is also used for XENPV and early booting. It's not used by XENPV. XENPV only uses the C entry point, but the ASM entry is separate. Thanks, tglx