Received: by 2002:a05:6a10:a852:0:0:0:0 with SMTP id d18csp1702293pxy; Thu, 6 May 2021 13:45:16 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzNJ1KX8t14ekbKT0jCMDwfF9MZ99LylTVN4bPkoVUSkYpkEpre93ac3V2Uk0rCtCaC1iVR X-Received: by 2002:a17:90a:c8:: with SMTP id v8mr19725730pjd.18.1620333916136; Thu, 06 May 2021 13:45:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1620333916; cv=none; d=google.com; s=arc-20160816; b=Pa2+G1WTg6y7mWqUq43eQFg+TzesdQuHEL+OfAUHdgHa9ChGd24P1jOdYh2Syxe1fH 1mpIIbxiETxfzzvyeWaq/owaxlLv4u/sqO2bhXh/IAr0ZrqMXsUjQHg0j4/5FERZxNQ2 2HWOgjw5N51oDU0Ef0xLNL+firxQL55hDnq6KutuLK2h23MmoHnvOzygxRDAJ3UEUo/6 of6UjQ6AJM0k6kLp5gyGNPgVn6hl/sSqqj0uhZzI4CzXtM0bjmKyA8ME1zUH9cinYfuI sGnzjErZ13dORrrdgQACtICV/Q/8J7tq44sWqpv4Buo72k4SRmG9tbQ8RtRSKV9x+KC0 +Xwg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=EJubi0y+oYsVmoQJpM2e8vSc2Cery4eiyclNOgAN7d8=; b=buyVd29vKT+TwAMy47fLczugcmTXJGufrwTCRat4DpryeMNEhSbewNxD6QeT90A9PL 9Hl1z3x7wUH7DdnVla3lUYFBOh+icPw0qgjlkoBeAvke9BLhk43i+wcEnkBchEyhZwVi jPVNGHlLxmAjqQYIYNB5l6U6ITWSj/MCpAN4FX8Eyfmb0nw4nHtQTPdRojnVO+yCKWXu A1Wv+WkYvf+lssFgENfRn76X9f+Hh3mKg57ewSpUgQVjvYicT7MgQUOJoY2uTMEOEzQQ VFS2BK0ZHZRDyBnBah/mWvnfdRX0qIr1lrsUvXSCw31XRrw2KjhONC+pEK5CSGNiCSPq CKvA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=8bytes.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id f12si4780002pfj.138.2021.05.06.13.44.58; Thu, 06 May 2021 13:45:16 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=8bytes.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233401AbhEFUmI (ORCPT + 99 others); Thu, 6 May 2021 16:42:08 -0400 Received: from 8bytes.org ([81.169.241.247]:37800 "EHLO theia.8bytes.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231200AbhEFUmH (ORCPT ); Thu, 6 May 2021 16:42:07 -0400 Received: by theia.8bytes.org (Postfix, from userid 1000) id E58A0312; Thu, 6 May 2021 22:41:07 +0200 (CEST) Date: Thu, 6 May 2021 22:41:05 +0200 From: Joerg Roedel To: "Eric W. Biederman" Cc: Joerg Roedel , x86@kernel.org, kexec@lists.infradead.org, stable@vger.kernel.org, hpa@zytor.com, Andy Lutomirski , Dave Hansen , Peter Zijlstra , Jiri Slaby , Dan Williams , Tom Lendacky , Juergen Gross , Kees Cook , David Rientjes , Cfir Cohen , Erdem Aktas , Masami Hiramatsu , Mike Stunes , Sean Christopherson , Martin Radev , Arvind Sankar , linux-coco@lists.linux.dev, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, virtualization@lists.linux-foundation.org Subject: Re: [PATCH 2/2] x86/kexec/64: Forbid kexec when running as an SEV-ES guest Message-ID: References: <20210506093122.28607-1-joro@8bytes.org> <20210506093122.28607-3-joro@8bytes.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, May 06, 2021 at 01:59:42PM -0500, Eric W. Biederman wrote: > Joerg Roedel writes: > Why does it need that? > > Would it not make sense to instead teach kexec how to pass a cpu from > one kernel to another. We could use that everywhere. > > Even the kexec-on-panic case should work as even in that case we have > to touch the cpus as they go down. > > The hardware simply worked well enough that it hasn't mattered enough > for us to do something like that, but given that we need to do something > anyway. It seems like it would make most sense do something that > will work everywhere, and does not introduce unnecessary dependencies > on hypervisors. Well, I guess we could implement something that even works for non SEV-ES guests and bare-metal. The question is what benefit we get from that. Is the SIPI sequence currently used not reliable enough? The benefit of being able to rely on the SIPI sequence is that the kexec'ed kernel can use the same method to bring up APs as the first kernel did. Btw, the same is true for SEV-ES guests, The goal is bring the APs of an SEV-ES guest into a state where they will use the SEV-ES method of AP-bringup when they wake up again. This method involves a firmware-owned page called the AP-jump-table, which contains the reset vector for the AP in its first 4 bytes. > > As I said above, for protocol version 1 it will stay disabled, so it is > > not only a temporary hack. > > Why does bringing up a cpu need hypervisor support? When a CPU is taken offline under SEV-ES it will do a special hypercall named AP-reset-hold. The hypervisor will put the CPU into a halt state until the next SIPI arrives. In protocol version 1 this hypercall requires a GHCB shared page to be set up for the CPU doing the hypercall and upon CPU wakeup the HV will write to that shared page. Problem with that is that the page which contains the GHCB is already owned by the new kernel then and is probably not shared anymore, which can cause data corruption in the new kernel. Version 2 of the protocol adds a purely MSR based version of the AP-reset-hold hypercall. This one does not need a GHCB page and has to be used to bring the APs offline before doing the kexec. That is the reason I plan to only support kexec when the hypervisor provides version 2 of the protocol. Regards, Joerg