Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp3490155ybi; Fri, 19 Jul 2019 04:09:41 -0700 (PDT) X-Google-Smtp-Source: APXvYqxJv2fbGe6n3Dy8nZhZFw0lDPi8RLZtcOTSTDXkI+yG8Otr7cSdWE16Ejj8caluhpJzLxq9 X-Received: by 2002:a17:902:8696:: with SMTP id g22mr54959528plo.249.1563534581498; Fri, 19 Jul 2019 04:09:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563534581; cv=none; d=google.com; s=arc-20160816; b=gdc+yzqLK3aCWDrVF35uANh5RZSlYJz3XT+DW49zOnnZM7g6DmMYgAo67MDqWq2Sjx Hr+giiSUk/mV6zpzKgFQXvPBI3YHPFV8UQLrL4awr+xaaDmBYWVYXjbijKQGVSg52kx7 bl+iLGABQLrGvGIFl3rOvD2P9yMKIJqB8vERexcEuLArSMq6wujlpR0Pg4zMG30ZfdR+ hCKyLB1x50dStwWxyd0jjrY9QMTX9nXnQo3H5xnA/yeSHvXcLxUrYg4Xg8tTFU/2E1Ax DU4u29bq9oguLE1aUJ+ABDtq1kbRJvzE8p7akCo9wImqp6gkhi0yBsMUK9dm0z2m/vqC plvA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:openpgp:from:references:cc:to:subject; bh=jZTpt124xkvIC199mqq/Y1JoK7GIwu0z9IqI5bP/sDw=; b=sSXE67SlmtDs9wK2/lLhtADhCcrR9OwGODLFHNcSpdS8Jculi2qX/BMUlQEDXvZLtR /PROrS/EIsyG3tkTXLHN5of0TmgweHEMrXAs27XUahpHzVs2szLSguZVGW1BXivcJPmB IHqMS4g27mT4JqPTLGfuwgmiDmpGFedIkATriRb4sqRbq9D65XCL/BussEQ2XV1XVnBU wjiyQFJz7D+4QMV8L0HLT3jVAQxYkk+//72BSIET3vIJ0YUfzS7Hw6KyFoQoG21C4j7M F3tzCtQw9LpybvpSb5uq4p/NQnGTzkNzQ3pXf7DOxCmz+gruo2SoklsZX5wKLwpzldC5 3Bwg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f1si1635324pjs.86.2019.07.19.04.09.25; Fri, 19 Jul 2019 04:09:41 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727486AbfGSLJH (ORCPT + 99 others); Fri, 19 Jul 2019 07:09:07 -0400 Received: from mail-wm1-f67.google.com ([209.85.128.67]:50447 "EHLO mail-wm1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726075AbfGSLJH (ORCPT ); Fri, 19 Jul 2019 07:09:07 -0400 Received: by mail-wm1-f67.google.com with SMTP id v15so28426914wml.0 for ; Fri, 19 Jul 2019 04:09:05 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:openpgp:message-id :date:user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=jZTpt124xkvIC199mqq/Y1JoK7GIwu0z9IqI5bP/sDw=; b=Xr3k9i5F5DJu1/S7/rCxV4AX6IUw0oz8z1WAHK4jJuBlI22iSiuyo3c2j88U82TD8C m0WMYwrzxenCXAT65hUgUKZpPxX2UXyil+6x1xli09trz91/g22Ojk67n4MY3y3DYyG6 ovOsMqowWafm1te2BdSU9g8N+FTvnMqpj4oXvUHG+dbv8f+eBhzmq9fZxNUAqms2diJC jIXW1HpdEfRNZrBp8SH5Z++Fiw+uga/mHk47jIzZVFN01EcPjY2P1pEB3N18i2PqR4rw RZiWrUKkclhuBRzcgf+PGKfSXE/yDaxHnr9hE8YyQXm19f3UqlV6K5BwakPTc+SySvVh gkOg== X-Gm-Message-State: APjAAAWywainqqz39MaNq+WYwALdiHY3T995Bn+u2uT5WBnZPTwAXCR2 nwAoUi7+RP+lidFORHKkM67zCQ== X-Received: by 2002:a7b:c215:: with SMTP id x21mr47986773wmi.38.1563534544929; Fri, 19 Jul 2019 04:09:04 -0700 (PDT) Received: from ?IPv6:2001:b07:6468:f312:8501:6b03:f18c:74f8? ([2001:b07:6468:f312:8501:6b03:f18c:74f8]) by smtp.gmail.com with ESMTPSA id l2sm20503956wmj.4.2019.07.19.04.09.03 (version=TLS1_3 cipher=AEAD-AES128-GCM-SHA256 bits=128/128); Fri, 19 Jul 2019 04:09:04 -0700 (PDT) Subject: Re: [5.2 regression] x86/fpu changes cause crashes in KVM guest To: Wanpeng Li , Thomas Lambertz Cc: Sebastian Andrzej Siewior , Rik van Riel , Dave Hansen , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , the arch/x86 maintainers , LKML , Radim Krcmar , kvm , Peter Zijlstra , Marc Orr , Dave Hansen References: <217248af-e980-9cb0-ff0d-9773413b9d38@thomaslambertz.de> From: Paolo Bonzini Openpgp: preference=signencrypt Message-ID: <3ae96202-a121-70a9-fe00-4b5bb4970242@redhat.com> Date: Fri, 19 Jul 2019 13:09:03 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 19/07/19 10:59, Wanpeng Li wrote: > https://lkml.org/lkml/2017/11/14/891, "The scheduler will save the > guest fpu context when a vCPU thread is preempted, and restore it when > it is scheduled back in." But I can't find any scheduler codes do > this. That's because applying commit 240c35a37 was completely wrong. The idea before commit 240c35a37 was that you have the following FPU states: userspace (QEMU) guest --------------------------------------------------------------------------- processor vcpu->arch.guest_fpu >>> KVM_RUN: kvm_load_guest_fpu vcpu->arch.user_fpu processor >>> preempt out vcpu->arch.user_fpu current->thread.fpu >>> preempt in vcpu->arch.user_fpu processor >>> back to userspace >>> kvm_put_guest_fpu processor vcpu->arch.guest_fpu --------------------------------------------------------------------------- After removing user_fpu, QEMU's FPU state is destroyed when KVM_RUN is preempted. So that's already messed up (I'll send a revert), and given the diagram above your patch makes total sense. With the new lazy model we want to hook into kvm_vcpu_arch_load and get the state back to the processor from current->thread.fpu, and indeed switch_fpu_return is essentially copy_kernel_to_fpregs(¤t->thread. fpu->state). However I would keep the fpregs_assert_state_consistent in kvm_arch_vcpu_load, and also WARN_ON_ONCE(test_thread_flag(TIF_NEED_FPU_LOAD)) in vcpu_enter_guest. Paolo