Received: by 10.223.164.202 with SMTP id h10csp899193wrb; Thu, 23 Nov 2017 08:00:40 -0800 (PST) X-Google-Smtp-Source: AGs4zMZe7NDgRkJxnGTWVZZogQs5w3KZraDB1tIstrKiBU1eKx+qQ2h8RPGOL8l2XzR3XhCWlauE X-Received: by 10.101.101.215 with SMTP id y23mr20600362pgv.391.1511452840782; Thu, 23 Nov 2017 08:00:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1511452840; cv=none; d=google.com; s=arc-20160816; b=NXs82h12ul6JUlu2bgOEtW82UV2DBfZ/TzixFU4ZSY+MSB+zjG3aJhqn4glIMJ/pv4 Lg1rSCmwp6y07ExZ+z+w9Pod8oHAmgIUNuuNNQ3E9V4F1ZehpFJ6SzOmKzq5ByabMBXS nE8RRMXRu38KhRjoTSEHJQ3z3wZjmVXI8eLYaWRDioS0cHPfpCYd9Gc1UlASg7yb0IF/ CvTiXI6sUq342aMArN2Axg3NmawJukpeFWZKxzxx/UG6YgtBs6w1koT3FdmGrPLCU13w bO13TROdBAWgYAW9w6Vt+JhopSR9u8pHZq+tXOViQ/Igv5Oy/5bWV9S/Og4tWCjckxcz fWcA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=yrxKYP9oIuyJCtPEoz5m3ZRRUb7MApevDln6yDwfzKI=; b=Io4Qti3vAovmCjjtZ8J2rJ7a4YTKfaXX1+JBEbVe8jIyK/VbO15BLfmdr9jx2S83T+ ldzFf/naicEp2FRMeNGaflAiOPLD+8ujksZRRzr5JIAN3axNOb6zxBKc0Tq4Xq3KcUPM EvyxwP5Xr4PH5/CIMBCIWLJxT/qq025WJ2X2VjLnHaL8TKfoNFRM3G/0qLkxNaixfoWG 5Ng1N1LhlZZDCO2PMlyZJlydmRM8g7upG7ZdXRBs01N2iS+3t/jR5Ni7VTDl+QsfkPF7 NZzppSeMigLcsm7ZBOweP6Fse10opaGzrsJ7SCmnsZES6wbsvH3FDH3kepBygJdMJXLZ 1Dyg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t8si6270512pfh.382.2017.11.23.08.00.29; Thu, 23 Nov 2017 08:00:40 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752941AbdKWP7v (ORCPT + 76 others); Thu, 23 Nov 2017 10:59:51 -0500 Received: from mx1.redhat.com ([209.132.183.28]:56544 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752413AbdKWP7u (ORCPT ); Thu, 23 Nov 2017 10:59:50 -0500 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.phx2.redhat.com [10.5.11.16]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id EC9507EA87; Thu, 23 Nov 2017 15:59:49 +0000 (UTC) Received: from flask (ovpn-204-21.brq.redhat.com [10.40.204.21]) by smtp.corp.redhat.com (Postfix) with SMTP id B13895C552; Thu, 23 Nov 2017 15:59:47 +0000 (UTC) Received: by flask (sSMTP sendmail emulation); Thu, 23 Nov 2017 16:59:46 +0100 Date: Thu, 23 Nov 2017 16:59:46 +0100 From: Radim =?utf-8?B?S3LEjW3DocWZ?= To: Marc Haber Cc: LKML , "KVM-ML (kvm@vger.kernel.org)" , Wanpeng Li Subject: Re: VMs freezing when host is running 4.14 Message-ID: <20171123155946.GC21184@flask> References: <20171121161821.b6k3hdl3wgia5f5q@torres.zugschlus.de> <20171122093945.5afa2di2g7qhf4eb@torres.zugschlus.de> <20171122155208.wdcmosxfpsjbwcrm@torres.zugschlus.de> <20171122164312.GA21279@flask> <20171123152024.7xsc7lesv2qyujng@torres.zugschlus.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20171123152024.7xsc7lesv2qyujng@torres.zugschlus.de> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.16 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.28]); Thu, 23 Nov 2017 15:59:50 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 2017-11-23 16:20+0100, Marc Haber: > On Wed, Nov 22, 2017 at 05:43:13PM +0100, Radim Krčmář wrote: > > 2017-11-22 16:52+0100, Marc Haber: > > > On Wed, Nov 22, 2017 at 04:04:42PM +0100, 王金浦 wrote: > > > > So all guest kernels are 4.14, or also other older kernel? > > > > > > Guest kernels are also 4.14, but the issue disappears when the host is > > > downgraded to an older kernel. I therefore reckoned that the guest > > > kernel doesn't matter, but that was before I saw the trace in the log. > > > > The two most suspicious patches since 4.13 (which I assume works) are > > > > 664f8e26b00c ("KVM: X86: Fix loss of exception which has not yet been > > injected") > > That one does not revert cleanly, the line in questions seems to have > been removed a bit later. > > Reject is: > 141 [24/5001]mh@fan:~/linux/git/linux ((v4.14.1) %) $ cat arch/x86/kvm/vmx.c.rej--- arch/x86/kvm/vmx.c > +++ arch/x86/kvm/vmx.c > @@ -2516,7 +2516,7 @@ static void vmx_queue_exception(struct kvm_vcpu *vcpu) > struct vcpu_vmx *vmx = to_vmx(vcpu); > unsigned nr = vcpu->arch.exception.nr; > bool has_error_code = vcpu->arch.exception.has_error_code; > - bool reinject = vcpu->arch.exception.injected; > + bool reinject = vcpu->arch.exception.reinject; > u32 error_code = vcpu->arch.exception.error_code; > u32 intr_info = nr | INTR_INFO_VALID_MASK; This line one can be deleted as reinject isn't used in the function. Btw. there have been already many fixes from Liran Alon for that patch and your case could be the one adressed in https://www.spinics.net/lists/kvm/msg159158.html The patch is incorrect, but you might be able to see only its benefits. > > and > > > > 9a6e7c39810e ("KVM: async_pf: Fix #DF due to inject "Page not Present" > > and "Page Ready" exceptions simultaneously") > > > > please try reverting them to see if it helps, > > That one reverted cleanly. I am now running the new kernel on the > affected machine, and I think that a second machine has joined the > market of being affected. That one had much lower chances of being the culprit. > Would this matter on the host only or on the guests as well? Only on the host. Thanks. From 1584870736775760447@xxx Thu Nov 23 15:21:56 +0000 2017 X-GM-THRID: 1584693144330950355 X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread