Received: by 2002:a25:5b86:0:0:0:0:0 with SMTP id p128csp1402384ybb; Fri, 29 Mar 2019 03:57:02 -0700 (PDT) X-Google-Smtp-Source: APXvYqyZZqJs4qJ7bA9GmXf9SE0ybkSDB0vAQxAnSlg6+9AMLUwR+ZGzXKXVnsNQjQjbqjNt1suo X-Received: by 2002:a17:902:f084:: with SMTP id go4mr48748148plb.15.1553857022820; Fri, 29 Mar 2019 03:57:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1553857022; cv=none; d=google.com; s=arc-20160816; b=0H1TqCxPNd+3F3paIYNnl4i50nIGfRJx5b2+tH2o8bi+8bNrgB2hSvIYXegL8naM34 q3ckL1Q4MhppnqgSbb4C88+IWRiTPheyVc0I698twE3mhtTK3Is9EU8btU3fPk8JWLZx CmZCt+GVs10kZ6uTFxC0KEYq2bY3JZlSzTW1+/6r+rali5SnhDS61Ffukyk0lnSRwcIi tX4Ud+FtyvqNGHoeDPh20vxdLNx6CLNnz8qXkkccwmqaQNod9qnMr4Fn7mu2O9YVnytx 4bSl9Srl0KeTp86lHrvnQNDVaR392naQ0VuP6kzjamut2UDFZFVUIaLZJ4LDXYPGWE11 W8Tw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:autocrypt:openpgp:from:references:cc:to :subject; bh=GoC9yAyZX4IbctXtyjdIPUcxUvzlzD5MtDhtYhh7O14=; b=z7gyDOYj3LWUj/3djcTij8PgbseQiYV746W6GxV2pgZVL+wxS/AlYb1JeVF4iE4QS4 Kw/hR+BOYfTZAmkEfcUlqJ99n2SQcU2jPbIruqTkpQx1Pdvk/BYBKhdu/YIT1fSxqvp0 /KpQWXqlMSA3/UT86UtIunnFT6iaw7IsH9RJDMVtxhVreZ2BCiKd2LAfSn71hIuJyf7h Ifg88SVJzPpP/qE34PKWF14WRpKAPW7zugo+PwmnSHZiC4MYnpxWIXkw2V7mNCM/lOcu IOhIbVo8tsSWF0cCnahsrSZHmUQ7X0HHNgg+fdaATz2tQifqC1j1OQpHSl6O71hvfAau Omzw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x5si1613496pln.220.2019.03.29.03.56.45; Fri, 29 Mar 2019 03:57:02 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729151AbfC2Kyw (ORCPT + 99 others); Fri, 29 Mar 2019 06:54:52 -0400 Received: from foss.arm.com ([217.140.101.70]:57926 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728837AbfC2Kyw (ORCPT ); Fri, 29 Mar 2019 06:54:52 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id C2BB580D; Fri, 29 Mar 2019 03:54:51 -0700 (PDT) Received: from [10.1.196.92] (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id AB4C03F575; Fri, 29 Mar 2019 03:54:50 -0700 (PDT) Subject: Re: Unexpected interrupt received in Guest OS when booting after "system_reset" To: Heyi Guo , Christoffer Dall Cc: linux-arm-kernel@lists.infradead.org, kvmarm@lists.cs.columbia.edu, linux-kernel@vger.kernel.org, wanghaibin 00208455 References: <9a6ece7e-9984-dc9e-8fa2-df9736393dd2@arm.com> <9694b5f2-80bd-b85c-8fc5-bd1d917e1b33@huawei.com> <2e8f8bfb-4ae2-5fc6-4022-222a8f44e1f6@huawei.com> From: Marc Zyngier Openpgp: preference=signencrypt Autocrypt: addr=marc.zyngier@arm.com; prefer-encrypt=mutual; keydata= mQINBE6Jf0UBEADLCxpix34Ch3kQKA9SNlVQroj9aHAEzzl0+V8jrvT9a9GkK+FjBOIQz4KE g+3p+lqgJH4NfwPm9H5I5e3wa+Scz9wAqWLTT772Rqb6hf6kx0kKd0P2jGv79qXSmwru28vJ t9NNsmIhEYwS5eTfCbsZZDCnR31J6qxozsDHpCGLHlYym/VbC199Uq/pN5gH+5JHZyhyZiNW ozUCjMqC4eNW42nYVKZQfbj/k4W9xFfudFaFEhAf/Vb1r6F05eBP1uopuzNkAN7vqS8XcgQH qXI357YC4ToCbmqLue4HK9+2mtf7MTdHZYGZ939OfTlOGuxFW+bhtPQzsHiW7eNe0ew0+LaL 3wdNzT5abPBscqXWVGsZWCAzBmrZato+Pd2bSCDPLInZV0j+rjt7MWiSxEAEowue3IcZA++7 ifTDIscQdpeKT8hcL+9eHLgoSDH62SlubO/y8bB1hV8JjLW/jQpLnae0oz25h39ij4ijcp8N t5slf5DNRi1NLz5+iaaLg4gaM3ywVK2VEKdBTg+JTg3dfrb3DH7ctTQquyKun9IVY8AsxMc6 lxl4HxrpLX7HgF10685GG5fFla7R1RUnW5svgQhz6YVU33yJjk5lIIrrxKI/wLlhn066mtu1 DoD9TEAjwOmpa6ofV6rHeBPehUwMZEsLqlKfLsl0PpsJwov8TQARAQABtCNNYXJjIFp5bmdp ZXIgPG1hcmMuenluZ2llckBhcm0uY29tPokCOwQTAQIAJQIbAwYLCQgHAwIGFQgCCQoLBBYC AwECHgECF4AFAk6NvYYCGQEACgkQI9DQutE9ekObww/+NcUATWXOcnoPflpYG43GZ0XjQLng LQFjBZL+CJV5+1XMDfz4ATH37cR+8gMO1UwmWPv5tOMKLHhw6uLxGG4upPAm0qxjRA/SE3LC 22kBjWiSMrkQgv5FDcwdhAcj8A+gKgcXBeyXsGBXLjo5UQOGvPTQXcqNXB9A3ZZN9vS6QUYN TXFjnUnzCJd+PVI/4jORz9EUVw1q/+kZgmA8/GhfPH3xNetTGLyJCJcQ86acom2liLZZX4+1 6Hda2x3hxpoQo7pTu+XA2YC4XyUstNDYIsE4F4NVHGi88a3N8yWE+Z7cBI2HjGvpfNxZnmKX 6bws6RQ4LHDPhy0yzWFowJXGTqM/e79c1UeqOVxKGFF3VhJJu1nMlh+5hnW4glXOoy/WmDEM UMbl9KbJUfo+GgIQGMp8mwgW0vK4HrSmevlDeMcrLdfbbFbcZLNeFFBn6KqxFZaTd+LpylIH bOPN6fy1Dxf7UZscogYw5Pt0JscgpciuO3DAZo3eXz6ffj2NrWchnbj+SpPBiH4srfFmHY+Y LBemIIOmSqIsjoSRjNEZeEObkshDVG5NncJzbAQY+V3Q3yo9og/8ZiaulVWDbcpKyUpzt7pv cdnY3baDE8ate/cymFP5jGJK++QCeA6u6JzBp7HnKbngqWa6g8qDSjPXBPCLmmRWbc5j0lvA 6ilrF8m5Ag0ETol/RQEQAM/2pdLYCWmf3rtIiP8Wj5NwyjSL6/UrChXtoX9wlY8a4h3EX6E3 64snIJVMLbyr4bwdmPKULlny7T/R8dx/mCOWu/DztrVNQiXWOTKJnd/2iQblBT+W5W8ep/nS w3qUIckKwKdplQtzSKeE+PJ+GMS+DoNDDkcrVjUnsoCEr0aK3cO6g5hLGu8IBbC1CJYSpple VVb/sADnWF3SfUvJ/l4K8Uk4B4+X90KpA7U9MhvDTCy5mJGaTsFqDLpnqp/yqaT2P7kyMG2E w+eqtVIqwwweZA0S+tuqput5xdNAcsj2PugVx9tlw/LJo39nh8NrMxAhv5aQ+JJ2I8UTiHLX QvoC0Yc/jZX/JRB5r4x4IhK34Mv5TiH/gFfZbwxd287Y1jOaD9lhnke1SX5MXF7eCT3cgyB+ hgSu42w+2xYl3+rzIhQqxXhaP232t/b3ilJO00ZZ19d4KICGcakeiL6ZBtD8TrtkRiewI3v0 o8rUBWtjcDRgg3tWx/PcJvZnw1twbmRdaNvsvnlapD2Y9Js3woRLIjSAGOijwzFXSJyC2HU1 AAuR9uo4/QkeIrQVHIxP7TJZdJ9sGEWdeGPzzPlKLHwIX2HzfbdtPejPSXm5LJ026qdtJHgz BAb3NygZG6BH6EC1NPDQ6O53EXorXS1tsSAgp5ZDSFEBklpRVT3E0NrDABEBAAGJAh8EGAEC AAkFAk6Jf0UCGwwACgkQI9DQutE9ekMLBQ//U+Mt9DtFpzMCIHFPE9nNlsCm75j22lNiw6mX mx3cUA3pl+uRGQr/zQC5inQNtjFUmwGkHqrAw+SmG5gsgnM4pSdYvraWaCWOZCQCx1lpaCOl MotrNcwMJTJLQGc4BjJyOeSH59HQDitKfKMu/yjRhzT8CXhys6R0kYMrEN0tbe1cFOJkxSbV 0GgRTDF4PKyLT+RncoKxQe8lGxuk5614aRpBQa0LPafkirwqkUtxsPnarkPUEfkBlnIhAR8L kmneYLu0AvbWjfJCUH7qfpyS/FRrQCoBq9QIEcf2v1f0AIpA27f9KCEv5MZSHXGCdNcbjKw1 39YxYZhmXaHFKDSZIC29YhQJeXWlfDEDq6nIhvurZy3mSh2OMQgaIoFexPCsBBOclH8QUtMk a3jW/qYyrV+qUq9Wf3SKPrXf7B3xB332jFCETbyZQXqmowV+2b3rJFRWn5hK5B+xwvuxKyGq qDOGjof2dKl2zBIxbFgOclV7wqCVkhxSJi/QaOj2zBqSNPXga5DWtX3ekRnJLa1+ijXxmdjz hApihi08gwvP5G9fNGKQyRETePEtEAWt0b7dOqMzYBYGRVr7uS4uT6WP7fzOwAJC4lU7ZYWZ yVshCa0IvTtp1085RtT3qhh9mobkcZ+7cQOY+Tx2RGXS9WeOh2jZjdoWUv6CevXNQyOUXMM= Organization: ARM Ltd Message-ID: Date: Fri, 29 Mar 2019 10:54:49 +0000 User-Agent: Mozilla/5.0 (X11; Linux aarch64; rv:60.0) Gecko/20100101 Thunderbird/60.5.1 MIME-Version: 1.0 In-Reply-To: <2e8f8bfb-4ae2-5fc6-4022-222a8f44e1f6@huawei.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 29/03/2019 09:19, Heyi Guo wrote: > Hi Marc, > > The patch works. I tested for 1.5 hour and 52 VM resets. There were > 16 times that a virtual LPI left in the ap_list (seen by an > additional printk) during reset and we never saw "Unexpected > interrupt received" any more. Thanks for testing, much appreciated. > Just a minor comment: how about replacing /vcpu->arch.vgic_cpu./ with > /vgic_cpu->/ in the lock/unlock code line, to reduce some words? Well, as I said, the patch is wrong in other ways, so I wouldn't bother with that. It only serves as a test for my theory. I think I'm slowly warming up to you initial proposal to hook things into the PROPBASER/PENDBASER registers, as the LPIs do have a life outside of the ITS itself. I'll try to respin something next week. Thanks, M. > > Thanks, > > Heyi > > On 2019/3/29 9:19, Heyi Guo wrote: >> >> >> On 2019/3/29 1:18, Marc Zyngier wrote: >>> [Please do not send HTML emails] >> Sorry; will keep in mind next time :) >>> >>> On 28/03/2019 15:44, Heyi Guo wrote: >>>> Hi Marc and Christoffer, >>>> >>>> When we issue "system_reset" from qemu monitor to a running VM, guest >>>> Linux will occasionally get "Unexpected interrupt" after rebooting, with >>>> kernel message at the bottom. >>>> >>>> After some investigation, we found it might be caused by the >>>> preservation of virtual LPI during system reset: it seems the virtual >>>> LPI remains in the ap_list during VM reset, as well as its "enabled" and >>>> "pending_latch" status, and this causes the virtual LPI to be injected >>>> wrongly after VCPU reboots and enables interrupt. >>>> >>>> We propose to clear "enabled" flag of virtual LPI when PROPBASER (or >>>> GICR_CTRL) of virtual GICR is written to 0, and update virtual LPI >>>> properties when GICR_CTRL.enableLPIs is set to 1 again. >>>> >>>> Any advice? Or did we miss something? >>> We're clearly missing a trick here, but I'm not convinced of your >>> approach. >> To be honest, we were not fully convinced by ourselves either. I was worrying about guest switching GICR_CTRL or GICR_PROPBASER at runtime which probably causes issue for our rough approach. >> >>> What should happend is that the redistributors should be reset >>> as well, and that this should recall any LPI that has been made pending. >>> Unfortunately, we don't seem to have such code in place, which is >>> embarrassing. >>> >>> Can you give the following, untested patch a go? It isn't right either, >>> but it should have the right effect. If you confirm that it solves your >>> problem, we can look at adding the right hooks... >> Thanks, I'll test this and get back to you. >> Heyi >> >>> Thanks, >>> >>> M. >>> >>> diff --git a/virt/kvm/arm/vgic/vgic-its.c b/virt/kvm/arm/vgic/vgic-its.c >>> index ab3f47745d9c..bd9a9250f323 100644 >>> --- a/virt/kvm/arm/vgic/vgic-its.c >>> +++ b/virt/kvm/arm/vgic/vgic-its.c >>> @@ -2403,8 +2403,32 @@ static int vgic_its_commit_v0(struct vgic_its *its) >>> return 0; >>> } >>> +static void vgic_nuke_pending_lpis(struct kvm_vcpu *vcpu) >>> +{ >>> + struct vgic_cpu *vgic_cpu = &vcpu->arch.vgic_cpu; >>> + struct vgic_irq *irq, *tmp; >>> + unsigned long flags; >>> + >>> + raw_spin_lock_irqsave(&vcpu->arch.vgic_cpu.ap_list_lock, flags); >>> + >>> + list_for_each_entry_safe(irq, tmp, &vgic_cpu->ap_list_head, ap_list) { >>> + if (irq->intid >= VGIC_MIN_LPI) { >>> + list_del(&irq->ap_list); >>> + vgic_put_irq(vcpu->kvm, irq); >>> + } >>> + } >>> + >>> + raw_spin_unlock_irqrestore(&vcpu->arch.vgic_cpu.ap_list_lock, flags); >>> +} >>> + >>> static void vgic_its_reset(struct kvm *kvm, struct vgic_its *its) >>> { >>> + struct kvm_vcpu *vcpu; >>> + int c; >>> + >>> + kvm_for_each_vcpu(c, vcpu, kvm) >>> + vgic_nuke_pending_lpis(vcpu); >>> + >>> /* We need to keep the ABI specific field values */ >>> its->baser_coll_table &= ~GITS_BASER_VALID; >>> its->baser_device_table &= ~GITS_BASER_VALID; >>> >> >> >> >> . >> > > -- Jazz is not dead. It just smells funny...