Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751366AbdH0MfN (ORCPT ); Sun, 27 Aug 2017 08:35:13 -0400 Received: from tartarus.angband.pl ([89.206.35.136]:52864 "EHLO tartarus.angband.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751152AbdH0MfL (ORCPT ); Sun, 27 Aug 2017 08:35:11 -0400 Date: Sun, 27 Aug 2017 14:35:05 +0200 From: Adam Borowski To: Paolo Bonzini Cc: Wanpeng Li , Radim =?utf-8?B?S3LEjW3DocWZ?= , kvm , "linux-kernel@vger.kernel.org" Subject: Re: kvm splat in mmu_spte_clear_track_bits Message-ID: <20170827123505.u4kb24kigjqwa2t2@angband.pl> References: <20170820231302.s732zclznrqxwr46@angband.pl> <20170821191203.jospdwqpnixlotx3@angband.pl> <20170821195833.GA696@flask> <20170821223228.edc6jrm7bpybtqlj@angband.pl> <1c270e76-05be-6f5f-29c6-9cb31f37f71d@redhat.com> <20170825131419.r5lzm6oluauu65nx@angband.pl> <0a85df4b-ca0a-7e70-51dc-90bd1c460c85@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <0a85df4b-ca0a-7e70-51dc-90bd1c460c85@redhat.com> X-Junkbait: aaron@angband.pl, zzyx@angband.pl User-Agent: NeoMutt/20170113 (1.7.2) X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: kilobyte@angband.pl X-SA-Exim-Scanned: No (on tartarus.angband.pl); SAEximRunCond expanded to false Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1848 Lines: 56 On Fri, Aug 25, 2017 at 03:40:50PM +0200, Paolo Bonzini wrote: > On 25/08/2017 15:14, Adam Borowski wrote: > >>> I would also try commit 1372324b328cd5dabaef5e345e37ad48c63df2a9 to > >>> identify whether it was caused by a KVM change in 4.13 or something > >>> else. > > I've ran different guests for a couple of hours, no explosions. Thus it > > looks like updating Cornelia's email address isn't the cause. > > > > Too bad, there's 15k commits between 1372324b and 7f680d7ec315. > > So: > > - 4.12 works > > - 1372324b328cd5dabaef5e345e37ad48c63df2a9 works > > - 4.13-rc6 fails > > The next ones to test are going to be > c136b84393d4e340e1b53fc7f737dd5827b19ee5 and 4.13-rc1. c136b84393d4e340e1b53fc7f737dd5827b19ee5 works 4.13-rc1 works Which actually isn't so surprising, as I pull from linus/master every Sunday night/Monday morning, so I'd would have noticed KVM breakage sooner. 4.13-rc4 works 4.13-rc5 works (but ↓) v4.13-rc5-173-g58d4e450a490 works... uh oh. It shouldn't -- it's merge-base between mainline and what I was running on the initial crash, and I'm sure anything non-mainline I had isn't the culprit. After a couple of hours of running various loads inside and outside KVM, I finally got it to crash. 4.13-rc5 retested fails Crashed only after two hours or so of testing. 4.13-rc4 apparently works It survived several hours of varied tests (like 5 debian-installer runs, a win10 point release upgrade, some hurd package building, openbsd, etc), all while the host was likewise busy. Thus: to the best of my knowledge, the problem is between 4.13-rc4 and 4.13-rc5 but I wouldn't bet my life on it. Meow! -- ⢀⣴⠾⠻⢶⣦⠀ ⣾⠁⢰⠒⠀⣿⡁ Vat kind uf sufficiently advanced technology iz dis!? ⢿⡄⠘⠷⠚⠋⠀ -- Genghis Ht'rok'din ⠈⠳⣄⠀⠀⠀⠀