Received: by 2002:a17:90a:1609:0:0:0:0 with SMTP id n9csp131799pja; Thu, 9 Apr 2020 16:01:53 -0700 (PDT) X-Google-Smtp-Source: APiQypKfFevS1Kov/k9bvhv4sKGP73FslyYFhYTK0Q/JFXmaxhxVmTrvI1uGDmcMYrEFAddqY1kI X-Received: by 2002:ae9:efca:: with SMTP id d193mr1252992qkg.465.1586473313742; Thu, 09 Apr 2020 16:01:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586473313; cv=none; d=google.com; s=arc-20160816; b=nB1Snevu8SbmUHEfninhEKmjDO4k/uf8LVykZcEqRnXXY3GErSyOKIg9vXfWmynLp/ vN1XnTG0FswOB+a76QnWqRXuUBKlZB4Et+04ZjoDY6qr5kE+HM4+7bACaZTJQcsnVkGF Pdu9f/2gv4fnfFFkAE6NDh/A3XuOpVexi6MTJHHwxIfBuv26YXTLHWzbdpxi8y4Es6mJ lpYX840Yb8bTESZykslla+4NHX6iocCyNPzlbWpQFVt2iZLc/jAG3k/NJWl546ySC5Qd t57Jy01Ooc1nu1FHiFUDOHCMoFRvIv/if70LdwkvG0Hy/qz5yLLuel+Dp/jVm+4Txpzc LSxQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature; bh=9za1cgL5saDGUL4ncdk/vOmVKH89NTfUqWctXJV+wKg=; b=cJlhbNEB54QepSyAXjgP5UHaCAyPF0D0KJBZ2jpvbWUUKcSeOOqwprwTDti750ku4s n/0zHfgGbmkiUUQgxO0spuUgW0+QJU2Y/4EjKqTpN5ruEcu7LR52coIMVspH/1yrYXkH 3WystrkdfyH6spyPEP0ZmoVWkEqD2ZzovX1A285nOrxzNKS2PpRBURoyACLh9NxILg1s MkXskOjwYIul0f+Hpy/I1qWLtS07EClYXrakwiJk58zxcBBzPe75Dm+ploY+LJ9rZf02 CzWIOYlrbI/e/m8IQBCTYi9+tycXuQ1mfaK3fwlbNZVjdbhIW4Vahic5XDYdAviB8Zom ZXwQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=E55hQkfn; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d29si106319qtg.116.2020.04.09.16.01.19; Thu, 09 Apr 2020 16:01:53 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=E55hQkfn; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726779AbgDIXAe (ORCPT + 99 others); Thu, 9 Apr 2020 19:00:34 -0400 Received: from mail-qt1-f196.google.com ([209.85.160.196]:33698 "EHLO mail-qt1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726722AbgDIXAe (ORCPT ); Thu, 9 Apr 2020 19:00:34 -0400 Received: by mail-qt1-f196.google.com with SMTP id x2so263711qtr.0 for ; Thu, 09 Apr 2020 16:00:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lca.pw; s=google; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=9za1cgL5saDGUL4ncdk/vOmVKH89NTfUqWctXJV+wKg=; b=E55hQkfn3hQcBkJrMLs3sEnbbLpotAAeVJqDeDykwBSnbW0eLW6xgjXoIWc2skM5lr qGc8uv0RURI88S2jAw6upuDwEtQI09MviDMZR9fIR8NjL8PTWvnrUJOgvGIyLBYS2cR1 f+NHmTKB9MgRErhKbEdkCHgyJ0gBiVSS1SPM1Xbkrjes67HLpXSIa8Oo1w6rdrjEYUFX 0Z+NQhrLstrEmFjchg9kT+TcmmUewZIPmh8J79bIN7Pgv/8IH4ZCYBhoTi9o0alszVmm LhOgV+yhdrMnO01IN9AJVrQKMs9kQsMDh2WItUR55E3y99PWT7e1YM2WTSQCRP/UOuc7 4Iow== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=9za1cgL5saDGUL4ncdk/vOmVKH89NTfUqWctXJV+wKg=; b=mjV9denXTvavVWgCbtEOYRjdvJfCrUuImq/FbX+gzae+pSDj7TmTsZgjpiCWUs+lNb Kx4+MQCGuZiZu/9hu7tHhCrVWejKZUO5XpySdwolk2uKcAwYlse7uB3eECaZrug788kC prjVb0VVhn0fkBrusPUYVpoBc1TjXrP6Yh5MX93uXx+Jops6GAQDf8RuETN47L27blfq c1xYeBgMHHtIBrqzBJ/JvjueEF2U1AONVeJxT5cCN0Qr2UjE2Qa6fzemndFHJZA3fR+X mxe6ahtajLa28ofj8la6N6TG67gusRgSlS+VqRMsATNtTMcdErILxj7S3/YtPpV8nHr/ bW+w== X-Gm-Message-State: AGi0PuZyOhym8UCCVTL2edyPuOsqhg+eEy53bOMOkD0+IMMuIDdZEL40 QX+6Y/QZWQ8TKZ4fIIZJjPpFtA== X-Received: by 2002:ac8:7286:: with SMTP id v6mr1775597qto.299.1586473233327; Thu, 09 Apr 2020 16:00:33 -0700 (PDT) Received: from [192.168.1.153] (pool-71-184-117-43.bstnma.fios.verizon.net. [71.184.117.43]) by smtp.gmail.com with ESMTPSA id 69sm226385qki.131.2020.04.09.16.00.32 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Thu, 09 Apr 2020 16:00:32 -0700 (PDT) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 13.4 \(3608.80.23.2.2\)) Subject: Re: KCSAN + KVM = host reset From: Qian Cai In-Reply-To: Date: Thu, 9 Apr 2020 19:00:31 -0400 Cc: Paolo Bonzini , "paul E. McKenney" , kasan-dev , LKML , kvm@vger.kernel.org Content-Transfer-Encoding: quoted-printable Message-Id: References: <017E692B-4791-46AD-B9ED-25B887ECB56B@lca.pw> <2730C0CC-B8B5-4A65-A4ED-9DFAAE158AA6@lca.pw> To: Marco Elver X-Mailer: Apple Mail (2.3608.80.23.2.2) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Apr 9, 2020, at 5:28 PM, Qian Cai wrote: >=20 >=20 >=20 >> On Apr 9, 2020, at 12:03 PM, Marco Elver wrote: >>=20 >> On Thu, 9 Apr 2020 at 17:30, Qian Cai wrote: >>>=20 >>>=20 >>>=20 >>>> On Apr 9, 2020, at 11:22 AM, Marco Elver wrote: >>>>=20 >>>> On Thu, 9 Apr 2020 at 17:10, Qian Cai wrote: >>>>>=20 >>>>>=20 >>>>>=20 >>>>>> On Apr 9, 2020, at 3:03 AM, Marco Elver wrote: >>>>>>=20 >>>>>> On Wed, 8 Apr 2020 at 23:29, Qian Cai wrote: >>>>>>>=20 >>>>>>>=20 >>>>>>>=20 >>>>>>>> On Apr 8, 2020, at 5:25 PM, Paolo Bonzini = wrote: >>>>>>>>=20 >>>>>>>> On 08/04/20 22:59, Qian Cai wrote: >>>>>>>>> Running a simple thing on this AMD host would trigger a reset = right away. >>>>>>>>> Unselect KCSAN kconfig makes everything work fine (the host = would also >>>>>>>>> reset If only "echo off > /sys/kernel/debug/kcsan=E2=80=9D = before running qemu-kvm). >>>>>>>>=20 >>>>>>>> Is this a regression or something you've just started to play = with? (If >>>>>>>> anything, the assembly language conversion of the AMD world = switch that >>>>>>>> is in linux-next could have reduced the likelihood of such a = failure, >>>>>>>> not increased it). >>>>>>>=20 >>>>>>> I don=E2=80=99t remember I had tried this combination before, so = don=E2=80=99t know if it is a >>>>>>> regression or not. >>>>>>=20 >>>>>> What happens with KASAN? My guess is that, since it also happens = with >>>>>> "off", something that should not be instrumented is being >>>>>> instrumented. >>>>>=20 >>>>> No, KASAN + KVM works fine. >>>>>=20 >>>>>>=20 >>>>>> What happens if you put a 'KCSAN_SANITIZE :=3D n' into >>>>>> arch/x86/kvm/Makefile? Since it's hard for me to reproduce on = this >>>>>=20 >>>>> Yes, that works, but this below alone does not work, >>>>>=20 >>>>> KCSAN_SANITIZE_kvm-amd.o :=3D n >>>>=20 >>>> There are some other files as well, that you could try until you = hit >>>> the right one. >>>>=20 >>>> But since this is in arch, 'KCSAN_SANITIZE :=3D n' wouldn't be too = bad >>>> for now. If you can't narrow it down further, do you want to send a >>>> patch? >>>=20 >>> No, that would be pretty bad because it will disable KCSAN for Intel >>> KVM as well which is working perfectly fine right now. It is only = AMD >>> is broken. >>=20 >> Interesting. Unfortunately I don't have access to an AMD machine = right now. >>=20 >> Actually I think it should be: >>=20 >> KCSAN_SANITIZE_svm.o :=3D n >> KCSAN_SANITIZE_pmu_amd.o :=3D n >>=20 >> If you want to disable KCSAN for kvm-amd. >=20 > KCSAN_SANITIZE_svm.o :=3D n >=20 > That alone works fine. I am wondering which functions there could = trigger > perhaps some kind of recursing with KCSAN? Another data point is set CONFIG_KCSAN_INTERRUPT_WATCHER=3Dn alone also fixed the issue. I saw quite a few interrupt related function in = svm.c, so some interrupt-related recursion going on?=