Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp3277769yba; Sun, 28 Apr 2019 22:22:27 -0700 (PDT) X-Google-Smtp-Source: APXvYqyl3Unuuiyy/50GXXBfZIVCbK/PlDO/RjAkbKBMDFj2OpdD1PLAgszUVjfcNHq5SCvPpLjF X-Received: by 2002:a63:1462:: with SMTP id 34mr12510337pgu.155.1556515347827; Sun, 28 Apr 2019 22:22:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1556515347; cv=none; d=google.com; s=arc-20160816; b=zldknFcEO25DmorirKVZTOmCJh9xzwFjG5e/elYn2s1PbU9A423xosjX+u7dt+o/4P Df4iDwG8JOaKz/4314cE5gGR5ihmQjmNfWyoFZwiuNYi7yo1h95+Shqk4SHpdqR0bvYc LLfYITx25iu7IEhRjIWyyTf3VvG49BwNNW+zQ62b65e9oNCxisQfM2dSHsyBXxX7r8/W kbbWuLi5/ZvRsoftaNWqV924cFI02Y2S9EK/mbRFKgK8OwZ0pqKgy47lSIdgbNZ2lqMY A6m6sRLzfagYFWCRwKauRCRu7jRtdqxw5UYQ2pyXnXrvuBYjwT3eTKPaypn1osIYO4zv KhuA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=QTI6XcWzDifUFN6sxdaZfIeCnC2pdItFPxlanxa9l5E=; b=uke/4BUe+5aANpbqwIjd+kk8pLEOqShCiElMiw+gRurYklWbwMHqS3LT6B0iax0jis zw4WZMwe+DwMf5W4MHteVOiGG+r6KbuSLg7HIThINh1jnonvD69XV0Rr1dUDDzmsOupX zkVvoRDqkvGy/voYNRrQxQJhHiFEoiEH9iItwjzn1h7E2SmGLtX3lvJB4Cdb58TQHThf x3L7Jhz6xBNhDW13myKXOgZWDgTYYr47iumRbo8ADJfRzM2y3+yhrz/X31DuXrn2lfHb GCSqxLVxdNjRGKZErWPFwX7uaBDiocc2ot00C4Sb/Qq59JcF3A0X1eNLiQ1P0syPrEb6 U4EQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 72si31950260plb.165.2019.04.28.22.22.12; Sun, 28 Apr 2019 22:22:27 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727289AbfD2FVU (ORCPT + 99 others); Mon, 29 Apr 2019 01:21:20 -0400 Received: from mga17.intel.com ([192.55.52.151]:10549 "EHLO mga17.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726585AbfD2FVU (ORCPT ); Mon, 29 Apr 2019 01:21:20 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga107.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 28 Apr 2019 22:21:20 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.60,408,1549958400"; d="scan'208";a="341652909" Received: from xiaoyaol-mobl.ccr.corp.intel.com (HELO [10.239.13.123]) ([10.239.13.123]) by fmsmga005.fm.intel.com with ESMTP; 28 Apr 2019 22:21:18 -0700 Subject: Re: [PATCH v8 12/15] kvm/vmx: Emulate MSR TEST_CTL To: Thomas Gleixner , Xiaoyao Li , Paolo Bonzini Cc: Fenghua Yu , Ingo Molnar , Borislav Petkov , H Peter Anvin , Dave Hansen , Ashok Raj , Peter Zijlstra , Ravi V Shankar , Christopherson Sean J , Kalle Valo , Michael Chan , linux-kernel , x86 , kvm@vger.kernel.org, netdev@vger.kernel.org, linux-wireless@vger.kernel.org References: <1556134382-58814-1-git-send-email-fenghua.yu@intel.com> <1556134382-58814-13-git-send-email-fenghua.yu@intel.com> <7395908840acfbf806146f5f20d3509342771a19.camel@linux.intel.com> From: Xiaoyao Li Message-ID: <725e3442-949d-efe6-a60c-1ca3716428fb@linux.intel.com> Date: Mon, 29 Apr 2019 13:21:16 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Thomas, Base on your comments, I plan to make the design as following: 1) When host enables this feature, there is no switch between host and guest that guest running with it enabled by force. Since #AC in exception bitmap is set in current kvm, every #AC in guest will be trapped. And in handle_exception() handler in kvm, if #AC is caused by alignment check, kvm injects #AC back to guest; if #AC is caused by split lock, kvm sends a SIGBUS to userspace. 2) When host disables this feature, there needs atomic switch between host and guest if different. And in handle_exception() handler in kvm, we can just inject #AC back to guest, and let guest to handle it. Besides, I think there might be an optimization for case #1. When host has it enabled and guest also has it enabled, I think it's OK to inject #AC back to guest, not directly kill the guest. Because guest kernel has it enabled means it knows what this feature is and it also want to get aware of and fault every split lock. At this point, if guest has it enabled, we can leave it to guest. Only when guest's configuration is having it disabled, can it be regards as potentially harmful that we kill the guest once there is a #AC due to split lock. How do you think about the design and this optimization? Hi, Paolo, What's your opinion about this design of split lock in KVM? Thanks. On 4/28/2019 3:09 PM, Thomas Gleixner wrote: > On Sat, 27 Apr 2019, Xiaoyao Li wrote: >> On Thu, 2019-04-25 at 09:42 +0200, Thomas Gleixner wrote: >>> But the way more interesting question is why are you exposing the MSR and >>> the bit to the guest at all if the host has split lock detection enabled? >>> >>> That does not make any sense as you basically allow the guest to switch it >>> off and then launch a slowdown attack. If the host has it enabled, then a >>> guest has to be treated like any other process and the #AC trap has to be >>> caught by the hypervisor which then kills the guest. >>> >>> Only if the host has split lock detection disabled, then you can expose it >>> and allow the guest to turn it on and handle it on its own. >> >> Indeed, if we use split lock detection for protection purpose, when host >> has it enabled we should directly pass it to guest and forbid guest from >> disabling it. And only when host disables split lock detection, we can >> expose it and allow the guest to turn it on. > ? >> If it is used for protection purpose, then it should follow what you said and >> this feature needs to be disabled by default. Because there are split lock >> issues in old/current kernels and BIOS. That will cause the existing guest >> booting failure and killed due to those split lock. > > Rightfully so. > >> If it is only used for debug purpose, I think it might be OK to enable this >> feature by default and make it indepedent between host and guest? > > No. It does not make sense. > >> So I think how to handle this feature between host and guest depends on how we >> use it? Once you give me a decision, I will follow it in next version. > > As I said: The host kernel makes the decision. > > If the host kernel has it enabled then the guest is not allowed to change > it. If the guest triggers an #AC it will be killed. > > If the host kernel has it disabled then the guest can enable it for it's > own purposes. > > Thanks, > > tglx >