Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp5366279pxu; Wed, 21 Oct 2020 23:34:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyBmVoy4pSS+W0PZDwxMXDWI0NOiejZUzhHqDIp4qOUkBaKeirJgy3psQmo7jScfRncpupW X-Received: by 2002:a17:906:5488:: with SMTP id r8mr814521ejo.483.1603348479929; Wed, 21 Oct 2020 23:34:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1603348479; cv=none; d=google.com; s=arc-20160816; b=fcMDhJm2eg9chiJMCd+7FJ1mU7LKAADHdFG+ASNY6SlT+aVDYtheX524pPxQrkjgdv enX0w9NhwQ1pGGukjoLQ++Yv4D56nRkDTprIE+1AKOHRnHQFxn1Wl+LV6K8H/6zAospC xVZzQKdf235aWTVYnWQaJ4uGVEPJdX9kpYjvU8HZ29ApZl8cShd3JRbh5lIth5y62fle fUJ9WWdP1ixjrglTEmSgLpws1N2zezlVuycHfzJ2KXp+oh/LahY7QxBMKWgtAdA+FRey gvxVc/5/BjKN3oSFp+tqVGGBrEoVxh350E76Xh7wT6GFNFAouDdGYH9Rma9Iuwf3sOPZ Oe/g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=E8EasOxRhOP4tKSelXi8vOrt3irONBkQC3+dFEG5bU0=; b=R8XWOyWmozPvniD7xQ8opaUuP52mDuad7VBwL9iKrHk8EQidk8dcvK/ZSNU1+b7q5m wSyIY3TBJ1jW5Dw5LQYvV/ZuYqBo3xjP50vFQrnjVkLdYyY2X2WhYVVwilv8LnkTnDX2 Uo77m3Yv9IZpJY7PNVlPI+FgFvzW/SMn546n02J4kHHcxfO+IqljCvvja+y+AYaEG7sZ Idafsh/TxgauWYQGE2kpPV99tr3XcIlAjlDmojrhobdH3K6O3HRyn90EE+ua8yAtKbXX 47uAwSxfU7Y81eq7rD9SEG+JBZgkxGxEwwZhocvxqWCncsfnTO6Q+ClwE+5cb798BO+5 ydXw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=Mlb6bwWn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p9si317878eja.248.2020.10.21.23.34.18; Wed, 21 Oct 2020 23:34:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=Mlb6bwWn; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2389009AbgJUSVL (ORCPT + 99 others); Wed, 21 Oct 2020 14:21:11 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44688 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2392087AbgJUSVL (ORCPT ); Wed, 21 Oct 2020 14:21:11 -0400 Received: from mail-wm1-x32e.google.com (mail-wm1-x32e.google.com [IPv6:2a00:1450:4864:20::32e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F1A82C0613CF for ; Wed, 21 Oct 2020 11:21:10 -0700 (PDT) Received: by mail-wm1-x32e.google.com with SMTP id e2so3608672wme.1 for ; Wed, 21 Oct 2020 11:21:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amacapital-net.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=E8EasOxRhOP4tKSelXi8vOrt3irONBkQC3+dFEG5bU0=; b=Mlb6bwWn6Gm4wUGJrY2+hFY3C4CVXYM4RKDORq9plubHdK5JFCxrzlK8tWW2WyhiW0 +1bmCbOY9cvLDsUqrlEdlXXK3cUeQFSE2EgCpPiScdBXi94x32bHTPWDlwjFKvi+liNe uDD5PL5oUZnuOCPjueMfEmwtHQOXCpm/y3FNnczTQ2O/dLX042Z9F66jY6ZLVYA9houk ++xik8f53NSFyhCOYynA11/0mdWGqA4Plx/DTN+E02yOOAgZfIeaOZ9ESZJrTftpjLuM T6FL9e2iSUAV438PzGCWoPCZnNOqLjWrXaWF4wGsXW0Z97vvZ2YboI7Re2zsP6v+jttq J64A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=E8EasOxRhOP4tKSelXi8vOrt3irONBkQC3+dFEG5bU0=; b=JDfe8LgNZJMKswTn+o4XMqWHLOSAIa8lGusWwFY3+Chmeq92jwVW2V1MPJv3DRDSyd uVIkHs/lYb4SbzfNb2yaN+0iN4se4Xqq3AT1sVNRqjV9F0SQroga6Glbf8qQqMEMZiHh rhJCD/Utumo+3ZdDf1l/AHMF8vJnBtvI+KBWyEDXdB0FIM3KJCoUJy9NMlvlAjlB6Dys OJjKMMKNSs/vc+V1wRJuJORkRbjZeXssUnwQW1P6GnxXd1nBanRhNye7DifmMXbE2E6f sSTA6di3R6q5MmMYkSi4iWWIEHIezgHqmaGzFRXs3nGhAHZpBde4Er5WHg1H1BdnPYjg Ng8Q== X-Gm-Message-State: AOAM53394AV3GKRrt+UatKwLjOa/pQGaJYZW18VwluwvnBTRsTKNYSLk i0SY+KC74KK40IZp1aTNIKX+gybf9hoxaJ2qsvp09g== X-Received: by 2002:a05:600c:2241:: with SMTP id a1mr5097261wmm.49.1603304468859; Wed, 21 Oct 2020 11:21:08 -0700 (PDT) MIME-Version: 1.0 References: <20201020061859.18385-1-kirill.shutemov@linux.intel.com> In-Reply-To: <20201020061859.18385-1-kirill.shutemov@linux.intel.com> From: Andy Lutomirski Date: Wed, 21 Oct 2020 11:20:56 -0700 Message-ID: Subject: Re: [RFCv2 00/16] KVM protected memory extension To: "Kirill A. Shutemov" Cc: Dave Hansen , Andy Lutomirski , Peter Zijlstra , Paolo Bonzini , Sean Christopherson , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , David Rientjes , Andrea Arcangeli , Kees Cook , Will Drewry , "Edgecombe, Rick P" , "Kleen, Andi" , Liran Alon , Mike Rapoport , X86 ML , kvm list , Linux-MM , LKML , "Kirill A. Shutemov" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Oct 19, 2020, at 11:19 PM, Kirill A. Shutemov wrote: > For removing the userspace mapping, use a trick similar to what NUMA > balancing does: convert memory that belongs to KVM memory slots to > PROT_NONE: all existing entries converted to PROT_NONE with mprotect() and > the newly faulted in pages get PROT_NONE from the updated vm_page_prot. > The new VMA flag -- VM_KVM_PROTECTED -- indicates that the pages in the > VMA must be treated in a special way in the GUP and fault paths. The flag > allows GUP to return the page even though it is mapped with PROT_NONE, but > only if the new GUP flag -- FOLL_KVM -- is specified. Any userspace access > to the memory would result in SIGBUS. Any GUP access without FOLL_KVM > would result in -EFAULT. > I definitely like the direction this patchset is going in, and I think that allowing KVM guests to have memory that is inaccessible to QEMU is a great idea. I do wonder, though: do we really want to do this with these PROT_NONE tricks, or should we actually come up with a way to have KVM guest map memory that isn't mapped into QEMU's mm_struct at all? As an example of the latter, I mean something a bit like this: https://lkml.kernel.org/r/CALCETrUSUp_7svg8EHNTk3nQ0x9sdzMCU=h8G-Sy6=SODq5GHg@mail.gmail.com I don't mean to say that this is a requirement of any kind of protected memory like this, but I do think we should understand the tradeoffs, in terms of what a full implementation looks like, the effort and time frames involved, and the maintenance burden of supporting whatever gets merged going forward.