Received: by 2002:ac0:8c9a:0:0:0:0:0 with SMTP id r26csp4292061ima; Mon, 4 Feb 2019 13:46:15 -0800 (PST) X-Google-Smtp-Source: AHgI3IYRkPzrIAaG7TCZTh6p6mEuc5YjgdoFwaXNkcU3i222MowXGozQunpbwVQcdcD1bTm1BpYK X-Received: by 2002:a17:902:1105:: with SMTP id d5mr1499744pla.47.1549316775498; Mon, 04 Feb 2019 13:46:15 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549316775; cv=none; d=google.com; s=arc-20160816; b=PvD/0mXxkm5ZXLVV7ZGkjBw8p17pGP5R7CL7LS7l/U2gppVuB84JXbfaEPy4o+hjWX jD5TbS71ha/RW3Pk4gMm3IguXdxZ9rmLYDEpN2Z/uAb2Jb0w6poo7uylM6uOa53RmR2K TYaWo6mBRbDt9rLBJMwPbpAIYb6ohB1ILV7wNtfyHV/RHzewWGjr2lcnsjdThLE8UKaX 0WVHjPFZe90Ll4Ide7Dn/MK9LOSHaoRIKESI1Fo600UVvRudamFkWB4klW7aCE1GiiSS sSefAQmi/PkbL+repOuepiB9ZKCAXXOkgiC804HAjY3ywXxQOITMABr2GgS7Yi40hSwO Q7FA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:to:from; bh=053CA2eS3mNUlG0kNHAxzOC+0WtSEDjw6yAy03V6G4k=; b=h2c8ba1GwbUIc5aDlXddbx70oloEKXWdyjXT0ccgCgdcBIueXY2TcT7eI0YhOdGm9g p4a1xJcAAAKPV5unoUoYrfPnlvnbxxS7rr9fSo6JwtXuu8HuDDO3OyYrKotLGdEw22fT T+K8l6ircmMUNvhoWA7RevPViHmADd87/lS1i166WSz/jtT+wFg63ccMnWDt6rbIy2FO Q9aXMwPf1cylQ20zhV+bH5yira3w4YcOFTR+2iKVAgjBWxkJJBZ5WbW3Al4RN4DQWcl1 ZONM7clXG6uHMEyMQy6o3YyCIg9K9VDg5raTpMqxFXlT442tWkKJJf7w+nGfYrED1cOG jDMw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x10si1059430plv.300.2019.02.04.13.45.59; Mon, 04 Feb 2019 13:46:15 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728466AbfBDUU2 (ORCPT + 99 others); Mon, 4 Feb 2019 15:20:28 -0500 Received: from mx1.redhat.com ([209.132.183.28]:47682 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728010AbfBDUTj (ORCPT ); Mon, 4 Feb 2019 15:19:39 -0500 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 1490081F0E; Mon, 4 Feb 2019 20:19:38 +0000 (UTC) Received: from virtlab420.virt.lab.eng.bos.redhat.com (virtlab420.virt.lab.eng.bos.redhat.com [10.19.152.148]) by smtp.corp.redhat.com (Postfix) with ESMTP id 648878924D; Mon, 4 Feb 2019 20:19:36 +0000 (UTC) From: Nitesh Narayan Lal To: kvm@vger.kernel.org, linux-kernel@vger.kernel.org, pbonzini@redhat.com, lcapitulino@redhat.com, pagupta@redhat.com, wei.w.wang@intel.com, yang.zhang.wz@gmail.com, riel@surriel.com, david@redhat.com, mst@redhat.com, dodgen@google.com, konrad.wilk@oracle.com, dhildenb@redhat.com, aarcange@redhat.com Subject: [RFC][Patch v8 1/7] KVM: Support for guest free page hinting Date: Mon, 4 Feb 2019 15:18:48 -0500 Message-Id: <20190204201854.2328-2-nitesh@redhat.com> In-Reply-To: <20190204201854.2328-1-nitesh@redhat.com> References: <20190204201854.2328-1-nitesh@redhat.com> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Mon, 04 Feb 2019 20:19:38 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This patch includes the following: 1. Basic skeleton for the support 2. Enablement of x86 platform to use the same Signed-off-by: Nitesh Narayan Lal --- arch/x86/Kbuild | 2 +- arch/x86/kvm/Kconfig | 8 ++++++++ arch/x86/kvm/Makefile | 2 ++ include/linux/gfp.h | 9 +++++++++ include/linux/page_hinting.h | 17 +++++++++++++++++ virt/kvm/page_hinting.c | 36 ++++++++++++++++++++++++++++++++++++ 6 files changed, 73 insertions(+), 1 deletion(-) create mode 100644 include/linux/page_hinting.h create mode 100644 virt/kvm/page_hinting.c diff --git a/arch/x86/Kbuild b/arch/x86/Kbuild index c625f57472f7..3244df4ee311 100644 --- a/arch/x86/Kbuild +++ b/arch/x86/Kbuild @@ -2,7 +2,7 @@ obj-y += entry/ obj-$(CONFIG_PERF_EVENTS) += events/ -obj-$(CONFIG_KVM) += kvm/ +obj-$(subst m,y,$(CONFIG_KVM)) += kvm/ # Xen paravirtualization support obj-$(CONFIG_XEN) += xen/ diff --git a/arch/x86/kvm/Kconfig b/arch/x86/kvm/Kconfig index 72fa955f4a15..2fae31459706 100644 --- a/arch/x86/kvm/Kconfig +++ b/arch/x86/kvm/Kconfig @@ -96,6 +96,14 @@ config KVM_MMU_AUDIT This option adds a R/W kVM module parameter 'mmu_audit', which allows auditing of KVM MMU events at runtime. +# KVM_FREE_PAGE_HINTING will allow the guest to report the free pages to the +# host in regular interval of time. +config KVM_FREE_PAGE_HINTING + def_bool y + depends on KVM + select VIRTIO + select VIRTIO_BALLOON + # OK, it's a little counter-intuitive to do this, but it puts it neatly under # the virtualization menu. source "drivers/vhost/Kconfig" diff --git a/arch/x86/kvm/Makefile b/arch/x86/kvm/Makefile index 69b3a7c30013..78640a80501e 100644 --- a/arch/x86/kvm/Makefile +++ b/arch/x86/kvm/Makefile @@ -16,6 +16,8 @@ kvm-y += x86.o mmu.o emulate.o i8259.o irq.o lapic.o \ i8254.o ioapic.o irq_comm.o cpuid.o pmu.o mtrr.o \ hyperv.o page_track.o debugfs.o +obj-$(CONFIG_KVM_FREE_PAGE_HINTING) += $(KVM)/page_hinting.o + kvm-intel-y += vmx/vmx.o vmx/vmenter.o vmx/pmu_intel.o vmx/vmcs12.o vmx/evmcs.o vmx/nested.o kvm-amd-y += svm.o pmu_amd.o diff --git a/include/linux/gfp.h b/include/linux/gfp.h index 5f5e25fd6149..e596527284ba 100644 --- a/include/linux/gfp.h +++ b/include/linux/gfp.h @@ -7,6 +7,7 @@ #include #include #include +#include struct vm_area_struct; @@ -456,6 +457,14 @@ static inline struct zonelist *node_zonelist(int nid, gfp_t flags) return NODE_DATA(nid)->node_zonelists + gfp_zonelist(flags); } +#ifdef CONFIG_KVM_FREE_PAGE_HINTING +#define HAVE_ARCH_FREE_PAGE +static inline void arch_free_page(struct page *page, int order) +{ + guest_free_page(page, order); +} +#endif + #ifndef HAVE_ARCH_FREE_PAGE static inline void arch_free_page(struct page *page, int order) { } #endif diff --git a/include/linux/page_hinting.h b/include/linux/page_hinting.h new file mode 100644 index 000000000000..b54f7428f348 --- /dev/null +++ b/include/linux/page_hinting.h @@ -0,0 +1,17 @@ +/* + * Size of the array which is used to store the freed pages is defined by + * MAX_FGPT_ENTRIES. If possible, we have to find a better way using which + * we can get rid of the hardcoded array size. + */ +#define MAX_FGPT_ENTRIES 1000 +/* + * hypervisor_pages - It is a dummy structure passed with the hypercall. + * @pfn: page frame number for the page which needs to be sent to the host. + * @order: order of the page needs to be reported to the host. + */ +struct hypervisor_pages { + unsigned long pfn; + unsigned int order; +}; + +void guest_free_page(struct page *page, int order); diff --git a/virt/kvm/page_hinting.c b/virt/kvm/page_hinting.c new file mode 100644 index 000000000000..818bd6b84e0c --- /dev/null +++ b/virt/kvm/page_hinting.c @@ -0,0 +1,36 @@ +#include +#include +#include + +/* + * struct kvm_free_pages - Tracks the pages which are freed by the guest. + * @pfn: page frame number for the page which is freed. + * @order: order corresponding to the page freed. + * @zonenum: zone number to which the freed page belongs. + */ +struct kvm_free_pages { + unsigned long pfn; + unsigned int order; + int zonenum; +}; + +/* + * struct page_hinting - holds array objects for the structures used to track + * guest free pages, along with an index variable for each of them. + * @kvm_pt: array object for the structure kvm_free_pages. + * @kvm_pt_idx: index for kvm_free_pages object. + * @hypervisor_pagelist: array object for the structure hypervisor_pages. + * @hyp_idx: index for hypervisor_pages object. + */ +struct page_hinting { + struct kvm_free_pages kvm_pt[MAX_FGPT_ENTRIES]; + int kvm_pt_idx; + struct hypervisor_pages hypervisor_pagelist[MAX_FGPT_ENTRIES]; + int hyp_idx; +}; + +DEFINE_PER_CPU(struct page_hinting, hinting_obj); + +void guest_free_page(struct page *page, int order) +{ +} -- 2.17.2