Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp6696024imm; Tue, 24 Jul 2018 01:10:08 -0700 (PDT) X-Google-Smtp-Source: AAOMgpesxdnfqnD2qm+qLy/na5Rn/lja5qqWQZF2Ah1tqquZTmQwF5V5deKfjdVAtlStge0BsOoR X-Received: by 2002:a62:d842:: with SMTP id e63-v6mr16876119pfg.88.1532419808277; Tue, 24 Jul 2018 01:10:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1532419808; cv=none; d=google.com; s=arc-20160816; b=CcUI0CswxXvCV2oRtZLe6zLEAl3jGvt+KplHEPnJ0vjpeyy1Sd75dExPCXJr9dlcQh 7lV/OLQ30/Ndaqk9M7TWVzbRjYMhkciElcNluKKhLPF3siGAr+mqz2T/z7NfZw8cNmss ZxDsE9BR2ipDcsUtdCsnvbLcpaBj87YAngUPrGn7kzUQ6xObPsCueaewwzNoEXC5WCjY mx/6iLFbx6u58wyLAi8y8i2V49aXLXbz8fdQQCTVcrYLW+JFzM9IvNbZXi+mggiesavp jLYAHinJYIpd4GIsTzGErqq43dIUQUgNpuElVZloAlS8jQMYrtAIeacDcoKOPqYPanez d4NQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :references:subject:cc:to:mime-version:user-agent:from:date :message-id:arc-authentication-results; bh=DkIyA/dMop+X9sg3Ik4YC9I/iOYt+LU7R/kdjzi/lhA=; b=MP4HRuY2wLkAJMMwFmR7Fpj8GGC06TGFRtY8Ka9wH2ER9Vpr2GlbfFtXefRWPeWxzY j8pqL6E1PBJ10+fFUPRX9p+e19T3EZf21tLJT5CIMj+HNujm38/qacDSPmmihvBGq7zd K7TahOPlUbIX6S4x37/N1IG37p41pWBrPCF8bMb3nSymHjTBOZm6lJb2QiSuTtgjkVOn pnuQglMZTmynTSr7EjXOxxCfeMsscgCyWDilD78Sn2iHqb6TtTkH0D9UFi8/FzolW2KR 2MM30U/+n7PdUPp5q3TaGG3x9k2arfGVqsPL9MtQH3zWSPy3c4Vo9mNTMJrCKDesIGVF b4EQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o8-v6si2645821pll.193.2018.07.24.01.09.53; Tue, 24 Jul 2018 01:10:08 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388613AbeGXJOL (ORCPT + 99 others); Tue, 24 Jul 2018 05:14:11 -0400 Received: from mga02.intel.com ([134.134.136.20]:1798 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388515AbeGXJOL (ORCPT ); Tue, 24 Jul 2018 05:14:11 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga008.jf.intel.com ([10.7.209.65]) by orsmga101.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Jul 2018 01:08:56 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.51,397,1526367600"; d="scan'208";a="59325753" Received: from unknown (HELO [10.239.13.97]) ([10.239.13.97]) by orsmga008.jf.intel.com with ESMTP; 24 Jul 2018 01:08:32 -0700 Message-ID: <5B56DF81.4030606@intel.com> Date: Tue, 24 Jul 2018 16:12:49 +0800 From: Wei Wang User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: "Dr. David Alan Gilbert" , "Michael S. Tsirkin" CC: virtio-dev@lists.oasis-open.org, linux-kernel@vger.kernel.org, virtualization@lists.linux-foundation.org, kvm@vger.kernel.org, linux-mm@kvack.org, mhocko@kernel.org, akpm@linux-foundation.org, torvalds@linux-foundation.org, pbonzini@redhat.com, liliang.opensource@gmail.com, yang.zhang.wz@gmail.com, quan.xu0@gmail.com, nilal@redhat.com, riel@redhat.com, peterx@redhat.com Subject: Re: [PATCH v36 0/5] Virtio-balloon: support free page reporting References: <1532075585-39067-1-git-send-email-wei.w.wang@intel.com> <20180723122342-mutt-send-email-mst@kernel.org> <20180723143604.GB2457@work-vm> In-Reply-To: <20180723143604.GB2457@work-vm> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/23/2018 10:36 PM, Dr. David Alan Gilbert wrote: > * Michael S. Tsirkin (mst@redhat.com) wrote: >> On Fri, Jul 20, 2018 at 04:33:00PM +0800, Wei Wang wrote: >>> This patch series is separated from the previous "Virtio-balloon >>> Enhancement" series. The new feature, VIRTIO_BALLOON_F_FREE_PAGE_HINT, >>> implemented by this series enables the virtio-balloon driver to report >>> hints of guest free pages to the host. It can be used to accelerate live >>> migration of VMs. Here is an introduction of this usage: >>> >>> Live migration needs to transfer the VM's memory from the source machine >>> to the destination round by round. For the 1st round, all the VM's memory >>> is transferred. From the 2nd round, only the pieces of memory that were >>> written by the guest (after the 1st round) are transferred. One method >>> that is popularly used by the hypervisor to track which part of memory is >>> written is to write-protect all the guest memory. >>> >>> This feature enables the optimization by skipping the transfer of guest >>> free pages during VM live migration. It is not concerned that the memory >>> pages are used after they are given to the hypervisor as a hint of the >>> free pages, because they will be tracked by the hypervisor and transferred >>> in the subsequent round if they are used and written. >>> >>> * Tests >>> - Test Environment >>> Host: Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz >>> Guest: 8G RAM, 4 vCPU >>> Migration setup: migrate_set_speed 100G, migrate_set_downtime 2 second >>> >>> - Test Results >>> - Idle Guest Live Migration Time (results are averaged over 10 runs): >>> - Optimization v.s. Legacy = 409ms vs 1757ms --> ~77% reduction >>> (setting page poisoning zero and enabling ksm don't affect the >>> comparison result) >>> - Guest with Linux Compilation Workload (make bzImage -j4): >>> - Live Migration Time (average) >>> Optimization v.s. Legacy = 1407ms v.s. 2528ms --> ~44% reduction >>> - Linux Compilation Time >>> Optimization v.s. Legacy = 5min4s v.s. 5min12s >>> --> no obvious difference >> I'd like to see dgilbert's take on whether this kind of gain >> justifies adding a PV interfaces, and what kind of guest workload >> is appropriate. >> >> Cc'd. > Well, 44% is great ... although the measurement is a bit weird. > > a) A 2 second downtime is very large; 300-500ms is more normal No problem, I will set downtime to 400ms for the tests. > b) I'm not sure what the 'average' is - is that just between a bunch of > repeated migrations? Yes, just repeatedly ("source<---->destination" migration) do the tests and get an averaged result. > c) What load was running in the guest during the live migration? The first one above just uses a guest without running any specific workload (named idle guests). The second one uses a guest with the Linux compilation workload running. > > An interesting measurement to add would be to do the same test but > with a VM with a lot more RAM but the same load; you'd hope the gain > would be even better. > It would be interesting, especially because the users who are interested > are people creating VMs allocated with lots of extra memory (for the > worst case) but most of the time migrating when it's fairly idle. OK. I will add tests of a guest with larger memory. Best, Wei