Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp10675431ybi; Thu, 25 Jul 2019 03:27:40 -0700 (PDT) X-Google-Smtp-Source: APXvYqx1Tx1lGBw76F5KiyuSjlxlKM4XVAS5fbni7rXzUMqoS+Rdz1jLXjtYKFOeBD20aT7SOfiC X-Received: by 2002:a62:4e58:: with SMTP id c85mr15944237pfb.176.1564050460440; Thu, 25 Jul 2019 03:27:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1564050460; cv=none; d=google.com; s=arc-20160816; b=vxAb9SFx/I9++4zP/3wk0Zq+bYVCpNW7Da4qIZQpOR1wC8gCh9qrDt46AcMvox/IFQ UELg6T0DZprUjf4d4dLan2OUIYbP3q8URo6zknq31Sn/9RhFFzH04qBK1IYJ2Jp+r96F iWrg6W6cxBRxh/3EI5ASnWQEJUZ2YL9QDo4ZUB3cDaXxu100hheMuy/qVJw4okKUMX20 C4DyJKGm1K2IWOOsE8Hfa+xXgb8fdbcI+vQiupOj9TJcthU4kle2RakpdJqKLKCOPalk GxnjHXxtuF6veMSpE3yWcRkyw/y/dFNGIE0xjG6efWuV+vZzdG5ow8vMvVLlxAFTAYgF KfJA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id; bh=5EHb0DmPHlH6TD4NdiPQtwNlAcPsCzOoRyix/eQ2Fuw=; b=tCrJxuLihG3MQGL0m9F274jV3z5mmJ/xsR29n4TTZJisY7MdzH7var8SbxNfvuTLsG VNUkooAY1kRXNgyTPDyCMtJhuFCBuCC2ld+q2n4nT2JlT5ZUddYhCcfnGf3vs/MLMLce zrYkH9YVWVxOLLb9iB0Ud6hRKerVzjwkEDaPRGV2g105CxBRNqABI+q57e56FOzfAg0G 6wNRQ6XrxNupvK0GnW49qdth0HGGibMxBP6PBrAH2F7pIJI/L83rVuA/mk5qJ36uXtnQ R0Ci18vpxznmPhjjTIc/NyKHTUjs1lQlqkJIogcaizrPOm4CJCDqlwmaON8HfthdeTBv zCPA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q8si16267325pfc.155.2019.07.25.03.27.21; Thu, 25 Jul 2019 03:27:40 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729312AbfGXW1i (ORCPT + 99 others); Wed, 24 Jul 2019 18:27:38 -0400 Received: from mga01.intel.com ([192.55.52.88]:52369 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727620AbfGXW1h (ORCPT ); Wed, 24 Jul 2019 18:27:37 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga008.jf.intel.com ([10.7.209.65]) by fmsmga101.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Jul 2019 15:27:37 -0700 X-IronPort-AV: E=Sophos;i="5.64,304,1559545200"; d="scan'208";a="163991221" Received: from ahduyck-desk1.jf.intel.com ([10.7.198.76]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Jul 2019 15:27:37 -0700 Message-ID: <6bbead1f2d7b3aa77a8e78ffc6bbbb6d0d68c12e.camel@linux.intel.com> Subject: Re: [PATCH v2 QEMU] virtio-balloon: Provide a interface for "bubble hinting" From: Alexander Duyck To: "Michael S. Tsirkin" Cc: Alexander Duyck , nitesh@redhat.com, kvm@vger.kernel.org, david@redhat.com, dave.hansen@intel.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org, akpm@linux-foundation.org, yang.zhang.wz@gmail.com, pagupta@redhat.com, riel@surriel.com, konrad.wilk@oracle.com, lcapitulino@redhat.com, wei.w.wang@intel.com, aarcange@redhat.com, pbonzini@redhat.com, dan.j.williams@intel.com Date: Wed, 24 Jul 2019 15:27:37 -0700 In-Reply-To: <20190724180552-mutt-send-email-mst@kernel.org> References: <20190724165158.6685.87228.stgit@localhost.localdomain> <20190724171050.7888.62199.stgit@localhost.localdomain> <20190724173403-mutt-send-email-mst@kernel.org> <20190724180552-mutt-send-email-mst@kernel.org> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.30.5 (3.30.5-1.fc29) MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2019-07-24 at 18:08 -0400, Michael S. Tsirkin wrote: > On Wed, Jul 24, 2019 at 03:03:56PM -0700, Alexander Duyck wrote: > > On Wed, 2019-07-24 at 17:38 -0400, Michael S. Tsirkin wrote: > > > On Wed, Jul 24, 2019 at 10:12:10AM -0700, Alexander Duyck wrote: > > > > From: Alexander Duyck > > > > > > > > Add support for what I am referring to as "bubble hinting". Basically the > > > > idea is to function very similar to how the balloon works in that we > > > > basically end up madvising the page as not being used. However we don't > > > > really need to bother with any deflate type logic since the page will be > > > > faulted back into the guest when it is read or written to. > > > > > > > > This is meant to be a simplification of the existing balloon interface > > > > to use for providing hints to what memory needs to be freed. I am assuming > > > > this is safe to do as the deflate logic does not actually appear to do very > > > > much other than tracking what subpages have been released and which ones > > > > haven't. > > > > > > > > Signed-off-by: Alexander Duyck > > > > > > BTW I wonder about migration here. When we migrate we lose all hints > > > right? Well destination could be smarter, detect that page is full of > > > 0s and just map a zero page. Then we don't need a hint as such - but I > > > don't think it's done like that ATM. > > > > I was wondering about that a bit myself. If you migrate with a balloon > > active what currently happens with the pages in the balloon? Do you > > actually migrate them, or do you ignore them and just assume a zero page? > > Ignore and assume zero page. > > > I'm just reusing the ram_block_discard_range logic that was being used for > > the balloon inflation so I would assume the behavior would be the same. > > > > > I also wonder about interaction with deflate. ATM deflate will add > > > pages to the free list, then balloon will come right back and report > > > them as free. > > > > I don't know how likely it is that somebody who is getting the free page > > reporting is likely to want to also use the balloon to take up memory. > > Why not? The two functions are essentially doing the same thing. The only real difference is enforcement. If the balloon takes the pages the guest cannot get them back. I suppose there might be some advantage if you are wanting for force shrink a guest but that would be about it. > > However hinting on a page that came out of deflate might make sense when > > you consider that the balloon operates on 4K pages and the hints are on 2M > > pages. You are likely going to lose track of it all anyway as you have to > > work to merge the 4K pages up to the higher order page. > > Right - we need to fix inflate/deflate anyway. > When we do, we can do whatever :) One thing we could probably look at for the future would be to more closely merge the balloon and this reporting logic. Ideally the balloon would grab pages that were already hinted in order to enforce a certain size limit on the guest, and then when it gave the pages back they would retain their hinted status if possible. The only problem is that right now both of those require that hinting/reporting be active for the zone being accessed since we otherwise don't have pointers to the pages at the head of the "hinted" list.