Received: by 2002:ac0:946b:0:0:0:0:0 with SMTP id j40csp3036006imj; Mon, 11 Feb 2019 12:41:54 -0800 (PST) X-Google-Smtp-Source: AHgI3Ib0FhL1VDVnO2TeO7jMjlr4T4sLf67LFpwZd57TnEzdFYlsNGvyl5fQX7VlW4j7YoEJesWq X-Received: by 2002:a62:2fc3:: with SMTP id v186mr143494pfv.82.1549917714678; Mon, 11 Feb 2019 12:41:54 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549917714; cv=none; d=google.com; s=arc-20160816; b=KnUtcPnIwsnIkQacRoNyJxx2ggHMFxraLzkhCtch4rIOg/5TnLZTNI8I/6ubAHfKu6 OTT6hCCcqcaLkZVPy/C3f6oinM1FSdZTb+BtHge+XQE52I8AipXSLPvryq4+3ZczkgaM 64bW549lgIu24QT37/SYv5FwsJndLFLwUxO1T82s5UnNP7oTMaEOoh/heM3UVTObBh9Z 4qkQxOCApY4xbpITCnVzUsE/TMfWnC8UVvzQVx6hI9e6+3WPgRhRPhEiNdvfFBzxhbVY NshYHpMggfmVqJq9LSnivY+C7Pll1Poehq7J5pRza4HTb9emcGL3/D9HchfoY16ZKLYV fjRQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=zE1Ol7a3GppKMtEDRVuBxnEUwh6Dn7Ves8xXxbBZo0Q=; b=TlU6MSNtjqPTaiKwi4JVFENkZrvWODRPdYD3HvsjIuS0ZmsiUOg5QW0RBaWyO+NZ/X b4MTze4GkLZrJg/bGTejk42KK2PzdkPZWICc5jn+3wC2sYnWSm8lhnFpcZA0kbWBxbjX metu/ZW98AUOzPtf6/7HQCQuVzPWU3HXNYcpN29K0e9nbd71g/sGmbqS6YKKSxHmT5M0 7p4EGNibR17z7An0AuEnOLWDVeb5SjuSRlXCJ1/pRp9o/BB09G7ST8XMGWGbpLyHz6jP a2zyLo+1U3aZqv3HhQK7gSw37aIFrTLBYkqYGdScqUOKhBFiiJ4Gi9/NqfKTXOeME1wg i2ag== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n17si9800693pgv.485.2019.02.11.12.41.38; Mon, 11 Feb 2019 12:41:54 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730759AbfBKTZC (ORCPT + 99 others); Mon, 11 Feb 2019 14:25:02 -0500 Received: from mx1.redhat.com ([209.132.183.28]:64056 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730255AbfBKTZC (ORCPT ); Mon, 11 Feb 2019 14:25:02 -0500 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id EFDC57E9E1; Mon, 11 Feb 2019 19:25:00 +0000 (UTC) Received: from redhat.com (ovpn-120-40.rdu2.redhat.com [10.10.120.40]) by smtp.corp.redhat.com (Postfix) with SMTP id F3C31611D1; Mon, 11 Feb 2019 19:24:58 +0000 (UTC) Date: Mon, 11 Feb 2019 14:24:58 -0500 From: "Michael S. Tsirkin" To: Alexander Duyck Cc: Dave Hansen , Alexander Duyck , linux-mm@kvack.org, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, rkrcmar@redhat.com, x86@kernel.org, mingo@redhat.com, bp@alien8.de, hpa@zytor.com, pbonzini@redhat.com, tglx@linutronix.de, akpm@linux-foundation.org Subject: Re: [RFC PATCH 2/4] kvm: Add host side support for free memory hints Message-ID: <20190211141811-mutt-send-email-mst@kernel.org> References: <20190204181118.12095.38300.stgit@localhost.localdomain> <20190204181546.12095.81356.stgit@localhost.localdomain> <20190209194108-mutt-send-email-mst@kernel.org> <39c915a7-e317-db01-0286-579230f37da2@intel.com> <20190211124203-mutt-send-email-mst@kernel.org> <58e57acd628f2d6535fc45a028af50855158fda6.camel@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <58e57acd628f2d6535fc45a028af50855158fda6.camel@linux.intel.com> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.26]); Mon, 11 Feb 2019 19:25:01 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Feb 11, 2019 at 10:30:10AM -0800, Alexander Duyck wrote: > On Mon, 2019-02-11 at 12:48 -0500, Michael S. Tsirkin wrote: > > On Mon, Feb 11, 2019 at 09:41:19AM -0800, Dave Hansen wrote: > > > On 2/9/19 4:44 PM, Michael S. Tsirkin wrote: > > > > So the policy should not leak into host/guest interface. > > > > Instead it is better to just keep the pages pinned and > > > > ignore the hint for now. > > > > > > It does seems a bit silly to have guests forever hinting about freed > > > memory when the host never has a hope of doing anything about it. > > > > > > Is that part fixable? > > > > > > Yes just not with existing IOMMU APIs. > > > > It's in the paragraph just above that you cut out: > > Yes right now assignment is not smart enough but generally > > you can protect the unused page in the IOMMU and that's it, > > it's safe. > > > > So e.g. > > extern int iommu_remap(struct iommu_domain *domain, unsigned long iova, > > phys_addr_t paddr, size_t size, int prot); > > > > > > I can elaborate if you like but generally we would need an API that > > allows you to atomically update a mapping for a specific page without > > perturbing the mapping for other pages. > > > > I still don't see how this would solve anything unless you have the > guest somehow hinting on what pages it is providing to the devices. > > You would have to have the host invalidating the pages when the hint is > provided, and have a new hint tied to arch_alloc_page that would > rebuild the IOMMU mapping when a page is allocated. > > I'm pretty certain that the added cost of that would make the hinting > pretty pointless as my experience has been that the IOMMU is too much > of a bottleneck to have multiple CPUs trying to create and invalidate > mappings simultaneously. I agree it's a concern. Another option would involve passing these hints in the DMA API. How about the option of removing the device by hotplug when host needs overcommit? That would involve either buffering on host, or requesting free pages after device is removed along the lines of existing balloon code. That btw seems to be an argument for making this hinting part of balloon. -- MST