Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758904Ab0GBSnA (ORCPT ); Fri, 2 Jul 2010 14:43:00 -0400 Received: from wolverine02.qualcomm.com ([199.106.114.251]:60002 "EHLO wolverine02.qualcomm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753197Ab0GBSm6 (ORCPT ); Fri, 2 Jul 2010 14:42:58 -0400 X-IronPort-AV: E=McAfee;i="5400,1158,6031"; a="46170343" Message-ID: <4C2E3331.3090405@codeaurora.org> Date: Fri, 02 Jul 2010 11:42:57 -0700 From: Zach Pfeffer User-Agent: Thunderbird 2.0.0.23 (X11/20090817) MIME-Version: 1.0 To: Andi Kleen CC: Daniel Walker , Randy Dunlap , mel@csn.ul.ie, linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-arm-msm@vger.kernel.org, linux-omap@vger.kernel.org, linux-arm-kernel@lists.infradead.org Subject: Re: [RFC 3/3] mm: iommu: The Virtual Contiguous Memory Manager References: <1277877350-2147-1-git-send-email-zpfeffer@codeaurora.org> <1277877350-2147-3-git-send-email-zpfeffer@codeaurora.org> <20100701101746.3810cc3b.randy.dunlap@oracle.com> <20100701180241.GA3594@basil.fritz.box> <1278012503.7738.17.camel@c-dwalke-linux.qualcomm.com> <20100701193850.GB3594@basil.fritz.box> <4C2D0FF1.6010206@codeaurora.org> <20100701230056.GD3594@basil.fritz.box> <4C2D847E.5080602@codeaurora.org> <20100702082225.GA12221@basil.fritz.box> In-Reply-To: <20100702082225.GA12221@basil.fritz.box> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3502 Lines: 69 Andi Kleen wrote: > On Thu, Jul 01, 2010 at 11:17:34PM -0700, Zach Pfeffer wrote: >> Andi Kleen wrote: >>>> The VCMM provides a more abstract, global view with finer-grained >>>> control of each mapping a user wants to create. For instance, the >>>> semantics of iommu_map preclude its use in setting up just the IOMMU >>>> side of a mapping. With a one-sided map, two IOMMU devices can be >>> Hmm? dma_map_* does not change any CPU mappings. It only sets up >>> DMA mapping(s). >> Sure, but I was saying that iommu_map() doesn't just set up the IOMMU >> mappings, its sets up both the iommu and kernel buffer mappings. > > Normally the data is already in the kernel or mappings, so why > would you need another CPU mapping too? Sometimes the CPU > code has to scatter-gather, but that is considered acceptable > (and if it really cannot be rewritten to support sg it's better > to have an explicit vmap operation) > > In general on larger systems with many CPUs changing CPU mappings > also gets expensive (because you have to communicate with all cores), > and is not a good idea on frequent IO paths. That's all true, but what a VCMM allows is for these trade-offs to be made by the user for future systems. It may not be too expensive to change the IO path around on future chips or the user may be okay with the performance penalty. A VCMM doesn't enforce a policy on the user, it lets the user make their own policy. >>>> Additionally, the current IOMMU interface does not allow users to >>>> associate one page table with multiple IOMMUs unless the user explicitly >>> That assumes that all the IOMMUs on the system support the same page table >>> format, right? >> Actually no. Since the VCMM abstracts a page-table as a Virtual >> Contiguous Region (VCM) a VCM can be associated with any device, >> regardless of their individual page table format. > > But then there is no real page table sharing, isn't it? > The real information should be in the page tables, nowhere else. Yeah, and the implementation ensures that it. The VCMM just adds a few fields like start_addr, len and the device. The device still manages the its page-tables. >>> The standard Linux approach to such a problem is to write >>> a library that drivers can use for common functionality, not put a middle >>> layer in between. Libraries are much more flexible than layers. >> That's true up to the, "is this middle layer so useful that its worth >> it" point. The VM is a middle layer, you could make the same argument >> about it, "the mapping code isn't too hard, just map in the memory >> that you need and be done with it". But the VM middle layer provides a >> clean separation between page frames and pages which turns out to be > > Actually we use both PFNs and struct page *s in many layers up > and down, there's not really any layering in that. Sure, but the PFNs and the struct page *s are the middle layer. Its just that things haven't been layered on top of them. A VCMM is the higher level abstraction, since it allows the size of the PFs to vary and the consumers of the VCM's to be determined at run-time. -- Sent by an employee of the Qualcomm Innovation Center, Inc. The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/