Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp7239585imu; Thu, 31 Jan 2019 07:12:29 -0800 (PST) X-Google-Smtp-Source: ALg8bN4vZZWGrKuSWtkzajp/1DWHKeH5GnyagvHgcAJHw6rfK/AWHTm5bnnsX7o56Yj+y4BgdmTO X-Received: by 2002:a17:902:4a0c:: with SMTP id w12mr35504680pld.8.1548947549878; Thu, 31 Jan 2019 07:12:29 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548947549; cv=none; d=google.com; s=arc-20160816; b=eSsHB10LUbRMnlcqliKoFLvuwKyu+L3ZasCZFron7mGSUwyQeh5tIItgxg4zyqilZF KWJlAVia6UfZ/7kzUMxGzsUMdhK3pZ0DuXjmDmV4EjlupwyPyi0BXzdHBCU+wAhbCmDc yGZpm1i6XqOYw4cKmtnKZSZxHMy/4P6qcaqDD50YJoxtn0WKbBdS/LSrjedf6LVLsf+g iBgYeFUDqPOCiPR7zUResUbwq15ooJXm8QwsS7IDVObAoaPEB1BmJP0EOfbTUdCxtRzM WuiSuN6gwu0qLW9y4poa9Aik1Nj7rql0f+wdM4Ro9v3R2PW3dYnLtp1sU/NVWY6m4SHP oeKw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=v1MB0ckzpQvR/8C4mV91Di5lPryfud4kqcp7DSn5uAE=; b=uK2rIueG+LqlZouDCe844SGtvLqLgJyz6O5fU1VAx1Xblp1XwTNTpj0Bfn/AkVtoIX 2No7RKwca6+fCkac2nomI+LpF2EtPjcT4d4rrV5/xONruEaeHv5z2HiQsHYCmjBEPZWE DrBeKhNCjpjLlRvpuSIfSia8hKcS9bZhyTZONdB55HBff4FrsJ49LStQ0rgIVwbWVjvf PcI99tZ0y49V1ikcAtQ7gwgrjNM0wCz8db537vsGY3/ucPNb+xNhjcXsOiTmB/oxhUI4 m8e5PtbjLQougTef3f8Bv8MqR9m6ijIOPyAW2SQSVXtYTdL0g7rd4rfX2mDP5tY0H9jr Mcuw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k91si4687376pld.283.2019.01.31.07.12.11; Thu, 31 Jan 2019 07:12:29 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727465AbfAaPLu (ORCPT + 99 others); Thu, 31 Jan 2019 10:11:50 -0500 Received: from mx1.redhat.com ([209.132.183.28]:5855 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725855AbfAaPLu (ORCPT ); Thu, 31 Jan 2019 10:11:50 -0500 Received: from smtp.corp.redhat.com (int-mx01.intmail.prod.int.phx2.redhat.com [10.5.11.11]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id B4551A08F0; Thu, 31 Jan 2019 15:11:49 +0000 (UTC) Received: from redhat.com (unknown [10.20.6.236]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 38D0660152; Thu, 31 Jan 2019 15:11:47 +0000 (UTC) Date: Thu, 31 Jan 2019 10:11:45 -0500 From: Jerome Glisse To: Christoph Hellwig Cc: Jason Gunthorpe , Logan Gunthorpe , "linux-mm@kvack.org" , "linux-kernel@vger.kernel.org" , Greg Kroah-Hartman , "Rafael J . Wysocki" , Bjorn Helgaas , Christian Koenig , Felix Kuehling , "linux-pci@vger.kernel.org" , "dri-devel@lists.freedesktop.org" , Marek Szyprowski , Robin Murphy , Joerg Roedel , "iommu@lists.linux-foundation.org" Subject: Re: [RFC PATCH 3/5] mm/vma: add support for peer to peer to device vma Message-ID: <20190131151145.GC4619@redhat.com> References: <20190129193250.GK10108@mellanox.com> <99c228c6-ef96-7594-cb43-78931966c75d@deltatee.com> <20190129205827.GM10108@mellanox.com> <20190130080208.GC29665@lst.de> <20190130174424.GA17080@mellanox.com> <20190130191946.GD17080@mellanox.com> <3793c115-2451-1479-29a9-04bed2831e4b@deltatee.com> <20190130204414.GH17080@mellanox.com> <20190131080501.GB26495@lst.de> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190131080501.GB26495@lst.de> User-Agent: Mutt/1.10.0 (2018-05-17) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.11 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.39]); Thu, 31 Jan 2019 15:11:50 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 31, 2019 at 09:05:01AM +0100, Christoph Hellwig wrote: > On Wed, Jan 30, 2019 at 08:44:20PM +0000, Jason Gunthorpe wrote: > > Not really, for MRs most drivers care about DMA addresses only. The > > only reason struct page ever gets involved is because it is part of > > the GUP, SGL and dma_map family of APIs. > > And the only way you get the DMA address is through the dma mapping > APIs. Which except for the little oddball dma_map_resource expect > a struct page in some form. And dma_map_resource isn't really up > to speed for full blown P2P. > > Now we could and maybe eventually should change all this. But that > is a pre-requisitive for doing anything more fancy, and not something > to be hacked around. > > > O_DIRECT seems to be the justification for struct page, but nobody is > > signing up to make O_DIRECT have the required special GUP/SGL/P2P flow > > that would be needed to *actually* make that work - so it really isn't > > a justification today. > > O_DIRECT is just the messenger. Anything using GUP will need a struct > page, which is all our interfaces that do I/O directly to user pages. I do not want to allow GUP to pin I/O space this would open a pandora box that we do not want to open at all. Many driver manage their IO space and if they get random pinning because some other kernel bits they never heard of starts to do GUP on their stuff it is gonna cause havoc. So far mmap of device file have always been special and it has been reflected to userspace in all the instance i know of (media and GPU). Pretending we can handle them like any other vma is a lie because they were never designed that way in the first place and it would be disruptive to all those driver. Minimum disruption with minimun changes is what we should aim for and is what i am trying to do with this patchset. Using struct page and allowing GUP would mean rewritting huge chunk of GPU drivers (pretty much rewritting their whole memory management) with no benefit at the end. When something is special it is better to leave it that way. Cheers, J?r?me