Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1163200ybl; Fri, 16 Aug 2019 09:55:38 -0700 (PDT) X-Google-Smtp-Source: APXvYqwQczUPDfL5AWrgNmJ3ywkhcq65MVhrdWcyE9apANAcfGlB6Hypk7C6B+quBxr44T6onfoT X-Received: by 2002:a17:90a:a489:: with SMTP id z9mr7928397pjp.24.1565974538671; Fri, 16 Aug 2019 09:55:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565974538; cv=none; d=google.com; s=arc-20160816; b=Xoq1Y6TE970Xl/euAyraCuDZykUp8sFq7gYtR5g8Wpk9k1zpTQ1yRwmknSCz66WWaV j1+f5qAzRdEM2Ie1f3rPjVDzqfTivHLMhZwxh0LxfxJUhsJqAe5lZmHAZOF9y0ourm9h YSsFKuCxOkZz6Q9dwj5BFHVnxwSV/0DBF2P6HjcLtV7+HxRmmlfol/rmp08qmqvk/pld ByONPJFounP26pFDgzjKLgVUOrkcdjeOS/LLC0kqHag0vC2vBjL8hqT0u0oog7I+ZSfC kwmb8rXzf/P7roQTQmmziwKP/213YRJfi9IbqIrRr+Eat7IvD1R3/zpa7G0UekHXKV0v Ws+w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=ExaoczQIfCK0ZLJqtZnhIXUu8kG2vKLj2k7dPGIvdeE=; b=O63aU0a6mOPoo/n5yMaccd6vq6iVqMkWkjeHHKmdomO0v2vDR/6xpmBgw3j7NzjVj3 9Qa0j6KFPrsEpMT8EhyEB4ftPv5B1iI+ueD7jm1WAWmPczgIyxoNz5rcIxwqEY1IoLDP tIr3fdczuO06IDkU/fOQdOmkqrxX2gimuAxB0rjzHZPjuCoUJXvHzM7XnmF1oSH0oQdf bmJiBYUW36f3FwJZfkRN/3srMHZbcmp25EBVtji+c3wRWyP4b+J9NpdOvTZftuhTnQK0 2P53b2LTPwr/D3asatnfOT1oWwxdaJPK+LoA9Y8oQGJUA15GmsiieAza3WKUP4ng2ayt vICg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i1si4323936pld.173.2019.08.16.09.55.22; Fri, 16 Aug 2019 09:55:38 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727005AbfHPQyt (ORCPT + 99 others); Fri, 16 Aug 2019 12:54:49 -0400 Received: from mx1.redhat.com ([209.132.183.28]:48756 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726245AbfHPQyt (ORCPT ); Fri, 16 Aug 2019 12:54:49 -0400 Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.phx2.redhat.com [10.5.11.15]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id CF1BE3001895; Fri, 16 Aug 2019 16:54:48 +0000 (UTC) Received: from redhat.com (ovpn-123-168.rdu2.redhat.com [10.10.123.168]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 6CAC884256; Fri, 16 Aug 2019 16:54:47 +0000 (UTC) Date: Fri, 16 Aug 2019 12:54:45 -0400 From: Jerome Glisse To: Jan Kara Cc: Vlastimil Babka , John Hubbard , Ira Weiny , Andrew Morton , Christoph Hellwig , Dan Williams , Dave Chinner , Jason Gunthorpe , LKML , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-rdma@vger.kernel.org Subject: Re: [RFC PATCH 2/2] mm/gup: introduce vaddr_pin_pages_remote() Message-ID: <20190816165445.GD3149@redhat.com> References: <90e5cd11-fb34-6913-351b-a5cc6e24d85d@nvidia.com> <20190814234959.GA463@iweiny-DESK2.sc.intel.com> <2cbdf599-2226-99ae-b4d5-8909a0a1eadf@nvidia.com> <20190815132622.GG14313@quack2.suse.cz> <20190815133510.GA21302@quack2.suse.cz> <0d6797d8-1e04-1ebe-80a7-3d6895fe71b0@suse.cz> <20190816154404.GF3041@quack2.suse.cz> <20190816155220.GC3149@redhat.com> <20190816161355.GL3041@quack2.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190816161355.GL3041@quack2.suse.cz> User-Agent: Mutt/1.12.1 (2019-06-15) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.15 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.42]); Fri, 16 Aug 2019 16:54:48 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Aug 16, 2019 at 06:13:55PM +0200, Jan Kara wrote: > On Fri 16-08-19 11:52:20, Jerome Glisse wrote: > > On Fri, Aug 16, 2019 at 05:44:04PM +0200, Jan Kara wrote: > > > On Fri 16-08-19 10:47:21, Vlastimil Babka wrote: > > > > On 8/15/19 3:35 PM, Jan Kara wrote: > > > > >> > > > > >> So when the GUP user uses MMU notifiers to stop writing to pages whenever > > > > >> they are writeprotected with page_mkclean(), they don't really need page > > > > >> pin - their access is then fully equivalent to any other mmap userspace > > > > >> access and filesystem knows how to deal with those. I forgot out this case > > > > >> when I wrote the above sentence. > > > > >> > > > > >> So to sum up there are three cases: > > > > >> 1) DIO case - GUP references to pages serving as DIO buffers are needed for > > > > >> relatively short time, no special synchronization with page_mkclean() or > > > > >> munmap() => needs FOLL_PIN > > > > >> 2) RDMA case - GUP references to pages serving as DMA buffers needed for a > > > > >> long time, no special synchronization with page_mkclean() or munmap() > > > > >> => needs FOLL_PIN | FOLL_LONGTERM > > > > >> This case has also a special case when the pages are actually DAX. Then > > > > >> the caller additionally needs file lease and additional file_pin > > > > >> structure is used for tracking this usage. > > > > >> 3) ODP case - GUP references to pages serving as DMA buffers, MMU notifiers > > > > >> used to synchronize with page_mkclean() and munmap() => normal page > > > > >> references are fine. > > > > > > > > IMHO the munlock lesson told us about another one, that's in the end equivalent > > > > to 3) > > > > > > > > 4) pinning for struct page manipulation only => normal page references > > > > are fine > > > > > > Right, it's good to have this for clarity. > > > > > > > > I want to add that I'd like to convert users in cases 1) and 2) from using > > > > > GUP to using differently named function. Users in case 3) can stay as they > > > > > are for now although ultimately I'd like to denote such use cases in a > > > > > special way as well... > > > > > > > > So after 1/2/3 is renamed/specially denoted, only 4) keeps the current > > > > interface? > > > > > > Well, munlock() code doesn't even use GUP, just follow_page(). I'd wait to > > > see what's left after handling cases 1), 2), and 3) to decide about the > > > interface for the remainder. > > > > > > > For 3 we do not need to take a reference at all :) So just forget about 3 > > it does not exist. For 3 the reference is the reference the CPU page table > > has on the page and that's it. GUP is no longer involve in ODP or anything > > like that. > > Yes, I understand. But the fact is that GUP calls are currently still there > e.g. in ODP code. If you can make the code work without taking a page > reference at all, I'm only happy :) Already in rdma next AFAIK so in 5.4 it will be gone :) i have been removing all GUP users that do not need reference. Intel i915 driver is a left over i will work some more with them to get rid of it too. Cheers, J?r?me