Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1124287ybl; Fri, 16 Aug 2019 09:15:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqxi1qbPDudqh6o2nn8Z00caQwpgxVo0MQ/7af1ydjRQn1qlDzcSAp2s0+K6oLrlyvgNysC/ X-Received: by 2002:a63:2744:: with SMTP id n65mr8378970pgn.277.1565972131032; Fri, 16 Aug 2019 09:15:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565972131; cv=none; d=google.com; s=arc-20160816; b=GavITF9DSNG+aPm5xNIUb7YhKj2rtauTbu/DiSryK/LJfxutYDzuVpxKwrAHZwIF3M S+80p6KI9whnLs7/QZsl0U0LB6/QNkQqTADDtKouTLOietKdGwjRLybggD9Ja1Saj+bP BCxhEPiLC9xRBR3dFtNScg4YrBZHd/Ea+M18yoAvgFZnprAUMIHX7pYWTmh9c4NhIPF0 IH7DcCMdfWaHnGN2PwRBdkeBoeEGSed2ihCseg6y+DNsRyMfmeJKIo8jQoX4Ra1ZMZSg bS8SSLZITs/2vcWvpmRYqEi5Au1Oa5nK28zGoOilmQIZh/AxMRzVRm/B4yc6suFKOgNr BdzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=6C2i5DfqwG9iCEze2u6c3qjRG6Gm0aVZKjRIiuuVQE0=; b=SjR+cU7Po/5qgzeeZs5/Ie/DN/77+BoPbtAcd+PG1yIf3IIKuX71HrD9CeENjSkG0Q 2UEWJsRoMFwtrbi05b3E8cc4nkc6ghcyzENVPs5icIMg1G24eeHfWeiVo7zXCUWm2Gi6 fGGgYfrW7fitVu5Yq0076nWrYLNyxaBvXlCpgdDkR6SMpJ7tv7asYmHDmtykfYau7R86 tC5W7Rd0AfqqkMaFRkFqN1grx1xreo1x1CvUdMjgiq6DoD6ozcsCU1qeqlr0nHPd7iKs 1tYYG18MFycG11RtoGassn9YOst69/Z5Z80xseG6Z06wXw7qELxxXpGhnp1IgwIxCsfn 0YTg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j12si4239694plk.272.2019.08.16.09.15.15; Fri, 16 Aug 2019 09:15:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727518AbfHPQOY (ORCPT + 99 others); Fri, 16 Aug 2019 12:14:24 -0400 Received: from mx2.suse.de ([195.135.220.15]:45384 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726550AbfHPQOY (ORCPT ); Fri, 16 Aug 2019 12:14:24 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id BDA24ABF6; Fri, 16 Aug 2019 16:14:21 +0000 (UTC) Received: by quack2.suse.cz (Postfix, from userid 1000) id F03601E4009; Fri, 16 Aug 2019 18:13:55 +0200 (CEST) Date: Fri, 16 Aug 2019 18:13:55 +0200 From: Jan Kara To: Jerome Glisse Cc: Jan Kara , Vlastimil Babka , John Hubbard , Ira Weiny , Andrew Morton , Christoph Hellwig , Dan Williams , Dave Chinner , Jason Gunthorpe , LKML , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-rdma@vger.kernel.org Subject: Re: [RFC PATCH 2/2] mm/gup: introduce vaddr_pin_pages_remote() Message-ID: <20190816161355.GL3041@quack2.suse.cz> References: <90e5cd11-fb34-6913-351b-a5cc6e24d85d@nvidia.com> <20190814234959.GA463@iweiny-DESK2.sc.intel.com> <2cbdf599-2226-99ae-b4d5-8909a0a1eadf@nvidia.com> <20190815132622.GG14313@quack2.suse.cz> <20190815133510.GA21302@quack2.suse.cz> <0d6797d8-1e04-1ebe-80a7-3d6895fe71b0@suse.cz> <20190816154404.GF3041@quack2.suse.cz> <20190816155220.GC3149@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190816155220.GC3149@redhat.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri 16-08-19 11:52:20, Jerome Glisse wrote: > On Fri, Aug 16, 2019 at 05:44:04PM +0200, Jan Kara wrote: > > On Fri 16-08-19 10:47:21, Vlastimil Babka wrote: > > > On 8/15/19 3:35 PM, Jan Kara wrote: > > > >> > > > >> So when the GUP user uses MMU notifiers to stop writing to pages whenever > > > >> they are writeprotected with page_mkclean(), they don't really need page > > > >> pin - their access is then fully equivalent to any other mmap userspace > > > >> access and filesystem knows how to deal with those. I forgot out this case > > > >> when I wrote the above sentence. > > > >> > > > >> So to sum up there are three cases: > > > >> 1) DIO case - GUP references to pages serving as DIO buffers are needed for > > > >> relatively short time, no special synchronization with page_mkclean() or > > > >> munmap() => needs FOLL_PIN > > > >> 2) RDMA case - GUP references to pages serving as DMA buffers needed for a > > > >> long time, no special synchronization with page_mkclean() or munmap() > > > >> => needs FOLL_PIN | FOLL_LONGTERM > > > >> This case has also a special case when the pages are actually DAX. Then > > > >> the caller additionally needs file lease and additional file_pin > > > >> structure is used for tracking this usage. > > > >> 3) ODP case - GUP references to pages serving as DMA buffers, MMU notifiers > > > >> used to synchronize with page_mkclean() and munmap() => normal page > > > >> references are fine. > > > > > > IMHO the munlock lesson told us about another one, that's in the end equivalent > > > to 3) > > > > > > 4) pinning for struct page manipulation only => normal page references > > > are fine > > > > Right, it's good to have this for clarity. > > > > > > I want to add that I'd like to convert users in cases 1) and 2) from using > > > > GUP to using differently named function. Users in case 3) can stay as they > > > > are for now although ultimately I'd like to denote such use cases in a > > > > special way as well... > > > > > > So after 1/2/3 is renamed/specially denoted, only 4) keeps the current > > > interface? > > > > Well, munlock() code doesn't even use GUP, just follow_page(). I'd wait to > > see what's left after handling cases 1), 2), and 3) to decide about the > > interface for the remainder. > > > > For 3 we do not need to take a reference at all :) So just forget about 3 > it does not exist. For 3 the reference is the reference the CPU page table > has on the page and that's it. GUP is no longer involve in ODP or anything > like that. Yes, I understand. But the fact is that GUP calls are currently still there e.g. in ODP code. If you can make the code work without taking a page reference at all, I'm only happy :) Honza -- Jan Kara SUSE Labs, CR