Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp52120ybl; Fri, 9 Aug 2019 02:19:47 -0700 (PDT) X-Google-Smtp-Source: APXvYqxJEwLUaKjodO38FXJDEPuJ4Tg6tiqCMxddiJauiC82DsDTnF1x0rNah/kUxHONq8dxqA2D X-Received: by 2002:a17:902:2ec5:: with SMTP id r63mr17903466plb.21.1565342387799; Fri, 09 Aug 2019 02:19:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565342387; cv=none; d=google.com; s=arc-20160816; b=zNRemxbTEa+qwVMJ0sipQAA3P2t/RKKP+8q41vTA+/zgSuqKZOfy+A/mHsqTuaKFxa QCURO1orXVsKV1z8kmhU+UuLCYQ3VtrgLsMcBmv/oD9tdDjYgJ+9s9SoIHppkIplLrsC TMdn2HCFqSHMVWHH3WqF1dnHmtCQ66xRriFE1K30AiKWKW2Z+tcLjKu7judpnOt3gnqs MA4x0hNPuULSaVBML8iYbo2l9mMgXtJ4JnVe+51zr1uWIdFU5rjAZ716/mpM9lmN+D5q qa22LS0EJylJ0pMAlNgqqX1izOuRWEdm62uOeUMZlM6gnl5dbHADbGOgQzuJBMVd1GSN YpsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=MA1fK0KtLP5wo+bM1c0XFXmTq958i6JcPGSsXrTh1nY=; b=Dgdn+ge3ivgMsAEhYnsTHBhphiugh5MpFTkJwugWLzZRreC2MIRkdrrjzu7qcrugwM p5Vh35kufchZJ0+MhQ4QzpTJiiJNFHwTf1hY/NamKAiyzjgMfhdLFX4kmm6++UhipPfk 7uXWj6qKukZoOAFO8VpLzwM9TpxATps1vw52w8Tp2druQ7rF6qq7QJrX6tQm/bGNx8YG BuGiIr8EFssfpjZFl4TfTfxN6A26SdUbOl+Sl+tm7UglLXR+cvBdHuwanUeUhbm17GwB hp9BpfEDe2ViOJVkSyOLto8Xr4SBE2hwAZnH0K9b7CI/4CfB1AgHJekNeCFVaKDBY5qn hz1g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c12si24944716pgl.252.2019.08.09.02.19.31; Fri, 09 Aug 2019 02:19:47 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2406031AbfHIJQR (ORCPT + 99 others); Fri, 9 Aug 2019 05:16:17 -0400 Received: from mx2.suse.de ([195.135.220.15]:60362 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2405641AbfHIJQR (ORCPT ); Fri, 9 Aug 2019 05:16:17 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id EAA78B011; Fri, 9 Aug 2019 09:16:14 +0000 (UTC) Date: Fri, 9 Aug 2019 11:16:14 +0200 From: Michal Hocko To: John Hubbard Cc: Vlastimil Babka , Andrew Morton , Christoph Hellwig , Ira Weiny , Jan Kara , Jason Gunthorpe , Jerome Glisse , LKML , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, Dan Williams , Daniel Black , Matthew Wilcox , Mike Kravetz Subject: Re: [PATCH 1/3] mm/mlock.c: convert put_page() to put_user_page*() Message-ID: <20190809091614.GO18351@dhcp22.suse.cz> References: <20190805222019.28592-2-jhubbard@nvidia.com> <20190807110147.GT11812@dhcp22.suse.cz> <01b5ed91-a8f7-6b36-a068-31870c05aad6@nvidia.com> <20190808062155.GF11812@dhcp22.suse.cz> <875dca95-b037-d0c7-38bc-4b4c4deea2c7@suse.cz> <306128f9-8cc6-761b-9b05-578edf6cce56@nvidia.com> <420a5039-a79c-3872-38ea-807cedca3b8a@suse.cz> <20190809082307.GL18351@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri 09-08-19 02:05:15, John Hubbard wrote: > On 8/9/19 1:23 AM, Michal Hocko wrote: > > On Fri 09-08-19 10:12:48, Vlastimil Babka wrote: > > > On 8/9/19 12:59 AM, John Hubbard wrote: > > > > > > That's true. However, I'm not sure munlocking is where the > > > > > > put_user_page() machinery is intended to be used anyway? These are > > > > > > short-term pins for struct page manipulation, not e.g. dirtying of page > > > > > > contents. Reading commit fc1d8e7cca2d I don't think this case falls > > > > > > within the reasoning there. Perhaps not all GUP users should be > > > > > > converted to the planned separate GUP tracking, and instead we should > > > > > > have a GUP/follow_page_mask() variant that keeps using get_page/put_page? > > > > > > > > > > Interesting. So far, the approach has been to get all the gup callers to > > > > > release via put_user_page(), but if we add in Jan's and Ira's vaddr_pin_pages() > > > > > wrapper, then maybe we could leave some sites unconverted. > > > > > > > > > > However, in order to do so, we would have to change things so that we have > > > > > one set of APIs (gup) that do *not* increment a pin count, and another set > > > > > (vaddr_pin_pages) that do. > > > > > > > > > > Is that where we want to go...? > > > > > > > > > > > We already have a FOLL_LONGTERM flag, isn't that somehow related? And if > > > it's not exactly the same thing, perhaps a new gup flag to distinguish > > > which kind of pinning to use? > > > > Agreed. This is a shiny example how forcing all existing gup users into > > the new scheme is subotimal at best. Not the mention the overal > > fragility mention elsewhere. I dislike the conversion even more now. > > > > Sorry if this was already discussed already but why the new pinning is > > not bound to FOLL_LONGTERM (ideally hidden by an interface so that users > > do not have to care about the flag) only? > > > > Oh, it's been discussed alright, but given how some of the discussions have gone, > I certainly am not surprised that there are still questions and criticisms! > Especially since I may have misunderstood some of the points, along the way. > It's been quite a merry go round. :) Yeah, I've tried to follow them but just gave up at some point. > Anyway, what I'm hearing now is: for gup(FOLL_LONGTERM), apply the pinned tracking. > And therefore only do put_user_page() on pages that were pinned with > FOLL_LONGTERM. For short term pins, let the locking do what it will: > things can briefly block and all will be well. > > Also, that may or may not come with a wrapper function, courtesy of Jan > and Ira. > > Is that about right? It's late here, but I don't immediately recall any > problems with doing it that way... Yes that makes more sense to me. Whoever needs that tracking should opt-in for it. Otherwise you just risk problems like the one discussed in the mlock path (because we do a strange stuff in the name of performance) and a never ending whack a mole where new users do not follow the new API usage and that results in all sorts of weird issues. Thanks! -- Michal Hocko SUSE Labs