Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp446594ybi; Fri, 31 May 2019 04:07:21 -0700 (PDT) X-Google-Smtp-Source: APXvYqxOLa2FaH359zNUZKKk5QGGOSEOsOlT02y4aso0w6hhLKx2GZi58VjdLrY0pG9u0jzCoA1y X-Received: by 2002:a63:f710:: with SMTP id x16mr8379623pgh.216.1559300841174; Fri, 31 May 2019 04:07:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559300841; cv=none; d=google.com; s=arc-20160816; b=zO1pwW1wELBVZK5K6CtY3RyBlhtXogSFGzmEnE1+wjVvpLFgbeQKsPPZO+evUdG2cc GrtHuHP4W5K2ZTbjJnvb2WeErHsBhLLYomt/U2Sn1XrPBky/hi+bfcuw94rEFQu2yYD2 Lwl9MByItEp39SGPtLrUXMBOlBgts7J1Ntqe2Knhdpu38OSpJC+Fpg7ZXi6dROBH3TrB OOHQmDS9i6wJhAF8m87q7qgV+CUURp7vPWWtEGV4nVD9a3Fhh1WUUGUTm0uN9HTync0u i6pKHx1rp4Jx2Ssn5SEhN0jUNJrxZ1tG7WnDMCqb3peHRTEomEWNzgi3Aga4AqwgBHt9 xS0Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=60Qw700xF164wG7p5BsIJw5jAg3eroN4jWr2z/QxvPU=; b=lPvciYrQdWkbcUJbBIlzIseR6F1mnf0iuMsG8wnBBxaZ/KutJf/c6JkjC7mZ+NKYPM KLuHqIkHDuXFkB0CYQgz+snJy1QCfGWPfoSll9lOckqVnNjopzr3ARLIlPl71EqXHaoz sYurVvhiWKscOHzyaXTsOzAB4jgqrE8IzZLs3ycE4p8URMEvcTTTdc8XJmz9ARvU5jGE CsBMClOqoUZPsJyNHCYh6BlSyV3sMvdj1UgD2r4FERYI/fbTpP7aAOZtJjHQICm8tXOA JJvCZ9TlnS6f+uylXgYbxJJlq6ed26FvSjc7KW6/HJ+YrFs5o4uWQuHcN9DBJVRUI7Kx +2lw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=a4mRQqxd; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l96si5521424plb.115.2019.05.31.04.06.52; Fri, 31 May 2019 04:07:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=a4mRQqxd; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726518AbfEaLFk (ORCPT + 99 others); Fri, 31 May 2019 07:05:40 -0400 Received: from mail-io1-f65.google.com ([209.85.166.65]:42365 "EHLO mail-io1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726280AbfEaLFj (ORCPT ); Fri, 31 May 2019 07:05:39 -0400 Received: by mail-io1-f65.google.com with SMTP id g16so7801032iom.9 for ; Fri, 31 May 2019 04:05:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=60Qw700xF164wG7p5BsIJw5jAg3eroN4jWr2z/QxvPU=; b=a4mRQqxdQjhQGZEE+NmBCzyeQc+ZVPtaxK/OqZYYdmzyZw/ms7KnKg28+WenzXoqQ9 RbLiH8SECZHXZl8On7zljNcq9ZbjamcvHSHwwyZoYdNkt+LT5GrkwRdWOLKBNdrnV38S RIaZKQbm0SF/w7k8E3L2VoTpXgVwf/PDydUHQ0xZuEeua1Mhz9MrrMT3PGQXVYhjCk7h d5oDC2ML2yXo7Y4seRpm1X93vpVz2nzjRDhFxdO/Vjt+gUWgXQF1zdHCTqGhA6XRTLTq ikGo2JCpjGG+ey24sLxCJYHy6LbiQnz9qnul40bEzX/emxK25pjJC0HHewOFr6oTYZfM gRbg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=60Qw700xF164wG7p5BsIJw5jAg3eroN4jWr2z/QxvPU=; b=fKQ8FgHavAWDtxFUxkmJWP/EjqF/O6F3bOP1Ju6Kv8icxS7P8pIdQXsh6RQMunxx5Q eu/DRqg3NF8EeJq+W/I0Gxm/wsTtNZ85daMMCQIeTTcCBOt+fYHH+ZfugD4XEicAb/xj kyD+y24C2W3TYeqbfDpY4bmcK+PmBjzJlTkgGsNbp0s75ewWUrKUM3m/nAfh5pliGB+9 aCc5n2oOPQxUgy2iEdL5I3VV6oW066yETc9ECF2XfnXLEizgnQJuiQPhIy0r1ur0IYzv x5AuomBViuUjowBjc99MBLtmdL7/eyG72Txi2ywPmFrXzjkDrUX25fsiyCHpu0iJ2cjd VWPQ== X-Gm-Message-State: APjAAAVX4CP7HVLIs7FsvIzEuH2gcnDSUmDJUNOje8ImuBL+OjCLfq8B rM/EITzfZWw/f4MbSq8LVcDQEIUEV+vkPvKMeQ== X-Received: by 2002:a5e:d70c:: with SMTP id v12mr5592967iom.12.1559300738725; Fri, 31 May 2019 04:05:38 -0700 (PDT) MIME-Version: 1.0 References: <1559170444-3304-1-git-send-email-kernelfans@gmail.com> <20190530214726.GA14000@iweiny-DESK2.sc.intel.com> <1497636a-8658-d3ff-f7cd-05230fdead19@nvidia.com> In-Reply-To: <1497636a-8658-d3ff-f7cd-05230fdead19@nvidia.com> From: Pingfan Liu Date: Fri, 31 May 2019 19:05:27 +0800 Message-ID: Subject: Re: [PATCH] mm/gup: fix omission of check on FOLL_LONGTERM in get_user_pages_fast() To: John Hubbard Cc: Ira Weiny , linux-mm@kvack.org, Andrew Morton , Mike Rapoport , Dan Williams , Matthew Wilcox , "Aneesh Kumar K.V" , Keith Busch , LKML Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 31, 2019 at 7:21 AM John Hubbard wrote: > > On 5/30/19 2:47 PM, Ira Weiny wrote: > > On Thu, May 30, 2019 at 06:54:04AM +0800, Pingfan Liu wrote: > [...] > >> + for (j = i; j < nr; j++) > >> + put_page(pages[j]); > > > > Should be put_user_page() now. For now that just calls put_page() but it is > > slated to change soon. > > > > I also wonder if this would be more efficient as a check as we are walking the > > page tables and bail early. > > > > Perhaps the code complexity is not worth it? > > Good point, it might be worth it. Because now we've got two loops that > we run, after the interrupts-off page walk, and it's starting to look like > a potential performance concern. > > > > >> + nr = i; > > > > Why not just break from the loop here? > > > > Or better yet just use 'i' in the inner loop... > > > > ...but if you do end up putting in the after-the-fact check, then we can > go one or two steps further in cleaning it up, by: > > * hiding the visible #ifdef that was slicing up gup_fast, > > * using put_user_pages() instead of either put_page or put_user_page, > thus getting rid of j entirely, and > > * renaming an ancient minor confusion: nr --> nr_pinned), > > we could have this, which is looks cleaner and still does the same thing: > > diff --git a/mm/gup.c b/mm/gup.c > index f173fcbaf1b2..0c1f36be1863 100644 > --- a/mm/gup.c > +++ b/mm/gup.c > @@ -1486,6 +1486,33 @@ static __always_inline long __gup_longterm_locked(struct task_struct *tsk, > } > #endif /* CONFIG_FS_DAX || CONFIG_CMA */ > > +#ifdef CONFIG_CMA > +/* > + * Returns the number of pages that were *not* rejected. This makes it > + * exactly compatible with its callers. > + */ > +static int reject_cma_pages(int nr_pinned, unsigned gup_flags, > + struct page **pages) > +{ > + int i = 0; > + if (unlikely(gup_flags & FOLL_LONGTERM)) { > + > + for (i = 0; i < nr_pinned; i++) > + if (is_migrate_cma_page(pages[i])) { > + put_user_pages(&pages[i], nr_pinned - i); > + break; > + } > + } > + return i; > +} > +#else > +static int reject_cma_pages(int nr_pinned, unsigned gup_flags, > + struct page **pages) > +{ > + return nr_pinned; > +} > +#endif > + > /* > * This is the same as get_user_pages_remote(), just with a > * less-flexible calling convention where we assume that the task > @@ -2216,7 +2243,7 @@ int get_user_pages_fast(unsigned long start, int nr_pages, > unsigned int gup_flags, struct page **pages) > { > unsigned long addr, len, end; > - int nr = 0, ret = 0; > + int nr_pinned = 0, ret = 0; > > start &= PAGE_MASK; > addr = start; > @@ -2231,25 +2258,27 @@ int get_user_pages_fast(unsigned long start, int nr_pages, > > if (gup_fast_permitted(start, nr_pages)) { > local_irq_disable(); > - gup_pgd_range(addr, end, gup_flags, pages, &nr); > + gup_pgd_range(addr, end, gup_flags, pages, &nr_pinned); > local_irq_enable(); > - ret = nr; > + ret = nr_pinned; > } > > - if (nr < nr_pages) { > + nr_pinned = reject_cma_pages(nr_pinned, gup_flags, pages); > + > + if (nr_pinned < nr_pages) { > /* Try to get the remaining pages with get_user_pages */ > - start += nr << PAGE_SHIFT; > - pages += nr; > + start += nr_pinned << PAGE_SHIFT; > + pages += nr_pinned; > > - ret = __gup_longterm_unlocked(start, nr_pages - nr, > + ret = __gup_longterm_unlocked(start, nr_pages - nr_pinned, > gup_flags, pages); > > /* Have to be a bit careful with return values */ > - if (nr > 0) { > + if (nr_pinned > 0) { > if (ret < 0) > - ret = nr; > + ret = nr_pinned; > else > - ret += nr; > + ret += nr_pinned; > } > } > > > Rather lightly tested...I've compile-tested with CONFIG_CMA and !CONFIG_CMA, > and boot tested with CONFIG_CMA, but could use a second set of eyes on whether > I've added any off-by-one errors, or worse. :) > Do you mind I send V2 based on your above patch? Anyway, it is a simple bug fix. Thanks, Pingfan