Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp6255038imm; Wed, 27 Jun 2018 04:59:55 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfHNNP//H1Gx7fCOUoG2ppXol8b8NpLHGlX9aaaR4MN94/ii/VedQGO4OkNqt39Qa2qtWv6 X-Received: by 2002:a65:5545:: with SMTP id t5-v6mr4107474pgr.157.1530100795671; Wed, 27 Jun 2018 04:59:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1530100795; cv=none; d=google.com; s=arc-20160816; b=b+tD7/jEtZJ3F96/xcIp9sIXMMxIu3eUxLHYuRXhZhjeXZSZlbwloktha/jQa951la MtEJDzOri1jEiE63ycCTO8/AO7mEeKOkv6zVxfOq7cdaksygQkce3//KomFNbLzRcugr M8sZSy/5d5XcrzLQDLOEsL9kCvNwmY9gIBYsxov5SuUdOz5GdLl+bKbSks3dKZ+NlPd3 nOqpO1OFuxZy7XTFUEHGnhFh8ZnQmXJjJfqnhXg8UlP6/PZyh/9R+op9WzA0CjJTKbom lxj3EpxCAbIXN1NiyCvLwcoXHLgjp21Dw862868mUrl5nOlYzcAYPsENkE7rFq4Q3d0o urRA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=RF6g9P9tCMoltFrby1eeB2bFYNPxyVLY+6+AaQQOpWM=; b=ZtfE958NYAow4cyKcTcoahpZ8keeDGflc7noFKAvgUPJ36XG3gRnWnkY7xzdG0c1cH Fbl87zIEHG4PYRvtOClY6cksKrdTO817bdYQRCUyskYpkS8T9PF3qRCft12C5ZSbGyjB UYxYhHl44OfIDkVY8xUM4ehOZuHETyr+wNkUap/sW/aE+HxVqnek7c4IOJkIoCdds0fP nd4F4U7n7pba8ZqLZUFInBRkaRE86AKSVwpsgs5XiuNWb6t7zzV2fhiybRMB6vmlH08X 2VPnjnP68K4hgg22jE6Jt5uMODQe1K9AkNmtgOJNO2nSRifySZHwUl9mLaL32bV+oLKk LzLw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k9-v6si3820404pfc.129.2018.06.27.04.59.41; Wed, 27 Jun 2018 04:59:55 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934192AbeF0Lx7 (ORCPT + 99 others); Wed, 27 Jun 2018 07:53:59 -0400 Received: from mx2.suse.de ([195.135.220.15]:45811 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S934162AbeF0Lxv (ORCPT ); Wed, 27 Jun 2018 07:53:51 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext-too.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 6C33CADF5; Wed, 27 Jun 2018 11:53:50 +0000 (UTC) Received: by quack2.suse.cz (Postfix, from userid 1000) id DE6B61E3C15; Wed, 27 Jun 2018 13:53:49 +0200 (CEST) Date: Wed, 27 Jun 2018 13:53:49 +0200 From: Jan Kara To: Michal Hocko Cc: Jan Kara , Dan Williams , John Hubbard , Christoph Hellwig , Jason Gunthorpe , John Hubbard , Matthew Wilcox , Christopher Lameter , Linux MM , LKML , linux-rdma Subject: Re: [PATCH 2/2] mm: set PG_dma_pinned on get_user_pages*() Message-ID: <20180627115349.cu2k3ainqqdrrepz@quack2.suse.cz> References: <311eba48-60f1-b6cc-d001-5cc3ed4d76a9@nvidia.com> <20180618081258.GB16991@lst.de> <3898ef6b-2fa0-e852-a9ac-d904b47320d5@nvidia.com> <20180626134757.GY28965@dhcp22.suse.cz> <20180626164825.fz4m2lv6hydbdrds@quack2.suse.cz> <20180627113221.GO32348@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180627113221.GO32348@dhcp22.suse.cz> User-Agent: NeoMutt/20170912 (1.9.0) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed 27-06-18 13:32:21, Michal Hocko wrote: > On Tue 26-06-18 18:48:25, Jan Kara wrote: > > On Tue 26-06-18 15:47:57, Michal Hocko wrote: > > > On Mon 18-06-18 12:21:46, Dan Williams wrote: > > > [...] > > > > I do think we should explore a page flag for pages that are "long > > > > term" pinned. Michal asked for something along these lines at LSF / MM > > > > so that the core-mm can give up on pages that the kernel has lost > > > > lifetime control. Michal, did I capture your ask correctly? > > > > > > I am sorry to be late. I didn't ask for a page flag exactly. I've asked > > > for a way to query for the pin to be temporal or permanent. How that is > > > achieved is another question. Maybe we have some more spare room after > > > recent struct page reorganization but I dunno, to be honest. Maybe we > > > can have an _count offset for these longterm pins. It is not like we are > > > using the whole ref count space, right? > > > > Matthew had an interesting idea to pull pinned pages completely out from > > any LRU and reuse that space in struct page for pinned refcounts. From some > > initial investigation (read on elsewhere in this thread) it looks doable. I > > was considering offsetting in refcount as well but on 32-bit architectures > > there's not that many bits that I'd be really comfortable with that > > solution... > > I am really slow at following up this discussion. The problem I would > see with off-lru pages is that this can quickly turn into a weird > reclaim behavior. Especially when we are talking about a lot of memory. > It is true that such pages wouldn't be reclaimable directly but could > poke them in some way if we see too many of them while scanning LRU. > > Not that this is a fundamental block stopper but this is the first thing > that popped out when thinking about such a solution. Maybe it is a good > start though. > > Appart from that, do we really care about 32b here? Big DIO, IB users > seem to be 64b only AFAIU. IMO it is a bad habit to leave unpriviledged-user-triggerable oops in the kernel even for uncommon platforms... Honza -- Jan Kara SUSE Labs, CR