Received: by 2002:ac0:8bc7:0:0:0:0:0 with SMTP id o7csp82323ima; Wed, 6 Feb 2019 21:33:31 -0800 (PST) X-Google-Smtp-Source: AHgI3IYxD7PKcXf7L9fE6nZM7wnpze4svU1uo16YAusCWkB/Rxndrk3wCHP67n+nYBjECKJU8LiI X-Received: by 2002:a62:fb07:: with SMTP id x7mr14309453pfm.71.1549517611061; Wed, 06 Feb 2019 21:33:31 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549517611; cv=none; d=google.com; s=arc-20160816; b=B/YhkJQkE5f0ris1NdLpwhnCCR47XIjt1dYJaL1hBAf+SZ7R3B3zBOMtcQkII3GuMK h0/ZoUxerGA7J4THmNpg0PaU/SEjz3h+6RoktA27AmlAPlHyw/QIRWUj13yB3fHJpUwl diPvKp/vTq1/Q0UB9OOU84dIhOdZJDQLMD7M+30DuGLW5HJ+MHrlG4dNL11OGCN5jG/C PkKE7skTDqNvQr4ZwXcI0RI9buVkbEPSiCKTOcg3nghoSqr6sfW6T66ojZXy4buQ0m0B Dvkp8Gv/VHwmhBXj95TNDWVxTXtQWj+q8bq+4K8HqA8p+IFKWLI7/ZOjdfQeAoy8j4nc MA7Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=1dhO9WZwCYTMt9fJvM1nNU2p7dvIM4HOJh/eYKYBoFs=; b=SgzQhLUiQSGkMDsEtHblF1YYZuGpYZVaubX5aomftUFt2XjvFTB1pra+dBzERHxFSP l0wZ153QxBVwEt6bVh+I82URGoFtnYiP9q0pcT47HzO5Qq3R5pGy8zyAdABypnih9Pm6 fX1iMoJOhPsVHNKdhAU5snOKkY06nZmEwzWtem9ARKzDagUR3Z8TtOfVBmrNi2S7pXpP WNIn6Jy3+QPHzo7oZYEcozSG519EwUiDhQvcCRhmi/oYzu9F2JzaIHjwUyVSBFrzZEHj 54tu2XfStN1MOiAgdlUYRih3FHZYsmJisgfWkl5NzDnILvzdrfEil0SNiB0E4S/TzkEf OZrA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=O6JuEwwu; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e68si8158982pfb.101.2019.02.06.21.33.12; Wed, 06 Feb 2019 21:33:31 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=O6JuEwwu; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726758AbfBGFdJ (ORCPT + 99 others); Thu, 7 Feb 2019 00:33:09 -0500 Received: from mail-pl1-f196.google.com ([209.85.214.196]:36153 "EHLO mail-pl1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726293AbfBGFdJ (ORCPT ); Thu, 7 Feb 2019 00:33:09 -0500 Received: by mail-pl1-f196.google.com with SMTP id g9so4286106plo.3 for ; Wed, 06 Feb 2019 21:33:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ziepe.ca; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=1dhO9WZwCYTMt9fJvM1nNU2p7dvIM4HOJh/eYKYBoFs=; b=O6JuEwwu6hKxxjXIys11QD16tgPe+A/2DnTr+z70AF3t0OVv1xdrIWoLuBKwz7HbIf iK+grMOhLiKkNbqOjg5yXtOoS4+o+jyUzZun6Nii5nU6IzXwJNZl1sWS+zC5vGNdbQ9N Q1jQHPwzNejWMzbh1nqmfapYsNafb5dvk/ipc36uEXNaD4va1okaOSVX+enNwIWTzdoV TAuJCe+EfVZna2FV4lI+R69LJDjhKWFr5CNS4i3vmw9NTidXYJviCPvM55UC+Eq4Z5Dc J5Mr45BsFeby+nhsnN//IzTSVyqB5FTN1uph3WPH9WKxgnp6M8RPR9qTpXPrgyb64Ivg vngg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=1dhO9WZwCYTMt9fJvM1nNU2p7dvIM4HOJh/eYKYBoFs=; b=RLbYuExSyblLVIorisX/z1SwHmVodlbhVd5QD4WfAZGPoDFJo50qR3K1KRyctWbuW+ vBC7SxsbQCzJPg6SVdfx4yPeltOJ49SFZugECvQfQ/1XAZ/wTX1ccy8RlB1ajrkqy3tz Is9eSEozMluR6CkyvPUyzRsHbXNe87o71cY+JruUNweyrn1NtXa/odBoAPHTJv+HcAcE wTCcwiFFL0rk3FkzGG3TsgMp32SlhZYUmDktMD1buVa3IzgePZb4WR/ozn5WHFzVr+8F rq8Gx5TmNyO9JqL7wPjFM7/ZkWqo1j1akPwFJ6rqZvsnl98VwPiLeUt5oPbMqmMR3TZn yy9A== X-Gm-Message-State: AHQUAuZz3TJo30A0WTiFQFqXhsQ/zYQXb2lfgaV0987UywLRMWq4LW1g hCZnyDnVPGAYb+ogLc/O9w9vJQ== X-Received: by 2002:a17:902:722:: with SMTP id 31mr14770884pli.271.1549517588507; Wed, 06 Feb 2019 21:33:08 -0800 (PST) Received: from ziepe.ca (S010614cc2056d97f.ed.shawcable.net. [174.3.196.123]) by smtp.gmail.com with ESMTPSA id 196sm22681710pfc.77.2019.02.06.21.33.07 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Wed, 06 Feb 2019 21:33:07 -0800 (PST) Received: from jgg by mlx.ziepe.ca with local (Exim 4.90_1) (envelope-from ) id 1grcJ9-00067N-8V; Wed, 06 Feb 2019 22:33:07 -0700 Date: Wed, 6 Feb 2019 22:33:07 -0700 From: Jason Gunthorpe To: Dan Williams Cc: Doug Ledford , Dave Chinner , Christopher Lameter , Matthew Wilcox , Jan Kara , Ira Weiny , lsf-pc@lists.linux-foundation.org, linux-rdma , Linux MM , Linux Kernel Mailing List , John Hubbard , Jerome Glisse , Michal Hocko , linux-nvdimm Subject: Re: [LSF/MM TOPIC] Discuss least bad options for resolving longterm-GUP usage by RDMA Message-ID: <20190207053307.GB22726@ziepe.ca> References: <47820c4d696aee41225854071ec73373a273fd4a.camel@redhat.com> <01000168c43d594c-7979fcf8-b9c1-4bda-b29a-500efe001d66-000000@email.amazonses.com> <20190206210356.GZ6173@dastard> <20190206220828.GJ12227@ziepe.ca> <0c868bc615a60c44d618fb0183fcbe0c418c7c83.camel@redhat.com> <20190206232130.GK12227@ziepe.ca> <20190206234132.GB15234@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 06, 2019 at 04:22:16PM -0800, Dan Williams wrote: > On Wed, Feb 6, 2019 at 3:41 PM Jason Gunthorpe wrote: > [..] > > > You're describing the current situation, i.e. Linux already implements > > > this, it's called Device-DAX and some users of RDMA find it > > > insufficient. The choices are to continue to tell them "no", or say > > > "yes, but you need to submit to lease coordination". > > > > Device-DAX is not what I'm imagining when I say XFS--. > > > > I mean more like XFS with all features that require rellocation of > > blocks disabled. > > > > Forbidding hold punch, reflink, cow, etc, doesn't devolve back to > > device-dax. > > True, not all the way, but the distinction loses significance as you > lose fs features. > > Filesystems mark DAX functionality experimental [1] precisely because > it forbids otherwise typical operations that work in the nominal page > cache case. An approach that says "lets cement the list of things a > filesystem or a core-memory-mangement facility can't do because RDMA > finds it awkward" is bad precedent. I'm not saying these rules should apply globaly. I'm suggesting you could have a FS that supports gup_longterm by design, and a FS that doesn't. And that is OK. They can have different rules. Obviously the golden case here is to use ODP (which doesn't call gup_longterm at all) - that works for both. Supporting non-ODP is a trade off case - users that want to run on limited HW must accept limited functionality. Limited functionality is better than no-funtionality. Linux has many of these user-choose tradeoffs. This is how it supports such a wide range of HW capabilities. Not all HW can do all things. Some features really do need HW support. It has always been that way. Jason