Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp272432ybi; Fri, 7 Jun 2019 07:51:25 -0700 (PDT) X-Google-Smtp-Source: APXvYqysSwrgZ5afSbmXHPLK0eRNzNUUMxyvSbIYoDcmwZuUbW3T+pFYp/V0B0OVwtcp/oEl8Gal X-Received: by 2002:a63:1a5e:: with SMTP id a30mr2967268pgm.433.1559919085573; Fri, 07 Jun 2019 07:51:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559919085; cv=none; d=google.com; s=arc-20160816; b=eKjybNng/pHdfkuJaKyKmczLTA3yM1+j4gt4yhhilvMPY31DhIowG4LEZgnK142KVX Oo/qUim2/DfXH4txvX6P1EDChvryUATMJWINef9PjM7jiiL1DbDFPBMjhR9hRJIpjkL8 HQphowF8HM/Gqp6bTGyTgHATETgkgoPS0NlKhB2wUGJyqTJ+xaa9RFfvhRFo7SKTF4+9 D3xI/vJEMfxLjnA6uMI4fJfBvPvZqpw6CCad/axC/VcsclTMTzx7fLxFa+9t6QYA1sAG 1qM0YvwEDq0/8TgUITvIoOfJj7o2582HIpAznTb/P0R4bxE8uHiASCXn0cC+mNKVs7mI gCBA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=ONrEX0pLSNG7X0Q/0w97Q79cpcqRNXaRPFhLlTP43jc=; b=0CXcQ/QXcNkgTGrQU65JkqkBQUZz60B1XtAWPfE38DC3JHeWJenJVZIZ95sg0wHdkL rI2q/lIecw/wrJyQ+uF8T62j6YfJxXRui1mpgpTvDl/28+Xq1pT+IwqQZC77pHYKltLX NS9BTZEXjbY1hUvhHuZBmBzIcIjRh/Z5Ijfgo3bZMjlSvQeXF8S442+/atin80U6oxAV Fe+Kd3KTGp0YgOqqXtcepmaRhImORzWX3MjYY4RNVYlGRUbh/0CNrab/MLH//+R3ZDC2 tUa713W+D4neguY3+K4jve8LwE5BCfPI9DCu7xuBZA+DzeCwIEeZR37k2fhJ3Y1Ys1UF zk1w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e11si1850070pgs.31.2019.06.07.07.51.03; Fri, 07 Jun 2019 07:51:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728464AbfFGOvB (ORCPT + 99 others); Fri, 7 Jun 2019 10:51:01 -0400 Received: from mga18.intel.com ([134.134.136.126]:24997 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728446AbfFGOvB (ORCPT ); Fri, 7 Jun 2019 10:51:01 -0400 X-Amp-Result: UNSCANNABLE X-Amp-File-Uploaded: False Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 07 Jun 2019 07:51:00 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.63,563,1557212400"; d="scan'208";a="182683628" Received: from iweiny-desk2.sc.intel.com ([10.3.52.157]) by fmsmga002.fm.intel.com with ESMTP; 07 Jun 2019 07:51:00 -0700 Date: Fri, 7 Jun 2019 07:52:13 -0700 From: Ira Weiny To: Jason Gunthorpe Cc: Jan Kara , Dan Williams , Theodore Ts'o , Jeff Layton , Dave Chinner , Matthew Wilcox , linux-xfs@vger.kernel.org, Andrew Morton , John Hubbard , =?iso-8859-1?B?Suly9G1l?= Glisse , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, linux-nvdimm@lists.01.org, linux-ext4@vger.kernel.org, linux-mm@kvack.org Subject: Re: [PATCH RFC 00/10] RDMA/FS DAX truncate proposal Message-ID: <20190607145213.GB14559@iweiny-DESK2.sc.intel.com> References: <20190606014544.8339-1-ira.weiny@intel.com> <20190606104203.GF7433@quack2.suse.cz> <20190606195114.GA30714@ziepe.ca> <20190606222228.GB11698@iweiny-DESK2.sc.intel.com> <20190607103636.GA12765@quack2.suse.cz> <20190607121729.GA14802@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190607121729.GA14802@ziepe.ca> User-Agent: Mutt/1.11.1 (2018-12-01) Sender: linux-ext4-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Fri, Jun 07, 2019 at 09:17:29AM -0300, Jason Gunthorpe wrote: > On Fri, Jun 07, 2019 at 12:36:36PM +0200, Jan Kara wrote: > > > Because the pins would be invisible to sysadmin from that point on. > > It is not invisible, it just shows up in a rdma specific kernel > interface. You have to use rdma netlink to see the kernel object > holding this pin. > > If this visibility is the main sticking point I suggest just enhancing > the existing MR reporting to include the file info for current GUP > pins and teaching lsof to collect information from there as well so it > is easy to use. > > If the ownership of the lease transfers to the MR, and we report that > ownership to userspace in a way lsof can find, then I think all the > concerns that have been raised are met, right? I was contemplating some new lsof feature yesterday. But what I don't think we want is sysadmins to have multiple tools for multiple subsystems. Or even have to teach lsof something new for every potential new subsystem user of GUP pins. I was thinking more along the lines of reporting files which have GUP pins on them directly somewhere (dare I say procfs?) and teaching lsof to report that information. That would cover any subsystem which does a longterm pin. > > > ugly to live so we have to come up with something better. The best I can > > currently come up with is to have a method associated with the lease that > > would invalidate the RDMA context that holds the pins in the same way that > > a file close would do it. > > This is back to requiring all RDMA HW to have some new behavior they > currently don't have.. > > The main objection to the current ODP & DAX solution is that very > little HW can actually implement it, having the alternative still > require HW support doesn't seem like progress. > > I think we will eventually start seein some HW be able to do this > invalidation, but it won't be universal, and I'd rather leave it > optional, for recovery from truely catastrophic errors (ie my DAX is > on fire, I need to unplug it). Agreed. I think software wise there is not much some of the devices can do with such an "invalidate". Ira