Received: by 2002:ac0:946b:0:0:0:0:0 with SMTP id j40csp1099281imj; Thu, 7 Feb 2019 17:44:55 -0800 (PST) X-Google-Smtp-Source: AHgI3IbRAoS380t78bl5Hm0COOnSJSzuUjJY86NzpQFRKc+WllUI6i3qPAoBv6Q5CwK6J0UJKURk X-Received: by 2002:a62:9917:: with SMTP id d23mr854224pfe.88.1549590295742; Thu, 07 Feb 2019 17:44:55 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549590295; cv=none; d=google.com; s=arc-20160816; b=Y0DbhcpdMf2V43YpIiOc7jKQc/lvwWkQxtjyiAJOTP46NVgf4vfZquM+uOADwikXlv q20OaN6V+HLqXdYQxc0Po8O4fGAutNrUBu4IqlqeFkZJz7JSh8aDzZiOpx+oNSLxgxPP m7hw1c05q9PmcvmzXjKf1my30x01w3yr8kBJ9Ii/R+8iQqN7N+fBrhOnBw4fBTBWVlo8 Asvl1cylWU4Ekw9Y0mA1MNQhOzy8JivJokuWvzyfvP9ci0e/t4UOXKW/2ARPBjplN7CM hFxL0CUqS2UUB/SfcHzzOi4Cisk4lvx4YfNgPS6ffOUDFAanVmebCqMhU0Yw1p7X/FHA Aoyw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=NREgLueQYiNpeDvopa6HKK4WeC+DDbVHKpVnYRgFxAA=; b=H7f5N8kGTcHzoAixi+A4GZg26mrcXgdBOe1g1bkDFEfClkgbLTV1YnmUznq8ntCTBW eWmQGocl/QMQMT/O5eMYbRxhtHnpMsL2z5OzXPXsrjk27jLGsGNJ8UCDAuShcRW46A28 kwqKIQCUKw+0cuXG4GhKffi3EYJl5YGyZFZi4YXFUKlq+OuD4UM5pRm80vEIqD22iwQ4 FlRdPstJTUTUjCjEqgdaydbNRqFXBne5UdqfdDAvDjkOaI48xZGX8sT9on9bASq2EH2V sWl9bqNOERj3Ge/pbc4AW5tQLkt1gAMhJoK3os90UCwvfclofBGeTAeDE/orKQyj3tTW GllQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a81si644605pfj.195.2019.02.07.17.44.39; Thu, 07 Feb 2019 17:44:55 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727076AbfBHBoW (ORCPT + 99 others); Thu, 7 Feb 2019 20:44:22 -0500 Received: from mga12.intel.com ([192.55.52.136]:39460 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726876AbfBHBoW (ORCPT ); Thu, 7 Feb 2019 20:44:22 -0500 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by fmsmga106.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 07 Feb 2019 17:44:20 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.58,346,1544515200"; d="scan'208";a="298118289" Received: from iweiny-desk2.sc.intel.com ([10.3.52.157]) by orsmga005.jf.intel.com with ESMTP; 07 Feb 2019 17:44:20 -0800 Date: Thu, 7 Feb 2019 17:44:04 -0800 From: Ira Weiny To: Dan Williams Cc: Jason Gunthorpe , Dave Chinner , Doug Ledford , Christopher Lameter , Matthew Wilcox , Jan Kara , lsf-pc@lists.linux-foundation.org, linux-rdma , Linux MM , Linux Kernel Mailing List , John Hubbard , Jerome Glisse , Michal Hocko Subject: Re: [LSF/MM TOPIC] Discuss least bad options for resolving longterm-GUP usage by RDMA Message-ID: <20190208014403.GA32701@iweiny-DESK2.sc.intel.com> References: <47820c4d696aee41225854071ec73373a273fd4a.camel@redhat.com> <01000168c43d594c-7979fcf8-b9c1-4bda-b29a-500efe001d66-000000@email.amazonses.com> <20190206210356.GZ6173@dastard> <20190206220828.GJ12227@ziepe.ca> <0c868bc615a60c44d618fb0183fcbe0c418c7c83.camel@redhat.com> <20190207035258.GD6173@dastard> <20190207052310.GA22726@ziepe.ca> <20190207171736.GD22726@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.11.1 (2018-12-01) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 07, 2019 at 03:54:58PM -0800, Dan Williams wrote: > On Thu, Feb 7, 2019 at 9:17 AM Jason Gunthorpe wrote: > > > > Insisting to run RDMA & DAX without ODP and building an elaborate > > revoke mechanism to support non-ODP HW is inherently baroque. > > > > Use the HW that supports ODP. > > > > Since no HW can do disable of a MR, the escalation path is SIGKILL > > which makes it a non-production toy. > > > > What you keep missing is that for people doing this - the RDMA is a > > critical compoment of the system, you can't just say the kernel will > > randomly degrade/kill RDMA processes - that is a 'toy' configuration > > that is not production worthy. > > > > Especially since this revoke idea is basically a DOS engine for the > > RDMA protocol if another process can do actions to trigger revoke. Now > > we have a new class of security problems. (again, screams non > > production toy) > > > > The only production worthy way is to have the FS be a partner in > > making this work without requiring revoke, so the critical RDMA > > traffic can operate safely. > > > > Otherwise we need to stick to ODP. > > Thanks for this it clears a lot of things up for me... > > ...but this statement: > > > The only production worthy way is to have the FS be a partner in > > making this work without requiring revoke, so the critical RDMA > > traffic can operate safely. > > ...belies a path forward. Just swap out "FS be a partner" with "system > administrator be a partner". In other words, If the RDMA stack can't > tolerate an MR being disabled then the administrator needs to actively > disable the paths that would trigger it. Turn off reflink, don't > truncate, avoid any future FS feature that might generate unwanted > lease breaks. We would need to make sure that lease notifications > include the information to identify the lease breaker to debug escapes > that might happen, but it is a solution that can be qualified to not > lease break. In any event, this lets end users pick their filesystem > (modulo RDMA incompatible features), provides an enumeration of lease > break sources in the kernel, and opens up FS-DAX to a wider array of > RDMA adapters. In general this is what Linux has historically done, > give end users technology freedom. To back off the details of this thread a bit... The details of limitations imposed and how they would be tracked within the kernel would be a great thing to discuss face to face. Hence the reason for my proposal as a topic. Ira