Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755625AbcK1WZO (ORCPT ); Mon, 28 Nov 2016 17:25:14 -0500 Received: from quartz.orcorp.ca ([184.70.90.242]:40606 "EHLO quartz.orcorp.ca" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754980AbcK1WZD (ORCPT ); Mon, 28 Nov 2016 17:25:03 -0500 Date: Mon, 28 Nov 2016 15:24:52 -0700 From: Jason Gunthorpe To: Serguei Sagalovitch Cc: Logan Gunthorpe , Haggai Eran , Christian K??nig , Dan Williams , "Deucher, Alexander" , "linux-nvdimm@lists.01.org" , "linux-rdma@vger.kernel.org" , "linux-pci@vger.kernel.org" , "Kuehling, Felix" , "Bridgman, John" , "linux-kernel@vger.kernel.org" , "dri-devel@lists.freedesktop.org" , "Sander, Ben" , "Suthikulpanit, Suravee" , "Blinzer, Paul" , "Linux-media@vger.kernel.org" , Max Gurtovoy Subject: Re: Enabling peer to peer device transactions for PCIe devices Message-ID: <20161128222452.GA744@obsidianresearch.com> References: <91d28749-bc64-622f-56a1-26c00e6b462a@deltatee.com> <20161124164249.GD20818@obsidianresearch.com> <3f2d2db3-fb75-2422-2a18-a8497fd5d70e@amd.com> <20161125193252.GC16504@obsidianresearch.com> <20161128165751.GB28381@obsidianresearch.com> <0d3d56e2-4d2b-85b7-9487-b7ae2aaea610@amd.com> <1ac2f9e7-f1ee-a2c9-0134-ffaa28c706af@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1ac2f9e7-f1ee-a2c9-0134-ffaa28c706af@amd.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 614 Lines: 14 On Mon, Nov 28, 2016 at 04:55:23PM -0500, Serguei Sagalovitch wrote: > >We haven't touch this in a long time and perhaps it changed, but there > >definitely was a call back in the PeerDirect API to allow the GPU to > >invalidate the mapping. That's what we don't want. > I assume that you are talking about "invalidate_peer_memory()' callback? > I was told that it is the "last resort" because HCA (and driver) is not > able to handle it in the safe manner so it is basically "abort" everything. If it is a last resort to save system stability then kill the impacted process, that will release the MRs. Jason