From: Alan Cox Subject: Re: [RFC PATCH 0/7] A General Accelerator Framework, WarpDrive Date: Fri, 3 Aug 2018 15:20:43 +0100 Message-ID: <20180803152043.40f88947@alans-desktop> References: <20180801102221.5308-1-nek.in.cn@gmail.com> <20180801165644.GA3820@redhat.com> <20180802111000.4649d9ed@alans-desktop> <20180802144627.GB3481@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Cc: "Tian, Kevin" , Alex Williamson , Herbert Xu , "kvm-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" , Jonathan Corbet , Greg Kroah-Hartman , Zaibo Xu , "linux-doc-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" , "Kumar, Sanjay K" , Hao Fang , "linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" , "linuxarm-hv44wF8Li93QT0dZR+AlfA@public.gmane.org" , "iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org" , "linux-crypto-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" , Kenneth Lee , Philippe Ombredanne , Thomas Gleixner , Ross Zwisler , Kenneth Lee , "David S . Mille To: Jerome Glisse Return-path: In-Reply-To: <20180802144627.GB3481-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: iommu-bounces-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org Errors-To: iommu-bounces-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org List-Id: linux-crypto.vger.kernel.org > If we are going to have any kind of general purpose accelerator API then > > it has to be able to implement things like > > Why is the existing driver model not good enough ? So you want > a device with function X you look into /dev/X (for instance > for GPU you look in /dev/dri) Except when my GPU is in an FPGA in which case it might be somewhere else or it's a general purpose accelerator that happens to be usable as a GPU. Unusual today in big computer space but you'll find it in microcontrollers. > Each of those device need a userspace driver and thus this > user space driver can easily knows where to look. I do not > expect that every application will reimplement those drivers > but instead use some kind of library that provide a high > level API for each of those devices. Think about it from the user level. You have a pipeline of things you wish to execute, you need to get the right accelerator combinations and they need to fit together to meet system constraints like number of IOMMU ids the accelerator supports, where they are connected. > Now you have a hierarchy of memory for the CPU (HBM, local > node main memory aka you DDR dimm, persistent memory) each It's not a heirarchy, it's a graph. There's no fundamental reason two accelerators can't be close to two different CPU cores but have shared HBM that is far from each processor. There are physical reasons it tends to look more like a heirarchy today. > Anyway i think finding devices and finding relation between > devices and memory is 2 separate problems and as such should > be handled separatly. At a certain level they are deeply intertwined because you need a common API. It's not good if I want a particular accelerator and need to then see which API its under on this machine and which interface I have to use, and maybe have a mix of FPGA, WarpDrive and Google ASIC interfaces all different. The job of the kernel is to impose some kind of sanity and unity on this lot. All of it in the end comes down to 'Somehow glue some chunk of memory into my address space and find any supporting driver I need' plus virtualization of the above. That bit's easy - but making it usable is a different story. Alan