Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751332AbdILBMb (ORCPT ); Mon, 11 Sep 2017 21:12:31 -0400 Received: from szxga04-in.huawei.com ([45.249.212.190]:6010 "EHLO szxga04-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750999AbdILBM3 (ORCPT ); Mon, 11 Sep 2017 21:12:29 -0400 Subject: Re: [PATCH 0/6] Cache coherent device memory (CDM) with HMM v5 To: Jerome Glisse , Bob Liu References: <20170718153816.GA3135@redhat.com> <20170719022537.GA6911@redhat.com> <20170720150305.GA2767@redhat.com> <20170721014106.GB25991@redhat.com> <20170905193644.GD19397@redhat.com> <20170911233649.GA4892@redhat.com> CC: Dan Williams , "linux-kernel@vger.kernel.org" , Linux MM , John Hubbard , David Nellans , Balbir Singh , Michal Hocko , Andrew Morton From: Bob Liu Message-ID: <905f3242-e17b-a4c1-dd03-36f64161fa02@huawei.com> Date: Tue, 12 Sep 2017 09:02:19 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0 MIME-Version: 1.0 In-Reply-To: <20170911233649.GA4892@redhat.com> Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 8bit X-Originating-IP: [10.142.83.150] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020201.59B7322C.00D8,ss=1,re=0.000,recu=0.000,reip=0.000,cl=1,cld=1,fgs=0, ip=0.0.0.0, so=2014-11-16 11:51:01, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: d90c064be42052f9d76254321560d092 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3558 Lines: 79 On 2017/9/12 7:36, Jerome Glisse wrote: > On Sun, Sep 10, 2017 at 07:22:58AM +0800, Bob Liu wrote: >> On Wed, Sep 6, 2017 at 3:36 AM, Jerome Glisse wrote: >>> On Thu, Jul 20, 2017 at 08:48:20PM -0700, Dan Williams wrote: >>>> On Thu, Jul 20, 2017 at 6:41 PM, Jerome Glisse wrote: >>>>> On Fri, Jul 21, 2017 at 09:15:29AM +0800, Bob Liu wrote: >>>>>> On 2017/7/20 23:03, Jerome Glisse wrote: >>>>>>> On Wed, Jul 19, 2017 at 05:09:04PM +0800, Bob Liu wrote: >>>>>>>> On 2017/7/19 10:25, Jerome Glisse wrote: >>>>>>>>> On Wed, Jul 19, 2017 at 09:46:10AM +0800, Bob Liu wrote: >>>>>>>>>> On 2017/7/18 23:38, Jerome Glisse wrote: >>>>>>>>>>> On Tue, Jul 18, 2017 at 11:26:51AM +0800, Bob Liu wrote: >>>>>>>>>>>> On 2017/7/14 5:15, J?r?me Glisse wrote: >>> >>> [...] >>> >>>>>>> Second device driver are not integrated that closely within mm and the >>>>>>> scheduler kernel code to allow to efficiently plug in device access >>>>>>> notification to page (ie to update struct page so that numa worker >>>>>>> thread can migrate memory base on accurate informations). >>>>>>> >>>>>>> Third it can be hard to decide who win between CPU and device access >>>>>>> when it comes to updating thing like last CPU id. >>>>>>> >>>>>>> Fourth there is no such thing like device id ie equivalent of CPU id. >>>>>>> If we were to add something the CPU id field in flags of struct page >>>>>>> would not be big enough so this can have repercusion on struct page >>>>>>> size. This is not an easy sell. >>>>>>> >>>>>>> They are other issues i can't think of right now. I think for now it >>>>>> >>>>>> My opinion is most of the issues are the same no matter use CDM or HMM-CDM. >>>>>> I just care about a more complete solution no matter CDM,HMM-CDM or other ways. >>>>>> HMM or HMM-CDM depends on device driver, but haven't see a public/full driver to >>>>>> demonstrate the whole solution works fine. >>>>> >>>>> I am working with NVidia close source driver team to make sure that it works >>>>> well for them. I am also working on nouveau open source driver for same NVidia >>>>> hardware thought it will be of less use as what is missing there is a solid >>>>> open source userspace to leverage this. Nonetheless open source driver are in >>>>> the work. >>>> >>>> Can you point to the nouveau patches? I still find these HMM patches >>>> un-reviewable without an upstream consumer. >>> >>> So i pushed a branch with WIP for nouveau to use HMM: >>> >>> https://cgit.freedesktop.org/~glisse/linux/log/?h=hmm-nouveau >>> >> >> Nice to see that. >> Btw, do you have any plan for a CDM-HMM driver? CPU can write to >> Device memory directly without extra copy. > > Yes nouveau CDM support on PPC (which is the only CDM platform commercialy > available today) is on the TODO list. Note that the driver changes for CDM > are minimal (probably less than 100 lines of code). From the driver point > of view this is memory and it doesn't matter if it is CDM or not. > > The real burden is on the application developpers who need to update their > code to leverage this. > Why it's not transparent to application? Application just use system malloc() and don't care whether the data is copied or not. > > Also as a data point you want to avoid CPU access to CDM device memory as > much as possible. The overhead for single cache line access are high (this > is PCIE or derivative protocol and it is a packet protocol). > Thank you for the hint, we are going to follow cdm-hmm since HMM already merged into upstream. -- Thanks, Bob