Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752693AbdIGB6v (ORCPT ); Wed, 6 Sep 2017 21:58:51 -0400 Received: from szxga05-in.huawei.com ([45.249.212.191]:5566 "EHLO szxga05-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751287AbdIGB6t (ORCPT ); Wed, 6 Sep 2017 21:58:49 -0400 Subject: Re: [RFC PATCH 0/6] Add platform device SVM support for ARM SMMUv3 To: Jean-Philippe Brucker , Yisheng Xie References: <1504167642-14922-1-git-send-email-xieyisheng1@huawei.com> <95d1a9e2-1816-ff7d-9a8d-98406a6c2c22@arm.com> CC: , , , , , , , , , , , , , , , , , , , From: Bob Liu Message-ID: <8e4764f5-0e5d-7fd6-529b-35914e1e3668@huawei.com> Date: Thu, 7 Sep 2017 09:55:49 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.142.83.150] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020201.59B0A7BD.00D2,ss=1,re=0.000,recu=0.000,reip=0.000,cl=1,cld=1,fgs=0, ip=0.0.0.0, so=2014-11-16 11:51:01, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: d7765235516df6b4b4b943d0efef56cd Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3854 Lines: 87 On 2017/9/6 17:59, Jean-Philippe Brucker wrote: > On 06/09/17 02:16, Yisheng Xie wrote: >> Hi Jean-Philippe, >> >> On 2017/9/5 20:56, Jean-Philippe Brucker wrote: >>> On 31/08/17 09:20, Yisheng Xie wrote: >>>> Jean-Philippe has post a patchset for Adding PCIe SVM support to ARM SMMUv3: >>>> https://www.spinics.net/lists/arm-kernel/msg565155.html >>>> >>>> But for some platform devices(aka on-chip integrated devices), there is also >>>> SVM requirement, which works based on the SMMU stall mode. >>>> Jean-Philippe has prepared a prototype patchset to support it: >>>> git://linux-arm.org/linux-jpb.git svm/stall >>> >>> Only meant for testing at that point, and unfit even for an RFC. >> >> Sorry about that, I should ask you before send it out. It's my mistake. For I also >> have some question about this patchset. >> >> We have related device, and would like to do some help about it. Do you have >> any plan about upstream ? >> >>> >>>> We tested this patchset with some fixes on a on-chip integrated device. The >>>> basic function is ok, so I just send them out for review, although this >>>> patchset heavily depends on the former patchset (PCIe SVM support for ARM >>>> SMMUv3), which is still under discussion. >>>> >>>> Patch Overview: >>>> *1 to 3 prepare for device tree or acpi get the device stall ability and pasid bits >>>> *4 is to realise the SVM function for platform device >>>> *5 is fix a bug when test SVM function while SMMU donnot support this feature >>>> *6 avoid ILLEGAL setting of STE and CD entry about stall >>>> >>>> Acctually here, I also have some questions about SVM on SMMUv3: >>>> >>>> 1. Why the SVM feature on SMMUv3 depends on BTM feature? when bind a task to device, >>>> it will register a mmu_notify. Therefore, when a page range is invalid, we can >>>> send TLBI or ATC invalid without BTM? >>> >>> We could, but the end goal for SVM is to perfectly mirror the CPU page >>> tables. So for platform SVM we would like to get rid of MMU notifiers >>> entirely. >> >> I see, but for some SMMU which do not support BTM, it cannot benefit from SVM. >> >> Meanwhile, do you mean even with BTM feature, the PCI-e device also need to send a >> ATC invalid by MMU notify? It seems not fair, why not hardware do the entirely work >> in this case? It may costly for send ATC invalid and sync. > > It will certainly be costly. But there are major problems with > transforming broadcast TLB maintenance into ATC invalidations in HW: > > * VMID:ASID to SID:SSID conversion. TLBIs use VMID:ASID, while ATCIs use > SID:SSID. > > * Most importantly, ATC invalidations accounting. Each endpoint has a > limited number of in-flight ATC invalidate requests. The conversion module > would have to buffer incoming invalidations and wait for in-flight ATC > invalidation to complete before sending the next ones. In case of > overflow, either we lose invalidation (which opens security holes) or we > somehow put back-pressure on the interconnect (no idea how feasible this > is, I suspect really hard). > > Solving the last one is also quite difficult in software, but at least we > can still invalidate a range. In hardware we would invalidate the ATC > page-by-page and quickly jam the bus. > Speak to the invalidation, I have one more question. There is a time window between 1) modify page table; 2) tlb invalidate; ARM-CPU Device 1. modify page table ^^^^^ Can still write data through smmu tlb even page table was already modified. (At this point, the same virtual addr may not point to the same thing for CPU and device!!! I'm afraid there may be some data-loss or other potential problems if this situation happens.) 2. tlb invalidate range -- Thanks, Bob