Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3247496imu; Sat, 24 Nov 2018 01:01:38 -0800 (PST) X-Google-Smtp-Source: AFSGD/U0qIP+gkxhfzFwOoyyS/Ezh27X3Dm3aGWcCGmuHjG2xRqscVv33O07akBGi45ovpXwCp/w X-Received: by 2002:a62:3ac1:: with SMTP id v62mr15496829pfj.87.1543050098834; Sat, 24 Nov 2018 01:01:38 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543050098; cv=none; d=google.com; s=arc-20160816; b=sYZQ3ytr3mh1OnIZSmTCOeCUlthokEMjn+JyXf4kZuy3zSr/r/EBZTut4srL5Qf1xd tWsGiSTHox+Y4php54UUumyMGmdOP5To3/KmNC93xqe2/BvEkUi1Gkv5GqLVsNwx11/G tsUPAI4vrLHHlfTNY6BDHX5UlyRyQ3Lh9e9j2o8+t1OkVv26n+oS8O0C/PIjVL+Jg5tu YHRJ4PzeVn3AGh9ZVt9taPNC4mnZgz9mZhqBF1flskpOlkBCVJOWSeu4itisYgTLutrT e6xReNbUqfuqp1khlN4STB/TBexIt+7kIOYIMBS125OTHwS+If9YkRQLfRNAIad3Crmq Ti+w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=IeeS8c+SY7LdHuGHbCH+uzstxfIx6/xvPWagsYzMQ1o=; b=cn4I+L5R3zIj/bi4O1VhMZ2G5fgwRnhWvT+0H5ziQ9B+jmT50AyHqCh7/a7HX2E9h3 MNycTgopYyow528Mi7+Oph8x9oouU2K6DvYHTJgAm4kphQ08VwMRFvdimjBvJWonH7n/ d3I5sJpXFZTs5ZrZx14ZD3Go0Aad4aKWl4WLsjE1FVz0ZtLFa7tdanqRoUaEpU9hzV12 jfv2SY8nIq2cCm8Cr9bV/NivYw87jCWAPtdgCgBXH03XQ+T+HJjh7cK15SQTVNRh2zbn s5CoSP2IxppGXBNa16MUSKQ1yfafmcCb5g6RQEKZurU27l/AmOmr316dJelgUQ4ksIfT s79w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w32si55719304pga.337.2018.11.24.01.01.24; Sat, 24 Nov 2018 01:01:38 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731148AbeKXO6s (ORCPT + 99 others); Sat, 24 Nov 2018 09:58:48 -0500 Received: from szxga06-in.huawei.com ([45.249.212.32]:46106 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1731053AbeKXO6s (ORCPT ); Sat, 24 Nov 2018 09:58:48 -0500 Received: from DGGEMS414-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 49C1D629875E0; Sat, 24 Nov 2018 12:11:47 +0800 (CST) Received: from localhost (10.67.212.75) by DGGEMS414-HUB.china.huawei.com (10.3.19.214) with Microsoft SMTP Server (TLS) id 14.3.408.0; Sat, 24 Nov 2018 12:11:41 +0800 Date: Sat, 24 Nov 2018 12:13:21 +0800 From: Kenneth Lee To: Jason Gunthorpe CC: Leon Romanovsky , Kenneth Lee , "Tim Sell" , , "Alexander Shishkin" , Zaibo Xu , , , , Christoph Lameter , Hao Fang , Gavin Schenk , "RDMA mailing list" , Zhou Wang , "Doug Ledford" , Uwe =?iso-8859-1?Q?Kleine-K=F6nig?= , David Kershner , Johan Hovold , Cyrille Pitchen , Sagar Dharia , Jens Axboe , , linux-netdev , Randy Dunlap , , Vinod Koul , , Philippe Ombredanne , Sanyog Kale , "David S. Miller" , Subject: Re: [RFCv3 PATCH 1/6] uacce: Add documents for WarpDrive/uacce Message-ID: <20181124041321.GL157308@Turing-Arch-b> References: <20181115085109.GD157308@Turing-Arch-b> <20181115145455.GN3759@mtr-leonro.mtl.com> <20181119091405.GE157308@Turing-Arch-b> <20181119184954.GB4890@ziepe.ca> <20181120030702.GH157308@Turing-Arch-b> <20181120032939.GR4890@ziepe.ca> <20181121060805.GJ157308@Turing-Arch-b> <20181122025840.GB19938@ziepe.ca> <20181123080242.GK157308@Turing-Arch-b> <20181123180504.GA3395@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20181123180504.GA3395@ziepe.ca> User-Agent: Mutt/1.5.21 (2010-09-15) X-Originating-IP: [10.67.212.75] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Nov 23, 2018 at 11:05:04AM -0700, Jason Gunthorpe wrote: > Date: Fri, 23 Nov 2018 11:05:04 -0700 > From: Jason Gunthorpe > To: Kenneth Lee > CC: Leon Romanovsky , Kenneth Lee , > Tim Sell , linux-doc@vger.kernel.org, Alexander > Shishkin , Zaibo Xu > , zhangfei.gao@foxmail.com, linuxarm@huawei.com, > haojian.zhuang@linaro.org, Christoph Lameter , Hao Fang > , Gavin Schenk , RDMA mailing > list , Zhou Wang , > Doug Ledford , Uwe Kleine-König > , David Kershner > , Johan Hovold , Cyrille > Pitchen , Sagar Dharia > , Jens Axboe , > guodong.xu@linaro.org, linux-netdev , Randy Dunlap > , linux-kernel@vger.kernel.org, Vinod Koul > , linux-crypto@vger.kernel.org, Philippe Ombredanne > , Sanyog Kale , "David S. > Miller" , linux-accelerators@lists.ozlabs.org > Subject: Re: [RFCv3 PATCH 1/6] uacce: Add documents for WarpDrive/uacce > User-Agent: Mutt/1.9.4 (2018-02-28) > Message-ID: <20181123180504.GA3395@ziepe.ca> > > On Fri, Nov 23, 2018 at 04:02:42PM +0800, Kenneth Lee wrote: > > > It is already part of Jean's patchset. And that's why I built my solution on > > VFIO in the first place. But I think the concept of SVA and PASID is not > > compatible with the original VFIO concept space. You would not share your whole > > address space to a device at all in a virtual machine manager, > > wouldn't you? > > Why not? That seems to fit VFIO's space just fine to me.. You might > need a new upcall to create a full MM registration, but that doesn't > seem unsuited. Because the VM manager (such as qemu) do not want to share its whole space to the device. It is a security problem. > > Part of the point here is you should try to make sensible revisions to > existing subsystems before just inventing a new thing... > > VFIO is deeply connected to the IOMMU, so enabling more general IOMMU > based approache seems perfectly fine to me.. > > > > Once the VFIO driver knows about this as a generic capability then the > > > device it exposes to userspace would use CPU addresses instead of DMA > > > addresses. > > > > > > The question is if your driver needs much more than the device > > > agnostic generic services VFIO provides. > > > > > > I'm not sure what you have in mind with resource management.. It is > > > hard to revoke resources from userspace, unless you are doing > > > kernel syscalls, but then why do all this? > > > > Say, I have 1024 queues in my accelerator. I can get one by opening the device > > and attach it with the fd. If the process exit by any means, the queue can be > > returned with the release of the fd. But if it is mdev, it will still be there > > and some one should tell the allocator it is available again. This is not easy > > to design in user space. > > ?? why wouldn't the mdev track the queues assigned using the existing > open/close/ioctl callbacks? > > That is basic flow I would expect: > > open(/dev/vfio) > ioctl(unity map entire process MM to mdev with IOMMU) > > // Create a HQ queue and link the PASID in the HW to this HW queue > struct hw queue[..]; > ioctl(create HW queue) > > // Get BAR doorbell memory for the queue > bar = mmap() > > // Submit work to the queue using CPU addresses > queue[0] = ... > writel(bar [..], &queue); > > // Queue, SVA, etc is cleaned up when the VFIO closes > close() This is not the way that you can use mdev. To use mdev, you have to: 1. unbind kernel driver from the device, and rebind it to vfio driver 2. for 0 to 1204: uuid > /sys/.../the_dev/mdev/create to create all the mdev 3. a virtual iommu_group will be created in /dev/vfio/* from every mdev now you can do this in you application (even without considering the pasid) : container = open(/dev/vfio); ioctl(container, settting); group = open(/dev/vfio/my_group_for_particular_mdev); ioctl(container, attach_group, group); device = ioctl(group, get_device); mmap(device); ioctl(container, set_dma_operation); Then you have to make a decision, how can you find a available mdev for use and how to return it. We have considered creating only one mdev and allocating queue when the device is openned. But the VFIO maintainer, Alex, did not agree and said it broke the VFIO origin idea. -Kenneth > > Presumably the kernel has to handle the PASID and related for security > reasons, so they shouldn't go to userspace? > > If there is something missing in vfio to do this is it looks pretty > small to me.. > > Jason -- -Kenneth(Hisilicon) ================================================================================ 本邮件及其附件含有华为公司的保密信息,仅限于发送给上面地址中列出的个人或群组。禁 止任何其他人以任何形式使用(包括但不限于全部或部分地泄露、复制、或散发)本邮件中 的信息。如果您错收了本邮件,请您立即电话或邮件通知发件人并删除本邮件! This e-mail and its attachments contain confidential information from HUAWEI, which is intended only for the person or entity whose address is listed above. Any use of the information contained herein in any way (including, but not limited to, total or partial disclosure, reproduction, or dissemination) by persons other than the intended recipient(s) is prohibited. If you receive this e-mail in error, please notify the sender by phone or email immediately and delete it!