Received: by 2002:ac0:a5b6:0:0:0:0:0 with SMTP id m51-v6csp726821imm; Wed, 6 Jun 2018 05:11:06 -0700 (PDT) X-Google-Smtp-Source: ADUXVKIvy0lFmZusIC6hGGglvWpL1sAUXkB409QDUZ2zPbeYapsRRhr/Z/5G3rtpXIA8SCCBM3nE X-Received: by 2002:a17:902:bb0d:: with SMTP id l13-v6mr3029904pls.115.1528287066277; Wed, 06 Jun 2018 05:11:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1528287066; cv=none; d=google.com; s=arc-20160816; b=W4Dw/DU5RN9X0wR2nPkERNgG5l5dHEid53+Ryk2BH/Ma0NxOLP47q1PdtUO5WLOsn3 jm3QvKc2pV4noquKfoRJt+JyjlJmTpLECdFXAnUTxbCQzAXoBLUlCBL0Sy3CrNYNoxsE kZHS6S1lflD95shmfWJCOT3ToZeRmvcinOlV73TOMxUI1dkoP8WAcBKNUwa5F1c7EgDX YZAsLpl/oRDKHU6zXee6OatIl5CNJJmN9XFaYmwJUcRAL7Nn18YYiV3Z+jWL338hji45 Ht+mA0KyBqbrSHX6rVMP8g7v32NYOyu8TX1AnrTbSwbbtEKKceuwY4JxdpGfqB0EsQFf P1pA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature :arc-authentication-results; bh=ftHt6eZ80zgiEgsvs9LoljpwcQ0P2BNKMZ7OCma4EAE=; b=aFRzo3cB7MpxiArVne+M1UNFvH3FmD2Qc8TFa+s9SZsT+a3xCIRESgPZ+fQn6IWAvD 7vabLRYqBp+QaQ8HIFuFPUBKBbWisQVzYoCkhtb5sr8zeyh6zcBAsJk81zbLMXBJyOz5 AIvdDBtyUCNnpiAEcD4eoKLV4uEN7ZWG9YxaHQIKnDzRUIPpPiFikexcZEt6vwTPrtqV wo1IbDaiheBeXPbielLYSPVIb6E/MWD6+ZEa/SKs4mHtxwieAXdEGtKyjv2NNVnD+uL9 dn5xq8DIatHHKUamLr2beuDyMuEV4Vb+5eT75ZL2cjzJhOTYrEm2yzwDCS9m9skyBjSi O6Ig== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=uDdCBpou; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b60-v6si51512654plc.270.2018.06.06.05.10.51; Wed, 06 Jun 2018 05:11:06 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=uDdCBpou; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752077AbeFFMKT (ORCPT + 99 others); Wed, 6 Jun 2018 08:10:19 -0400 Received: from mail-lf0-f67.google.com ([209.85.215.67]:33779 "EHLO mail-lf0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750864AbeFFMKR (ORCPT ); Wed, 6 Jun 2018 08:10:17 -0400 Received: by mail-lf0-f67.google.com with SMTP id y20-v6so8764271lfy.0; Wed, 06 Jun 2018 05:10:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=ftHt6eZ80zgiEgsvs9LoljpwcQ0P2BNKMZ7OCma4EAE=; b=uDdCBpouU/lE5gFHp2vLhqbPqlcRw70Qx4gi+tySOEmmV77C1imtnf/VImbCkAu2m3 ILoeSqEEFnD5hdH14JdWUoqsA4PZN7UH1U9xtaEgtVkUZJrrm1RB0ruZoUU9olKsNXv+ 7bQbN87q10pKng8kWZ8zRPeM2PU7rcAtdESOkbuzsrW3cUBDXbbpilfiynkkRIe9yrei 9DJ3qLAmYSoDQrVCKPZsvZeio8m5gv4+qCd1m8i8XhgC4taZFtL94WHVjFJseOmRVav2 3YlOjY2NgtTjpZ0y3goeA5QKnMm7xbyi7sPycbYLxr2ZsV6XItZKqRPzKD5EoSjZJZnJ A86w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=ftHt6eZ80zgiEgsvs9LoljpwcQ0P2BNKMZ7OCma4EAE=; b=Hg7pkvLnOnksc885i7ulRoAvMpCpGGdXorcU4cS9VBdycA87GXRYaBMcMkWGc4J7Gr 94extHVzD0RGlcCePrFJ8QV0q0QNMo51Bp7iQc7RRl9POtIMMV3hAbEQnlf/o7L0qeOK ujIqEYOCXX/gOvb/8Q/6LAdO5PkulOmL8T31z08/hfBnSaNwILXs+yB5I7k+Pplqlvio y2NpSCMAWOz+TtSpWFNpBg4YshQgzkGKUD0iN+ZcU+wn3RWjPXIL8TTkbFRu2BTGoyfz /B8VY548NsVBRG1k3Gb3r336AEvHHujeb+AaY5LwoRuH+YBVQGkIkqk/meF90qJQrW9D QoAQ== X-Gm-Message-State: APt69E3m39Y/XnEe53P/XyGFrVyeF2t9ZBLgqUAYJAf7Le3peJJIEfsW 5qcbfALYU84ONpkef0GRbJE= X-Received: by 2002:a2e:594d:: with SMTP id n74-v6mr2028175ljb.128.1528287015096; Wed, 06 Jun 2018 05:10:15 -0700 (PDT) Received: from [10.17.182.9] (ll-55.209.223.85.sovam.net.ua. [85.223.209.55]) by smtp.gmail.com with ESMTPSA id k12-v6sm4963584lje.94.2018.06.06.05.10.13 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 06 Jun 2018 05:10:13 -0700 (PDT) Subject: Re: [PATCH v2 7/9] xen/gntdev: Implement dma-buf export functionality To: Boris Ostrovsky , xen-devel@lists.xenproject.org, linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, linux-media@vger.kernel.org, jgross@suse.com, konrad.wilk@oracle.com Cc: daniel.vetter@intel.com, dongwon.kim@intel.com, matthew.d.roper@intel.com, Oleksandr Andrushchenko References: <20180601114132.22596-1-andr2000@gmail.com> <20180601114132.22596-8-andr2000@gmail.com> <96dd30f5-6ac6-498f-06e7-352e46994576@oracle.com> From: Oleksandr Andrushchenko Message-ID: <117e05b3-69f6-b879-50d9-0cddd8e4c313@gmail.com> Date: Wed, 6 Jun 2018 15:10:13 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <96dd30f5-6ac6-498f-06e7-352e46994576@oracle.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06/05/2018 01:07 AM, Boris Ostrovsky wrote: > On 06/01/2018 07:41 AM, Oleksandr Andrushchenko wrote: >> From: Oleksandr Andrushchenko >> >> 1. Create a dma-buf from grant references provided by the foreign >> domain. By default dma-buf is backed by system memory pages, but >> by providing GNTDEV_DMA_FLAG_XXX flags it can also be created >> as a DMA write-combine/coherent buffer, e.g. allocated with >> corresponding dma_alloc_xxx API. >> Export the resulting buffer as a new dma-buf. >> >> 2. Implement waiting for the dma-buf to be released: block until the >> dma-buf with the file descriptor provided is released. >> If within the time-out provided the buffer is not released then >> -ETIMEDOUT error is returned. If the buffer with the file descriptor >> does not exist or has already been released, then -ENOENT is >> returned. For valid file descriptors this must not be treated as >> error. >> >> Signed-off-by: Oleksandr Andrushchenko >> --- >> drivers/xen/gntdev-dmabuf.c | 393 +++++++++++++++++++++++++++++++++++- >> drivers/xen/gntdev-dmabuf.h | 9 +- >> drivers/xen/gntdev.c | 90 ++++++++- >> 3 files changed, 486 insertions(+), 6 deletions(-) >> >> diff --git a/drivers/xen/gntdev-dmabuf.c b/drivers/xen/gntdev-dmabuf.c >> index 6bedd1387bd9..f612468879b4 100644 >> --- a/drivers/xen/gntdev-dmabuf.c >> +++ b/drivers/xen/gntdev-dmabuf.c >> @@ -3,15 +3,58 @@ >> /* >> * Xen dma-buf functionality for gntdev. >> * >> + * DMA buffer implementation is based on drivers/gpu/drm/drm_prime.c. >> + * >> * Copyright (c) 2018 Oleksandr Andrushchenko, EPAM Systems Inc. >> */ >> >> +#include >> #include >> >> #include "gntdev-dmabuf.h" >> >> +struct gntdev_dmabuf { >> + struct gntdev_dmabuf_priv *priv; >> + struct dma_buf *dmabuf; >> + struct list_head next; >> + int fd; >> + >> + union { >> + struct { >> + /* Exported buffers are reference counted. */ >> + struct kref refcount; >> + >> + struct gntdev_priv *priv; >> + struct grant_map *map; >> + void (*release)(struct gntdev_priv *priv, >> + struct grant_map *map); >> + } exp; >> + } u; >> + >> + /* Number of pages this buffer has. */ >> + int nr_pages; >> + /* Pages of this buffer. */ >> + struct page **pages; >> +}; >> + >> +struct gntdev_dmabuf_wait_obj { >> + struct list_head next; >> + struct gntdev_dmabuf *gntdev_dmabuf; >> + struct completion completion; >> +}; >> + >> +struct gntdev_dmabuf_attachment { >> + struct sg_table *sgt; >> + enum dma_data_direction dir; >> +}; >> + >> struct gntdev_dmabuf_priv { >> - int dummy; >> + /* List of exported DMA buffers. */ >> + struct list_head exp_list; >> + /* List of wait objects. */ >> + struct list_head exp_wait_list; >> + /* This is the lock which protects dma_buf_xxx lists. */ >> + struct mutex lock; >> }; >> >> /* ------------------------------------------------------------------ */ >> @@ -22,19 +65,359 @@ struct gntdev_dmabuf_priv { >> /* Implementation of wait for exported DMA buffer to be released. */ >> /* ------------------------------------------------------------------ */ >> >> +static void dmabuf_exp_release(struct kref *kref); >> + >> +static struct gntdev_dmabuf_wait_obj * >> +dmabuf_exp_wait_obj_new(struct gntdev_dmabuf_priv *priv, >> + struct gntdev_dmabuf *gntdev_dmabuf) >> +{ >> + struct gntdev_dmabuf_wait_obj *obj; >> + >> + obj = kzalloc(sizeof(*obj), GFP_KERNEL); >> + if (!obj) >> + return ERR_PTR(-ENOMEM); >> + >> + init_completion(&obj->completion); >> + obj->gntdev_dmabuf = gntdev_dmabuf; >> + >> + mutex_lock(&priv->lock); >> + list_add(&obj->next, &priv->exp_wait_list); >> + /* Put our reference and wait for gntdev_dmabuf's release to fire. */ >> + kref_put(&gntdev_dmabuf->u.exp.refcount, dmabuf_exp_release); >> + mutex_unlock(&priv->lock); >> + return obj; >> +} >> + >> +static void dmabuf_exp_wait_obj_free(struct gntdev_dmabuf_priv *priv, >> + struct gntdev_dmabuf_wait_obj *obj) >> +{ >> + struct gntdev_dmabuf_wait_obj *cur_obj, *q; >> + >> + mutex_lock(&priv->lock); >> + list_for_each_entry_safe(cur_obj, q, &priv->exp_wait_list, next) >> + if (cur_obj == obj) { >> + list_del(&obj->next); >> + kfree(obj); >> + break; >> + } >> + mutex_unlock(&priv->lock); >> +} >> + >> +static int dmabuf_exp_wait_obj_wait(struct gntdev_dmabuf_wait_obj *obj, >> + u32 wait_to_ms) >> +{ >> + if (wait_for_completion_timeout(&obj->completion, >> + msecs_to_jiffies(wait_to_ms)) <= 0) >> + return -ETIMEDOUT; >> + >> + return 0; >> +} >> + >> +static void dmabuf_exp_wait_obj_signal(struct gntdev_dmabuf_priv *priv, >> + struct gntdev_dmabuf *gntdev_dmabuf) >> +{ >> + struct gntdev_dmabuf_wait_obj *obj, *q; >> + >> + list_for_each_entry_safe(obj, q, &priv->exp_wait_list, next) >> + if (obj->gntdev_dmabuf == gntdev_dmabuf) { >> + pr_debug("Found gntdev_dmabuf in the wait list, wake\n"); >> + complete_all(&obj->completion); > break ? sure, thank you >> + } >> +} >> + >> +static struct gntdev_dmabuf * >> +dmabuf_exp_wait_obj_get_by_fd(struct gntdev_dmabuf_priv *priv, int fd) >> +{ >> + struct gntdev_dmabuf *q, *gntdev_dmabuf, *ret = ERR_PTR(-ENOENT); >> + >> + mutex_lock(&priv->lock); >> + list_for_each_entry_safe(gntdev_dmabuf, q, &priv->exp_list, next) >> + if (gntdev_dmabuf->fd == fd) { >> + pr_debug("Found gntdev_dmabuf in the wait list\n"); >> + kref_get(&gntdev_dmabuf->u.exp.refcount); >> + ret = gntdev_dmabuf; >> + break; >> + } >> + mutex_unlock(&priv->lock); >> + return ret; >> +} >> + >> int gntdev_dmabuf_exp_wait_released(struct gntdev_dmabuf_priv *priv, int fd, >> int wait_to_ms) >> { >> - return -EINVAL; >> + struct gntdev_dmabuf *gntdev_dmabuf; >> + struct gntdev_dmabuf_wait_obj *obj; >> + int ret; >> + >> + pr_debug("Will wait for dma-buf with fd %d\n", fd); >> + /* >> + * Try to find the DMA buffer: if not found means that >> + * either the buffer has already been released or file descriptor >> + * provided is wrong. >> + */ >> + gntdev_dmabuf = dmabuf_exp_wait_obj_get_by_fd(priv, fd); >> + if (IS_ERR(gntdev_dmabuf)) >> + return PTR_ERR(gntdev_dmabuf); >> + >> + /* >> + * gntdev_dmabuf still exists and is reference count locked by us now, >> + * so prepare to wait: allocate wait object and add it to the wait list, >> + * so we can find it on release. >> + */ >> + obj = dmabuf_exp_wait_obj_new(priv, gntdev_dmabuf); >> + if (IS_ERR(obj)) { >> + pr_err("Failed to setup wait object, ret %ld\n", PTR_ERR(obj)); > > No need for pr_err. We are out of memory. Will remove > >> + return PTR_ERR(obj); >> +} >> + >> + ret = dmabuf_exp_wait_obj_wait(obj, wait_to_ms); >> + dmabuf_exp_wait_obj_free(priv, obj); >> + return ret; >> } >> >> /* ------------------------------------------------------------------ */ >> /* DMA buffer export support. */ >> /* ------------------------------------------------------------------ */ >> >> +static struct sg_table * >> +dmabuf_pages_to_sgt(struct page **pages, unsigned int nr_pages) >> +{ >> + struct sg_table *sgt; >> + int ret; >> + >> + sgt = kmalloc(sizeof(*sgt), GFP_KERNEL); >> + if (!sgt) { >> + ret = -ENOMEM; >> + goto out; >> + } >> + >> + ret = sg_alloc_table_from_pages(sgt, pages, nr_pages, 0, >> + nr_pages << PAGE_SHIFT, >> + GFP_KERNEL); >> + if (ret) >> + goto out; >> + >> + return sgt; >> + >> +out: >> + kfree(sgt); >> + return ERR_PTR(ret); >> +} >> + >> +static int dmabuf_exp_ops_attach(struct dma_buf *dma_buf, >> + struct device *target_dev, >> + struct dma_buf_attachment *attach) >> +{ >> + struct gntdev_dmabuf_attachment *gntdev_dmabuf_attach; >> + >> + gntdev_dmabuf_attach = kzalloc(sizeof(*gntdev_dmabuf_attach), >> + GFP_KERNEL); >> + if (!gntdev_dmabuf_attach) >> + return -ENOMEM; >> + >> + gntdev_dmabuf_attach->dir = DMA_NONE; >> + attach->priv = gntdev_dmabuf_attach; >> + /* Might need to pin the pages of the buffer now. */ > > Who is supposed to pin the pages? The caller? Ok, as we do not implement .mmap for Xen dma-buf and there is no plan to mmap kernel memory (either ballooned or dma_alloc'ed), then we can remove this comment as there is no need to pin/unpin pages. > >> + return 0; >> +} >> + >> +static void dmabuf_exp_ops_detach(struct dma_buf *dma_buf, >> + struct dma_buf_attachment *attach) >> +{ >> + struct gntdev_dmabuf_attachment *gntdev_dmabuf_attach = attach->priv; >> + >> + if (gntdev_dmabuf_attach) { >> + struct sg_table *sgt = gntdev_dmabuf_attach->sgt; >> + >> + if (sgt) { >> + if (gntdev_dmabuf_attach->dir != DMA_NONE) >> + dma_unmap_sg_attrs(attach->dev, sgt->sgl, >> + sgt->nents, >> + gntdev_dmabuf_attach->dir, >> + DMA_ATTR_SKIP_CPU_SYNC); >> + sg_free_table(sgt); >> + } >> + >> + kfree(sgt); >> + kfree(gntdev_dmabuf_attach); >> + attach->priv = NULL; >> + } >> + /* Might need to unpin the pages of the buffer now. */ > Same question. Please see above >> +} >> + >> +static struct sg_table * >> +dmabuf_exp_ops_map_dma_buf(struct dma_buf_attachment *attach, >> + enum dma_data_direction dir) >> +{ >> + struct gntdev_dmabuf_attachment *gntdev_dmabuf_attach = attach->priv; >> + struct gntdev_dmabuf *gntdev_dmabuf = attach->dmabuf->priv; >> + struct sg_table *sgt; >> + >> + pr_debug("Mapping %d pages for dev %p\n", gntdev_dmabuf->nr_pages, >> + attach->dev); >> + >> + if (WARN_ON(dir == DMA_NONE || !gntdev_dmabuf_attach)) > > WARN_ON_ONCE. Here and elsewhere. Why? The UAPI may be used by different applications, thus we might lose warnings for some of them. Having WARN_ON will show problems for multiple users, not for the first one. Does this make sense to still use WARN_ON? > >> + return ERR_PTR(-EINVAL); >> + >> + /* Return the cached mapping when possible. */ >> + if (gntdev_dmabuf_attach->dir == dir) >> + return gntdev_dmabuf_attach->sgt; >> + >> + /* >> + * Two mappings with different directions for the same attachment are >> + * not allowed. >> + */ >> + if (WARN_ON(gntdev_dmabuf_attach->dir != DMA_NONE)) >> + return ERR_PTR(-EBUSY); >> + >> + sgt = dmabuf_pages_to_sgt(gntdev_dmabuf->pages, >> + gntdev_dmabuf->nr_pages); >> + if (!IS_ERR(sgt)) { >> + if (!dma_map_sg_attrs(attach->dev, sgt->sgl, sgt->nents, dir, >> + DMA_ATTR_SKIP_CPU_SYNC)) { >> + sg_free_table(sgt); >> + kfree(sgt); >> + sgt = ERR_PTR(-ENOMEM); >> + } else { >> + gntdev_dmabuf_attach->sgt = sgt; >> + gntdev_dmabuf_attach->dir = dir; >> + } >> + } >> + if (IS_ERR(sgt)) >> + pr_err("Failed to map sg table for dev %p\n", attach->dev); >> + return sgt; >> +} >> + >> +static void dmabuf_exp_ops_unmap_dma_buf(struct dma_buf_attachment *attach, >> + struct sg_table *sgt, >> + enum dma_data_direction dir) >> +{ >> + /* Not implemented. The unmap is done at dmabuf_exp_ops_detach(). */ >> +} >> + >> +static void dmabuf_exp_release(struct kref *kref) >> +{ >> + struct gntdev_dmabuf *gntdev_dmabuf = >> + container_of(kref, struct gntdev_dmabuf, u.exp.refcount); >> + >> + dmabuf_exp_wait_obj_signal(gntdev_dmabuf->priv, gntdev_dmabuf); >> + list_del(&gntdev_dmabuf->next); >> + kfree(gntdev_dmabuf); >> +} >> + >> +static void dmabuf_exp_ops_release(struct dma_buf *dma_buf) >> +{ >> + struct gntdev_dmabuf *gntdev_dmabuf = dma_buf->priv; >> + struct gntdev_dmabuf_priv *priv = gntdev_dmabuf->priv; >> + >> + gntdev_dmabuf->u.exp.release(gntdev_dmabuf->u.exp.priv, >> + gntdev_dmabuf->u.exp.map); >> + mutex_lock(&priv->lock); >> + kref_put(&gntdev_dmabuf->u.exp.refcount, dmabuf_exp_release); >> + mutex_unlock(&priv->lock); >> +} >> + >> +static void *dmabuf_exp_ops_kmap_atomic(struct dma_buf *dma_buf, >> + unsigned long page_num) >> +{ >> + /* Not implemented. */ >> + return NULL; >> +} >> + >> +static void dmabuf_exp_ops_kunmap_atomic(struct dma_buf *dma_buf, >> + unsigned long page_num, void *addr) >> +{ >> + /* Not implemented. */ >> +} >> + >> +static void *dmabuf_exp_ops_kmap(struct dma_buf *dma_buf, >> + unsigned long page_num) >> +{ >> + /* Not implemented. */ >> + return NULL; >> +} >> + >> +static void dmabuf_exp_ops_kunmap(struct dma_buf *dma_buf, >> + unsigned long page_num, void *addr) >> +{ >> + /* Not implemented. */ >> +} >> + >> +static int dmabuf_exp_ops_mmap(struct dma_buf *dma_buf, >> + struct vm_area_struct *vma) >> +{ >> + /* Not implemented. */ >> + return 0; >> +} >> + >> +static const struct dma_buf_ops dmabuf_exp_ops = { >> + .attach = dmabuf_exp_ops_attach, >> + .detach = dmabuf_exp_ops_detach, >> + .map_dma_buf = dmabuf_exp_ops_map_dma_buf, >> + .unmap_dma_buf = dmabuf_exp_ops_unmap_dma_buf, >> + .release = dmabuf_exp_ops_release, >> + .map = dmabuf_exp_ops_kmap, >> + .map_atomic = dmabuf_exp_ops_kmap_atomic, >> + .unmap = dmabuf_exp_ops_kunmap, >> + .unmap_atomic = dmabuf_exp_ops_kunmap_atomic, >> + .mmap = dmabuf_exp_ops_mmap, >> +}; >> + >> int gntdev_dmabuf_exp_from_pages(struct gntdev_dmabuf_export_args *args) >> { >> - return -EINVAL; >> + DEFINE_DMA_BUF_EXPORT_INFO(exp_info); >> + struct gntdev_dmabuf *gntdev_dmabuf; >> + int ret = 0; >> + >> + gntdev_dmabuf = kzalloc(sizeof(*gntdev_dmabuf), GFP_KERNEL); >> + if (!gntdev_dmabuf) >> + return -ENOMEM; >> + >> + kref_init(&gntdev_dmabuf->u.exp.refcount); >> + >> + gntdev_dmabuf->priv = args->dmabuf_priv; >> + gntdev_dmabuf->nr_pages = args->count; >> + gntdev_dmabuf->pages = args->pages; >> + gntdev_dmabuf->u.exp.priv = args->priv; >> + gntdev_dmabuf->u.exp.map = args->map; >> + gntdev_dmabuf->u.exp.release = args->release; >> + >> + exp_info.exp_name = KBUILD_MODNAME; >> + if (args->dev->driver && args->dev->driver->owner) >> + exp_info.owner = args->dev->driver->owner; >> + else >> + exp_info.owner = THIS_MODULE; >> + exp_info.ops = &dmabuf_exp_ops; >> + exp_info.size = args->count << PAGE_SHIFT; >> + exp_info.flags = O_RDWR; >> + exp_info.priv = gntdev_dmabuf; >> + >> + gntdev_dmabuf->dmabuf = dma_buf_export(&exp_info); >> + if (IS_ERR(gntdev_dmabuf->dmabuf)) { >> + ret = PTR_ERR(gntdev_dmabuf->dmabuf); >> + gntdev_dmabuf->dmabuf = NULL; >> + goto fail; >> + } >> + >> + ret = dma_buf_fd(gntdev_dmabuf->dmabuf, O_CLOEXEC); >> + if (ret < 0) >> + goto fail; >> + >> + gntdev_dmabuf->fd = ret; >> + args->fd = ret; >> + >> + pr_debug("Exporting DMA buffer with fd %d\n", ret); >> + >> + mutex_lock(&args->dmabuf_priv->lock); >> + list_add(&gntdev_dmabuf->next, &args->dmabuf_priv->exp_list); >> + mutex_unlock(&args->dmabuf_priv->lock); >> + return 0; >> + >> +fail: >> + if (gntdev_dmabuf->dmabuf) >> + dma_buf_put(gntdev_dmabuf->dmabuf); >> + kfree(gntdev_dmabuf); >> + return ret; >> } >> >> /* ------------------------------------------------------------------ */ >> @@ -66,6 +449,10 @@ struct gntdev_dmabuf_priv *gntdev_dmabuf_init(void) >> if (!priv) >> return ERR_PTR(-ENOMEM); >> >> + mutex_init(&priv->lock); >> + INIT_LIST_HEAD(&priv->exp_list); >> + INIT_LIST_HEAD(&priv->exp_wait_list); >> + >> return priv; >> } >> >> diff --git a/drivers/xen/gntdev-dmabuf.h b/drivers/xen/gntdev-dmabuf.h >> index 040b2de904ac..95c23a24f640 100644 >> --- a/drivers/xen/gntdev-dmabuf.h >> +++ b/drivers/xen/gntdev-dmabuf.h >> @@ -18,7 +18,14 @@ struct gntdev_dmabuf; >> struct device; >> >> struct gntdev_dmabuf_export_args { >> - int dummy; >> + struct gntdev_priv *priv; >> + struct grant_map *map; >> + void (*release)(struct gntdev_priv *priv, struct grant_map *map); >> + struct gntdev_dmabuf_priv *dmabuf_priv; >> + struct device *dev; >> + int count; >> + struct page **pages; >> + u32 fd; >> }; >> >> struct gntdev_dmabuf_priv *gntdev_dmabuf_init(void); >> diff --git a/drivers/xen/gntdev.c b/drivers/xen/gntdev.c >> index 7d58dfb3e5e8..cf255d45f20f 100644 >> --- a/drivers/xen/gntdev.c >> +++ b/drivers/xen/gntdev.c >> @@ -319,6 +319,16 @@ static void gntdev_put_map(struct gntdev_priv *priv, struct grant_map *map) >> gntdev_free_map(map); >> } >> >> +#ifdef CONFIG_XEN_GNTDEV_DMABUF >> +static void gntdev_remove_map(struct gntdev_priv *priv, struct grant_map *map) >> +{ >> + mutex_lock(&priv->lock); >> + list_del(&map->next); >> + gntdev_put_map(NULL /* already removed */, map); >> + mutex_unlock(&priv->lock); >> +} >> +#endif >> + >> /* ------------------------------------------------------------------ */ >> >> static int find_grant_ptes(pte_t *pte, pgtable_t token, >> @@ -1063,12 +1073,88 @@ static long gntdev_ioctl_grant_copy(struct gntdev_priv *priv, void __user *u) >> /* DMA buffer export support. */ >> /* ------------------------------------------------------------------ */ >> >> +static struct grant_map * >> +dmabuf_exp_alloc_backing_storage(struct gntdev_priv *priv, int dmabuf_flags, >> + int count) >> +{ >> + struct grant_map *map; >> + >> + if (unlikely(count <= 0)) >> + return ERR_PTR(-EINVAL); >> + >> + if ((dmabuf_flags & GNTDEV_DMA_FLAG_WC) && >> + (dmabuf_flags & GNTDEV_DMA_FLAG_COHERENT)) { >> + pr_err("Wrong dma-buf flags: either WC or coherent, not both\n"); >> + return ERR_PTR(-EINVAL); >> + } >> + >> + map = gntdev_alloc_map(priv, count, dmabuf_flags); >> + if (!map) >> + return ERR_PTR(-ENOMEM); >> + >> + if (unlikely(atomic_add_return(count, &pages_mapped) > limit)) { >> + pr_err("can't map: over limit\n"); >> + gntdev_put_map(NULL, map); >> + return ERR_PTR(-ENOMEM); >> + } >> + return map; >> +} >> + >> int gntdev_dmabuf_exp_from_refs(struct gntdev_priv *priv, int flags, >> int count, u32 domid, u32 *refs, u32 *fd) >> { >> - /* XXX: this will need to work with gntdev's map, so leave it here. */ >> + struct grant_map *map; >> + struct gntdev_dmabuf_export_args args; >> + int i, ret; >> + >> *fd = -1; >> - return -EINVAL; >> + >> + if (use_ptemod) { >> + pr_err("Cannot provide dma-buf: use_ptemode %d\n", >> + use_ptemod); > No pr_err here please. This can potentially become a DoS vector as it > comes directly from ioctl. > > I would, in fact, revisit other uses of pr_err in this file. Sure, all of pr_err can actually be pr_debug... >> + return -EINVAL; >> + } >> + >> + map = dmabuf_exp_alloc_backing_storage(priv, flags, count); > > @count comes from userspace. dmabuf_exp_alloc_backing_storage only > checks for it to be >0. Should it be checked for some sane max value? This is not easy as it is hard to tell what could be that max value. For DMA buffers if count is too big then allocation will fail, so need to check for max hereĀ  (dma_alloc_{xxx} will filter out too big allocations). For Xen balloon allocations I cannot tell what could be that max value neither. Tough question how to limit. > > -boris Thank you, Oleksandr >> + if (IS_ERR(map)) >> + return PTR_ERR(map); >> + >> + for (i = 0; i < count; i++) { >> + map->grants[i].domid = domid; >> + map->grants[i].ref = refs[i]; >> + } >> + >> + mutex_lock(&priv->lock); >> + gntdev_add_map(priv, map); >> + mutex_unlock(&priv->lock); >> + >> + map->flags |= GNTMAP_host_map; >> +#if defined(CONFIG_X86) >> + map->flags |= GNTMAP_device_map; >> +#endif >> + >> + ret = map_grant_pages(map); >> + if (ret < 0) >> + goto out; >> + >> + args.priv = priv; >> + args.map = map; >> + args.release = gntdev_remove_map; >> + args.dev = priv->dma_dev; >> + args.dmabuf_priv = priv->dmabuf_priv; >> + args.count = map->count; >> + args.pages = map->pages; >> + >> + ret = gntdev_dmabuf_exp_from_pages(&args); >> + if (ret < 0) >> + goto out; >> + >> + *fd = args.fd; >> + return 0; >> + >> +out: >> + gntdev_remove_map(priv, map); >> + return ret; >> } >> >> /* ------------------------------------------------------------------ */