2023-06-29 22:33:37

by Danilo Krummrich

[permalink] [raw]
Subject: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Add infrastructure to keep track of GPU virtual address (VA) mappings
with a decicated VA space manager implementation.

New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers
start implementing, allow userspace applications to request multiple and
arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is
intended to serve the following purposes in this context.

1) Provide infrastructure to track GPU VA allocations and mappings,
making use of the maple_tree.

2) Generically connect GPU VA mappings to their backing buffers, in
particular DRM GEM objects.

3) Provide a common implementation to perform more complex mapping
operations on the GPU VA space. In particular splitting and merging
of GPU VA mappings, e.g. for intersecting mapping requests or partial
unmap requests.

Tested-by: Donald Robson <[email protected]>
Reviewed-by: Boris Brezillon <[email protected]>
Suggested-by: Dave Airlie <[email protected]>
Signed-off-by: Danilo Krummrich <[email protected]>
---
Documentation/gpu/drm-mm.rst | 36 +
drivers/gpu/drm/Makefile | 1 +
drivers/gpu/drm/drm_gem.c | 3 +
drivers/gpu/drm/drm_gpuva_mgr.c | 1743 +++++++++++++++++++++++++++++++
include/drm/drm_drv.h | 6 +
include/drm/drm_gem.h | 52 +
include/drm/drm_gpuva_mgr.h | 756 ++++++++++++++
7 files changed, 2597 insertions(+)
create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c
create mode 100644 include/drm/drm_gpuva_mgr.h

diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst
index a52e6f4117d6..3d5dc9dc1bfe 100644
--- a/Documentation/gpu/drm-mm.rst
+++ b/Documentation/gpu/drm-mm.rst
@@ -466,6 +466,42 @@ DRM MM Range Allocator Function References
.. kernel-doc:: drivers/gpu/drm/drm_mm.c
:export:

+DRM GPU VA Manager
+==================
+
+Overview
+--------
+
+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
+ :doc: Overview
+
+Split and Merge
+---------------
+
+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
+ :doc: Split and Merge
+
+Locking
+-------
+
+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
+ :doc: Locking
+
+Examples
+--------
+
+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
+ :doc: Examples
+
+DRM GPU VA Manager Function References
+--------------------------------------
+
+.. kernel-doc:: include/drm/drm_gpuva_mgr.h
+ :internal:
+
+.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
+ :export:
+
DRM Buddy Allocator
===================

diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
index 414855e2a463..6d6c9dec66e8 100644
--- a/drivers/gpu/drm/Makefile
+++ b/drivers/gpu/drm/Makefile
@@ -45,6 +45,7 @@ drm-y := \
drm_vblank.o \
drm_vblank_work.o \
drm_vma_manager.o \
+ drm_gpuva_mgr.o \
drm_writeback.o
drm-$(CONFIG_DRM_LEGACY) += \
drm_agpsupport.o \
diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c
index 1a5a2cd0d4ec..cd878ebddbd0 100644
--- a/drivers/gpu/drm/drm_gem.c
+++ b/drivers/gpu/drm/drm_gem.c
@@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev,
if (!obj->resv)
obj->resv = &obj->_resv;

+ if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA))
+ drm_gem_gpuva_init(obj);
+
drm_vma_node_reset(&obj->vma_node);
INIT_LIST_HEAD(&obj->lru_node);
}
diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
new file mode 100644
index 000000000000..4414990c05cc
--- /dev/null
+++ b/drivers/gpu/drm/drm_gpuva_mgr.c
@@ -0,0 +1,1743 @@
+// SPDX-License-Identifier: GPL-2.0
+/*
+ * Copyright (c) 2022 Red Hat.
+ *
+ * Permission is hereby granted, free of charge, to any person obtaining a
+ * copy of this software and associated documentation files (the "Software"),
+ * to deal in the Software without restriction, including without limitation
+ * the rights to use, copy, modify, merge, publish, distribute, sublicense,
+ * and/or sell copies of the Software, and to permit persons to whom the
+ * Software is furnished to do so, subject to the following conditions:
+ *
+ * The above copyright notice and this permission notice shall be included in
+ * all copies or substantial portions of the Software.
+ *
+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR
+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
+ * OTHER DEALINGS IN THE SOFTWARE.
+ *
+ * Authors:
+ * Danilo Krummrich <[email protected]>
+ *
+ */
+
+#include <drm/drm_gpuva_mgr.h>
+
+#include <linux/interval_tree_generic.h>
+#include <linux/mm.h>
+
+/**
+ * DOC: Overview
+ *
+ * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track
+ * of a GPU's virtual address (VA) space and manages the corresponding virtual
+ * mappings represented by &drm_gpuva objects. It also keeps track of the
+ * mapping's backing &drm_gem_object buffers.
+ *
+ * &drm_gem_object buffers maintain a list of &drm_gpuva objects representing
+ * all existent GPU VA mappings using this &drm_gem_object as backing buffer.
+ *
+ * GPU VAs can be flagged as sparse, such that drivers may use GPU VAs to also
+ * keep track of sparse PTEs in order to support Vulkan 'Sparse Resources'.
+ *
+ * The GPU VA manager internally uses a rb-tree to manage the
+ * &drm_gpuva mappings within a GPU's virtual address space.
+ *
+ * The &drm_gpuva_manager contains a special &drm_gpuva representing the
+ * portion of VA space reserved by the kernel. This node is initialized together
+ * with the GPU VA manager instance and removed when the GPU VA manager is
+ * destroyed.
+ *
+ * In a typical application drivers would embed struct drm_gpuva_manager and
+ * struct drm_gpuva within their own driver specific structures, there won't be
+ * any memory allocations of it's own nor memory allocations of &drm_gpuva
+ * entries.
+ *
+ * The data structures needed to store &drm_gpuvas within the &drm_gpuva_manager
+ * are contained within struct drm_gpuva already. Hence, for inserting
+ * &drm_gpuva entries from within dma-fence signalling critical sections it is
+ * enough to pre-allocate the &drm_gpuva structures.
+ */
+
+/**
+ * DOC: Split and Merge
+ *
+ * Besides it's capability to manage and represent a GPU VA space, the
+ * &drm_gpuva_manager also provides functions to let the &drm_gpuva_manager
+ * calculate a sequence of operations to satisfy a given map or unmap request.
+ *
+ * Therefore the DRM GPU VA manager provides an algorithm implementing splitting
+ * and merging of existent GPU VA mappings with the ones that are requested to
+ * be mapped or unmapped. This feature is required by the Vulkan API to
+ * implement Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this
+ * as VM BIND.
+ *
+ * Drivers can call drm_gpuva_sm_map() to receive a sequence of callbacks
+ * containing map, unmap and remap operations for a given newly requested
+ * mapping. The sequence of callbacks represents the set of operations to
+ * execute in order to integrate the new mapping cleanly into the current state
+ * of the GPU VA space.
+ *
+ * Depending on how the new GPU VA mapping intersects with the existent mappings
+ * of the GPU VA space the &drm_gpuva_fn_ops callbacks contain an arbitrary
+ * amount of unmap operations, a maximum of two remap operations and a single
+ * map operation. The caller might receive no callback at all if no operation is
+ * required, e.g. if the requested mapping already exists in the exact same way.
+ *
+ * The single map operation represents the original map operation requested by
+ * the caller.
+ *
+ * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the
+ * &drm_gpuva to unmap is physically contiguous with the original mapping
+ * request. Optionally, if 'keep' is set, drivers may keep the actual page table
+ * entries for this &drm_gpuva, adding the missing page table entries only and
+ * update the &drm_gpuva_manager's view of things accordingly.
+ *
+ * Drivers may do the same optimization, namely delta page table updates, also
+ * for remap operations. This is possible since &drm_gpuva_op_remap consists of
+ * one unmap operation and one or two map operations, such that drivers can
+ * derive the page table update delta accordingly.
+ *
+ * Note that there can't be more than two existent mappings to split up, one at
+ * the beginning and one at the end of the new mapping, hence there is a
+ * maximum of two remap operations.
+ *
+ * Analogous to drm_gpuva_sm_map() drm_gpuva_sm_unmap() uses &drm_gpuva_fn_ops
+ * to call back into the driver in order to unmap a range of GPU VA space. The
+ * logic behind this function is way simpler though: For all existent mappings
+ * enclosed by the given range unmap operations are created. For mappings which
+ * are only partically located within the given range, remap operations are
+ * created such that those mappings are split up and re-mapped partically.
+ *
+ * As an alternative to drm_gpuva_sm_map() and drm_gpuva_sm_unmap(),
+ * drm_gpuva_sm_map_ops_create() and drm_gpuva_sm_unmap_ops_create() can be used
+ * to directly obtain an instance of struct drm_gpuva_ops containing a list of
+ * &drm_gpuva_op, which can be iterated with drm_gpuva_for_each_op(). This list
+ * contains the &drm_gpuva_ops analogous to the callbacks one would receive when
+ * calling drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). While this way requires
+ * more memory (to allocate the &drm_gpuva_ops), it provides drivers a way to
+ * iterate the &drm_gpuva_op multiple times, e.g. once in a context where memory
+ * allocations are possible (e.g. to allocate GPU page tables) and once in the
+ * dma-fence signalling critical path.
+ *
+ * To update the &drm_gpuva_manager's view of the GPU VA space
+ * drm_gpuva_insert() and drm_gpuva_remove() may be used. These functions can
+ * safely be used from &drm_gpuva_fn_ops callbacks originating from
+ * drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). However, it might be more
+ * convenient to use the provided helper functions drm_gpuva_map(),
+ * drm_gpuva_remap() and drm_gpuva_unmap() instead.
+ *
+ * The following diagram depicts the basic relationships of existent GPU VA
+ * mappings, a newly requested mapping and the resulting mappings as implemented
+ * by drm_gpuva_sm_map() - it doesn't cover any arbitrary combinations of these.
+ *
+ * 1) Requested mapping is identical. Replace it, but indicate the backing PTEs
+ * could be kept.
+ *
+ * ::
+ *
+ * 0 a 1
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 a 1
+ * req: |-----------| (bo_offset=n)
+ *
+ * 0 a 1
+ * new: |-----------| (bo_offset=n)
+ *
+ *
+ * 2) Requested mapping is identical, except for the BO offset, hence replace
+ * the mapping.
+ *
+ * ::
+ *
+ * 0 a 1
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 a 1
+ * req: |-----------| (bo_offset=m)
+ *
+ * 0 a 1
+ * new: |-----------| (bo_offset=m)
+ *
+ *
+ * 3) Requested mapping is identical, except for the backing BO, hence replace
+ * the mapping.
+ *
+ * ::
+ *
+ * 0 a 1
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 b 1
+ * req: |-----------| (bo_offset=n)
+ *
+ * 0 b 1
+ * new: |-----------| (bo_offset=n)
+ *
+ *
+ * 4) Existent mapping is a left aligned subset of the requested one, hence
+ * replace the existent one.
+ *
+ * ::
+ *
+ * 0 a 1
+ * old: |-----| (bo_offset=n)
+ *
+ * 0 a 2
+ * req: |-----------| (bo_offset=n)
+ *
+ * 0 a 2
+ * new: |-----------| (bo_offset=n)
+ *
+ * .. note::
+ * We expect to see the same result for a request with a different BO
+ * and/or non-contiguous BO offset.
+ *
+ *
+ * 5) Requested mapping's range is a left aligned subset of the existent one,
+ * but backed by a different BO. Hence, map the requested mapping and split
+ * the existent one adjusting it's BO offset.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 b 1
+ * req: |-----| (bo_offset=n)
+ *
+ * 0 b 1 a' 2
+ * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1)
+ *
+ * .. note::
+ * We expect to see the same result for a request with a different BO
+ * and/or non-contiguous BO offset.
+ *
+ *
+ * 6) Existent mapping is a superset of the requested mapping. Split it up, but
+ * indicate that the backing PTEs could be kept.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 a 1
+ * req: |-----| (bo_offset=n)
+ *
+ * 0 a 1 a' 2
+ * new: |-----|-----| (a.bo_offset=n, a'.bo_offset=n+1)
+ *
+ *
+ * 7) Requested mapping's range is a right aligned subset of the existent one,
+ * but backed by a different BO. Hence, map the requested mapping and split
+ * the existent one, without adjusting the BO offset.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 1 b 2
+ * req: |-----| (bo_offset=m)
+ *
+ * 0 a 1 b 2
+ * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m)
+ *
+ *
+ * 8) Existent mapping is a superset of the requested mapping. Split it up, but
+ * indicate that the backing PTEs could be kept.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 1 a 2
+ * req: |-----| (bo_offset=n+1)
+ *
+ * 0 a' 1 a 2
+ * new: |-----|-----| (a'.bo_offset=n, a.bo_offset=n+1)
+ *
+ *
+ * 9) Existent mapping is overlapped at the end by the requested mapping backed
+ * by a different BO. Hence, map the requested mapping and split up the
+ * existent one, without adjusting the BO offset.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 1 b 3
+ * req: |-----------| (bo_offset=m)
+ *
+ * 0 a 1 b 3
+ * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m)
+ *
+ *
+ * 10) Existent mapping is overlapped by the requested mapping, both having the
+ * same backing BO with a contiguous offset. Indicate the backing PTEs of
+ * the old mapping could be kept.
+ *
+ * ::
+ *
+ * 0 a 2
+ * old: |-----------| (bo_offset=n)
+ *
+ * 1 a 3
+ * req: |-----------| (bo_offset=n+1)
+ *
+ * 0 a' 1 a 3
+ * new: |-----|-----------| (a'.bo_offset=n, a.bo_offset=n+1)
+ *
+ *
+ * 11) Requested mapping's range is a centered subset of the existent one
+ * having a different backing BO. Hence, map the requested mapping and split
+ * up the existent one in two mappings, adjusting the BO offset of the right
+ * one accordingly.
+ *
+ * ::
+ *
+ * 0 a 3
+ * old: |-----------------| (bo_offset=n)
+ *
+ * 1 b 2
+ * req: |-----| (bo_offset=m)
+ *
+ * 0 a 1 b 2 a' 3
+ * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2)
+ *
+ *
+ * 12) Requested mapping is a contiguous subset of the existent one. Split it
+ * up, but indicate that the backing PTEs could be kept.
+ *
+ * ::
+ *
+ * 0 a 3
+ * old: |-----------------| (bo_offset=n)
+ *
+ * 1 a 2
+ * req: |-----| (bo_offset=n+1)
+ *
+ * 0 a' 1 a 2 a'' 3
+ * old: |-----|-----|-----| (a'.bo_offset=n, a.bo_offset=n+1, a''.bo_offset=n+2)
+ *
+ *
+ * 13) Existent mapping is a right aligned subset of the requested one, hence
+ * replace the existent one.
+ *
+ * ::
+ *
+ * 1 a 2
+ * old: |-----| (bo_offset=n+1)
+ *
+ * 0 a 2
+ * req: |-----------| (bo_offset=n)
+ *
+ * 0 a 2
+ * new: |-----------| (bo_offset=n)
+ *
+ * .. note::
+ * We expect to see the same result for a request with a different bo
+ * and/or non-contiguous bo_offset.
+ *
+ *
+ * 14) Existent mapping is a centered subset of the requested one, hence
+ * replace the existent one.
+ *
+ * ::
+ *
+ * 1 a 2
+ * old: |-----| (bo_offset=n+1)
+ *
+ * 0 a 3
+ * req: |----------------| (bo_offset=n)
+ *
+ * 0 a 3
+ * new: |----------------| (bo_offset=n)
+ *
+ * .. note::
+ * We expect to see the same result for a request with a different bo
+ * and/or non-contiguous bo_offset.
+ *
+ *
+ * 15) Existent mappings is overlapped at the beginning by the requested mapping
+ * backed by a different BO. Hence, map the requested mapping and split up
+ * the existent one, adjusting it's BO offset accordingly.
+ *
+ * ::
+ *
+ * 1 a 3
+ * old: |-----------| (bo_offset=n)
+ *
+ * 0 b 2
+ * req: |-----------| (bo_offset=m)
+ *
+ * 0 b 2 a' 3
+ * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2)
+ */
+
+/**
+ * DOC: Locking
+ *
+ * Generally, the GPU VA manager does not take care of locking itself, it is
+ * the drivers responsibility to take care about locking. Drivers might want to
+ * protect the following operations: inserting, removing and iterating
+ * &drm_gpuva objects as well as generating all kinds of operations, such as
+ * split / merge or prefetch.
+ *
+ * The GPU VA manager also does not take care of the locking of the backing
+ * &drm_gem_object buffers GPU VA lists by itself; drivers are responsible to
+ * enforce mutual exclusion using either the GEMs dma_resv lock or alternatively
+ * a driver specific external lock by setting the @DRM_GPUVA_MANAGER_LOCK_EXTERN
+ * flag.
+ *
+ * For the latter, functions such as drm_gpuva_link() or drm_gpuva_unlink()
+ * contain lockdep checks to indicate locking issues. For this to work drivers
+ * must provide (in case the @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set) their
+ * external lock with drm_gpuva_manager_set_ext_lock() after initialization.
+ */
+
+/**
+ * DOC: Examples
+ *
+ * This section gives two examples on how to let the DRM GPUVA Manager generate
+ * &drm_gpuva_op in order to satisfy a given map or unmap request and how to
+ * make use of them.
+ *
+ * The below code is strictly limited to illustrate the generic usage pattern.
+ * To maintain simplicitly, it doesn't make use of any abstractions for common
+ * code, different (asyncronous) stages with fence signalling critical paths,
+ * any other helpers or error handling in terms of freeing memory and dropping
+ * previously taken locks.
+ *
+ * 1) Obtain a list of &drm_gpuva_op to create a new mapping::
+ *
+ * // Allocates a new &drm_gpuva.
+ * struct drm_gpuva * driver_gpuva_alloc(void);
+ *
+ * // Typically drivers would embedd the &drm_gpuva_manager and &drm_gpuva
+ * // structure in individual driver structures and lock the dma-resv with
+ * // drm_exec or similar helpers.
+ * int driver_mapping_create(struct drm_gpuva_manager *mgr,
+ * u64 addr, u64 range,
+ * struct drm_gem_object *obj, u64 offset)
+ * {
+ * struct drm_gpuva_ops *ops;
+ * struct drm_gpuva_op *op
+ *
+ * driver_lock_va_space();
+ * ops = drm_gpuva_sm_map_ops_create(mgr, addr, range,
+ * obj, offset);
+ * if (IS_ERR(ops))
+ * return PTR_ERR(ops);
+ *
+ * drm_gpuva_for_each_op(op, ops) {
+ * struct drm_gpuva *va;
+ *
+ * switch (op->op) {
+ * case DRM_GPUVA_OP_MAP:
+ * va = driver_gpuva_alloc();
+ * if (!va)
+ * ; // unwind previous VA space updates,
+ * // free memory and unlock
+ *
+ * driver_vm_map();
+ * drm_gpuva_map(mgr, va, &op->map);
+ * drm_gpuva_link(va);
+ *
+ * break;
+ * case DRM_GPUVA_OP_REMAP: {
+ * struct drm_gpuva *prev = NULL, *next = NULL;
+ *
+ * va = op->remap.unmap->va;
+ *
+ * if (op->remap.prev) {
+ * prev = driver_gpuva_alloc();
+ * if (!prev)
+ * ; // unwind previous VA space
+ * // updates, free memory and
+ * // unlock
+ * }
+ *
+ * if (op->remap.next) {
+ * next = driver_gpuva_alloc();
+ * if (!next)
+ * ; // unwind previous VA space
+ * // updates, free memory and
+ * // unlock
+ * }
+ *
+ * driver_vm_remap();
+ * drm_gpuva_remap(prev, next, &op->remap);
+ *
+ * drm_gpuva_unlink(va);
+ * if (prev)
+ * drm_gpuva_link(prev);
+ * if (next)
+ * drm_gpuva_link(next);
+ *
+ * break;
+ * }
+ * case DRM_GPUVA_OP_UNMAP:
+ * va = op->unmap->va;
+ *
+ * driver_vm_unmap();
+ * drm_gpuva_unlink(va);
+ * drm_gpuva_unmap(&op->unmap);
+ *
+ * break;
+ * default:
+ * break;
+ * }
+ * }
+ * driver_unlock_va_space();
+ *
+ * return 0;
+ * }
+ *
+ * 2) Receive a callback for each &drm_gpuva_op to create a new mapping::
+ *
+ * struct driver_context {
+ * struct drm_gpuva_manager *mgr;
+ * struct drm_gpuva *new_va;
+ * struct drm_gpuva *prev_va;
+ * struct drm_gpuva *next_va;
+ * };
+ *
+ * // ops to pass to drm_gpuva_manager_init()
+ * static const struct drm_gpuva_fn_ops driver_gpuva_ops = {
+ * .sm_step_map = driver_gpuva_map,
+ * .sm_step_remap = driver_gpuva_remap,
+ * .sm_step_unmap = driver_gpuva_unmap,
+ * };
+ *
+ * // Typically drivers would embedd the &drm_gpuva_manager and &drm_gpuva
+ * // structure in individual driver structures and lock the dma-resv with
+ * // drm_exec or similar helpers.
+ * int driver_mapping_create(struct drm_gpuva_manager *mgr,
+ * u64 addr, u64 range,
+ * struct drm_gem_object *obj, u64 offset)
+ * {
+ * struct driver_context ctx;
+ * struct drm_gpuva_ops *ops;
+ * struct drm_gpuva_op *op;
+ * int ret = 0;
+ *
+ * ctx.mgr = mgr;
+ *
+ * ctx.new_va = kzalloc(sizeof(*ctx.new_va), GFP_KERNEL);
+ * ctx.prev_va = kzalloc(sizeof(*ctx.prev_va), GFP_KERNEL);
+ * ctx.next_va = kzalloc(sizeof(*ctx.next_va), GFP_KERNEL);
+ * if (!ctx.new_va || !ctx.prev_va || !ctx.next_va) {
+ * ret = -ENOMEM;
+ * goto out;
+ * }
+ *
+ * driver_lock_va_space();
+ * ret = drm_gpuva_sm_map(mgr, &ctx, addr, range, obj, offset);
+ * driver_unlock_va_space();
+ *
+ * out:
+ * kfree(ctx.new_va);
+ * kfree(ctx.prev_va);
+ * kfree(ctx.next_va);
+ * return ret;
+ * }
+ *
+ * int driver_gpuva_map(struct drm_gpuva_op *op, void *__ctx)
+ * {
+ * struct driver_context *ctx = __ctx;
+ *
+ * drm_gpuva_map(ctx->mgr, ctx->new_va, &op->map);
+ *
+ * drm_gpuva_link(ctx->new_va);
+ *
+ * // prevent the new GPUVA from being freed in
+ * // driver_mapping_create()
+ * ctx->new_va = NULL;
+ *
+ * return 0;
+ * }
+ *
+ * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
+ * {
+ * struct driver_context *ctx = __ctx;
+ *
+ * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
+ *
+ * drm_gpuva_unlink(op->remap.unmap->va);
+ * kfree(op->remap.unmap->va);
+ *
+ * if (op->remap.prev) {
+ * drm_gpuva_link(ctx->prev_va);
+ * ctx->prev_va = NULL;
+ * }
+ *
+ * if (op->remap.next) {
+ * drm_gpuva_link(ctx->next_va);
+ * ctx->next_va = NULL;
+ * }
+ *
+ * return 0;
+ * }
+ *
+ * int driver_gpuva_unmap(struct drm_gpuva_op *op, void *__ctx)
+ * {
+ * drm_gpuva_unlink(op->unmap.va);
+ * drm_gpuva_unmap(&op->unmap);
+ * kfree(op->unmap.va);
+ *
+ * return 0;
+ * }
+ */
+
+#define to_drm_gpuva(__node) container_of((__node), struct drm_gpuva, rb.node)
+
+#define GPUVA_START(node) ((node)->va.addr)
+#define GPUVA_LAST(node) ((node)->va.addr + (node)->va.range - 1)
+
+/* We do not actually use drm_gpuva_it_next(), tell the compiler to not complain
+ * about this.
+ */
+INTERVAL_TREE_DEFINE(struct drm_gpuva, rb.node, u64, rb.__subtree_last,
+ GPUVA_START, GPUVA_LAST, static __attribute__((unused)),
+ drm_gpuva_it)
+
+static int __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva *va);
+static void __drm_gpuva_remove(struct drm_gpuva *va);
+
+static inline bool
+drm_gpuva_check_overflow(u64 addr, u64 range)
+{
+ u64 end;
+
+ return WARN(check_add_overflow(addr, range, &end),
+ "GPUVA address limited to %lu bytes.\n", sizeof(end));
+}
+
+static inline bool
+drm_gpuva_in_mm_range(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
+{
+ u64 end = addr + range;
+ u64 mm_start = mgr->mm_start;
+ u64 mm_end = mm_start + mgr->mm_range;
+
+ return addr >= mm_start && end <= mm_end;
+}
+
+static inline bool
+drm_gpuva_in_kernel_node(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
+{
+ u64 end = addr + range;
+ u64 kstart = mgr->kernel_alloc_node.va.addr;
+ u64 krange = mgr->kernel_alloc_node.va.range;
+ u64 kend = kstart + krange;
+
+ return krange && addr < kend && kstart < end;
+}
+
+static inline bool
+drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range)
+{
+
+ return !drm_gpuva_check_overflow(addr, range) &&
+ drm_gpuva_in_mm_range(mgr, addr, range) &&
+ !drm_gpuva_in_kernel_node(mgr, addr, range);
+}
+
+/**
+ * drm_gpuva_manager_init - initialize a &drm_gpuva_manager
+ * @mgr: pointer to the &drm_gpuva_manager to initialize
+ * @name: the name of the GPU VA space
+ * @start_offset: the start offset of the GPU VA space
+ * @range: the size of the GPU VA space
+ * @reserve_offset: the start of the kernel reserved GPU VA area
+ * @reserve_range: the size of the kernel reserved GPU VA area
+ * @ops: &drm_gpuva_fn_ops called on &drm_gpuva_sm_map / &drm_gpuva_sm_unmap
+ * @flags: the feature flags for the &drm_gpuva_manager
+ *
+ * The &drm_gpuva_manager must be initialized with this function before use.
+ *
+ * Note that @mgr must be cleared to 0 before calling this function. The given
+ * &name is expected to be managed by the surrounding driver structures.
+ */
+void
+drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+ const char *name,
+ u64 start_offset, u64 range,
+ u64 reserve_offset, u64 reserve_range,
+ const struct drm_gpuva_fn_ops *ops,
+ enum drm_gpuva_manager_flags flags)
+{
+ mgr->rb.tree = RB_ROOT_CACHED;
+ INIT_LIST_HEAD(&mgr->rb.list);
+
+ drm_gpuva_check_overflow(start_offset, range);
+ mgr->mm_start = start_offset;
+ mgr->mm_range = range;
+
+ mgr->name = name ? name : "unknown";
+ mgr->flags = flags;
+ mgr->ops = ops;
+
+ memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
+
+ if (reserve_range) {
+ mgr->kernel_alloc_node.va.addr = reserve_offset;
+ mgr->kernel_alloc_node.va.range = reserve_range;
+
+ if (likely(!drm_gpuva_check_overflow(reserve_offset,
+ reserve_range)))
+ __drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
+ }
+
+}
+EXPORT_SYMBOL(drm_gpuva_manager_init);
+
+/**
+ * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager
+ * @mgr: pointer to the &drm_gpuva_manager to clean up
+ *
+ * Note that it is a bug to call this function on a manager that still
+ * holds GPU VA mappings.
+ */
+void
+drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
+{
+ mgr->name = NULL;
+
+ if (mgr->kernel_alloc_node.va.range)
+ __drm_gpuva_remove(&mgr->kernel_alloc_node);
+
+ WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
+ "GPUVA tree is not empty, potentially leaking memory.");
+}
+EXPORT_SYMBOL(drm_gpuva_manager_destroy);
+
+static int
+__drm_gpuva_insert(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva *va)
+{
+ struct rb_node *node;
+ struct list_head *head;
+
+ if (drm_gpuva_it_iter_first(&mgr->rb.tree,
+ GPUVA_START(va),
+ GPUVA_LAST(va)))
+ return -EEXIST;
+
+ va->mgr = mgr;
+
+ drm_gpuva_it_insert(va, &mgr->rb.tree);
+
+ node = rb_prev(&va->rb.node);
+ if (node)
+ head = &(to_drm_gpuva(node))->rb.entry;
+ else
+ head = &mgr->rb.list;
+
+ list_add(&va->rb.entry, head);
+
+ return 0;
+}
+
+/**
+ * drm_gpuva_insert - insert a &drm_gpuva
+ * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in
+ * @va: the &drm_gpuva to insert
+ *
+ * Insert a &drm_gpuva with a given address and range into a
+ * &drm_gpuva_manager.
+ *
+ * It is safe to use this function using the safe versions of iterating the GPU
+ * VA space, such as drm_gpuva_for_each_va_safe() and
+ * drm_gpuva_for_each_va_range_safe().
+ *
+ * Returns: 0 on success, negative error code on failure.
+ */
+int
+drm_gpuva_insert(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva *va)
+{
+ u64 addr = va->va.addr;
+ u64 range = va->va.range;
+
+ if (unlikely(!drm_gpuva_range_valid(mgr, addr, range)))
+ return -EINVAL;
+
+ return __drm_gpuva_insert(mgr, va);
+}
+EXPORT_SYMBOL(drm_gpuva_insert);
+
+static void
+__drm_gpuva_remove(struct drm_gpuva *va)
+{
+ drm_gpuva_it_remove(va, &va->mgr->rb.tree);
+ list_del_init(&va->rb.entry);
+}
+
+/**
+ * drm_gpuva_remove - remove a &drm_gpuva
+ * @va: the &drm_gpuva to remove
+ *
+ * This removes the given &va from the underlaying tree.
+ *
+ * It is safe to use this function using the safe versions of iterating the GPU
+ * VA space, such as drm_gpuva_for_each_va_safe() and
+ * drm_gpuva_for_each_va_range_safe().
+ */
+void
+drm_gpuva_remove(struct drm_gpuva *va)
+{
+ struct drm_gpuva_manager *mgr = va->mgr;
+
+ if (unlikely(va == &mgr->kernel_alloc_node)) {
+ WARN(1, "Can't destroy kernel reserved node.\n");
+ return;
+ }
+
+ __drm_gpuva_remove(va);
+}
+EXPORT_SYMBOL(drm_gpuva_remove);
+
+/**
+ * drm_gpuva_link - link a &drm_gpuva
+ * @va: the &drm_gpuva to link
+ *
+ * This adds the given &va to the GPU VA list of the &drm_gem_object it is
+ * associated with.
+ *
+ * This function expects the caller to protect the GEM's GPUVA list against
+ * concurrent access using the GEMs dma_resv lock.
+ */
+void
+drm_gpuva_link(struct drm_gpuva *va)
+{
+ struct drm_gpuva_manager *mgr = va->mgr;
+ struct drm_gem_object *obj = va->gem.obj;
+
+ if (unlikely(!obj))
+ return;
+
+ if (drm_gpuva_manager_external_lock(mgr))
+ drm_gpuva_manager_ext_assert_held(mgr);
+ else
+ dma_resv_assert_held(obj->resv);
+
+ list_add_tail(&va->gem.entry, &obj->gpuva.list);
+}
+EXPORT_SYMBOL(drm_gpuva_link);
+
+/**
+ * drm_gpuva_unlink - unlink a &drm_gpuva
+ * @va: the &drm_gpuva to unlink
+ *
+ * This removes the given &va from the GPU VA list of the &drm_gem_object it is
+ * associated with.
+ *
+ * This function expects the caller to protect the GEM's GPUVA list against
+ * concurrent access using the GEMs dma_resv lock.
+ */
+void
+drm_gpuva_unlink(struct drm_gpuva *va)
+{
+ struct drm_gpuva_manager *mgr = va->mgr;
+ struct drm_gem_object *obj = va->gem.obj;
+
+ if (unlikely(!obj))
+ return;
+
+ if (drm_gpuva_manager_external_lock(mgr))
+ drm_gpuva_manager_ext_assert_held(mgr);
+ else
+ dma_resv_assert_held(obj->resv);
+
+ list_del_init(&va->gem.entry);
+}
+EXPORT_SYMBOL(drm_gpuva_unlink);
+
+/**
+ * drm_gpuva_find_first - find the first &drm_gpuva in the given range
+ * @mgr: the &drm_gpuva_manager to search in
+ * @addr: the &drm_gpuvas address
+ * @range: the &drm_gpuvas range
+ *
+ * Returns: the first &drm_gpuva within the given range
+ */
+struct drm_gpuva *
+drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range)
+{
+ u64 last = addr + range - 1;
+
+ return drm_gpuva_it_iter_first(&mgr->rb.tree, addr, last);
+}
+EXPORT_SYMBOL(drm_gpuva_find_first);
+
+/**
+ * drm_gpuva_find - find a &drm_gpuva
+ * @mgr: the &drm_gpuva_manager to search in
+ * @addr: the &drm_gpuvas address
+ * @range: the &drm_gpuvas range
+ *
+ * Returns: the &drm_gpuva at a given &addr and with a given &range
+ */
+struct drm_gpuva *
+drm_gpuva_find(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range)
+{
+ struct drm_gpuva *va;
+
+ va = drm_gpuva_find_first(mgr, addr, range);
+ if (!va)
+ goto out;
+
+ if (va->va.addr != addr ||
+ va->va.range != range)
+ goto out;
+
+ return va;
+
+out:
+ return NULL;
+}
+EXPORT_SYMBOL(drm_gpuva_find);
+
+/**
+ * drm_gpuva_find_prev - find the &drm_gpuva before the given address
+ * @mgr: the &drm_gpuva_manager to search in
+ * @start: the given GPU VA's start address
+ *
+ * Find the adjacent &drm_gpuva before the GPU VA with given &start address.
+ *
+ * Note that if there is any free space between the GPU VA mappings no mapping
+ * is returned.
+ *
+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found
+ */
+struct drm_gpuva *
+drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start)
+{
+ if (!drm_gpuva_range_valid(mgr, start - 1, 1))
+ return NULL;
+
+ return drm_gpuva_it_iter_first(&mgr->rb.tree, start - 1, start);
+}
+EXPORT_SYMBOL(drm_gpuva_find_prev);
+
+/**
+ * drm_gpuva_find_next - find the &drm_gpuva after the given address
+ * @mgr: the &drm_gpuva_manager to search in
+ * @end: the given GPU VA's end address
+ *
+ * Find the adjacent &drm_gpuva after the GPU VA with given &end address.
+ *
+ * Note that if there is any free space between the GPU VA mappings no mapping
+ * is returned.
+ *
+ * Returns: a pointer to the found &drm_gpuva or NULL if none was found
+ */
+struct drm_gpuva *
+drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end)
+{
+ if (!drm_gpuva_range_valid(mgr, end, 1))
+ return NULL;
+
+ return drm_gpuva_it_iter_first(&mgr->rb.tree, end, end + 1);
+}
+EXPORT_SYMBOL(drm_gpuva_find_next);
+
+/**
+ * drm_gpuva_interval_empty - indicate whether a given interval of the VA space
+ * is empty
+ * @mgr: the &drm_gpuva_manager to check the range for
+ * @addr: the start address of the range
+ * @range: the range of the interval
+ *
+ * Returns: true if the interval is empty, false otherwise
+ */
+bool
+drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
+{
+ return !drm_gpuva_find_first(mgr, addr, range);
+}
+EXPORT_SYMBOL(drm_gpuva_interval_empty);
+
+/**
+ * drm_gpuva_map - helper to insert a &drm_gpuva according to a
+ * &drm_gpuva_op_map
+ * @mgr: the &drm_gpuva_manager
+ * @va: the &drm_gpuva to insert
+ * @op: the &drm_gpuva_op_map to initialize @va with
+ *
+ * Initializes the @va from the @op and inserts it into the given @mgr.
+ */
+void
+drm_gpuva_map(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva *va,
+ struct drm_gpuva_op_map *op)
+{
+ drm_gpuva_init_from_op(va, op);
+ drm_gpuva_insert(mgr, va);
+}
+EXPORT_SYMBOL(drm_gpuva_map);
+
+/**
+ * drm_gpuva_remap - helper to remap a &drm_gpuva according to a
+ * &drm_gpuva_op_remap
+ * @prev: the &drm_gpuva to remap when keeping the start of a mapping
+ * @next: the &drm_gpuva to remap when keeping the end of a mapping
+ * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
+ *
+ * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
+ * @next.
+ */
+void
+drm_gpuva_remap(struct drm_gpuva *prev,
+ struct drm_gpuva *next,
+ struct drm_gpuva_op_remap *op)
+{
+ struct drm_gpuva *curr = op->unmap->va;
+ struct drm_gpuva_manager *mgr = curr->mgr;
+ struct drm_gpuva_op_map *map;
+
+ drm_gpuva_remove(curr);
+
+ if ((map = op->prev)) {
+ drm_gpuva_init_from_op(prev, map);
+ drm_gpuva_insert(mgr, prev);
+ }
+
+ if ((map = op->next)) {
+ drm_gpuva_init_from_op(next, map);
+ drm_gpuva_insert(mgr, next);
+ }
+}
+EXPORT_SYMBOL(drm_gpuva_remap);
+
+/**
+ * drm_gpuva_unmap - helper to remove a &drm_gpuva according to a
+ * &drm_gpuva_op_unmap
+ * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
+ *
+ * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap.
+ */
+void
+drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
+{
+ drm_gpuva_remove(op->va);
+}
+EXPORT_SYMBOL(drm_gpuva_unmap);
+
+static int
+op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
+ u64 addr, u64 range,
+ struct drm_gem_object *obj, u64 offset)
+{
+ struct drm_gpuva_op op = {};
+
+ op.op = DRM_GPUVA_OP_MAP;
+ op.map.va.addr = addr;
+ op.map.va.range = range;
+ op.map.gem.obj = obj;
+ op.map.gem.offset = offset;
+
+ return fn->sm_step_map(&op, priv);
+}
+
+static int
+op_remap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
+ struct drm_gpuva_op_map *prev,
+ struct drm_gpuva_op_map *next,
+ struct drm_gpuva_op_unmap *unmap)
+{
+ struct drm_gpuva_op op = {};
+ struct drm_gpuva_op_remap *r;
+
+ op.op = DRM_GPUVA_OP_REMAP;
+ r = &op.remap;
+ r->prev = prev;
+ r->next = next;
+ r->unmap = unmap;
+
+ return fn->sm_step_remap(&op, priv);
+}
+
+static int
+op_unmap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
+ struct drm_gpuva *va, bool merge)
+{
+ struct drm_gpuva_op op = {};
+
+ op.op = DRM_GPUVA_OP_UNMAP;
+ op.unmap.va = va;
+ op.unmap.keep = merge;
+
+ return fn->sm_step_unmap(&op, priv);
+}
+
+static int
+__drm_gpuva_sm_map(struct drm_gpuva_manager *mgr,
+ const struct drm_gpuva_fn_ops *ops, void *priv,
+ u64 req_addr, u64 req_range,
+ struct drm_gem_object *req_obj, u64 req_offset)
+{
+ struct drm_gpuva *va, *next, *prev = NULL;
+ u64 req_end = req_addr + req_range;
+ int ret;
+
+ if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
+ return -EINVAL;
+
+ drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
+ struct drm_gem_object *obj = va->gem.obj;
+ u64 offset = va->gem.offset;
+ u64 addr = va->va.addr;
+ u64 range = va->va.range;
+ u64 end = addr + range;
+ bool merge = !!va->gem.obj;
+
+ if (addr == req_addr) {
+ merge &= obj == req_obj &&
+ offset == req_offset;
+
+ if (end == req_end) {
+ ret = op_unmap_cb(ops, priv, va, merge);
+ if (ret)
+ return ret;
+ break;
+ }
+
+ if (end < req_end) {
+ ret = op_unmap_cb(ops, priv, va, merge);
+ if (ret)
+ return ret;
+ goto next;
+ }
+
+ if (end > req_end) {
+ struct drm_gpuva_op_map n = {
+ .va.addr = req_end,
+ .va.range = range - req_range,
+ .gem.obj = obj,
+ .gem.offset = offset + req_range,
+ };
+ struct drm_gpuva_op_unmap u = {
+ .va = va,
+ .keep = merge,
+ };
+
+ ret = op_remap_cb(ops, priv, NULL, &n, &u);
+ if (ret)
+ return ret;
+ break;
+ }
+ } else if (addr < req_addr) {
+ u64 ls_range = req_addr - addr;
+ struct drm_gpuva_op_map p = {
+ .va.addr = addr,
+ .va.range = ls_range,
+ .gem.obj = obj,
+ .gem.offset = offset,
+ };
+ struct drm_gpuva_op_unmap u = { .va = va };
+
+ merge &= obj == req_obj &&
+ offset + ls_range == req_offset;
+ u.keep = merge;
+
+ if (end == req_end) {
+ ret = op_remap_cb(ops, priv, &p, NULL, &u);
+ if (ret)
+ return ret;
+ break;
+ }
+
+ if (end < req_end) {
+ ret = op_remap_cb(ops, priv, &p, NULL, &u);
+ if (ret)
+ return ret;
+ goto next;
+ }
+
+ if (end > req_end) {
+ struct drm_gpuva_op_map n = {
+ .va.addr = req_end,
+ .va.range = end - req_end,
+ .gem.obj = obj,
+ .gem.offset = offset + ls_range +
+ req_range,
+ };
+
+ ret = op_remap_cb(ops, priv, &p, &n, &u);
+ if (ret)
+ return ret;
+ break;
+ }
+ } else if (addr > req_addr) {
+ merge &= obj == req_obj &&
+ offset == req_offset +
+ (addr - req_addr);
+
+ if (end == req_end) {
+ ret = op_unmap_cb(ops, priv, va, merge);
+ if (ret)
+ return ret;
+ break;
+ }
+
+ if (end < req_end) {
+ ret = op_unmap_cb(ops, priv, va, merge);
+ if (ret)
+ return ret;
+ goto next;
+ }
+
+ if (end > req_end) {
+ struct drm_gpuva_op_map n = {
+ .va.addr = req_end,
+ .va.range = end - req_end,
+ .gem.obj = obj,
+ .gem.offset = offset + req_end - addr,
+ };
+ struct drm_gpuva_op_unmap u = {
+ .va = va,
+ .keep = merge,
+ };
+
+ ret = op_remap_cb(ops, priv, NULL, &n, &u);
+ if (ret)
+ return ret;
+ break;
+ }
+ }
+next:
+ prev = va;
+ }
+
+ return op_map_cb(ops, priv,
+ req_addr, req_range,
+ req_obj, req_offset);
+}
+
+static int
+__drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr,
+ const struct drm_gpuva_fn_ops *ops, void *priv,
+ u64 req_addr, u64 req_range)
+{
+ struct drm_gpuva *va, *next;
+ u64 req_end = req_addr + req_range;
+ int ret;
+
+ if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
+ return -EINVAL;
+
+ drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
+ struct drm_gpuva_op_map prev = {}, next = {};
+ bool prev_split = false, next_split = false;
+ struct drm_gem_object *obj = va->gem.obj;
+ u64 offset = va->gem.offset;
+ u64 addr = va->va.addr;
+ u64 range = va->va.range;
+ u64 end = addr + range;
+
+ if (addr < req_addr) {
+ prev.va.addr = addr;
+ prev.va.range = req_addr - addr;
+ prev.gem.obj = obj;
+ prev.gem.offset = offset;
+
+ prev_split = true;
+ }
+
+ if (end > req_end) {
+ next.va.addr = req_end;
+ next.va.range = end - req_end;
+ next.gem.obj = obj;
+ next.gem.offset = offset + (req_end - addr);
+
+ next_split = true;
+ }
+
+ if (prev_split || next_split) {
+ struct drm_gpuva_op_unmap unmap = { .va = va };
+
+ ret = op_remap_cb(ops, priv,
+ prev_split ? &prev : NULL,
+ next_split ? &next : NULL,
+ &unmap);
+ if (ret)
+ return ret;
+ } else {
+ ret = op_unmap_cb(ops, priv, va, false);
+ if (ret)
+ return ret;
+ }
+ }
+
+ return 0;
+}
+
+/**
+ * drm_gpuva_sm_map - creates the &drm_gpuva_op split/merge steps
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @req_addr: the start address of the new mapping
+ * @req_range: the range of the new mapping
+ * @req_obj: the &drm_gem_object to map
+ * @req_offset: the offset within the &drm_gem_object
+ * @priv: pointer to a driver private data structure
+ *
+ * This function iterates the given range of the GPU VA space. It utilizes the
+ * &drm_gpuva_fn_ops to call back into the driver providing the split and merge
+ * steps.
+ *
+ * Drivers may use these callbacks to update the GPU VA space right away within
+ * the callback. In case the driver decides to copy and store the operations for
+ * later processing neither this function nor &drm_gpuva_sm_unmap is allowed to
+ * be called before the &drm_gpuva_manager's view of the GPU VA space was
+ * updated with the previous set of operations. To update the
+ * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
+ * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
+ * used.
+ *
+ * A sequence of callbacks can contain map, unmap and remap operations, but
+ * the sequence of callbacks might also be empty if no operation is required,
+ * e.g. if the requested mapping already exists in the exact same way.
+ *
+ * There can be an arbitrary amount of unmap operations, a maximum of two remap
+ * operations and a single map operation. The latter one represents the original
+ * map operation requested by the caller.
+ *
+ * Returns: 0 on success or a negative error code
+ */
+int
+drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
+ u64 req_addr, u64 req_range,
+ struct drm_gem_object *req_obj, u64 req_offset)
+{
+ const struct drm_gpuva_fn_ops *ops = mgr->ops;
+
+ if (unlikely(!(ops && ops->sm_step_map &&
+ ops->sm_step_remap &&
+ ops->sm_step_unmap)))
+ return -EINVAL;
+
+ return __drm_gpuva_sm_map(mgr, ops, priv,
+ req_addr, req_range,
+ req_obj, req_offset);
+}
+EXPORT_SYMBOL(drm_gpuva_sm_map);
+
+/**
+ * drm_gpuva_sm_unmap - creates the &drm_gpuva_ops to split on unmap
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @priv: pointer to a driver private data structure
+ * @req_addr: the start address of the range to unmap
+ * @req_range: the range of the mappings to unmap
+ *
+ * This function iterates the given range of the GPU VA space. It utilizes the
+ * &drm_gpuva_fn_ops to call back into the driver providing the operations to
+ * unmap and, if required, split existent mappings.
+ *
+ * Drivers may use these callbacks to update the GPU VA space right away within
+ * the callback. In case the driver decides to copy and store the operations for
+ * later processing neither this function nor &drm_gpuva_sm_map is allowed to be
+ * called before the &drm_gpuva_manager's view of the GPU VA space was updated
+ * with the previous set of operations. To update the &drm_gpuva_manager's view
+ * of the GPU VA space drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or
+ * drm_gpuva_destroy_unlocked() should be used.
+ *
+ * A sequence of callbacks can contain unmap and remap operations, depending on
+ * whether there are actual overlapping mappings to split.
+ *
+ * There can be an arbitrary amount of unmap operations and a maximum of two
+ * remap operations.
+ *
+ * Returns: 0 on success or a negative error code
+ */
+int
+drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
+ u64 req_addr, u64 req_range)
+{
+ const struct drm_gpuva_fn_ops *ops = mgr->ops;
+
+ if (unlikely(!(ops && ops->sm_step_remap &&
+ ops->sm_step_unmap)))
+ return -EINVAL;
+
+ return __drm_gpuva_sm_unmap(mgr, ops, priv,
+ req_addr, req_range);
+}
+EXPORT_SYMBOL(drm_gpuva_sm_unmap);
+
+static struct drm_gpuva_op *
+gpuva_op_alloc(struct drm_gpuva_manager *mgr)
+{
+ const struct drm_gpuva_fn_ops *fn = mgr->ops;
+ struct drm_gpuva_op *op;
+
+ if (fn && fn->op_alloc)
+ op = fn->op_alloc();
+ else
+ op = kzalloc(sizeof(*op), GFP_KERNEL);
+
+ if (unlikely(!op))
+ return NULL;
+
+ return op;
+}
+
+static void
+gpuva_op_free(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva_op *op)
+{
+ const struct drm_gpuva_fn_ops *fn = mgr->ops;
+
+ if (fn && fn->op_free)
+ fn->op_free(op);
+ else
+ kfree(op);
+}
+
+static int
+drm_gpuva_sm_step(struct drm_gpuva_op *__op,
+ void *priv)
+{
+ struct {
+ struct drm_gpuva_manager *mgr;
+ struct drm_gpuva_ops *ops;
+ } *args = priv;
+ struct drm_gpuva_manager *mgr = args->mgr;
+ struct drm_gpuva_ops *ops = args->ops;
+ struct drm_gpuva_op *op;
+
+ op = gpuva_op_alloc(mgr);
+ if (unlikely(!op))
+ goto err;
+
+ memcpy(op, __op, sizeof(*op));
+
+ if (op->op == DRM_GPUVA_OP_REMAP) {
+ struct drm_gpuva_op_remap *__r = &__op->remap;
+ struct drm_gpuva_op_remap *r = &op->remap;
+
+ r->unmap = kmemdup(__r->unmap, sizeof(*r->unmap),
+ GFP_KERNEL);
+ if (unlikely(!r->unmap))
+ goto err_free_op;
+
+ if (__r->prev) {
+ r->prev = kmemdup(__r->prev, sizeof(*r->prev),
+ GFP_KERNEL);
+ if (unlikely(!r->prev))
+ goto err_free_unmap;
+ }
+
+ if (__r->next) {
+ r->next = kmemdup(__r->next, sizeof(*r->next),
+ GFP_KERNEL);
+ if (unlikely(!r->next))
+ goto err_free_prev;
+ }
+ }
+
+ list_add_tail(&op->entry, &ops->list);
+
+ return 0;
+
+err_free_unmap:
+ kfree(op->remap.unmap);
+err_free_prev:
+ kfree(op->remap.prev);
+err_free_op:
+ gpuva_op_free(mgr, op);
+err:
+ return -ENOMEM;
+}
+
+static const struct drm_gpuva_fn_ops gpuva_list_ops = {
+ .sm_step_map = drm_gpuva_sm_step,
+ .sm_step_remap = drm_gpuva_sm_step,
+ .sm_step_unmap = drm_gpuva_sm_step,
+};
+
+/**
+ * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @req_addr: the start address of the new mapping
+ * @req_range: the range of the new mapping
+ * @req_obj: the &drm_gem_object to map
+ * @req_offset: the offset within the &drm_gem_object
+ *
+ * This function creates a list of operations to perform splitting and merging
+ * of existent mapping(s) with the newly requested one.
+ *
+ * The list can be iterated with &drm_gpuva_for_each_op and must be processed
+ * in the given order. It can contain map, unmap and remap operations, but it
+ * also can be empty if no operation is required, e.g. if the requested mapping
+ * already exists is the exact same way.
+ *
+ * There can be an arbitrary amount of unmap operations, a maximum of two remap
+ * operations and a single map operation. The latter one represents the original
+ * map operation requested by the caller.
+ *
+ * Note that before calling this function again with another mapping request it
+ * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. The
+ * previously obtained operations must be either processed or abandoned. To
+ * update the &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
+ * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
+ * used.
+ *
+ * After the caller finished processing the returned &drm_gpuva_ops, they must
+ * be freed with &drm_gpuva_ops_free.
+ *
+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_ops *
+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
+ u64 req_addr, u64 req_range,
+ struct drm_gem_object *req_obj, u64 req_offset)
+{
+ struct drm_gpuva_ops *ops;
+ struct {
+ struct drm_gpuva_manager *mgr;
+ struct drm_gpuva_ops *ops;
+ } args;
+ int ret;
+
+ ops = kzalloc(sizeof(*ops), GFP_KERNEL);
+ if (unlikely(!ops))
+ return ERR_PTR(-ENOMEM);
+
+ INIT_LIST_HEAD(&ops->list);
+
+ args.mgr = mgr;
+ args.ops = ops;
+
+ ret = __drm_gpuva_sm_map(mgr, &gpuva_list_ops, &args,
+ req_addr, req_range,
+ req_obj, req_offset);
+ if (ret)
+ goto err_free_ops;
+
+ return ops;
+
+err_free_ops:
+ drm_gpuva_ops_free(mgr, ops);
+ return ERR_PTR(ret);
+}
+EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create);
+
+/**
+ * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @req_addr: the start address of the range to unmap
+ * @req_range: the range of the mappings to unmap
+ *
+ * This function creates a list of operations to perform unmapping and, if
+ * required, splitting of the mappings overlapping the unmap range.
+ *
+ * The list can be iterated with &drm_gpuva_for_each_op and must be processed
+ * in the given order. It can contain unmap and remap operations, depending on
+ * whether there are actual overlapping mappings to split.
+ *
+ * There can be an arbitrary amount of unmap operations and a maximum of two
+ * remap operations.
+ *
+ * Note that before calling this function again with another range to unmap it
+ * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. The
+ * previously obtained operations must be processed or abandoned. To update the
+ * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
+ * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
+ * used.
+ *
+ * After the caller finished processing the returned &drm_gpuva_ops, they must
+ * be freed with &drm_gpuva_ops_free.
+ *
+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_ops *
+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
+ u64 req_addr, u64 req_range)
+{
+ struct drm_gpuva_ops *ops;
+ struct {
+ struct drm_gpuva_manager *mgr;
+ struct drm_gpuva_ops *ops;
+ } args;
+ int ret;
+
+ ops = kzalloc(sizeof(*ops), GFP_KERNEL);
+ if (unlikely(!ops))
+ return ERR_PTR(-ENOMEM);
+
+ INIT_LIST_HEAD(&ops->list);
+
+ args.mgr = mgr;
+ args.ops = ops;
+
+ ret = __drm_gpuva_sm_unmap(mgr, &gpuva_list_ops, &args,
+ req_addr, req_range);
+ if (ret)
+ goto err_free_ops;
+
+ return ops;
+
+err_free_ops:
+ drm_gpuva_ops_free(mgr, ops);
+ return ERR_PTR(ret);
+}
+EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create);
+
+/**
+ * drm_gpuva_prefetch_ops_create - creates the &drm_gpuva_ops to prefetch
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @addr: the start address of the range to prefetch
+ * @range: the range of the mappings to prefetch
+ *
+ * This function creates a list of operations to perform prefetching.
+ *
+ * The list can be iterated with &drm_gpuva_for_each_op and must be processed
+ * in the given order. It can contain prefetch operations.
+ *
+ * There can be an arbitrary amount of prefetch operations.
+ *
+ * After the caller finished processing the returned &drm_gpuva_ops, they must
+ * be freed with &drm_gpuva_ops_free.
+ *
+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_ops *
+drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range)
+{
+ struct drm_gpuva_ops *ops;
+ struct drm_gpuva_op *op;
+ struct drm_gpuva *va;
+ u64 end = addr + range;
+ int ret;
+
+ ops = kzalloc(sizeof(*ops), GFP_KERNEL);
+ if (!ops)
+ return ERR_PTR(-ENOMEM);
+
+ INIT_LIST_HEAD(&ops->list);
+
+ drm_gpuva_for_each_va_range(va, mgr, addr, end) {
+ op = gpuva_op_alloc(mgr);
+ if (!op) {
+ ret = -ENOMEM;
+ goto err_free_ops;
+ }
+
+ op->op = DRM_GPUVA_OP_PREFETCH;
+ op->prefetch.va = va;
+ list_add_tail(&op->entry, &ops->list);
+ }
+
+ return ops;
+
+err_free_ops:
+ drm_gpuva_ops_free(mgr, ops);
+ return ERR_PTR(ret);
+}
+EXPORT_SYMBOL(drm_gpuva_prefetch_ops_create);
+
+/**
+ * drm_gpuva_gem_unmap_ops_create - creates the &drm_gpuva_ops to unmap a GEM
+ * @mgr: the &drm_gpuva_manager representing the GPU VA space
+ * @obj: the &drm_gem_object to unmap
+ *
+ * This function creates a list of operations to perform unmapping for every
+ * GPUVA attached to a GEM.
+ *
+ * The list can be iterated with &drm_gpuva_for_each_op and consists out of an
+ * arbitrary amount of unmap operations.
+ *
+ * After the caller finished processing the returned &drm_gpuva_ops, they must
+ * be freed with &drm_gpuva_ops_free.
+ *
+ * It is the callers responsibility to protect the GEMs GPUVA list against
+ * concurrent access using the GEMs dma_resv lock.
+ *
+ * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
+ */
+struct drm_gpuva_ops *
+drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
+ struct drm_gem_object *obj)
+{
+ struct drm_gpuva_ops *ops;
+ struct drm_gpuva_op *op;
+ struct drm_gpuva *va;
+ int ret;
+
+ if (drm_gpuva_manager_external_lock(mgr))
+ drm_gpuva_manager_ext_assert_held(mgr);
+ else
+ dma_resv_assert_held(obj->resv);
+
+ ops = kzalloc(sizeof(*ops), GFP_KERNEL);
+ if (!ops)
+ return ERR_PTR(-ENOMEM);
+
+ INIT_LIST_HEAD(&ops->list);
+
+ drm_gem_for_each_gpuva(va, obj) {
+ op = gpuva_op_alloc(mgr);
+ if (!op) {
+ ret = -ENOMEM;
+ goto err_free_ops;
+ }
+
+ op->op = DRM_GPUVA_OP_UNMAP;
+ op->unmap.va = va;
+ list_add_tail(&op->entry, &ops->list);
+ }
+
+ return ops;
+
+err_free_ops:
+ drm_gpuva_ops_free(mgr, ops);
+ return ERR_PTR(ret);
+}
+EXPORT_SYMBOL(drm_gpuva_gem_unmap_ops_create);
+
+
+/**
+ * drm_gpuva_ops_free - free the given &drm_gpuva_ops
+ * @mgr: the &drm_gpuva_manager the ops were created for
+ * @ops: the &drm_gpuva_ops to free
+ *
+ * Frees the given &drm_gpuva_ops structure including all the ops associated
+ * with it.
+ */
+void
+drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva_ops *ops)
+{
+ struct drm_gpuva_op *op, *next;
+
+ drm_gpuva_for_each_op_safe(op, next, ops) {
+ list_del(&op->entry);
+
+ if (op->op == DRM_GPUVA_OP_REMAP) {
+ kfree(op->remap.prev);
+ kfree(op->remap.next);
+ kfree(op->remap.unmap);
+ }
+
+ gpuva_op_free(mgr, op);
+ }
+
+ kfree(ops);
+}
+EXPORT_SYMBOL(drm_gpuva_ops_free);
diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h
index 89e2706cac56..04dbe223b1a5 100644
--- a/include/drm/drm_drv.h
+++ b/include/drm/drm_drv.h
@@ -104,6 +104,12 @@ enum drm_driver_feature {
* acceleration should be handled by two drivers that are connected using auxiliary bus.
*/
DRIVER_COMPUTE_ACCEL = BIT(7),
+ /**
+ * @DRIVER_GEM_GPUVA:
+ *
+ * Driver supports user defined GPU VA bindings for GEM objects.
+ */
+ DRIVER_GEM_GPUVA = BIT(8),

/* IMPORTANT: Below are all the legacy flags, add new ones above. */

diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
index bbc721870c13..5ec8148a30ee 100644
--- a/include/drm/drm_gem.h
+++ b/include/drm/drm_gem.h
@@ -36,6 +36,8 @@

#include <linux/kref.h>
#include <linux/dma-resv.h>
+#include <linux/list.h>
+#include <linux/mutex.h>

#include <drm/drm_vma_manager.h>

@@ -379,6 +381,18 @@ struct drm_gem_object {
*/
struct dma_resv _resv;

+ /**
+ * @gpuva:
+ *
+ * Provides the list of GPU VAs attached to this GEM object.
+ *
+ * Drivers should lock list accesses with the GEMs &dma_resv lock
+ * (&drm_gem_object.resv).
+ */
+ struct {
+ struct list_head list;
+ } gpuva;
+
/**
* @funcs:
*
@@ -526,4 +540,42 @@ unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru,

int drm_gem_evict(struct drm_gem_object *obj);

+/**
+ * drm_gem_gpuva_init - initialize the gpuva list of a GEM object
+ * @obj: the &drm_gem_object
+ *
+ * This initializes the &drm_gem_object's &drm_gpuva list.
+ *
+ * Calling this function is only necessary for drivers intending to support the
+ * &drm_driver_feature DRIVER_GEM_GPUVA.
+ */
+static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
+{
+ INIT_LIST_HEAD(&obj->gpuva.list);
+}
+
+/**
+ * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas
+ * @entry: &drm_gpuva structure to assign to in each iteration step
+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gpuva_manager.
+ */
+#define drm_gem_for_each_gpuva(entry__, obj__) \
+ list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
+
+/**
+ * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas
+ * @entry: &drm_gpuva structure to assign to in each iteration step
+ * @next: &next &drm_gpuva to store the next step
+ * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the
+ * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
+ * it is save against removal of elements.
+ */
+#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
+ list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
+
#endif /* __DRM_GEM_H__ */
diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
new file mode 100644
index 000000000000..4f23aaf726dd
--- /dev/null
+++ b/include/drm/drm_gpuva_mgr.h
@@ -0,0 +1,756 @@
+/* SPDX-License-Identifier: GPL-2.0 */
+
+#ifndef __DRM_GPUVA_MGR_H__
+#define __DRM_GPUVA_MGR_H__
+
+/*
+ * Copyright (c) 2022 Red Hat.
+ *
+ * Permission is hereby granted, free of charge, to any person obtaining a
+ * copy of this software and associated documentation files (the "Software"),
+ * to deal in the Software without restriction, including without limitation
+ * the rights to use, copy, modify, merge, publish, distribute, sublicense,
+ * and/or sell copies of the Software, and to permit persons to whom the
+ * Software is furnished to do so, subject to the following conditions:
+ *
+ * The above copyright notice and this permission notice shall be included in
+ * all copies or substantial portions of the Software.
+ *
+ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
+ * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR
+ * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
+ * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
+ * OTHER DEALINGS IN THE SOFTWARE.
+ */
+
+#include <linux/list.h>
+#include <linux/rbtree.h>
+#include <linux/types.h>
+
+#include <drm/drm_gem.h>
+
+struct drm_gpuva_manager;
+struct drm_gpuva_fn_ops;
+
+/**
+ * enum drm_gpuva_flags - flags for struct drm_gpuva
+ */
+enum drm_gpuva_flags {
+ /**
+ * @DRM_GPUVA_INVALIDATED:
+ *
+ * Flag indicating that the &drm_gpuva's backing GEM is invalidated.
+ */
+ DRM_GPUVA_INVALIDATED = (1 << 0),
+
+ /**
+ * @DRM_GPUVA_SPARSE:
+ *
+ * Flag indicating that the &drm_gpuva is a sparse mapping.
+ */
+ DRM_GPUVA_SPARSE = (1 << 1),
+
+ /**
+ * @DRM_GPUVA_USERBITS: user defined bits
+ */
+ DRM_GPUVA_USERBITS = (1 << 2),
+};
+
+/**
+ * struct drm_gpuva - structure to track a GPU VA mapping
+ *
+ * This structure represents a GPU VA mapping and is associated with a
+ * &drm_gpuva_manager.
+ *
+ * Typically, this structure is embedded in bigger driver structures.
+ */
+struct drm_gpuva {
+ /**
+ * @mgr: the &drm_gpuva_manager this object is associated with
+ */
+ struct drm_gpuva_manager *mgr;
+
+ /**
+ * @flags: the &drm_gpuva_flags for this mapping
+ */
+ enum drm_gpuva_flags flags;
+
+ /**
+ * @va: structure containing the address and range of the &drm_gpuva
+ */
+ struct {
+ /**
+ * @addr: the start address
+ */
+ u64 addr;
+
+ /*
+ * @range: the range
+ */
+ u64 range;
+ } va;
+
+ /**
+ * @gem: structure containing the &drm_gem_object and it's offset
+ */
+ struct {
+ /**
+ * @offset: the offset within the &drm_gem_object
+ */
+ u64 offset;
+
+ /**
+ * @obj: the mapped &drm_gem_object
+ */
+ struct drm_gem_object *obj;
+
+ /**
+ * @entry: the &list_head to attach this object to a &drm_gem_object
+ */
+ struct list_head entry;
+ } gem;
+
+ /**
+ * @rb: structure containing data to store &drm_gpuvas in a rb-tree
+ */
+ struct {
+ /**
+ * @rb: the rb-tree node
+ */
+ struct rb_node node;
+
+ /**
+ * @entry: The &list_head to additionally connect &drm_gpuvas
+ * in the same order they appear in the interval tree. This is
+ * useful to keep iterating &drm_gpuvas from a start node found
+ * through the rb-tree while doing modifications on the rb-tree
+ * itself.
+ */
+ struct list_head entry;
+
+ /**
+ * @__subtree_last: needed by the interval tree, holding last-in-subtree
+ */
+ u64 __subtree_last;
+ } rb;
+};
+
+int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
+void drm_gpuva_remove(struct drm_gpuva *va);
+
+void drm_gpuva_link(struct drm_gpuva *va);
+void drm_gpuva_unlink(struct drm_gpuva *va);
+
+struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range);
+struct drm_gpuva *drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range);
+struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start);
+struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end);
+
+bool drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr, u64 range);
+
+static inline void drm_gpuva_init(struct drm_gpuva *va, u64 addr, u64 range,
+ struct drm_gem_object *obj, u64 offset)
+{
+ va->va.addr = addr;
+ va->va.range = range;
+ va->gem.obj = obj;
+ va->gem.offset = offset;
+}
+
+/**
+ * drm_gpuva_invalidate - sets whether the backing GEM of this &drm_gpuva is
+ * invalidated
+ * @va: the &drm_gpuva to set the invalidate flag for
+ * @invalidate: indicates whether the &drm_gpuva is invalidated
+ */
+static inline void drm_gpuva_invalidate(struct drm_gpuva *va, bool invalidate)
+{
+ if (invalidate)
+ va->flags |= DRM_GPUVA_INVALIDATED;
+ else
+ va->flags &= ~DRM_GPUVA_INVALIDATED;
+}
+
+/**
+ * drm_gpuva_invalidated - indicates whether the backing BO of this &drm_gpuva
+ * is invalidated
+ * @va: the &drm_gpuva to check
+ */
+static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
+{
+ return va->flags & DRM_GPUVA_INVALIDATED;
+}
+
+#ifdef CONFIG_LOCKDEP
+typedef struct lockdep_map *lockdep_map_p;
+#define drm_gpuva_manager_ext_assert_held(mgr) \
+ lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
+/**
+ * drm_gpuva_manager_set_ext_lock - set the external lock according to
+ * @DRM_GPUVA_MANAGER_LOCK_EXTERN
+ * @mgr: the &drm_gpuva_manager to set the lock for
+ * @lock: the lock to set
+ *
+ * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
+ * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
+ * &drm_gem_objects GPUVA list.
+ */
+#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
+ (mgr)->ext_lock = &(lock)->dep_map
+#else
+typedef struct { /* nothing */ } lockdep_map_p;
+#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
+#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
+#endif
+
+/**
+ * enum drm_gpuva_manager_flags - the feature flags for the &drm_gpuva_manager
+ */
+enum drm_gpuva_manager_flags {
+ /**
+ * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
+ *
+ * Indicates the driver has it's own external lock for linking and
+ * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
+ *
+ * When setting this flag it is rquired to set a lock via
+ * drm_gpuva_set_ext_lock().
+ */
+ DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
+};
+
+/**
+ * struct drm_gpuva_manager - DRM GPU VA Manager
+ *
+ * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using
+ * &maple_tree structures. Typically, this structure is embedded in bigger
+ * driver structures.
+ *
+ * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or
+ * pages.
+ *
+ * There should be one manager instance per GPU virtual address space.
+ */
+struct drm_gpuva_manager {
+ /**
+ * @name: the name of the DRM GPU VA space
+ */
+ const char *name;
+
+ /**
+ * @flags: the feature flags of the &drm_gpuva_manager
+ */
+ enum drm_gpuva_manager_flags flags;
+
+ /**
+ * @mm_start: start of the VA space
+ */
+ u64 mm_start;
+
+ /**
+ * @mm_range: length of the VA space
+ */
+ u64 mm_range;
+
+ /**
+ * @rb: structures to track &drm_gpuva entries
+ */
+ struct {
+ /**
+ * @tree: the rb-tree to track GPU VA mappings
+ */
+ struct rb_root_cached tree;
+
+ /**
+ * @list: the &list_head to track GPU VA mappings
+ */
+ struct list_head list;
+ } rb;
+
+ /**
+ * @kernel_alloc_node:
+ *
+ * &drm_gpuva representing the address space cutout reserved for
+ * the kernel
+ */
+ struct drm_gpuva kernel_alloc_node;
+
+ /**
+ * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
+ */
+ const struct drm_gpuva_fn_ops *ops;
+
+ /**
+ * @ext_lock: &lockdep_map according to @DRM_GPUVA_MANAGER_LOCK_EXTERN
+ */
+ lockdep_map_p ext_lock;
+};
+
+void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
+ const char *name,
+ u64 start_offset, u64 range,
+ u64 reserve_offset, u64 reserve_range,
+ const struct drm_gpuva_fn_ops *ops,
+ enum drm_gpuva_manager_flags flags);
+void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
+
+/**
+ * drm_gpuva_manager_external_lock - indicates whether the
+ * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
+ * @mgr: the &drm_gpuva_manager to check the flag for
+ */
+static inline bool drm_gpuva_manager_external_lock(struct drm_gpuva_manager *mgr)
+{
+ return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
+}
+
+/**
+ * drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
+ * @va__: &drm_gpuva structure to assign to in each iteration step
+ * @mgr__: &drm_gpuva_manager to walk over
+ * @start__: starting offset, the first gpuva will overlap this
+ * @end__: ending offset, the last gpuva will start before this (but may
+ * overlap)
+ *
+ * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
+ * between @start__ and @end__. It is implemented similarly to list_for_each(),
+ * but is using the &drm_gpuva_manager's internal interval tree to accelerate
+ * the search for the starting &drm_gpuva, and hence isn't safe against removal
+ * of elements. It assumes that @end__ is within (or is the upper limit of) the
+ * &drm_gpuva_manager. This iterator does not skip over the &drm_gpuva_manager's
+ * @kernel_alloc_node.
+ */
+#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
+ for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \
+ va__ && (va__->va.addr < (end__)) && \
+ !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
+ va__ = list_next_entry(va__, rb.entry))
+
+/**
+ * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
+ * &drm_gpuvas
+ * @va__: &drm_gpuva to assign to in each iteration step
+ * @next__: another &drm_gpuva to use as temporary storage
+ * @mgr__: &drm_gpuva_manager to walk over
+ * @start__: starting offset, the first gpuva will overlap this
+ * @end__: ending offset, the last gpuva will start before this (but may
+ * overlap)
+ *
+ * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
+ * between @start__ and @end__. It is implemented similarly to
+ * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
+ * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
+ * against removal of elements. It assumes that @end__ is within (or is the
+ * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
+ * &drm_gpuva_manager's @kernel_alloc_node.
+ */
+#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
+ for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
+ next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
+ va__ && (va__->va.addr < (end__)) && \
+ !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
+ va__ = next__, next__ = list_next_entry(va__, rb.entry))
+
+/**
+ * drm_gpuva_for_each_va - iternator to walk over all &drm_gpuvas
+ * @va__: &drm_gpuva to assign to in each iteration step
+ * @mgr__: &drm_gpuva_manager to walk over
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the given
+ * &drm_gpuva_manager.
+ */
+#define drm_gpuva_for_each_va(va__, mgr__) \
+ list_for_each_entry(va__, &(mgr__)->rb.list, rb.entry)
+
+/**
+ * drm_gpuva_for_each_va_safe - iternator to safely walk over all &drm_gpuvas
+ * @va__: &drm_gpuva to assign to in each iteration step
+ * @next__: another &drm_gpuva to use as temporary storage
+ * @mgr__: &drm_gpuva_manager to walk over
+ *
+ * This iterator walks over all &drm_gpuva structures associated with the given
+ * &drm_gpuva_manager. It is implemented with list_for_each_entry_safe(), and
+ * hence safe against the removal of elements.
+ */
+#define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
+ list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
+
+/**
+ * enum drm_gpuva_op_type - GPU VA operation type
+ *
+ * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager.
+ */
+enum drm_gpuva_op_type {
+ /**
+ * @DRM_GPUVA_OP_MAP: the map op type
+ */
+ DRM_GPUVA_OP_MAP,
+
+ /**
+ * @DRM_GPUVA_OP_REMAP: the remap op type
+ */
+ DRM_GPUVA_OP_REMAP,
+
+ /**
+ * @DRM_GPUVA_OP_UNMAP: the unmap op type
+ */
+ DRM_GPUVA_OP_UNMAP,
+
+ /**
+ * @DRM_GPUVA_OP_PREFETCH: the prefetch op type
+ */
+ DRM_GPUVA_OP_PREFETCH,
+};
+
+/**
+ * struct drm_gpuva_op_map - GPU VA map operation
+ *
+ * This structure represents a single map operation generated by the
+ * DRM GPU VA manager.
+ */
+struct drm_gpuva_op_map {
+ /**
+ * @va: structure containing address and range of a map
+ * operation
+ */
+ struct {
+ /**
+ * @addr: the base address of the new mapping
+ */
+ u64 addr;
+
+ /**
+ * @range: the range of the new mapping
+ */
+ u64 range;
+ } va;
+
+ /**
+ * @gem: structure containing the &drm_gem_object and it's offset
+ */
+ struct {
+ /**
+ * @offset: the offset within the &drm_gem_object
+ */
+ u64 offset;
+
+ /**
+ * @obj: the &drm_gem_object to map
+ */
+ struct drm_gem_object *obj;
+ } gem;
+};
+
+/**
+ * struct drm_gpuva_op_unmap - GPU VA unmap operation
+ *
+ * This structure represents a single unmap operation generated by the
+ * DRM GPU VA manager.
+ */
+struct drm_gpuva_op_unmap {
+ /**
+ * @va: the &drm_gpuva to unmap
+ */
+ struct drm_gpuva *va;
+
+ /**
+ * @keep:
+ *
+ * Indicates whether this &drm_gpuva is physically contiguous with the
+ * original mapping request.
+ *
+ * Optionally, if &keep is set, drivers may keep the actual page table
+ * mappings for this &drm_gpuva, adding the missing page table entries
+ * only and update the &drm_gpuva_manager accordingly.
+ */
+ bool keep;
+};
+
+/**
+ * struct drm_gpuva_op_remap - GPU VA remap operation
+ *
+ * This represents a single remap operation generated by the DRM GPU VA manager.
+ *
+ * A remap operation is generated when an existing GPU VA mmapping is split up
+ * by inserting a new GPU VA mapping or by partially unmapping existent
+ * mapping(s), hence it consists of a maximum of two map and one unmap
+ * operation.
+ *
+ * The @unmap operation takes care of removing the original existing mapping.
+ * @prev is used to remap the preceding part, @next the subsequent part.
+ *
+ * If either a new mapping's start address is aligned with the start address
+ * of the old mapping or the new mapping's end address is aligned with the
+ * end address of the old mapping, either @prev or @next is NULL.
+ *
+ * Note, the reason for a dedicated remap operation, rather than arbitrary
+ * unmap and map operations, is to give drivers the chance of extracting driver
+ * specific data for creating the new mappings from the unmap operations's
+ * &drm_gpuva structure which typically is embedded in larger driver specific
+ * structures.
+ */
+struct drm_gpuva_op_remap {
+ /**
+ * @prev: the preceding part of a split mapping
+ */
+ struct drm_gpuva_op_map *prev;
+
+ /**
+ * @next: the subsequent part of a split mapping
+ */
+ struct drm_gpuva_op_map *next;
+
+ /**
+ * @unmap: the unmap operation for the original existing mapping
+ */
+ struct drm_gpuva_op_unmap *unmap;
+};
+
+/**
+ * struct drm_gpuva_op_prefetch - GPU VA prefetch operation
+ *
+ * This structure represents a single prefetch operation generated by the
+ * DRM GPU VA manager.
+ */
+struct drm_gpuva_op_prefetch {
+ /**
+ * @va: the &drm_gpuva to prefetch
+ */
+ struct drm_gpuva *va;
+};
+
+/**
+ * struct drm_gpuva_op - GPU VA operation
+ *
+ * This structure represents a single generic operation.
+ *
+ * The particular type of the operation is defined by @op.
+ */
+struct drm_gpuva_op {
+ /**
+ * @entry:
+ *
+ * The &list_head used to distribute instances of this struct within
+ * &drm_gpuva_ops.
+ */
+ struct list_head entry;
+
+ /**
+ * @op: the type of the operation
+ */
+ enum drm_gpuva_op_type op;
+
+ union {
+ /**
+ * @map: the map operation
+ */
+ struct drm_gpuva_op_map map;
+
+ /**
+ * @remap: the remap operation
+ */
+ struct drm_gpuva_op_remap remap;
+
+ /**
+ * @unmap: the unmap operation
+ */
+ struct drm_gpuva_op_unmap unmap;
+
+ /**
+ * @prefetch: the prefetch operation
+ */
+ struct drm_gpuva_op_prefetch prefetch;
+ };
+};
+
+/**
+ * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op
+ */
+struct drm_gpuva_ops {
+ /**
+ * @list: the &list_head
+ */
+ struct list_head list;
+};
+
+/**
+ * drm_gpuva_for_each_op - iterator to walk over &drm_gpuva_ops
+ * @op: &drm_gpuva_op to assign in each iteration step
+ * @ops: &drm_gpuva_ops to walk
+ *
+ * This iterator walks over all ops within a given list of operations.
+ */
+#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry)
+
+/**
+ * drm_gpuva_for_each_op_safe - iterator to safely walk over &drm_gpuva_ops
+ * @op: &drm_gpuva_op to assign in each iteration step
+ * @next: &next &drm_gpuva_op to store the next step
+ * @ops: &drm_gpuva_ops to walk
+ *
+ * This iterator walks over all ops within a given list of operations. It is
+ * implemented with list_for_each_safe(), so save against removal of elements.
+ */
+#define drm_gpuva_for_each_op_safe(op, next, ops) \
+ list_for_each_entry_safe(op, next, &(ops)->list, entry)
+
+/**
+ * drm_gpuva_for_each_op_from_reverse - iterate backwards from the given point
+ * @op: &drm_gpuva_op to assign in each iteration step
+ * @ops: &drm_gpuva_ops to walk
+ *
+ * This iterator walks over all ops within a given list of operations beginning
+ * from the given operation in reverse order.
+ */
+#define drm_gpuva_for_each_op_from_reverse(op, ops) \
+ list_for_each_entry_from_reverse(op, &(ops)->list, entry)
+
+/**
+ * drm_gpuva_first_op - returns the first &drm_gpuva_op from &drm_gpuva_ops
+ * @ops: the &drm_gpuva_ops to get the fist &drm_gpuva_op from
+ */
+#define drm_gpuva_first_op(ops) \
+ list_first_entry(&(ops)->list, struct drm_gpuva_op, entry)
+
+/**
+ * drm_gpuva_last_op - returns the last &drm_gpuva_op from &drm_gpuva_ops
+ * @ops: the &drm_gpuva_ops to get the last &drm_gpuva_op from
+ */
+#define drm_gpuva_last_op(ops) \
+ list_last_entry(&(ops)->list, struct drm_gpuva_op, entry)
+
+/**
+ * drm_gpuva_prev_op - previous &drm_gpuva_op in the list
+ * @op: the current &drm_gpuva_op
+ */
+#define drm_gpuva_prev_op(op) list_prev_entry(op, entry)
+
+/**
+ * drm_gpuva_next_op - next &drm_gpuva_op in the list
+ * @op: the current &drm_gpuva_op
+ */
+#define drm_gpuva_next_op(op) list_next_entry(op, entry)
+
+struct drm_gpuva_ops *
+drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range,
+ struct drm_gem_object *obj, u64 offset);
+struct drm_gpuva_ops *
+drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range);
+
+struct drm_gpuva_ops *
+drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
+ u64 addr, u64 range);
+
+struct drm_gpuva_ops *
+drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
+ struct drm_gem_object *obj);
+
+void drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva_ops *ops);
+
+static inline void drm_gpuva_init_from_op(struct drm_gpuva *va,
+ struct drm_gpuva_op_map *op)
+{
+ drm_gpuva_init(va, op->va.addr, op->va.range,
+ op->gem.obj, op->gem.offset);
+}
+
+/**
+ * struct drm_gpuva_fn_ops - callbacks for split/merge steps
+ *
+ * This structure defines the callbacks used by &drm_gpuva_sm_map and
+ * &drm_gpuva_sm_unmap to provide the split/merge steps for map and unmap
+ * operations to drivers.
+ */
+struct drm_gpuva_fn_ops {
+ /**
+ * @op_alloc: called when the &drm_gpuva_manager allocates
+ * a struct drm_gpuva_op
+ *
+ * Some drivers may want to embed struct drm_gpuva_op into driver
+ * specific structures. By implementing this callback drivers can
+ * allocate memory accordingly.
+ *
+ * This callback is optional.
+ */
+ struct drm_gpuva_op *(*op_alloc)(void);
+
+ /**
+ * @op_free: called when the &drm_gpuva_manager frees a
+ * struct drm_gpuva_op
+ *
+ * Some drivers may want to embed struct drm_gpuva_op into driver
+ * specific structures. By implementing this callback drivers can
+ * free the previously allocated memory accordingly.
+ *
+ * This callback is optional.
+ */
+ void (*op_free)(struct drm_gpuva_op *op);
+
+ /**
+ * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
+ * mapping once all previous steps were completed
+ *
+ * The &priv pointer matches the one the driver passed to
+ * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
+ *
+ * Can be NULL if &drm_gpuva_sm_map is used.
+ */
+ int (*sm_step_map)(struct drm_gpuva_op *op, void *priv);
+
+ /**
+ * @sm_step_remap: called from &drm_gpuva_sm_map and
+ * &drm_gpuva_sm_unmap to split up an existent mapping
+ *
+ * This callback is called when existent mapping needs to be split up.
+ * This is the case when either a newly requested mapping overlaps or
+ * is enclosed by an existent mapping or a partial unmap of an existent
+ * mapping is requested.
+ *
+ * The &priv pointer matches the one the driver passed to
+ * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
+ *
+ * Can be NULL if neither &drm_gpuva_sm_map nor &drm_gpuva_sm_unmap is
+ * used.
+ */
+ int (*sm_step_remap)(struct drm_gpuva_op *op, void *priv);
+
+ /**
+ * @sm_step_unmap: called from &drm_gpuva_sm_map and
+ * &drm_gpuva_sm_unmap to unmap an existent mapping
+ *
+ * This callback is called when existent mapping needs to be unmapped.
+ * This is the case when either a newly requested mapping encloses an
+ * existent mapping or an unmap of an existent mapping is requested.
+ *
+ * The &priv pointer matches the one the driver passed to
+ * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
+ *
+ * Can be NULL if neither &drm_gpuva_sm_map nor &drm_gpuva_sm_unmap is
+ * used.
+ */
+ int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
+};
+
+int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
+ u64 addr, u64 range,
+ struct drm_gem_object *obj, u64 offset);
+
+int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
+ u64 addr, u64 range);
+
+void drm_gpuva_map(struct drm_gpuva_manager *mgr,
+ struct drm_gpuva *va,
+ struct drm_gpuva_op_map *op);
+void drm_gpuva_remap(struct drm_gpuva *prev,
+ struct drm_gpuva *next,
+ struct drm_gpuva_op_remap *op);
+void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
+
+#endif /* __DRM_GPUVA_MGR_H__ */
--
2.41.0



2023-06-30 08:23:47

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 30 Jun 2023 10:02:52 +0200
Boris Brezillon <[email protected]> wrote:

> Hi Danilo,
>
> On Fri, 30 Jun 2023 00:25:18 +0200
> Danilo Krummrich <[email protected]> wrote:
>
> > + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
> > + * {
> > + * struct driver_context *ctx = __ctx;
> > + *
> > + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
> > + *
> > + * drm_gpuva_unlink(op->remap.unmap->va);
> > + * kfree(op->remap.unmap->va);
> > + *
> > + * if (op->remap.prev) {
> > + * drm_gpuva_link(ctx->prev_va);
>
> I ended up switching to dma_resv-based locking for the GEMs and I
> wonder what the locking is supposed to look like in the async-mapping
> case, where we insert/remove the VA nodes in the drm_sched::run_job()
> path.
>
> What I have right now is something like:
>
> dma_resv_lock(vm->resv);
>
> // split done in drm_gpuva_sm_map(), each iteration
> // of the loop is a call to the driver ->[re,un]map()
> // hook
> for_each_sub_op() {
>
> // Private BOs have their resv field pointing to the
> // VM resv and we take the VM resv lock before calling
> // drm_gpuva_sm_map()
> if (vm->resv != gem->resv)
> dma_resv_lock(gem->resv);
>
> drm_gpuva_[un]link(va);
> gem_[un]pin(gem);
>
> if (vm->resv != gem->resv)
> dma_resv_unlock(gem->resv);
> }
>
> dma_resv_unlock(vm->resv);
>
> In practice, I don't expect things to deadlock, because the VM resv is
> not supposed to be taken outside the VM context and the locking order
> is always the same (VM lock first, and then each shared BO
> taken/released independently), but I'm not super thrilled by this
> nested lock, and I'm wondering if we shouldn't have a pass collecting
> locks in a drm_exec context first, and then have
> the operations executed. IOW, something like that:
>
> drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES)
> drm_exec_until_all_locked(exec) {
> // Dummy GEM is the dummy GEM object I use to make the VM
> // participate in the locking without having to teach
> // drm_exec how to deal with raw dma_resv objects.
> ret = drm_exec_lock_obj(exec, vm->dummy_gem);
> drm_exec_retry_on_contention(exec);
> if (ret)
> return ret;
>
> // Could take the form of drm_gpuva_sm_[un]map_acquire_locks()
> // helpers
> for_each_sub_op() {
> ret = drm_exec_lock_obj(exec, gem);
> if (ret)
> return ret;
> }
> }
>
> // each iteration of the loop is a call to the driver
> // ->[re,un]map() hook
> for_each_sub_op() {
> ...
> gem_[un]pin_locked(gem);

Just wanted to clarify that the pages have been pinned at VM_BIND job
creation time, so this gem_pin_locked() call is effectively just a
pin_count++, not the whole page allocation, which we don't want to
happen in a dma-signaling path.

> drm_gpuva_[un]link(va);
> ...
> }
>
> drm_exec_fini(exec);

2023-06-30 08:39:10

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Hi Danilo,

On Fri, 30 Jun 2023 00:25:18 +0200
Danilo Krummrich <[email protected]> wrote:

> + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
> + * {
> + * struct driver_context *ctx = __ctx;
> + *
> + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
> + *
> + * drm_gpuva_unlink(op->remap.unmap->va);
> + * kfree(op->remap.unmap->va);
> + *
> + * if (op->remap.prev) {
> + * drm_gpuva_link(ctx->prev_va);

I ended up switching to dma_resv-based locking for the GEMs and I
wonder what the locking is supposed to look like in the async-mapping
case, where we insert/remove the VA nodes in the drm_sched::run_job()
path.

What I have right now is something like:

dma_resv_lock(vm->resv);

// split done in drm_gpuva_sm_map(), each iteration
// of the loop is a call to the driver ->[re,un]map()
// hook
for_each_sub_op() {

// Private BOs have their resv field pointing to the
// VM resv and we take the VM resv lock before calling
// drm_gpuva_sm_map()
if (vm->resv != gem->resv)
dma_resv_lock(gem->resv);

drm_gpuva_[un]link(va);
gem_[un]pin(gem);

if (vm->resv != gem->resv)
dma_resv_unlock(gem->resv);
}

dma_resv_unlock(vm->resv);

In practice, I don't expect things to deadlock, because the VM resv is
not supposed to be taken outside the VM context and the locking order
is always the same (VM lock first, and then each shared BO
taken/released independently), but I'm not super thrilled by this
nested lock, and I'm wondering if we shouldn't have a pass collecting
locks in a drm_exec context first, and then have
the operations executed. IOW, something like that:

drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES)
drm_exec_until_all_locked(exec) {
// Dummy GEM is the dummy GEM object I use to make the VM
// participate in the locking without having to teach
// drm_exec how to deal with raw dma_resv objects.
ret = drm_exec_lock_obj(exec, vm->dummy_gem);
drm_exec_retry_on_contention(exec);
if (ret)
return ret;

// Could take the form of drm_gpuva_sm_[un]map_acquire_locks()
// helpers
for_each_sub_op() {
ret = drm_exec_lock_obj(exec, gem);
if (ret)
return ret;
}
}

// each iteration of the loop is a call to the driver
// ->[re,un]map() hook
for_each_sub_op() {
...
gem_[un]pin_locked(gem);
drm_gpuva_[un]link(va);
...
}

drm_exec_fini(exec);

Don't know if I got this right, or if I'm just confused again by how
the drm_gpuva API is supposed to be used.

Regards,

Boris

> + * ctx->prev_va = NULL;
> + * }
> + *
> + * if (op->remap.next) {
> + * drm_gpuva_link(ctx->next_va);
> + * ctx->next_va = NULL;
> + * }
> + *
> + * return 0;
> + * }

2023-06-30 09:55:26

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 30 Jun 2023 10:02:52 +0200
Boris Brezillon <[email protected]> wrote:

> In practice, I don't expect things to deadlock, because the VM resv is
> not supposed to be taken outside the VM context and the locking order
> is always the same (VM lock first, and then each shared BO
> taken/released independently), but I'm not super thrilled by this
> nested lock, and I'm wondering if we shouldn't have a pass collecting
> locks in a drm_exec context first, and then have
> the operations executed. IOW, something like that:
>
> drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES)
> drm_exec_until_all_locked(exec) {
> // Dummy GEM is the dummy GEM object I use to make the VM
> // participate in the locking without having to teach
> // drm_exec how to deal with raw dma_resv objects.
> ret = drm_exec_lock_obj(exec, vm->dummy_gem);
> drm_exec_retry_on_contention(exec);
> if (ret)
> return ret;
>
> // Could take the form of drm_gpuva_sm_[un]map_acquire_locks()
> // helpers

Nevermind, I implemented a driver specific acquire_op_locks(), and it's
fairly simple with the gpuva iter (we just have to iterate over all VAs
covered by the operation range and call drm_exec_lock_obj() on the GEM
attached to these VAs), so it's probably not worth providing a generic
helper for that.

Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Hi, Danilo

Some review comments below:

On 6/30/23 00:25, Danilo Krummrich wrote:
> Add infrastructure to keep track of GPU virtual address (VA) mappings
> with a decicated VA space manager implementation.
>
> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers
> start implementing, allow userspace applications to request multiple and
> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is
> intended to serve the following purposes in this context.
>
> 1) Provide infrastructure to track GPU VA allocations and mappings,
> making use of the maple_tree.

It looks like we're not using the maple tree anymore, but rather an
instantiation of an interval tree.

(Perhaps as a follow-up it makes sense to provide a pre-instantiated
common u64 version of the interval tree in addition to the unsigned long
one since it appears to be used in multiple places in graphics drivers).

> 2) Generically connect GPU VA mappings to their backing buffers, in
> particular DRM GEM objects.
>
> 3) Provide a common implementation to perform more complex mapping
> operations on the GPU VA space. In particular splitting and merging
> of GPU VA mappings, e.g. for intersecting mapping requests or partial
> unmap requests.
>
> Tested-by: Donald Robson<[email protected]>
> Reviewed-by: Boris Brezillon<[email protected]>
> Suggested-by: Dave Airlie<[email protected]>
> Signed-off-by: Danilo Krummrich<[email protected]>
> ---
> Documentation/gpu/drm-mm.rst | 36 +
> drivers/gpu/drm/Makefile | 1 +
> drivers/gpu/drm/drm_gem.c | 3 +
> drivers/gpu/drm/drm_gpuva_mgr.c | 1743 +++++++++++++++++++++++++++++++
> include/drm/drm_drv.h | 6 +
> include/drm/drm_gem.h | 52 +
> include/drm/drm_gpuva_mgr.h | 756 ++++++++++++++
> 7 files changed, 2597 insertions(+)
> create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c
> create mode 100644 include/drm/drm_gpuva_mgr.h
>
> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst
> index a52e6f4117d6..3d5dc9dc1bfe 100644
> --- a/Documentation/gpu/drm-mm.rst
> +++ b/Documentation/gpu/drm-mm.rst
> @@ -466,6 +466,42 @@ DRM MM Range Allocator Function References
> .. kernel-doc:: drivers/gpu/drm/drm_mm.c
> :export:
>
> +DRM GPU VA Manager
> +==================
> +
> +Overview
> +--------
> +
> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
> + :doc: Overview
> +
> +Split and Merge
> +---------------
> +
> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
> + :doc: Split and Merge
> +
> +Locking
> +-------
> +
> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
> + :doc: Locking
> +
> +Examples
> +--------
> +
> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
> + :doc: Examples
> +
> +DRM GPU VA Manager Function References
> +--------------------------------------
> +
> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h
> + :internal:
> +
> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
> + :export:
> +
> DRM Buddy Allocator
> ===================
>
> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
> index 414855e2a463..6d6c9dec66e8 100644
> --- a/drivers/gpu/drm/Makefile
> +++ b/drivers/gpu/drm/Makefile
> @@ -45,6 +45,7 @@ drm-y := \
> drm_vblank.o \
> drm_vblank_work.o \
> drm_vma_manager.o \
> + drm_gpuva_mgr.o \
> drm_writeback.o
> drm-$(CONFIG_DRM_LEGACY) += \
> drm_agpsupport.o \
> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c
> index 1a5a2cd0d4ec..cd878ebddbd0 100644
> --- a/drivers/gpu/drm/drm_gem.c
> +++ b/drivers/gpu/drm/drm_gem.c
> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device *dev,
> if (!obj->resv)
> obj->resv = &obj->_resv;
>
> + if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA))
> + drm_gem_gpuva_init(obj);
> +
> drm_vma_node_reset(&obj->vma_node);
> INIT_LIST_HEAD(&obj->lru_node);
> }
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> new file mode 100644
> index 000000000000..4414990c05cc
> --- /dev/null
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -0,0 +1,1743 @@
> +// SPDX-License-Identifier: GPL-2.0
> +/*
SPDX-License is GPL-2.0 but below looks like MIT. Can we use "GPL-2.0 OR
MIT" or does something restrict it to GPL-only?
> + * Copyright (c) 2022 Red Hat.
> + *
> + * Permission is hereby granted, free of charge, to any person obtaining a
> + * copy of this software and associated documentation files (the "Software"),
> + * to deal in the Software without restriction, including without limitation
> + * the rights to use, copy, modify, merge, publish, distribute, sublicense,
> + * and/or sell copies of the Software, and to permit persons to whom the
> + * Software is furnished to do so, subject to the following conditions:
> + *
> + * The above copyright notice and this permission notice shall be included in
> + * all copies or substantial portions of the Software.
> + *
> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR
> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
> + * OTHER DEALINGS IN THE SOFTWARE.
> + *
> + * Authors:
> + * Danilo Krummrich<[email protected]>
> + *
> + */
> +
> +#include <drm/drm_gpuva_mgr.h>
> +
> +#include <linux/interval_tree_generic.h>
> +#include <linux/mm.h>
> +
> +/**
> + * DOC: Overview
> + *
> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager keeps track
> + * of a GPU's virtual address (VA) space and manages the corresponding virtual
> + * mappings represented by &drm_gpuva objects. It also keeps track of the
> + * mapping's backing &drm_gem_object buffers.
> + *
> + * &drm_gem_object buffers maintain a list of &drm_gpuva objects representing
> + * all existent GPU VA mappings using this &drm_gem_object as backing buffer.
> + *
> + * GPU VAs can be flagged as sparse, such that drivers may use GPU VAs to also
> + * keep track of sparse PTEs in order to support Vulkan 'Sparse Resources'.
> + *
> + * The GPU VA manager internally uses a rb-tree to manage the
> + * &drm_gpuva mappings within a GPU's virtual address space.
> + *
> + * The &drm_gpuva_manager contains a special &drm_gpuva representing the
> + * portion of VA space reserved by the kernel. This node is initialized together
> + * with the GPU VA manager instance and removed when the GPU VA manager is
> + * destroyed.
> + *
> + * In a typical application drivers would embed struct drm_gpuva_manager and
> + * struct drm_gpuva within their own driver specific structures, there won't be
> + * any memory allocations of it's own nor memory allocations of &drm_gpuva
s/it's/its/
> + * entries.
> + *
> + * The data structures needed to store &drm_gpuvas within the &drm_gpuva_manager
> + * are contained within struct drm_gpuva already. Hence, for inserting
> + * &drm_gpuva entries from within dma-fence signalling critical sections it is
> + * enough to pre-allocate the &drm_gpuva structures.
> + */
> +
> +/**
> + * DOC: Split and Merge
> + *
> + * Besides it's capability to manage and represent a GPU VA space, the
s/it's/its/
> + * &drm_gpuva_manager also provides functions to let the &drm_gpuva_manager
> + * calculate a sequence of operations to satisfy a given map or unmap request.
> + *
> + * Therefore the DRM GPU VA manager provides an algorithm implementing splitting
> + * and merging of existent GPU VA mappings with the ones that are requested to
> + * be mapped or unmapped. This feature is required by the Vulkan API to
> + * implement Vulkan 'Sparse Memory Bindings' - drivers UAPIs often refer to this
> + * as VM BIND.
> + *
> + * Drivers can call drm_gpuva_sm_map() to receive a sequence of callbacks
> + * containing map, unmap and remap operations for a given newly requested
> + * mapping. The sequence of callbacks represents the set of operations to
> + * execute in order to integrate the new mapping cleanly into the current state
> + * of the GPU VA space.
> + *
> + * Depending on how the new GPU VA mapping intersects with the existent mappings
> + * of the GPU VA space the &drm_gpuva_fn_ops callbacks contain an arbitrary
> + * amount of unmap operations, a maximum of two remap operations and a single
> + * map operation. The caller might receive no callback at all if no operation is
> + * required, e.g. if the requested mapping already exists in the exact same way.
> + *
> + * The single map operation represents the original map operation requested by
> + * the caller.
> + *
> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates whether the
> + * &drm_gpuva to unmap is physically contiguous with the original mapping
> + * request. Optionally, if 'keep' is set, drivers may keep the actual page table
> + * entries for this &drm_gpuva, adding the missing page table entries only and
> + * update the &drm_gpuva_manager's view of things accordingly.
> + *
> + * Drivers may do the same optimization, namely delta page table updates, also
> + * for remap operations. This is possible since &drm_gpuva_op_remap consists of
> + * one unmap operation and one or two map operations, such that drivers can
> + * derive the page table update delta accordingly.
> + *
> + * Note that there can't be more than two existent mappings to split up, one at
> + * the beginning and one at the end of the new mapping, hence there is a
> + * maximum of two remap operations.
> + *
> + * Analogous to drm_gpuva_sm_map() drm_gpuva_sm_unmap() uses &drm_gpuva_fn_ops
> + * to call back into the driver in order to unmap a range of GPU VA space. The
> + * logic behind this function is way simpler though: For all existent mappings
> + * enclosed by the given range unmap operations are created. For mappings which
> + * are only partically located within the given range, remap operations are
> + * created such that those mappings are split up and re-mapped partically.
> + *
> + * As an alternative to drm_gpuva_sm_map() and drm_gpuva_sm_unmap(),
> + * drm_gpuva_sm_map_ops_create() and drm_gpuva_sm_unmap_ops_create() can be used
> + * to directly obtain an instance of struct drm_gpuva_ops containing a list of
> + * &drm_gpuva_op, which can be iterated with drm_gpuva_for_each_op(). This list
> + * contains the &drm_gpuva_ops analogous to the callbacks one would receive when
> + * calling drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). While this way requires
> + * more memory (to allocate the &drm_gpuva_ops), it provides drivers a way to
> + * iterate the &drm_gpuva_op multiple times, e.g. once in a context where memory
> + * allocations are possible (e.g. to allocate GPU page tables) and once in the
> + * dma-fence signalling critical path.
> + *
> + * To update the &drm_gpuva_manager's view of the GPU VA space
> + * drm_gpuva_insert() and drm_gpuva_remove() may be used. These functions can
> + * safely be used from &drm_gpuva_fn_ops callbacks originating from
> + * drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). However, it might be more
> + * convenient to use the provided helper functions drm_gpuva_map(),
> + * drm_gpuva_remap() and drm_gpuva_unmap() instead.
> + *
> + * The following diagram depicts the basic relationships of existent GPU VA
> + * mappings, a newly requested mapping and the resulting mappings as implemented
> + * by drm_gpuva_sm_map() - it doesn't cover any arbitrary combinations of these.
> + *
> + * 1) Requested mapping is identical. Replace it, but indicate the backing PTEs
> + * could be kept.
> + *
> + * ::
> + *
> + * 0 a 1
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 a 1
> + * req: |-----------| (bo_offset=n)
> + *
> + * 0 a 1
> + * new: |-----------| (bo_offset=n)
> + *
> + *
> + * 2) Requested mapping is identical, except for the BO offset, hence replace
> + * the mapping.
> + *
> + * ::
> + *
> + * 0 a 1
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 a 1
> + * req: |-----------| (bo_offset=m)
> + *
> + * 0 a 1
> + * new: |-----------| (bo_offset=m)
> + *
> + *
> + * 3) Requested mapping is identical, except for the backing BO, hence replace
> + * the mapping.
> + *
> + * ::
> + *
> + * 0 a 1
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 b 1
> + * req: |-----------| (bo_offset=n)
> + *
> + * 0 b 1
> + * new: |-----------| (bo_offset=n)
> + *
> + *
> + * 4) Existent mapping is a left aligned subset of the requested one, hence
> + * replace the existent one.
> + *
> + * ::
> + *
> + * 0 a 1
> + * old: |-----| (bo_offset=n)
> + *
> + * 0 a 2
> + * req: |-----------| (bo_offset=n)
> + *
> + * 0 a 2
> + * new: |-----------| (bo_offset=n)
> + *
> + * .. note::
> + * We expect to see the same result for a request with a different BO
> + * and/or non-contiguous BO offset.
> + *
> + *
> + * 5) Requested mapping's range is a left aligned subset of the existent one,
> + * but backed by a different BO. Hence, map the requested mapping and split
> + * the existent one adjusting it's BO offset.
Typo: s/it's/its/ above and in multiple places below.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 b 1
> + * req: |-----| (bo_offset=n)
> + *
> + * 0 b 1 a' 2
> + * new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1)
> + *
> + * .. note::
> + * We expect to see the same result for a request with a different BO
> + * and/or non-contiguous BO offset.
> + *
> + *
> + * 6) Existent mapping is a superset of the requested mapping. Split it up, but
> + * indicate that the backing PTEs could be kept.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 a 1
> + * req: |-----| (bo_offset=n)
> + *
> + * 0 a 1 a' 2
> + * new: |-----|-----| (a.bo_offset=n, a'.bo_offset=n+1)
> + *
> + *
> + * 7) Requested mapping's range is a right aligned subset of the existent one,
> + * but backed by a different BO. Hence, map the requested mapping and split
> + * the existent one, without adjusting the BO offset.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 1 b 2
> + * req: |-----| (bo_offset=m)
> + *
> + * 0 a 1 b 2
> + * new: |-----|-----| (a.bo_offset=n,b.bo_offset=m)
> + *
> + *
> + * 8) Existent mapping is a superset of the requested mapping. Split it up, but
> + * indicate that the backing PTEs could be kept.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 1 a 2
> + * req: |-----| (bo_offset=n+1)
> + *
> + * 0 a' 1 a 2
> + * new: |-----|-----| (a'.bo_offset=n, a.bo_offset=n+1)
> + *
> + *
> + * 9) Existent mapping is overlapped at the end by the requested mapping backed
> + * by a different BO. Hence, map the requested mapping and split up the
> + * existent one, without adjusting the BO offset.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 1 b 3
> + * req: |-----------| (bo_offset=m)
> + *
> + * 0 a 1 b 3
> + * new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m)
> + *
> + *
> + * 10) Existent mapping is overlapped by the requested mapping, both having the
> + * same backing BO with a contiguous offset. Indicate the backing PTEs of
> + * the old mapping could be kept.
> + *
> + * ::
> + *
> + * 0 a 2
> + * old: |-----------| (bo_offset=n)
> + *
> + * 1 a 3
> + * req: |-----------| (bo_offset=n+1)
> + *
> + * 0 a' 1 a 3
> + * new: |-----|-----------| (a'.bo_offset=n, a.bo_offset=n+1)
> + *
> + *
> + * 11) Requested mapping's range is a centered subset of the existent one
> + * having a different backing BO. Hence, map the requested mapping and split
> + * up the existent one in two mappings, adjusting the BO offset of the right
> + * one accordingly.
> + *
> + * ::
> + *
> + * 0 a 3
> + * old: |-----------------| (bo_offset=n)
> + *
> + * 1 b 2
> + * req: |-----| (bo_offset=m)
> + *
> + * 0 a 1 b 2 a' 3
> + * new: |-----|-----|-----| (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2)
> + *
> + *
> + * 12) Requested mapping is a contiguous subset of the existent one. Split it
> + * up, but indicate that the backing PTEs could be kept.
> + *
> + * ::
> + *
> + * 0 a 3
> + * old: |-----------------| (bo_offset=n)
> + *
> + * 1 a 2
> + * req: |-----| (bo_offset=n+1)
> + *
> + * 0 a' 1 a 2 a'' 3
> + * old: |-----|-----|-----| (a'.bo_offset=n, a.bo_offset=n+1, a''.bo_offset=n+2)
> + *
> + *
> + * 13) Existent mapping is a right aligned subset of the requested one, hence
> + * replace the existent one.
> + *
> + * ::
> + *
> + * 1 a 2
> + * old: |-----| (bo_offset=n+1)
> + *
> + * 0 a 2
> + * req: |-----------| (bo_offset=n)
> + *
> + * 0 a 2
> + * new: |-----------| (bo_offset=n)
> + *
> + * .. note::
> + * We expect to see the same result for a request with a different bo
> + * and/or non-contiguous bo_offset.
> + *
> + *
> + * 14) Existent mapping is a centered subset of the requested one, hence
> + * replace the existent one.
> + *
> + * ::
> + *
> + * 1 a 2
> + * old: |-----| (bo_offset=n+1)
> + *
> + * 0 a 3
> + * req: |----------------| (bo_offset=n)
> + *
> + * 0 a 3
> + * new: |----------------| (bo_offset=n)
> + *
> + * .. note::
> + * We expect to see the same result for a request with a different bo
> + * and/or non-contiguous bo_offset.
> + *
> + *
> + * 15) Existent mappings is overlapped at the beginning by the requested mapping
> + * backed by a different BO. Hence, map the requested mapping and split up
> + * the existent one, adjusting it's BO offset accordingly.
> + *
> + * ::
> + *
> + * 1 a 3
> + * old: |-----------| (bo_offset=n)
> + *
> + * 0 b 2
> + * req: |-----------| (bo_offset=m)
> + *
> + * 0 b 2 a' 3
> + * new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2)
> + */
> +
> +/**
> + * DOC: Locking
> + *
> + * Generally, the GPU VA manager does not take care of locking itself, it is
> + * the drivers responsibility to take care about locking. Drivers might want to
> + * protect the following operations: inserting, removing and iterating
> + * &drm_gpuva objects as well as generating all kinds of operations, such as
> + * split / merge or prefetch.
> + *
> + * The GPU VA manager also does not take care of the locking of the backing
> + * &drm_gem_object buffers GPU VA lists by itself; drivers are responsible to
> + * enforce mutual exclusion using either the GEMs dma_resv lock or alternatively
> + * a driver specific external lock by setting the @DRM_GPUVA_MANAGER_LOCK_EXTERN
> + * flag.

Is the external lock used or anticipated by any WIP implementation? If
not I suggest to leave it out until there is a user.

> + *
> + * For the latter, functions such as drm_gpuva_link() or drm_gpuva_unlink()
> + * contain lockdep checks to indicate locking issues. For this to work drivers
> + * must provide (in case the @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set) their
> + * external lock with drm_gpuva_manager_set_ext_lock() after initialization.
> + */
> +
> +/**
> + * DOC: Examples
> + *
> + * This section gives two examples on how to let the DRM GPUVA Manager generate
> + * &drm_gpuva_op in order to satisfy a given map or unmap request and how to
> + * make use of them.
> + *
> + * The below code is strictly limited to illustrate the generic usage pattern.
> + * To maintain simplicitly, it doesn't make use of any abstractions for common
> + * code, different (asyncronous) stages with fence signalling critical paths,
> + * any other helpers or error handling in terms of freeing memory and dropping
> + * previously taken locks.
> + *
> + * 1) Obtain a list of &drm_gpuva_op to create a new mapping::
> + *
> + * // Allocates a new &drm_gpuva.
> + * struct drm_gpuva * driver_gpuva_alloc(void);
> + *
> + * // Typically drivers would embedd the &drm_gpuva_manager and &drm_gpuva
> + * // structure in individual driver structures and lock the dma-resv with
> + * // drm_exec or similar helpers.
> + * int driver_mapping_create(struct drm_gpuva_manager *mgr,
> + * u64 addr, u64 range,
> + * struct drm_gem_object *obj, u64 offset)
> + * {
> + * struct drm_gpuva_ops *ops;
> + * struct drm_gpuva_op *op
> + *
> + * driver_lock_va_space();
> + * ops = drm_gpuva_sm_map_ops_create(mgr, addr, range,
> + * obj, offset);
> + * if (IS_ERR(ops))
> + * return PTR_ERR(ops);
> + *
> + * drm_gpuva_for_each_op(op, ops) {
> + * struct drm_gpuva *va;
> + *
> + * switch (op->op) {
> + * case DRM_GPUVA_OP_MAP:
> + * va = driver_gpuva_alloc();
> + * if (!va)
> + * ; // unwind previous VA space updates,
> + * // free memory and unlock
> + *
> + * driver_vm_map();
> + * drm_gpuva_map(mgr, va, &op->map);
> + * drm_gpuva_link(va);
> + *
> + * break;
> + * case DRM_GPUVA_OP_REMAP: {
> + * struct drm_gpuva *prev = NULL, *next = NULL;
> + *
> + * va = op->remap.unmap->va;
> + *
> + * if (op->remap.prev) {
> + * prev = driver_gpuva_alloc();
> + * if (!prev)
> + * ; // unwind previous VA space
> + * // updates, free memory and
> + * // unlock
> + * }
> + *
> + * if (op->remap.next) {
> + * next = driver_gpuva_alloc();
> + * if (!next)
> + * ; // unwind previous VA space
> + * // updates, free memory and
> + * // unlock
> + * }
> + *
> + * driver_vm_remap();
> + * drm_gpuva_remap(prev, next, &op->remap);
> + *
> + * drm_gpuva_unlink(va);
> + * if (prev)
> + * drm_gpuva_link(prev);
> + * if (next)
> + * drm_gpuva_link(next);
> + *
> + * break;
> + * }
> + * case DRM_GPUVA_OP_UNMAP:
> + * va = op->unmap->va;
> + *
> + * driver_vm_unmap();
> + * drm_gpuva_unlink(va);
> + * drm_gpuva_unmap(&op->unmap);
> + *
> + * break;
> + * default:
> + * break;
> + * }
> + * }
> + * driver_unlock_va_space();
> + *
> + * return 0;
> + * }
> + *
> + * 2) Receive a callback for each &drm_gpuva_op to create a new mapping::
> + *
> + * struct driver_context {
> + * struct drm_gpuva_manager *mgr;
> + * struct drm_gpuva *new_va;
> + * struct drm_gpuva *prev_va;
> + * struct drm_gpuva *next_va;
> + * };
> + *
> + * // ops to pass to drm_gpuva_manager_init()
> + * static const struct drm_gpuva_fn_ops driver_gpuva_ops = {
> + * .sm_step_map = driver_gpuva_map,
> + * .sm_step_remap = driver_gpuva_remap,
> + * .sm_step_unmap = driver_gpuva_unmap,
> + * };
> + *
> + * // Typically drivers would embedd the &drm_gpuva_manager and &drm_gpuva
> + * // structure in individual driver structures and lock the dma-resv with
> + * // drm_exec or similar helpers.
> + * int driver_mapping_create(struct drm_gpuva_manager *mgr,
> + * u64 addr, u64 range,
> + * struct drm_gem_object *obj, u64 offset)
> + * {
> + * struct driver_context ctx;
> + * struct drm_gpuva_ops *ops;
> + * struct drm_gpuva_op *op;
> + * int ret = 0;
> + *
> + * ctx.mgr = mgr;
> + *
> + * ctx.new_va = kzalloc(sizeof(*ctx.new_va), GFP_KERNEL);
> + * ctx.prev_va = kzalloc(sizeof(*ctx.prev_va), GFP_KERNEL);
> + * ctx.next_va = kzalloc(sizeof(*ctx.next_va), GFP_KERNEL);
> + * if (!ctx.new_va || !ctx.prev_va || !ctx.next_va) {
> + * ret = -ENOMEM;
> + * goto out;
> + * }
> + *
> + * driver_lock_va_space();
> + * ret = drm_gpuva_sm_map(mgr, &ctx, addr, range, obj, offset);
> + * driver_unlock_va_space();
> + *
> + * out:
> + * kfree(ctx.new_va);
> + * kfree(ctx.prev_va);
> + * kfree(ctx.next_va);
> + * return ret;
> + * }
> + *
> + * int driver_gpuva_map(struct drm_gpuva_op *op, void *__ctx)
> + * {
> + * struct driver_context *ctx = __ctx;
> + *
> + * drm_gpuva_map(ctx->mgr, ctx->new_va, &op->map);
> + *
> + * drm_gpuva_link(ctx->new_va);
> + *
> + * // prevent the new GPUVA from being freed in
> + * // driver_mapping_create()
> + * ctx->new_va = NULL;
> + *
> + * return 0;
> + * }
> + *
> + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
> + * {
> + * struct driver_context *ctx = __ctx;
> + *
> + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
> + *
> + * drm_gpuva_unlink(op->remap.unmap->va);
> + * kfree(op->remap.unmap->va);
> + *
> + * if (op->remap.prev) {
> + * drm_gpuva_link(ctx->prev_va);
> + * ctx->prev_va = NULL;
> + * }
> + *
> + * if (op->remap.next) {
> + * drm_gpuva_link(ctx->next_va);
> + * ctx->next_va = NULL;
> + * }
> + *
> + * return 0;
> + * }
> + *
> + * int driver_gpuva_unmap(struct drm_gpuva_op *op, void *__ctx)
> + * {
> + * drm_gpuva_unlink(op->unmap.va);
> + * drm_gpuva_unmap(&op->unmap);
> + * kfree(op->unmap.va);
> + *
> + * return 0;
> + * }
> + */
> +
> +#define to_drm_gpuva(__node) container_of((__node), struct drm_gpuva, rb.node)
> +
> +#define GPUVA_START(node) ((node)->va.addr)
> +#define GPUVA_LAST(node) ((node)->va.addr + (node)->va.range - 1)
> +
> +/* We do not actually use drm_gpuva_it_next(), tell the compiler to not complain
> + * about this.
> + */
> +INTERVAL_TREE_DEFINE(struct drm_gpuva, rb.node, u64, rb.__subtree_last,
> + GPUVA_START, GPUVA_LAST, static __attribute__((unused)),

Would  s/__attribute__((unused))/__maybe_unused/ work here?

> + drm_gpuva_it)
> +
> +static int __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva *va);
> +static void __drm_gpuva_remove(struct drm_gpuva *va);
> +
> +static inline bool
"static inline" is typically used only in header files, since the
compiler should be smart enough to inline if beneficial. Prefer
"static". Also in multiple places below.
> +drm_gpuva_check_overflow(u64 addr, u64 range)
> +{
> + u64 end;
> +
> + return WARN(check_add_overflow(addr, range, &end),
> + "GPUVA address limited to %lu bytes.\n", sizeof(end));
> +}
> +
> +static inline bool
> +drm_gpuva_in_mm_range(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
> +{
> + u64 end = addr + range;
> + u64 mm_start = mgr->mm_start;
> + u64 mm_end = mm_start + mgr->mm_range;
> +
> + return addr >= mm_start && end <= mm_end;
> +}
> +
> +static inline bool
> +drm_gpuva_in_kernel_node(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
> +{
> + u64 end = addr + range;
> + u64 kstart = mgr->kernel_alloc_node.va.addr;
> + u64 krange = mgr->kernel_alloc_node.va.range;
> + u64 kend = kstart + krange;
> +
> + return krange && addr < kend && kstart < end;
> +}
> +
> +static inline bool
> +drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range)
> +{
> +
> + return !drm_gpuva_check_overflow(addr, range) &&
> + drm_gpuva_in_mm_range(mgr, addr, range) &&
> + !drm_gpuva_in_kernel_node(mgr, addr, range);
> +}
> +
> +/**
> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager
Function kerneldoc names should end with "()": drm_gpuva_manager_init().
General comment across the patch.
> + * @mgr: pointer to the &drm_gpuva_manager to initialize
> + * @name: the name of the GPU VA space
> + * @start_offset: the start offset of the GPU VA space
> + * @range: the size of the GPU VA space
> + * @reserve_offset: the start of the kernel reserved GPU VA area
> + * @reserve_range: the size of the kernel reserved GPU VA area
> + * @ops: &drm_gpuva_fn_ops called on &drm_gpuva_sm_map / &drm_gpuva_sm_unmap
> + * @flags: the feature flags for the &drm_gpuva_manager
> + *
> + * The &drm_gpuva_manager must be initialized with this function before use.
> + *
> + * Note that @mgr must be cleared to 0 before calling this function. The given
> + * &name is expected to be managed by the surrounding driver structures.
> + */
> +void
> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> + const char *name,
> + u64 start_offset, u64 range,
> + u64 reserve_offset, u64 reserve_range,
> + const struct drm_gpuva_fn_ops *ops,
> + enum drm_gpuva_manager_flags flags)
> +{
> + mgr->rb.tree = RB_ROOT_CACHED;
> + INIT_LIST_HEAD(&mgr->rb.list);
> +
> + drm_gpuva_check_overflow(start_offset, range);
> + mgr->mm_start = start_offset;
> + mgr->mm_range = range;
> +
> + mgr->name = name ? name : "unknown";
> + mgr->flags = flags;
> + mgr->ops = ops;
> +
> + memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
> +
> + if (reserve_range) {
> + mgr->kernel_alloc_node.va.addr = reserve_offset;
> + mgr->kernel_alloc_node.va.range = reserve_range;
> +
> + if (likely(!drm_gpuva_check_overflow(reserve_offset,
> + reserve_range)))
> + __drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
> + }
> +
> +}
> +EXPORT_SYMBOL(drm_gpuva_manager_init);
> +
> +/**
> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager
> + * @mgr: pointer to the &drm_gpuva_manager to clean up
> + *
> + * Note that it is a bug to call this function on a manager that still
> + * holds GPU VA mappings.
> + */
> +void
> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
> +{
> + mgr->name = NULL;
> +
> + if (mgr->kernel_alloc_node.va.range)
> + __drm_gpuva_remove(&mgr->kernel_alloc_node);
> +
> + WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
> + "GPUVA tree is not empty, potentially leaking memory.");
> +}
> +EXPORT_SYMBOL(drm_gpuva_manager_destroy);
> +
> +static int
> +__drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva *va)
> +{
> + struct rb_node *node;
> + struct list_head *head;
> +
> + if (drm_gpuva_it_iter_first(&mgr->rb.tree,
> + GPUVA_START(va),
> + GPUVA_LAST(va)))
> + return -EEXIST;
> +
> + va->mgr = mgr;
> +
> + drm_gpuva_it_insert(va, &mgr->rb.tree);
> +
> + node = rb_prev(&va->rb.node);
> + if (node)
> + head = &(to_drm_gpuva(node))->rb.entry;
> + else
> + head = &mgr->rb.list;
> +
> + list_add(&va->rb.entry, head);
> +
> + return 0;
> +}
> +
> +/**
> + * drm_gpuva_insert - insert a &drm_gpuva
> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in
> + * @va: the &drm_gpuva to insert
> + *
> + * Insert a &drm_gpuva with a given address and range into a
> + * &drm_gpuva_manager.
> + *
> + * It is safe to use this function using the safe versions of iterating the GPU
> + * VA space, such as drm_gpuva_for_each_va_safe() and
> + * drm_gpuva_for_each_va_range_safe().
> + *
> + * Returns: 0 on success, negative error code on failure.
> + */
> +int
> +drm_gpuva_insert(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva *va)
> +{
> + u64 addr = va->va.addr;
> + u64 range = va->va.range;
> +
> + if (unlikely(!drm_gpuva_range_valid(mgr, addr, range)))
> + return -EINVAL;
> +
> + return __drm_gpuva_insert(mgr, va);
> +}
> +EXPORT_SYMBOL(drm_gpuva_insert);
> +
> +static void
> +__drm_gpuva_remove(struct drm_gpuva *va)
> +{
> + drm_gpuva_it_remove(va, &va->mgr->rb.tree);
> + list_del_init(&va->rb.entry);
> +}
> +
> +/**
> + * drm_gpuva_remove - remove a &drm_gpuva
> + * @va: the &drm_gpuva to remove
> + *
> + * This removes the given &va from the underlaying tree.
> + *
> + * It is safe to use this function using the safe versions of iterating the GPU
> + * VA space, such as drm_gpuva_for_each_va_safe() and
> + * drm_gpuva_for_each_va_range_safe().
> + */
> +void
> +drm_gpuva_remove(struct drm_gpuva *va)
> +{
> + struct drm_gpuva_manager *mgr = va->mgr;
> +
> + if (unlikely(va == &mgr->kernel_alloc_node)) {
> + WARN(1, "Can't destroy kernel reserved node.\n");
> + return;
> + }
> +
> + __drm_gpuva_remove(va);
> +}
> +EXPORT_SYMBOL(drm_gpuva_remove);
> +
> +/**
> + * drm_gpuva_link - link a &drm_gpuva
> + * @va: the &drm_gpuva to link
> + *
> + * This adds the given &va to the GPU VA list of the &drm_gem_object it is
> + * associated with.
> + *
> + * This function expects the caller to protect the GEM's GPUVA list against
> + * concurrent access using the GEMs dma_resv lock.
> + */
> +void
> +drm_gpuva_link(struct drm_gpuva *va)
> +{
> + struct drm_gpuva_manager *mgr = va->mgr;
> + struct drm_gem_object *obj = va->gem.obj;
> +
> + if (unlikely(!obj))
> + return;
> +
> + if (drm_gpuva_manager_external_lock(mgr))
> + drm_gpuva_manager_ext_assert_held(mgr);
> + else
> + dma_resv_assert_held(obj->resv);
> +
> + list_add_tail(&va->gem.entry, &obj->gpuva.list);
> +}
> +EXPORT_SYMBOL(drm_gpuva_link);
> +
> +/**
> + * drm_gpuva_unlink - unlink a &drm_gpuva
> + * @va: the &drm_gpuva to unlink
> + *
> + * This removes the given &va from the GPU VA list of the &drm_gem_object it is
> + * associated with.
> + *
> + * This function expects the caller to protect the GEM's GPUVA list against
> + * concurrent access using the GEMs dma_resv lock.
> + */
> +void
> +drm_gpuva_unlink(struct drm_gpuva *va)
> +{
> + struct drm_gpuva_manager *mgr = va->mgr;
> + struct drm_gem_object *obj = va->gem.obj;
> +
> + if (unlikely(!obj))
> + return;
> +
> + if (drm_gpuva_manager_external_lock(mgr))
> + drm_gpuva_manager_ext_assert_held(mgr);
> + else
> + dma_resv_assert_held(obj->resv);
> +
> + list_del_init(&va->gem.entry);
> +}
> +EXPORT_SYMBOL(drm_gpuva_unlink);
> +
> +/**
> + * drm_gpuva_find_first - find the first &drm_gpuva in the given range
> + * @mgr: the &drm_gpuva_manager to search in
> + * @addr: the &drm_gpuvas address
> + * @range: the &drm_gpuvas range
> + *
> + * Returns: the first &drm_gpuva within the given range
> + */
> +struct drm_gpuva *
> +drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range)
> +{
> + u64 last = addr + range - 1;
> +
> + return drm_gpuva_it_iter_first(&mgr->rb.tree, addr, last);
> +}
> +EXPORT_SYMBOL(drm_gpuva_find_first);
> +
> +/**
> + * drm_gpuva_find - find a &drm_gpuva
> + * @mgr: the &drm_gpuva_manager to search in
> + * @addr: the &drm_gpuvas address
> + * @range: the &drm_gpuvas range
> + *
> + * Returns: the &drm_gpuva at a given &addr and with a given &range
> + */
> +struct drm_gpuva *
> +drm_gpuva_find(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range)
> +{
> + struct drm_gpuva *va;
> +
> + va = drm_gpuva_find_first(mgr, addr, range);
> + if (!va)
> + goto out;
> +
> + if (va->va.addr != addr ||
> + va->va.range != range)
> + goto out;
> +
> + return va;
> +
> +out:
> + return NULL;
> +}
> +EXPORT_SYMBOL(drm_gpuva_find);
> +
> +/**
> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address
> + * @mgr: the &drm_gpuva_manager to search in
> + * @start: the given GPU VA's start address
> + *
> + * Find the adjacent &drm_gpuva before the GPU VA with given &start address.
> + *
> + * Note that if there is any free space between the GPU VA mappings no mapping
> + * is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found
> + */
> +struct drm_gpuva *
> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start)
> +{
> + if (!drm_gpuva_range_valid(mgr, start - 1, 1))
> + return NULL;
> +
> + return drm_gpuva_it_iter_first(&mgr->rb.tree, start - 1, start);
> +}
> +EXPORT_SYMBOL(drm_gpuva_find_prev);
> +
> +/**
> + * drm_gpuva_find_next - find the &drm_gpuva after the given address
> + * @mgr: the &drm_gpuva_manager to search in
> + * @end: the given GPU VA's end address
> + *
> + * Find the adjacent &drm_gpuva after the GPU VA with given &end address.
> + *
> + * Note that if there is any free space between the GPU VA mappings no mapping
> + * is returned.
> + *
> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found
> + */
> +struct drm_gpuva *
> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end)
> +{
> + if (!drm_gpuva_range_valid(mgr, end, 1))
> + return NULL;
> +
> + return drm_gpuva_it_iter_first(&mgr->rb.tree, end, end + 1);
> +}
> +EXPORT_SYMBOL(drm_gpuva_find_next);
> +
> +/**
> + * drm_gpuva_interval_empty - indicate whether a given interval of the VA space
> + * is empty
> + * @mgr: the &drm_gpuva_manager to check the range for
> + * @addr: the start address of the range
> + * @range: the range of the interval
> + *
> + * Returns: true if the interval is empty, false otherwise
> + */
> +bool
> +drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr, u64 range)
> +{
> + return !drm_gpuva_find_first(mgr, addr, range);
> +}
> +EXPORT_SYMBOL(drm_gpuva_interval_empty);
> +
> +/**
> + * drm_gpuva_map - helper to insert a &drm_gpuva according to a
> + * &drm_gpuva_op_map
> + * @mgr: the &drm_gpuva_manager
> + * @va: the &drm_gpuva to insert
> + * @op: the &drm_gpuva_op_map to initialize @va with
> + *
> + * Initializes the @va from the @op and inserts it into the given @mgr.
> + */
> +void
> +drm_gpuva_map(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva *va,
> + struct drm_gpuva_op_map *op)
> +{
> + drm_gpuva_init_from_op(va, op);
> + drm_gpuva_insert(mgr, va);
> +}
> +EXPORT_SYMBOL(drm_gpuva_map);
> +
> +/**
> + * drm_gpuva_remap - helper to remap a &drm_gpuva according to a
> + * &drm_gpuva_op_remap
> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
> + *
> + * Removes the currently mapped &drm_gpuva and remaps it using @prev and/or
> + * @next.
> + */
> +void
> +drm_gpuva_remap(struct drm_gpuva *prev,
> + struct drm_gpuva *next,
> + struct drm_gpuva_op_remap *op)
> +{
> + struct drm_gpuva *curr = op->unmap->va;
> + struct drm_gpuva_manager *mgr = curr->mgr;
> + struct drm_gpuva_op_map *map;
> +
> + drm_gpuva_remove(curr);
> +
> + if ((map = op->prev)) {
> + drm_gpuva_init_from_op(prev, map);
> + drm_gpuva_insert(mgr, prev);
> + }
> +
> + if ((map = op->next)) {
> + drm_gpuva_init_from_op(next, map);
> + drm_gpuva_insert(mgr, next);
> + }
> +}
> +EXPORT_SYMBOL(drm_gpuva_remap);
> +
> +/**
> + * drm_gpuva_unmap - helper to remove a &drm_gpuva according to a
> + * &drm_gpuva_op_unmap
> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
> + *
> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap.
> + */
> +void
> +drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
> +{
> + drm_gpuva_remove(op->va);
> +}
> +EXPORT_SYMBOL(drm_gpuva_unmap);
> +
> +static int
> +op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> + u64 addr, u64 range,
> + struct drm_gem_object *obj, u64 offset)
> +{
> + struct drm_gpuva_op op = {};
> +
> + op.op = DRM_GPUVA_OP_MAP;
> + op.map.va.addr = addr;
> + op.map.va.range = range;
> + op.map.gem.obj = obj;
> + op.map.gem.offset = offset;
> +
> + return fn->sm_step_map(&op, priv);
> +}
> +
> +static int
> +op_remap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> + struct drm_gpuva_op_map *prev,
> + struct drm_gpuva_op_map *next,
> + struct drm_gpuva_op_unmap *unmap)
> +{
> + struct drm_gpuva_op op = {};
> + struct drm_gpuva_op_remap *r;
> +
> + op.op = DRM_GPUVA_OP_REMAP;
> + r = &op.remap;
> + r->prev = prev;
> + r->next = next;
> + r->unmap = unmap;
> +
> + return fn->sm_step_remap(&op, priv);
> +}
> +
> +static int
> +op_unmap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
> + struct drm_gpuva *va, bool merge)
> +{
> + struct drm_gpuva_op op = {};
> +
> + op.op = DRM_GPUVA_OP_UNMAP;
> + op.unmap.va = va;
> + op.unmap.keep = merge;
> +
> + return fn->sm_step_unmap(&op, priv);
> +}
> +
> +static int
> +__drm_gpuva_sm_map(struct drm_gpuva_manager *mgr,
> + const struct drm_gpuva_fn_ops *ops, void *priv,
> + u64 req_addr, u64 req_range,
> + struct drm_gem_object *req_obj, u64 req_offset)
> +{
> + struct drm_gpuva *va, *next, *prev = NULL;
> + u64 req_end = req_addr + req_range;
> + int ret;
> +
> + if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
> + return -EINVAL;
> +
> + drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
> + struct drm_gem_object *obj = va->gem.obj;
> + u64 offset = va->gem.offset;
> + u64 addr = va->va.addr;
> + u64 range = va->va.range;
> + u64 end = addr + range;
> + bool merge = !!va->gem.obj;
> +
> + if (addr == req_addr) {
> + merge &= obj == req_obj &&
> + offset == req_offset;
> +
> + if (end == req_end) {
> + ret = op_unmap_cb(ops, priv, va, merge);
> + if (ret)
> + return ret;
> + break;
> + }
> +
> + if (end < req_end) {
> + ret = op_unmap_cb(ops, priv, va, merge);
> + if (ret)
> + return ret;
> + goto next;
> + }
> +
> + if (end > req_end) {
> + struct drm_gpuva_op_map n = {
> + .va.addr = req_end,
> + .va.range = range - req_range,
> + .gem.obj = obj,
> + .gem.offset = offset + req_range,
> + };
> + struct drm_gpuva_op_unmap u = {
> + .va = va,
> + .keep = merge,
> + };
> +
> + ret = op_remap_cb(ops, priv, NULL, &n, &u);
> + if (ret)
> + return ret;
> + break;
> + }
> + } else if (addr < req_addr) {
> + u64 ls_range = req_addr - addr;
> + struct drm_gpuva_op_map p = {
> + .va.addr = addr,
> + .va.range = ls_range,
> + .gem.obj = obj,
> + .gem.offset = offset,
> + };
> + struct drm_gpuva_op_unmap u = { .va = va };
> +
> + merge &= obj == req_obj &&
> + offset + ls_range == req_offset;
> + u.keep = merge;
> +
> + if (end == req_end) {
> + ret = op_remap_cb(ops, priv, &p, NULL, &u);
> + if (ret)
> + return ret;
> + break;
> + }
> +
> + if (end < req_end) {
> + ret = op_remap_cb(ops, priv, &p, NULL, &u);
> + if (ret)
> + return ret;
> + goto next;
> + }
> +
> + if (end > req_end) {
> + struct drm_gpuva_op_map n = {
> + .va.addr = req_end,
> + .va.range = end - req_end,
> + .gem.obj = obj,
> + .gem.offset = offset + ls_range +
> + req_range,
> + };
> +
> + ret = op_remap_cb(ops, priv, &p, &n, &u);
> + if (ret)
> + return ret;
> + break;
> + }
> + } else if (addr > req_addr) {
> + merge &= obj == req_obj &&
> + offset == req_offset +
> + (addr - req_addr);
> +
> + if (end == req_end) {
> + ret = op_unmap_cb(ops, priv, va, merge);
> + if (ret)
> + return ret;
> + break;
> + }
> +
> + if (end < req_end) {
> + ret = op_unmap_cb(ops, priv, va, merge);
> + if (ret)
> + return ret;
> + goto next;
> + }
> +
> + if (end > req_end) {
> + struct drm_gpuva_op_map n = {
> + .va.addr = req_end,
> + .va.range = end - req_end,
> + .gem.obj = obj,
> + .gem.offset = offset + req_end - addr,
> + };
> + struct drm_gpuva_op_unmap u = {
> + .va = va,
> + .keep = merge,
> + };
> +
> + ret = op_remap_cb(ops, priv, NULL, &n, &u);
> + if (ret)
> + return ret;
> + break;
> + }
> + }
> +next:
> + prev = va;
> + }
> +
> + return op_map_cb(ops, priv,
> + req_addr, req_range,
> + req_obj, req_offset);
> +}
> +
> +static int
> +__drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr,
> + const struct drm_gpuva_fn_ops *ops, void *priv,
> + u64 req_addr, u64 req_range)
> +{
> + struct drm_gpuva *va, *next;
> + u64 req_end = req_addr + req_range;
> + int ret;
> +
> + if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
> + return -EINVAL;
> +
> + drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
> + struct drm_gpuva_op_map prev = {}, next = {};
> + bool prev_split = false, next_split = false;
> + struct drm_gem_object *obj = va->gem.obj;
> + u64 offset = va->gem.offset;
> + u64 addr = va->va.addr;
> + u64 range = va->va.range;
> + u64 end = addr + range;
> +
> + if (addr < req_addr) {
> + prev.va.addr = addr;
> + prev.va.range = req_addr - addr;
> + prev.gem.obj = obj;
> + prev.gem.offset = offset;
> +
> + prev_split = true;
> + }
> +
> + if (end > req_end) {
> + next.va.addr = req_end;
> + next.va.range = end - req_end;
> + next.gem.obj = obj;
> + next.gem.offset = offset + (req_end - addr);
> +
> + next_split = true;
> + }
> +
> + if (prev_split || next_split) {
> + struct drm_gpuva_op_unmap unmap = { .va = va };
> +
> + ret = op_remap_cb(ops, priv,
> + prev_split ? &prev : NULL,
> + next_split ? &next : NULL,
> + &unmap);
> + if (ret)
> + return ret;
> + } else {
> + ret = op_unmap_cb(ops, priv, va, false);
> + if (ret)
> + return ret;
> + }
> + }
> +
> + return 0;
> +}
> +
> +/**
> + * drm_gpuva_sm_map - creates the &drm_gpuva_op split/merge steps
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @req_addr: the start address of the new mapping
> + * @req_range: the range of the new mapping
> + * @req_obj: the &drm_gem_object to map
> + * @req_offset: the offset within the &drm_gem_object
> + * @priv: pointer to a driver private data structure
> + *
> + * This function iterates the given range of the GPU VA space. It utilizes the
> + * &drm_gpuva_fn_ops to call back into the driver providing the split and merge
> + * steps.
> + *
> + * Drivers may use these callbacks to update the GPU VA space right away within
> + * the callback. In case the driver decides to copy and store the operations for
> + * later processing neither this function nor &drm_gpuva_sm_unmap is allowed to
> + * be called before the &drm_gpuva_manager's view of the GPU VA space was
> + * updated with the previous set of operations. To update the
> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
> + * used.
> + *
> + * A sequence of callbacks can contain map, unmap and remap operations, but
> + * the sequence of callbacks might also be empty if no operation is required,
> + * e.g. if the requested mapping already exists in the exact same way.
> + *
> + * There can be an arbitrary amount of unmap operations, a maximum of two remap
> + * operations and a single map operation. The latter one represents the original
> + * map operation requested by the caller.
> + *
> + * Returns: 0 on success or a negative error code
> + */
> +int
> +drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> + u64 req_addr, u64 req_range,
> + struct drm_gem_object *req_obj, u64 req_offset)
> +{
> + const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +
> + if (unlikely(!(ops && ops->sm_step_map &&
> + ops->sm_step_remap &&
> + ops->sm_step_unmap)))
> + return -EINVAL;
> +
> + return __drm_gpuva_sm_map(mgr, ops, priv,
> + req_addr, req_range,
> + req_obj, req_offset);
> +}
> +EXPORT_SYMBOL(drm_gpuva_sm_map);
> +
> +/**
> + * drm_gpuva_sm_unmap - creates the &drm_gpuva_ops to split on unmap
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @priv: pointer to a driver private data structure
> + * @req_addr: the start address of the range to unmap
> + * @req_range: the range of the mappings to unmap
> + *
> + * This function iterates the given range of the GPU VA space. It utilizes the
> + * &drm_gpuva_fn_ops to call back into the driver providing the operations to
> + * unmap and, if required, split existent mappings.
> + *
> + * Drivers may use these callbacks to update the GPU VA space right away within
> + * the callback. In case the driver decides to copy and store the operations for
> + * later processing neither this function nor &drm_gpuva_sm_map is allowed to be
> + * called before the &drm_gpuva_manager's view of the GPU VA space was updated
> + * with the previous set of operations. To update the &drm_gpuva_manager's view
> + * of the GPU VA space drm_gpuva_insert(), drm_gpuva_destroy_locked() and/or
> + * drm_gpuva_destroy_unlocked() should be used.
> + *
> + * A sequence of callbacks can contain unmap and remap operations, depending on
> + * whether there are actual overlapping mappings to split.
> + *
> + * There can be an arbitrary amount of unmap operations and a maximum of two
> + * remap operations.
> + *
> + * Returns: 0 on success or a negative error code
> + */
> +int
> +drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> + u64 req_addr, u64 req_range)
> +{
> + const struct drm_gpuva_fn_ops *ops = mgr->ops;
> +
> + if (unlikely(!(ops && ops->sm_step_remap &&
> + ops->sm_step_unmap)))
> + return -EINVAL;
> +
> + return __drm_gpuva_sm_unmap(mgr, ops, priv,
> + req_addr, req_range);
> +}
> +EXPORT_SYMBOL(drm_gpuva_sm_unmap);
> +
> +static struct drm_gpuva_op *
> +gpuva_op_alloc(struct drm_gpuva_manager *mgr)
> +{
> + const struct drm_gpuva_fn_ops *fn = mgr->ops;
> + struct drm_gpuva_op *op;
> +
> + if (fn && fn->op_alloc)
> + op = fn->op_alloc();
> + else
> + op = kzalloc(sizeof(*op), GFP_KERNEL);
> +
> + if (unlikely(!op))
> + return NULL;
> +
> + return op;
> +}
> +
> +static void
> +gpuva_op_free(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva_op *op)
> +{
> + const struct drm_gpuva_fn_ops *fn = mgr->ops;
> +
> + if (fn && fn->op_free)
> + fn->op_free(op);
> + else
> + kfree(op);
> +}
> +
> +static int
> +drm_gpuva_sm_step(struct drm_gpuva_op *__op,
> + void *priv)
> +{
> + struct {
> + struct drm_gpuva_manager *mgr;
> + struct drm_gpuva_ops *ops;
> + } *args = priv;
> + struct drm_gpuva_manager *mgr = args->mgr;
> + struct drm_gpuva_ops *ops = args->ops;
> + struct drm_gpuva_op *op;
> +
> + op = gpuva_op_alloc(mgr);
> + if (unlikely(!op))
> + goto err;
> +
> + memcpy(op, __op, sizeof(*op));
> +
> + if (op->op == DRM_GPUVA_OP_REMAP) {
> + struct drm_gpuva_op_remap *__r = &__op->remap;
> + struct drm_gpuva_op_remap *r = &op->remap;
> +
> + r->unmap = kmemdup(__r->unmap, sizeof(*r->unmap),
> + GFP_KERNEL);
> + if (unlikely(!r->unmap))
> + goto err_free_op;
> +
> + if (__r->prev) {
> + r->prev = kmemdup(__r->prev, sizeof(*r->prev),
> + GFP_KERNEL);
> + if (unlikely(!r->prev))
> + goto err_free_unmap;
> + }
> +
> + if (__r->next) {
> + r->next = kmemdup(__r->next, sizeof(*r->next),
> + GFP_KERNEL);
> + if (unlikely(!r->next))
> + goto err_free_prev;
> + }
> + }
> +
> + list_add_tail(&op->entry, &ops->list);
> +
> + return 0;
> +
> +err_free_unmap:
> + kfree(op->remap.unmap);
> +err_free_prev:
> + kfree(op->remap.prev);
> +err_free_op:
> + gpuva_op_free(mgr, op);
> +err:
> + return -ENOMEM;
> +}
> +
> +static const struct drm_gpuva_fn_ops gpuva_list_ops = {
> + .sm_step_map = drm_gpuva_sm_step,
> + .sm_step_remap = drm_gpuva_sm_step,
> + .sm_step_unmap = drm_gpuva_sm_step,
> +};
> +
> +/**
> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split and merge
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @req_addr: the start address of the new mapping
> + * @req_range: the range of the new mapping
> + * @req_obj: the &drm_gem_object to map
> + * @req_offset: the offset within the &drm_gem_object
> + *
> + * This function creates a list of operations to perform splitting and merging
> + * of existent mapping(s) with the newly requested one.
> + *
> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed
> + * in the given order. It can contain map, unmap and remap operations, but it
> + * also can be empty if no operation is required, e.g. if the requested mapping
> + * already exists is the exact same way.
> + *
> + * There can be an arbitrary amount of unmap operations, a maximum of two remap
> + * operations and a single map operation. The latter one represents the original
> + * map operation requested by the caller.
> + *
> + * Note that before calling this function again with another mapping request it
> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. The
> + * previously obtained operations must be either processed or abandoned. To
> + * update the &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
> + * used.
> + *
> + * After the caller finished processing the returned &drm_gpuva_ops, they must
> + * be freed with &drm_gpuva_ops_free.
> + *
> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_ops *
> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
> + u64 req_addr, u64 req_range,
> + struct drm_gem_object *req_obj, u64 req_offset)
> +{
> + struct drm_gpuva_ops *ops;
> + struct {
> + struct drm_gpuva_manager *mgr;
> + struct drm_gpuva_ops *ops;
> + } args;
> + int ret;
> +
> + ops = kzalloc(sizeof(*ops), GFP_KERNEL);
> + if (unlikely(!ops))
> + return ERR_PTR(-ENOMEM);
> +
> + INIT_LIST_HEAD(&ops->list);
> +
> + args.mgr = mgr;
> + args.ops = ops;
> +
> + ret = __drm_gpuva_sm_map(mgr, &gpuva_list_ops, &args,
> + req_addr, req_range,
> + req_obj, req_offset);
> + if (ret)
> + goto err_free_ops;
> +
> + return ops;
> +
> +err_free_ops:
> + drm_gpuva_ops_free(mgr, ops);
> + return ERR_PTR(ret);
> +}
> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create);
> +
> +/**
> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to split on unmap
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @req_addr: the start address of the range to unmap
> + * @req_range: the range of the mappings to unmap
> + *
> + * This function creates a list of operations to perform unmapping and, if
> + * required, splitting of the mappings overlapping the unmap range.
> + *
> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed
> + * in the given order. It can contain unmap and remap operations, depending on
> + * whether there are actual overlapping mappings to split.
> + *
> + * There can be an arbitrary amount of unmap operations and a maximum of two
> + * remap operations.
> + *
> + * Note that before calling this function again with another range to unmap it
> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA space. The
> + * previously obtained operations must be processed or abandoned. To update the
> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked() should be
> + * used.
> + *
> + * After the caller finished processing the returned &drm_gpuva_ops, they must
> + * be freed with &drm_gpuva_ops_free.
> + *
> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_ops *
> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
> + u64 req_addr, u64 req_range)
> +{
> + struct drm_gpuva_ops *ops;
> + struct {
> + struct drm_gpuva_manager *mgr;
> + struct drm_gpuva_ops *ops;
> + } args;
> + int ret;
> +
> + ops = kzalloc(sizeof(*ops), GFP_KERNEL);
> + if (unlikely(!ops))
> + return ERR_PTR(-ENOMEM);
> +
> + INIT_LIST_HEAD(&ops->list);
> +
> + args.mgr = mgr;
> + args.ops = ops;
> +
> + ret = __drm_gpuva_sm_unmap(mgr, &gpuva_list_ops, &args,
> + req_addr, req_range);
> + if (ret)
> + goto err_free_ops;
> +
> + return ops;
> +
> +err_free_ops:
> + drm_gpuva_ops_free(mgr, ops);
> + return ERR_PTR(ret);
> +}
> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create);
> +
> +/**
> + * drm_gpuva_prefetch_ops_create - creates the &drm_gpuva_ops to prefetch
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @addr: the start address of the range to prefetch
> + * @range: the range of the mappings to prefetch
> + *
> + * This function creates a list of operations to perform prefetching.
> + *
> + * The list can be iterated with &drm_gpuva_for_each_op and must be processed
> + * in the given order. It can contain prefetch operations.
> + *
> + * There can be an arbitrary amount of prefetch operations.
> + *
> + * After the caller finished processing the returned &drm_gpuva_ops, they must
> + * be freed with &drm_gpuva_ops_free.
> + *
> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_ops *
> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range)
> +{
> + struct drm_gpuva_ops *ops;
> + struct drm_gpuva_op *op;
> + struct drm_gpuva *va;
> + u64 end = addr + range;
> + int ret;
> +
> + ops = kzalloc(sizeof(*ops), GFP_KERNEL);
> + if (!ops)
> + return ERR_PTR(-ENOMEM);
> +
> + INIT_LIST_HEAD(&ops->list);
> +
> + drm_gpuva_for_each_va_range(va, mgr, addr, end) {
> + op = gpuva_op_alloc(mgr);
> + if (!op) {
> + ret = -ENOMEM;
> + goto err_free_ops;
> + }
> +
> + op->op = DRM_GPUVA_OP_PREFETCH;
> + op->prefetch.va = va;
> + list_add_tail(&op->entry, &ops->list);
> + }
> +
> + return ops;
> +
> +err_free_ops:
> + drm_gpuva_ops_free(mgr, ops);
> + return ERR_PTR(ret);
> +}
> +EXPORT_SYMBOL(drm_gpuva_prefetch_ops_create);
> +
> +/**
> + * drm_gpuva_gem_unmap_ops_create - creates the &drm_gpuva_ops to unmap a GEM
> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
> + * @obj: the &drm_gem_object to unmap
> + *
> + * This function creates a list of operations to perform unmapping for every
> + * GPUVA attached to a GEM.
> + *
> + * The list can be iterated with &drm_gpuva_for_each_op and consists out of an
> + * arbitrary amount of unmap operations.
> + *
> + * After the caller finished processing the returned &drm_gpuva_ops, they must
> + * be freed with &drm_gpuva_ops_free.
> + *
> + * It is the callers responsibility to protect the GEMs GPUVA list against
> + * concurrent access using the GEMs dma_resv lock.
> + *
> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on failure
> + */
> +struct drm_gpuva_ops *
> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> + struct drm_gem_object *obj)
> +{
> + struct drm_gpuva_ops *ops;
> + struct drm_gpuva_op *op;
> + struct drm_gpuva *va;
> + int ret;
> +
> + if (drm_gpuva_manager_external_lock(mgr))
> + drm_gpuva_manager_ext_assert_held(mgr);
> + else
> + dma_resv_assert_held(obj->resv);
> +
> + ops = kzalloc(sizeof(*ops), GFP_KERNEL);
> + if (!ops)
> + return ERR_PTR(-ENOMEM);
> +
> + INIT_LIST_HEAD(&ops->list);
> +
> + drm_gem_for_each_gpuva(va, obj) {
> + op = gpuva_op_alloc(mgr);
> + if (!op) {
> + ret = -ENOMEM;
> + goto err_free_ops;
> + }
> +
> + op->op = DRM_GPUVA_OP_UNMAP;
> + op->unmap.va = va;
> + list_add_tail(&op->entry, &ops->list);
> + }
> +
> + return ops;
> +
> +err_free_ops:
> + drm_gpuva_ops_free(mgr, ops);
> + return ERR_PTR(ret);
> +}
> +EXPORT_SYMBOL(drm_gpuva_gem_unmap_ops_create);
> +
> +
> +/**
> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops
> + * @mgr: the &drm_gpuva_manager the ops were created for
> + * @ops: the &drm_gpuva_ops to free
> + *
> + * Frees the given &drm_gpuva_ops structure including all the ops associated
> + * with it.
> + */
> +void
> +drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva_ops *ops)
> +{
> + struct drm_gpuva_op *op, *next;
> +
> + drm_gpuva_for_each_op_safe(op, next, ops) {
> + list_del(&op->entry);
> +
> + if (op->op == DRM_GPUVA_OP_REMAP) {
> + kfree(op->remap.prev);
> + kfree(op->remap.next);
> + kfree(op->remap.unmap);
> + }
> +
> + gpuva_op_free(mgr, op);
> + }
> +
> + kfree(ops);
> +}
> +EXPORT_SYMBOL(drm_gpuva_ops_free);
> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h
> index 89e2706cac56..04dbe223b1a5 100644
> --- a/include/drm/drm_drv.h
> +++ b/include/drm/drm_drv.h
> @@ -104,6 +104,12 @@ enum drm_driver_feature {
> * acceleration should be handled by two drivers that are connected using auxiliary bus.
> */
> DRIVER_COMPUTE_ACCEL = BIT(7),
> + /**
> + * @DRIVER_GEM_GPUVA:
> + *
> + * Driver supports user defined GPU VA bindings for GEM objects.
> + */
> + DRIVER_GEM_GPUVA = BIT(8),
>
> /* IMPORTANT: Below are all the legacy flags, add new ones above. */
>
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index bbc721870c13..5ec8148a30ee 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -36,6 +36,8 @@
>
> #include <linux/kref.h>
> #include <linux/dma-resv.h>
> +#include <linux/list.h>
> +#include <linux/mutex.h>
>
> #include <drm/drm_vma_manager.h>
>
> @@ -379,6 +381,18 @@ struct drm_gem_object {
> */
> struct dma_resv _resv;
>
> + /**
> + * @gpuva:
> + *
> + * Provides the list of GPU VAs attached to this GEM object.
> + *
> + * Drivers should lock list accesses with the GEMs &dma_resv lock
> + * (&drm_gem_object.resv).
> + */
> + struct {
> + struct list_head list;
> + } gpuva;
> +
> /**
> * @funcs:
> *
> @@ -526,4 +540,42 @@ unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru,
>
> int drm_gem_evict(struct drm_gem_object *obj);
>
> +/**
> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object
> + * @obj: the &drm_gem_object
> + *
> + * This initializes the &drm_gem_object's &drm_gpuva list.
> + *
> + * Calling this function is only necessary for drivers intending to support the
> + * &drm_driver_feature DRIVER_GEM_GPUVA.
> + */
> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> +{
> + INIT_LIST_HEAD(&obj->gpuva.list);
> +}
> +
> +/**
> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas

s/iternator/iterator/. (multiple places) Although since "iterator"
typically refers to an object being iterated over, perhaps

"drm_gem_for_each_gpuva() - iterate over a list of gpuvas", (general
comment across the patch).

> + * @entry: &drm_gpuva structure to assign to in each iteration step
> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gpuva_manager.
> + */
> +#define drm_gem_for_each_gpuva(entry__, obj__) \
> + list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
> +
> +/**
> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list of gpuvas
> + * @entry: &drm_gpuva structure to assign to in each iteration step
> + * @next: &next &drm_gpuva to store the next step
> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the
> + * &drm_gem_object. It is implemented with list_for_each_entry_safe(), hence
> + * it is save against removal of elements.
> + */
> +#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
> + list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list, gem.entry)
> +
> #endif /* __DRM_GEM_H__ */
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> new file mode 100644
> index 000000000000..4f23aaf726dd
> --- /dev/null
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -0,0 +1,756 @@
> +/* SPDX-License-Identifier: GPL-2.0 */
> +
> +#ifndef __DRM_GPUVA_MGR_H__
> +#define __DRM_GPUVA_MGR_H__
> +
> +/*
> + * Copyright (c) 2022 Red Hat.
> + *
> + * Permission is hereby granted, free of charge, to any person obtaining a
> + * copy of this software and associated documentation files (the "Software"),
> + * to deal in the Software without restriction, including without limitation
> + * the rights to use, copy, modify, merge, publish, distribute, sublicense,
> + * and/or sell copies of the Software, and to permit persons to whom the
> + * Software is furnished to do so, subject to the following conditions:
> + *
> + * The above copyright notice and this permission notice shall be included in
> + * all copies or substantial portions of the Software.
> + *
> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM, DAMAGES OR
> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
> + * OTHER DEALINGS IN THE SOFTWARE.
> + */
> +
> +#include <linux/list.h>
> +#include <linux/rbtree.h>
> +#include <linux/types.h>
> +
> +#include <drm/drm_gem.h>
> +
> +struct drm_gpuva_manager;
> +struct drm_gpuva_fn_ops;
> +
> +/**
> + * enum drm_gpuva_flags - flags for struct drm_gpuva
> + */
> +enum drm_gpuva_flags {
> + /**
> + * @DRM_GPUVA_INVALIDATED:
> + *
> + * Flag indicating that the &drm_gpuva's backing GEM is invalidated.
> + */
> + DRM_GPUVA_INVALIDATED = (1 << 0),
> +
> + /**
> + * @DRM_GPUVA_SPARSE:
> + *
> + * Flag indicating that the &drm_gpuva is a sparse mapping.
> + */
> + DRM_GPUVA_SPARSE = (1 << 1),
> +
> + /**
> + * @DRM_GPUVA_USERBITS: user defined bits
> + */
> + DRM_GPUVA_USERBITS = (1 << 2),
> +};
> +
> +/**
> + * struct drm_gpuva - structure to track a GPU VA mapping
> + *
> + * This structure represents a GPU VA mapping and is associated with a
> + * &drm_gpuva_manager.
> + *
> + * Typically, this structure is embedded in bigger driver structures.
> + */
> +struct drm_gpuva {
> + /**
> + * @mgr: the &drm_gpuva_manager this object is associated with
> + */
> + struct drm_gpuva_manager *mgr;
> +
> + /**
> + * @flags: the &drm_gpuva_flags for this mapping
> + */
> + enum drm_gpuva_flags flags;
> +
> + /**
> + * @va: structure containing the address and range of the &drm_gpuva
> + */
> + struct {
> + /**
> + * @addr: the start address
> + */
> + u64 addr;
> +
> + /*
> + * @range: the range
> + */
> + u64 range;
> + } va;
> +
> + /**
> + * @gem: structure containing the &drm_gem_object and it's offset
> + */
> + struct {
> + /**
> + * @offset: the offset within the &drm_gem_object
> + */
> + u64 offset;
> +
> + /**
> + * @obj: the mapped &drm_gem_object
> + */
> + struct drm_gem_object *obj;
> +
> + /**
> + * @entry: the &list_head to attach this object to a &drm_gem_object
> + */
> + struct list_head entry;
> + } gem;
> +
> + /**
> + * @rb: structure containing data to store &drm_gpuvas in a rb-tree
> + */
> + struct {
> + /**
> + * @rb: the rb-tree node
> + */
> + struct rb_node node;
> +
> + /**
> + * @entry: The &list_head to additionally connect &drm_gpuvas
> + * in the same order they appear in the interval tree. This is
> + * useful to keep iterating &drm_gpuvas from a start node found
> + * through the rb-tree while doing modifications on the rb-tree
> + * itself.
> + */
> + struct list_head entry;
> +
> + /**
> + * @__subtree_last: needed by the interval tree, holding last-in-subtree
> + */
> + u64 __subtree_last;
> + } rb;
> +};
> +
> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva *va);
> +void drm_gpuva_remove(struct drm_gpuva *va);
> +
> +void drm_gpuva_link(struct drm_gpuva *va);
> +void drm_gpuva_unlink(struct drm_gpuva *va);
> +
> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range);
> +struct drm_gpuva *drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range);
> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start);
> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end);
> +
> +bool drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr, u64 range);
> +
> +static inline void drm_gpuva_init(struct drm_gpuva *va, u64 addr, u64 range,
> + struct drm_gem_object *obj, u64 offset)
> +{
> + va->va.addr = addr;
> + va->va.range = range;
> + va->gem.obj = obj;
> + va->gem.offset = offset;
> +}
> +
> +/**
> + * drm_gpuva_invalidate - sets whether the backing GEM of this &drm_gpuva is
> + * invalidated
> + * @va: the &drm_gpuva to set the invalidate flag for
> + * @invalidate: indicates whether the &drm_gpuva is invalidated
> + */
> +static inline void drm_gpuva_invalidate(struct drm_gpuva *va, bool invalidate)
> +{
> + if (invalidate)
> + va->flags |= DRM_GPUVA_INVALIDATED;
> + else
> + va->flags &= ~DRM_GPUVA_INVALIDATED;
> +}
> +
> +/**
> + * drm_gpuva_invalidated - indicates whether the backing BO of this &drm_gpuva
> + * is invalidated
> + * @va: the &drm_gpuva to check
> + */
> +static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
> +{
> + return va->flags & DRM_GPUVA_INVALIDATED;
> +}
> +
> +#ifdef CONFIG_LOCKDEP
> +typedef struct lockdep_map *lockdep_map_p;
> +#define drm_gpuva_manager_ext_assert_held(mgr) \
> + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
> +/**
> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
> + * @mgr: the &drm_gpuva_manager to set the lock for
> + * @lock: the lock to set
> + *
> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
> + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
> + * &drm_gem_objects GPUVA list.
> + */
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
> + (mgr)->ext_lock = &(lock)->dep_map
> +#else
> +typedef struct { /* nothing */ } lockdep_map_p;
> +#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
> +#endif
> +
> +/**
> + * enum drm_gpuva_manager_flags - the feature flags for the &drm_gpuva_manager
> + */
> +enum drm_gpuva_manager_flags {
> + /**
> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
> + *
> + * Indicates the driver has it's own external lock for linking and
> + * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
> + *
> + * When setting this flag it is rquired to set a lock via
> + * drm_gpuva_set_ext_lock().
> + */
> + DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
> +};
> +
> +/**
> + * struct drm_gpuva_manager - DRM GPU VA Manager
> + *
> + * The DRM GPU VA Manager keeps track of a GPU's virtual address space by using
> + * &maple_tree structures. Typically, this structure is embedded in bigger
> + * driver structures.
> + *
> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g. bytes or
> + * pages.
> + *
> + * There should be one manager instance per GPU virtual address space.
> + */
> +struct drm_gpuva_manager {
> + /**
> + * @name: the name of the DRM GPU VA space
> + */
> + const char *name;
> +
> + /**
> + * @flags: the feature flags of the &drm_gpuva_manager
> + */
> + enum drm_gpuva_manager_flags flags;
> +
> + /**
> + * @mm_start: start of the VA space
> + */
> + u64 mm_start;
> +
> + /**
> + * @mm_range: length of the VA space
> + */
> + u64 mm_range;
> +
> + /**
> + * @rb: structures to track &drm_gpuva entries
> + */
> + struct {
> + /**
> + * @tree: the rb-tree to track GPU VA mappings
> + */
> + struct rb_root_cached tree;
> +
> + /**
> + * @list: the &list_head to track GPU VA mappings
> + */
> + struct list_head list;
> + } rb;
> +
> + /**
> + * @kernel_alloc_node:
> + *
> + * &drm_gpuva representing the address space cutout reserved for
> + * the kernel
> + */
> + struct drm_gpuva kernel_alloc_node;
> +
> + /**
> + * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> + */
> + const struct drm_gpuva_fn_ops *ops;
> +
> + /**
> + * @ext_lock: &lockdep_map according to @DRM_GPUVA_MANAGER_LOCK_EXTERN
> + */
> + lockdep_map_p ext_lock;
> +};
> +
> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> + const char *name,
> + u64 start_offset, u64 range,
> + u64 reserve_offset, u64 reserve_range,
> + const struct drm_gpuva_fn_ops *ops,
> + enum drm_gpuva_manager_flags flags);
> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
> +
> +/**
> + * drm_gpuva_manager_external_lock - indicates whether the
> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
> + * @mgr: the &drm_gpuva_manager to check the flag for
> + */
> +static inline bool drm_gpuva_manager_external_lock(struct drm_gpuva_manager *mgr)
> +{
> + return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
> +}
> +
> +/**
> + * drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @mgr__: &drm_gpuva_manager to walk over
> + * @start__: starting offset, the first gpuva will overlap this
> + * @end__: ending offset, the last gpuva will start before this (but may
> + * overlap)
> + *
> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> + * between @start__ and @end__. It is implemented similarly to list_for_each(),
> + * but is using the &drm_gpuva_manager's internal interval tree to accelerate
> + * the search for the starting &drm_gpuva, and hence isn't safe against removal
> + * of elements. It assumes that @end__ is within (or is the upper limit of) the
> + * &drm_gpuva_manager. This iterator does not skip over the &drm_gpuva_manager's
> + * @kernel_alloc_node.
> + */
> +#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \
> + va__ && (va__->va.addr < (end__)) && \
> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> + va__ = list_next_entry(va__, rb.entry))
> +
> +/**
> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
> + * &drm_gpuvas
> + * @va__: &drm_gpuva to assign to in each iteration step
> + * @next__: another &drm_gpuva to use as temporary storage
> + * @mgr__: &drm_gpuva_manager to walk over
> + * @start__: starting offset, the first gpuva will overlap this
> + * @end__: ending offset, the last gpuva will start before this (but may
> + * overlap)
> + *
> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> + * between @start__ and @end__. It is implemented similarly to
> + * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
> + * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
> + * against removal of elements. It assumes that @end__ is within (or is the
> + * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
> + * &drm_gpuva_manager's @kernel_alloc_node.
> + */
> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
> + next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
> + va__ && (va__->va.addr < (end__)) && \
> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> + va__ = next__, next__ = list_next_entry(va__, rb.entry))
> +
> +/**
> + * drm_gpuva_for_each_va - iternator to walk over all &drm_gpuvas
> + * @va__: &drm_gpuva to assign to in each iteration step
> + * @mgr__: &drm_gpuva_manager to walk over
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the given
> + * &drm_gpuva_manager.
> + */
> +#define drm_gpuva_for_each_va(va__, mgr__) \
> + list_for_each_entry(va__, &(mgr__)->rb.list, rb.entry)
> +
> +/**
> + * drm_gpuva_for_each_va_safe - iternator to safely walk over all &drm_gpuvas
> + * @va__: &drm_gpuva to assign to in each iteration step
> + * @next__: another &drm_gpuva to use as temporary storage
> + * @mgr__: &drm_gpuva_manager to walk over
> + *
> + * This iterator walks over all &drm_gpuva structures associated with the given
> + * &drm_gpuva_manager. It is implemented with list_for_each_entry_safe(), and
> + * hence safe against the removal of elements.
> + */
> +#define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
> + list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
> +
> +/**
> + * enum drm_gpuva_op_type - GPU VA operation type
> + *
> + * Operations to alter the GPU VA mappings tracked by the &drm_gpuva_manager.
> + */
> +enum drm_gpuva_op_type {
> + /**
> + * @DRM_GPUVA_OP_MAP: the map op type
> + */
> + DRM_GPUVA_OP_MAP,
> +
> + /**
> + * @DRM_GPUVA_OP_REMAP: the remap op type
> + */
> + DRM_GPUVA_OP_REMAP,
> +
> + /**
> + * @DRM_GPUVA_OP_UNMAP: the unmap op type
> + */
> + DRM_GPUVA_OP_UNMAP,
> +
> + /**
> + * @DRM_GPUVA_OP_PREFETCH: the prefetch op type
> + */
> + DRM_GPUVA_OP_PREFETCH,
> +};
> +
> +/**
> + * struct drm_gpuva_op_map - GPU VA map operation
> + *
> + * This structure represents a single map operation generated by the
> + * DRM GPU VA manager.
> + */
> +struct drm_gpuva_op_map {
> + /**
> + * @va: structure containing address and range of a map
> + * operation
> + */
> + struct {
> + /**
> + * @addr: the base address of the new mapping
> + */
> + u64 addr;
> +
> + /**
> + * @range: the range of the new mapping
> + */
> + u64 range;
> + } va;
> +
> + /**
> + * @gem: structure containing the &drm_gem_object and it's offset
> + */
> + struct {
> + /**
> + * @offset: the offset within the &drm_gem_object
> + */
> + u64 offset;
> +
> + /**
> + * @obj: the &drm_gem_object to map
> + */
> + struct drm_gem_object *obj;
> + } gem;
> +};
> +
> +/**
> + * struct drm_gpuva_op_unmap - GPU VA unmap operation
> + *
> + * This structure represents a single unmap operation generated by the
> + * DRM GPU VA manager.
> + */
> +struct drm_gpuva_op_unmap {
> + /**
> + * @va: the &drm_gpuva to unmap
> + */
> + struct drm_gpuva *va;
> +
> + /**
> + * @keep:
> + *
> + * Indicates whether this &drm_gpuva is physically contiguous with the
> + * original mapping request.
> + *
> + * Optionally, if &keep is set, drivers may keep the actual page table
> + * mappings for this &drm_gpuva, adding the missing page table entries
> + * only and update the &drm_gpuva_manager accordingly.
> + */
> + bool keep;
> +};
> +
> +/**
> + * struct drm_gpuva_op_remap - GPU VA remap operation
> + *
> + * This represents a single remap operation generated by the DRM GPU VA manager.
> + *
> + * A remap operation is generated when an existing GPU VA mmapping is split up
> + * by inserting a new GPU VA mapping or by partially unmapping existent
> + * mapping(s), hence it consists of a maximum of two map and one unmap
> + * operation.
> + *
> + * The @unmap operation takes care of removing the original existing mapping.
> + * @prev is used to remap the preceding part, @next the subsequent part.
> + *
> + * If either a new mapping's start address is aligned with the start address
> + * of the old mapping or the new mapping's end address is aligned with the
> + * end address of the old mapping, either @prev or @next is NULL.
> + *
> + * Note, the reason for a dedicated remap operation, rather than arbitrary
> + * unmap and map operations, is to give drivers the chance of extracting driver
> + * specific data for creating the new mappings from the unmap operations's
> + * &drm_gpuva structure which typically is embedded in larger driver specific
> + * structures.
> + */
> +struct drm_gpuva_op_remap {
> + /**
> + * @prev: the preceding part of a split mapping
> + */
> + struct drm_gpuva_op_map *prev;
> +
> + /**
> + * @next: the subsequent part of a split mapping
> + */
> + struct drm_gpuva_op_map *next;
> +
> + /**
> + * @unmap: the unmap operation for the original existing mapping
> + */
> + struct drm_gpuva_op_unmap *unmap;
> +};
> +
> +/**
> + * struct drm_gpuva_op_prefetch - GPU VA prefetch operation
> + *
> + * This structure represents a single prefetch operation generated by the
> + * DRM GPU VA manager.
> + */
> +struct drm_gpuva_op_prefetch {
> + /**
> + * @va: the &drm_gpuva to prefetch
> + */
> + struct drm_gpuva *va;
> +};
> +
> +/**
> + * struct drm_gpuva_op - GPU VA operation
> + *
> + * This structure represents a single generic operation.
> + *
> + * The particular type of the operation is defined by @op.
> + */
> +struct drm_gpuva_op {
> + /**
> + * @entry:
> + *
> + * The &list_head used to distribute instances of this struct within
> + * &drm_gpuva_ops.
> + */
> + struct list_head entry;
> +
> + /**
> + * @op: the type of the operation
> + */
> + enum drm_gpuva_op_type op;
> +
> + union {
> + /**
> + * @map: the map operation
> + */
> + struct drm_gpuva_op_map map;
> +
> + /**
> + * @remap: the remap operation
> + */
> + struct drm_gpuva_op_remap remap;
> +
> + /**
> + * @unmap: the unmap operation
> + */
> + struct drm_gpuva_op_unmap unmap;
> +
> + /**
> + * @prefetch: the prefetch operation
> + */
> + struct drm_gpuva_op_prefetch prefetch;
> + };
> +};
> +
> +/**
> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op
> + */
> +struct drm_gpuva_ops {
> + /**
> + * @list: the &list_head
> + */
> + struct list_head list;
> +};
> +
> +/**
> + * drm_gpuva_for_each_op - iterator to walk over &drm_gpuva_ops
> + * @op: &drm_gpuva_op to assign in each iteration step
> + * @ops: &drm_gpuva_ops to walk
> + *
> + * This iterator walks over all ops within a given list of operations.
> + */
> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op, &(ops)->list, entry)
> +
> +/**
> + * drm_gpuva_for_each_op_safe - iterator to safely walk over &drm_gpuva_ops
> + * @op: &drm_gpuva_op to assign in each iteration step
> + * @next: &next &drm_gpuva_op to store the next step
> + * @ops: &drm_gpuva_ops to walk
> + *
> + * This iterator walks over all ops within a given list of operations. It is
> + * implemented with list_for_each_safe(), so save against removal of elements.
> + */
> +#define drm_gpuva_for_each_op_safe(op, next, ops) \
> + list_for_each_entry_safe(op, next, &(ops)->list, entry)
> +
> +/**
> + * drm_gpuva_for_each_op_from_reverse - iterate backwards from the given point
> + * @op: &drm_gpuva_op to assign in each iteration step
> + * @ops: &drm_gpuva_ops to walk
> + *
> + * This iterator walks over all ops within a given list of operations beginning
> + * from the given operation in reverse order.
> + */
> +#define drm_gpuva_for_each_op_from_reverse(op, ops) \
> + list_for_each_entry_from_reverse(op, &(ops)->list, entry)
> +
> +/**
> + * drm_gpuva_first_op - returns the first &drm_gpuva_op from &drm_gpuva_ops
> + * @ops: the &drm_gpuva_ops to get the fist &drm_gpuva_op from
> + */
> +#define drm_gpuva_first_op(ops) \
> + list_first_entry(&(ops)->list, struct drm_gpuva_op, entry)
> +
> +/**
> + * drm_gpuva_last_op - returns the last &drm_gpuva_op from &drm_gpuva_ops
> + * @ops: the &drm_gpuva_ops to get the last &drm_gpuva_op from
> + */
> +#define drm_gpuva_last_op(ops) \
> + list_last_entry(&(ops)->list, struct drm_gpuva_op, entry)
> +
> +/**
> + * drm_gpuva_prev_op - previous &drm_gpuva_op in the list
> + * @op: the current &drm_gpuva_op
> + */
> +#define drm_gpuva_prev_op(op) list_prev_entry(op, entry)
> +
> +/**
> + * drm_gpuva_next_op - next &drm_gpuva_op in the list
> + * @op: the current &drm_gpuva_op
> + */
> +#define drm_gpuva_next_op(op) list_next_entry(op, entry)
> +
> +struct drm_gpuva_ops *
> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range,
> + struct drm_gem_object *obj, u64 offset);
> +struct drm_gpuva_ops *
> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range);
> +
> +struct drm_gpuva_ops *
> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
> + u64 addr, u64 range);
> +
> +struct drm_gpuva_ops *
> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> + struct drm_gem_object *obj);
> +
> +void drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva_ops *ops);
> +
> +static inline void drm_gpuva_init_from_op(struct drm_gpuva *va,
> + struct drm_gpuva_op_map *op)
> +{
> + drm_gpuva_init(va, op->va.addr, op->va.range,
> + op->gem.obj, op->gem.offset);
> +}
> +
> +/**
> + * struct drm_gpuva_fn_ops - callbacks for split/merge steps
> + *
> + * This structure defines the callbacks used by &drm_gpuva_sm_map and
> + * &drm_gpuva_sm_unmap to provide the split/merge steps for map and unmap
> + * operations to drivers.
> + */
> +struct drm_gpuva_fn_ops {
> + /**
> + * @op_alloc: called when the &drm_gpuva_manager allocates
> + * a struct drm_gpuva_op
> + *
> + * Some drivers may want to embed struct drm_gpuva_op into driver
> + * specific structures. By implementing this callback drivers can
> + * allocate memory accordingly.
> + *
> + * This callback is optional.
> + */
> + struct drm_gpuva_op *(*op_alloc)(void);
> +
> + /**
> + * @op_free: called when the &drm_gpuva_manager frees a
> + * struct drm_gpuva_op
> + *
> + * Some drivers may want to embed struct drm_gpuva_op into driver
> + * specific structures. By implementing this callback drivers can
> + * free the previously allocated memory accordingly.
> + *
> + * This callback is optional.
> + */
> + void (*op_free)(struct drm_gpuva_op *op);
> +
> + /**
> + * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
> + * mapping once all previous steps were completed
> + *
> + * The &priv pointer matches the one the driver passed to
> + * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
> + *
> + * Can be NULL if &drm_gpuva_sm_map is used.
> + */
> + int (*sm_step_map)(struct drm_gpuva_op *op, void *priv);
> +
> + /**
> + * @sm_step_remap: called from &drm_gpuva_sm_map and
> + * &drm_gpuva_sm_unmap to split up an existent mapping
> + *
> + * This callback is called when existent mapping needs to be split up.
> + * This is the case when either a newly requested mapping overlaps or
> + * is enclosed by an existent mapping or a partial unmap of an existent
> + * mapping is requested.
> + *
> + * The &priv pointer matches the one the driver passed to
> + * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
> + *
> + * Can be NULL if neither &drm_gpuva_sm_map nor &drm_gpuva_sm_unmap is
> + * used.
> + */
> + int (*sm_step_remap)(struct drm_gpuva_op *op, void *priv);
> +
> + /**
> + * @sm_step_unmap: called from &drm_gpuva_sm_map and
> + * &drm_gpuva_sm_unmap to unmap an existent mapping
> + *
> + * This callback is called when existent mapping needs to be unmapped.
> + * This is the case when either a newly requested mapping encloses an
> + * existent mapping or an unmap of an existent mapping is requested.
> + *
> + * The &priv pointer matches the one the driver passed to
> + * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
> + *
> + * Can be NULL if neither &drm_gpuva_sm_map nor &drm_gpuva_sm_unmap is
> + * used.
> + */
> + int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
> +};
> +
> +int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
> + u64 addr, u64 range,
> + struct drm_gem_object *obj, u64 offset);
> +
> +int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
> + u64 addr, u64 range);
> +
> +void drm_gpuva_map(struct drm_gpuva_manager *mgr,
> + struct drm_gpuva *va,
> + struct drm_gpuva_op_map *op);
Missing newline
> +void drm_gpuva_remap(struct drm_gpuva *prev,
> + struct drm_gpuva *next,
> + struct drm_gpuva_op_remap *op);
Missing newline
> +void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
> +
> +#endif /* __DRM_GPUVA_MGR_H__ */

2023-07-06 15:43:51

by Danilo Krummrich

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Hi Boris,

On 6/30/23 10:02, Boris Brezillon wrote:
> Hi Danilo,
>
> On Fri, 30 Jun 2023 00:25:18 +0200
> Danilo Krummrich <[email protected]> wrote:
>
>> + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
>> + * {
>> + * struct driver_context *ctx = __ctx;
>> + *
>> + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
>> + *
>> + * drm_gpuva_unlink(op->remap.unmap->va);
>> + * kfree(op->remap.unmap->va);
>> + *
>> + * if (op->remap.prev) {
>> + * drm_gpuva_link(ctx->prev_va);
>
> I ended up switching to dma_resv-based locking for the GEMs and I
> wonder what the locking is supposed to look like in the async-mapping
> case, where we insert/remove the VA nodes in the drm_sched::run_job()
> path.

If you decide to pick the interface where you just call
drm_gpuva_sm_[un]map() and receive a callback for each operation it
takes to fulfill the request, you probably do this because you want to
do everything one shot, updating the VA space, link/unlink GPUVAs
to/from its corresponding backing GEMs, do the actual GPU mappings.

This has a few advantages over generating a list of operations when the
job is submitted. You've pointed out one of them, when you noticed that
with a list of operations one can't sneak in a synchronous job between
already queued up asynchronous jobs.

However, for the asynchronous path it has the limitation that the
dma-resv lock can't be used to link/unlink GPUVAs to/from its
corresponding backing GEMs, since this would happen in the fence
signalling critical path and we're not allowed to hold the dma-resv lock
there. Hence, as we discussed I added the option for drivers to provide
an external lock for that, just to be able to keep some lockdep checks.

>
> What I have right now is something like:
>
> dma_resv_lock(vm->resv);
>
> // split done in drm_gpuva_sm_map(), each iteration
> // of the loop is a call to the driver ->[re,un]map()
> // hook
> for_each_sub_op() {
>
> // Private BOs have their resv field pointing to the
> // VM resv and we take the VM resv lock before calling
> // drm_gpuva_sm_map()
> if (vm->resv != gem->resv)
> dma_resv_lock(gem->resv);
>
> drm_gpuva_[un]link(va);
> gem_[un]pin(gem);
>
> if (vm->resv != gem->resv)
> dma_resv_unlock(gem->resv);
> }
>
> dma_resv_unlock(vm->resv);
>

I'm not sure I get this code right, reading "for_each_sub_op()" and
"drm_gpuva_sm_map()" looks a bit like things are mixed up?

Or do you mean to represent the sum of all callbacks with
"for_each_sub_op()"? In this case I assume this code runs in
drm_sched::run_job() and hence isn't allowed to take the dma-resv lock.

> In practice, I don't expect things to deadlock, because the VM resv is
> not supposed to be taken outside the VM context and the locking order
> is always the same (VM lock first, and then each shared BO
> taken/released independently), but I'm not super thrilled by this
> nested lock, and I'm wondering if we shouldn't have a pass collecting
> locks in a drm_exec context first, and then have
> the operations executed. IOW, something like that:
>
> drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES)
> drm_exec_until_all_locked(exec) {
> // Dummy GEM is the dummy GEM object I use to make the VM
> // participate in the locking without having to teach
> // drm_exec how to deal with raw dma_resv objects.
> ret = drm_exec_lock_obj(exec, vm->dummy_gem);
> drm_exec_retry_on_contention(exec);
> if (ret)
> return ret;
>
> // Could take the form of drm_gpuva_sm_[un]map_acquire_locks()
> // helpers
> for_each_sub_op() {
> ret = drm_exec_lock_obj(exec, gem);
> if (ret)
> return ret;
> }
> }
>
> // each iteration of the loop is a call to the driver
> // ->[re,un]map() hook
> for_each_sub_op() {
> ...
> gem_[un]pin_locked(gem);
> drm_gpuva_[un]link(va);
> ...
> }
>
> drm_exec_fini(exec);

I have a follow-up patch (still WIP) in the queue to generalize dma-resv
handling, fence handling and GEM validation within the GPUVA manager as
optional helper functions:
https://gitlab.freedesktop.org/nouvelles/kernel/-/commit/a5fc29f3b1edbf3f96fb5a21b858ffe00a3f2584

This was suggested by Matt Brost.

- Danilo

>
> Don't know if I got this right, or if I'm just confused again by how
> the drm_gpuva API is supposed to be used.
>
> Regards,
>
> Boris
>
>> + * ctx->prev_va = NULL;
>> + * }
>> + *
>> + * if (op->remap.next) {
>> + * drm_gpuva_link(ctx->next_va);
>> + * ctx->next_va = NULL;
>> + * }
>> + *
>> + * return 0;
>> + * }
>


2023-07-06 16:16:52

by Danilo Krummrich

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Hi Thomas,

On 7/6/23 10:49, Thomas Hellström (Intel) wrote:
> Hi, Danilo
>
> Some review comments below:
>
> On 6/30/23 00:25, Danilo Krummrich wrote:
>> Add infrastructure to keep track of GPU virtual address (VA) mappings
>> with a decicated VA space manager implementation.
>>
>> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers
>> start implementing, allow userspace applications to request multiple and
>> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is
>> intended to serve the following purposes in this context.
>>
>> 1) Provide infrastructure to track GPU VA allocations and mappings,
>>     making use of the maple_tree.
>
> It looks like we're not using the maple tree anymore, but rather an
> instantiation of an interval tree.
>
> (Perhaps as a follow-up it makes sense to provide a pre-instantiated
> common u64 version of the interval tree in addition to the unsigned long
> one since it appears to be used in multiple places in graphics drivers).
>
>> 2) Generically connect GPU VA mappings to their backing buffers, in
>>     particular DRM GEM objects.
>>
>> 3) Provide a common implementation to perform more complex mapping
>>     operations on the GPU VA space. In particular splitting and merging
>>     of GPU VA mappings, e.g. for intersecting mapping requests or partial
>>     unmap requests.
>>
>> Tested-by: Donald Robson<[email protected]>
>> Reviewed-by: Boris Brezillon<[email protected]>
>> Suggested-by: Dave Airlie<[email protected]>
>> Signed-off-by: Danilo Krummrich<[email protected]>
>> ---
>>   Documentation/gpu/drm-mm.rst    |   36 +
>>   drivers/gpu/drm/Makefile        |    1 +
>>   drivers/gpu/drm/drm_gem.c       |    3 +
>>   drivers/gpu/drm/drm_gpuva_mgr.c | 1743 +++++++++++++++++++++++++++++++
>>   include/drm/drm_drv.h           |    6 +
>>   include/drm/drm_gem.h           |   52 +
>>   include/drm/drm_gpuva_mgr.h     |  756 ++++++++++++++
>>   7 files changed, 2597 insertions(+)
>>   create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c
>>   create mode 100644 include/drm/drm_gpuva_mgr.h
>>
>> diff --git a/Documentation/gpu/drm-mm.rst b/Documentation/gpu/drm-mm.rst
>> index a52e6f4117d6..3d5dc9dc1bfe 100644
>> --- a/Documentation/gpu/drm-mm.rst
>> +++ b/Documentation/gpu/drm-mm.rst
>> @@ -466,6 +466,42 @@ DRM MM Range Allocator Function References
>>   .. kernel-doc:: drivers/gpu/drm/drm_mm.c
>>      :export:
>> +DRM GPU VA Manager
>> +==================
>> +
>> +Overview
>> +--------
>> +
>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>> +   :doc: Overview
>> +
>> +Split and Merge
>> +---------------
>> +
>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>> +   :doc: Split and Merge
>> +
>> +Locking
>> +-------
>> +
>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>> +   :doc: Locking
>> +
>> +Examples
>> +--------
>> +
>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>> +   :doc: Examples
>> +
>> +DRM GPU VA Manager Function References
>> +--------------------------------------
>> +
>> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h
>> +   :internal:
>> +
>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>> +   :export:
>> +
>>   DRM Buddy Allocator
>>   ===================
>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
>> index 414855e2a463..6d6c9dec66e8 100644
>> --- a/drivers/gpu/drm/Makefile
>> +++ b/drivers/gpu/drm/Makefile
>> @@ -45,6 +45,7 @@ drm-y := \
>>       drm_vblank.o \
>>       drm_vblank_work.o \
>>       drm_vma_manager.o \
>> +    drm_gpuva_mgr.o \
>>       drm_writeback.o
>>   drm-$(CONFIG_DRM_LEGACY) += \
>>       drm_agpsupport.o \
>> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c
>> index 1a5a2cd0d4ec..cd878ebddbd0 100644
>> --- a/drivers/gpu/drm/drm_gem.c
>> +++ b/drivers/gpu/drm/drm_gem.c
>> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct drm_device
>> *dev,
>>       if (!obj->resv)
>>           obj->resv = &obj->_resv;
>> +    if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA))
>> +        drm_gem_gpuva_init(obj);
>> +
>>       drm_vma_node_reset(&obj->vma_node);
>>       INIT_LIST_HEAD(&obj->lru_node);
>>   }
>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c
>> b/drivers/gpu/drm/drm_gpuva_mgr.c
>> new file mode 100644
>> index 000000000000..4414990c05cc
>> --- /dev/null
>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>> @@ -0,0 +1,1743 @@
>> +// SPDX-License-Identifier: GPL-2.0
>> +/*
> SPDX-License is GPL-2.0 but below looks like MIT. Can we use "GPL-2.0 OR
> MIT" or does something restrict it to GPL-only?
>> + * Copyright (c) 2022 Red Hat.
>> + *
>> + * Permission is hereby granted, free of charge, to any person
>> obtaining a
>> + * copy of this software and associated documentation files (the
>> "Software"),
>> + * to deal in the Software without restriction, including without
>> limitation
>> + * the rights to use, copy, modify, merge, publish, distribute,
>> sublicense,
>> + * and/or sell copies of the Software, and to permit persons to whom the
>> + * Software is furnished to do so, subject to the following conditions:
>> + *
>> + * The above copyright notice and this permission notice shall be
>> included in
>> + * all copies or substantial portions of the Software.
>> + *
>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
>> EXPRESS OR
>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>> MERCHANTABILITY,
>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT
>> SHALL
>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>> DAMAGES OR
>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
>> + * OTHER DEALINGS IN THE SOFTWARE.
>> + *
>> + * Authors:
>> + *     Danilo Krummrich<[email protected]>
>> + *
>> + */
>> +
>> +#include <drm/drm_gpuva_mgr.h>
>> +
>> +#include <linux/interval_tree_generic.h>
>> +#include <linux/mm.h>
>> +
>> +/**
>> + * DOC: Overview
>> + *
>> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager
>> keeps track
>> + * of a GPU's virtual address (VA) space and manages the
>> corresponding virtual
>> + * mappings represented by &drm_gpuva objects. It also keeps track of
>> the
>> + * mapping's backing &drm_gem_object buffers.
>> + *
>> + * &drm_gem_object buffers maintain a list of &drm_gpuva objects
>> representing
>> + * all existent GPU VA mappings using this &drm_gem_object as backing
>> buffer.
>> + *
>> + * GPU VAs can be flagged as sparse, such that drivers may use GPU
>> VAs to also
>> + * keep track of sparse PTEs in order to support Vulkan 'Sparse
>> Resources'.
>> + *
>> + * The GPU VA manager internally uses a rb-tree to manage the
>> + * &drm_gpuva mappings within a GPU's virtual address space.
>> + *
>> + * The &drm_gpuva_manager contains a special &drm_gpuva representing the
>> + * portion of VA space reserved by the kernel. This node is
>> initialized together
>> + * with the GPU VA manager instance and removed when the GPU VA
>> manager is
>> + * destroyed.
>> + *
>> + * In a typical application drivers would embed struct
>> drm_gpuva_manager and
>> + * struct drm_gpuva within their own driver specific structures,
>> there won't be
>> + * any memory allocations of it's own nor memory allocations of
>> &drm_gpuva
> s/it's/its/
>> + * entries.
>> + *
>> + * The data structures needed to store &drm_gpuvas within the
>> &drm_gpuva_manager
>> + * are contained within struct drm_gpuva already. Hence, for inserting
>> + * &drm_gpuva entries from within dma-fence signalling critical
>> sections it is
>> + * enough to pre-allocate the &drm_gpuva structures.
>> + */
>> +
>> +/**
>> + * DOC: Split and Merge
>> + *
>> + * Besides it's capability to manage and represent a GPU VA space, the
> s/it's/its/
>> + * &drm_gpuva_manager also provides functions to let the
>> &drm_gpuva_manager
>> + * calculate a sequence of operations to satisfy a given map or unmap
>> request.
>> + *
>> + * Therefore the DRM GPU VA manager provides an algorithm
>> implementing splitting
>> + * and merging of existent GPU VA mappings with the ones that are
>> requested to
>> + * be mapped or unmapped. This feature is required by the Vulkan API to
>> + * implement Vulkan 'Sparse Memory Bindings' - drivers UAPIs often
>> refer to this
>> + * as VM BIND.
>> + *
>> + * Drivers can call drm_gpuva_sm_map() to receive a sequence of
>> callbacks
>> + * containing map, unmap and remap operations for a given newly
>> requested
>> + * mapping. The sequence of callbacks represents the set of
>> operations to
>> + * execute in order to integrate the new mapping cleanly into the
>> current state
>> + * of the GPU VA space.
>> + *
>> + * Depending on how the new GPU VA mapping intersects with the
>> existent mappings
>> + * of the GPU VA space the &drm_gpuva_fn_ops callbacks contain an
>> arbitrary
>> + * amount of unmap operations, a maximum of two remap operations and
>> a single
>> + * map operation. The caller might receive no callback at all if no
>> operation is
>> + * required, e.g. if the requested mapping already exists in the
>> exact same way.
>> + *
>> + * The single map operation represents the original map operation
>> requested by
>> + * the caller.
>> + *
>> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates
>> whether the
>> + * &drm_gpuva to unmap is physically contiguous with the original
>> mapping
>> + * request. Optionally, if 'keep' is set, drivers may keep the actual
>> page table
>> + * entries for this &drm_gpuva, adding the missing page table entries
>> only and
>> + * update the &drm_gpuva_manager's view of things accordingly.
>> + *
>> + * Drivers may do the same optimization, namely delta page table
>> updates, also
>> + * for remap operations. This is possible since &drm_gpuva_op_remap
>> consists of
>> + * one unmap operation and one or two map operations, such that
>> drivers can
>> + * derive the page table update delta accordingly.
>> + *
>> + * Note that there can't be more than two existent mappings to split
>> up, one at
>> + * the beginning and one at the end of the new mapping, hence there is a
>> + * maximum of two remap operations.
>> + *
>> + * Analogous to drm_gpuva_sm_map() drm_gpuva_sm_unmap() uses
>> &drm_gpuva_fn_ops
>> + * to call back into the driver in order to unmap a range of GPU VA
>> space. The
>> + * logic behind this function is way simpler though: For all existent
>> mappings
>> + * enclosed by the given range unmap operations are created. For
>> mappings which
>> + * are only partically located within the given range, remap
>> operations are
>> + * created such that those mappings are split up and re-mapped
>> partically.
>> + *
>> + * As an alternative to drm_gpuva_sm_map() and drm_gpuva_sm_unmap(),
>> + * drm_gpuva_sm_map_ops_create() and drm_gpuva_sm_unmap_ops_create()
>> can be used
>> + * to directly obtain an instance of struct drm_gpuva_ops containing
>> a list of
>> + * &drm_gpuva_op, which can be iterated with drm_gpuva_for_each_op().
>> This list
>> + * contains the &drm_gpuva_ops analogous to the callbacks one would
>> receive when
>> + * calling drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). While this way
>> requires
>> + * more memory (to allocate the &drm_gpuva_ops), it provides drivers
>> a way to
>> + * iterate the &drm_gpuva_op multiple times, e.g. once in a context
>> where memory
>> + * allocations are possible (e.g. to allocate GPU page tables) and
>> once in the
>> + * dma-fence signalling critical path.
>> + *
>> + * To update the &drm_gpuva_manager's view of the GPU VA space
>> + * drm_gpuva_insert() and drm_gpuva_remove() may be used. These
>> functions can
>> + * safely be used from &drm_gpuva_fn_ops callbacks originating from
>> + * drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). However, it might be more
>> + * convenient to use the provided helper functions drm_gpuva_map(),
>> + * drm_gpuva_remap() and drm_gpuva_unmap() instead.
>> + *
>> + * The following diagram depicts the basic relationships of existent
>> GPU VA
>> + * mappings, a newly requested mapping and the resulting mappings as
>> implemented
>> + * by drm_gpuva_sm_map() - it doesn't cover any arbitrary
>> combinations of these.
>> + *
>> + * 1) Requested mapping is identical. Replace it, but indicate the
>> backing PTEs
>> + *    could be kept.
>> + *
>> + *    ::
>> + *
>> + *         0     a     1
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *         0     a     1
>> + *    req: |-----------| (bo_offset=n)
>> + *
>> + *         0     a     1
>> + *    new: |-----------| (bo_offset=n)
>> + *
>> + *
>> + * 2) Requested mapping is identical, except for the BO offset, hence
>> replace
>> + *    the mapping.
>> + *
>> + *    ::
>> + *
>> + *         0     a     1
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *         0     a     1
>> + *    req: |-----------| (bo_offset=m)
>> + *
>> + *         0     a     1
>> + *    new: |-----------| (bo_offset=m)
>> + *
>> + *
>> + * 3) Requested mapping is identical, except for the backing BO,
>> hence replace
>> + *    the mapping.
>> + *
>> + *    ::
>> + *
>> + *         0     a     1
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *         0     b     1
>> + *    req: |-----------| (bo_offset=n)
>> + *
>> + *         0     b     1
>> + *    new: |-----------| (bo_offset=n)
>> + *
>> + *
>> + * 4) Existent mapping is a left aligned subset of the requested one,
>> hence
>> + *    replace the existent one.
>> + *
>> + *    ::
>> + *
>> + *         0  a  1
>> + *    old: |-----|       (bo_offset=n)
>> + *
>> + *         0     a     2
>> + *    req: |-----------| (bo_offset=n)
>> + *
>> + *         0     a     2
>> + *    new: |-----------| (bo_offset=n)
>> + *
>> + *    .. note::
>> + *       We expect to see the same result for a request with a
>> different BO
>> + *       and/or non-contiguous BO offset.
>> + *
>> + *
>> + * 5) Requested mapping's range is a left aligned subset of the
>> existent one,
>> + *    but backed by a different BO. Hence, map the requested mapping
>> and split
>> + *    the existent one adjusting it's BO offset.
> Typo: s/it's/its/ above and in multiple places below.
>> + *
>> + *    ::
>> + *
>> + *         0     a     2
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *         0  b  1
>> + *    req: |-----|       (bo_offset=n)
>> + *
>> + *         0  b  1  a' 2
>> + *    new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1)
>> + *
>> + *    .. note::
>> + *       We expect to see the same result for a request with a
>> different BO
>> + *       and/or non-contiguous BO offset.
>> + *
>> + *
>> + * 6) Existent mapping is a superset of the requested mapping. Split
>> it up, but
>> + *    indicate that the backing PTEs could be kept.
>> + *
>> + *    ::
>> + *
>> + *         0     a     2
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *         0  a  1
>> + *    req: |-----|       (bo_offset=n)
>> + *
>> + *         0  a  1  a' 2
>> + *    new: |-----|-----| (a.bo_offset=n, a'.bo_offset=n+1)
>> + *
>> + *
>> + * 7) Requested mapping's range is a right aligned subset of the
>> existent one,
>> + *    but backed by a different BO. Hence, map the requested mapping
>> and split
>> + *    the existent one, without adjusting the BO offset.
>> + *
>> + *    ::
>> + *
>> + *         0     a     2
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *               1  b  2
>> + *    req:       |-----| (bo_offset=m)
>> + *
>> + *         0  a  1  b  2
>> + *    new: |-----|-----| (a.bo_offset=n,b.bo_offset=m)
>> + *
>> + *
>> + * 8) Existent mapping is a superset of the requested mapping. Split
>> it up, but
>> + *    indicate that the backing PTEs could be kept.
>> + *
>> + *    ::
>> + *
>> + *          0     a     2
>> + *    old: |-----------| (bo_offset=n)
>> + *
>> + *               1  a  2
>> + *    req:       |-----| (bo_offset=n+1)
>> + *
>> + *         0  a' 1  a  2
>> + *    new: |-----|-----| (a'.bo_offset=n, a.bo_offset=n+1)
>> + *
>> + *
>> + * 9) Existent mapping is overlapped at the end by the requested
>> mapping backed
>> + *    by a different BO. Hence, map the requested mapping and split
>> up the
>> + *    existent one, without adjusting the BO offset.
>> + *
>> + *    ::
>> + *
>> + *         0     a     2
>> + *    old: |-----------|       (bo_offset=n)
>> + *
>> + *               1     b     3
>> + *    req:       |-----------| (bo_offset=m)
>> + *
>> + *         0  a  1     b     3
>> + *    new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m)
>> + *
>> + *
>> + * 10) Existent mapping is overlapped by the requested mapping, both
>> having the
>> + *     same backing BO with a contiguous offset. Indicate the backing
>> PTEs of
>> + *     the old mapping could be kept.
>> + *
>> + *     ::
>> + *
>> + *          0     a     2
>> + *     old: |-----------|       (bo_offset=n)
>> + *
>> + *                1     a     3
>> + *     req:       |-----------| (bo_offset=n+1)
>> + *
>> + *          0  a' 1     a     3
>> + *     new: |-----|-----------| (a'.bo_offset=n, a.bo_offset=n+1)
>> + *
>> + *
>> + * 11) Requested mapping's range is a centered subset of the existent
>> one
>> + *     having a different backing BO. Hence, map the requested
>> mapping and split
>> + *     up the existent one in two mappings, adjusting the BO offset
>> of the right
>> + *     one accordingly.
>> + *
>> + *     ::
>> + *
>> + *          0        a        3
>> + *     old: |-----------------| (bo_offset=n)
>> + *
>> + *                1  b  2
>> + *     req:       |-----|       (bo_offset=m)
>> + *
>> + *          0  a  1  b  2  a' 3
>> + *     new: |-----|-----|-----|
>> (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2)
>> + *
>> + *
>> + * 12) Requested mapping is a contiguous subset of the existent one.
>> Split it
>> + *     up, but indicate that the backing PTEs could be kept.
>> + *
>> + *     ::
>> + *
>> + *          0        a        3
>> + *     old: |-----------------| (bo_offset=n)
>> + *
>> + *                1  a  2
>> + *     req:       |-----|       (bo_offset=n+1)
>> + *
>> + *          0  a' 1  a  2 a'' 3
>> + *     old: |-----|-----|-----| (a'.bo_offset=n, a.bo_offset=n+1,
>> a''.bo_offset=n+2)
>> + *
>> + *
>> + * 13) Existent mapping is a right aligned subset of the requested
>> one, hence
>> + *     replace the existent one.
>> + *
>> + *     ::
>> + *
>> + *                1  a  2
>> + *     old:       |-----| (bo_offset=n+1)
>> + *
>> + *          0     a     2
>> + *     req: |-----------| (bo_offset=n)
>> + *
>> + *          0     a     2
>> + *     new: |-----------| (bo_offset=n)
>> + *
>> + *     .. note::
>> + *        We expect to see the same result for a request with a
>> different bo
>> + *        and/or non-contiguous bo_offset.
>> + *
>> + *
>> + * 14) Existent mapping is a centered subset of the requested one, hence
>> + *     replace the existent one.
>> + *
>> + *     ::
>> + *
>> + *                1  a  2
>> + *     old:       |-----| (bo_offset=n+1)
>> + *
>> + *          0        a       3
>> + *     req: |----------------| (bo_offset=n)
>> + *
>> + *          0        a       3
>> + *     new: |----------------| (bo_offset=n)
>> + *
>> + *     .. note::
>> + *        We expect to see the same result for a request with a
>> different bo
>> + *        and/or non-contiguous bo_offset.
>> + *
>> + *
>> + * 15) Existent mappings is overlapped at the beginning by the
>> requested mapping
>> + *     backed by a different BO. Hence, map the requested mapping and
>> split up
>> + *     the existent one, adjusting it's BO offset accordingly.
>> + *
>> + *     ::
>> + *
>> + *                1     a     3
>> + *     old:       |-----------| (bo_offset=n)
>> + *
>> + *          0     b     2
>> + *     req: |-----------|       (bo_offset=m)
>> + *
>> + *          0     b     2  a' 3
>> + *     new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2)
>> + */
>> +
>> +/**
>> + * DOC: Locking
>> + *
>> + * Generally, the GPU VA manager does not take care of locking
>> itself, it is
>> + * the drivers responsibility to take care about locking. Drivers
>> might want to
>> + * protect the following operations: inserting, removing and iterating
>> + * &drm_gpuva objects as well as generating all kinds of operations,
>> such as
>> + * split / merge or prefetch.
>> + *
>> + * The GPU VA manager also does not take care of the locking of the
>> backing
>> + * &drm_gem_object buffers GPU VA lists by itself; drivers are
>> responsible to
>> + * enforce mutual exclusion using either the GEMs dma_resv lock or
>> alternatively
>> + * a driver specific external lock by setting the
>> @DRM_GPUVA_MANAGER_LOCK_EXTERN
>> + * flag.
>
> Is the external lock used or anticipated by any WIP implementation? If
> not I suggest to leave it out until there is a user.

I think the PowerVR driver requires this. However, it's generally useful
for any driver using direct callbacks rather than drm_gpuva_ops.

Once the page table handling in Nouveau is re-worked, and direct
callbacks can be used, I probably want to use this in Nouveau as well.

Gonna fix up all other comments.

- Danilo

>
>> + *
>> + * For the latter, functions such as drm_gpuva_link() or
>> drm_gpuva_unlink()
>> + * contain lockdep checks to indicate locking issues. For this to
>> work drivers
>> + * must provide (in case the @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is
>> set) their
>> + * external lock with drm_gpuva_manager_set_ext_lock() after
>> initialization.
>> + */
>> +
>> +/**
>> + * DOC: Examples
>> + *
>> + * This section gives two examples on how to let the DRM GPUVA
>> Manager generate
>> + * &drm_gpuva_op in order to satisfy a given map or unmap request and
>> how to
>> + * make use of them.
>> + *
>> + * The below code is strictly limited to illustrate the generic usage
>> pattern.
>> + * To maintain simplicitly, it doesn't make use of any abstractions
>> for common
>> + * code, different (asyncronous) stages with fence signalling
>> critical paths,
>> + * any other helpers or error handling in terms of freeing memory and
>> dropping
>> + * previously taken locks.
>> + *
>> + * 1) Obtain a list of &drm_gpuva_op to create a new mapping::
>> + *
>> + *    // Allocates a new &drm_gpuva.
>> + *    struct drm_gpuva * driver_gpuva_alloc(void);
>> + *
>> + *    // Typically drivers would embedd the &drm_gpuva_manager and
>> &drm_gpuva
>> + *    // structure in individual driver structures and lock the
>> dma-resv with
>> + *    // drm_exec or similar helpers.
>> + *    int driver_mapping_create(struct drm_gpuva_manager *mgr,
>> + *                  u64 addr, u64 range,
>> + *                  struct drm_gem_object *obj, u64 offset)
>> + *    {
>> + *        struct drm_gpuva_ops *ops;
>> + *        struct drm_gpuva_op *op
>> + *
>> + *        driver_lock_va_space();
>> + *        ops = drm_gpuva_sm_map_ops_create(mgr, addr, range,
>> + *                          obj, offset);
>> + *        if (IS_ERR(ops))
>> + *            return PTR_ERR(ops);
>> + *
>> + *        drm_gpuva_for_each_op(op, ops) {
>> + *            struct drm_gpuva *va;
>> + *
>> + *            switch (op->op) {
>> + *            case DRM_GPUVA_OP_MAP:
>> + *                va = driver_gpuva_alloc();
>> + *                if (!va)
>> + *                    ; // unwind previous VA space updates,
>> + *                      // free memory and unlock
>> + *
>> + *                driver_vm_map();
>> + *                drm_gpuva_map(mgr, va, &op->map);
>> + *                drm_gpuva_link(va);
>> + *
>> + *                break;
>> + *            case DRM_GPUVA_OP_REMAP: {
>> + *                struct drm_gpuva *prev = NULL, *next = NULL;
>> + *
>> + *                va = op->remap.unmap->va;
>> + *
>> + *                if (op->remap.prev) {
>> + *                    prev = driver_gpuva_alloc();
>> + *                    if (!prev)
>> + *                        ; // unwind previous VA space
>> + *                          // updates, free memory and
>> + *                          // unlock
>> + *                }
>> + *
>> + *                if (op->remap.next) {
>> + *                    next = driver_gpuva_alloc();
>> + *                    if (!next)
>> + *                        ; // unwind previous VA space
>> + *                          // updates, free memory and
>> + *                          // unlock
>> + *                }
>> + *
>> + *                driver_vm_remap();
>> + *                drm_gpuva_remap(prev, next, &op->remap);
>> + *
>> + *                drm_gpuva_unlink(va);
>> + *                if (prev)
>> + *                    drm_gpuva_link(prev);
>> + *                if (next)
>> + *                    drm_gpuva_link(next);
>> + *
>> + *                break;
>> + *            }
>> + *            case DRM_GPUVA_OP_UNMAP:
>> + *                va = op->unmap->va;
>> + *
>> + *                driver_vm_unmap();
>> + *                drm_gpuva_unlink(va);
>> + *                drm_gpuva_unmap(&op->unmap);
>> + *
>> + *                break;
>> + *            default:
>> + *                break;
>> + *            }
>> + *        }
>> + *        driver_unlock_va_space();
>> + *
>> + *        return 0;
>> + *    }
>> + *
>> + * 2) Receive a callback for each &drm_gpuva_op to create a new
>> mapping::
>> + *
>> + *    struct driver_context {
>> + *        struct drm_gpuva_manager *mgr;
>> + *        struct drm_gpuva *new_va;
>> + *        struct drm_gpuva *prev_va;
>> + *        struct drm_gpuva *next_va;
>> + *    };
>> + *
>> + *    // ops to pass to drm_gpuva_manager_init()
>> + *    static const struct drm_gpuva_fn_ops driver_gpuva_ops = {
>> + *        .sm_step_map = driver_gpuva_map,
>> + *        .sm_step_remap = driver_gpuva_remap,
>> + *        .sm_step_unmap = driver_gpuva_unmap,
>> + *    };
>> + *
>> + *    // Typically drivers would embedd the &drm_gpuva_manager and
>> &drm_gpuva
>> + *    // structure in individual driver structures and lock the
>> dma-resv with
>> + *    // drm_exec or similar helpers.
>> + *    int driver_mapping_create(struct drm_gpuva_manager *mgr,
>> + *                  u64 addr, u64 range,
>> + *                  struct drm_gem_object *obj, u64 offset)
>> + *    {
>> + *        struct driver_context ctx;
>> + *        struct drm_gpuva_ops *ops;
>> + *        struct drm_gpuva_op *op;
>> + *        int ret = 0;
>> + *
>> + *        ctx.mgr = mgr;
>> + *
>> + *        ctx.new_va = kzalloc(sizeof(*ctx.new_va), GFP_KERNEL);
>> + *        ctx.prev_va = kzalloc(sizeof(*ctx.prev_va), GFP_KERNEL);
>> + *        ctx.next_va = kzalloc(sizeof(*ctx.next_va), GFP_KERNEL);
>> + *        if (!ctx.new_va || !ctx.prev_va || !ctx.next_va) {
>> + *            ret = -ENOMEM;
>> + *            goto out;
>> + *        }
>> + *
>> + *        driver_lock_va_space();
>> + *        ret = drm_gpuva_sm_map(mgr, &ctx, addr, range, obj, offset);
>> + *        driver_unlock_va_space();
>> + *
>> + *    out:
>> + *        kfree(ctx.new_va);
>> + *        kfree(ctx.prev_va);
>> + *        kfree(ctx.next_va);
>> + *        return ret;
>> + *    }
>> + *
>> + *    int driver_gpuva_map(struct drm_gpuva_op *op, void *__ctx)
>> + *    {
>> + *        struct driver_context *ctx = __ctx;
>> + *
>> + *        drm_gpuva_map(ctx->mgr, ctx->new_va, &op->map);
>> + *
>> + *        drm_gpuva_link(ctx->new_va);
>> + *
>> + *        // prevent the new GPUVA from being freed in
>> + *        // driver_mapping_create()
>> + *        ctx->new_va = NULL;
>> + *
>> + *        return 0;
>> + *    }
>> + *
>> + *    int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
>> + *    {
>> + *        struct driver_context *ctx = __ctx;
>> + *
>> + *        drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
>> + *
>> + *        drm_gpuva_unlink(op->remap.unmap->va);
>> + *        kfree(op->remap.unmap->va);
>> + *
>> + *        if (op->remap.prev) {
>> + *            drm_gpuva_link(ctx->prev_va);
>> + *            ctx->prev_va = NULL;
>> + *        }
>> + *
>> + *        if (op->remap.next) {
>> + *            drm_gpuva_link(ctx->next_va);
>> + *            ctx->next_va = NULL;
>> + *        }
>> + *
>> + *        return 0;
>> + *    }
>> + *
>> + *    int driver_gpuva_unmap(struct drm_gpuva_op *op, void *__ctx)
>> + *    {
>> + *        drm_gpuva_unlink(op->unmap.va);
>> + *        drm_gpuva_unmap(&op->unmap);
>> + *        kfree(op->unmap.va);
>> + *
>> + *        return 0;
>> + *    }
>> + */
>> +
>> +#define to_drm_gpuva(__node)    container_of((__node), struct
>> drm_gpuva, rb.node)
>> +
>> +#define GPUVA_START(node) ((node)->va.addr)
>> +#define GPUVA_LAST(node) ((node)->va.addr + (node)->va.range - 1)
>> +
>> +/* We do not actually use drm_gpuva_it_next(), tell the compiler to
>> not complain
>> + * about this.
>> + */
>> +INTERVAL_TREE_DEFINE(struct drm_gpuva, rb.node, u64, rb.__subtree_last,
>> +             GPUVA_START, GPUVA_LAST, static __attribute__((unused)),
>
> Would  s/__attribute__((unused))/__maybe_unused/ work here?
>
>> +             drm_gpuva_it)
>> +
>> +static int __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>> +                  struct drm_gpuva *va);
>> +static void __drm_gpuva_remove(struct drm_gpuva *va);
>> +
>> +static inline bool
> "static inline" is typically used only in header files, since the
> compiler should be smart enough to inline if beneficial. Prefer
> "static". Also in multiple places below.
>> +drm_gpuva_check_overflow(u64 addr, u64 range)
>> +{
>> +    u64 end;
>> +
>> +    return WARN(check_add_overflow(addr, range, &end),
>> +            "GPUVA address limited to %lu bytes.\n", sizeof(end));
>> +}
>> +
>> +static inline bool
>> +drm_gpuva_in_mm_range(struct drm_gpuva_manager *mgr, u64 addr, u64
>> range)
>> +{
>> +    u64 end = addr + range;
>> +    u64 mm_start = mgr->mm_start;
>> +    u64 mm_end = mm_start + mgr->mm_range;
>> +
>> +    return addr >= mm_start && end <= mm_end;
>> +}
>> +
>> +static inline bool
>> +drm_gpuva_in_kernel_node(struct drm_gpuva_manager *mgr, u64 addr, u64
>> range)
>> +{
>> +    u64 end = addr + range;
>> +    u64 kstart = mgr->kernel_alloc_node.va.addr;
>> +    u64 krange = mgr->kernel_alloc_node.va.range;
>> +    u64 kend = kstart + krange;
>> +
>> +    return krange && addr < kend && kstart < end;
>> +}
>> +
>> +static inline bool
>> +drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>> +              u64 addr, u64 range)
>> +{
>> +
>> +    return !drm_gpuva_check_overflow(addr, range) &&
>> +           drm_gpuva_in_mm_range(mgr, addr, range) &&
>> +           !drm_gpuva_in_kernel_node(mgr, addr, range);
>> +}
>> +
>> +/**
>> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager
> Function kerneldoc names should end with "()": drm_gpuva_manager_init().
> General comment across the patch.
>> + * @mgr: pointer to the &drm_gpuva_manager to initialize
>> + * @name: the name of the GPU VA space
>> + * @start_offset: the start offset of the GPU VA space
>> + * @range: the size of the GPU VA space
>> + * @reserve_offset: the start of the kernel reserved GPU VA area
>> + * @reserve_range: the size of the kernel reserved GPU VA area
>> + * @ops: &drm_gpuva_fn_ops called on &drm_gpuva_sm_map /
>> &drm_gpuva_sm_unmap
>> + * @flags: the feature flags for the &drm_gpuva_manager
>> + *
>> + * The &drm_gpuva_manager must be initialized with this function
>> before use.
>> + *
>> + * Note that @mgr must be cleared to 0 before calling this function.
>> The given
>> + * &name is expected to be managed by the surrounding driver structures.
>> + */
>> +void
>> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>> +               const char *name,
>> +               u64 start_offset, u64 range,
>> +               u64 reserve_offset, u64 reserve_range,
>> +               const struct drm_gpuva_fn_ops *ops,
>> +               enum drm_gpuva_manager_flags flags)
>> +{
>> +    mgr->rb.tree = RB_ROOT_CACHED;
>> +    INIT_LIST_HEAD(&mgr->rb.list);
>> +
>> +    drm_gpuva_check_overflow(start_offset, range);
>> +    mgr->mm_start = start_offset;
>> +    mgr->mm_range = range;
>> +
>> +    mgr->name = name ? name : "unknown";
>> +    mgr->flags = flags;
>> +    mgr->ops = ops;
>> +
>> +    memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
>> +
>> +    if (reserve_range) {
>> +        mgr->kernel_alloc_node.va.addr = reserve_offset;
>> +        mgr->kernel_alloc_node.va.range = reserve_range;
>> +
>> +        if (likely(!drm_gpuva_check_overflow(reserve_offset,
>> +                             reserve_range)))
>> +            __drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>> +    }
>> +
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_manager_init);
>> +
>> +/**
>> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager
>> + * @mgr: pointer to the &drm_gpuva_manager to clean up
>> + *
>> + * Note that it is a bug to call this function on a manager that still
>> + * holds GPU VA mappings.
>> + */
>> +void
>> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>> +{
>> +    mgr->name = NULL;
>> +
>> +    if (mgr->kernel_alloc_node.va.range)
>> +        __drm_gpuva_remove(&mgr->kernel_alloc_node);
>> +
>> +    WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>> +         "GPUVA tree is not empty, potentially leaking memory.");
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_manager_destroy);
>> +
>> +static int
>> +__drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>> +           struct drm_gpuva *va)
>> +{
>> +    struct rb_node *node;
>> +    struct list_head *head;
>> +
>> +    if (drm_gpuva_it_iter_first(&mgr->rb.tree,
>> +                    GPUVA_START(va),
>> +                    GPUVA_LAST(va)))
>> +        return -EEXIST;
>> +
>> +    va->mgr = mgr;
>> +
>> +    drm_gpuva_it_insert(va, &mgr->rb.tree);
>> +
>> +    node = rb_prev(&va->rb.node);
>> +    if (node)
>> +        head = &(to_drm_gpuva(node))->rb.entry;
>> +    else
>> +        head = &mgr->rb.list;
>> +
>> +    list_add(&va->rb.entry, head);
>> +
>> +    return 0;
>> +}
>> +
>> +/**
>> + * drm_gpuva_insert - insert a &drm_gpuva
>> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in
>> + * @va: the &drm_gpuva to insert
>> + *
>> + * Insert a &drm_gpuva with a given address and range into a
>> + * &drm_gpuva_manager.
>> + *
>> + * It is safe to use this function using the safe versions of
>> iterating the GPU
>> + * VA space, such as drm_gpuva_for_each_va_safe() and
>> + * drm_gpuva_for_each_va_range_safe().
>> + *
>> + * Returns: 0 on success, negative error code on failure.
>> + */
>> +int
>> +drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>> +         struct drm_gpuva *va)
>> +{
>> +    u64 addr = va->va.addr;
>> +    u64 range = va->va.range;
>> +
>> +    if (unlikely(!drm_gpuva_range_valid(mgr, addr, range)))
>> +        return -EINVAL;
>> +
>> +    return __drm_gpuva_insert(mgr, va);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_insert);
>> +
>> +static void
>> +__drm_gpuva_remove(struct drm_gpuva *va)
>> +{
>> +    drm_gpuva_it_remove(va, &va->mgr->rb.tree);
>> +    list_del_init(&va->rb.entry);
>> +}
>> +
>> +/**
>> + * drm_gpuva_remove - remove a &drm_gpuva
>> + * @va: the &drm_gpuva to remove
>> + *
>> + * This removes the given &va from the underlaying tree.
>> + *
>> + * It is safe to use this function using the safe versions of
>> iterating the GPU
>> + * VA space, such as drm_gpuva_for_each_va_safe() and
>> + * drm_gpuva_for_each_va_range_safe().
>> + */
>> +void
>> +drm_gpuva_remove(struct drm_gpuva *va)
>> +{
>> +    struct drm_gpuva_manager *mgr = va->mgr;
>> +
>> +    if (unlikely(va == &mgr->kernel_alloc_node)) {
>> +        WARN(1, "Can't destroy kernel reserved node.\n");
>> +        return;
>> +    }
>> +
>> +    __drm_gpuva_remove(va);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_remove);
>> +
>> +/**
>> + * drm_gpuva_link - link a &drm_gpuva
>> + * @va: the &drm_gpuva to link
>> + *
>> + * This adds the given &va to the GPU VA list of the &drm_gem_object
>> it is
>> + * associated with.
>> + *
>> + * This function expects the caller to protect the GEM's GPUVA list
>> against
>> + * concurrent access using the GEMs dma_resv lock.
>> + */
>> +void
>> +drm_gpuva_link(struct drm_gpuva *va)
>> +{
>> +    struct drm_gpuva_manager *mgr = va->mgr;
>> +    struct drm_gem_object *obj = va->gem.obj;
>> +
>> +    if (unlikely(!obj))
>> +        return;
>> +
>> +    if (drm_gpuva_manager_external_lock(mgr))
>> +        drm_gpuva_manager_ext_assert_held(mgr);
>> +    else
>> +        dma_resv_assert_held(obj->resv);
>> +
>> +    list_add_tail(&va->gem.entry, &obj->gpuva.list);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_link);
>> +
>> +/**
>> + * drm_gpuva_unlink - unlink a &drm_gpuva
>> + * @va: the &drm_gpuva to unlink
>> + *
>> + * This removes the given &va from the GPU VA list of the
>> &drm_gem_object it is
>> + * associated with.
>> + *
>> + * This function expects the caller to protect the GEM's GPUVA list
>> against
>> + * concurrent access using the GEMs dma_resv lock.
>> + */
>> +void
>> +drm_gpuva_unlink(struct drm_gpuva *va)
>> +{
>> +    struct drm_gpuva_manager *mgr = va->mgr;
>> +    struct drm_gem_object *obj = va->gem.obj;
>> +
>> +    if (unlikely(!obj))
>> +        return;
>> +
>> +    if (drm_gpuva_manager_external_lock(mgr))
>> +        drm_gpuva_manager_ext_assert_held(mgr);
>> +    else
>> +        dma_resv_assert_held(obj->resv);
>> +
>> +    list_del_init(&va->gem.entry);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_unlink);
>> +
>> +/**
>> + * drm_gpuva_find_first - find the first &drm_gpuva in the given range
>> + * @mgr: the &drm_gpuva_manager to search in
>> + * @addr: the &drm_gpuvas address
>> + * @range: the &drm_gpuvas range
>> + *
>> + * Returns: the first &drm_gpuva within the given range
>> + */
>> +struct drm_gpuva *
>> +drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
>> +             u64 addr, u64 range)
>> +{
>> +    u64 last = addr + range - 1;
>> +
>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, addr, last);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_find_first);
>> +
>> +/**
>> + * drm_gpuva_find - find a &drm_gpuva
>> + * @mgr: the &drm_gpuva_manager to search in
>> + * @addr: the &drm_gpuvas address
>> + * @range: the &drm_gpuvas range
>> + *
>> + * Returns: the &drm_gpuva at a given &addr and with a given &range
>> + */
>> +struct drm_gpuva *
>> +drm_gpuva_find(struct drm_gpuva_manager *mgr,
>> +           u64 addr, u64 range)
>> +{
>> +    struct drm_gpuva *va;
>> +
>> +    va = drm_gpuva_find_first(mgr, addr, range);
>> +    if (!va)
>> +        goto out;
>> +
>> +    if (va->va.addr != addr ||
>> +        va->va.range != range)
>> +        goto out;
>> +
>> +    return va;
>> +
>> +out:
>> +    return NULL;
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_find);
>> +
>> +/**
>> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address
>> + * @mgr: the &drm_gpuva_manager to search in
>> + * @start: the given GPU VA's start address
>> + *
>> + * Find the adjacent &drm_gpuva before the GPU VA with given &start
>> address.
>> + *
>> + * Note that if there is any free space between the GPU VA mappings
>> no mapping
>> + * is returned.
>> + *
>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found
>> + */
>> +struct drm_gpuva *
>> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start)
>> +{
>> +    if (!drm_gpuva_range_valid(mgr, start - 1, 1))
>> +        return NULL;
>> +
>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, start - 1, start);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_find_prev);
>> +
>> +/**
>> + * drm_gpuva_find_next - find the &drm_gpuva after the given address
>> + * @mgr: the &drm_gpuva_manager to search in
>> + * @end: the given GPU VA's end address
>> + *
>> + * Find the adjacent &drm_gpuva after the GPU VA with given &end
>> address.
>> + *
>> + * Note that if there is any free space between the GPU VA mappings
>> no mapping
>> + * is returned.
>> + *
>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was found
>> + */
>> +struct drm_gpuva *
>> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end)
>> +{
>> +    if (!drm_gpuva_range_valid(mgr, end, 1))
>> +        return NULL;
>> +
>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, end, end + 1);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_find_next);
>> +
>> +/**
>> + * drm_gpuva_interval_empty - indicate whether a given interval of
>> the VA space
>> + * is empty
>> + * @mgr: the &drm_gpuva_manager to check the range for
>> + * @addr: the start address of the range
>> + * @range: the range of the interval
>> + *
>> + * Returns: true if the interval is empty, false otherwise
>> + */
>> +bool
>> +drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr, u64
>> range)
>> +{
>> +    return !drm_gpuva_find_first(mgr, addr, range);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_interval_empty);
>> +
>> +/**
>> + * drm_gpuva_map - helper to insert a &drm_gpuva according to a
>> + * &drm_gpuva_op_map
>> + * @mgr: the &drm_gpuva_manager
>> + * @va: the &drm_gpuva to insert
>> + * @op: the &drm_gpuva_op_map to initialize @va with
>> + *
>> + * Initializes the @va from the @op and inserts it into the given @mgr.
>> + */
>> +void
>> +drm_gpuva_map(struct drm_gpuva_manager *mgr,
>> +          struct drm_gpuva *va,
>> +          struct drm_gpuva_op_map *op)
>> +{
>> +    drm_gpuva_init_from_op(va, op);
>> +    drm_gpuva_insert(mgr, va);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_map);
>> +
>> +/**
>> + * drm_gpuva_remap - helper to remap a &drm_gpuva according to a
>> + * &drm_gpuva_op_remap
>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>> + *
>> + * Removes the currently mapped &drm_gpuva and remaps it using @prev
>> and/or
>> + * @next.
>> + */
>> +void
>> +drm_gpuva_remap(struct drm_gpuva *prev,
>> +        struct drm_gpuva *next,
>> +        struct drm_gpuva_op_remap *op)
>> +{
>> +    struct drm_gpuva *curr = op->unmap->va;
>> +    struct drm_gpuva_manager *mgr = curr->mgr;
>> +    struct drm_gpuva_op_map *map;
>> +
>> +    drm_gpuva_remove(curr);
>> +
>> +    if ((map = op->prev)) {
>> +        drm_gpuva_init_from_op(prev, map);
>> +        drm_gpuva_insert(mgr, prev);
>> +    }
>> +
>> +    if ((map = op->next)) {
>> +        drm_gpuva_init_from_op(next, map);
>> +        drm_gpuva_insert(mgr, next);
>> +    }
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_remap);
>> +
>> +/**
>> + * drm_gpuva_unmap - helper to remove a &drm_gpuva according to a
>> + * &drm_gpuva_op_unmap
>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>> + *
>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap.
>> + */
>> +void
>> +drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>> +{
>> +    drm_gpuva_remove(op->va);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_unmap);
>> +
>> +static int
>> +op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>> +      u64 addr, u64 range,
>> +      struct drm_gem_object *obj, u64 offset)
>> +{
>> +    struct drm_gpuva_op op = {};
>> +
>> +    op.op = DRM_GPUVA_OP_MAP;
>> +    op.map.va.addr = addr;
>> +    op.map.va.range = range;
>> +    op.map.gem.obj = obj;
>> +    op.map.gem.offset = offset;
>> +
>> +    return fn->sm_step_map(&op, priv);
>> +}
>> +
>> +static int
>> +op_remap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>> +        struct drm_gpuva_op_map *prev,
>> +        struct drm_gpuva_op_map *next,
>> +        struct drm_gpuva_op_unmap *unmap)
>> +{
>> +    struct drm_gpuva_op op = {};
>> +    struct drm_gpuva_op_remap *r;
>> +
>> +    op.op = DRM_GPUVA_OP_REMAP;
>> +    r = &op.remap;
>> +    r->prev = prev;
>> +    r->next = next;
>> +    r->unmap = unmap;
>> +
>> +    return fn->sm_step_remap(&op, priv);
>> +}
>> +
>> +static int
>> +op_unmap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>> +        struct drm_gpuva *va, bool merge)
>> +{
>> +    struct drm_gpuva_op op = {};
>> +
>> +    op.op = DRM_GPUVA_OP_UNMAP;
>> +    op.unmap.va = va;
>> +    op.unmap.keep = merge;
>> +
>> +    return fn->sm_step_unmap(&op, priv);
>> +}
>> +
>> +static int
>> +__drm_gpuva_sm_map(struct drm_gpuva_manager *mgr,
>> +           const struct drm_gpuva_fn_ops *ops, void *priv,
>> +           u64 req_addr, u64 req_range,
>> +           struct drm_gem_object *req_obj, u64 req_offset)
>> +{
>> +    struct drm_gpuva *va, *next, *prev = NULL;
>> +    u64 req_end = req_addr + req_range;
>> +    int ret;
>> +
>> +    if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
>> +        return -EINVAL;
>> +
>> +    drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
>> +        struct drm_gem_object *obj = va->gem.obj;
>> +        u64 offset = va->gem.offset;
>> +        u64 addr = va->va.addr;
>> +        u64 range = va->va.range;
>> +        u64 end = addr + range;
>> +        bool merge = !!va->gem.obj;
>> +
>> +        if (addr == req_addr) {
>> +            merge &= obj == req_obj &&
>> +                 offset == req_offset;
>> +
>> +            if (end == req_end) {
>> +                ret = op_unmap_cb(ops, priv, va, merge);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +
>> +            if (end < req_end) {
>> +                ret = op_unmap_cb(ops, priv, va, merge);
>> +                if (ret)
>> +                    return ret;
>> +                goto next;
>> +            }
>> +
>> +            if (end > req_end) {
>> +                struct drm_gpuva_op_map n = {
>> +                    .va.addr = req_end,
>> +                    .va.range = range - req_range,
>> +                    .gem.obj = obj,
>> +                    .gem.offset = offset + req_range,
>> +                };
>> +                struct drm_gpuva_op_unmap u = {
>> +                    .va = va,
>> +                    .keep = merge,
>> +                };
>> +
>> +                ret = op_remap_cb(ops, priv, NULL, &n, &u);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +        } else if (addr < req_addr) {
>> +            u64 ls_range = req_addr - addr;
>> +            struct drm_gpuva_op_map p = {
>> +                .va.addr = addr,
>> +                .va.range = ls_range,
>> +                .gem.obj = obj,
>> +                .gem.offset = offset,
>> +            };
>> +            struct drm_gpuva_op_unmap u = { .va = va };
>> +
>> +            merge &= obj == req_obj &&
>> +                 offset + ls_range == req_offset;
>> +            u.keep = merge;
>> +
>> +            if (end == req_end) {
>> +                ret = op_remap_cb(ops, priv, &p, NULL, &u);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +
>> +            if (end < req_end) {
>> +                ret = op_remap_cb(ops, priv, &p, NULL, &u);
>> +                if (ret)
>> +                    return ret;
>> +                goto next;
>> +            }
>> +
>> +            if (end > req_end) {
>> +                struct drm_gpuva_op_map n = {
>> +                    .va.addr = req_end,
>> +                    .va.range = end - req_end,
>> +                    .gem.obj = obj,
>> +                    .gem.offset = offset + ls_range +
>> +                              req_range,
>> +                };
>> +
>> +                ret = op_remap_cb(ops, priv, &p, &n, &u);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +        } else if (addr > req_addr) {
>> +            merge &= obj == req_obj &&
>> +                 offset == req_offset +
>> +                       (addr - req_addr);
>> +
>> +            if (end == req_end) {
>> +                ret = op_unmap_cb(ops, priv, va, merge);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +
>> +            if (end < req_end) {
>> +                ret = op_unmap_cb(ops, priv, va, merge);
>> +                if (ret)
>> +                    return ret;
>> +                goto next;
>> +            }
>> +
>> +            if (end > req_end) {
>> +                struct drm_gpuva_op_map n = {
>> +                    .va.addr = req_end,
>> +                    .va.range = end - req_end,
>> +                    .gem.obj = obj,
>> +                    .gem.offset = offset + req_end - addr,
>> +                };
>> +                struct drm_gpuva_op_unmap u = {
>> +                    .va = va,
>> +                    .keep = merge,
>> +                };
>> +
>> +                ret = op_remap_cb(ops, priv, NULL, &n, &u);
>> +                if (ret)
>> +                    return ret;
>> +                break;
>> +            }
>> +        }
>> +next:
>> +        prev = va;
>> +    }
>> +
>> +    return op_map_cb(ops, priv,
>> +             req_addr, req_range,
>> +             req_obj, req_offset);
>> +}
>> +
>> +static int
>> +__drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr,
>> +             const struct drm_gpuva_fn_ops *ops, void *priv,
>> +             u64 req_addr, u64 req_range)
>> +{
>> +    struct drm_gpuva *va, *next;
>> +    u64 req_end = req_addr + req_range;
>> +    int ret;
>> +
>> +    if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
>> +        return -EINVAL;
>> +
>> +    drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr, req_end) {
>> +        struct drm_gpuva_op_map prev = {}, next = {};
>> +        bool prev_split = false, next_split = false;
>> +        struct drm_gem_object *obj = va->gem.obj;
>> +        u64 offset = va->gem.offset;
>> +        u64 addr = va->va.addr;
>> +        u64 range = va->va.range;
>> +        u64 end = addr + range;
>> +
>> +        if (addr < req_addr) {
>> +            prev.va.addr = addr;
>> +            prev.va.range = req_addr - addr;
>> +            prev.gem.obj = obj;
>> +            prev.gem.offset = offset;
>> +
>> +            prev_split = true;
>> +        }
>> +
>> +        if (end > req_end) {
>> +            next.va.addr = req_end;
>> +            next.va.range = end - req_end;
>> +            next.gem.obj = obj;
>> +            next.gem.offset = offset + (req_end - addr);
>> +
>> +            next_split = true;
>> +        }
>> +
>> +        if (prev_split || next_split) {
>> +            struct drm_gpuva_op_unmap unmap = { .va = va };
>> +
>> +            ret = op_remap_cb(ops, priv,
>> +                      prev_split ? &prev : NULL,
>> +                      next_split ? &next : NULL,
>> +                      &unmap);
>> +            if (ret)
>> +                return ret;
>> +        } else {
>> +            ret = op_unmap_cb(ops, priv, va, false);
>> +            if (ret)
>> +                return ret;
>> +        }
>> +    }
>> +
>> +    return 0;
>> +}
>> +
>> +/**
>> + * drm_gpuva_sm_map - creates the &drm_gpuva_op split/merge steps
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @req_addr: the start address of the new mapping
>> + * @req_range: the range of the new mapping
>> + * @req_obj: the &drm_gem_object to map
>> + * @req_offset: the offset within the &drm_gem_object
>> + * @priv: pointer to a driver private data structure
>> + *
>> + * This function iterates the given range of the GPU VA space. It
>> utilizes the
>> + * &drm_gpuva_fn_ops to call back into the driver providing the split
>> and merge
>> + * steps.
>> + *
>> + * Drivers may use these callbacks to update the GPU VA space right
>> away within
>> + * the callback. In case the driver decides to copy and store the
>> operations for
>> + * later processing neither this function nor &drm_gpuva_sm_unmap is
>> allowed to
>> + * be called before the &drm_gpuva_manager's view of the GPU VA space
>> was
>> + * updated with the previous set of operations. To update the
>> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>> should be
>> + * used.
>> + *
>> + * A sequence of callbacks can contain map, unmap and remap
>> operations, but
>> + * the sequence of callbacks might also be empty if no operation is
>> required,
>> + * e.g. if the requested mapping already exists in the exact same way.
>> + *
>> + * There can be an arbitrary amount of unmap operations, a maximum of
>> two remap
>> + * operations and a single map operation. The latter one represents
>> the original
>> + * map operation requested by the caller.
>> + *
>> + * Returns: 0 on success or a negative error code
>> + */
>> +int
>> +drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>> +         u64 req_addr, u64 req_range,
>> +         struct drm_gem_object *req_obj, u64 req_offset)
>> +{
>> +    const struct drm_gpuva_fn_ops *ops = mgr->ops;
>> +
>> +    if (unlikely(!(ops && ops->sm_step_map &&
>> +               ops->sm_step_remap &&
>> +               ops->sm_step_unmap)))
>> +        return -EINVAL;
>> +
>> +    return __drm_gpuva_sm_map(mgr, ops, priv,
>> +                  req_addr, req_range,
>> +                  req_obj, req_offset);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_sm_map);
>> +
>> +/**
>> + * drm_gpuva_sm_unmap - creates the &drm_gpuva_ops to split on unmap
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @priv: pointer to a driver private data structure
>> + * @req_addr: the start address of the range to unmap
>> + * @req_range: the range of the mappings to unmap
>> + *
>> + * This function iterates the given range of the GPU VA space. It
>> utilizes the
>> + * &drm_gpuva_fn_ops to call back into the driver providing the
>> operations to
>> + * unmap and, if required, split existent mappings.
>> + *
>> + * Drivers may use these callbacks to update the GPU VA space right
>> away within
>> + * the callback. In case the driver decides to copy and store the
>> operations for
>> + * later processing neither this function nor &drm_gpuva_sm_map is
>> allowed to be
>> + * called before the &drm_gpuva_manager's view of the GPU VA space
>> was updated
>> + * with the previous set of operations. To update the
>> &drm_gpuva_manager's view
>> + * of the GPU VA space drm_gpuva_insert(), drm_gpuva_destroy_locked()
>> and/or
>> + * drm_gpuva_destroy_unlocked() should be used.
>> + *
>> + * A sequence of callbacks can contain unmap and remap operations,
>> depending on
>> + * whether there are actual overlapping mappings to split.
>> + *
>> + * There can be an arbitrary amount of unmap operations and a maximum
>> of two
>> + * remap operations.
>> + *
>> + * Returns: 0 on success or a negative error code
>> + */
>> +int
>> +drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>> +           u64 req_addr, u64 req_range)
>> +{
>> +    const struct drm_gpuva_fn_ops *ops = mgr->ops;
>> +
>> +    if (unlikely(!(ops && ops->sm_step_remap &&
>> +               ops->sm_step_unmap)))
>> +        return -EINVAL;
>> +
>> +    return __drm_gpuva_sm_unmap(mgr, ops, priv,
>> +                    req_addr, req_range);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_sm_unmap);
>> +
>> +static struct drm_gpuva_op *
>> +gpuva_op_alloc(struct drm_gpuva_manager *mgr)
>> +{
>> +    const struct drm_gpuva_fn_ops *fn = mgr->ops;
>> +    struct drm_gpuva_op *op;
>> +
>> +    if (fn && fn->op_alloc)
>> +        op = fn->op_alloc();
>> +    else
>> +        op = kzalloc(sizeof(*op), GFP_KERNEL);
>> +
>> +    if (unlikely(!op))
>> +        return NULL;
>> +
>> +    return op;
>> +}
>> +
>> +static void
>> +gpuva_op_free(struct drm_gpuva_manager *mgr,
>> +          struct drm_gpuva_op *op)
>> +{
>> +    const struct drm_gpuva_fn_ops *fn = mgr->ops;
>> +
>> +    if (fn && fn->op_free)
>> +        fn->op_free(op);
>> +    else
>> +        kfree(op);
>> +}
>> +
>> +static int
>> +drm_gpuva_sm_step(struct drm_gpuva_op *__op,
>> +          void *priv)
>> +{
>> +    struct {
>> +        struct drm_gpuva_manager *mgr;
>> +        struct drm_gpuva_ops *ops;
>> +    } *args = priv;
>> +    struct drm_gpuva_manager *mgr = args->mgr;
>> +    struct drm_gpuva_ops *ops = args->ops;
>> +    struct drm_gpuva_op *op;
>> +
>> +    op = gpuva_op_alloc(mgr);
>> +    if (unlikely(!op))
>> +        goto err;
>> +
>> +    memcpy(op, __op, sizeof(*op));
>> +
>> +    if (op->op == DRM_GPUVA_OP_REMAP) {
>> +        struct drm_gpuva_op_remap *__r = &__op->remap;
>> +        struct drm_gpuva_op_remap *r = &op->remap;
>> +
>> +        r->unmap = kmemdup(__r->unmap, sizeof(*r->unmap),
>> +                   GFP_KERNEL);
>> +        if (unlikely(!r->unmap))
>> +            goto err_free_op;
>> +
>> +        if (__r->prev) {
>> +            r->prev = kmemdup(__r->prev, sizeof(*r->prev),
>> +                      GFP_KERNEL);
>> +            if (unlikely(!r->prev))
>> +                goto err_free_unmap;
>> +        }
>> +
>> +        if (__r->next) {
>> +            r->next = kmemdup(__r->next, sizeof(*r->next),
>> +                      GFP_KERNEL);
>> +            if (unlikely(!r->next))
>> +                goto err_free_prev;
>> +        }
>> +    }
>> +
>> +    list_add_tail(&op->entry, &ops->list);
>> +
>> +    return 0;
>> +
>> +err_free_unmap:
>> +    kfree(op->remap.unmap);
>> +err_free_prev:
>> +    kfree(op->remap.prev);
>> +err_free_op:
>> +    gpuva_op_free(mgr, op);
>> +err:
>> +    return -ENOMEM;
>> +}
>> +
>> +static const struct drm_gpuva_fn_ops gpuva_list_ops = {
>> +    .sm_step_map = drm_gpuva_sm_step,
>> +    .sm_step_remap = drm_gpuva_sm_step,
>> +    .sm_step_unmap = drm_gpuva_sm_step,
>> +};
>> +
>> +/**
>> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to split
>> and merge
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @req_addr: the start address of the new mapping
>> + * @req_range: the range of the new mapping
>> + * @req_obj: the &drm_gem_object to map
>> + * @req_offset: the offset within the &drm_gem_object
>> + *
>> + * This function creates a list of operations to perform splitting
>> and merging
>> + * of existent mapping(s) with the newly requested one.
>> + *
>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>> processed
>> + * in the given order. It can contain map, unmap and remap
>> operations, but it
>> + * also can be empty if no operation is required, e.g. if the
>> requested mapping
>> + * already exists is the exact same way.
>> + *
>> + * There can be an arbitrary amount of unmap operations, a maximum of
>> two remap
>> + * operations and a single map operation. The latter one represents
>> the original
>> + * map operation requested by the caller.
>> + *
>> + * Note that before calling this function again with another mapping
>> request it
>> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA
>> space. The
>> + * previously obtained operations must be either processed or
>> abandoned. To
>> + * update the &drm_gpuva_manager's view of the GPU VA space
>> drm_gpuva_insert(),
>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>> should be
>> + * used.
>> + *
>> + * After the caller finished processing the returned &drm_gpuva_ops,
>> they must
>> + * be freed with &drm_gpuva_ops_free.
>> + *
>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on
>> failure
>> + */
>> +struct drm_gpuva_ops *
>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
>> +                u64 req_addr, u64 req_range,
>> +                struct drm_gem_object *req_obj, u64 req_offset)
>> +{
>> +    struct drm_gpuva_ops *ops;
>> +    struct {
>> +        struct drm_gpuva_manager *mgr;
>> +        struct drm_gpuva_ops *ops;
>> +    } args;
>> +    int ret;
>> +
>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>> +    if (unlikely(!ops))
>> +        return ERR_PTR(-ENOMEM);
>> +
>> +    INIT_LIST_HEAD(&ops->list);
>> +
>> +    args.mgr = mgr;
>> +    args.ops = ops;
>> +
>> +    ret = __drm_gpuva_sm_map(mgr, &gpuva_list_ops, &args,
>> +                 req_addr, req_range,
>> +                 req_obj, req_offset);
>> +    if (ret)
>> +        goto err_free_ops;
>> +
>> +    return ops;
>> +
>> +err_free_ops:
>> +    drm_gpuva_ops_free(mgr, ops);
>> +    return ERR_PTR(ret);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create);
>> +
>> +/**
>> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to
>> split on unmap
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @req_addr: the start address of the range to unmap
>> + * @req_range: the range of the mappings to unmap
>> + *
>> + * This function creates a list of operations to perform unmapping
>> and, if
>> + * required, splitting of the mappings overlapping the unmap range.
>> + *
>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>> processed
>> + * in the given order. It can contain unmap and remap operations,
>> depending on
>> + * whether there are actual overlapping mappings to split.
>> + *
>> + * There can be an arbitrary amount of unmap operations and a maximum
>> of two
>> + * remap operations.
>> + *
>> + * Note that before calling this function again with another range to
>> unmap it
>> + * is necessary to update the &drm_gpuva_manager's view of the GPU VA
>> space. The
>> + * previously obtained operations must be processed or abandoned. To
>> update the
>> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>> should be
>> + * used.
>> + *
>> + * After the caller finished processing the returned &drm_gpuva_ops,
>> they must
>> + * be freed with &drm_gpuva_ops_free.
>> + *
>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on
>> failure
>> + */
>> +struct drm_gpuva_ops *
>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
>> +                  u64 req_addr, u64 req_range)
>> +{
>> +    struct drm_gpuva_ops *ops;
>> +    struct {
>> +        struct drm_gpuva_manager *mgr;
>> +        struct drm_gpuva_ops *ops;
>> +    } args;
>> +    int ret;
>> +
>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>> +    if (unlikely(!ops))
>> +        return ERR_PTR(-ENOMEM);
>> +
>> +    INIT_LIST_HEAD(&ops->list);
>> +
>> +    args.mgr = mgr;
>> +    args.ops = ops;
>> +
>> +    ret = __drm_gpuva_sm_unmap(mgr, &gpuva_list_ops, &args,
>> +                   req_addr, req_range);
>> +    if (ret)
>> +        goto err_free_ops;
>> +
>> +    return ops;
>> +
>> +err_free_ops:
>> +    drm_gpuva_ops_free(mgr, ops);
>> +    return ERR_PTR(ret);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create);
>> +
>> +/**
>> + * drm_gpuva_prefetch_ops_create - creates the &drm_gpuva_ops to
>> prefetch
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @addr: the start address of the range to prefetch
>> + * @range: the range of the mappings to prefetch
>> + *
>> + * This function creates a list of operations to perform prefetching.
>> + *
>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>> processed
>> + * in the given order. It can contain prefetch operations.
>> + *
>> + * There can be an arbitrary amount of prefetch operations.
>> + *
>> + * After the caller finished processing the returned &drm_gpuva_ops,
>> they must
>> + * be freed with &drm_gpuva_ops_free.
>> + *
>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on
>> failure
>> + */
>> +struct drm_gpuva_ops *
>> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
>> +                  u64 addr, u64 range)
>> +{
>> +    struct drm_gpuva_ops *ops;
>> +    struct drm_gpuva_op *op;
>> +    struct drm_gpuva *va;
>> +    u64 end = addr + range;
>> +    int ret;
>> +
>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>> +    if (!ops)
>> +        return ERR_PTR(-ENOMEM);
>> +
>> +    INIT_LIST_HEAD(&ops->list);
>> +
>> +    drm_gpuva_for_each_va_range(va, mgr, addr, end) {
>> +        op = gpuva_op_alloc(mgr);
>> +        if (!op) {
>> +            ret = -ENOMEM;
>> +            goto err_free_ops;
>> +        }
>> +
>> +        op->op = DRM_GPUVA_OP_PREFETCH;
>> +        op->prefetch.va = va;
>> +        list_add_tail(&op->entry, &ops->list);
>> +    }
>> +
>> +    return ops;
>> +
>> +err_free_ops:
>> +    drm_gpuva_ops_free(mgr, ops);
>> +    return ERR_PTR(ret);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_prefetch_ops_create);
>> +
>> +/**
>> + * drm_gpuva_gem_unmap_ops_create - creates the &drm_gpuva_ops to
>> unmap a GEM
>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>> + * @obj: the &drm_gem_object to unmap
>> + *
>> + * This function creates a list of operations to perform unmapping
>> for every
>> + * GPUVA attached to a GEM.
>> + *
>> + * The list can be iterated with &drm_gpuva_for_each_op and consists
>> out of an
>> + * arbitrary amount of unmap operations.
>> + *
>> + * After the caller finished processing the returned &drm_gpuva_ops,
>> they must
>> + * be freed with &drm_gpuva_ops_free.
>> + *
>> + * It is the callers responsibility to protect the GEMs GPUVA list
>> against
>> + * concurrent access using the GEMs dma_resv lock.
>> + *
>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR on
>> failure
>> + */
>> +struct drm_gpuva_ops *
>> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>> +                   struct drm_gem_object *obj)
>> +{
>> +    struct drm_gpuva_ops *ops;
>> +    struct drm_gpuva_op *op;
>> +    struct drm_gpuva *va;
>> +    int ret;
>> +
>> +    if (drm_gpuva_manager_external_lock(mgr))
>> +        drm_gpuva_manager_ext_assert_held(mgr);
>> +    else
>> +        dma_resv_assert_held(obj->resv);
>> +
>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>> +    if (!ops)
>> +        return ERR_PTR(-ENOMEM);
>> +
>> +    INIT_LIST_HEAD(&ops->list);
>> +
>> +    drm_gem_for_each_gpuva(va, obj) {
>> +        op = gpuva_op_alloc(mgr);
>> +        if (!op) {
>> +            ret = -ENOMEM;
>> +            goto err_free_ops;
>> +        }
>> +
>> +        op->op = DRM_GPUVA_OP_UNMAP;
>> +        op->unmap.va = va;
>> +        list_add_tail(&op->entry, &ops->list);
>> +    }
>> +
>> +    return ops;
>> +
>> +err_free_ops:
>> +    drm_gpuva_ops_free(mgr, ops);
>> +    return ERR_PTR(ret);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_gem_unmap_ops_create);
>> +
>> +
>> +/**
>> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops
>> + * @mgr: the &drm_gpuva_manager the ops were created for
>> + * @ops: the &drm_gpuva_ops to free
>> + *
>> + * Frees the given &drm_gpuva_ops structure including all the ops
>> associated
>> + * with it.
>> + */
>> +void
>> +drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
>> +           struct drm_gpuva_ops *ops)
>> +{
>> +    struct drm_gpuva_op *op, *next;
>> +
>> +    drm_gpuva_for_each_op_safe(op, next, ops) {
>> +        list_del(&op->entry);
>> +
>> +        if (op->op == DRM_GPUVA_OP_REMAP) {
>> +            kfree(op->remap.prev);
>> +            kfree(op->remap.next);
>> +            kfree(op->remap.unmap);
>> +        }
>> +
>> +        gpuva_op_free(mgr, op);
>> +    }
>> +
>> +    kfree(ops);
>> +}
>> +EXPORT_SYMBOL(drm_gpuva_ops_free);
>> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h
>> index 89e2706cac56..04dbe223b1a5 100644
>> --- a/include/drm/drm_drv.h
>> +++ b/include/drm/drm_drv.h
>> @@ -104,6 +104,12 @@ enum drm_driver_feature {
>>        * acceleration should be handled by two drivers that are
>> connected using auxiliary bus.
>>        */
>>       DRIVER_COMPUTE_ACCEL            = BIT(7),
>> +    /**
>> +     * @DRIVER_GEM_GPUVA:
>> +     *
>> +     * Driver supports user defined GPU VA bindings for GEM objects.
>> +     */
>> +    DRIVER_GEM_GPUVA        = BIT(8),
>>       /* IMPORTANT: Below are all the legacy flags, add new ones
>> above. */
>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>> index bbc721870c13..5ec8148a30ee 100644
>> --- a/include/drm/drm_gem.h
>> +++ b/include/drm/drm_gem.h
>> @@ -36,6 +36,8 @@
>>   #include <linux/kref.h>
>>   #include <linux/dma-resv.h>
>> +#include <linux/list.h>
>> +#include <linux/mutex.h>
>>   #include <drm/drm_vma_manager.h>
>> @@ -379,6 +381,18 @@ struct drm_gem_object {
>>        */
>>       struct dma_resv _resv;
>> +    /**
>> +     * @gpuva:
>> +     *
>> +     * Provides the list of GPU VAs attached to this GEM object.
>> +     *
>> +     * Drivers should lock list accesses with the GEMs &dma_resv lock
>> +     * (&drm_gem_object.resv).
>> +     */
>> +    struct {
>> +        struct list_head list;
>> +    } gpuva;
>> +
>>       /**
>>        * @funcs:
>>        *
>> @@ -526,4 +540,42 @@ unsigned long drm_gem_lru_scan(struct drm_gem_lru
>> *lru,
>>   int drm_gem_evict(struct drm_gem_object *obj);
>> +/**
>> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object
>> + * @obj: the &drm_gem_object
>> + *
>> + * This initializes the &drm_gem_object's &drm_gpuva list.
>> + *
>> + * Calling this function is only necessary for drivers intending to
>> support the
>> + * &drm_driver_feature DRIVER_GEM_GPUVA.
>> + */
>> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>> +{
>> +    INIT_LIST_HEAD(&obj->gpuva.list);
>> +}
>> +
>> +/**
>> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas
>
> s/iternator/iterator/. (multiple places) Although since "iterator"
> typically refers to an object being iterated over, perhaps
>
> "drm_gem_for_each_gpuva() - iterate over a list of gpuvas", (general
> comment across the patch).
>
>> + * @entry: &drm_gpuva structure to assign to in each iteration step
>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
>> + *
>> + * This iterator walks over all &drm_gpuva structures associated with
>> the
>> + * &drm_gpuva_manager.
>> + */
>> +#define drm_gem_for_each_gpuva(entry__, obj__) \
>> +    list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>> +
>> +/**
>> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a list
>> of gpuvas
>> + * @entry: &drm_gpuva structure to assign to in each iteration step
>> + * @next: &next &drm_gpuva to store the next step
>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated with
>> + *
>> + * This iterator walks over all &drm_gpuva structures associated with
>> the
>> + * &drm_gem_object. It is implemented with
>> list_for_each_entry_safe(), hence
>> + * it is save against removal of elements.
>> + */
>> +#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>> +    list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list,
>> gem.entry)
>> +
>>   #endif /* __DRM_GEM_H__ */
>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>> new file mode 100644
>> index 000000000000..4f23aaf726dd
>> --- /dev/null
>> +++ b/include/drm/drm_gpuva_mgr.h
>> @@ -0,0 +1,756 @@
>> +/* SPDX-License-Identifier: GPL-2.0 */
>> +
>> +#ifndef __DRM_GPUVA_MGR_H__
>> +#define __DRM_GPUVA_MGR_H__
>> +
>> +/*
>> + * Copyright (c) 2022 Red Hat.
>> + *
>> + * Permission is hereby granted, free of charge, to any person
>> obtaining a
>> + * copy of this software and associated documentation files (the
>> "Software"),
>> + * to deal in the Software without restriction, including without
>> limitation
>> + * the rights to use, copy, modify, merge, publish, distribute,
>> sublicense,
>> + * and/or sell copies of the Software, and to permit persons to whom the
>> + * Software is furnished to do so, subject to the following conditions:
>> + *
>> + * The above copyright notice and this permission notice shall be
>> included in
>> + * all copies or substantial portions of the Software.
>> + *
>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
>> EXPRESS OR
>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>> MERCHANTABILITY,
>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT
>> SHALL
>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>> DAMAGES OR
>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
>> + * OTHER DEALINGS IN THE SOFTWARE.
>> + */
>> +
>> +#include <linux/list.h>
>> +#include <linux/rbtree.h>
>> +#include <linux/types.h>
>> +
>> +#include <drm/drm_gem.h>
>> +
>> +struct drm_gpuva_manager;
>> +struct drm_gpuva_fn_ops;
>> +
>> +/**
>> + * enum drm_gpuva_flags - flags for struct drm_gpuva
>> + */
>> +enum drm_gpuva_flags {
>> +    /**
>> +     * @DRM_GPUVA_INVALIDATED:
>> +     *
>> +     * Flag indicating that the &drm_gpuva's backing GEM is invalidated.
>> +     */
>> +    DRM_GPUVA_INVALIDATED = (1 << 0),
>> +
>> +    /**
>> +     * @DRM_GPUVA_SPARSE:
>> +     *
>> +     * Flag indicating that the &drm_gpuva is a sparse mapping.
>> +     */
>> +    DRM_GPUVA_SPARSE = (1 << 1),
>> +
>> +    /**
>> +     * @DRM_GPUVA_USERBITS: user defined bits
>> +     */
>> +    DRM_GPUVA_USERBITS = (1 << 2),
>> +};
>> +
>> +/**
>> + * struct drm_gpuva - structure to track a GPU VA mapping
>> + *
>> + * This structure represents a GPU VA mapping and is associated with a
>> + * &drm_gpuva_manager.
>> + *
>> + * Typically, this structure is embedded in bigger driver structures.
>> + */
>> +struct drm_gpuva {
>> +    /**
>> +     * @mgr: the &drm_gpuva_manager this object is associated with
>> +     */
>> +    struct drm_gpuva_manager *mgr;
>> +
>> +    /**
>> +     * @flags: the &drm_gpuva_flags for this mapping
>> +     */
>> +    enum drm_gpuva_flags flags;
>> +
>> +    /**
>> +     * @va: structure containing the address and range of the &drm_gpuva
>> +     */
>> +    struct {
>> +        /**
>> +         * @addr: the start address
>> +         */
>> +        u64 addr;
>> +
>> +        /*
>> +         * @range: the range
>> +         */
>> +        u64 range;
>> +    } va;
>> +
>> +    /**
>> +     * @gem: structure containing the &drm_gem_object and it's offset
>> +     */
>> +    struct {
>> +        /**
>> +         * @offset: the offset within the &drm_gem_object
>> +         */
>> +        u64 offset;
>> +
>> +        /**
>> +         * @obj: the mapped &drm_gem_object
>> +         */
>> +        struct drm_gem_object *obj;
>> +
>> +        /**
>> +         * @entry: the &list_head to attach this object to a
>> &drm_gem_object
>> +         */
>> +        struct list_head entry;
>> +    } gem;
>> +
>> +    /**
>> +     * @rb: structure containing data to store &drm_gpuvas in a rb-tree
>> +     */
>> +    struct {
>> +        /**
>> +         * @rb: the rb-tree node
>> +         */
>> +        struct rb_node node;
>> +
>> +        /**
>> +         * @entry: The &list_head to additionally connect &drm_gpuvas
>> +         * in the same order they appear in the interval tree. This is
>> +         * useful to keep iterating &drm_gpuvas from a start node found
>> +         * through the rb-tree while doing modifications on the rb-tree
>> +         * itself.
>> +         */
>> +        struct list_head entry;
>> +
>> +        /**
>> +         * @__subtree_last: needed by the interval tree, holding
>> last-in-subtree
>> +         */
>> +        u64 __subtree_last;
>> +    } rb;
>> +};
>> +
>> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct drm_gpuva
>> *va);
>> +void drm_gpuva_remove(struct drm_gpuva *va);
>> +
>> +void drm_gpuva_link(struct drm_gpuva *va);
>> +void drm_gpuva_unlink(struct drm_gpuva *va);
>> +
>> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>> +                 u64 addr, u64 range);
>> +struct drm_gpuva *drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
>> +                       u64 addr, u64 range);
>> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager *mgr,
>> u64 start);
>> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager *mgr,
>> u64 end);
>> +
>> +bool drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64
>> addr, u64 range);
>> +
>> +static inline void drm_gpuva_init(struct drm_gpuva *va, u64 addr, u64
>> range,
>> +                  struct drm_gem_object *obj, u64 offset)
>> +{
>> +    va->va.addr = addr;
>> +    va->va.range = range;
>> +    va->gem.obj = obj;
>> +    va->gem.offset = offset;
>> +}
>> +
>> +/**
>> + * drm_gpuva_invalidate - sets whether the backing GEM of this
>> &drm_gpuva is
>> + * invalidated
>> + * @va: the &drm_gpuva to set the invalidate flag for
>> + * @invalidate: indicates whether the &drm_gpuva is invalidated
>> + */
>> +static inline void drm_gpuva_invalidate(struct drm_gpuva *va, bool
>> invalidate)
>> +{
>> +    if (invalidate)
>> +        va->flags |= DRM_GPUVA_INVALIDATED;
>> +    else
>> +        va->flags &= ~DRM_GPUVA_INVALIDATED;
>> +}
>> +
>> +/**
>> + * drm_gpuva_invalidated - indicates whether the backing BO of this
>> &drm_gpuva
>> + * is invalidated
>> + * @va: the &drm_gpuva to check
>> + */
>> +static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
>> +{
>> +    return va->flags & DRM_GPUVA_INVALIDATED;
>> +}
>> +
>> +#ifdef CONFIG_LOCKDEP
>> +typedef struct lockdep_map *lockdep_map_p;
>> +#define drm_gpuva_manager_ext_assert_held(mgr)        \
>> +    lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
>> +/**
>> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
>> + * @mgr: the &drm_gpuva_manager to set the lock for
>> + * @lock: the lock to set
>> + *
>> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call
>> this function
>> + * to provide the lock used to lock linking and unlinking of
>> &drm_gpuvas to the
>> + * &drm_gem_objects GPUVA list.
>> + */
>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock)    \
>> +    (mgr)->ext_lock = &(lock)->dep_map
>> +#else
>> +typedef struct { /* nothing */ } lockdep_map_p;
>> +#define drm_gpuva_manager_ext_assert_held(mgr)        do {
>> (void)(mgr); } while (0)
>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock)    do { } while (0)
>> +#endif
>> +
>> +/**
>> + * enum drm_gpuva_manager_flags - the feature flags for the
>> &drm_gpuva_manager
>> + */
>> +enum drm_gpuva_manager_flags {
>> +    /**
>> +     * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
>> +     *
>> +     * Indicates the driver has it's own external lock for linking and
>> +     * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
>> +     *
>> +     * When setting this flag it is rquired to set a lock via
>> +     * drm_gpuva_set_ext_lock().
>> +     */
>> +    DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_manager - DRM GPU VA Manager
>> + *
>> + * The DRM GPU VA Manager keeps track of a GPU's virtual address
>> space by using
>> + * &maple_tree structures. Typically, this structure is embedded in
>> bigger
>> + * driver structures.
>> + *
>> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g.
>> bytes or
>> + * pages.
>> + *
>> + * There should be one manager instance per GPU virtual address space.
>> + */
>> +struct drm_gpuva_manager {
>> +    /**
>> +     * @name: the name of the DRM GPU VA space
>> +     */
>> +    const char *name;
>> +
>> +    /**
>> +     * @flags: the feature flags of the &drm_gpuva_manager
>> +     */
>> +    enum drm_gpuva_manager_flags flags;
>> +
>> +    /**
>> +     * @mm_start: start of the VA space
>> +     */
>> +    u64 mm_start;
>> +
>> +    /**
>> +     * @mm_range: length of the VA space
>> +     */
>> +    u64 mm_range;
>> +
>> +    /**
>> +     * @rb: structures to track &drm_gpuva entries
>> +     */
>> +    struct {
>> +        /**
>> +         * @tree: the rb-tree to track GPU VA mappings
>> +         */
>> +        struct rb_root_cached tree;
>> +
>> +        /**
>> +         * @list: the &list_head to track GPU VA mappings
>> +         */
>> +        struct list_head list;
>> +    } rb;
>> +
>> +    /**
>> +     * @kernel_alloc_node:
>> +     *
>> +     * &drm_gpuva representing the address space cutout reserved for
>> +     * the kernel
>> +     */
>> +    struct drm_gpuva kernel_alloc_node;
>> +
>> +    /**
>> +     * @ops: &drm_gpuva_fn_ops providing the split/merge steps to
>> drivers
>> +     */
>> +    const struct drm_gpuva_fn_ops *ops;
>> +
>> +    /**
>> +     * @ext_lock: &lockdep_map according to
>> @DRM_GPUVA_MANAGER_LOCK_EXTERN
>> +     */
>> +    lockdep_map_p ext_lock;
>> +};
>> +
>> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>> +                const char *name,
>> +                u64 start_offset, u64 range,
>> +                u64 reserve_offset, u64 reserve_range,
>> +                const struct drm_gpuva_fn_ops *ops,
>> +                enum drm_gpuva_manager_flags flags);
>> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>> +
>> +/**
>> + * drm_gpuva_manager_external_lock - indicates whether the
>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
>> + * @mgr: the &drm_gpuva_manager to check the flag for
>> + */
>> +static inline bool drm_gpuva_manager_external_lock(struct
>> drm_gpuva_manager *mgr)
>> +{
>> +    return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
>> +}
>> +
>> +/**
>> + * drm_gpuva_for_each_va_range - iternator to walk over a range of
>> &drm_gpuvas
>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + * @start__: starting offset, the first gpuva will overlap this
>> + * @end__: ending offset, the last gpuva will start before this (but may
>> + * overlap)
>> + *
>> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager
>> that lie
>> + * between @start__ and @end__. It is implemented similarly to
>> list_for_each(),
>> + * but is using the &drm_gpuva_manager's internal interval tree to
>> accelerate
>> + * the search for the starting &drm_gpuva, and hence isn't safe
>> against removal
>> + * of elements. It assumes that @end__ is within (or is the upper
>> limit of) the
>> + * &drm_gpuva_manager. This iterator does not skip over the
>> &drm_gpuva_manager's
>> + * @kernel_alloc_node.
>> + */
>> +#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
>> +    for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \
>> +         va__ && (va__->va.addr < (end__)) && \
>> +         !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>> +         va__ = list_next_entry(va__, rb.entry))
>> +
>> +/**
>> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a
>> range of
>> + * &drm_gpuvas
>> + * @va__: &drm_gpuva to assign to in each iteration step
>> + * @next__: another &drm_gpuva to use as temporary storage
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + * @start__: starting offset, the first gpuva will overlap this
>> + * @end__: ending offset, the last gpuva will start before this (but may
>> + * overlap)
>> + *
>> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager
>> that lie
>> + * between @start__ and @end__. It is implemented similarly to
>> + * list_for_each_safe(), but is using the &drm_gpuva_manager's
>> internal interval
>> + * tree to accelerate the search for the starting &drm_gpuva, and
>> hence is safe
>> + * against removal of elements. It assumes that @end__ is within (or
>> is the
>> + * upper limit of) the &drm_gpuva_manager. This iterator does not
>> skip over the
>> + * &drm_gpuva_manager's @kernel_alloc_node.
>> + */
>> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__,
>> start__, end__) \
>> +    for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
>> +         next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
>> +         va__ && (va__->va.addr < (end__)) && \
>> +         !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>> +         va__ = next__, next__ = list_next_entry(va__, rb.entry))
>> +
>> +/**
>> + * drm_gpuva_for_each_va - iternator to walk over all &drm_gpuvas
>> + * @va__: &drm_gpuva to assign to in each iteration step
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + *
>> + * This iterator walks over all &drm_gpuva structures associated with
>> the given
>> + * &drm_gpuva_manager.
>> + */
>> +#define drm_gpuva_for_each_va(va__, mgr__) \
>> +    list_for_each_entry(va__, &(mgr__)->rb.list, rb.entry)
>> +
>> +/**
>> + * drm_gpuva_for_each_va_safe - iternator to safely walk over all
>> &drm_gpuvas
>> + * @va__: &drm_gpuva to assign to in each iteration step
>> + * @next__: another &drm_gpuva to use as temporary storage
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + *
>> + * This iterator walks over all &drm_gpuva structures associated with
>> the given
>> + * &drm_gpuva_manager. It is implemented with
>> list_for_each_entry_safe(), and
>> + * hence safe against the removal of elements.
>> + */
>> +#define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>> +    list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list, rb.entry)
>> +
>> +/**
>> + * enum drm_gpuva_op_type - GPU VA operation type
>> + *
>> + * Operations to alter the GPU VA mappings tracked by the
>> &drm_gpuva_manager.
>> + */
>> +enum drm_gpuva_op_type {
>> +    /**
>> +     * @DRM_GPUVA_OP_MAP: the map op type
>> +     */
>> +    DRM_GPUVA_OP_MAP,
>> +
>> +    /**
>> +     * @DRM_GPUVA_OP_REMAP: the remap op type
>> +     */
>> +    DRM_GPUVA_OP_REMAP,
>> +
>> +    /**
>> +     * @DRM_GPUVA_OP_UNMAP: the unmap op type
>> +     */
>> +    DRM_GPUVA_OP_UNMAP,
>> +
>> +    /**
>> +     * @DRM_GPUVA_OP_PREFETCH: the prefetch op type
>> +     */
>> +    DRM_GPUVA_OP_PREFETCH,
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_op_map - GPU VA map operation
>> + *
>> + * This structure represents a single map operation generated by the
>> + * DRM GPU VA manager.
>> + */
>> +struct drm_gpuva_op_map {
>> +    /**
>> +     * @va: structure containing address and range of a map
>> +     * operation
>> +     */
>> +    struct {
>> +        /**
>> +         * @addr: the base address of the new mapping
>> +         */
>> +        u64 addr;
>> +
>> +        /**
>> +         * @range: the range of the new mapping
>> +         */
>> +        u64 range;
>> +    } va;
>> +
>> +    /**
>> +     * @gem: structure containing the &drm_gem_object and it's offset
>> +     */
>> +    struct {
>> +        /**
>> +         * @offset: the offset within the &drm_gem_object
>> +         */
>> +        u64 offset;
>> +
>> +        /**
>> +         * @obj: the &drm_gem_object to map
>> +         */
>> +        struct drm_gem_object *obj;
>> +    } gem;
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_op_unmap - GPU VA unmap operation
>> + *
>> + * This structure represents a single unmap operation generated by the
>> + * DRM GPU VA manager.
>> + */
>> +struct drm_gpuva_op_unmap {
>> +    /**
>> +     * @va: the &drm_gpuva to unmap
>> +     */
>> +    struct drm_gpuva *va;
>> +
>> +    /**
>> +     * @keep:
>> +     *
>> +     * Indicates whether this &drm_gpuva is physically contiguous
>> with the
>> +     * original mapping request.
>> +     *
>> +     * Optionally, if &keep is set, drivers may keep the actual page
>> table
>> +     * mappings for this &drm_gpuva, adding the missing page table
>> entries
>> +     * only and update the &drm_gpuva_manager accordingly.
>> +     */
>> +    bool keep;
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_op_remap - GPU VA remap operation
>> + *
>> + * This represents a single remap operation generated by the DRM GPU
>> VA manager.
>> + *
>> + * A remap operation is generated when an existing GPU VA mmapping is
>> split up
>> + * by inserting a new GPU VA mapping or by partially unmapping existent
>> + * mapping(s), hence it consists of a maximum of two map and one unmap
>> + * operation.
>> + *
>> + * The @unmap operation takes care of removing the original existing
>> mapping.
>> + * @prev is used to remap the preceding part, @next the subsequent part.
>> + *
>> + * If either a new mapping's start address is aligned with the start
>> address
>> + * of the old mapping or the new mapping's end address is aligned
>> with the
>> + * end address of the old mapping, either @prev or @next is NULL.
>> + *
>> + * Note, the reason for a dedicated remap operation, rather than
>> arbitrary
>> + * unmap and map operations, is to give drivers the chance of
>> extracting driver
>> + * specific data for creating the new mappings from the unmap
>> operations's
>> + * &drm_gpuva structure which typically is embedded in larger driver
>> specific
>> + * structures.
>> + */
>> +struct drm_gpuva_op_remap {
>> +    /**
>> +     * @prev: the preceding part of a split mapping
>> +     */
>> +    struct drm_gpuva_op_map *prev;
>> +
>> +    /**
>> +     * @next: the subsequent part of a split mapping
>> +     */
>> +    struct drm_gpuva_op_map *next;
>> +
>> +    /**
>> +     * @unmap: the unmap operation for the original existing mapping
>> +     */
>> +    struct drm_gpuva_op_unmap *unmap;
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_op_prefetch - GPU VA prefetch operation
>> + *
>> + * This structure represents a single prefetch operation generated by
>> the
>> + * DRM GPU VA manager.
>> + */
>> +struct drm_gpuva_op_prefetch {
>> +    /**
>> +     * @va: the &drm_gpuva to prefetch
>> +     */
>> +    struct drm_gpuva *va;
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_op - GPU VA operation
>> + *
>> + * This structure represents a single generic operation.
>> + *
>> + * The particular type of the operation is defined by @op.
>> + */
>> +struct drm_gpuva_op {
>> +    /**
>> +     * @entry:
>> +     *
>> +     * The &list_head used to distribute instances of this struct within
>> +     * &drm_gpuva_ops.
>> +     */
>> +    struct list_head entry;
>> +
>> +    /**
>> +     * @op: the type of the operation
>> +     */
>> +    enum drm_gpuva_op_type op;
>> +
>> +    union {
>> +        /**
>> +         * @map: the map operation
>> +         */
>> +        struct drm_gpuva_op_map map;
>> +
>> +        /**
>> +         * @remap: the remap operation
>> +         */
>> +        struct drm_gpuva_op_remap remap;
>> +
>> +        /**
>> +         * @unmap: the unmap operation
>> +         */
>> +        struct drm_gpuva_op_unmap unmap;
>> +
>> +        /**
>> +         * @prefetch: the prefetch operation
>> +         */
>> +        struct drm_gpuva_op_prefetch prefetch;
>> +    };
>> +};
>> +
>> +/**
>> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op
>> + */
>> +struct drm_gpuva_ops {
>> +    /**
>> +     * @list: the &list_head
>> +     */
>> +    struct list_head list;
>> +};
>> +
>> +/**
>> + * drm_gpuva_for_each_op - iterator to walk over &drm_gpuva_ops
>> + * @op: &drm_gpuva_op to assign in each iteration step
>> + * @ops: &drm_gpuva_ops to walk
>> + *
>> + * This iterator walks over all ops within a given list of operations.
>> + */
>> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op,
>> &(ops)->list, entry)
>> +
>> +/**
>> + * drm_gpuva_for_each_op_safe - iterator to safely walk over
>> &drm_gpuva_ops
>> + * @op: &drm_gpuva_op to assign in each iteration step
>> + * @next: &next &drm_gpuva_op to store the next step
>> + * @ops: &drm_gpuva_ops to walk
>> + *
>> + * This iterator walks over all ops within a given list of
>> operations. It is
>> + * implemented with list_for_each_safe(), so save against removal of
>> elements.
>> + */
>> +#define drm_gpuva_for_each_op_safe(op, next, ops) \
>> +    list_for_each_entry_safe(op, next, &(ops)->list, entry)
>> +
>> +/**
>> + * drm_gpuva_for_each_op_from_reverse - iterate backwards from the
>> given point
>> + * @op: &drm_gpuva_op to assign in each iteration step
>> + * @ops: &drm_gpuva_ops to walk
>> + *
>> + * This iterator walks over all ops within a given list of operations
>> beginning
>> + * from the given operation in reverse order.
>> + */
>> +#define drm_gpuva_for_each_op_from_reverse(op, ops) \
>> +    list_for_each_entry_from_reverse(op, &(ops)->list, entry)
>> +
>> +/**
>> + * drm_gpuva_first_op - returns the first &drm_gpuva_op from
>> &drm_gpuva_ops
>> + * @ops: the &drm_gpuva_ops to get the fist &drm_gpuva_op from
>> + */
>> +#define drm_gpuva_first_op(ops) \
>> +    list_first_entry(&(ops)->list, struct drm_gpuva_op, entry)
>> +
>> +/**
>> + * drm_gpuva_last_op - returns the last &drm_gpuva_op from
>> &drm_gpuva_ops
>> + * @ops: the &drm_gpuva_ops to get the last &drm_gpuva_op from
>> + */
>> +#define drm_gpuva_last_op(ops) \
>> +    list_last_entry(&(ops)->list, struct drm_gpuva_op, entry)
>> +
>> +/**
>> + * drm_gpuva_prev_op - previous &drm_gpuva_op in the list
>> + * @op: the current &drm_gpuva_op
>> + */
>> +#define drm_gpuva_prev_op(op) list_prev_entry(op, entry)
>> +
>> +/**
>> + * drm_gpuva_next_op - next &drm_gpuva_op in the list
>> + * @op: the current &drm_gpuva_op
>> + */
>> +#define drm_gpuva_next_op(op) list_next_entry(op, entry)
>> +
>> +struct drm_gpuva_ops *
>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
>> +                u64 addr, u64 range,
>> +                struct drm_gem_object *obj, u64 offset);
>> +struct drm_gpuva_ops *
>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
>> +                  u64 addr, u64 range);
>> +
>> +struct drm_gpuva_ops *
>> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
>> +                 u64 addr, u64 range);
>> +
>> +struct drm_gpuva_ops *
>> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>> +                   struct drm_gem_object *obj);
>> +
>> +void drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
>> +            struct drm_gpuva_ops *ops);
>> +
>> +static inline void drm_gpuva_init_from_op(struct drm_gpuva *va,
>> +                      struct drm_gpuva_op_map *op)
>> +{
>> +    drm_gpuva_init(va, op->va.addr, op->va.range,
>> +               op->gem.obj, op->gem.offset);
>> +}
>> +
>> +/**
>> + * struct drm_gpuva_fn_ops - callbacks for split/merge steps
>> + *
>> + * This structure defines the callbacks used by &drm_gpuva_sm_map and
>> + * &drm_gpuva_sm_unmap to provide the split/merge steps for map and
>> unmap
>> + * operations to drivers.
>> + */
>> +struct drm_gpuva_fn_ops {
>> +    /**
>> +     * @op_alloc: called when the &drm_gpuva_manager allocates
>> +     * a struct drm_gpuva_op
>> +     *
>> +     * Some drivers may want to embed struct drm_gpuva_op into driver
>> +     * specific structures. By implementing this callback drivers can
>> +     * allocate memory accordingly.
>> +     *
>> +     * This callback is optional.
>> +     */
>> +    struct drm_gpuva_op *(*op_alloc)(void);
>> +
>> +    /**
>> +     * @op_free: called when the &drm_gpuva_manager frees a
>> +     * struct drm_gpuva_op
>> +     *
>> +     * Some drivers may want to embed struct drm_gpuva_op into driver
>> +     * specific structures. By implementing this callback drivers can
>> +     * free the previously allocated memory accordingly.
>> +     *
>> +     * This callback is optional.
>> +     */
>> +    void (*op_free)(struct drm_gpuva_op *op);
>> +
>> +    /**
>> +     * @sm_step_map: called from &drm_gpuva_sm_map to finally insert the
>> +     * mapping once all previous steps were completed
>> +     *
>> +     * The &priv pointer matches the one the driver passed to
>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>> +     *
>> +     * Can be NULL if &drm_gpuva_sm_map is used.
>> +     */
>> +    int (*sm_step_map)(struct drm_gpuva_op *op, void *priv);
>> +
>> +    /**
>> +     * @sm_step_remap: called from &drm_gpuva_sm_map and
>> +     * &drm_gpuva_sm_unmap to split up an existent mapping
>> +     *
>> +     * This callback is called when existent mapping needs to be
>> split up.
>> +     * This is the case when either a newly requested mapping
>> overlaps or
>> +     * is enclosed by an existent mapping or a partial unmap of an
>> existent
>> +     * mapping is requested.
>> +     *
>> +     * The &priv pointer matches the one the driver passed to
>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>> +     *
>> +     * Can be NULL if neither &drm_gpuva_sm_map nor
>> &drm_gpuva_sm_unmap is
>> +     * used.
>> +     */
>> +    int (*sm_step_remap)(struct drm_gpuva_op *op, void *priv);
>> +
>> +    /**
>> +     * @sm_step_unmap: called from &drm_gpuva_sm_map and
>> +     * &drm_gpuva_sm_unmap to unmap an existent mapping
>> +     *
>> +     * This callback is called when existent mapping needs to be
>> unmapped.
>> +     * This is the case when either a newly requested mapping
>> encloses an
>> +     * existent mapping or an unmap of an existent mapping is requested.
>> +     *
>> +     * The &priv pointer matches the one the driver passed to
>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>> +     *
>> +     * Can be NULL if neither &drm_gpuva_sm_map nor
>> &drm_gpuva_sm_unmap is
>> +     * used.
>> +     */
>> +    int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>> +};
>> +
>> +int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>> +             u64 addr, u64 range,
>> +             struct drm_gem_object *obj, u64 offset);
>> +
>> +int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>> +               u64 addr, u64 range);
>> +
>> +void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>> +           struct drm_gpuva *va,
>> +           struct drm_gpuva_op_map *op);
> Missing newline
>> +void drm_gpuva_remap(struct drm_gpuva *prev,
>> +             struct drm_gpuva *next,
>> +             struct drm_gpuva_op_remap *op);
> Missing newline
>> +void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>> +
>> +#endif /* __DRM_GPUVA_MGR_H__ */
>


2023-07-06 16:17:10

by Danilo Krummrich

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

Hi Donald,

On 7/6/23 17:45, Donald Robson wrote:
> On Fri, 2023-06-30 at 00:25 +0200, Danilo Krummrich wrote:
>>
>> +#ifdef CONFIG_LOCKDEP
>> +typedef struct lockdep_map *lockdep_map_p;
>> +#define drm_gpuva_manager_ext_assert_held(mgr) \
>> + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
>> +/**
>> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
>> + * @mgr: the &drm_gpuva_manager to set the lock for
>> + * @lock: the lock to set
>> + *
>> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
>> + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
>> + * &drm_gem_objects GPUVA list.
>> + */
>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
>> + (mgr)->ext_lock = &(lock)->dep_map
>> +#else
>> +typedef struct { /* nothing */ } lockdep_map_p;
>
> lockdep_map_p conflicts with an identical typedef in maple_tree.h when CONFIG_LOCKDEP is
> not set (it's being pulled in by mm.h in drm_vma_manager.h). I'll just comment the line
> out for now.

Good catch! I got this trick from maple_tree.h and intended to move it
to the lockdep header in a separate patch to avoid such collisions.
Unfortunately, I forgot about it. Gonna fix it up.

- Danilo

>
>> +#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
>> +#endif
>> +


2023-07-06 16:30:22

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Thu, 6 Jul 2023 17:06:08 +0200
Danilo Krummrich <[email protected]> wrote:

> Hi Boris,
>
> On 6/30/23 10:02, Boris Brezillon wrote:
> > Hi Danilo,
> >
> > On Fri, 30 Jun 2023 00:25:18 +0200
> > Danilo Krummrich <[email protected]> wrote:
> >
> >> + * int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
> >> + * {
> >> + * struct driver_context *ctx = __ctx;
> >> + *
> >> + * drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
> >> + *
> >> + * drm_gpuva_unlink(op->remap.unmap->va);
> >> + * kfree(op->remap.unmap->va);
> >> + *
> >> + * if (op->remap.prev) {
> >> + * drm_gpuva_link(ctx->prev_va);
> >
> > I ended up switching to dma_resv-based locking for the GEMs and I
> > wonder what the locking is supposed to look like in the async-mapping
> > case, where we insert/remove the VA nodes in the drm_sched::run_job()
> > path.
>
> If you decide to pick the interface where you just call
> drm_gpuva_sm_[un]map() and receive a callback for each operation it
> takes to fulfill the request, you probably do this because you want to
> do everything one shot, updating the VA space, link/unlink GPUVAs
> to/from its corresponding backing GEMs, do the actual GPU mappings.
>
> This has a few advantages over generating a list of operations when the
> job is submitted. You've pointed out one of them, when you noticed that
> with a list of operations one can't sneak in a synchronous job between
> already queued up asynchronous jobs.
>
> However, for the asynchronous path it has the limitation that the
> dma-resv lock can't be used to link/unlink GPUVAs to/from its
> corresponding backing GEMs, since this would happen in the fence
> signalling critical path and we're not allowed to hold the dma-resv lock
> there. Hence, as we discussed I added the option for drivers to provide
> an external lock for that, just to be able to keep some lockdep checks.

Uh, okay, I guess that means I need to go back to a custom lock for VM
operations then.

>
> >
> > What I have right now is something like:
> >
> > dma_resv_lock(vm->resv);
> >
> > // split done in drm_gpuva_sm_map(), each iteration
> > // of the loop is a call to the driver ->[re,un]map()
> > // hook
> > for_each_sub_op() {
> >
> > // Private BOs have their resv field pointing to the
> > // VM resv and we take the VM resv lock before calling
> > // drm_gpuva_sm_map()
> > if (vm->resv != gem->resv)
> > dma_resv_lock(gem->resv);
> >
> > drm_gpuva_[un]link(va);
> > gem_[un]pin(gem);
> >
> > if (vm->resv != gem->resv)
> > dma_resv_unlock(gem->resv);
> > }
> >
> > dma_resv_unlock(vm->resv);
> >
>
> I'm not sure I get this code right, reading "for_each_sub_op()" and
> "drm_gpuva_sm_map()" looks a bit like things are mixed up?
>
> Or do you mean to represent the sum of all callbacks with
> "for_each_sub_op()"?

That ^.

> In this case I assume this code runs in
> drm_sched::run_job() and hence isn't allowed to take the dma-resv lock.

Yeah, I didn't realize that taking the dma-resv lock in the
dma-signaling path was forbidden. I think it's fine for the drm_gpuva
destroy operation (which calls drm_gem_shmem_unpin(), which in turns
acquires the resv lock) because I can move that to a worker and get it
out of the dma-signaling path. The problem remains for remap operations
though. I need to call drm_gem_shmem_pin() so we retain the pages even
after the unmapped gpuva object that's in the middle of a mapping is
released. I guess one option would be to use an atomic_t for
drm_shmem_gem_object::pages_use_count, and
have something like:

int drm_gem_shmem_pin(struct drm_gem_shmem_object *shmem)
{
int ret;

if (atomic_inc_not_zero(&shmem->pages_use_count))
return 0;

dma_resv_lock(shmem->base.resv, NULL);
ret = drm_gem_shmem_pin_locked(shmem);
dma_resv_unlock(shmem->base.resv);

return ret;
}

Given the object already had its pages pinned when we remap, we're sure
the fast path will be taken, and no dma-resv lock aquired.

>
> > In practice, I don't expect things to deadlock, because the VM resv is
> > not supposed to be taken outside the VM context and the locking order
> > is always the same (VM lock first, and then each shared BO
> > taken/released independently), but I'm not super thrilled by this
> > nested lock, and I'm wondering if we shouldn't have a pass collecting
> > locks in a drm_exec context first, and then have
> > the operations executed. IOW, something like that:
> >
> > drm_exec_init(exec, DRM_EXEC_IGNORE_DUPLICATES)
> > drm_exec_until_all_locked(exec) {
> > // Dummy GEM is the dummy GEM object I use to make the VM
> > // participate in the locking without having to teach
> > // drm_exec how to deal with raw dma_resv objects.
> > ret = drm_exec_lock_obj(exec, vm->dummy_gem);
> > drm_exec_retry_on_contention(exec);
> > if (ret)
> > return ret;
> >
> > // Could take the form of drm_gpuva_sm_[un]map_acquire_locks()
> > // helpers
> > for_each_sub_op() {
> > ret = drm_exec_lock_obj(exec, gem);
> > if (ret)
> > return ret;
> > }
> > }
> >
> > // each iteration of the loop is a call to the driver
> > // ->[re,un]map() hook
> > for_each_sub_op() {
> > ...
> > gem_[un]pin_locked(gem);
> > drm_gpuva_[un]link(va);
> > ...
> > }
> >
> > drm_exec_fini(exec);
>
> I have a follow-up patch (still WIP) in the queue to generalize dma-resv
> handling, fence handling and GEM validation within the GPUVA manager as
> optional helper functions:
> https://gitlab.freedesktop.org/nouvelles/kernel/-/commit/a5fc29f3b1edbf3f96fb5a21b858ffe00a3f2584

Thanks for the heads-up. That's more or less what I have, except I'm
attaching a dummy_gem object to the VM so it can be passed to drm_exec
directly (instead of having a separate ww_acquire_ctx).

2023-07-06 16:42:04

by Donald Robson

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 2023-06-30 at 00:25 +0200, Danilo Krummrich wrote:
>
> +#ifdef CONFIG_LOCKDEP
> +typedef struct lockdep_map *lockdep_map_p;
> +#define drm_gpuva_manager_ext_assert_held(mgr) \
> + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
> +/**
> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
> + * @mgr: the &drm_gpuva_manager to set the lock for
> + * @lock: the lock to set
> + *
> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
> + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
> + * &drm_gem_objects GPUVA list.
> + */
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
> + (mgr)->ext_lock = &(lock)->dep_map
> +#else
> +typedef struct { /* nothing */ } lockdep_map_p;

lockdep_map_p conflicts with an identical typedef in maple_tree.h when CONFIG_LOCKDEP is
not set (it's being pulled in by mm.h in drm_vma_manager.h). I'll just comment the line
out for now.

> +#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
> +#endif
> +

Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings


On 7/6/23 17:48, Danilo Krummrich wrote:
> Hi Thomas,
>
> On 7/6/23 10:49, Thomas Hellström (Intel) wrote:
>> Hi, Danilo
>>
>> Some review comments below:
>>
>> On 6/30/23 00:25, Danilo Krummrich wrote:
>>> Add infrastructure to keep track of GPU virtual address (VA) mappings
>>> with a decicated VA space manager implementation.
>>>
>>> New UAPIs, motivated by Vulkan sparse memory bindings graphics drivers
>>> start implementing, allow userspace applications to request multiple
>>> and
>>> arbitrary GPU VA mappings of buffer objects. The DRM GPU VA manager is
>>> intended to serve the following purposes in this context.
>>>
>>> 1) Provide infrastructure to track GPU VA allocations and mappings,
>>>     making use of the maple_tree.
>>
>> It looks like we're not using the maple tree anymore, but rather an
>> instantiation of an interval tree.
>>
>> (Perhaps as a follow-up it makes sense to provide a pre-instantiated
>> common u64 version of the interval tree in addition to the unsigned
>> long one since it appears to be used in multiple places in graphics
>> drivers).
>>
>>> 2) Generically connect GPU VA mappings to their backing buffers, in
>>>     particular DRM GEM objects.
>>>
>>> 3) Provide a common implementation to perform more complex mapping
>>>     operations on the GPU VA space. In particular splitting and merging
>>>     of GPU VA mappings, e.g. for intersecting mapping requests or
>>> partial
>>>     unmap requests.
>>>
>>> Tested-by: Donald Robson<[email protected]>
>>> Reviewed-by: Boris Brezillon<[email protected]>
>>> Suggested-by: Dave Airlie<[email protected]>
>>> Signed-off-by: Danilo Krummrich<[email protected]>
>>> ---
>>>   Documentation/gpu/drm-mm.rst    |   36 +
>>>   drivers/gpu/drm/Makefile        |    1 +
>>>   drivers/gpu/drm/drm_gem.c       |    3 +
>>>   drivers/gpu/drm/drm_gpuva_mgr.c | 1743
>>> +++++++++++++++++++++++++++++++
>>>   include/drm/drm_drv.h           |    6 +
>>>   include/drm/drm_gem.h           |   52 +
>>>   include/drm/drm_gpuva_mgr.h     |  756 ++++++++++++++
>>>   7 files changed, 2597 insertions(+)
>>>   create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c
>>>   create mode 100644 include/drm/drm_gpuva_mgr.h
>>>
>>> diff --git a/Documentation/gpu/drm-mm.rst
>>> b/Documentation/gpu/drm-mm.rst
>>> index a52e6f4117d6..3d5dc9dc1bfe 100644
>>> --- a/Documentation/gpu/drm-mm.rst
>>> +++ b/Documentation/gpu/drm-mm.rst
>>> @@ -466,6 +466,42 @@ DRM MM Range Allocator Function References
>>>   .. kernel-doc:: drivers/gpu/drm/drm_mm.c
>>>      :export:
>>> +DRM GPU VA Manager
>>> +==================
>>> +
>>> +Overview
>>> +--------
>>> +
>>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>>> +   :doc: Overview
>>> +
>>> +Split and Merge
>>> +---------------
>>> +
>>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>>> +   :doc: Split and Merge
>>> +
>>> +Locking
>>> +-------
>>> +
>>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>>> +   :doc: Locking
>>> +
>>> +Examples
>>> +--------
>>> +
>>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>>> +   :doc: Examples
>>> +
>>> +DRM GPU VA Manager Function References
>>> +--------------------------------------
>>> +
>>> +.. kernel-doc:: include/drm/drm_gpuva_mgr.h
>>> +   :internal:
>>> +
>>> +.. kernel-doc:: drivers/gpu/drm/drm_gpuva_mgr.c
>>> +   :export:
>>> +
>>>   DRM Buddy Allocator
>>>   ===================
>>> diff --git a/drivers/gpu/drm/Makefile b/drivers/gpu/drm/Makefile
>>> index 414855e2a463..6d6c9dec66e8 100644
>>> --- a/drivers/gpu/drm/Makefile
>>> +++ b/drivers/gpu/drm/Makefile
>>> @@ -45,6 +45,7 @@ drm-y := \
>>>       drm_vblank.o \
>>>       drm_vblank_work.o \
>>>       drm_vma_manager.o \
>>> +    drm_gpuva_mgr.o \
>>>       drm_writeback.o
>>>   drm-$(CONFIG_DRM_LEGACY) += \
>>>       drm_agpsupport.o \
>>> diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c
>>> index 1a5a2cd0d4ec..cd878ebddbd0 100644
>>> --- a/drivers/gpu/drm/drm_gem.c
>>> +++ b/drivers/gpu/drm/drm_gem.c
>>> @@ -164,6 +164,9 @@ void drm_gem_private_object_init(struct
>>> drm_device *dev,
>>>       if (!obj->resv)
>>>           obj->resv = &obj->_resv;
>>> +    if (drm_core_check_feature(dev, DRIVER_GEM_GPUVA))
>>> +        drm_gem_gpuva_init(obj);
>>> +
>>>       drm_vma_node_reset(&obj->vma_node);
>>>       INIT_LIST_HEAD(&obj->lru_node);
>>>   }
>>> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c
>>> b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> new file mode 100644
>>> index 000000000000..4414990c05cc
>>> --- /dev/null
>>> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
>>> @@ -0,0 +1,1743 @@
>>> +// SPDX-License-Identifier: GPL-2.0
>>> +/*
>> SPDX-License is GPL-2.0 but below looks like MIT. Can we use "GPL-2.0
>> OR MIT" or does something restrict it to GPL-only?
>>> + * Copyright (c) 2022 Red Hat.
>>> + *
>>> + * Permission is hereby granted, free of charge, to any person
>>> obtaining a
>>> + * copy of this software and associated documentation files (the
>>> "Software"),
>>> + * to deal in the Software without restriction, including without
>>> limitation
>>> + * the rights to use, copy, modify, merge, publish, distribute,
>>> sublicense,
>>> + * and/or sell copies of the Software, and to permit persons to
>>> whom the
>>> + * Software is furnished to do so, subject to the following
>>> conditions:
>>> + *
>>> + * The above copyright notice and this permission notice shall be
>>> included in
>>> + * all copies or substantial portions of the Software.
>>> + *
>>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
>>> EXPRESS OR
>>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>>> MERCHANTABILITY,
>>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO
>>> EVENT SHALL
>>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>>> DAMAGES OR
>>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
>>> OTHERWISE,
>>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE
>>> USE OR
>>> + * OTHER DEALINGS IN THE SOFTWARE.
>>> + *
>>> + * Authors:
>>> + *     Danilo Krummrich<[email protected]>
>>> + *
>>> + */
>>> +
>>> +#include <drm/drm_gpuva_mgr.h>
>>> +
>>> +#include <linux/interval_tree_generic.h>
>>> +#include <linux/mm.h>
>>> +
>>> +/**
>>> + * DOC: Overview
>>> + *
>>> + * The DRM GPU VA Manager, represented by struct drm_gpuva_manager
>>> keeps track
>>> + * of a GPU's virtual address (VA) space and manages the
>>> corresponding virtual
>>> + * mappings represented by &drm_gpuva objects. It also keeps track
>>> of the
>>> + * mapping's backing &drm_gem_object buffers.
>>> + *
>>> + * &drm_gem_object buffers maintain a list of &drm_gpuva objects
>>> representing
>>> + * all existent GPU VA mappings using this &drm_gem_object as
>>> backing buffer.
>>> + *
>>> + * GPU VAs can be flagged as sparse, such that drivers may use GPU
>>> VAs to also
>>> + * keep track of sparse PTEs in order to support Vulkan 'Sparse
>>> Resources'.
>>> + *
>>> + * The GPU VA manager internally uses a rb-tree to manage the
>>> + * &drm_gpuva mappings within a GPU's virtual address space.
>>> + *
>>> + * The &drm_gpuva_manager contains a special &drm_gpuva
>>> representing the
>>> + * portion of VA space reserved by the kernel. This node is
>>> initialized together
>>> + * with the GPU VA manager instance and removed when the GPU VA
>>> manager is
>>> + * destroyed.
>>> + *
>>> + * In a typical application drivers would embed struct
>>> drm_gpuva_manager and
>>> + * struct drm_gpuva within their own driver specific structures,
>>> there won't be
>>> + * any memory allocations of it's own nor memory allocations of
>>> &drm_gpuva
>> s/it's/its/
>>> + * entries.
>>> + *
>>> + * The data structures needed to store &drm_gpuvas within the
>>> &drm_gpuva_manager
>>> + * are contained within struct drm_gpuva already. Hence, for inserting
>>> + * &drm_gpuva entries from within dma-fence signalling critical
>>> sections it is
>>> + * enough to pre-allocate the &drm_gpuva structures.
>>> + */
>>> +
>>> +/**
>>> + * DOC: Split and Merge
>>> + *
>>> + * Besides it's capability to manage and represent a GPU VA space, the
>> s/it's/its/
>>> + * &drm_gpuva_manager also provides functions to let the
>>> &drm_gpuva_manager
>>> + * calculate a sequence of operations to satisfy a given map or
>>> unmap request.
>>> + *
>>> + * Therefore the DRM GPU VA manager provides an algorithm
>>> implementing splitting
>>> + * and merging of existent GPU VA mappings with the ones that are
>>> requested to
>>> + * be mapped or unmapped. This feature is required by the Vulkan
>>> API to
>>> + * implement Vulkan 'Sparse Memory Bindings' - drivers UAPIs often
>>> refer to this
>>> + * as VM BIND.
>>> + *
>>> + * Drivers can call drm_gpuva_sm_map() to receive a sequence of
>>> callbacks
>>> + * containing map, unmap and remap operations for a given newly
>>> requested
>>> + * mapping. The sequence of callbacks represents the set of
>>> operations to
>>> + * execute in order to integrate the new mapping cleanly into the
>>> current state
>>> + * of the GPU VA space.
>>> + *
>>> + * Depending on how the new GPU VA mapping intersects with the
>>> existent mappings
>>> + * of the GPU VA space the &drm_gpuva_fn_ops callbacks contain an
>>> arbitrary
>>> + * amount of unmap operations, a maximum of two remap operations
>>> and a single
>>> + * map operation. The caller might receive no callback at all if no
>>> operation is
>>> + * required, e.g. if the requested mapping already exists in the
>>> exact same way.
>>> + *
>>> + * The single map operation represents the original map operation
>>> requested by
>>> + * the caller.
>>> + *
>>> + * &drm_gpuva_op_unmap contains a 'keep' field, which indicates
>>> whether the
>>> + * &drm_gpuva to unmap is physically contiguous with the original
>>> mapping
>>> + * request. Optionally, if 'keep' is set, drivers may keep the
>>> actual page table
>>> + * entries for this &drm_gpuva, adding the missing page table
>>> entries only and
>>> + * update the &drm_gpuva_manager's view of things accordingly.
>>> + *
>>> + * Drivers may do the same optimization, namely delta page table
>>> updates, also
>>> + * for remap operations. This is possible since &drm_gpuva_op_remap
>>> consists of
>>> + * one unmap operation and one or two map operations, such that
>>> drivers can
>>> + * derive the page table update delta accordingly.
>>> + *
>>> + * Note that there can't be more than two existent mappings to
>>> split up, one at
>>> + * the beginning and one at the end of the new mapping, hence there
>>> is a
>>> + * maximum of two remap operations.
>>> + *
>>> + * Analogous to drm_gpuva_sm_map() drm_gpuva_sm_unmap() uses
>>> &drm_gpuva_fn_ops
>>> + * to call back into the driver in order to unmap a range of GPU VA
>>> space. The
>>> + * logic behind this function is way simpler though: For all
>>> existent mappings
>>> + * enclosed by the given range unmap operations are created. For
>>> mappings which
>>> + * are only partically located within the given range, remap
>>> operations are
>>> + * created such that those mappings are split up and re-mapped
>>> partically.
>>> + *
>>> + * As an alternative to drm_gpuva_sm_map() and drm_gpuva_sm_unmap(),
>>> + * drm_gpuva_sm_map_ops_create() and
>>> drm_gpuva_sm_unmap_ops_create() can be used
>>> + * to directly obtain an instance of struct drm_gpuva_ops
>>> containing a list of
>>> + * &drm_gpuva_op, which can be iterated with
>>> drm_gpuva_for_each_op(). This list
>>> + * contains the &drm_gpuva_ops analogous to the callbacks one would
>>> receive when
>>> + * calling drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). While this
>>> way requires
>>> + * more memory (to allocate the &drm_gpuva_ops), it provides
>>> drivers a way to
>>> + * iterate the &drm_gpuva_op multiple times, e.g. once in a context
>>> where memory
>>> + * allocations are possible (e.g. to allocate GPU page tables) and
>>> once in the
>>> + * dma-fence signalling critical path.
>>> + *
>>> + * To update the &drm_gpuva_manager's view of the GPU VA space
>>> + * drm_gpuva_insert() and drm_gpuva_remove() may be used. These
>>> functions can
>>> + * safely be used from &drm_gpuva_fn_ops callbacks originating from
>>> + * drm_gpuva_sm_map() or drm_gpuva_sm_unmap(). However, it might be
>>> more
>>> + * convenient to use the provided helper functions drm_gpuva_map(),
>>> + * drm_gpuva_remap() and drm_gpuva_unmap() instead.
>>> + *
>>> + * The following diagram depicts the basic relationships of
>>> existent GPU VA
>>> + * mappings, a newly requested mapping and the resulting mappings
>>> as implemented
>>> + * by drm_gpuva_sm_map() - it doesn't cover any arbitrary
>>> combinations of these.
>>> + *
>>> + * 1) Requested mapping is identical. Replace it, but indicate the
>>> backing PTEs
>>> + *    could be kept.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     1
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     a     1
>>> + *    req: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     a     1
>>> + *    new: |-----------| (bo_offset=n)
>>> + *
>>> + *
>>> + * 2) Requested mapping is identical, except for the BO offset,
>>> hence replace
>>> + *    the mapping.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     1
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     a     1
>>> + *    req: |-----------| (bo_offset=m)
>>> + *
>>> + *         0     a     1
>>> + *    new: |-----------| (bo_offset=m)
>>> + *
>>> + *
>>> + * 3) Requested mapping is identical, except for the backing BO,
>>> hence replace
>>> + *    the mapping.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     1
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     b     1
>>> + *    req: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     b     1
>>> + *    new: |-----------| (bo_offset=n)
>>> + *
>>> + *
>>> + * 4) Existent mapping is a left aligned subset of the requested
>>> one, hence
>>> + *    replace the existent one.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0  a  1
>>> + *    old: |-----|       (bo_offset=n)
>>> + *
>>> + *         0     a     2
>>> + *    req: |-----------| (bo_offset=n)
>>> + *
>>> + *         0     a     2
>>> + *    new: |-----------| (bo_offset=n)
>>> + *
>>> + *    .. note::
>>> + *       We expect to see the same result for a request with a
>>> different BO
>>> + *       and/or non-contiguous BO offset.
>>> + *
>>> + *
>>> + * 5) Requested mapping's range is a left aligned subset of the
>>> existent one,
>>> + *    but backed by a different BO. Hence, map the requested
>>> mapping and split
>>> + *    the existent one adjusting it's BO offset.
>> Typo: s/it's/its/ above and in multiple places below.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     2
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *         0  b  1
>>> + *    req: |-----|       (bo_offset=n)
>>> + *
>>> + *         0  b  1  a' 2
>>> + *    new: |-----|-----| (b.bo_offset=n, a.bo_offset=n+1)
>>> + *
>>> + *    .. note::
>>> + *       We expect to see the same result for a request with a
>>> different BO
>>> + *       and/or non-contiguous BO offset.
>>> + *
>>> + *
>>> + * 6) Existent mapping is a superset of the requested mapping.
>>> Split it up, but
>>> + *    indicate that the backing PTEs could be kept.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     2
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *         0  a  1
>>> + *    req: |-----|       (bo_offset=n)
>>> + *
>>> + *         0  a  1  a' 2
>>> + *    new: |-----|-----| (a.bo_offset=n, a'.bo_offset=n+1)
>>> + *
>>> + *
>>> + * 7) Requested mapping's range is a right aligned subset of the
>>> existent one,
>>> + *    but backed by a different BO. Hence, map the requested
>>> mapping and split
>>> + *    the existent one, without adjusting the BO offset.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     2
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *               1  b  2
>>> + *    req:       |-----| (bo_offset=m)
>>> + *
>>> + *         0  a  1  b  2
>>> + *    new: |-----|-----| (a.bo_offset=n,b.bo_offset=m)
>>> + *
>>> + *
>>> + * 8) Existent mapping is a superset of the requested mapping.
>>> Split it up, but
>>> + *    indicate that the backing PTEs could be kept.
>>> + *
>>> + *    ::
>>> + *
>>> + *          0     a     2
>>> + *    old: |-----------| (bo_offset=n)
>>> + *
>>> + *               1  a  2
>>> + *    req:       |-----| (bo_offset=n+1)
>>> + *
>>> + *         0  a' 1  a  2
>>> + *    new: |-----|-----| (a'.bo_offset=n, a.bo_offset=n+1)
>>> + *
>>> + *
>>> + * 9) Existent mapping is overlapped at the end by the requested
>>> mapping backed
>>> + *    by a different BO. Hence, map the requested mapping and split
>>> up the
>>> + *    existent one, without adjusting the BO offset.
>>> + *
>>> + *    ::
>>> + *
>>> + *         0     a     2
>>> + *    old: |-----------|       (bo_offset=n)
>>> + *
>>> + *               1     b     3
>>> + *    req:       |-----------| (bo_offset=m)
>>> + *
>>> + *         0  a  1     b     3
>>> + *    new: |-----|-----------| (a.bo_offset=n,b.bo_offset=m)
>>> + *
>>> + *
>>> + * 10) Existent mapping is overlapped by the requested mapping,
>>> both having the
>>> + *     same backing BO with a contiguous offset. Indicate the
>>> backing PTEs of
>>> + *     the old mapping could be kept.
>>> + *
>>> + *     ::
>>> + *
>>> + *          0     a     2
>>> + *     old: |-----------|       (bo_offset=n)
>>> + *
>>> + *                1     a     3
>>> + *     req:       |-----------| (bo_offset=n+1)
>>> + *
>>> + *          0  a' 1     a     3
>>> + *     new: |-----|-----------| (a'.bo_offset=n, a.bo_offset=n+1)
>>> + *
>>> + *
>>> + * 11) Requested mapping's range is a centered subset of the
>>> existent one
>>> + *     having a different backing BO. Hence, map the requested
>>> mapping and split
>>> + *     up the existent one in two mappings, adjusting the BO offset
>>> of the right
>>> + *     one accordingly.
>>> + *
>>> + *     ::
>>> + *
>>> + *          0        a        3
>>> + *     old: |-----------------| (bo_offset=n)
>>> + *
>>> + *                1  b  2
>>> + *     req:       |-----|       (bo_offset=m)
>>> + *
>>> + *          0  a  1  b  2  a' 3
>>> + *     new: |-----|-----|-----|
>>> (a.bo_offset=n,b.bo_offset=m,a'.bo_offset=n+2)
>>> + *
>>> + *
>>> + * 12) Requested mapping is a contiguous subset of the existent
>>> one. Split it
>>> + *     up, but indicate that the backing PTEs could be kept.
>>> + *
>>> + *     ::
>>> + *
>>> + *          0        a        3
>>> + *     old: |-----------------| (bo_offset=n)
>>> + *
>>> + *                1  a  2
>>> + *     req:       |-----|       (bo_offset=n+1)
>>> + *
>>> + *          0  a' 1  a  2 a'' 3
>>> + *     old: |-----|-----|-----| (a'.bo_offset=n, a.bo_offset=n+1,
>>> a''.bo_offset=n+2)
>>> + *
>>> + *
>>> + * 13) Existent mapping is a right aligned subset of the requested
>>> one, hence
>>> + *     replace the existent one.
>>> + *
>>> + *     ::
>>> + *
>>> + *                1  a  2
>>> + *     old:       |-----| (bo_offset=n+1)
>>> + *
>>> + *          0     a     2
>>> + *     req: |-----------| (bo_offset=n)
>>> + *
>>> + *          0     a     2
>>> + *     new: |-----------| (bo_offset=n)
>>> + *
>>> + *     .. note::
>>> + *        We expect to see the same result for a request with a
>>> different bo
>>> + *        and/or non-contiguous bo_offset.
>>> + *
>>> + *
>>> + * 14) Existent mapping is a centered subset of the requested one,
>>> hence
>>> + *     replace the existent one.
>>> + *
>>> + *     ::
>>> + *
>>> + *                1  a  2
>>> + *     old:       |-----| (bo_offset=n+1)
>>> + *
>>> + *          0        a       3
>>> + *     req: |----------------| (bo_offset=n)
>>> + *
>>> + *          0        a       3
>>> + *     new: |----------------| (bo_offset=n)
>>> + *
>>> + *     .. note::
>>> + *        We expect to see the same result for a request with a
>>> different bo
>>> + *        and/or non-contiguous bo_offset.
>>> + *
>>> + *
>>> + * 15) Existent mappings is overlapped at the beginning by the
>>> requested mapping
>>> + *     backed by a different BO. Hence, map the requested mapping
>>> and split up
>>> + *     the existent one, adjusting it's BO offset accordingly.
>>> + *
>>> + *     ::
>>> + *
>>> + *                1     a     3
>>> + *     old:       |-----------| (bo_offset=n)
>>> + *
>>> + *          0     b     2
>>> + *     req: |-----------|       (bo_offset=m)
>>> + *
>>> + *          0     b     2  a' 3
>>> + *     new: |-----------|-----| (b.bo_offset=m,a.bo_offset=n+2)
>>> + */
>>> +
>>> +/**
>>> + * DOC: Locking
>>> + *
>>> + * Generally, the GPU VA manager does not take care of locking
>>> itself, it is
>>> + * the drivers responsibility to take care about locking. Drivers
>>> might want to
>>> + * protect the following operations: inserting, removing and iterating
>>> + * &drm_gpuva objects as well as generating all kinds of
>>> operations, such as
>>> + * split / merge or prefetch.
>>> + *
>>> + * The GPU VA manager also does not take care of the locking of the
>>> backing
>>> + * &drm_gem_object buffers GPU VA lists by itself; drivers are
>>> responsible to
>>> + * enforce mutual exclusion using either the GEMs dma_resv lock or
>>> alternatively
>>> + * a driver specific external lock by setting the
>>> @DRM_GPUVA_MANAGER_LOCK_EXTERN
>>> + * flag.
>>
>> Is the external lock used or anticipated by any WIP implementation?
>> If not I suggest to leave it out until there is a user.
>
> I think the PowerVR driver requires this. However, it's generally
> useful for any driver using direct callbacks rather than drm_gpuva_ops.
>
> Once the page table handling in Nouveau is re-worked, and direct
> callbacks can be used, I probably want to use this in Nouveau as well.
>
> Gonna fix up all other comments.

Sounds good to me. The main concern here with the external lock was
testing coverage if no user. I don't have anything against it per se.

Going on vacation on Monday, but that Acked-by: once addressed on IRC
still holds.

Thanks,

Thomas




>
> - Danilo
>
>>
>>> + *
>>> + * For the latter, functions such as drm_gpuva_link() or
>>> drm_gpuva_unlink()
>>> + * contain lockdep checks to indicate locking issues. For this to
>>> work drivers
>>> + * must provide (in case the @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is
>>> set) their
>>> + * external lock with drm_gpuva_manager_set_ext_lock() after
>>> initialization.
>>> + */
>>> +
>>> +/**
>>> + * DOC: Examples
>>> + *
>>> + * This section gives two examples on how to let the DRM GPUVA
>>> Manager generate
>>> + * &drm_gpuva_op in order to satisfy a given map or unmap request
>>> and how to
>>> + * make use of them.
>>> + *
>>> + * The below code is strictly limited to illustrate the generic
>>> usage pattern.
>>> + * To maintain simplicitly, it doesn't make use of any abstractions
>>> for common
>>> + * code, different (asyncronous) stages with fence signalling
>>> critical paths,
>>> + * any other helpers or error handling in terms of freeing memory
>>> and dropping
>>> + * previously taken locks.
>>> + *
>>> + * 1) Obtain a list of &drm_gpuva_op to create a new mapping::
>>> + *
>>> + *    // Allocates a new &drm_gpuva.
>>> + *    struct drm_gpuva * driver_gpuva_alloc(void);
>>> + *
>>> + *    // Typically drivers would embedd the &drm_gpuva_manager and
>>> &drm_gpuva
>>> + *    // structure in individual driver structures and lock the
>>> dma-resv with
>>> + *    // drm_exec or similar helpers.
>>> + *    int driver_mapping_create(struct drm_gpuva_manager *mgr,
>>> + *                  u64 addr, u64 range,
>>> + *                  struct drm_gem_object *obj, u64 offset)
>>> + *    {
>>> + *        struct drm_gpuva_ops *ops;
>>> + *        struct drm_gpuva_op *op
>>> + *
>>> + *        driver_lock_va_space();
>>> + *        ops = drm_gpuva_sm_map_ops_create(mgr, addr, range,
>>> + *                          obj, offset);
>>> + *        if (IS_ERR(ops))
>>> + *            return PTR_ERR(ops);
>>> + *
>>> + *        drm_gpuva_for_each_op(op, ops) {
>>> + *            struct drm_gpuva *va;
>>> + *
>>> + *            switch (op->op) {
>>> + *            case DRM_GPUVA_OP_MAP:
>>> + *                va = driver_gpuva_alloc();
>>> + *                if (!va)
>>> + *                    ; // unwind previous VA space updates,
>>> + *                      // free memory and unlock
>>> + *
>>> + *                driver_vm_map();
>>> + *                drm_gpuva_map(mgr, va, &op->map);
>>> + *                drm_gpuva_link(va);
>>> + *
>>> + *                break;
>>> + *            case DRM_GPUVA_OP_REMAP: {
>>> + *                struct drm_gpuva *prev = NULL, *next = NULL;
>>> + *
>>> + *                va = op->remap.unmap->va;
>>> + *
>>> + *                if (op->remap.prev) {
>>> + *                    prev = driver_gpuva_alloc();
>>> + *                    if (!prev)
>>> + *                        ; // unwind previous VA space
>>> + *                          // updates, free memory and
>>> + *                          // unlock
>>> + *                }
>>> + *
>>> + *                if (op->remap.next) {
>>> + *                    next = driver_gpuva_alloc();
>>> + *                    if (!next)
>>> + *                        ; // unwind previous VA space
>>> + *                          // updates, free memory and
>>> + *                          // unlock
>>> + *                }
>>> + *
>>> + *                driver_vm_remap();
>>> + *                drm_gpuva_remap(prev, next, &op->remap);
>>> + *
>>> + *                drm_gpuva_unlink(va);
>>> + *                if (prev)
>>> + *                    drm_gpuva_link(prev);
>>> + *                if (next)
>>> + *                    drm_gpuva_link(next);
>>> + *
>>> + *                break;
>>> + *            }
>>> + *            case DRM_GPUVA_OP_UNMAP:
>>> + *                va = op->unmap->va;
>>> + *
>>> + *                driver_vm_unmap();
>>> + *                drm_gpuva_unlink(va);
>>> + *                drm_gpuva_unmap(&op->unmap);
>>> + *
>>> + *                break;
>>> + *            default:
>>> + *                break;
>>> + *            }
>>> + *        }
>>> + *        driver_unlock_va_space();
>>> + *
>>> + *        return 0;
>>> + *    }
>>> + *
>>> + * 2) Receive a callback for each &drm_gpuva_op to create a new
>>> mapping::
>>> + *
>>> + *    struct driver_context {
>>> + *        struct drm_gpuva_manager *mgr;
>>> + *        struct drm_gpuva *new_va;
>>> + *        struct drm_gpuva *prev_va;
>>> + *        struct drm_gpuva *next_va;
>>> + *    };
>>> + *
>>> + *    // ops to pass to drm_gpuva_manager_init()
>>> + *    static const struct drm_gpuva_fn_ops driver_gpuva_ops = {
>>> + *        .sm_step_map = driver_gpuva_map,
>>> + *        .sm_step_remap = driver_gpuva_remap,
>>> + *        .sm_step_unmap = driver_gpuva_unmap,
>>> + *    };
>>> + *
>>> + *    // Typically drivers would embedd the &drm_gpuva_manager and
>>> &drm_gpuva
>>> + *    // structure in individual driver structures and lock the
>>> dma-resv with
>>> + *    // drm_exec or similar helpers.
>>> + *    int driver_mapping_create(struct drm_gpuva_manager *mgr,
>>> + *                  u64 addr, u64 range,
>>> + *                  struct drm_gem_object *obj, u64 offset)
>>> + *    {
>>> + *        struct driver_context ctx;
>>> + *        struct drm_gpuva_ops *ops;
>>> + *        struct drm_gpuva_op *op;
>>> + *        int ret = 0;
>>> + *
>>> + *        ctx.mgr = mgr;
>>> + *
>>> + *        ctx.new_va = kzalloc(sizeof(*ctx.new_va), GFP_KERNEL);
>>> + *        ctx.prev_va = kzalloc(sizeof(*ctx.prev_va), GFP_KERNEL);
>>> + *        ctx.next_va = kzalloc(sizeof(*ctx.next_va), GFP_KERNEL);
>>> + *        if (!ctx.new_va || !ctx.prev_va || !ctx.next_va) {
>>> + *            ret = -ENOMEM;
>>> + *            goto out;
>>> + *        }
>>> + *
>>> + *        driver_lock_va_space();
>>> + *        ret = drm_gpuva_sm_map(mgr, &ctx, addr, range, obj, offset);
>>> + *        driver_unlock_va_space();
>>> + *
>>> + *    out:
>>> + *        kfree(ctx.new_va);
>>> + *        kfree(ctx.prev_va);
>>> + *        kfree(ctx.next_va);
>>> + *        return ret;
>>> + *    }
>>> + *
>>> + *    int driver_gpuva_map(struct drm_gpuva_op *op, void *__ctx)
>>> + *    {
>>> + *        struct driver_context *ctx = __ctx;
>>> + *
>>> + *        drm_gpuva_map(ctx->mgr, ctx->new_va, &op->map);
>>> + *
>>> + *        drm_gpuva_link(ctx->new_va);
>>> + *
>>> + *        // prevent the new GPUVA from being freed in
>>> + *        // driver_mapping_create()
>>> + *        ctx->new_va = NULL;
>>> + *
>>> + *        return 0;
>>> + *    }
>>> + *
>>> + *    int driver_gpuva_remap(struct drm_gpuva_op *op, void *__ctx)
>>> + *    {
>>> + *        struct driver_context *ctx = __ctx;
>>> + *
>>> + *        drm_gpuva_remap(ctx->prev_va, ctx->next_va, &op->remap);
>>> + *
>>> + *        drm_gpuva_unlink(op->remap.unmap->va);
>>> + *        kfree(op->remap.unmap->va);
>>> + *
>>> + *        if (op->remap.prev) {
>>> + *            drm_gpuva_link(ctx->prev_va);
>>> + *            ctx->prev_va = NULL;
>>> + *        }
>>> + *
>>> + *        if (op->remap.next) {
>>> + *            drm_gpuva_link(ctx->next_va);
>>> + *            ctx->next_va = NULL;
>>> + *        }
>>> + *
>>> + *        return 0;
>>> + *    }
>>> + *
>>> + *    int driver_gpuva_unmap(struct drm_gpuva_op *op, void *__ctx)
>>> + *    {
>>> + *        drm_gpuva_unlink(op->unmap.va);
>>> + *        drm_gpuva_unmap(&op->unmap);
>>> + *        kfree(op->unmap.va);
>>> + *
>>> + *        return 0;
>>> + *    }
>>> + */
>>> +
>>> +#define to_drm_gpuva(__node)    container_of((__node), struct
>>> drm_gpuva, rb.node)
>>> +
>>> +#define GPUVA_START(node) ((node)->va.addr)
>>> +#define GPUVA_LAST(node) ((node)->va.addr + (node)->va.range - 1)
>>> +
>>> +/* We do not actually use drm_gpuva_it_next(), tell the compiler to
>>> not complain
>>> + * about this.
>>> + */
>>> +INTERVAL_TREE_DEFINE(struct drm_gpuva, rb.node, u64,
>>> rb.__subtree_last,
>>> +             GPUVA_START, GPUVA_LAST, static __attribute__((unused)),
>>
>> Would  s/__attribute__((unused))/__maybe_unused/ work here?
>>
>>> +             drm_gpuva_it)
>>> +
>>> +static int __drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>> +                  struct drm_gpuva *va);
>>> +static void __drm_gpuva_remove(struct drm_gpuva *va);
>>> +
>>> +static inline bool
>> "static inline" is typically used only in header files, since the
>> compiler should be smart enough to inline if beneficial. Prefer
>> "static". Also in multiple places below.
>>> +drm_gpuva_check_overflow(u64 addr, u64 range)
>>> +{
>>> +    u64 end;
>>> +
>>> +    return WARN(check_add_overflow(addr, range, &end),
>>> +            "GPUVA address limited to %lu bytes.\n", sizeof(end));
>>> +}
>>> +
>>> +static inline bool
>>> +drm_gpuva_in_mm_range(struct drm_gpuva_manager *mgr, u64 addr, u64
>>> range)
>>> +{
>>> +    u64 end = addr + range;
>>> +    u64 mm_start = mgr->mm_start;
>>> +    u64 mm_end = mm_start + mgr->mm_range;
>>> +
>>> +    return addr >= mm_start && end <= mm_end;
>>> +}
>>> +
>>> +static inline bool
>>> +drm_gpuva_in_kernel_node(struct drm_gpuva_manager *mgr, u64 addr,
>>> u64 range)
>>> +{
>>> +    u64 end = addr + range;
>>> +    u64 kstart = mgr->kernel_alloc_node.va.addr;
>>> +    u64 krange = mgr->kernel_alloc_node.va.range;
>>> +    u64 kend = kstart + krange;
>>> +
>>> +    return krange && addr < kend && kstart < end;
>>> +}
>>> +
>>> +static inline bool
>>> +drm_gpuva_range_valid(struct drm_gpuva_manager *mgr,
>>> +              u64 addr, u64 range)
>>> +{
>>> +
>>> +    return !drm_gpuva_check_overflow(addr, range) &&
>>> +           drm_gpuva_in_mm_range(mgr, addr, range) &&
>>> +           !drm_gpuva_in_kernel_node(mgr, addr, range);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_manager_init - initialize a &drm_gpuva_manager
>> Function kerneldoc names should end with "()":
>> drm_gpuva_manager_init(). General comment across the patch.
>>> + * @mgr: pointer to the &drm_gpuva_manager to initialize
>>> + * @name: the name of the GPU VA space
>>> + * @start_offset: the start offset of the GPU VA space
>>> + * @range: the size of the GPU VA space
>>> + * @reserve_offset: the start of the kernel reserved GPU VA area
>>> + * @reserve_range: the size of the kernel reserved GPU VA area
>>> + * @ops: &drm_gpuva_fn_ops called on &drm_gpuva_sm_map /
>>> &drm_gpuva_sm_unmap
>>> + * @flags: the feature flags for the &drm_gpuva_manager
>>> + *
>>> + * The &drm_gpuva_manager must be initialized with this function
>>> before use.
>>> + *
>>> + * Note that @mgr must be cleared to 0 before calling this
>>> function. The given
>>> + * &name is expected to be managed by the surrounding driver
>>> structures.
>>> + */
>>> +void
>>> +drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +               const char *name,
>>> +               u64 start_offset, u64 range,
>>> +               u64 reserve_offset, u64 reserve_range,
>>> +               const struct drm_gpuva_fn_ops *ops,
>>> +               enum drm_gpuva_manager_flags flags)
>>> +{
>>> +    mgr->rb.tree = RB_ROOT_CACHED;
>>> +    INIT_LIST_HEAD(&mgr->rb.list);
>>> +
>>> +    drm_gpuva_check_overflow(start_offset, range);
>>> +    mgr->mm_start = start_offset;
>>> +    mgr->mm_range = range;
>>> +
>>> +    mgr->name = name ? name : "unknown";
>>> +    mgr->flags = flags;
>>> +    mgr->ops = ops;
>>> +
>>> +    memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
>>> +
>>> +    if (reserve_range) {
>>> +        mgr->kernel_alloc_node.va.addr = reserve_offset;
>>> +        mgr->kernel_alloc_node.va.range = reserve_range;
>>> +
>>> +        if (likely(!drm_gpuva_check_overflow(reserve_offset,
>>> +                             reserve_range)))
>>> +            __drm_gpuva_insert(mgr, &mgr->kernel_alloc_node);
>>> +    }
>>> +
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_manager_init);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_destroy - cleanup a &drm_gpuva_manager
>>> + * @mgr: pointer to the &drm_gpuva_manager to clean up
>>> + *
>>> + * Note that it is a bug to call this function on a manager that still
>>> + * holds GPU VA mappings.
>>> + */
>>> +void
>>> +drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr)
>>> +{
>>> +    mgr->name = NULL;
>>> +
>>> +    if (mgr->kernel_alloc_node.va.range)
>>> +        __drm_gpuva_remove(&mgr->kernel_alloc_node);
>>> +
>>> +    WARN(!RB_EMPTY_ROOT(&mgr->rb.tree.rb_root),
>>> +         "GPUVA tree is not empty, potentially leaking memory.");
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_manager_destroy);
>>> +
>>> +static int
>>> +__drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>> +           struct drm_gpuva *va)
>>> +{
>>> +    struct rb_node *node;
>>> +    struct list_head *head;
>>> +
>>> +    if (drm_gpuva_it_iter_first(&mgr->rb.tree,
>>> +                    GPUVA_START(va),
>>> +                    GPUVA_LAST(va)))
>>> +        return -EEXIST;
>>> +
>>> +    va->mgr = mgr;
>>> +
>>> +    drm_gpuva_it_insert(va, &mgr->rb.tree);
>>> +
>>> +    node = rb_prev(&va->rb.node);
>>> +    if (node)
>>> +        head = &(to_drm_gpuva(node))->rb.entry;
>>> +    else
>>> +        head = &mgr->rb.list;
>>> +
>>> +    list_add(&va->rb.entry, head);
>>> +
>>> +    return 0;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_insert - insert a &drm_gpuva
>>> + * @mgr: the &drm_gpuva_manager to insert the &drm_gpuva in
>>> + * @va: the &drm_gpuva to insert
>>> + *
>>> + * Insert a &drm_gpuva with a given address and range into a
>>> + * &drm_gpuva_manager.
>>> + *
>>> + * It is safe to use this function using the safe versions of
>>> iterating the GPU
>>> + * VA space, such as drm_gpuva_for_each_va_safe() and
>>> + * drm_gpuva_for_each_va_range_safe().
>>> + *
>>> + * Returns: 0 on success, negative error code on failure.
>>> + */
>>> +int
>>> +drm_gpuva_insert(struct drm_gpuva_manager *mgr,
>>> +         struct drm_gpuva *va)
>>> +{
>>> +    u64 addr = va->va.addr;
>>> +    u64 range = va->va.range;
>>> +
>>> +    if (unlikely(!drm_gpuva_range_valid(mgr, addr, range)))
>>> +        return -EINVAL;
>>> +
>>> +    return __drm_gpuva_insert(mgr, va);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_insert);
>>> +
>>> +static void
>>> +__drm_gpuva_remove(struct drm_gpuva *va)
>>> +{
>>> +    drm_gpuva_it_remove(va, &va->mgr->rb.tree);
>>> +    list_del_init(&va->rb.entry);
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_remove - remove a &drm_gpuva
>>> + * @va: the &drm_gpuva to remove
>>> + *
>>> + * This removes the given &va from the underlaying tree.
>>> + *
>>> + * It is safe to use this function using the safe versions of
>>> iterating the GPU
>>> + * VA space, such as drm_gpuva_for_each_va_safe() and
>>> + * drm_gpuva_for_each_va_range_safe().
>>> + */
>>> +void
>>> +drm_gpuva_remove(struct drm_gpuva *va)
>>> +{
>>> +    struct drm_gpuva_manager *mgr = va->mgr;
>>> +
>>> +    if (unlikely(va == &mgr->kernel_alloc_node)) {
>>> +        WARN(1, "Can't destroy kernel reserved node.\n");
>>> +        return;
>>> +    }
>>> +
>>> +    __drm_gpuva_remove(va);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_remove);
>>> +
>>> +/**
>>> + * drm_gpuva_link - link a &drm_gpuva
>>> + * @va: the &drm_gpuva to link
>>> + *
>>> + * This adds the given &va to the GPU VA list of the
>>> &drm_gem_object it is
>>> + * associated with.
>>> + *
>>> + * This function expects the caller to protect the GEM's GPUVA list
>>> against
>>> + * concurrent access using the GEMs dma_resv lock.
>>> + */
>>> +void
>>> +drm_gpuva_link(struct drm_gpuva *va)
>>> +{
>>> +    struct drm_gpuva_manager *mgr = va->mgr;
>>> +    struct drm_gem_object *obj = va->gem.obj;
>>> +
>>> +    if (unlikely(!obj))
>>> +        return;
>>> +
>>> +    if (drm_gpuva_manager_external_lock(mgr))
>>> +        drm_gpuva_manager_ext_assert_held(mgr);
>>> +    else
>>> +        dma_resv_assert_held(obj->resv);
>>> +
>>> +    list_add_tail(&va->gem.entry, &obj->gpuva.list);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_link);
>>> +
>>> +/**
>>> + * drm_gpuva_unlink - unlink a &drm_gpuva
>>> + * @va: the &drm_gpuva to unlink
>>> + *
>>> + * This removes the given &va from the GPU VA list of the
>>> &drm_gem_object it is
>>> + * associated with.
>>> + *
>>> + * This function expects the caller to protect the GEM's GPUVA list
>>> against
>>> + * concurrent access using the GEMs dma_resv lock.
>>> + */
>>> +void
>>> +drm_gpuva_unlink(struct drm_gpuva *va)
>>> +{
>>> +    struct drm_gpuva_manager *mgr = va->mgr;
>>> +    struct drm_gem_object *obj = va->gem.obj;
>>> +
>>> +    if (unlikely(!obj))
>>> +        return;
>>> +
>>> +    if (drm_gpuva_manager_external_lock(mgr))
>>> +        drm_gpuva_manager_ext_assert_held(mgr);
>>> +    else
>>> +        dma_resv_assert_held(obj->resv);
>>> +
>>> +    list_del_init(&va->gem.entry);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_unlink);
>>> +
>>> +/**
>>> + * drm_gpuva_find_first - find the first &drm_gpuva in the given range
>>> + * @mgr: the &drm_gpuva_manager to search in
>>> + * @addr: the &drm_gpuvas address
>>> + * @range: the &drm_gpuvas range
>>> + *
>>> + * Returns: the first &drm_gpuva within the given range
>>> + */
>>> +struct drm_gpuva *
>>> +drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
>>> +             u64 addr, u64 range)
>>> +{
>>> +    u64 last = addr + range - 1;
>>> +
>>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, addr, last);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_find_first);
>>> +
>>> +/**
>>> + * drm_gpuva_find - find a &drm_gpuva
>>> + * @mgr: the &drm_gpuva_manager to search in
>>> + * @addr: the &drm_gpuvas address
>>> + * @range: the &drm_gpuvas range
>>> + *
>>> + * Returns: the &drm_gpuva at a given &addr and with a given &range
>>> + */
>>> +struct drm_gpuva *
>>> +drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>> +           u64 addr, u64 range)
>>> +{
>>> +    struct drm_gpuva *va;
>>> +
>>> +    va = drm_gpuva_find_first(mgr, addr, range);
>>> +    if (!va)
>>> +        goto out;
>>> +
>>> +    if (va->va.addr != addr ||
>>> +        va->va.range != range)
>>> +        goto out;
>>> +
>>> +    return va;
>>> +
>>> +out:
>>> +    return NULL;
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_find);
>>> +
>>> +/**
>>> + * drm_gpuva_find_prev - find the &drm_gpuva before the given address
>>> + * @mgr: the &drm_gpuva_manager to search in
>>> + * @start: the given GPU VA's start address
>>> + *
>>> + * Find the adjacent &drm_gpuva before the GPU VA with given &start
>>> address.
>>> + *
>>> + * Note that if there is any free space between the GPU VA mappings
>>> no mapping
>>> + * is returned.
>>> + *
>>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was
>>> found
>>> + */
>>> +struct drm_gpuva *
>>> +drm_gpuva_find_prev(struct drm_gpuva_manager *mgr, u64 start)
>>> +{
>>> +    if (!drm_gpuva_range_valid(mgr, start - 1, 1))
>>> +        return NULL;
>>> +
>>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, start - 1, start);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_find_prev);
>>> +
>>> +/**
>>> + * drm_gpuva_find_next - find the &drm_gpuva after the given address
>>> + * @mgr: the &drm_gpuva_manager to search in
>>> + * @end: the given GPU VA's end address
>>> + *
>>> + * Find the adjacent &drm_gpuva after the GPU VA with given &end
>>> address.
>>> + *
>>> + * Note that if there is any free space between the GPU VA mappings
>>> no mapping
>>> + * is returned.
>>> + *
>>> + * Returns: a pointer to the found &drm_gpuva or NULL if none was
>>> found
>>> + */
>>> +struct drm_gpuva *
>>> +drm_gpuva_find_next(struct drm_gpuva_manager *mgr, u64 end)
>>> +{
>>> +    if (!drm_gpuva_range_valid(mgr, end, 1))
>>> +        return NULL;
>>> +
>>> +    return drm_gpuva_it_iter_first(&mgr->rb.tree, end, end + 1);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_find_next);
>>> +
>>> +/**
>>> + * drm_gpuva_interval_empty - indicate whether a given interval of
>>> the VA space
>>> + * is empty
>>> + * @mgr: the &drm_gpuva_manager to check the range for
>>> + * @addr: the start address of the range
>>> + * @range: the range of the interval
>>> + *
>>> + * Returns: true if the interval is empty, false otherwise
>>> + */
>>> +bool
>>> +drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64 addr,
>>> u64 range)
>>> +{
>>> +    return !drm_gpuva_find_first(mgr, addr, range);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_interval_empty);
>>> +
>>> +/**
>>> + * drm_gpuva_map - helper to insert a &drm_gpuva according to a
>>> + * &drm_gpuva_op_map
>>> + * @mgr: the &drm_gpuva_manager
>>> + * @va: the &drm_gpuva to insert
>>> + * @op: the &drm_gpuva_op_map to initialize @va with
>>> + *
>>> + * Initializes the @va from the @op and inserts it into the given
>>> @mgr.
>>> + */
>>> +void
>>> +drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>> +          struct drm_gpuva *va,
>>> +          struct drm_gpuva_op_map *op)
>>> +{
>>> +    drm_gpuva_init_from_op(va, op);
>>> +    drm_gpuva_insert(mgr, va);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_map);
>>> +
>>> +/**
>>> + * drm_gpuva_remap - helper to remap a &drm_gpuva according to a
>>> + * &drm_gpuva_op_remap
>>> + * @prev: the &drm_gpuva to remap when keeping the start of a mapping
>>> + * @next: the &drm_gpuva to remap when keeping the end of a mapping
>>> + * @op: the &drm_gpuva_op_remap to initialize @prev and @next with
>>> + *
>>> + * Removes the currently mapped &drm_gpuva and remaps it using
>>> @prev and/or
>>> + * @next.
>>> + */
>>> +void
>>> +drm_gpuva_remap(struct drm_gpuva *prev,
>>> +        struct drm_gpuva *next,
>>> +        struct drm_gpuva_op_remap *op)
>>> +{
>>> +    struct drm_gpuva *curr = op->unmap->va;
>>> +    struct drm_gpuva_manager *mgr = curr->mgr;
>>> +    struct drm_gpuva_op_map *map;
>>> +
>>> +    drm_gpuva_remove(curr);
>>> +
>>> +    if ((map = op->prev)) {
>>> +        drm_gpuva_init_from_op(prev, map);
>>> +        drm_gpuva_insert(mgr, prev);
>>> +    }
>>> +
>>> +    if ((map = op->next)) {
>>> +        drm_gpuva_init_from_op(next, map);
>>> +        drm_gpuva_insert(mgr, next);
>>> +    }
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_remap);
>>> +
>>> +/**
>>> + * drm_gpuva_unmap - helper to remove a &drm_gpuva according to a
>>> + * &drm_gpuva_op_unmap
>>> + * @op: the &drm_gpuva_op_unmap specifying the &drm_gpuva to remove
>>> + *
>>> + * Removes the &drm_gpuva associated with the &drm_gpuva_op_unmap.
>>> + */
>>> +void
>>> +drm_gpuva_unmap(struct drm_gpuva_op_unmap *op)
>>> +{
>>> +    drm_gpuva_remove(op->va);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_unmap);
>>> +
>>> +static int
>>> +op_map_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>> +      u64 addr, u64 range,
>>> +      struct drm_gem_object *obj, u64 offset)
>>> +{
>>> +    struct drm_gpuva_op op = {};
>>> +
>>> +    op.op = DRM_GPUVA_OP_MAP;
>>> +    op.map.va.addr = addr;
>>> +    op.map.va.range = range;
>>> +    op.map.gem.obj = obj;
>>> +    op.map.gem.offset = offset;
>>> +
>>> +    return fn->sm_step_map(&op, priv);
>>> +}
>>> +
>>> +static int
>>> +op_remap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>> +        struct drm_gpuva_op_map *prev,
>>> +        struct drm_gpuva_op_map *next,
>>> +        struct drm_gpuva_op_unmap *unmap)
>>> +{
>>> +    struct drm_gpuva_op op = {};
>>> +    struct drm_gpuva_op_remap *r;
>>> +
>>> +    op.op = DRM_GPUVA_OP_REMAP;
>>> +    r = &op.remap;
>>> +    r->prev = prev;
>>> +    r->next = next;
>>> +    r->unmap = unmap;
>>> +
>>> +    return fn->sm_step_remap(&op, priv);
>>> +}
>>> +
>>> +static int
>>> +op_unmap_cb(const struct drm_gpuva_fn_ops *fn, void *priv,
>>> +        struct drm_gpuva *va, bool merge)
>>> +{
>>> +    struct drm_gpuva_op op = {};
>>> +
>>> +    op.op = DRM_GPUVA_OP_UNMAP;
>>> +    op.unmap.va = va;
>>> +    op.unmap.keep = merge;
>>> +
>>> +    return fn->sm_step_unmap(&op, priv);
>>> +}
>>> +
>>> +static int
>>> +__drm_gpuva_sm_map(struct drm_gpuva_manager *mgr,
>>> +           const struct drm_gpuva_fn_ops *ops, void *priv,
>>> +           u64 req_addr, u64 req_range,
>>> +           struct drm_gem_object *req_obj, u64 req_offset)
>>> +{
>>> +    struct drm_gpuva *va, *next, *prev = NULL;
>>> +    u64 req_end = req_addr + req_range;
>>> +    int ret;
>>> +
>>> +    if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
>>> +        return -EINVAL;
>>> +
>>> +    drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr,
>>> req_end) {
>>> +        struct drm_gem_object *obj = va->gem.obj;
>>> +        u64 offset = va->gem.offset;
>>> +        u64 addr = va->va.addr;
>>> +        u64 range = va->va.range;
>>> +        u64 end = addr + range;
>>> +        bool merge = !!va->gem.obj;
>>> +
>>> +        if (addr == req_addr) {
>>> +            merge &= obj == req_obj &&
>>> +                 offset == req_offset;
>>> +
>>> +            if (end == req_end) {
>>> +                ret = op_unmap_cb(ops, priv, va, merge);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +
>>> +            if (end < req_end) {
>>> +                ret = op_unmap_cb(ops, priv, va, merge);
>>> +                if (ret)
>>> +                    return ret;
>>> +                goto next;
>>> +            }
>>> +
>>> +            if (end > req_end) {
>>> +                struct drm_gpuva_op_map n = {
>>> +                    .va.addr = req_end,
>>> +                    .va.range = range - req_range,
>>> +                    .gem.obj = obj,
>>> +                    .gem.offset = offset + req_range,
>>> +                };
>>> +                struct drm_gpuva_op_unmap u = {
>>> +                    .va = va,
>>> +                    .keep = merge,
>>> +                };
>>> +
>>> +                ret = op_remap_cb(ops, priv, NULL, &n, &u);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +        } else if (addr < req_addr) {
>>> +            u64 ls_range = req_addr - addr;
>>> +            struct drm_gpuva_op_map p = {
>>> +                .va.addr = addr,
>>> +                .va.range = ls_range,
>>> +                .gem.obj = obj,
>>> +                .gem.offset = offset,
>>> +            };
>>> +            struct drm_gpuva_op_unmap u = { .va = va };
>>> +
>>> +            merge &= obj == req_obj &&
>>> +                 offset + ls_range == req_offset;
>>> +            u.keep = merge;
>>> +
>>> +            if (end == req_end) {
>>> +                ret = op_remap_cb(ops, priv, &p, NULL, &u);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +
>>> +            if (end < req_end) {
>>> +                ret = op_remap_cb(ops, priv, &p, NULL, &u);
>>> +                if (ret)
>>> +                    return ret;
>>> +                goto next;
>>> +            }
>>> +
>>> +            if (end > req_end) {
>>> +                struct drm_gpuva_op_map n = {
>>> +                    .va.addr = req_end,
>>> +                    .va.range = end - req_end,
>>> +                    .gem.obj = obj,
>>> +                    .gem.offset = offset + ls_range +
>>> +                              req_range,
>>> +                };
>>> +
>>> +                ret = op_remap_cb(ops, priv, &p, &n, &u);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +        } else if (addr > req_addr) {
>>> +            merge &= obj == req_obj &&
>>> +                 offset == req_offset +
>>> +                       (addr - req_addr);
>>> +
>>> +            if (end == req_end) {
>>> +                ret = op_unmap_cb(ops, priv, va, merge);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +
>>> +            if (end < req_end) {
>>> +                ret = op_unmap_cb(ops, priv, va, merge);
>>> +                if (ret)
>>> +                    return ret;
>>> +                goto next;
>>> +            }
>>> +
>>> +            if (end > req_end) {
>>> +                struct drm_gpuva_op_map n = {
>>> +                    .va.addr = req_end,
>>> +                    .va.range = end - req_end,
>>> +                    .gem.obj = obj,
>>> +                    .gem.offset = offset + req_end - addr,
>>> +                };
>>> +                struct drm_gpuva_op_unmap u = {
>>> +                    .va = va,
>>> +                    .keep = merge,
>>> +                };
>>> +
>>> +                ret = op_remap_cb(ops, priv, NULL, &n, &u);
>>> +                if (ret)
>>> +                    return ret;
>>> +                break;
>>> +            }
>>> +        }
>>> +next:
>>> +        prev = va;
>>> +    }
>>> +
>>> +    return op_map_cb(ops, priv,
>>> +             req_addr, req_range,
>>> +             req_obj, req_offset);
>>> +}
>>> +
>>> +static int
>>> +__drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr,
>>> +             const struct drm_gpuva_fn_ops *ops, void *priv,
>>> +             u64 req_addr, u64 req_range)
>>> +{
>>> +    struct drm_gpuva *va, *next;
>>> +    u64 req_end = req_addr + req_range;
>>> +    int ret;
>>> +
>>> +    if (unlikely(!drm_gpuva_range_valid(mgr, req_addr, req_range)))
>>> +        return -EINVAL;
>>> +
>>> +    drm_gpuva_for_each_va_range_safe(va, next, mgr, req_addr,
>>> req_end) {
>>> +        struct drm_gpuva_op_map prev = {}, next = {};
>>> +        bool prev_split = false, next_split = false;
>>> +        struct drm_gem_object *obj = va->gem.obj;
>>> +        u64 offset = va->gem.offset;
>>> +        u64 addr = va->va.addr;
>>> +        u64 range = va->va.range;
>>> +        u64 end = addr + range;
>>> +
>>> +        if (addr < req_addr) {
>>> +            prev.va.addr = addr;
>>> +            prev.va.range = req_addr - addr;
>>> +            prev.gem.obj = obj;
>>> +            prev.gem.offset = offset;
>>> +
>>> +            prev_split = true;
>>> +        }
>>> +
>>> +        if (end > req_end) {
>>> +            next.va.addr = req_end;
>>> +            next.va.range = end - req_end;
>>> +            next.gem.obj = obj;
>>> +            next.gem.offset = offset + (req_end - addr);
>>> +
>>> +            next_split = true;
>>> +        }
>>> +
>>> +        if (prev_split || next_split) {
>>> +            struct drm_gpuva_op_unmap unmap = { .va = va };
>>> +
>>> +            ret = op_remap_cb(ops, priv,
>>> +                      prev_split ? &prev : NULL,
>>> +                      next_split ? &next : NULL,
>>> +                      &unmap);
>>> +            if (ret)
>>> +                return ret;
>>> +        } else {
>>> +            ret = op_unmap_cb(ops, priv, va, false);
>>> +            if (ret)
>>> +                return ret;
>>> +        }
>>> +    }
>>> +
>>> +    return 0;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_sm_map - creates the &drm_gpuva_op split/merge steps
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @req_addr: the start address of the new mapping
>>> + * @req_range: the range of the new mapping
>>> + * @req_obj: the &drm_gem_object to map
>>> + * @req_offset: the offset within the &drm_gem_object
>>> + * @priv: pointer to a driver private data structure
>>> + *
>>> + * This function iterates the given range of the GPU VA space. It
>>> utilizes the
>>> + * &drm_gpuva_fn_ops to call back into the driver providing the
>>> split and merge
>>> + * steps.
>>> + *
>>> + * Drivers may use these callbacks to update the GPU VA space right
>>> away within
>>> + * the callback. In case the driver decides to copy and store the
>>> operations for
>>> + * later processing neither this function nor &drm_gpuva_sm_unmap
>>> is allowed to
>>> + * be called before the &drm_gpuva_manager's view of the GPU VA
>>> space was
>>> + * updated with the previous set of operations. To update the
>>> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
>>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>>> should be
>>> + * used.
>>> + *
>>> + * A sequence of callbacks can contain map, unmap and remap
>>> operations, but
>>> + * the sequence of callbacks might also be empty if no operation is
>>> required,
>>> + * e.g. if the requested mapping already exists in the exact same way.
>>> + *
>>> + * There can be an arbitrary amount of unmap operations, a maximum
>>> of two remap
>>> + * operations and a single map operation. The latter one represents
>>> the original
>>> + * map operation requested by the caller.
>>> + *
>>> + * Returns: 0 on success or a negative error code
>>> + */
>>> +int
>>> +drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>> +         u64 req_addr, u64 req_range,
>>> +         struct drm_gem_object *req_obj, u64 req_offset)
>>> +{
>>> +    const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +
>>> +    if (unlikely(!(ops && ops->sm_step_map &&
>>> +               ops->sm_step_remap &&
>>> +               ops->sm_step_unmap)))
>>> +        return -EINVAL;
>>> +
>>> +    return __drm_gpuva_sm_map(mgr, ops, priv,
>>> +                  req_addr, req_range,
>>> +                  req_obj, req_offset);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_sm_map);
>>> +
>>> +/**
>>> + * drm_gpuva_sm_unmap - creates the &drm_gpuva_ops to split on unmap
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @priv: pointer to a driver private data structure
>>> + * @req_addr: the start address of the range to unmap
>>> + * @req_range: the range of the mappings to unmap
>>> + *
>>> + * This function iterates the given range of the GPU VA space. It
>>> utilizes the
>>> + * &drm_gpuva_fn_ops to call back into the driver providing the
>>> operations to
>>> + * unmap and, if required, split existent mappings.
>>> + *
>>> + * Drivers may use these callbacks to update the GPU VA space right
>>> away within
>>> + * the callback. In case the driver decides to copy and store the
>>> operations for
>>> + * later processing neither this function nor &drm_gpuva_sm_map is
>>> allowed to be
>>> + * called before the &drm_gpuva_manager's view of the GPU VA space
>>> was updated
>>> + * with the previous set of operations. To update the
>>> &drm_gpuva_manager's view
>>> + * of the GPU VA space drm_gpuva_insert(),
>>> drm_gpuva_destroy_locked() and/or
>>> + * drm_gpuva_destroy_unlocked() should be used.
>>> + *
>>> + * A sequence of callbacks can contain unmap and remap operations,
>>> depending on
>>> + * whether there are actual overlapping mappings to split.
>>> + *
>>> + * There can be an arbitrary amount of unmap operations and a
>>> maximum of two
>>> + * remap operations.
>>> + *
>>> + * Returns: 0 on success or a negative error code
>>> + */
>>> +int
>>> +drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>> +           u64 req_addr, u64 req_range)
>>> +{
>>> +    const struct drm_gpuva_fn_ops *ops = mgr->ops;
>>> +
>>> +    if (unlikely(!(ops && ops->sm_step_remap &&
>>> +               ops->sm_step_unmap)))
>>> +        return -EINVAL;
>>> +
>>> +    return __drm_gpuva_sm_unmap(mgr, ops, priv,
>>> +                    req_addr, req_range);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_sm_unmap);
>>> +
>>> +static struct drm_gpuva_op *
>>> +gpuva_op_alloc(struct drm_gpuva_manager *mgr)
>>> +{
>>> +    const struct drm_gpuva_fn_ops *fn = mgr->ops;
>>> +    struct drm_gpuva_op *op;
>>> +
>>> +    if (fn && fn->op_alloc)
>>> +        op = fn->op_alloc();
>>> +    else
>>> +        op = kzalloc(sizeof(*op), GFP_KERNEL);
>>> +
>>> +    if (unlikely(!op))
>>> +        return NULL;
>>> +
>>> +    return op;
>>> +}
>>> +
>>> +static void
>>> +gpuva_op_free(struct drm_gpuva_manager *mgr,
>>> +          struct drm_gpuva_op *op)
>>> +{
>>> +    const struct drm_gpuva_fn_ops *fn = mgr->ops;
>>> +
>>> +    if (fn && fn->op_free)
>>> +        fn->op_free(op);
>>> +    else
>>> +        kfree(op);
>>> +}
>>> +
>>> +static int
>>> +drm_gpuva_sm_step(struct drm_gpuva_op *__op,
>>> +          void *priv)
>>> +{
>>> +    struct {
>>> +        struct drm_gpuva_manager *mgr;
>>> +        struct drm_gpuva_ops *ops;
>>> +    } *args = priv;
>>> +    struct drm_gpuva_manager *mgr = args->mgr;
>>> +    struct drm_gpuva_ops *ops = args->ops;
>>> +    struct drm_gpuva_op *op;
>>> +
>>> +    op = gpuva_op_alloc(mgr);
>>> +    if (unlikely(!op))
>>> +        goto err;
>>> +
>>> +    memcpy(op, __op, sizeof(*op));
>>> +
>>> +    if (op->op == DRM_GPUVA_OP_REMAP) {
>>> +        struct drm_gpuva_op_remap *__r = &__op->remap;
>>> +        struct drm_gpuva_op_remap *r = &op->remap;
>>> +
>>> +        r->unmap = kmemdup(__r->unmap, sizeof(*r->unmap),
>>> +                   GFP_KERNEL);
>>> +        if (unlikely(!r->unmap))
>>> +            goto err_free_op;
>>> +
>>> +        if (__r->prev) {
>>> +            r->prev = kmemdup(__r->prev, sizeof(*r->prev),
>>> +                      GFP_KERNEL);
>>> +            if (unlikely(!r->prev))
>>> +                goto err_free_unmap;
>>> +        }
>>> +
>>> +        if (__r->next) {
>>> +            r->next = kmemdup(__r->next, sizeof(*r->next),
>>> +                      GFP_KERNEL);
>>> +            if (unlikely(!r->next))
>>> +                goto err_free_prev;
>>> +        }
>>> +    }
>>> +
>>> +    list_add_tail(&op->entry, &ops->list);
>>> +
>>> +    return 0;
>>> +
>>> +err_free_unmap:
>>> +    kfree(op->remap.unmap);
>>> +err_free_prev:
>>> +    kfree(op->remap.prev);
>>> +err_free_op:
>>> +    gpuva_op_free(mgr, op);
>>> +err:
>>> +    return -ENOMEM;
>>> +}
>>> +
>>> +static const struct drm_gpuva_fn_ops gpuva_list_ops = {
>>> +    .sm_step_map = drm_gpuva_sm_step,
>>> +    .sm_step_remap = drm_gpuva_sm_step,
>>> +    .sm_step_unmap = drm_gpuva_sm_step,
>>> +};
>>> +
>>> +/**
>>> + * drm_gpuva_sm_map_ops_create - creates the &drm_gpuva_ops to
>>> split and merge
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @req_addr: the start address of the new mapping
>>> + * @req_range: the range of the new mapping
>>> + * @req_obj: the &drm_gem_object to map
>>> + * @req_offset: the offset within the &drm_gem_object
>>> + *
>>> + * This function creates a list of operations to perform splitting
>>> and merging
>>> + * of existent mapping(s) with the newly requested one.
>>> + *
>>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>>> processed
>>> + * in the given order. It can contain map, unmap and remap
>>> operations, but it
>>> + * also can be empty if no operation is required, e.g. if the
>>> requested mapping
>>> + * already exists is the exact same way.
>>> + *
>>> + * There can be an arbitrary amount of unmap operations, a maximum
>>> of two remap
>>> + * operations and a single map operation. The latter one represents
>>> the original
>>> + * map operation requested by the caller.
>>> + *
>>> + * Note that before calling this function again with another
>>> mapping request it
>>> + * is necessary to update the &drm_gpuva_manager's view of the GPU
>>> VA space. The
>>> + * previously obtained operations must be either processed or
>>> abandoned. To
>>> + * update the &drm_gpuva_manager's view of the GPU VA space
>>> drm_gpuva_insert(),
>>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>>> should be
>>> + * used.
>>> + *
>>> + * After the caller finished processing the returned
>>> &drm_gpuva_ops, they must
>>> + * be freed with &drm_gpuva_ops_free.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR
>>> on failure
>>> + */
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
>>> +                u64 req_addr, u64 req_range,
>>> +                struct drm_gem_object *req_obj, u64 req_offset)
>>> +{
>>> +    struct drm_gpuva_ops *ops;
>>> +    struct {
>>> +        struct drm_gpuva_manager *mgr;
>>> +        struct drm_gpuva_ops *ops;
>>> +    } args;
>>> +    int ret;
>>> +
>>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>>> +    if (unlikely(!ops))
>>> +        return ERR_PTR(-ENOMEM);
>>> +
>>> +    INIT_LIST_HEAD(&ops->list);
>>> +
>>> +    args.mgr = mgr;
>>> +    args.ops = ops;
>>> +
>>> +    ret = __drm_gpuva_sm_map(mgr, &gpuva_list_ops, &args,
>>> +                 req_addr, req_range,
>>> +                 req_obj, req_offset);
>>> +    if (ret)
>>> +        goto err_free_ops;
>>> +
>>> +    return ops;
>>> +
>>> +err_free_ops:
>>> +    drm_gpuva_ops_free(mgr, ops);
>>> +    return ERR_PTR(ret);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_sm_map_ops_create);
>>> +
>>> +/**
>>> + * drm_gpuva_sm_unmap_ops_create - creates the &drm_gpuva_ops to
>>> split on unmap
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @req_addr: the start address of the range to unmap
>>> + * @req_range: the range of the mappings to unmap
>>> + *
>>> + * This function creates a list of operations to perform unmapping
>>> and, if
>>> + * required, splitting of the mappings overlapping the unmap range.
>>> + *
>>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>>> processed
>>> + * in the given order. It can contain unmap and remap operations,
>>> depending on
>>> + * whether there are actual overlapping mappings to split.
>>> + *
>>> + * There can be an arbitrary amount of unmap operations and a
>>> maximum of two
>>> + * remap operations.
>>> + *
>>> + * Note that before calling this function again with another range
>>> to unmap it
>>> + * is necessary to update the &drm_gpuva_manager's view of the GPU
>>> VA space. The
>>> + * previously obtained operations must be processed or abandoned.
>>> To update the
>>> + * &drm_gpuva_manager's view of the GPU VA space drm_gpuva_insert(),
>>> + * drm_gpuva_destroy_locked() and/or drm_gpuva_destroy_unlocked()
>>> should be
>>> + * used.
>>> + *
>>> + * After the caller finished processing the returned
>>> &drm_gpuva_ops, they must
>>> + * be freed with &drm_gpuva_ops_free.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR
>>> on failure
>>> + */
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>> +                  u64 req_addr, u64 req_range)
>>> +{
>>> +    struct drm_gpuva_ops *ops;
>>> +    struct {
>>> +        struct drm_gpuva_manager *mgr;
>>> +        struct drm_gpuva_ops *ops;
>>> +    } args;
>>> +    int ret;
>>> +
>>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>>> +    if (unlikely(!ops))
>>> +        return ERR_PTR(-ENOMEM);
>>> +
>>> +    INIT_LIST_HEAD(&ops->list);
>>> +
>>> +    args.mgr = mgr;
>>> +    args.ops = ops;
>>> +
>>> +    ret = __drm_gpuva_sm_unmap(mgr, &gpuva_list_ops, &args,
>>> +                   req_addr, req_range);
>>> +    if (ret)
>>> +        goto err_free_ops;
>>> +
>>> +    return ops;
>>> +
>>> +err_free_ops:
>>> +    drm_gpuva_ops_free(mgr, ops);
>>> +    return ERR_PTR(ret);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_sm_unmap_ops_create);
>>> +
>>> +/**
>>> + * drm_gpuva_prefetch_ops_create - creates the &drm_gpuva_ops to
>>> prefetch
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @addr: the start address of the range to prefetch
>>> + * @range: the range of the mappings to prefetch
>>> + *
>>> + * This function creates a list of operations to perform prefetching.
>>> + *
>>> + * The list can be iterated with &drm_gpuva_for_each_op and must be
>>> processed
>>> + * in the given order. It can contain prefetch operations.
>>> + *
>>> + * There can be an arbitrary amount of prefetch operations.
>>> + *
>>> + * After the caller finished processing the returned
>>> &drm_gpuva_ops, they must
>>> + * be freed with &drm_gpuva_ops_free.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR
>>> on failure
>>> + */
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
>>> +                  u64 addr, u64 range)
>>> +{
>>> +    struct drm_gpuva_ops *ops;
>>> +    struct drm_gpuva_op *op;
>>> +    struct drm_gpuva *va;
>>> +    u64 end = addr + range;
>>> +    int ret;
>>> +
>>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>>> +    if (!ops)
>>> +        return ERR_PTR(-ENOMEM);
>>> +
>>> +    INIT_LIST_HEAD(&ops->list);
>>> +
>>> +    drm_gpuva_for_each_va_range(va, mgr, addr, end) {
>>> +        op = gpuva_op_alloc(mgr);
>>> +        if (!op) {
>>> +            ret = -ENOMEM;
>>> +            goto err_free_ops;
>>> +        }
>>> +
>>> +        op->op = DRM_GPUVA_OP_PREFETCH;
>>> +        op->prefetch.va = va;
>>> +        list_add_tail(&op->entry, &ops->list);
>>> +    }
>>> +
>>> +    return ops;
>>> +
>>> +err_free_ops:
>>> +    drm_gpuva_ops_free(mgr, ops);
>>> +    return ERR_PTR(ret);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_prefetch_ops_create);
>>> +
>>> +/**
>>> + * drm_gpuva_gem_unmap_ops_create - creates the &drm_gpuva_ops to
>>> unmap a GEM
>>> + * @mgr: the &drm_gpuva_manager representing the GPU VA space
>>> + * @obj: the &drm_gem_object to unmap
>>> + *
>>> + * This function creates a list of operations to perform unmapping
>>> for every
>>> + * GPUVA attached to a GEM.
>>> + *
>>> + * The list can be iterated with &drm_gpuva_for_each_op and
>>> consists out of an
>>> + * arbitrary amount of unmap operations.
>>> + *
>>> + * After the caller finished processing the returned
>>> &drm_gpuva_ops, they must
>>> + * be freed with &drm_gpuva_ops_free.
>>> + *
>>> + * It is the callers responsibility to protect the GEMs GPUVA list
>>> against
>>> + * concurrent access using the GEMs dma_resv lock.
>>> + *
>>> + * Returns: a pointer to the &drm_gpuva_ops on success, an ERR_PTR
>>> on failure
>>> + */
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>> +                   struct drm_gem_object *obj)
>>> +{
>>> +    struct drm_gpuva_ops *ops;
>>> +    struct drm_gpuva_op *op;
>>> +    struct drm_gpuva *va;
>>> +    int ret;
>>> +
>>> +    if (drm_gpuva_manager_external_lock(mgr))
>>> +        drm_gpuva_manager_ext_assert_held(mgr);
>>> +    else
>>> +        dma_resv_assert_held(obj->resv);
>>> +
>>> +    ops = kzalloc(sizeof(*ops), GFP_KERNEL);
>>> +    if (!ops)
>>> +        return ERR_PTR(-ENOMEM);
>>> +
>>> +    INIT_LIST_HEAD(&ops->list);
>>> +
>>> +    drm_gem_for_each_gpuva(va, obj) {
>>> +        op = gpuva_op_alloc(mgr);
>>> +        if (!op) {
>>> +            ret = -ENOMEM;
>>> +            goto err_free_ops;
>>> +        }
>>> +
>>> +        op->op = DRM_GPUVA_OP_UNMAP;
>>> +        op->unmap.va = va;
>>> +        list_add_tail(&op->entry, &ops->list);
>>> +    }
>>> +
>>> +    return ops;
>>> +
>>> +err_free_ops:
>>> +    drm_gpuva_ops_free(mgr, ops);
>>> +    return ERR_PTR(ret);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_gem_unmap_ops_create);
>>> +
>>> +
>>> +/**
>>> + * drm_gpuva_ops_free - free the given &drm_gpuva_ops
>>> + * @mgr: the &drm_gpuva_manager the ops were created for
>>> + * @ops: the &drm_gpuva_ops to free
>>> + *
>>> + * Frees the given &drm_gpuva_ops structure including all the ops
>>> associated
>>> + * with it.
>>> + */
>>> +void
>>> +drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
>>> +           struct drm_gpuva_ops *ops)
>>> +{
>>> +    struct drm_gpuva_op *op, *next;
>>> +
>>> +    drm_gpuva_for_each_op_safe(op, next, ops) {
>>> +        list_del(&op->entry);
>>> +
>>> +        if (op->op == DRM_GPUVA_OP_REMAP) {
>>> +            kfree(op->remap.prev);
>>> +            kfree(op->remap.next);
>>> +            kfree(op->remap.unmap);
>>> +        }
>>> +
>>> +        gpuva_op_free(mgr, op);
>>> +    }
>>> +
>>> +    kfree(ops);
>>> +}
>>> +EXPORT_SYMBOL(drm_gpuva_ops_free);
>>> diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h
>>> index 89e2706cac56..04dbe223b1a5 100644
>>> --- a/include/drm/drm_drv.h
>>> +++ b/include/drm/drm_drv.h
>>> @@ -104,6 +104,12 @@ enum drm_driver_feature {
>>>        * acceleration should be handled by two drivers that are
>>> connected using auxiliary bus.
>>>        */
>>>       DRIVER_COMPUTE_ACCEL            = BIT(7),
>>> +    /**
>>> +     * @DRIVER_GEM_GPUVA:
>>> +     *
>>> +     * Driver supports user defined GPU VA bindings for GEM objects.
>>> +     */
>>> +    DRIVER_GEM_GPUVA        = BIT(8),
>>>       /* IMPORTANT: Below are all the legacy flags, add new ones
>>> above. */
>>> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
>>> index bbc721870c13..5ec8148a30ee 100644
>>> --- a/include/drm/drm_gem.h
>>> +++ b/include/drm/drm_gem.h
>>> @@ -36,6 +36,8 @@
>>>   #include <linux/kref.h>
>>>   #include <linux/dma-resv.h>
>>> +#include <linux/list.h>
>>> +#include <linux/mutex.h>
>>>   #include <drm/drm_vma_manager.h>
>>> @@ -379,6 +381,18 @@ struct drm_gem_object {
>>>        */
>>>       struct dma_resv _resv;
>>> +    /**
>>> +     * @gpuva:
>>> +     *
>>> +     * Provides the list of GPU VAs attached to this GEM object.
>>> +     *
>>> +     * Drivers should lock list accesses with the GEMs &dma_resv lock
>>> +     * (&drm_gem_object.resv).
>>> +     */
>>> +    struct {
>>> +        struct list_head list;
>>> +    } gpuva;
>>> +
>>>       /**
>>>        * @funcs:
>>>        *
>>> @@ -526,4 +540,42 @@ unsigned long drm_gem_lru_scan(struct
>>> drm_gem_lru *lru,
>>>   int drm_gem_evict(struct drm_gem_object *obj);
>>> +/**
>>> + * drm_gem_gpuva_init - initialize the gpuva list of a GEM object
>>> + * @obj: the &drm_gem_object
>>> + *
>>> + * This initializes the &drm_gem_object's &drm_gpuva list.
>>> + *
>>> + * Calling this function is only necessary for drivers intending to
>>> support the
>>> + * &drm_driver_feature DRIVER_GEM_GPUVA.
>>> + */
>>> +static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
>>> +{
>>> +    INIT_LIST_HEAD(&obj->gpuva.list);
>>> +}
>>> +
>>> +/**
>>> + * drm_gem_for_each_gpuva - iternator to walk over a list of gpuvas
>>
>> s/iternator/iterator/. (multiple places) Although since "iterator"
>> typically refers to an object being iterated over, perhaps
>>
>> "drm_gem_for_each_gpuva() - iterate over a list of gpuvas", (general
>> comment across the patch).
>>
>>> + * @entry: &drm_gpuva structure to assign to in each iteration step
>>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated
>>> with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated
>>> with the
>>> + * &drm_gpuva_manager.
>>> + */
>>> +#define drm_gem_for_each_gpuva(entry__, obj__) \
>>> +    list_for_each_entry(entry__, &(obj__)->gpuva.list, gem.entry)
>>> +
>>> +/**
>>> + * drm_gem_for_each_gpuva_safe - iternator to safely walk over a
>>> list of gpuvas
>>> + * @entry: &drm_gpuva structure to assign to in each iteration step
>>> + * @next: &next &drm_gpuva to store the next step
>>> + * @obj: the &drm_gem_object the &drm_gpuvas to walk are associated
>>> with
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated
>>> with the
>>> + * &drm_gem_object. It is implemented with
>>> list_for_each_entry_safe(), hence
>>> + * it is save against removal of elements.
>>> + */
>>> +#define drm_gem_for_each_gpuva_safe(entry__, next__, obj__) \
>>> +    list_for_each_entry_safe(entry__, next__, &(obj__)->gpuva.list,
>>> gem.entry)
>>> +
>>>   #endif /* __DRM_GEM_H__ */
>>> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
>>> new file mode 100644
>>> index 000000000000..4f23aaf726dd
>>> --- /dev/null
>>> +++ b/include/drm/drm_gpuva_mgr.h
>>> @@ -0,0 +1,756 @@
>>> +/* SPDX-License-Identifier: GPL-2.0 */
>>> +
>>> +#ifndef __DRM_GPUVA_MGR_H__
>>> +#define __DRM_GPUVA_MGR_H__
>>> +
>>> +/*
>>> + * Copyright (c) 2022 Red Hat.
>>> + *
>>> + * Permission is hereby granted, free of charge, to any person
>>> obtaining a
>>> + * copy of this software and associated documentation files (the
>>> "Software"),
>>> + * to deal in the Software without restriction, including without
>>> limitation
>>> + * the rights to use, copy, modify, merge, publish, distribute,
>>> sublicense,
>>> + * and/or sell copies of the Software, and to permit persons to
>>> whom the
>>> + * Software is furnished to do so, subject to the following
>>> conditions:
>>> + *
>>> + * The above copyright notice and this permission notice shall be
>>> included in
>>> + * all copies or substantial portions of the Software.
>>> + *
>>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
>>> EXPRESS OR
>>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>>> MERCHANTABILITY,
>>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO
>>> EVENT SHALL
>>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>>> DAMAGES OR
>>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
>>> OTHERWISE,
>>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE
>>> USE OR
>>> + * OTHER DEALINGS IN THE SOFTWARE.
>>> + */
>>> +
>>> +#include <linux/list.h>
>>> +#include <linux/rbtree.h>
>>> +#include <linux/types.h>
>>> +
>>> +#include <drm/drm_gem.h>
>>> +
>>> +struct drm_gpuva_manager;
>>> +struct drm_gpuva_fn_ops;
>>> +
>>> +/**
>>> + * enum drm_gpuva_flags - flags for struct drm_gpuva
>>> + */
>>> +enum drm_gpuva_flags {
>>> +    /**
>>> +     * @DRM_GPUVA_INVALIDATED:
>>> +     *
>>> +     * Flag indicating that the &drm_gpuva's backing GEM is
>>> invalidated.
>>> +     */
>>> +    DRM_GPUVA_INVALIDATED = (1 << 0),
>>> +
>>> +    /**
>>> +     * @DRM_GPUVA_SPARSE:
>>> +     *
>>> +     * Flag indicating that the &drm_gpuva is a sparse mapping.
>>> +     */
>>> +    DRM_GPUVA_SPARSE = (1 << 1),
>>> +
>>> +    /**
>>> +     * @DRM_GPUVA_USERBITS: user defined bits
>>> +     */
>>> +    DRM_GPUVA_USERBITS = (1 << 2),
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva - structure to track a GPU VA mapping
>>> + *
>>> + * This structure represents a GPU VA mapping and is associated with a
>>> + * &drm_gpuva_manager.
>>> + *
>>> + * Typically, this structure is embedded in bigger driver structures.
>>> + */
>>> +struct drm_gpuva {
>>> +    /**
>>> +     * @mgr: the &drm_gpuva_manager this object is associated with
>>> +     */
>>> +    struct drm_gpuva_manager *mgr;
>>> +
>>> +    /**
>>> +     * @flags: the &drm_gpuva_flags for this mapping
>>> +     */
>>> +    enum drm_gpuva_flags flags;
>>> +
>>> +    /**
>>> +     * @va: structure containing the address and range of the
>>> &drm_gpuva
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @addr: the start address
>>> +         */
>>> +        u64 addr;
>>> +
>>> +        /*
>>> +         * @range: the range
>>> +         */
>>> +        u64 range;
>>> +    } va;
>>> +
>>> +    /**
>>> +     * @gem: structure containing the &drm_gem_object and it's offset
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @offset: the offset within the &drm_gem_object
>>> +         */
>>> +        u64 offset;
>>> +
>>> +        /**
>>> +         * @obj: the mapped &drm_gem_object
>>> +         */
>>> +        struct drm_gem_object *obj;
>>> +
>>> +        /**
>>> +         * @entry: the &list_head to attach this object to a
>>> &drm_gem_object
>>> +         */
>>> +        struct list_head entry;
>>> +    } gem;
>>> +
>>> +    /**
>>> +     * @rb: structure containing data to store &drm_gpuvas in a
>>> rb-tree
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @rb: the rb-tree node
>>> +         */
>>> +        struct rb_node node;
>>> +
>>> +        /**
>>> +         * @entry: The &list_head to additionally connect &drm_gpuvas
>>> +         * in the same order they appear in the interval tree. This is
>>> +         * useful to keep iterating &drm_gpuvas from a start node
>>> found
>>> +         * through the rb-tree while doing modifications on the
>>> rb-tree
>>> +         * itself.
>>> +         */
>>> +        struct list_head entry;
>>> +
>>> +        /**
>>> +         * @__subtree_last: needed by the interval tree, holding
>>> last-in-subtree
>>> +         */
>>> +        u64 __subtree_last;
>>> +    } rb;
>>> +};
>>> +
>>> +int drm_gpuva_insert(struct drm_gpuva_manager *mgr, struct
>>> drm_gpuva *va);
>>> +void drm_gpuva_remove(struct drm_gpuva *va);
>>> +
>>> +void drm_gpuva_link(struct drm_gpuva *va);
>>> +void drm_gpuva_unlink(struct drm_gpuva *va);
>>> +
>>> +struct drm_gpuva *drm_gpuva_find(struct drm_gpuva_manager *mgr,
>>> +                 u64 addr, u64 range);
>>> +struct drm_gpuva *drm_gpuva_find_first(struct drm_gpuva_manager *mgr,
>>> +                       u64 addr, u64 range);
>>> +struct drm_gpuva *drm_gpuva_find_prev(struct drm_gpuva_manager
>>> *mgr, u64 start);
>>> +struct drm_gpuva *drm_gpuva_find_next(struct drm_gpuva_manager
>>> *mgr, u64 end);
>>> +
>>> +bool drm_gpuva_interval_empty(struct drm_gpuva_manager *mgr, u64
>>> addr, u64 range);
>>> +
>>> +static inline void drm_gpuva_init(struct drm_gpuva *va, u64 addr,
>>> u64 range,
>>> +                  struct drm_gem_object *obj, u64 offset)
>>> +{
>>> +    va->va.addr = addr;
>>> +    va->va.range = range;
>>> +    va->gem.obj = obj;
>>> +    va->gem.offset = offset;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_invalidate - sets whether the backing GEM of this
>>> &drm_gpuva is
>>> + * invalidated
>>> + * @va: the &drm_gpuva to set the invalidate flag for
>>> + * @invalidate: indicates whether the &drm_gpuva is invalidated
>>> + */
>>> +static inline void drm_gpuva_invalidate(struct drm_gpuva *va, bool
>>> invalidate)
>>> +{
>>> +    if (invalidate)
>>> +        va->flags |= DRM_GPUVA_INVALIDATED;
>>> +    else
>>> +        va->flags &= ~DRM_GPUVA_INVALIDATED;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_invalidated - indicates whether the backing BO of this
>>> &drm_gpuva
>>> + * is invalidated
>>> + * @va: the &drm_gpuva to check
>>> + */
>>> +static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
>>> +{
>>> +    return va->flags & DRM_GPUVA_INVALIDATED;
>>> +}
>>> +
>>> +#ifdef CONFIG_LOCKDEP
>>> +typedef struct lockdep_map *lockdep_map_p;
>>> +#define drm_gpuva_manager_ext_assert_held(mgr)        \
>>> +    lockdep_assert(lock_is_held((mgr)->ext_lock) !=
>>> LOCK_STATE_NOT_HELD)
>>> +/**
>>> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
>>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
>>> + * @mgr: the &drm_gpuva_manager to set the lock for
>>> + * @lock: the lock to set
>>> + *
>>> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call
>>> this function
>>> + * to provide the lock used to lock linking and unlinking of
>>> &drm_gpuvas to the
>>> + * &drm_gem_objects GPUVA list.
>>> + */
>>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock)    \
>>> +    (mgr)->ext_lock = &(lock)->dep_map
>>> +#else
>>> +typedef struct { /* nothing */ } lockdep_map_p;
>>> +#define drm_gpuva_manager_ext_assert_held(mgr)        do {
>>> (void)(mgr); } while (0)
>>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock)    do { } while (0)
>>> +#endif
>>> +
>>> +/**
>>> + * enum drm_gpuva_manager_flags - the feature flags for the
>>> &drm_gpuva_manager
>>> + */
>>> +enum drm_gpuva_manager_flags {
>>> +    /**
>>> +     * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
>>> +     *
>>> +     * Indicates the driver has it's own external lock for linking and
>>> +     * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
>>> +     *
>>> +     * When setting this flag it is rquired to set a lock via
>>> +     * drm_gpuva_set_ext_lock().
>>> +     */
>>> +    DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_manager - DRM GPU VA Manager
>>> + *
>>> + * The DRM GPU VA Manager keeps track of a GPU's virtual address
>>> space by using
>>> + * &maple_tree structures. Typically, this structure is embedded in
>>> bigger
>>> + * driver structures.
>>> + *
>>> + * Drivers can pass addresses and ranges in an arbitrary unit, e.g.
>>> bytes or
>>> + * pages.
>>> + *
>>> + * There should be one manager instance per GPU virtual address space.
>>> + */
>>> +struct drm_gpuva_manager {
>>> +    /**
>>> +     * @name: the name of the DRM GPU VA space
>>> +     */
>>> +    const char *name;
>>> +
>>> +    /**
>>> +     * @flags: the feature flags of the &drm_gpuva_manager
>>> +     */
>>> +    enum drm_gpuva_manager_flags flags;
>>> +
>>> +    /**
>>> +     * @mm_start: start of the VA space
>>> +     */
>>> +    u64 mm_start;
>>> +
>>> +    /**
>>> +     * @mm_range: length of the VA space
>>> +     */
>>> +    u64 mm_range;
>>> +
>>> +    /**
>>> +     * @rb: structures to track &drm_gpuva entries
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @tree: the rb-tree to track GPU VA mappings
>>> +         */
>>> +        struct rb_root_cached tree;
>>> +
>>> +        /**
>>> +         * @list: the &list_head to track GPU VA mappings
>>> +         */
>>> +        struct list_head list;
>>> +    } rb;
>>> +
>>> +    /**
>>> +     * @kernel_alloc_node:
>>> +     *
>>> +     * &drm_gpuva representing the address space cutout reserved for
>>> +     * the kernel
>>> +     */
>>> +    struct drm_gpuva kernel_alloc_node;
>>> +
>>> +    /**
>>> +     * @ops: &drm_gpuva_fn_ops providing the split/merge steps to
>>> drivers
>>> +     */
>>> +    const struct drm_gpuva_fn_ops *ops;
>>> +
>>> +    /**
>>> +     * @ext_lock: &lockdep_map according to
>>> @DRM_GPUVA_MANAGER_LOCK_EXTERN
>>> +     */
>>> +    lockdep_map_p ext_lock;
>>> +};
>>> +
>>> +void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
>>> +                const char *name,
>>> +                u64 start_offset, u64 range,
>>> +                u64 reserve_offset, u64 reserve_range,
>>> +                const struct drm_gpuva_fn_ops *ops,
>>> +                enum drm_gpuva_manager_flags flags);
>>> +void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>>> +
>>> +/**
>>> + * drm_gpuva_manager_external_lock - indicates whether the
>>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
>>> + * @mgr: the &drm_gpuva_manager to check the flag for
>>> + */
>>> +static inline bool drm_gpuva_manager_external_lock(struct
>>> drm_gpuva_manager *mgr)
>>> +{
>>> +    return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
>>> +}
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_va_range - iternator to walk over a range of
>>> &drm_gpuvas
>>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>>> + * @mgr__: &drm_gpuva_manager to walk over
>>> + * @start__: starting offset, the first gpuva will overlap this
>>> + * @end__: ending offset, the last gpuva will start before this
>>> (but may
>>> + * overlap)
>>> + *
>>> + * This iterator walks over all &drm_gpuvas in the
>>> &drm_gpuva_manager that lie
>>> + * between @start__ and @end__. It is implemented similarly to
>>> list_for_each(),
>>> + * but is using the &drm_gpuva_manager's internal interval tree to
>>> accelerate
>>> + * the search for the starting &drm_gpuva, and hence isn't safe
>>> against removal
>>> + * of elements. It assumes that @end__ is within (or is the upper
>>> limit of) the
>>> + * &drm_gpuva_manager. This iterator does not skip over the
>>> &drm_gpuva_manager's
>>> + * @kernel_alloc_node.
>>> + */
>>> +#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
>>> +    for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \
>>> +         va__ && (va__->va.addr < (end__)) && \
>>> +         !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>>> +         va__ = list_next_entry(va__, rb.entry))
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over
>>> a range of
>>> + * &drm_gpuvas
>>> + * @va__: &drm_gpuva to assign to in each iteration step
>>> + * @next__: another &drm_gpuva to use as temporary storage
>>> + * @mgr__: &drm_gpuva_manager to walk over
>>> + * @start__: starting offset, the first gpuva will overlap this
>>> + * @end__: ending offset, the last gpuva will start before this
>>> (but may
>>> + * overlap)
>>> + *
>>> + * This iterator walks over all &drm_gpuvas in the
>>> &drm_gpuva_manager that lie
>>> + * between @start__ and @end__. It is implemented similarly to
>>> + * list_for_each_safe(), but is using the &drm_gpuva_manager's
>>> internal interval
>>> + * tree to accelerate the search for the starting &drm_gpuva, and
>>> hence is safe
>>> + * against removal of elements. It assumes that @end__ is within
>>> (or is the
>>> + * upper limit of) the &drm_gpuva_manager. This iterator does not
>>> skip over the
>>> + * &drm_gpuva_manager's @kernel_alloc_node.
>>> + */
>>> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__,
>>> start__, end__) \
>>> +    for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
>>> +         next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
>>> +         va__ && (va__->va.addr < (end__)) && \
>>> +         !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>>> +         va__ = next__, next__ = list_next_entry(va__, rb.entry))
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_va - iternator to walk over all &drm_gpuvas
>>> + * @va__: &drm_gpuva to assign to in each iteration step
>>> + * @mgr__: &drm_gpuva_manager to walk over
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated
>>> with the given
>>> + * &drm_gpuva_manager.
>>> + */
>>> +#define drm_gpuva_for_each_va(va__, mgr__) \
>>> +    list_for_each_entry(va__, &(mgr__)->rb.list, rb.entry)
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_va_safe - iternator to safely walk over all
>>> &drm_gpuvas
>>> + * @va__: &drm_gpuva to assign to in each iteration step
>>> + * @next__: another &drm_gpuva to use as temporary storage
>>> + * @mgr__: &drm_gpuva_manager to walk over
>>> + *
>>> + * This iterator walks over all &drm_gpuva structures associated
>>> with the given
>>> + * &drm_gpuva_manager. It is implemented with
>>> list_for_each_entry_safe(), and
>>> + * hence safe against the removal of elements.
>>> + */
>>> +#define drm_gpuva_for_each_va_safe(va__, next__, mgr__) \
>>> +    list_for_each_entry_safe(va__, next__, &(mgr__)->rb.list,
>>> rb.entry)
>>> +
>>> +/**
>>> + * enum drm_gpuva_op_type - GPU VA operation type
>>> + *
>>> + * Operations to alter the GPU VA mappings tracked by the
>>> &drm_gpuva_manager.
>>> + */
>>> +enum drm_gpuva_op_type {
>>> +    /**
>>> +     * @DRM_GPUVA_OP_MAP: the map op type
>>> +     */
>>> +    DRM_GPUVA_OP_MAP,
>>> +
>>> +    /**
>>> +     * @DRM_GPUVA_OP_REMAP: the remap op type
>>> +     */
>>> +    DRM_GPUVA_OP_REMAP,
>>> +
>>> +    /**
>>> +     * @DRM_GPUVA_OP_UNMAP: the unmap op type
>>> +     */
>>> +    DRM_GPUVA_OP_UNMAP,
>>> +
>>> +    /**
>>> +     * @DRM_GPUVA_OP_PREFETCH: the prefetch op type
>>> +     */
>>> +    DRM_GPUVA_OP_PREFETCH,
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_op_map - GPU VA map operation
>>> + *
>>> + * This structure represents a single map operation generated by the
>>> + * DRM GPU VA manager.
>>> + */
>>> +struct drm_gpuva_op_map {
>>> +    /**
>>> +     * @va: structure containing address and range of a map
>>> +     * operation
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @addr: the base address of the new mapping
>>> +         */
>>> +        u64 addr;
>>> +
>>> +        /**
>>> +         * @range: the range of the new mapping
>>> +         */
>>> +        u64 range;
>>> +    } va;
>>> +
>>> +    /**
>>> +     * @gem: structure containing the &drm_gem_object and it's offset
>>> +     */
>>> +    struct {
>>> +        /**
>>> +         * @offset: the offset within the &drm_gem_object
>>> +         */
>>> +        u64 offset;
>>> +
>>> +        /**
>>> +         * @obj: the &drm_gem_object to map
>>> +         */
>>> +        struct drm_gem_object *obj;
>>> +    } gem;
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_op_unmap - GPU VA unmap operation
>>> + *
>>> + * This structure represents a single unmap operation generated by the
>>> + * DRM GPU VA manager.
>>> + */
>>> +struct drm_gpuva_op_unmap {
>>> +    /**
>>> +     * @va: the &drm_gpuva to unmap
>>> +     */
>>> +    struct drm_gpuva *va;
>>> +
>>> +    /**
>>> +     * @keep:
>>> +     *
>>> +     * Indicates whether this &drm_gpuva is physically contiguous
>>> with the
>>> +     * original mapping request.
>>> +     *
>>> +     * Optionally, if &keep is set, drivers may keep the actual
>>> page table
>>> +     * mappings for this &drm_gpuva, adding the missing page table
>>> entries
>>> +     * only and update the &drm_gpuva_manager accordingly.
>>> +     */
>>> +    bool keep;
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_op_remap - GPU VA remap operation
>>> + *
>>> + * This represents a single remap operation generated by the DRM
>>> GPU VA manager.
>>> + *
>>> + * A remap operation is generated when an existing GPU VA mmapping
>>> is split up
>>> + * by inserting a new GPU VA mapping or by partially unmapping
>>> existent
>>> + * mapping(s), hence it consists of a maximum of two map and one unmap
>>> + * operation.
>>> + *
>>> + * The @unmap operation takes care of removing the original
>>> existing mapping.
>>> + * @prev is used to remap the preceding part, @next the subsequent
>>> part.
>>> + *
>>> + * If either a new mapping's start address is aligned with the
>>> start address
>>> + * of the old mapping or the new mapping's end address is aligned
>>> with the
>>> + * end address of the old mapping, either @prev or @next is NULL.
>>> + *
>>> + * Note, the reason for a dedicated remap operation, rather than
>>> arbitrary
>>> + * unmap and map operations, is to give drivers the chance of
>>> extracting driver
>>> + * specific data for creating the new mappings from the unmap
>>> operations's
>>> + * &drm_gpuva structure which typically is embedded in larger
>>> driver specific
>>> + * structures.
>>> + */
>>> +struct drm_gpuva_op_remap {
>>> +    /**
>>> +     * @prev: the preceding part of a split mapping
>>> +     */
>>> +    struct drm_gpuva_op_map *prev;
>>> +
>>> +    /**
>>> +     * @next: the subsequent part of a split mapping
>>> +     */
>>> +    struct drm_gpuva_op_map *next;
>>> +
>>> +    /**
>>> +     * @unmap: the unmap operation for the original existing mapping
>>> +     */
>>> +    struct drm_gpuva_op_unmap *unmap;
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_op_prefetch - GPU VA prefetch operation
>>> + *
>>> + * This structure represents a single prefetch operation generated
>>> by the
>>> + * DRM GPU VA manager.
>>> + */
>>> +struct drm_gpuva_op_prefetch {
>>> +    /**
>>> +     * @va: the &drm_gpuva to prefetch
>>> +     */
>>> +    struct drm_gpuva *va;
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_op - GPU VA operation
>>> + *
>>> + * This structure represents a single generic operation.
>>> + *
>>> + * The particular type of the operation is defined by @op.
>>> + */
>>> +struct drm_gpuva_op {
>>> +    /**
>>> +     * @entry:
>>> +     *
>>> +     * The &list_head used to distribute instances of this struct
>>> within
>>> +     * &drm_gpuva_ops.
>>> +     */
>>> +    struct list_head entry;
>>> +
>>> +    /**
>>> +     * @op: the type of the operation
>>> +     */
>>> +    enum drm_gpuva_op_type op;
>>> +
>>> +    union {
>>> +        /**
>>> +         * @map: the map operation
>>> +         */
>>> +        struct drm_gpuva_op_map map;
>>> +
>>> +        /**
>>> +         * @remap: the remap operation
>>> +         */
>>> +        struct drm_gpuva_op_remap remap;
>>> +
>>> +        /**
>>> +         * @unmap: the unmap operation
>>> +         */
>>> +        struct drm_gpuva_op_unmap unmap;
>>> +
>>> +        /**
>>> +         * @prefetch: the prefetch operation
>>> +         */
>>> +        struct drm_gpuva_op_prefetch prefetch;
>>> +    };
>>> +};
>>> +
>>> +/**
>>> + * struct drm_gpuva_ops - wraps a list of &drm_gpuva_op
>>> + */
>>> +struct drm_gpuva_ops {
>>> +    /**
>>> +     * @list: the &list_head
>>> +     */
>>> +    struct list_head list;
>>> +};
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_op - iterator to walk over &drm_gpuva_ops
>>> + * @op: &drm_gpuva_op to assign in each iteration step
>>> + * @ops: &drm_gpuva_ops to walk
>>> + *
>>> + * This iterator walks over all ops within a given list of operations.
>>> + */
>>> +#define drm_gpuva_for_each_op(op, ops) list_for_each_entry(op,
>>> &(ops)->list, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_op_safe - iterator to safely walk over
>>> &drm_gpuva_ops
>>> + * @op: &drm_gpuva_op to assign in each iteration step
>>> + * @next: &next &drm_gpuva_op to store the next step
>>> + * @ops: &drm_gpuva_ops to walk
>>> + *
>>> + * This iterator walks over all ops within a given list of
>>> operations. It is
>>> + * implemented with list_for_each_safe(), so save against removal
>>> of elements.
>>> + */
>>> +#define drm_gpuva_for_each_op_safe(op, next, ops) \
>>> +    list_for_each_entry_safe(op, next, &(ops)->list, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_for_each_op_from_reverse - iterate backwards from the
>>> given point
>>> + * @op: &drm_gpuva_op to assign in each iteration step
>>> + * @ops: &drm_gpuva_ops to walk
>>> + *
>>> + * This iterator walks over all ops within a given list of
>>> operations beginning
>>> + * from the given operation in reverse order.
>>> + */
>>> +#define drm_gpuva_for_each_op_from_reverse(op, ops) \
>>> +    list_for_each_entry_from_reverse(op, &(ops)->list, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_first_op - returns the first &drm_gpuva_op from
>>> &drm_gpuva_ops
>>> + * @ops: the &drm_gpuva_ops to get the fist &drm_gpuva_op from
>>> + */
>>> +#define drm_gpuva_first_op(ops) \
>>> +    list_first_entry(&(ops)->list, struct drm_gpuva_op, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_last_op - returns the last &drm_gpuva_op from
>>> &drm_gpuva_ops
>>> + * @ops: the &drm_gpuva_ops to get the last &drm_gpuva_op from
>>> + */
>>> +#define drm_gpuva_last_op(ops) \
>>> +    list_last_entry(&(ops)->list, struct drm_gpuva_op, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_prev_op - previous &drm_gpuva_op in the list
>>> + * @op: the current &drm_gpuva_op
>>> + */
>>> +#define drm_gpuva_prev_op(op) list_prev_entry(op, entry)
>>> +
>>> +/**
>>> + * drm_gpuva_next_op - next &drm_gpuva_op in the list
>>> + * @op: the current &drm_gpuva_op
>>> + */
>>> +#define drm_gpuva_next_op(op) list_next_entry(op, entry)
>>> +
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_sm_map_ops_create(struct drm_gpuva_manager *mgr,
>>> +                u64 addr, u64 range,
>>> +                struct drm_gem_object *obj, u64 offset);
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_sm_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>> +                  u64 addr, u64 range);
>>> +
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_prefetch_ops_create(struct drm_gpuva_manager *mgr,
>>> +                 u64 addr, u64 range);
>>> +
>>> +struct drm_gpuva_ops *
>>> +drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
>>> +                   struct drm_gem_object *obj);
>>> +
>>> +void drm_gpuva_ops_free(struct drm_gpuva_manager *mgr,
>>> +            struct drm_gpuva_ops *ops);
>>> +
>>> +static inline void drm_gpuva_init_from_op(struct drm_gpuva *va,
>>> +                      struct drm_gpuva_op_map *op)
>>> +{
>>> +    drm_gpuva_init(va, op->va.addr, op->va.range,
>>> +               op->gem.obj, op->gem.offset);
>>> +}
>>> +
>>> +/**
>>> + * struct drm_gpuva_fn_ops - callbacks for split/merge steps
>>> + *
>>> + * This structure defines the callbacks used by &drm_gpuva_sm_map and
>>> + * &drm_gpuva_sm_unmap to provide the split/merge steps for map and
>>> unmap
>>> + * operations to drivers.
>>> + */
>>> +struct drm_gpuva_fn_ops {
>>> +    /**
>>> +     * @op_alloc: called when the &drm_gpuva_manager allocates
>>> +     * a struct drm_gpuva_op
>>> +     *
>>> +     * Some drivers may want to embed struct drm_gpuva_op into driver
>>> +     * specific structures. By implementing this callback drivers can
>>> +     * allocate memory accordingly.
>>> +     *
>>> +     * This callback is optional.
>>> +     */
>>> +    struct drm_gpuva_op *(*op_alloc)(void);
>>> +
>>> +    /**
>>> +     * @op_free: called when the &drm_gpuva_manager frees a
>>> +     * struct drm_gpuva_op
>>> +     *
>>> +     * Some drivers may want to embed struct drm_gpuva_op into driver
>>> +     * specific structures. By implementing this callback drivers can
>>> +     * free the previously allocated memory accordingly.
>>> +     *
>>> +     * This callback is optional.
>>> +     */
>>> +    void (*op_free)(struct drm_gpuva_op *op);
>>> +
>>> +    /**
>>> +     * @sm_step_map: called from &drm_gpuva_sm_map to finally
>>> insert the
>>> +     * mapping once all previous steps were completed
>>> +     *
>>> +     * The &priv pointer matches the one the driver passed to
>>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>>> +     *
>>> +     * Can be NULL if &drm_gpuva_sm_map is used.
>>> +     */
>>> +    int (*sm_step_map)(struct drm_gpuva_op *op, void *priv);
>>> +
>>> +    /**
>>> +     * @sm_step_remap: called from &drm_gpuva_sm_map and
>>> +     * &drm_gpuva_sm_unmap to split up an existent mapping
>>> +     *
>>> +     * This callback is called when existent mapping needs to be
>>> split up.
>>> +     * This is the case when either a newly requested mapping
>>> overlaps or
>>> +     * is enclosed by an existent mapping or a partial unmap of an
>>> existent
>>> +     * mapping is requested.
>>> +     *
>>> +     * The &priv pointer matches the one the driver passed to
>>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>>> +     *
>>> +     * Can be NULL if neither &drm_gpuva_sm_map nor
>>> &drm_gpuva_sm_unmap is
>>> +     * used.
>>> +     */
>>> +    int (*sm_step_remap)(struct drm_gpuva_op *op, void *priv);
>>> +
>>> +    /**
>>> +     * @sm_step_unmap: called from &drm_gpuva_sm_map and
>>> +     * &drm_gpuva_sm_unmap to unmap an existent mapping
>>> +     *
>>> +     * This callback is called when existent mapping needs to be
>>> unmapped.
>>> +     * This is the case when either a newly requested mapping
>>> encloses an
>>> +     * existent mapping or an unmap of an existent mapping is
>>> requested.
>>> +     *
>>> +     * The &priv pointer matches the one the driver passed to
>>> +     * &drm_gpuva_sm_map or &drm_gpuva_sm_unmap, respectively.
>>> +     *
>>> +     * Can be NULL if neither &drm_gpuva_sm_map nor
>>> &drm_gpuva_sm_unmap is
>>> +     * used.
>>> +     */
>>> +    int (*sm_step_unmap)(struct drm_gpuva_op *op, void *priv);
>>> +};
>>> +
>>> +int drm_gpuva_sm_map(struct drm_gpuva_manager *mgr, void *priv,
>>> +             u64 addr, u64 range,
>>> +             struct drm_gem_object *obj, u64 offset);
>>> +
>>> +int drm_gpuva_sm_unmap(struct drm_gpuva_manager *mgr, void *priv,
>>> +               u64 addr, u64 range);
>>> +
>>> +void drm_gpuva_map(struct drm_gpuva_manager *mgr,
>>> +           struct drm_gpuva *va,
>>> +           struct drm_gpuva_op_map *op);
>> Missing newline
>>> +void drm_gpuva_remap(struct drm_gpuva *prev,
>>> +             struct drm_gpuva *next,
>>> +             struct drm_gpuva_op_remap *op);
>> Missing newline
>>> +void drm_gpuva_unmap(struct drm_gpuva_op_unmap *op);
>>> +
>>> +#endif /* __DRM_GPUVA_MGR_H__ */
>>

2023-07-06 18:37:51

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 30 Jun 2023 00:25:18 +0200
Danilo Krummrich <[email protected]> wrote:

> +#ifdef CONFIG_LOCKDEP
> +typedef struct lockdep_map *lockdep_map_p;
> +#define drm_gpuva_manager_ext_assert_held(mgr) \
> + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
> +/**
> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
> + * @mgr: the &drm_gpuva_manager to set the lock for
> + * @lock: the lock to set
> + *
> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
> + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
> + * &drm_gem_objects GPUVA list.
> + */
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
> + (mgr)->ext_lock = &(lock)->dep_map

Okay, so, IIUC, this is the lock protecting the GEM's active mappings
list, meaning the lock is likely to be attached to the GEM object. Are
we expected to call drm_gpuva_manager_set_ext_lock() every time we call
drm_gpuva_[un]link(), or are we supposed to have some lock at the
device level serializing all drm_gpuva_[un]link() calls across VMs? The
later doesn't sound like a good option to me, and the former feels a bit
weird. I'm wondering if we shouldn't just drop this assert_held() check
when DRM_GPUVA_MANAGER_LOCK_EXTERN is set. Alternatively, we could say
that any driver wanting to use a custom lock (which is basically all
drivers modifying the VA space asynchronously in the ::run_job() path)
has to provide its own variant of drm_gpuva_[un]link() (maybe with its
own VA list too), which doesn't sound like a good idea either.

> +#else
> +typedef struct { /* nothing */ } lockdep_map_p;
> +#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
> +#endif

2023-07-07 08:25:31

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Thu, 6 Jul 2023 20:26:42 +0200
Boris Brezillon <[email protected]> wrote:

> On Fri, 30 Jun 2023 00:25:18 +0200
> Danilo Krummrich <[email protected]> wrote:
>
> > +#ifdef CONFIG_LOCKDEP
> > +typedef struct lockdep_map *lockdep_map_p;
> > +#define drm_gpuva_manager_ext_assert_held(mgr) \
> > + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
> > +/**
> > + * drm_gpuva_manager_set_ext_lock - set the external lock according to
> > + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
> > + * @mgr: the &drm_gpuva_manager to set the lock for
> > + * @lock: the lock to set
> > + *
> > + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
> > + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
> > + * &drm_gem_objects GPUVA list.
> > + */
> > +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
> > + (mgr)->ext_lock = &(lock)->dep_map
>
> Okay, so, IIUC, this is the lock protecting the GEM's active mappings
> list, meaning the lock is likely to be attached to the GEM object. Are
> we expected to call drm_gpuva_manager_set_ext_lock() every time we call
> drm_gpuva_[un]link(), or are we supposed to have some lock at the
> device level serializing all drm_gpuva_[un]link() calls across VMs? The
> later doesn't sound like a good option to me, and the former feels a bit
> weird. I'm wondering if we shouldn't just drop this assert_held() check
> when DRM_GPUVA_MANAGER_LOCK_EXTERN is set. Alternatively, we could say
> that any driver wanting to use a custom lock (which is basically all
> drivers modifying the VA space asynchronously in the ::run_job() path)
> has to provide its own variant of drm_gpuva_[un]link() (maybe with its
> own VA list too), which doesn't sound like a good idea either.

Or we could just attach the dep_map to drm_gem_object::gpuva::lock, and
let drivers overload the default lock in their GEM creation function if
they want to use a custom lock (see the following diff).

---

diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
index e47747f22126..6427c88c22ba 100644
--- a/drivers/gpu/drm/drm_gpuva_mgr.c
+++ b/drivers/gpu/drm/drm_gpuva_mgr.c
@@ -675,8 +675,7 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
const char *name,
u64 start_offset, u64 range,
u64 reserve_offset, u64 reserve_range,
- const struct drm_gpuva_fn_ops *ops,
- enum drm_gpuva_manager_flags flags)
+ const struct drm_gpuva_fn_ops *ops)
{
mgr->rb.tree = RB_ROOT_CACHED;
INIT_LIST_HEAD(&mgr->rb.list);
@@ -686,7 +685,6 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
mgr->mm_range = range;

mgr->name = name ? name : "unknown";
- mgr->flags = flags;
mgr->ops = ops;

memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
@@ -822,16 +820,12 @@ EXPORT_SYMBOL(drm_gpuva_remove);
void
drm_gpuva_link(struct drm_gpuva *va)
{
- struct drm_gpuva_manager *mgr = va->mgr;
struct drm_gem_object *obj = va->gem.obj;

if (unlikely(!obj))
return;

- if (drm_gpuva_manager_external_lock(mgr))
- drm_gpuva_manager_ext_assert_held(mgr);
- else
- dma_resv_assert_held(obj->resv);
+ drm_gem_gpuva_assert_lock_held(obj);

list_add_tail(&va->gem.entry, &obj->gpuva.list);
}
@@ -850,16 +844,12 @@ EXPORT_SYMBOL(drm_gpuva_link);
void
drm_gpuva_unlink(struct drm_gpuva *va)
{
- struct drm_gpuva_manager *mgr = va->mgr;
struct drm_gem_object *obj = va->gem.obj;

if (unlikely(!obj))
return;

- if (drm_gpuva_manager_external_lock(mgr))
- drm_gpuva_manager_ext_assert_held(mgr);
- else
- dma_resv_assert_held(obj->resv);
+ drm_gem_gpuva_assert_lock_held(obj);

list_del_init(&va->gem.entry);
}
@@ -1680,10 +1670,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
struct drm_gpuva *va;
int ret;

- if (drm_gpuva_manager_external_lock(mgr))
- drm_gpuva_manager_ext_assert_held(mgr);
- else
- dma_resv_assert_held(obj->resv);
+ drm_gem_gpuva_assert_lock_held(obj);

ops = kzalloc(sizeof(*ops), GFP_KERNEL);
if (!ops)
diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
index 5ec8148a30ee..572d7a538324 100644
--- a/include/drm/drm_gem.h
+++ b/include/drm/drm_gem.h
@@ -387,10 +387,14 @@ struct drm_gem_object {
* Provides the list of GPU VAs attached to this GEM object.
*
* Drivers should lock list accesses with the GEMs &dma_resv lock
- * (&drm_gem_object.resv).
+ * (&drm_gem_object.resv) or a custom lock if one is provided.
*/
struct {
struct list_head list;
+
+#ifdef CONFIG_LOCKDEP
+ struct lockdep_map *lock_dep_map;
+#endif
} gpuva;

/**
@@ -540,6 +544,26 @@ unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru,

int drm_gem_evict(struct drm_gem_object *obj);

+#ifdef CONFIG_LOCKDEP
+/*
+ * drm_gem_gpuva_set_lock() - Set the lock protecting accesses to the gpuva list.
+ * @obj: the &drm_gem_object
+ * @lock: the lock used to protect the gpuva list. The locking primitive
+ * must contain a dep_map field.
+ *
+ * Call this if you're not proctecting access to the gpuva list
+ * with the resv lock, otherwise, drm_gem_gpuva_init() takes case
+ * of initializing the lock_dep_map for you.
+ */
+#define drm_gem_gpuva_set_lock(obj, lock) \
+ obj->gpuva.lock_dep_map = &(lock)->dep_map
+#define drm_gem_gpuva_assert_lock_held(obj) \
+ lockdep_assert(lock_is_held(obj->gpuva.lock_dep_map))
+#else
+#define drm_gem_gpuva_set_lock(obj, lock) do {} while(0)
+#define drm_gem_gpuva_assert_lock_held(obj) do {} while(0)
+#endif
+
/**
* drm_gem_gpuva_init - initialize the gpuva list of a GEM object
* @obj: the &drm_gem_object
@@ -552,6 +576,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
{
INIT_LIST_HEAD(&obj->gpuva.list);
+ drm_gem_gpuva_set_lock(obj, &obj->resv->lock.base);
}

/**
diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
index 4f23aaf726dd..4ad56b67e244 100644
--- a/include/drm/drm_gpuva_mgr.h
+++ b/include/drm/drm_gpuva_mgr.h
@@ -185,44 +185,6 @@ static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
return va->flags & DRM_GPUVA_INVALIDATED;
}

-#ifdef CONFIG_LOCKDEP
-typedef struct lockdep_map *lockdep_map_p;
-#define drm_gpuva_manager_ext_assert_held(mgr) \
- lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
-/**
- * drm_gpuva_manager_set_ext_lock - set the external lock according to
- * @DRM_GPUVA_MANAGER_LOCK_EXTERN
- * @mgr: the &drm_gpuva_manager to set the lock for
- * @lock: the lock to set
- *
- * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
- * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
- * &drm_gem_objects GPUVA list.
- */
-#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
- (mgr)->ext_lock = &(lock)->dep_map
-#else
-typedef struct { /* nothing */ } lockdep_map_p;
-#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
-#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
-#endif
-
-/**
- * enum drm_gpuva_manager_flags - the feature flags for the &drm_gpuva_manager
- */
-enum drm_gpuva_manager_flags {
- /**
- * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
- *
- * Indicates the driver has it's own external lock for linking and
- * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
- *
- * When setting this flag it is rquired to set a lock via
- * drm_gpuva_set_ext_lock().
- */
- DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
-};
-
/**
* struct drm_gpuva_manager - DRM GPU VA Manager
*
@@ -241,11 +203,6 @@ struct drm_gpuva_manager {
*/
const char *name;

- /**
- * @flags: the feature flags of the &drm_gpuva_manager
- */
- enum drm_gpuva_manager_flags flags;
-
/**
* @mm_start: start of the VA space
*/
@@ -283,31 +240,15 @@ struct drm_gpuva_manager {
* @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
*/
const struct drm_gpuva_fn_ops *ops;
-
- /**
- * @ext_lock: &lockdep_map according to @DRM_GPUVA_MANAGER_LOCK_EXTERN
- */
- lockdep_map_p ext_lock;
};

void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
const char *name,
u64 start_offset, u64 range,
u64 reserve_offset, u64 reserve_range,
- const struct drm_gpuva_fn_ops *ops,
- enum drm_gpuva_manager_flags flags);
+ const struct drm_gpuva_fn_ops *ops);
void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);

-/**
- * drm_gpuva_manager_external_lock - indicates whether the
- * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
- * @mgr: the &drm_gpuva_manager to check the flag for
- */
-static inline bool drm_gpuva_manager_external_lock(struct drm_gpuva_manager *mgr)
-{
- return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
-}
-
/**
* drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
* @va__: &drm_gpuva structure to assign to in each iteration step

2023-07-07 11:06:17

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 30 Jun 2023 00:25:18 +0200
Danilo Krummrich <[email protected]> wrote:

> +/**
> + * drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
> + * @va__: &drm_gpuva structure to assign to in each iteration step
> + * @mgr__: &drm_gpuva_manager to walk over
> + * @start__: starting offset, the first gpuva will overlap this
> + * @end__: ending offset, the last gpuva will start before this (but may
> + * overlap)
> + *
> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> + * between @start__ and @end__. It is implemented similarly to list_for_each(),
> + * but is using the &drm_gpuva_manager's internal interval tree to accelerate
> + * the search for the starting &drm_gpuva, and hence isn't safe against removal
> + * of elements. It assumes that @end__ is within (or is the upper limit of) the
> + * &drm_gpuva_manager. This iterator does not skip over the &drm_gpuva_manager's
> + * @kernel_alloc_node.
> + */
> +#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \

drm_gpuva_find_first() takes the range size as its last argument, not
the range end:

for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \


> + va__ && (va__->va.addr < (end__)) && \
> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> + va__ = list_next_entry(va__, rb.entry))

If you define:

static inline struct drm_gpuva *
drm_gpuva_next(struct drm_gpuva *va)
{
if (va && !list_is_last(&va->rb.entry, &va->mgr->rb.list))
return list_next_entry(va, rb.entry);

return NULL;
}

the for loop becomes a bit more readable:

for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \
va__ && (va__->va.addr < (end__)); \
va__ = drm_gpuva_next(va__))

> +
> +/**
> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
> + * &drm_gpuvas
> + * @va__: &drm_gpuva to assign to in each iteration step
> + * @next__: another &drm_gpuva to use as temporary storage
> + * @mgr__: &drm_gpuva_manager to walk over
> + * @start__: starting offset, the first gpuva will overlap this
> + * @end__: ending offset, the last gpuva will start before this (but may
> + * overlap)
> + *
> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> + * between @start__ and @end__. It is implemented similarly to
> + * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
> + * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
> + * against removal of elements. It assumes that @end__ is within (or is the
> + * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
> + * &drm_gpuva_manager's @kernel_alloc_node.
> + */
> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
> + next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
> + va__ && (va__->va.addr < (end__)) && \
> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> + va__ = next__, next__ = list_next_entry(va__, rb.entry))

And this is the safe version using the drm_gpuva_next() helper:

for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)), \
next__ = drm_gpuva_next(va__); \
va__ && (va__->va.addr < (end__)); \
va__ = next__, next__ = drm_gpuva_next(va__))

Those changes fixed an invalid pointer access I had in the sm_unmap()
path.

2023-07-07 12:53:15

by Danilo Krummrich

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On 7/7/23 09:57, Boris Brezillon wrote:
> On Thu, 6 Jul 2023 20:26:42 +0200
> Boris Brezillon <[email protected]> wrote:
>
>> On Fri, 30 Jun 2023 00:25:18 +0200
>> Danilo Krummrich <[email protected]> wrote:
>>
>>> +#ifdef CONFIG_LOCKDEP
>>> +typedef struct lockdep_map *lockdep_map_p;
>>> +#define drm_gpuva_manager_ext_assert_held(mgr) \
>>> + lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
>>> +/**
>>> + * drm_gpuva_manager_set_ext_lock - set the external lock according to
>>> + * @DRM_GPUVA_MANAGER_LOCK_EXTERN
>>> + * @mgr: the &drm_gpuva_manager to set the lock for
>>> + * @lock: the lock to set
>>> + *
>>> + * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
>>> + * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
>>> + * &drm_gem_objects GPUVA list.
>>> + */
>>> +#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
>>> + (mgr)->ext_lock = &(lock)->dep_map
>>
>> Okay, so, IIUC, this is the lock protecting the GEM's active mappings
>> list, meaning the lock is likely to be attached to the GEM object. Are
>> we expected to call drm_gpuva_manager_set_ext_lock() every time we call
>> drm_gpuva_[un]link(), or are we supposed to have some lock at the
>> device level serializing all drm_gpuva_[un]link() calls across VMs? The
>> later doesn't sound like a good option to me, and the former feels a bit
>> weird. I'm wondering if we shouldn't just drop this assert_held() check
>> when DRM_GPUVA_MANAGER_LOCK_EXTERN is set. Alternatively, we could say
>> that any driver wanting to use a custom lock (which is basically all
>> drivers modifying the VA space asynchronously in the ::run_job() path)
>> has to provide its own variant of drm_gpuva_[un]link() (maybe with its
>> own VA list too), which doesn't sound like a good idea either.
>
> Or we could just attach the dep_map to drm_gem_object::gpuva::lock, and
> let drivers overload the default lock in their GEM creation function if
> they want to use a custom lock (see the following diff).

Uh, I like that. Will pick it up, thanks!

>
> ---
>
> diff --git a/drivers/gpu/drm/drm_gpuva_mgr.c b/drivers/gpu/drm/drm_gpuva_mgr.c
> index e47747f22126..6427c88c22ba 100644
> --- a/drivers/gpu/drm/drm_gpuva_mgr.c
> +++ b/drivers/gpu/drm/drm_gpuva_mgr.c
> @@ -675,8 +675,7 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> const char *name,
> u64 start_offset, u64 range,
> u64 reserve_offset, u64 reserve_range,
> - const struct drm_gpuva_fn_ops *ops,
> - enum drm_gpuva_manager_flags flags)
> + const struct drm_gpuva_fn_ops *ops)
> {
> mgr->rb.tree = RB_ROOT_CACHED;
> INIT_LIST_HEAD(&mgr->rb.list);
> @@ -686,7 +685,6 @@ drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> mgr->mm_range = range;
>
> mgr->name = name ? name : "unknown";
> - mgr->flags = flags;
> mgr->ops = ops;
>
> memset(&mgr->kernel_alloc_node, 0, sizeof(struct drm_gpuva));
> @@ -822,16 +820,12 @@ EXPORT_SYMBOL(drm_gpuva_remove);
> void
> drm_gpuva_link(struct drm_gpuva *va)
> {
> - struct drm_gpuva_manager *mgr = va->mgr;
> struct drm_gem_object *obj = va->gem.obj;
>
> if (unlikely(!obj))
> return;
>
> - if (drm_gpuva_manager_external_lock(mgr))
> - drm_gpuva_manager_ext_assert_held(mgr);
> - else
> - dma_resv_assert_held(obj->resv);
> + drm_gem_gpuva_assert_lock_held(obj);
>
> list_add_tail(&va->gem.entry, &obj->gpuva.list);
> }
> @@ -850,16 +844,12 @@ EXPORT_SYMBOL(drm_gpuva_link);
> void
> drm_gpuva_unlink(struct drm_gpuva *va)
> {
> - struct drm_gpuva_manager *mgr = va->mgr;
> struct drm_gem_object *obj = va->gem.obj;
>
> if (unlikely(!obj))
> return;
>
> - if (drm_gpuva_manager_external_lock(mgr))
> - drm_gpuva_manager_ext_assert_held(mgr);
> - else
> - dma_resv_assert_held(obj->resv);
> + drm_gem_gpuva_assert_lock_held(obj);
>
> list_del_init(&va->gem.entry);
> }
> @@ -1680,10 +1670,7 @@ drm_gpuva_gem_unmap_ops_create(struct drm_gpuva_manager *mgr,
> struct drm_gpuva *va;
> int ret;
>
> - if (drm_gpuva_manager_external_lock(mgr))
> - drm_gpuva_manager_ext_assert_held(mgr);
> - else
> - dma_resv_assert_held(obj->resv);
> + drm_gem_gpuva_assert_lock_held(obj);
>
> ops = kzalloc(sizeof(*ops), GFP_KERNEL);
> if (!ops)
> diff --git a/include/drm/drm_gem.h b/include/drm/drm_gem.h
> index 5ec8148a30ee..572d7a538324 100644
> --- a/include/drm/drm_gem.h
> +++ b/include/drm/drm_gem.h
> @@ -387,10 +387,14 @@ struct drm_gem_object {
> * Provides the list of GPU VAs attached to this GEM object.
> *
> * Drivers should lock list accesses with the GEMs &dma_resv lock
> - * (&drm_gem_object.resv).
> + * (&drm_gem_object.resv) or a custom lock if one is provided.
> */
> struct {
> struct list_head list;
> +
> +#ifdef CONFIG_LOCKDEP
> + struct lockdep_map *lock_dep_map;
> +#endif
> } gpuva;
>
> /**
> @@ -540,6 +544,26 @@ unsigned long drm_gem_lru_scan(struct drm_gem_lru *lru,
>
> int drm_gem_evict(struct drm_gem_object *obj);
>
> +#ifdef CONFIG_LOCKDEP
> +/*
> + * drm_gem_gpuva_set_lock() - Set the lock protecting accesses to the gpuva list.
> + * @obj: the &drm_gem_object
> + * @lock: the lock used to protect the gpuva list. The locking primitive
> + * must contain a dep_map field.
> + *
> + * Call this if you're not proctecting access to the gpuva list
> + * with the resv lock, otherwise, drm_gem_gpuva_init() takes case
> + * of initializing the lock_dep_map for you.
> + */
> +#define drm_gem_gpuva_set_lock(obj, lock) \
> + obj->gpuva.lock_dep_map = &(lock)->dep_map
> +#define drm_gem_gpuva_assert_lock_held(obj) \
> + lockdep_assert(lock_is_held(obj->gpuva.lock_dep_map))
> +#else
> +#define drm_gem_gpuva_set_lock(obj, lock) do {} while(0)
> +#define drm_gem_gpuva_assert_lock_held(obj) do {} while(0)
> +#endif
> +
> /**
> * drm_gem_gpuva_init - initialize the gpuva list of a GEM object
> * @obj: the &drm_gem_object
> @@ -552,6 +576,7 @@ int drm_gem_evict(struct drm_gem_object *obj);
> static inline void drm_gem_gpuva_init(struct drm_gem_object *obj)
> {
> INIT_LIST_HEAD(&obj->gpuva.list);
> + drm_gem_gpuva_set_lock(obj, &obj->resv->lock.base);
> }
>
> /**
> diff --git a/include/drm/drm_gpuva_mgr.h b/include/drm/drm_gpuva_mgr.h
> index 4f23aaf726dd..4ad56b67e244 100644
> --- a/include/drm/drm_gpuva_mgr.h
> +++ b/include/drm/drm_gpuva_mgr.h
> @@ -185,44 +185,6 @@ static inline bool drm_gpuva_invalidated(struct drm_gpuva *va)
> return va->flags & DRM_GPUVA_INVALIDATED;
> }
>
> -#ifdef CONFIG_LOCKDEP
> -typedef struct lockdep_map *lockdep_map_p;
> -#define drm_gpuva_manager_ext_assert_held(mgr) \
> - lockdep_assert(lock_is_held((mgr)->ext_lock) != LOCK_STATE_NOT_HELD)
> -/**
> - * drm_gpuva_manager_set_ext_lock - set the external lock according to
> - * @DRM_GPUVA_MANAGER_LOCK_EXTERN
> - * @mgr: the &drm_gpuva_manager to set the lock for
> - * @lock: the lock to set
> - *
> - * If @DRM_GPUVA_MANAGER_LOCK_EXTERN is set, drivers need to call this function
> - * to provide the lock used to lock linking and unlinking of &drm_gpuvas to the
> - * &drm_gem_objects GPUVA list.
> - */
> -#define drm_gpuva_manager_set_ext_lock(mgr, lock) \
> - (mgr)->ext_lock = &(lock)->dep_map
> -#else
> -typedef struct { /* nothing */ } lockdep_map_p;
> -#define drm_gpuva_manager_ext_assert_held(mgr) do { (void)(mgr); } while (0)
> -#define drm_gpuva_manager_set_ext_lock(mgr, lock) do { } while (0)
> -#endif
> -
> -/**
> - * enum drm_gpuva_manager_flags - the feature flags for the &drm_gpuva_manager
> - */
> -enum drm_gpuva_manager_flags {
> - /**
> - * @DRM_GPUVA_MANAGER_LOCK_EXTERN:
> - *
> - * Indicates the driver has it's own external lock for linking and
> - * unlinking &drm_gpuvas to the &drm_gem_objects GPUVA list.
> - *
> - * When setting this flag it is rquired to set a lock via
> - * drm_gpuva_set_ext_lock().
> - */
> - DRM_GPUVA_MANAGER_LOCK_EXTERN = (1 << 0),
> -};
> -
> /**
> * struct drm_gpuva_manager - DRM GPU VA Manager
> *
> @@ -241,11 +203,6 @@ struct drm_gpuva_manager {
> */
> const char *name;
>
> - /**
> - * @flags: the feature flags of the &drm_gpuva_manager
> - */
> - enum drm_gpuva_manager_flags flags;
> -
> /**
> * @mm_start: start of the VA space
> */
> @@ -283,31 +240,15 @@ struct drm_gpuva_manager {
> * @ops: &drm_gpuva_fn_ops providing the split/merge steps to drivers
> */
> const struct drm_gpuva_fn_ops *ops;
> -
> - /**
> - * @ext_lock: &lockdep_map according to @DRM_GPUVA_MANAGER_LOCK_EXTERN
> - */
> - lockdep_map_p ext_lock;
> };
>
> void drm_gpuva_manager_init(struct drm_gpuva_manager *mgr,
> const char *name,
> u64 start_offset, u64 range,
> u64 reserve_offset, u64 reserve_range,
> - const struct drm_gpuva_fn_ops *ops,
> - enum drm_gpuva_manager_flags flags);
> + const struct drm_gpuva_fn_ops *ops);
> void drm_gpuva_manager_destroy(struct drm_gpuva_manager *mgr);
>
> -/**
> - * drm_gpuva_manager_external_lock - indicates whether the
> - * @DRM_GPUVA_MANAGER_LOCK_EXTERN flag is set
> - * @mgr: the &drm_gpuva_manager to check the flag for
> - */
> -static inline bool drm_gpuva_manager_external_lock(struct drm_gpuva_manager *mgr)
> -{
> - return mgr->flags & DRM_GPUVA_MANAGER_LOCK_EXTERN;
> -}
> -
> /**
> * drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
> * @va__: &drm_gpuva structure to assign to in each iteration step
>


2023-07-07 13:00:29

by Danilo Krummrich

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On 7/7/23 13:00, Boris Brezillon wrote:
> On Fri, 30 Jun 2023 00:25:18 +0200
> Danilo Krummrich <[email protected]> wrote:
>
>> +/**
>> + * drm_gpuva_for_each_va_range - iternator to walk over a range of &drm_gpuvas
>> + * @va__: &drm_gpuva structure to assign to in each iteration step
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + * @start__: starting offset, the first gpuva will overlap this
>> + * @end__: ending offset, the last gpuva will start before this (but may
>> + * overlap)
>> + *
>> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
>> + * between @start__ and @end__. It is implemented similarly to list_for_each(),
>> + * but is using the &drm_gpuva_manager's internal interval tree to accelerate
>> + * the search for the starting &drm_gpuva, and hence isn't safe against removal
>> + * of elements. It assumes that @end__ is within (or is the upper limit of) the
>> + * &drm_gpuva_manager. This iterator does not skip over the &drm_gpuva_manager's
>> + * @kernel_alloc_node.
>> + */
>> +#define drm_gpuva_for_each_va_range(va__, mgr__, start__, end__) \
>> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)); \
>
> drm_gpuva_find_first() takes the range size as its last argument, not
> the range end:
>
> for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \
>

Good catch! Originally this was

drm_gpuva_it_iter_first(&(mgr)->rb.tree, (start__), (end__) - 1)

but then I changed it since I did not want to expose the interval tree
functions directly.

>
>> + va__ && (va__->va.addr < (end__)) && \
>> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>> + va__ = list_next_entry(va__, rb.entry))
>
> If you define:
>
> static inline struct drm_gpuva *
> drm_gpuva_next(struct drm_gpuva *va)
> {
> if (va && !list_is_last(&va->rb.entry, &va->mgr->rb.list))
> return list_next_entry(va, rb.entry);
>
> return NULL;
> } >
> the for loop becomes a bit more readable:

Yes, it would. However, I don't want it to be confused with
drm_gpuva_find_next(). Maybe I should rename the latter to something
like drm_gpuva_find_next_neighbor() then.

>
> for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \
> va__ && (va__->va.addr < (end__)); \
> va__ = drm_gpuva_next(va__))
>
>> +
>> +/**
>> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
>> + * &drm_gpuvas
>> + * @va__: &drm_gpuva to assign to in each iteration step
>> + * @next__: another &drm_gpuva to use as temporary storage
>> + * @mgr__: &drm_gpuva_manager to walk over
>> + * @start__: starting offset, the first gpuva will overlap this
>> + * @end__: ending offset, the last gpuva will start before this (but may
>> + * overlap)
>> + *
>> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
>> + * between @start__ and @end__. It is implemented similarly to
>> + * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
>> + * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
>> + * against removal of elements. It assumes that @end__ is within (or is the
>> + * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
>> + * &drm_gpuva_manager's @kernel_alloc_node.
>> + */
>> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
>> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
>> + next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
>> + va__ && (va__->va.addr < (end__)) && \
>> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
>> + va__ = next__, next__ = list_next_entry(va__, rb.entry))
>
> And this is the safe version using the drm_gpuva_next() helper:
>
> for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)), \
> next__ = drm_gpuva_next(va__); \
> va__ && (va__->va.addr < (end__)); \
> va__ = next__, next__ = drm_gpuva_next(va__))
>
> Those changes fixed an invalid pointer access I had in the sm_unmap()
> path.
>

Sorry you did run into this bug.

- Danilo


2023-07-07 13:07:17

by Boris Brezillon

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, 7 Jul 2023 14:41:23 +0200
Danilo Krummrich <[email protected]> wrote:

> >> + va__ && (va__->va.addr < (end__)) && \
> >> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> >> + va__ = list_next_entry(va__, rb.entry))
> >
> > If you define:
> >
> > static inline struct drm_gpuva *
> > drm_gpuva_next(struct drm_gpuva *va)
> > {
> > if (va && !list_is_last(&va->rb.entry, &va->mgr->rb.list))
> > return list_next_entry(va, rb.entry);
> >
> > return NULL;
> > } >
> > the for loop becomes a bit more readable:
>
> Yes, it would. However, I don't want it to be confused with
> drm_gpuva_find_next(). Maybe I should rename the latter to something
> like drm_gpuva_find_next_neighbor() then.

If you want to keep drm_gpuva_find_next(), feel free to rename/prefix
the drm_gpuva_next() function. I was just posting it as a reference.

>
> >
> > for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \
> > va__ && (va__->va.addr < (end__)); \
> > va__ = drm_gpuva_next(va__))
> >
> >> +
> >> +/**
> >> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
> >> + * &drm_gpuvas
> >> + * @va__: &drm_gpuva to assign to in each iteration step
> >> + * @next__: another &drm_gpuva to use as temporary storage
> >> + * @mgr__: &drm_gpuva_manager to walk over
> >> + * @start__: starting offset, the first gpuva will overlap this
> >> + * @end__: ending offset, the last gpuva will start before this (but may
> >> + * overlap)
> >> + *
> >> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> >> + * between @start__ and @end__. It is implemented similarly to
> >> + * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
> >> + * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
> >> + * against removal of elements. It assumes that @end__ is within (or is the
> >> + * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
> >> + * &drm_gpuva_manager's @kernel_alloc_node.
> >> + */
> >> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
> >> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
> >> + next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
> >> + va__ && (va__->va.addr < (end__)) && \
> >> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> >> + va__ = next__, next__ = list_next_entry(va__, rb.entry))
> >
> > And this is the safe version using the drm_gpuva_next() helper:
> >
> > for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)), \
> > next__ = drm_gpuva_next(va__); \
> > va__ && (va__->va.addr < (end__)); \
> > va__ = next__, next__ = drm_gpuva_next(va__))
> >
> > Those changes fixed an invalid pointer access I had in the sm_unmap()
> > path.
> >
>
> Sorry you did run into this bug.

No worries, that's what testing/debugging/reviewing is for. And I'm glad
someone decided to work on this gpuva stuff so I don't have to code it
myself, so that's the least I can do.

2023-07-08 07:31:36

by Matthew Brost

[permalink] [raw]
Subject: Re: [PATCH drm-next v6 02/13] drm: manager to keep track of GPUs VA mappings

On Fri, Jul 07, 2023 at 02:52:41PM +0200, Boris Brezillon wrote:
> On Fri, 7 Jul 2023 14:41:23 +0200
> Danilo Krummrich <[email protected]> wrote:
>
> > >> + va__ && (va__->va.addr < (end__)) && \
> > >> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> > >> + va__ = list_next_entry(va__, rb.entry))
> > >
> > > If you define:
> > >
> > > static inline struct drm_gpuva *
> > > drm_gpuva_next(struct drm_gpuva *va)
> > > {
> > > if (va && !list_is_last(&va->rb.entry, &va->mgr->rb.list))
> > > return list_next_entry(va, rb.entry);
> > >
> > > return NULL;
> > > } >
> > > the for loop becomes a bit more readable:
> >
> > Yes, it would. However, I don't want it to be confused with
> > drm_gpuva_find_next(). Maybe I should rename the latter to something
> > like drm_gpuva_find_next_neighbor() then.
>
> If you want to keep drm_gpuva_find_next(), feel free to rename/prefix
> the drm_gpuva_next() function. I was just posting it as a reference.
>
> >
> > >
> > > for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)); \
> > > va__ && (va__->va.addr < (end__)); \
> > > va__ = drm_gpuva_next(va__))
> > >
> > >> +
> > >> +/**
> > >> + * drm_gpuva_for_each_va_range_safe - iternator to safely walk over a range of
> > >> + * &drm_gpuvas
> > >> + * @va__: &drm_gpuva to assign to in each iteration step
> > >> + * @next__: another &drm_gpuva to use as temporary storage
> > >> + * @mgr__: &drm_gpuva_manager to walk over
> > >> + * @start__: starting offset, the first gpuva will overlap this
> > >> + * @end__: ending offset, the last gpuva will start before this (but may
> > >> + * overlap)
> > >> + *
> > >> + * This iterator walks over all &drm_gpuvas in the &drm_gpuva_manager that lie
> > >> + * between @start__ and @end__. It is implemented similarly to
> > >> + * list_for_each_safe(), but is using the &drm_gpuva_manager's internal interval
> > >> + * tree to accelerate the search for the starting &drm_gpuva, and hence is safe
> > >> + * against removal of elements. It assumes that @end__ is within (or is the
> > >> + * upper limit of) the &drm_gpuva_manager. This iterator does not skip over the
> > >> + * &drm_gpuva_manager's @kernel_alloc_node.
> > >> + */
> > >> +#define drm_gpuva_for_each_va_range_safe(va__, next__, mgr__, start__, end__) \
> > >> + for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__)), \
> > >> + next__ = va ? list_next_entry(va__, rb.entry) : NULL; \
> > >> + va__ && (va__->va.addr < (end__)) && \
> > >> + !list_entry_is_head(va__, &(mgr__)->rb.list, rb.entry); \
> > >> + va__ = next__, next__ = list_next_entry(va__, rb.entry))
> > >
> > > And this is the safe version using the drm_gpuva_next() helper:
> > >
> > > for (va__ = drm_gpuva_find_first((mgr__), (start__), (end__) - (start__)), \
> > > next__ = drm_gpuva_next(va__); \
> > > va__ && (va__->va.addr < (end__)); \
> > > va__ = next__, next__ = drm_gpuva_next(va__))
> > >
> > > Those changes fixed an invalid pointer access I had in the sm_unmap()
> > > path.
> > >
> >
> > Sorry you did run into this bug.
>
> No worries, that's what testing/debugging/reviewing is for. And I'm glad
> someone decided to work on this gpuva stuff so I don't have to code it
> myself, so that's the least I can do.

With Boris's changes this version works in Xe.

With that:

Acked-by: Matthew Brost <[email protected]>
Tested-by: Matthew Brost <[email protected]>