LinuxLists.cc - [PATCH 00/12] Aspeed: Add SCU interrupt controller and XDMA engine drivers

2019-11-08 20:21:34

Subject: [PATCH 00/12] Aspeed: Add SCU interrupt controller and XDMA engine drivers

This series first adds a driver to control the interrupt controller provided by
the System Control Unit (SCU) on the AST2500 and AST2600 SOCs. The interrupts
made available are necessary for the control of the XDMA engine embedded in the
same Aspeed SOCs.
This series then adds a driver to control the XDMA engine. This driver was
previously sent to the list without support for the AST2600, and has been
refactored significantly to enable that support. The XDMA engine performs
automatic DMA operations between the Aspeed SOC (acting as a BMC) and a host
processor.

Eddie James (12):
dt-bindings: interrupt-controller: Add Aspeed SCU interrupt controller
irqchip: Add Aspeed SCU interrupt controller
ARM: dts: aspeed: ast2500: Add SCU interrupt controller
ARM: dts: aspeed: ast2600: Add SCU interrupt controllers
dt-bindings: soc: Add Aspeed XDMA Engine
drivers/soc: Add Aspeed XDMA Engine Driver
drivers/soc: xdma: Add user interface
ARM: dts: aspeed: ast2500: Add XDMA Engine
ARM: dts: aspeed: ast2600: Add XDMA Engine
ARM: dts: aspeed: witherspoon: Enable XDMA Engine
ARM: dts: aspeed: rainier: Enable XDMA engine
ARM: dts: aspeed: tacoma: Enable XDMA engine

.../interrupt-controller/aspeed,ast2xxx-scu-ic.txt | 26 +
.../devicetree/bindings/soc/aspeed/xdma.txt | 24 +
MAINTAINERS | 16 +
arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts | 4 +
arch/arm/boot/dts/aspeed-bmc-opp-tacoma.dts | 4 +
arch/arm/boot/dts/aspeed-bmc-opp-witherspoon.dts | 4 +
arch/arm/boot/dts/aspeed-g5.dtsi | 21 +-
arch/arm/boot/dts/aspeed-g6.dtsi | 29 +
drivers/irqchip/Makefile | 2 +-
drivers/irqchip/irq-aspeed-scu-ic.c | 233 +++++
drivers/soc/aspeed/Kconfig | 8 +
drivers/soc/aspeed/Makefile | 1 +
drivers/soc/aspeed/aspeed-xdma.c | 1079 ++++++++++++++++++++
.../interrupt-controller/aspeed-scu-ic.h | 23 +
include/uapi/linux/aspeed-xdma.h | 49 +
15 files changed, 1521 insertions(+), 2 deletions(-)
create mode 100644 Documentation/devicetree/bindings/interrupt-controller/aspeed,ast2xxx-scu-ic.txt
create mode 100644 Documentation/devicetree/bindings/soc/aspeed/xdma.txt
create mode 100644 drivers/irqchip/irq-aspeed-scu-ic.c
create mode 100644 drivers/soc/aspeed/aspeed-xdma.c
create mode 100644 include/dt-bindings/interrupt-controller/aspeed-scu-ic.h
create mode 100644 include/uapi/linux/aspeed-xdma.h

--
1.8.3.1

2019-11-08 20:23:20

by Eddie James

[permalink] [raw]

Subject: [PATCH 09/12] ARM: dts: aspeed: ast2600: Add XDMA Engine

Add a node for the XDMA engine with all the necessary information.

Signed-off-by: Eddie James <[email protected]>
---
arch/arm/boot/dts/aspeed-g6.dtsi | 11 +++++++++++
1 file changed, 11 insertions(+)

diff --git a/arch/arm/boot/dts/aspeed-g6.dtsi b/arch/arm/boot/dts/aspeed-g6.dtsi
index 65ea2b2..61bd717 100644
--- a/arch/arm/boot/dts/aspeed-g6.dtsi
+++ b/arch/arm/boot/dts/aspeed-g6.dtsi
@@ -3,6 +3,7 @@

#include <dt-bindings/interrupt-controller/arm-gic.h>
#include <dt-bindings/clock/ast2600-clock.h>
+#include <dt-bindings/interrupt-controller/aspeed-scu-ic.h>

/ {
model = "Aspeed BMC";
@@ -315,6 +316,16 @@
quality = <100>;
};

+ xdma: xdma@1e6e7000 {
+ compatible = "aspeed,ast2600-xdma";
+ reg = <0x1e6e7000 0x100>;
+ clocks = <&syscon ASPEED_CLK_GATE_BCLK>;
+ resets = <&syscon ASPEED_RESET_DEV_XDMA>;
+ interrupts-extended = <&gic GIC_SPI 6 IRQ_TYPE_LEVEL_HIGH>,
+ <&scu_ic0 ASPEED_AST2600_SCU_IC0_PCIE_PERST_LO_TO_HI>;
+ status = "disabled";
+ };
+
gpio0: gpio@1e780000 {
#gpio-cells = <2>;
gpio-controller;
--
1.8.3.1

2019-11-08 20:24:10

by Eddie James

[permalink] [raw]

Subject: [PATCH 03/12] ARM: dts: aspeed: ast2500: Add SCU interrupt controller

Add a node for the interrupt controller provided by the SCU.

Signed-off-by: Eddie James <[email protected]>
---
arch/arm/boot/dts/aspeed-g5.dtsi | 11 ++++++++++-
1 file changed, 10 insertions(+), 1 deletion(-)

diff --git a/arch/arm/boot/dts/aspeed-g5.dtsi b/arch/arm/boot/dts/aspeed-g5.dtsi
index a259c63..4579c78 100644
--- a/arch/arm/boot/dts/aspeed-g5.dtsi
+++ b/arch/arm/boot/dts/aspeed-g5.dtsi
@@ -216,8 +216,9 @@
syscon: syscon@1e6e2000 {
compatible = "aspeed,ast2500-scu", "syscon", "simple-mfd";
reg = <0x1e6e2000 0x1a8>;
+ ranges = <0 0x1e6e2000 0x1a8>;
#address-cells = <1>;
- #size-cells = <0>;
+ #size-cells = <1>;
#clock-cells = <1>;
#reset-cells = <1>;

@@ -231,6 +232,14 @@
compatible = "aspeed,ast2500-p2a-ctrl";
status = "disabled";
};
+
+ scu_ic: interrupt-controller@18 {
+ #interrupt-cells = <1>;
+ compatible = "aspeed,ast2500-scu-ic";
+ reg = <0x18 0x04>;
+ interrupts = <21>;
+ interrupt-controller;
+ };
};

rng: hwrng@1e6e2078 {
--
1.8.3.1

2019-11-08 20:24:38

by Eddie James

[permalink] [raw]

Subject: [PATCH 07/12] drivers/soc: xdma: Add user interface

This commits adds a miscdevice to provide a user interface to the XDMA
engine. The interface provides the write operation to start DMA
operations. The DMA parameters are passed as the data to the write call.
The actual data to transfer is NOT passed through write. Note that both
directions of DMA operation are accomplished through the write command;
BMC to host and host to BMC.

The XDMA engine is restricted to only accessing the reserved memory
space on the AST2500, typically used by the VGA. For this reason, the
VGA memory space is pooled and allocated with genalloc. Users calling
mmap allocate pages from this pool for their usage. The space allocated
by a client will be the space used in the DMA operation. For an
"upstream" (BMC to host) operation, the data in the client's area will
be transferred to the host. For a "downstream" (host to BMC) operation,
the host data will be placed in the client's memory area.

Poll is also provided in order to determine when the DMA operation is
complete for non-blocking IO.

Signed-off-by: Eddie James <[email protected]>
---
drivers/soc/aspeed/aspeed-xdma.c | 223 +++++++++++++++++++++++++++++++++++++++
1 file changed, 223 insertions(+)

diff --git a/drivers/soc/aspeed/aspeed-xdma.c b/drivers/soc/aspeed/aspeed-xdma.c
index 99041a6..3d37582 100644
--- a/drivers/soc/aspeed/aspeed-xdma.c
+++ b/drivers/soc/aspeed/aspeed-xdma.c
@@ -64,6 +64,9 @@
#define XDMA_CMDQ_SIZE PAGE_SIZE
#define XDMA_NUM_CMDS \
(XDMA_CMDQ_SIZE / sizeof(struct aspeed_xdma_cmd))
+#define XDMA_OP_SIZE_MAX sizeof(struct aspeed_xdma_op)
+#define XDMA_OP_SIZE_MIN \
+ (sizeof(struct aspeed_xdma_op) - sizeof(u64))

/* Aspeed specification requires 10ms after switching the reset line */
#define XDMA_RESET_TIME_MS 10
@@ -216,6 +219,7 @@ struct aspeed_xdma {
bool in_reset;
bool upstream;
unsigned int cmd_idx;
+ struct mutex file_lock;
struct mutex start_lock;
struct delayed_work reset_work;
spinlock_t client_lock;
@@ -230,6 +234,8 @@ struct aspeed_xdma {
dma_addr_t cmdq_vga_phys;
void *cmdq_vga_virt;
struct gen_pool *vga_pool;
+
+ struct miscdevice misc;
};

struct aspeed_xdma_client {
@@ -557,6 +563,204 @@ static irqreturn_t aspeed_xdma_pcie_irq(int irq, void *arg)
return IRQ_HANDLED;
}

+static ssize_t aspeed_xdma_write(struct file *file, const char __user *buf,
+ size_t len, loff_t *offset)
+{
+ int rc;
+ struct aspeed_xdma_op op;
+ struct aspeed_xdma_client *client = file->private_data;
+ struct aspeed_xdma *ctx = client->ctx;
+ u32 offs = client->phys ? (client->phys - ctx->vga_phys) :
+ XDMA_CMDQ_SIZE;
+
+ if (len < XDMA_OP_SIZE_MIN)
+ return -EINVAL;
+
+ if (len > XDMA_OP_SIZE_MAX)
+ len = XDMA_OP_SIZE_MAX;
+
+ rc = copy_from_user(&op, buf, len);
+ if (rc)
+ return rc;
+
+ if (op.direction == ASPEED_XDMA_DIRECTION_RESET) {
+ mutex_lock(&ctx->start_lock);
+
+ if (aspeed_xdma_reset_start(ctx)) {
+ msleep(XDMA_RESET_TIME_MS);
+
+ aspeed_xdma_reset_finish(ctx);
+ }
+
+ mutex_unlock(&ctx->start_lock);
+
+ return len;
+ } else if (op.direction > ASPEED_XDMA_DIRECTION_RESET) {
+ return -EINVAL;
+ }
+
+ if (op.len > ctx->vga_size - offs)
+ return -EINVAL;
+
+ if (file->f_flags & O_NONBLOCK) {
+ if (!mutex_trylock(&ctx->file_lock))
+ return -EAGAIN;
+
+ if (ctx->in_progress || ctx->in_reset) {
+ mutex_unlock(&ctx->file_lock);
+ return -EAGAIN;
+ }
+ } else {
+ mutex_lock(&ctx->file_lock);
+
+ rc = wait_event_interruptible(ctx->wait, !ctx->in_progress &&
+ !ctx->in_reset);
+ if (rc) {
+ mutex_unlock(&ctx->file_lock);
+ return -EINTR;
+ }
+ }
+
+ aspeed_xdma_start(ctx, &op, ctx->vga_phys + offs, client);
+
+ mutex_unlock(&ctx->file_lock);
+
+ if (!(file->f_flags & O_NONBLOCK)) {
+ rc = wait_event_interruptible(ctx->wait, !ctx->in_progress);
+ if (rc)
+ return -EINTR;
+
+ if (client->error)
+ return -EIO;
+ }
+
+ return len;
+}
+
+static __poll_t aspeed_xdma_poll(struct file *file,
+ struct poll_table_struct *wait)
+{
+ __poll_t mask = 0;
+ __poll_t req = poll_requested_events(wait);
+ struct aspeed_xdma_client *client = file->private_data;
+ struct aspeed_xdma *ctx = client->ctx;
+
+ if (req & (EPOLLIN | EPOLLRDNORM)) {
+ if (client->in_progress)
+ poll_wait(file, &ctx->wait, wait);
+
+ if (!client->in_progress) {
+ if (client->error)
+ mask |= EPOLLERR;
+ else
+ mask |= EPOLLIN | EPOLLRDNORM;
+ }
+ }
+
+ if (req & (EPOLLOUT | EPOLLWRNORM)) {
+ if (ctx->in_progress)
+ poll_wait(file, &ctx->wait, wait);
+
+ if (!ctx->in_progress)
+ mask |= EPOLLOUT | EPOLLWRNORM;
+ }
+
+ return mask;
+}
+
+static void aspeed_xdma_vma_close(struct vm_area_struct *vma)
+{
+ struct aspeed_xdma_client *client = vma->vm_private_data;
+
+ gen_pool_free(client->ctx->vga_pool, (unsigned long)client->virt,
+ client->size);
+
+ client->virt = NULL;
+ client->phys = 0;
+ client->size = 0;
+}
+
+static const struct vm_operations_struct aspeed_xdma_vm_ops = {
+ .close = aspeed_xdma_vma_close,
+};
+
+static int aspeed_xdma_mmap(struct file *file, struct vm_area_struct *vma)
+{
+ int rc;
+ struct aspeed_xdma_client *client = file->private_data;
+ struct aspeed_xdma *ctx = client->ctx;
+
+ /* restrict file to one mapping */
+ if (client->size)
+ return -ENOMEM;
+
+ client->size = vma->vm_end - vma->vm_start;
+ client->virt = gen_pool_dma_alloc(ctx->vga_pool, client->size,
+ &client->phys);
+ if (!client->virt) {
+ client->phys = 0;
+ client->size = 0;
+ return -ENOMEM;
+ }
+
+ vma->vm_pgoff = (client->phys - ctx->vga_phys) >> PAGE_SHIFT;
+ vma->vm_ops = &aspeed_xdma_vm_ops;
+ vma->vm_private_data = client;
+ vma->vm_page_prot = pgprot_noncached(vma->vm_page_prot);
+
+ rc = io_remap_pfn_range(vma, vma->vm_start, client->phys >> PAGE_SHIFT,
+ client->size, vma->vm_page_prot);
+ if (rc) {
+ gen_pool_free(ctx->vga_pool, (unsigned long)client->virt,
+ client->size);
+
+ client->virt = NULL;
+ client->phys = 0;
+ client->size = 0;
+ return rc;
+ }
+
+ dev_dbg(ctx->dev, "mmap: v[%08lx] to p[%08x], s[%08x]\n",
+ vma->vm_start, (u32)client->phys, client->size);
+
+ return 0;
+}
+
+static int aspeed_xdma_open(struct inode *inode, struct file *file)
+{
+ struct miscdevice *misc = file->private_data;
+ struct aspeed_xdma *ctx = container_of(misc, struct aspeed_xdma, misc);
+ struct aspeed_xdma_client *client = kzalloc(sizeof(*client),
+ GFP_KERNEL);
+
+ if (!client)
+ return -ENOMEM;
+
+ client->ctx = ctx;
+ file->private_data = client;
+ return 0;
+}
+
+static int aspeed_xdma_release(struct inode *inode, struct file *file)
+{
+ struct aspeed_xdma_client *client = file->private_data;
+
+ if (client->ctx->current_client == client)
+ client->ctx->current_client = NULL;
+
+ kfree(client);
+ return 0;
+}
+
+static const struct file_operations aspeed_xdma_fops = {
+ .owner = THIS_MODULE,
+ .write = aspeed_xdma_write,
+ .poll = aspeed_xdma_poll,
+ .mmap = aspeed_xdma_mmap,
+ .open = aspeed_xdma_open,
+ .release = aspeed_xdma_release,
+};
+
static int aspeed_xdma_init(struct aspeed_xdma *ctx)
{
int rc;
@@ -739,6 +943,7 @@ static int aspeed_xdma_probe(struct platform_device *pdev)

ctx->dev = dev;
platform_set_drvdata(pdev, ctx);
+ mutex_init(&ctx->file_lock);
mutex_init(&ctx->start_lock);
INIT_DELAYED_WORK(&ctx->reset_work, aspeed_xdma_reset_work);
spin_lock_init(&ctx->client_lock);
@@ -797,6 +1002,23 @@ static int aspeed_xdma_probe(struct platform_device *pdev)

aspeed_xdma_init_eng(ctx);

+ ctx->misc.minor = MISC_DYNAMIC_MINOR;
+ ctx->misc.fops = &aspeed_xdma_fops;
+ ctx->misc.name = "aspeed-xdma";
+ ctx->misc.parent = dev;
+ rc = misc_register(&ctx->misc);
+ if (rc) {
+ dev_err(dev, "Unable to register xdma miscdevice.\n");
+
+ gen_pool_free(ctx->vga_pool, (unsigned long)ctx->cmdq_vga_virt,
+ XDMA_CMDQ_SIZE);
+ iounmap(ctx->vga_virt);
+
+ reset_control_assert(ctx->reset);
+ clk_disable_unprepare(ctx->clock);
+ return rc;
+ }
+
/*
* This interrupt could fire immediately so only request it once the
* engine and driver are initialized.
@@ -818,6 +1040,7 @@ static int aspeed_xdma_remove(struct platform_device *pdev)
{
struct aspeed_xdma *ctx = platform_get_drvdata(pdev);

+ misc_deregister(&ctx->misc);
gen_pool_free(ctx->vga_pool, (unsigned long)ctx->cmdq_vga_virt,
XDMA_CMDQ_SIZE);
iounmap(ctx->vga_virt);
--
1.8.3.1

2019-11-08 20:24:47

by Eddie James

[permalink] [raw]

Subject: [PATCH 11/12] ARM: dts: aspeed: rainier: Enable XDMA engine

Enable the XDMA engine node.

Signed-off-by: Eddie James <[email protected]>
---
arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts | 4 ++++
1 file changed, 4 insertions(+)

diff --git a/arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts b/arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts
index c1c9cd3..2cd53d9 100644
--- a/arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts
+++ b/arch/arm/boot/dts/aspeed-bmc-ibm-rainier.dts
@@ -970,3 +970,7 @@
spi-max-frequency = <100000000>;
};
};
+
+&xdma {
+ status = "okay";
+};
--
1.8.3.1

2019-11-25 00:00:19

by Andrew Jeffery

[permalink] [raw]

Subject: Re: [PATCH 07/12] drivers/soc: xdma: Add user interface

On Sat, 9 Nov 2019, at 06:48, Eddie James wrote:
> This commits adds a miscdevice to provide a user interface to the XDMA
> engine. The interface provides the write operation to start DMA
> operations. The DMA parameters are passed as the data to the write call.
> The actual data to transfer is NOT passed through write. Note that both
> directions of DMA operation are accomplished through the write command;
> BMC to host and host to BMC.
>
> The XDMA engine is restricted to only accessing the reserved memory
> space on the AST2500, typically used by the VGA. For this reason, the
> VGA memory space is pooled and allocated with genalloc. Users calling
> mmap allocate pages from this pool for their usage. The space allocated
> by a client will be the space used in the DMA operation. For an
> "upstream" (BMC to host) operation, the data in the client's area will
> be transferred to the host. For a "downstream" (host to BMC) operation,
> the host data will be placed in the client's memory area.
>
> Poll is also provided in order to determine when the DMA operation is
> complete for non-blocking IO.
>
> Signed-off-by: Eddie James <[email protected]>
> ---
> drivers/soc/aspeed/aspeed-xdma.c | 223 +++++++++++++++++++++++++++++++++++++++
> 1 file changed, 223 insertions(+)
>
> diff --git a/drivers/soc/aspeed/aspeed-xdma.c b/drivers/soc/aspeed/aspeed-xdma.c
> index 99041a6..3d37582 100644
> --- a/drivers/soc/aspeed/aspeed-xdma.c
> +++ b/drivers/soc/aspeed/aspeed-xdma.c
> @@ -64,6 +64,9 @@
> #define XDMA_CMDQ_SIZE PAGE_SIZE
> #define XDMA_NUM_CMDS \
> (XDMA_CMDQ_SIZE / sizeof(struct aspeed_xdma_cmd))
> +#define XDMA_OP_SIZE_MAX sizeof(struct aspeed_xdma_op)
> +#define XDMA_OP_SIZE_MIN \
> + (sizeof(struct aspeed_xdma_op) - sizeof(u64))
>
> /* Aspeed specification requires 10ms after switching the reset line */
> #define XDMA_RESET_TIME_MS 10
> @@ -216,6 +219,7 @@ struct aspeed_xdma {
> bool in_reset;
> bool upstream;
> unsigned int cmd_idx;
> + struct mutex file_lock;

Please add documentation about what data file_lock is protecting.

> struct mutex start_lock;
> struct delayed_work reset_work;
> spinlock_t client_lock;
> @@ -230,6 +234,8 @@ struct aspeed_xdma {
> dma_addr_t cmdq_vga_phys;
> void *cmdq_vga_virt;
> struct gen_pool *vga_pool;
> +
> + struct miscdevice misc;
> };
>
> struct aspeed_xdma_client {
> @@ -557,6 +563,204 @@ static irqreturn_t aspeed_xdma_pcie_irq(int irq,
> void *arg)
> return IRQ_HANDLED;
> }
>
> +static ssize_t aspeed_xdma_write(struct file *file, const char __user *buf,
> + size_t len, loff_t *offset)
> +{
> + int rc;
> + struct aspeed_xdma_op op;
> + struct aspeed_xdma_client *client = file->private_data;
> + struct aspeed_xdma *ctx = client->ctx;
> + u32 offs = client->phys ? (client->phys - ctx->vga_phys) :
> + XDMA_CMDQ_SIZE;
> +
> + if (len < XDMA_OP_SIZE_MIN)
> + return -EINVAL;
> +
> + if (len > XDMA_OP_SIZE_MAX)
> + len = XDMA_OP_SIZE_MAX;

Isn't this an EINVAL case as well?

> +
> + rc = copy_from_user(&op, buf, len);
> + if (rc)
> + return rc;
> +
> + if (op.direction == ASPEED_XDMA_DIRECTION_RESET) {

Seems a bit abusive to use the direction field to issue a reset.

> + mutex_lock(&ctx->start_lock);
> +
> + if (aspeed_xdma_reset_start(ctx)) {
> + msleep(XDMA_RESET_TIME_MS);
> +
> + aspeed_xdma_reset_finish(ctx);
> + }
> +
> + mutex_unlock(&ctx->start_lock);
> +
> + return len;
> + } else if (op.direction > ASPEED_XDMA_DIRECTION_RESET) {
> + return -EINVAL;
> + }
> +
> + if (op.len > ctx->vga_size - offs)
> + return -EINVAL;
> +
> + if (file->f_flags & O_NONBLOCK) {
> + if (!mutex_trylock(&ctx->file_lock))
> + return -EAGAIN;
> +
> + if (ctx->in_progress || ctx->in_reset) {

ctx->in_progress was protected by a lock that isn't file_lock, so this looks wrong.

> + mutex_unlock(&ctx->file_lock);
> + return -EAGAIN;
> + }
> + } else {
> + mutex_lock(&ctx->file_lock);
> +
> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress &&
> + !ctx->in_reset);

As above.

> + if (rc) {
> + mutex_unlock(&ctx->file_lock);
> + return -EINTR;
> + }
> + }
> +
> + aspeed_xdma_start(ctx, &op, ctx->vga_phys + offs, client);
> +
> + mutex_unlock(&ctx->file_lock);
> +
> + if (!(file->f_flags & O_NONBLOCK)) {
> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress);
> + if (rc)
> + return -EINTR;
> +
> + if (client->error)
> + return -EIO;

What's the client->error value? Can it be more informative?

> + }
> +
> + return len;

We've potentially truncated len above (in the len > XDMA_OP_SIZE_MAX),
which leads to some ambiguity with the write() syscall given that it can
potentially return less than the requested length. This is one such case, but
the caller probably shouldn't attempt a follow-up write.

This would go away if we make the len > XDMA_OP_SIZE_MAX an EINVAL
case as suggested agove.

> +}
> +
> +static __poll_t aspeed_xdma_poll(struct file *file,
> + struct poll_table_struct *wait)
> +{
> + __poll_t mask = 0;
> + __poll_t req = poll_requested_events(wait);
> + struct aspeed_xdma_client *client = file->private_data;
> + struct aspeed_xdma *ctx = client->ctx;
> +
> + if (req & (EPOLLIN | EPOLLRDNORM)) {
> + if (client->in_progress)
> + poll_wait(file, &ctx->wait, wait);
> +
> + if (!client->in_progress) {
> + if (client->error)
> + mask |= EPOLLERR;
> + else
> + mask |= EPOLLIN | EPOLLRDNORM;
> + }
> + }
> +
> + if (req & (EPOLLOUT | EPOLLWRNORM)) {
> + if (ctx->in_progress)
> + poll_wait(file, &ctx->wait, wait);
> +
> + if (!ctx->in_progress)
> + mask |= EPOLLOUT | EPOLLWRNORM;
> + }
> +
> + return mask;
> +}
> +
> +static void aspeed_xdma_vma_close(struct vm_area_struct *vma)
> +{
> + struct aspeed_xdma_client *client = vma->vm_private_data;
> +
> + gen_pool_free(client->ctx->vga_pool, (unsigned long)client->virt,
> + client->size);
> +
> + client->virt = NULL;
> + client->phys = 0;
> + client->size = 0;
> +}
> +
> +static const struct vm_operations_struct aspeed_xdma_vm_ops = {
> + .close = aspeed_xdma_vma_close,
> +};
> +
> +static int aspeed_xdma_mmap(struct file *file, struct vm_area_struct *vma)
> +{
> + int rc;
> + struct aspeed_xdma_client *client = file->private_data;
> + struct aspeed_xdma *ctx = client->ctx;
> +
> + /* restrict file to one mapping */
> + if (client->size)
> + return -ENOMEM;

Can we do better with the error code here?

> +
> + client->size = vma->vm_end - vma->vm_start;
> + client->virt = gen_pool_dma_alloc(ctx->vga_pool, client->size,
> + &client->phys);
> + if (!client->virt) {
> + client->phys = 0;
> + client->size = 0;
> + return -ENOMEM;
> + }
> +
> + vma->vm_pgoff = (client->phys - ctx->vga_phys) >> PAGE_SHIFT;

Where does client->phys get set?

Andrew

2019-11-25 19:46:59

by Eddie James

[permalink] [raw]

Subject: Re: [PATCH 07/12] drivers/soc: xdma: Add user interface

On 11/24/19 5:59 PM, Andrew Jeffery wrote:
>
> On Sat, 9 Nov 2019, at 06:48, Eddie James wrote:
>> This commits adds a miscdevice to provide a user interface to the XDMA
>> engine. The interface provides the write operation to start DMA
>> operations. The DMA parameters are passed as the data to the write call.
>> The actual data to transfer is NOT passed through write. Note that both
>> directions of DMA operation are accomplished through the write command;
>> BMC to host and host to BMC.
>>
>> The XDMA engine is restricted to only accessing the reserved memory
>> space on the AST2500, typically used by the VGA. For this reason, the
>> VGA memory space is pooled and allocated with genalloc. Users calling
>> mmap allocate pages from this pool for their usage. The space allocated
>> by a client will be the space used in the DMA operation. For an
>> "upstream" (BMC to host) operation, the data in the client's area will
>> be transferred to the host. For a "downstream" (host to BMC) operation,
>> the host data will be placed in the client's memory area.
>>
>> Poll is also provided in order to determine when the DMA operation is
>> complete for non-blocking IO.
>>
>> Signed-off-by: Eddie James <[email protected]>
>> ---
>> drivers/soc/aspeed/aspeed-xdma.c | 223 +++++++++++++++++++++++++++++++++++++++
>> 1 file changed, 223 insertions(+)
>>
>> diff --git a/drivers/soc/aspeed/aspeed-xdma.c b/drivers/soc/aspeed/aspeed-xdma.c
>> index 99041a6..3d37582 100644
>> --- a/drivers/soc/aspeed/aspeed-xdma.c
>> +++ b/drivers/soc/aspeed/aspeed-xdma.c
>> @@ -64,6 +64,9 @@
>> #define XDMA_CMDQ_SIZE PAGE_SIZE
>> #define XDMA_NUM_CMDS \
>> (XDMA_CMDQ_SIZE / sizeof(struct aspeed_xdma_cmd))
>> +#define XDMA_OP_SIZE_MAX sizeof(struct aspeed_xdma_op)
>> +#define XDMA_OP_SIZE_MIN \
>> + (sizeof(struct aspeed_xdma_op) - sizeof(u64))
>>
>> /* Aspeed specification requires 10ms after switching the reset line */
>> #define XDMA_RESET_TIME_MS 10
>> @@ -216,6 +219,7 @@ struct aspeed_xdma {
>> bool in_reset;
>> bool upstream;
>> unsigned int cmd_idx;
>> + struct mutex file_lock;
> Please add documentation about what data file_lock is protecting.
>
>> struct mutex start_lock;
>> struct delayed_work reset_work;
>> spinlock_t client_lock;
>> @@ -230,6 +234,8 @@ struct aspeed_xdma {
>> dma_addr_t cmdq_vga_phys;
>> void *cmdq_vga_virt;
>> struct gen_pool *vga_pool;
>> +
>> + struct miscdevice misc;
>> };
>>
>> struct aspeed_xdma_client {
>> @@ -557,6 +563,204 @@ static irqreturn_t aspeed_xdma_pcie_irq(int irq,
>> void *arg)
>> return IRQ_HANDLED;
>> }
>>
>> +static ssize_t aspeed_xdma_write(struct file *file, const char __user *buf,
>> + size_t len, loff_t *offset)
>> +{
>> + int rc;
>> + struct aspeed_xdma_op op;
>> + struct aspeed_xdma_client *client = file->private_data;
>> + struct aspeed_xdma *ctx = client->ctx;
>> + u32 offs = client->phys ? (client->phys - ctx->vga_phys) :
>> + XDMA_CMDQ_SIZE;
>> +
>> + if (len < XDMA_OP_SIZE_MIN)
>> + return -EINVAL;
>> +
>> + if (len > XDMA_OP_SIZE_MAX)
>> + len = XDMA_OP_SIZE_MAX;
> Isn't this an EINVAL case as well?

Perhaps so.

>
>> +
>> + rc = copy_from_user(&op, buf, len);
>> + if (rc)
>> + return rc;
>> +
>> + if (op.direction == ASPEED_XDMA_DIRECTION_RESET) {
> Seems a bit abusive to use the direction field to issue a reset.

What would you recommend instead?

>
>> + mutex_lock(&ctx->start_lock);
>> +
>> + if (aspeed_xdma_reset_start(ctx)) {
>> + msleep(XDMA_RESET_TIME_MS);
>> +
>> + aspeed_xdma_reset_finish(ctx);
>> + }
>> +
>> + mutex_unlock(&ctx->start_lock);
>> +
>> + return len;
>> + } else if (op.direction > ASPEED_XDMA_DIRECTION_RESET) {
>> + return -EINVAL;
>> + }
>> +
>> + if (op.len > ctx->vga_size - offs)
>> + return -EINVAL;
>> +
>> + if (file->f_flags & O_NONBLOCK) {
>> + if (!mutex_trylock(&ctx->file_lock))
>> + return -EAGAIN;
>> +
>> + if (ctx->in_progress || ctx->in_reset) {
> ctx->in_progress was protected by a lock that isn't file_lock, so this looks wrong.

file_lock isn't protecting in_progress. It's protecting access to the
whole engine while a transfer is in progress. in_progress isn't
protected at all, it's just better to lock before waiting for
in_progress so that multiple clients don't all see in_progress go false
and have to wait for a mutex (particularly in the nonblocking case).

>
>> + mutex_unlock(&ctx->file_lock);
>> + return -EAGAIN;
>> + }
>> + } else {
>> + mutex_lock(&ctx->file_lock);
>> +
>> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress &&
>> + !ctx->in_reset);
> As above.
>
>> + if (rc) {
>> + mutex_unlock(&ctx->file_lock);
>> + return -EINTR;
>> + }
>> + }
>> +
>> + aspeed_xdma_start(ctx, &op, ctx->vga_phys + offs, client);
>> +
>> + mutex_unlock(&ctx->file_lock);
>> +
>> + if (!(file->f_flags & O_NONBLOCK)) {
>> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress);
>> + if (rc)
>> + return -EINTR;
>> +
>> + if (client->error)
>> + return -EIO;
> What's the client->error value? Can it be more informative?

Not really. There isn't much error information available. Basically the
only way to get an error is if the engine is reset (user or PCIE
initiated) while the transfer is on-going.

>
>> + }
>> +
>> + return len;
> We've potentially truncated len above (in the len > XDMA_OP_SIZE_MAX),
> which leads to some ambiguity with the write() syscall given that it can
> potentially return less than the requested length. This is one such case, but
> the caller probably shouldn't attempt a follow-up write.
>
> This would go away if we make the len > XDMA_OP_SIZE_MAX an EINVAL
> case as suggested agove.

Sure.

>
>> +}
>> +
>> +static __poll_t aspeed_xdma_poll(struct file *file,
>> + struct poll_table_struct *wait)
>> +{
>> + __poll_t mask = 0;
>> + __poll_t req = poll_requested_events(wait);
>> + struct aspeed_xdma_client *client = file->private_data;
>> + struct aspeed_xdma *ctx = client->ctx;
>> +
>> + if (req & (EPOLLIN | EPOLLRDNORM)) {
>> + if (client->in_progress)
>> + poll_wait(file, &ctx->wait, wait);
>> +
>> + if (!client->in_progress) {
>> + if (client->error)
>> + mask |= EPOLLERR;
>> + else
>> + mask |= EPOLLIN | EPOLLRDNORM;
>> + }
>> + }
>> +
>> + if (req & (EPOLLOUT | EPOLLWRNORM)) {
>> + if (ctx->in_progress)
>> + poll_wait(file, &ctx->wait, wait);
>> +
>> + if (!ctx->in_progress)
>> + mask |= EPOLLOUT | EPOLLWRNORM;
>> + }
>> +
>> + return mask;
>> +}
>> +
>> +static void aspeed_xdma_vma_close(struct vm_area_struct *vma)
>> +{
>> + struct aspeed_xdma_client *client = vma->vm_private_data;
>> +
>> + gen_pool_free(client->ctx->vga_pool, (unsigned long)client->virt,
>> + client->size);
>> +
>> + client->virt = NULL;
>> + client->phys = 0;
>> + client->size = 0;
>> +}
>> +
>> +static const struct vm_operations_struct aspeed_xdma_vm_ops = {
>> + .close = aspeed_xdma_vma_close,
>> +};
>> +
>> +static int aspeed_xdma_mmap(struct file *file, struct vm_area_struct *vma)
>> +{
>> + int rc;
>> + struct aspeed_xdma_client *client = file->private_data;
>> + struct aspeed_xdma *ctx = client->ctx;
>> +
>> + /* restrict file to one mapping */
>> + if (client->size)
>> + return -ENOMEM;
> Can we do better with the error code here?

Maybe? I'm open to suggestions...

>
>> +
>> + client->size = vma->vm_end - vma->vm_start;
>> + client->virt = gen_pool_dma_alloc(ctx->vga_pool, client->size,
>> + &client->phys);
>> + if (!client->virt) {
>> + client->phys = 0;
>> + client->size = 0;
>> + return -ENOMEM;
>> + }
>> +
>> + vma->vm_pgoff = (client->phys - ctx->vga_phys) >> PAGE_SHIFT;
> Where does client->phys get set?

gen_pool_dma_alloc sets it.

Thanks for the review!

Eddie

>
> Andrew

2019-11-26 04:56:11

by Andrew Jeffery

[permalink] [raw]

Subject: Re: [PATCH 07/12] drivers/soc: xdma: Add user interface

On Tue, 26 Nov 2019, at 06:14, Eddie James wrote:
>
> On 11/24/19 5:59 PM, Andrew Jeffery wrote:
> >
> > On Sat, 9 Nov 2019, at 06:48, Eddie James wrote:
> >> This commits adds a miscdevice to provide a user interface to the XDMA
> >> engine. The interface provides the write operation to start DMA
> >> operations. The DMA parameters are passed as the data to the write call.
> >> The actual data to transfer is NOT passed through write. Note that both
> >> directions of DMA operation are accomplished through the write command;
> >> BMC to host and host to BMC.
> >>
> >> The XDMA engine is restricted to only accessing the reserved memory
> >> space on the AST2500, typically used by the VGA. For this reason, the
> >> VGA memory space is pooled and allocated with genalloc. Users calling
> >> mmap allocate pages from this pool for their usage. The space allocated
> >> by a client will be the space used in the DMA operation. For an
> >> "upstream" (BMC to host) operation, the data in the client's area will
> >> be transferred to the host. For a "downstream" (host to BMC) operation,
> >> the host data will be placed in the client's memory area.
> >>
> >> Poll is also provided in order to determine when the DMA operation is
> >> complete for non-blocking IO.
> >>
> >> Signed-off-by: Eddie James <[email protected]>
> >> ---
> >> drivers/soc/aspeed/aspeed-xdma.c | 223 +++++++++++++++++++++++++++++++++++++++
> >> 1 file changed, 223 insertions(+)
> >>
> >> diff --git a/drivers/soc/aspeed/aspeed-xdma.c b/drivers/soc/aspeed/aspeed-xdma.c
> >> index 99041a6..3d37582 100644
> >> --- a/drivers/soc/aspeed/aspeed-xdma.c
> >> +++ b/drivers/soc/aspeed/aspeed-xdma.c
> >> @@ -64,6 +64,9 @@
> >> #define XDMA_CMDQ_SIZE PAGE_SIZE
> >> #define XDMA_NUM_CMDS \
> >> (XDMA_CMDQ_SIZE / sizeof(struct aspeed_xdma_cmd))
> >> +#define XDMA_OP_SIZE_MAX sizeof(struct aspeed_xdma_op)
> >> +#define XDMA_OP_SIZE_MIN \
> >> + (sizeof(struct aspeed_xdma_op) - sizeof(u64))
> >>
> >> /* Aspeed specification requires 10ms after switching the reset line */
> >> #define XDMA_RESET_TIME_MS 10
> >> @@ -216,6 +219,7 @@ struct aspeed_xdma {
> >> bool in_reset;
> >> bool upstream;
> >> unsigned int cmd_idx;
> >> + struct mutex file_lock;
> > Please add documentation about what data file_lock is protecting.
> >
> >> struct mutex start_lock;
> >> struct delayed_work reset_work;
> >> spinlock_t client_lock;
> >> @@ -230,6 +234,8 @@ struct aspeed_xdma {
> >> dma_addr_t cmdq_vga_phys;
> >> void *cmdq_vga_virt;
> >> struct gen_pool *vga_pool;
> >> +
> >> + struct miscdevice misc;
> >> };
> >>
> >> struct aspeed_xdma_client {
> >> @@ -557,6 +563,204 @@ static irqreturn_t aspeed_xdma_pcie_irq(int irq,
> >> void *arg)
> >> return IRQ_HANDLED;
> >> }
> >>
> >> +static ssize_t aspeed_xdma_write(struct file *file, const char __user *buf,
> >> + size_t len, loff_t *offset)
> >> +{
> >> + int rc;
> >> + struct aspeed_xdma_op op;
> >> + struct aspeed_xdma_client *client = file->private_data;
> >> + struct aspeed_xdma *ctx = client->ctx;
> >> + u32 offs = client->phys ? (client->phys - ctx->vga_phys) :
> >> + XDMA_CMDQ_SIZE;
> >> +
> >> + if (len < XDMA_OP_SIZE_MIN)
> >> + return -EINVAL;
> >> +
> >> + if (len > XDMA_OP_SIZE_MAX)
> >> + len = XDMA_OP_SIZE_MAX;
> > Isn't this an EINVAL case as well?
>
>
> Perhaps so.
>
>
> >
> >> +
> >> + rc = copy_from_user(&op, buf, len);
> >> + if (rc)
> >> + return rc;
> >> +
> >> + if (op.direction == ASPEED_XDMA_DIRECTION_RESET) {
> > Seems a bit abusive to use the direction field to issue a reset.
>
>
> What would you recommend instead?

Looks like an ioctl() to me. But what need do we have to directly reset
the device? Could we achieve the same by rebinding the driver if
necessary? We should only need to reset it if the driver has bugs, or
is there some errata that we need to deal with? Userspace shouldn't
be handling that though?

>
>
> >
> >> + mutex_lock(&ctx->start_lock);
> >> +
> >> + if (aspeed_xdma_reset_start(ctx)) {
> >> + msleep(XDMA_RESET_TIME_MS);
> >> +
> >> + aspeed_xdma_reset_finish(ctx);
> >> + }
> >> +
> >> + mutex_unlock(&ctx->start_lock);
> >> +
> >> + return len;
> >> + } else if (op.direction > ASPEED_XDMA_DIRECTION_RESET) {
> >> + return -EINVAL;
> >> + }
> >> +
> >> + if (op.len > ctx->vga_size - offs)
> >> + return -EINVAL;
> >> +
> >> + if (file->f_flags & O_NONBLOCK) {
> >> + if (!mutex_trylock(&ctx->file_lock))
> >> + return -EAGAIN;
> >> +
> >> + if (ctx->in_progress || ctx->in_reset) {
> > ctx->in_progress was protected by a lock that isn't file_lock, so this looks wrong.
>
>
> file_lock isn't protecting in_progress. It's protecting access to the
> whole engine while a transfer is in progress.

Then when would we ever gain file_lock if in_progress was set? Shouldn't the current
client hold file_lock until we'd set in_progress to false, in which case we wouldn't
need to check in_progress if we now hold the lock?

> in_progress isn't protected at all,

Except it is, because you've acquired file_lock above before checking it (and
in_reset).

And why is in_progress written to under ctx->start_lock (which is not file_lock)
in aspeed_xdma_start() in the earlier patch if it's not protected?

> it's just better to lock

There's never a case of "it's just better to lock", as if it were optional. Either the
variable needs to be protected against concurrent access or it doesn't. If it does,
always access it under a consistent lock.

> before waiting for
> in_progress so that multiple clients don't all see in_progress go false
> and have to wait for a mutex (particularly in the nonblocking case).

Yes, so what you're suggesting is that in_progress needs to be protected against
concurrent access, so needs to be accessed under a consistent lock. Though as
suggested above it might be enough to successfully acquire file_lock.

As far as I can see we have three events that we need to care about:

1. Submission of a DMA request
2. Completion of a DMA request
3. PCIe-driven reset

For 1, multiple concurrent submissions need to be serialised, so we need a
mutex with the semantics of file_lock as you've described above.

2 should only occur if we have an event of type 1 outstanding. If 1 is outstanding
then receiving 2 should cause the process that triggered 1 to wake and release
the mutex.

3 can happen at any time which results in two cases that we need to care about:

3a. DMA request in progress (i.e. 1 but have not yet seen corresponding 2)
3b. DMA idle (no request in progress)

Events of type 1 need to be serialised against 3. 2 won't occur after 3 until 1 has
occurred, so there's no need to consider it in the serialisation for 3.

In the case of 3a. we need to reset the device, then mark the current transfer failed,
then wake the associated process. The woken process will release the mutex and
allow any queued requests to proceed.

3b is much simpler, though we need to prevent events of type 1 concurrently
accessing the device while the reset is in progress. So we need a spinlock to cover
configuring the device.

So that's two locks - a mutex to serialise process-context access to the device, and
a spinlock to serialise interrupts with respect to process-context access. Currently
your implementation contains two mutexes (start_lock and file_lock) and two
spinlocks (client_lock and reset_lock), all with fairly hazy definitions.

>
>
> >
> >> + mutex_unlock(&ctx->file_lock);
> >> + return -EAGAIN;
> >> + }
> >> + } else {
> >> + mutex_lock(&ctx->file_lock);
> >> +
> >> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress &&
> >> + !ctx->in_reset);
> > As above.
> >
> >> + if (rc) {
> >> + mutex_unlock(&ctx->file_lock);
> >> + return -EINTR;
> >> + }
> >> + }
> >> +
> >> + aspeed_xdma_start(ctx, &op, ctx->vga_phys + offs, client);
> >> +
> >> + mutex_unlock(&ctx->file_lock);
> >> +
> >> + if (!(file->f_flags & O_NONBLOCK)) {
> >> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress);
> >> + if (rc)
> >> + return -EINTR;
> >> +
> >> + if (client->error)
> >> + return -EIO;
> > What's the client->error value? Can it be more informative?
>
>
> Not really. There isn't much error information available. Basically the
> only way to get an error is if the engine is reset (user or PCIE
> initiated) while the transfer is on-going.
>
>
> >
> >> + }
> >> +
> >> + return len;
> > We've potentially truncated len above (in the len > XDMA_OP_SIZE_MAX),
> > which leads to some ambiguity with the write() syscall given that it can
> > potentially return less than the requested length. This is one such case, but
> > the caller probably shouldn't attempt a follow-up write.
> >
> > This would go away if we make the len > XDMA_OP_SIZE_MAX an EINVAL
> > case as suggested agove.
>
>
> Sure.
>
>
> >
> >> +}
> >> +
> >> +static __poll_t aspeed_xdma_poll(struct file *file,
> >> + struct poll_table_struct *wait)
> >> +{
> >> + __poll_t mask = 0;
> >> + __poll_t req = poll_requested_events(wait);
> >> + struct aspeed_xdma_client *client = file->private_data;
> >> + struct aspeed_xdma *ctx = client->ctx;
> >> +
> >> + if (req & (EPOLLIN | EPOLLRDNORM)) {
> >> + if (client->in_progress)
> >> + poll_wait(file, &ctx->wait, wait);
> >> +
> >> + if (!client->in_progress) {
> >> + if (client->error)
> >> + mask |= EPOLLERR;
> >> + else
> >> + mask |= EPOLLIN | EPOLLRDNORM;
> >> + }
> >> + }
> >> +
> >> + if (req & (EPOLLOUT | EPOLLWRNORM)) {
> >> + if (ctx->in_progress)
> >> + poll_wait(file, &ctx->wait, wait);
> >> +
> >> + if (!ctx->in_progress)
> >> + mask |= EPOLLOUT | EPOLLWRNORM;
> >> + }
> >> +
> >> + return mask;
> >> +}
> >> +
> >> +static void aspeed_xdma_vma_close(struct vm_area_struct *vma)
> >> +{
> >> + struct aspeed_xdma_client *client = vma->vm_private_data;
> >> +
> >> + gen_pool_free(client->ctx->vga_pool, (unsigned long)client->virt,
> >> + client->size);
> >> +
> >> + client->virt = NULL;
> >> + client->phys = 0;
> >> + client->size = 0;
> >> +}
> >> +
> >> +static const struct vm_operations_struct aspeed_xdma_vm_ops = {
> >> + .close = aspeed_xdma_vma_close,
> >> +};
> >> +
> >> +static int aspeed_xdma_mmap(struct file *file, struct vm_area_struct *vma)
> >> +{
> >> + int rc;
> >> + struct aspeed_xdma_client *client = file->private_data;
> >> + struct aspeed_xdma *ctx = client->ctx;
> >> +
> >> + /* restrict file to one mapping */
> >> + if (client->size)
> >> + return -ENOMEM;
> > Can we do better with the error code here?
>
>
> Maybe? I'm open to suggestions...

How about EBUSY?

>
>
> >
> >> +
> >> + client->size = vma->vm_end - vma->vm_start;
> >> + client->virt = gen_pool_dma_alloc(ctx->vga_pool, client->size,
> >> + &client->phys);
> >> + if (!client->virt) {
> >> + client->phys = 0;
> >> + client->size = 0;
> >> + return -ENOMEM;
> >> + }
> >> +
> >> + vma->vm_pgoff = (client->phys - ctx->vga_phys) >> PAGE_SHIFT;
> > Where does client->phys get set?
>
>
> gen_pool_dma_alloc sets it.

Ah, yes. Thanks.

Andrew

2019-12-04 15:28:53

by Eddie James

[permalink] [raw]

Subject: Re: [PATCH 07/12] drivers/soc: xdma: Add user interface

On 11/25/19 9:30 PM, Andrew Jeffery wrote:
> On Tue, 26 Nov 2019, at 06:14, Eddie James wrote:
>> On 11/24/19 5:59 PM, Andrew Jeffery wrote:
>>> On Sat, 9 Nov 2019, at 06:48, Eddie James wrote:
>>>> This commits adds a miscdevice to provide a user interface to the XDMA
>>>> engine. The interface provides the write operation to start DMA
>>>> operations. The DMA parameters are passed as the data to the write call.
>>>> The actual data to transfer is NOT passed through write. Note that both
>>>> directions of DMA operation are accomplished through the write command;
>>>> BMC to host and host to BMC.
>>>>
>>>> The XDMA engine is restricted to only accessing the reserved memory
>>>> space on the AST2500, typically used by the VGA. For this reason, the
>>>> VGA memory space is pooled and allocated with genalloc. Users calling
>>>> mmap allocate pages from this pool for their usage. The space allocated
>>>> by a client will be the space used in the DMA operation. For an
>>>> "upstream" (BMC to host) operation, the data in the client's area will
>>>> be transferred to the host. For a "downstream" (host to BMC) operation,
>>>> the host data will be placed in the client's memory area.
>>>>
>>>> Poll is also provided in order to determine when the DMA operation is
>>>> complete for non-blocking IO.
>>>>
>>>> Signed-off-by: Eddie James<[email protected]>
>>>> ---
>>>> drivers/soc/aspeed/aspeed-xdma.c | 223 +++++++++++++++++++++++++++++++++++++++
>>>> 1 file changed, 223 insertions(+)
>>>>
>>>> diff --git a/drivers/soc/aspeed/aspeed-xdma.c b/drivers/soc/aspeed/aspeed-xdma.c
>>>> index 99041a6..3d37582 100644
>>>> --- a/drivers/soc/aspeed/aspeed-xdma.c
>>>> +++ b/drivers/soc/aspeed/aspeed-xdma.c
>>>> @@ -64,6 +64,9 @@
>>>> #define XDMA_CMDQ_SIZE PAGE_SIZE
>>>> #define XDMA_NUM_CMDS \
>>>> (XDMA_CMDQ_SIZE / sizeof(struct aspeed_xdma_cmd))
>>>> +#define XDMA_OP_SIZE_MAX sizeof(struct aspeed_xdma_op)
>>>> +#define XDMA_OP_SIZE_MIN \
>>>> + (sizeof(struct aspeed_xdma_op) - sizeof(u64))
>>>>
>>>> /* Aspeed specification requires 10ms after switching the reset line */
>>>> #define XDMA_RESET_TIME_MS 10
>>>> @@ -216,6 +219,7 @@ struct aspeed_xdma {
>>>> bool in_reset;
>>>> bool upstream;
>>>> unsigned int cmd_idx;
>>>> + struct mutex file_lock;
>>> Please add documentation about what data file_lock is protecting.
>>>
>>>> struct mutex start_lock;
>>>> struct delayed_work reset_work;
>>>> spinlock_t client_lock;
>>>> @@ -230,6 +234,8 @@ struct aspeed_xdma {
>>>> dma_addr_t cmdq_vga_phys;
>>>> void *cmdq_vga_virt;
>>>> struct gen_pool *vga_pool;
>>>> +
>>>> + struct miscdevice misc;
>>>> };
>>>>
>>>> struct aspeed_xdma_client {
>>>> @@ -557,6 +563,204 @@ static irqreturn_t aspeed_xdma_pcie_irq(int irq,
>>>> void *arg)
>>>> return IRQ_HANDLED;
>>>> }
>>>>
>>>> +static ssize_t aspeed_xdma_write(struct file *file, const char __user *buf,
>>>> + size_t len, loff_t *offset)
>>>> +{
>>>> + int rc;
>>>> + struct aspeed_xdma_op op;
>>>> + struct aspeed_xdma_client *client = file->private_data;
>>>> + struct aspeed_xdma *ctx = client->ctx;
>>>> + u32 offs = client->phys ? (client->phys - ctx->vga_phys) :
>>>> + XDMA_CMDQ_SIZE;
>>>> +
>>>> + if (len < XDMA_OP_SIZE_MIN)
>>>> + return -EINVAL;
>>>> +
>>>> + if (len > XDMA_OP_SIZE_MAX)
>>>> + len = XDMA_OP_SIZE_MAX;
>>> Isn't this an EINVAL case as well?
>> Perhaps so.
>>
>>
>>>> +
>>>> + rc = copy_from_user(&op, buf, len);
>>>> + if (rc)
>>>> + return rc;
>>>> +
>>>> + if (op.direction == ASPEED_XDMA_DIRECTION_RESET) {
>>> Seems a bit abusive to use the direction field to issue a reset.
>> What would you recommend instead?
> Looks like an ioctl() to me. But what need do we have to directly reset
> the device? Could we achieve the same by rebinding the driver if
> necessary? We should only need to reset it if the driver has bugs, or
> is there some errata that we need to deal with? Userspace shouldn't
> be handling that though?

Well it could be necessary to reset if userspace messes up and sends a
bad host address for example, so that's why I'd like to have it
available to userspace.

>>>> + mutex_lock(&ctx->start_lock);
>>>> +
>>>> + if (aspeed_xdma_reset_start(ctx)) {
>>>> + msleep(XDMA_RESET_TIME_MS);
>>>> +
>>>> + aspeed_xdma_reset_finish(ctx);
>>>> + }
>>>> +
>>>> + mutex_unlock(&ctx->start_lock);
>>>> +
>>>> + return len;
>>>> + } else if (op.direction > ASPEED_XDMA_DIRECTION_RESET) {
>>>> + return -EINVAL;
>>>> + }
>>>> +
>>>> + if (op.len > ctx->vga_size - offs)
>>>> + return -EINVAL;
>>>> +
>>>> + if (file->f_flags & O_NONBLOCK) {
>>>> + if (!mutex_trylock(&ctx->file_lock))
>>>> + return -EAGAIN;
>>>> +
>>>> + if (ctx->in_progress || ctx->in_reset) {
>>> ctx->in_progress was protected by a lock that isn't file_lock, so this looks wrong.
>> file_lock isn't protecting in_progress. It's protecting access to the
>> whole engine while a transfer is in progress.
> Then when would we ever gain file_lock if in_progress was set? Shouldn't the current
> client hold file_lock until we'd set in_progress to false, in which case we wouldn't
> need to check in_progress if we now hold the lock?

That doesn't work for non-blocking io.

>> in_progress isn't protected at all,
> Except it is, because you've acquired file_lock above before checking it (and
> in_reset).
>
> And why is in_progress written to under ctx->start_lock (which is not file_lock)
> in aspeed_xdma_start() in the earlier patch if it's not protected?

OK, I just didn't quite get what you meant by protected. Yes, multiple
reads here of in_progress must be serialized so that only one thread
sees it go false at once. This prevents multiple transfers from being
started while one is in progress. But writing it doesn't require locking
that I can see.

>> it's just better to lock
> There's never a case of "it's just better to lock", as if it were optional. Either the
> variable needs to be protected against concurrent access or it doesn't. If it does,
> always access it under a consistent lock.

Not what I meant, was trying to say better to lock before waiting
(rather than after). However now that I think again, that wouldn't work
to prevent multiple transfers being started while one is in progress.

>> before waiting for
>> in_progress so that multiple clients don't all see in_progress go false
>> and have to wait for a mutex (particularly in the nonblocking case).
> Yes, so what you're suggesting is that in_progress needs to be protected against
> concurrent access, so needs to be accessed under a consistent lock. Though as
> suggested above it might be enough to successfully acquire file_lock.
> As far as I can see we have three events that we need to care about:
>
> 1. Submission of a DMA request
> 2. Completion of a DMA request
> 3. PCIe-driven reset
>
> For 1, multiple concurrent submissions need to be serialised, so we need a
> mutex with the semantics of file_lock as you've described above.
>
> 2 should only occur if we have an event of type 1 outstanding. If 1 is outstanding
> then receiving 2 should cause the process that triggered 1 to wake and release
> the mutex.

This gets more complicated than that because we need non-blocking io;
the mutex can't be held while the client call exits with EAGAIN.

> 3 can happen at any time which results in two cases that we need to care about:
>
> 3a. DMA request in progress (i.e. 1 but have not yet seen corresponding 2)
> 3b. DMA idle (no request in progress)
>
> Events of type 1 need to be serialised against 3. 2 won't occur after 3 until 1 has
> occurred, so there's no need to consider it in the serialisation for 3.
>
> In the case of 3a. we need to reset the device, then mark the current transfer failed,
> then wake the associated process. The woken process will release the mutex and
> allow any queued requests to proceed.
>
> 3b is much simpler, though we need to prevent events of type 1 concurrently
> accessing the device while the reset is in progress. So we need a spinlock to cover
> configuring the device.
>
> So that's two locks - a mutex to serialise process-context access to the device, and
> a spinlock to serialise interrupts with respect to process-context access. Currently
> your implementation contains two mutexes (start_lock and file_lock) and two
> spinlocks (client_lock and reset_lock), all with fairly hazy definitions.
>
>>>> + mutex_unlock(&ctx->file_lock);
>>>> + return -EAGAIN;
>>>> + }
>>>> + } else {
>>>> + mutex_lock(&ctx->file_lock);
>>>> +
>>>> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress &&
>>>> + !ctx->in_reset);
>>> As above.
>>>
>>>> + if (rc) {
>>>> + mutex_unlock(&ctx->file_lock);
>>>> + return -EINTR;
>>>> + }
>>>> + }
>>>> +
>>>> + aspeed_xdma_start(ctx, &op, ctx->vga_phys + offs, client);
>>>> +
>>>> + mutex_unlock(&ctx->file_lock);
>>>> +
>>>> + if (!(file->f_flags & O_NONBLOCK)) {
>>>> + rc = wait_event_interruptible(ctx->wait, !ctx->in_progress);
>>>> + if (rc)
>>>> + return -EINTR;
>>>> +
>>>> + if (client->error)
>>>> + return -EIO;
>>> What's the client->error value? Can it be more informative?
>> Not really. There isn't much error information available. Basically the
>> only way to get an error is if the engine is reset (user or PCIE
>> initiated) while the transfer is on-going.
>>
>>
>>>> + }
>>>> +
>>>> + return len;
>>> We've potentially truncated len above (in the len > XDMA_OP_SIZE_MAX),
>>> which leads to some ambiguity with the write() syscall given that it can
>>> potentially return less than the requested length. This is one such case, but
>>> the caller probably shouldn't attempt a follow-up write.
>>>
>>> This would go away if we make the len > XDMA_OP_SIZE_MAX an EINVAL
>>> case as suggested agove.
>> Sure.
>>
>>
>>>> +}
>>>> +
>>>> +static __poll_t aspeed_xdma_poll(struct file *file,
>>>> + struct poll_table_struct *wait)
>>>> +{
>>>> + __poll_t mask = 0;
>>>> + __poll_t req = poll_requested_events(wait);
>>>> + struct aspeed_xdma_client *client = file->private_data;
>>>> + struct aspeed_xdma *ctx = client->ctx;
>>>> +
>>>> + if (req & (EPOLLIN | EPOLLRDNORM)) {
>>>> + if (client->in_progress)
>>>> + poll_wait(file, &ctx->wait, wait);
>>>> +
>>>> + if (!client->in_progress) {
>>>> + if (client->error)
>>>> + mask |= EPOLLERR;
>>>> + else
>>>> + mask |= EPOLLIN | EPOLLRDNORM;
>>>> + }
>>>> + }
>>>> +
>>>> + if (req & (EPOLLOUT | EPOLLWRNORM)) {
>>>> + if (ctx->in_progress)
>>>> + poll_wait(file, &ctx->wait, wait);
>>>> +
>>>> + if (!ctx->in_progress)
>>>> + mask |= EPOLLOUT | EPOLLWRNORM;
>>>> + }
>>>> +
>>>> + return mask;
>>>> +}
>>>> +
>>>> +static void aspeed_xdma_vma_close(struct vm_area_struct *vma)
>>>> +{
>>>> + struct aspeed_xdma_client *client = vma->vm_private_data;
>>>> +
>>>> + gen_pool_free(client->ctx->vga_pool, (unsigned long)client->virt,
>>>> + client->size);
>>>> +
>>>> + client->virt = NULL;
>>>> + client->phys = 0;
>>>> + client->size = 0;
>>>> +}
>>>> +
>>>> +static const struct vm_operations_struct aspeed_xdma_vm_ops = {
>>>> + .close = aspeed_xdma_vma_close,
>>>> +};
>>>> +
>>>> +static int aspeed_xdma_mmap(struct file *file, struct vm_area_struct *vma)
>>>> +{
>>>> + int rc;
>>>> + struct aspeed_xdma_client *client = file->private_data;
>>>> + struct aspeed_xdma *ctx = client->ctx;
>>>> +
>>>> + /* restrict file to one mapping */
>>>> + if (client->size)
>>>> + return -ENOMEM;
>>> Can we do better with the error code here?
>> Maybe? I'm open to suggestions...
> How about EBUSY?

Sounds good.

>>>> +
>>>> + client->size = vma->vm_end - vma->vm_start;
>>>> + client->virt = gen_pool_dma_alloc(ctx->vga_pool, client->size,
>>>> + &client->phys);
>>>> + if (!client->virt) {
>>>> + client->phys = 0;
>>>> + client->size = 0;
>>>> + return -ENOMEM;
>>>> + }
>>>> +
>>>> + vma->vm_pgoff = (client->phys - ctx->vga_phys) >> PAGE_SHIFT;
>>> Where does client->phys get set?
>> gen_pool_dma_alloc sets it.
> Ah, yes. Thanks.
>
> Andrew