2022-04-12 14:08:44

by Nicholas Graumann

[permalink] [raw]
Subject: [PATCH v3] dmaengine: xilinx_dma: Free descriptor lists in order

If xilinx_dma_terminate_all is called while the AXI DMA is active, the
following error messages might be seen upon restarting the DMA:

[ 72.556254] xilinx_dma_irq_handler: Channel d053cb5b has errors 10, cdr 2c049f80 tdr 2c04a000
[ 72.557370] xilinx_dma_irq_handler: Channel d053cb5b has errors 100, cdr 2c049f80 tdr 2c049f80

From then on the AXI DMA won't process any more descriptors until the
DMA channel is released and requested again.

The following sequence of events is what causes this to happen:

1. Some descriptors are prepared with xilinx_dma_tx_submit (so they get
added to pending_list).
2. The DMA is kicked off via call to xilinx_dma_tx_submit (the
descriptors are moved to active_list).
3. While the transfer is active, another descriptor is prepared with
xilinx_dma_tx_submit (so it goes onto the pending_list)
4. Before the transfers complete, xilinx_dma_terminate_all is called.
That function resets the channel then calls
xilinx_dma_free_descriptors to free the descriptors.

At that point, pending_list contains a descriptor that is newer (and
thus farther down the chain of descriptors) than the descriptors
prepared in (1). However, it gets placed onto the free_seg_list before
the older descriptors. From then on, the next pointers are no longer
valid because the order of the descriptors in free_seg_list does not
match the order in which the descriptors were allocated.

To remedy this, the descriptor lists need to be freed in order from
oldest to newest, otherwise segments could be added to the free segment
list in a different order than they were created. This is not an issue
for VDMA nor CDMA because the driver does not maintain a list of
descriptors to free.

Fixes: c0bba3a99f07 ("dmaengine: vdma: Add Support for Xilinx AXI Direct Memory Access Engine")
Signed-off-by: Nicholas Graumann <[email protected]>
Reviewed-by: Radhey Shyam Pandey <[email protected]>
---
drivers/dma/xilinx/xilinx_dma.c | 6 +++++-
1 file changed, 5 insertions(+), 1 deletion(-)

diff --git a/drivers/dma/xilinx/xilinx_dma.c b/drivers/dma/xilinx/xilinx_dma.c
index 3ff9fa3d8cd5..3b435449cd0c 100644
--- a/drivers/dma/xilinx/xilinx_dma.c
+++ b/drivers/dma/xilinx/xilinx_dma.c
@@ -884,9 +884,13 @@ static void xilinx_dma_free_descriptors(struct xilinx_dma_chan *chan)

spin_lock_irqsave(&chan->lock, flags);

- xilinx_dma_free_desc_list(chan, &chan->pending_list);
+ /*
+ * Descriptor lists must be freed from oldest to newest so that the
+ * order of free_seg_list is maintained.
+ */
xilinx_dma_free_desc_list(chan, &chan->done_list);
xilinx_dma_free_desc_list(chan, &chan->active_list);
+ xilinx_dma_free_desc_list(chan, &chan->pending_list);

spin_unlock_irqrestore(&chan->lock, flags);
}
--
2.35.1


2022-05-10 15:44:07

by Radhey Shyam Pandey

[permalink] [raw]
Subject: RE: [PATCH v3] dmaengine: xilinx_dma: Free descriptor lists in order

> -----Original Message-----
> From: Nicholas Graumann <[email protected]>
> Sent: Monday, April 11, 2022 7:34 PM
> To: Radhey Shyam Pandey <[email protected]>; Appana Durga Kedareswara
> Rao <[email protected]>; Harini Katakam <[email protected]>; Vinod
> Koul <[email protected]>; Michal Simek <[email protected]>; Appana Durga
> Kedareswara Rao <[email protected]>
> Cc: Nicholas Graumann <[email protected]>;
> [email protected]; [email protected]; linux-
> [email protected]
> Subject: [PATCH v3] dmaengine: xilinx_dma: Free descriptor lists in order
>

Minor nit - It's first version. I think we need to spin a new version fixing it?

> If xilinx_dma_terminate_all is called while the AXI DMA is active, the following
> error messages might be seen upon restarting the DMA:
>
> [ 72.556254] xilinx_dma_irq_handler: Channel d053cb5b has errors 10, cdr
> 2c049f80 tdr 2c04a000
> [ 72.557370] xilinx_dma_irq_handler: Channel d053cb5b has errors 100, cdr
> 2c049f80 tdr 2c049f80
>
> From then on the AXI DMA won't process any more descriptors until the DMA
> channel is released and requested again.
>
> The following sequence of events is what causes this to happen:
>
> 1. Some descriptors are prepared with xilinx_dma_tx_submit (so they get
> added to pending_list).
> 2. The DMA is kicked off via call to xilinx_dma_tx_submit (the
> descriptors are moved to active_list).
> 3. While the transfer is active, another descriptor is prepared with
> xilinx_dma_tx_submit (so it goes onto the pending_list) 4. Before the
> transfers complete, xilinx_dma_terminate_all is called.
> That function resets the channel then calls
> xilinx_dma_free_descriptors to free the descriptors.
>
> At that point, pending_list contains a descriptor that is newer (and thus farther
> down the chain of descriptors) than the descriptors prepared in (1). However,
> it gets placed onto the free_seg_list before the older descriptors. From then
> on, the next pointers are no longer valid because the order of the descriptors
> in free_seg_list does not match the order in which the descriptors were
> allocated.
>
> To remedy this, the descriptor lists need to be freed in order from oldest to
> newest, otherwise segments could be added to the free segment list in a
> different order than they were created. This is not an issue for VDMA nor
> CDMA because the driver does not maintain a list of descriptors to free.
>
> Fixes: c0bba3a99f07 ("dmaengine: vdma: Add Support for Xilinx AXI Direct
> Memory Access Engine")
> Signed-off-by: Nicholas Graumann <[email protected]>
> Reviewed-by: Radhey Shyam Pandey <[email protected]>
Reviewed-by: Radhey Shyam Pandey <[email protected]>

> ---
> drivers/dma/xilinx/xilinx_dma.c | 6 +++++-
> 1 file changed, 5 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/dma/xilinx/xilinx_dma.c b/drivers/dma/xilinx/xilinx_dma.c
> index 3ff9fa3d8cd5..3b435449cd0c 100644
> --- a/drivers/dma/xilinx/xilinx_dma.c
> +++ b/drivers/dma/xilinx/xilinx_dma.c
> @@ -884,9 +884,13 @@ static void xilinx_dma_free_descriptors(struct
> xilinx_dma_chan *chan)
>
> spin_lock_irqsave(&chan->lock, flags);
>
> - xilinx_dma_free_desc_list(chan, &chan->pending_list);
> + /*
> + * Descriptor lists must be freed from oldest to newest so that the
> + * order of free_seg_list is maintained.
> + */
> xilinx_dma_free_desc_list(chan, &chan->done_list);
> xilinx_dma_free_desc_list(chan, &chan->active_list);
> + xilinx_dma_free_desc_list(chan, &chan->pending_list);
>
> spin_unlock_irqrestore(&chan->lock, flags); }
> --
> 2.35.1