2018-02-01 22:02:51

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:

Hi Hao,

A few comments below. Besides that, looks good.

> This patch adds fpga manager driver for FPGA Management Engine (FME). It
> implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>
> Signed-off-by: Tim Whisonant <[email protected]>
> Signed-off-by: Enno Luebbers <[email protected]>
> Signed-off-by: Shiva Rao <[email protected]>
> Signed-off-by: Christopher Rauer <[email protected]>
> Signed-off-by: Kang Luwei <[email protected]>
> Signed-off-by: Xiao Guangrong <[email protected]>
> Signed-off-by: Wu Hao <[email protected]>
> ----
> v3: rename driver to dfl-fpga-fme-mgr
> implemented status callback for fpga manager
> rebased due to fpga api changes
> ---
> .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> drivers/fpga/Kconfig | 6 +
> drivers/fpga/Makefile | 1 +
> drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> drivers/fpga/fpga-dfl.h | 39 ++-
> 5 files changed, 371 insertions(+), 1 deletion(-)
> create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>
> diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> new file mode 100644
> index 0000000..2d4f917
> --- /dev/null
> +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> @@ -0,0 +1,8 @@
> +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> +Date: November 2017
> +KernelVersion: 4.15
> +Contact: Wu Hao <[email protected]>
> +Description: Read-only. It returns interface id of partial reconfiguration
> + hardware. Userspace could use this information to check if
> + current hardware is compatible with given image before FPGA
> + programming.

I'm a little confused by this. I can understand that the PR bitstream
has a dependency on the FPGA's static image, but I don't understand
the dependency of the bistream on the hardware that is used to program
the bitstream to the FPGA.

> diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> index 57da904..0171ecb 100644
> --- a/drivers/fpga/Kconfig
> +++ b/drivers/fpga/Kconfig
> @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> FPGA platform level management features. There shall be 1 FME
> per DFL based FPGA device.
>
> +config FPGA_DFL_FME_MGR
> + tristate "FPGA DFL FME Manager Driver"
> + depends on FPGA_DFL_FME
> + help
> + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> +
> config INTEL_FPGA_DFL_PCI
> tristate "Intel FPGA DFL PCIe Device Driver"
> depends on PCI && FPGA_DFL
> diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> index cc75bb3..6378580 100644
> --- a/drivers/fpga/Makefile
> +++ b/drivers/fpga/Makefile
> @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> # FPGA Device Feature List Support
> obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
>
> fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
>
> diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> new file mode 100644
> index 0000000..70356ce
> --- /dev/null
> +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> @@ -0,0 +1,318 @@
> +/*
> + * FPGA Manager Driver for FPGA Management Engine (FME)
> + *
> + * Copyright (C) 2017 Intel Corporation, Inc.
> + *
> + * Authors:
> + * Kang Luwei <[email protected]>
> + * Xiao Guangrong <[email protected]>
> + * Wu Hao <[email protected]>
> + * Joseph Grecco <[email protected]>
> + * Enno Luebbers <[email protected]>
> + * Tim Whisonant <[email protected]>
> + * Ananda Ravuri <[email protected]>
> + * Christopher Rauer <[email protected]>
> + * Henry Mitchel <[email protected]>
> + *
> + * This work is licensed under the terms of the GNU GPL version 2.
> + * SPDX-License-Identifier: GPL-2.0
> + */
> +
> +#include <linux/module.h>
> +#include <linux/iopoll.h>
> +#include <linux/fpga/fpga-mgr.h>
> +
> +#include "fpga-dfl.h"
> +#include "dfl-fme.h"
> +
> +#define PR_WAIT_TIMEOUT 8000000
> +#define PR_HOST_STATUS_IDLE 0
> +
> +struct fme_mgr_priv {
> + void __iomem *ioaddr;
> + u64 pr_error;
> +};
> +
> +static ssize_t interface_id_show(struct device *dev,
> + struct device_attribute *attr, char *buf)
> +{
> + struct fpga_manager *mgr = dev_get_drvdata(dev);
> + struct fme_mgr_priv *priv = mgr->priv;
> + u64 intfc_id_l, intfc_id_h;
> +
> + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> +
> + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> + (unsigned long long)intfc_id_h,
> + (unsigned long long)intfc_id_l);
> +}
> +static DEVICE_ATTR_RO(interface_id);
> +
> +static const struct attribute *fme_mgr_attrs[] = {
> + &dev_attr_interface_id.attr,
> + NULL,
> +};
> +
> +static u64 pr_error_to_mgr_status(u64 err)
> +{
> + u64 status = 0;
> +
> + if (err & FME_PR_ERR_OPERATION_ERR)
> + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> + if (err & FME_PR_ERR_CRC_ERR)
> + status |= FPGA_MGR_STATUS_CRC_ERR;
> + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> + if (err & FME_PR_ERR_PROTOCOL_ERR)
> + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> +
> + return status;
> +}
> +
> +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> +{
> + u64 pr_status, pr_error;
> +
> + pr_status = readq(fme_pr + FME_PR_STS);
> + if (!(pr_status & FME_PR_STS_PR_STS))
> + return 0;
> +
> + pr_error = readq(fme_pr + FME_PR_ERR);
> + writeq(pr_error, fme_pr + FME_PR_ERR);
> +
> + return pr_error;
> +}
> +
> +static int fme_mgr_write_init(struct fpga_manager *mgr,
> + struct fpga_image_info *info,
> + const char *buf, size_t count)
> +{
> + struct device *dev = &mgr->dev;
> + struct fme_mgr_priv *priv = mgr->priv;
> + void __iomem *fme_pr = priv->ioaddr;
> + u64 pr_ctrl, pr_status;
> +
> + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> + dev_err(dev, "only support partial reconfiguration.\n");

supports

> + return -EINVAL;
> + }
> +
> + dev_dbg(dev, "resetting PR before initiated PR\n");
> +
> + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> + pr_ctrl |= FME_PR_CTRL_PR_RST;
> + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> +
> + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> + PR_WAIT_TIMEOUT)) {
> + dev_err(dev, "maximum PR timeout\n");

We have two dev_err's with the same "maximum PR timeout" so the user
would not be able to see at which point the timeout occurred.

I suggest to get rid of the word 'maximum' for both. This one could
be 'reset ack timeout" or something similar.

> + return -ETIMEDOUT;
> + }
> +
> + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> +
> + dev_dbg(dev,
> + "waiting for PR resource in HW to be initialized and ready\n");
> +
> + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> + (pr_status & FME_PR_STS_PR_STS) ==
> + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> + dev_err(dev, "maximum PR timeout\n");

"PR_STS timeout"? Or something better.

> + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> + return -ETIMEDOUT;
> + }
> +
> + dev_dbg(dev, "check and clear previous PR error\n");
> + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> + if (priv->pr_error)
> + dev_dbg(dev, "previous PR error detected %llx\n",
> + (unsigned long long)priv->pr_error);
> +
> + dev_dbg(dev, "set PR port ID\n");
> +
> + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> +
> + return 0;
> +}
> +
> +static int fme_mgr_write(struct fpga_manager *mgr,
> + const char *buf, size_t count)
> +{
> + struct device *dev = &mgr->dev;
> + struct fme_mgr_priv *priv = mgr->priv;
> + void __iomem *fme_pr = priv->ioaddr;
> + u64 pr_ctrl, pr_status, pr_data;
> + int delay = 0, pr_credit, i = 0;
> +
> + dev_dbg(dev, "start request\n");
> +
> + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> + pr_ctrl |= FME_PR_CTRL_PR_START;
> + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> +
> + dev_dbg(dev, "pushing data from bitstream to HW\n");
> +
> + pr_status = readq(fme_pr + FME_PR_STS);
> + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> +
> + while (count > 0) {
> + while (pr_credit <= 1) {
> + if (delay++ > PR_WAIT_TIMEOUT) {
> + dev_err(dev, "maximum try\n");

It looks like this is waiting for an entry in a queue and timing out
here. Could you add a some comments for pr_credit above and this
loop? Also a better error message, perhaps "PR credit timeout"?

> + return -ETIMEDOUT;
> + }
> + udelay(1);
> +
> + pr_status = readq(fme_pr + FME_PR_STS);
> + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> + }
> +
> + if (count >= 4) {
> + pr_data = 0;
> + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> + *(((u32 *)buf) + i));
> + writeq(pr_data, fme_pr + FME_PR_DATA);
> + count -= 4;
> + pr_credit--;
> + i++;
> + } else {
> + WARN_ON(1);
> + return -EINVAL;
> + }
> + }
> +
> + return 0;
> +}
> +
> +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> + struct fpga_image_info *info)
> +{
> + struct device *dev = &mgr->dev;
> + struct fme_mgr_priv *priv = mgr->priv;
> + void __iomem *fme_pr = priv->ioaddr;
> + u64 pr_ctrl;
> +
> + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> +
> + dev_dbg(dev, "green bitstream push complete\n");
> + dev_dbg(dev, "waiting for HW to release PR resource\n");
> +
> + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> + PR_WAIT_TIMEOUT)) {
> + dev_err(dev, "maximum try.\n");

Some message specific to here also, please.

> + return -ETIMEDOUT;
> + }
> +
> + dev_dbg(dev, "PR operation complete, checking status\n");
> + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> + if (priv->pr_error) {
> + dev_dbg(dev, "PR error detected %llx\n",
> + (unsigned long long)priv->pr_error);
> + return -EIO;
> + }
> +
> + dev_dbg(dev, "PR done successfully\n");
> +
> + return 0;
> +}
> +
> +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> +{
> + return FPGA_MGR_STATE_UNKNOWN;
> +}
> +
> +static u64 fme_mgr_status(struct fpga_manager *mgr)
> +{
> + struct fme_mgr_priv *priv = mgr->priv;
> +
> + return pr_error_to_mgr_status(priv->pr_error);
> +}
> +
> +static const struct fpga_manager_ops fme_mgr_ops = {
> + .write_init = fme_mgr_write_init,
> + .write = fme_mgr_write,
> + .write_complete = fme_mgr_write_complete,
> + .state = fme_mgr_state,
> + .status = fme_mgr_status,
> +};
> +
> +static int fme_mgr_probe(struct platform_device *pdev)
> +{
> + struct device *dev = &pdev->dev;
> + struct fme_mgr_priv *priv;
> + struct fpga_manager *mgr;
> + struct resource *res;
> + int ret;
> +
> + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> + if (!priv)
> + return -ENOMEM;
> +
> + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> + if (IS_ERR(priv->ioaddr))
> + return PTR_ERR(priv->ioaddr);
> +
> + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> + if (ret)
> + return ret;
> +
> + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> + if (!mgr)
> + goto sysfs_remove_exit;
> +
> + mgr->name = "DFL FPGA Manager";
> + mgr->mops = &fme_mgr_ops;
> + mgr->priv = priv;
> + mgr->parent = dev;
> + platform_set_drvdata(pdev, mgr);
> +
> + ret = fpga_mgr_register(mgr);
> + if (ret) {
> + dev_err(dev, "unable to register FPGA manager\n");
> + goto sysfs_remove_exit;
> + }
> +
> + return 0;
> +
> +sysfs_remove_exit:
> + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> + return ret;
> +}
> +
> +static int fme_mgr_remove(struct platform_device *pdev)
> +{
> + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> +
> + fpga_mgr_unregister(mgr);
> + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> +
> + return 0;
> +}
> +
> +static struct platform_driver fme_mgr_driver = {
> + .driver = {
> + .name = FPGA_DFL_FME_MGR,
> + },
> + .probe = fme_mgr_probe,
> + .remove = fme_mgr_remove,
> +};
> +
> +module_platform_driver(fme_mgr_driver);
> +
> +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> +MODULE_AUTHOR("Intel Corporation");
> +MODULE_LICENSE("GPL v2");
> +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> index e5a1094..d45eb82 100644
> --- a/drivers/fpga/fpga-dfl.h
> +++ b/drivers/fpga/fpga-dfl.h
> @@ -130,7 +130,44 @@
>
> /* FME Partial Reconfiguration Sub Feature Register Set */
> #define FME_PR_DFH DFH
> -#define FME_PR_SIZE DFH_SIZE
> +#define FME_PR_CTRL 0x8
> +#define FME_PR_STS 0x10
> +#define FME_PR_DATA 0x18
> +#define FME_PR_ERR 0x20
> +#define FME_PR_INTFC_ID_H 0xA8
> +#define FME_PR_INTFC_ID_L 0xB0
> +#define FME_PR_SIZE 0xB8
> +
> +/* FME PR Control Register Bitfield */
> +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> +
> +/* FME PR Status Register Bitfield */
> +/* Number of available entries in HW queue inside the PR engine. */
> +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> +#define FME_PR_STS_PR_STS_IDLE 0
> +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> +
> +/* FME PR Data Register Bitfield */
> +/* PR data from the raw-binary file. */
> +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> +
> +/* FME PR Error Register */
> +/* Previous PR Operation errors detected. */
> +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> +/* CRC error detected. */
> +#define FME_PR_ERR_CRC_ERR BIT(1)
> +/* Incompatible PR bitstream detected. */
> +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> +/* PR data push protocol violated. */
> +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> +/* PR data fifo overflow error detected */
> +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
>
> /* FME HSSI Sub Feature Register Set */
> #define FME_HSSI_DFH DFH

I see fpga-dfl.h as enumeration code which is separate from any driver
implementation specifics other than what's required for the DFL
enumeration scheme. These PR engine #defines should move to a .h
that is dedicated to this specific PR hardware device. If someone else
adds a different PR device to the framework, their PR driver would
also have its own .h. Same for any other modules that aren't central
to DFL enumeration.

Thanks,
Alan

> --
> 1.8.3.1
>


2018-02-02 09:53:12

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>
> Hi Hao,
>
> A few comments below. Besides that, looks good.
>
> > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >
> > Signed-off-by: Tim Whisonant <[email protected]>
> > Signed-off-by: Enno Luebbers <[email protected]>
> > Signed-off-by: Shiva Rao <[email protected]>
> > Signed-off-by: Christopher Rauer <[email protected]>
> > Signed-off-by: Kang Luwei <[email protected]>
> > Signed-off-by: Xiao Guangrong <[email protected]>
> > Signed-off-by: Wu Hao <[email protected]>
> > ----
> > v3: rename driver to dfl-fpga-fme-mgr
> > implemented status callback for fpga manager
> > rebased due to fpga api changes
> > ---
> > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > drivers/fpga/Kconfig | 6 +
> > drivers/fpga/Makefile | 1 +
> > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > drivers/fpga/fpga-dfl.h | 39 ++-
> > 5 files changed, 371 insertions(+), 1 deletion(-)
> > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >
> > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > new file mode 100644
> > index 0000000..2d4f917
> > --- /dev/null
> > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > @@ -0,0 +1,8 @@
> > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > +Date: November 2017
> > +KernelVersion: 4.15
> > +Contact: Wu Hao <[email protected]>
> > +Description: Read-only. It returns interface id of partial reconfiguration
> > + hardware. Userspace could use this information to check if
> > + current hardware is compatible with given image before FPGA
> > + programming.
>
> I'm a little confused by this. I can understand that the PR bitstream
> has a dependency on the FPGA's static image, but I don't understand
> the dependency of the bistream on the hardware that is used to program
> the bitstream to the FPGA.

Sorry for the confusion, the interface_id is used to indicate the version of
the hardware for partial reconfiguration (it's part of the static image of
the FPGA device). Will improve the description on this.

>
> > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > index 57da904..0171ecb 100644
> > --- a/drivers/fpga/Kconfig
> > +++ b/drivers/fpga/Kconfig
> > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > FPGA platform level management features. There shall be 1 FME
> > per DFL based FPGA device.
> >
> > +config FPGA_DFL_FME_MGR
> > + tristate "FPGA DFL FME Manager Driver"
> > + depends on FPGA_DFL_FME
> > + help
> > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > +
> > config INTEL_FPGA_DFL_PCI
> > tristate "Intel FPGA DFL PCIe Device Driver"
> > depends on PCI && FPGA_DFL
> > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > index cc75bb3..6378580 100644
> > --- a/drivers/fpga/Makefile
> > +++ b/drivers/fpga/Makefile
> > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > # FPGA Device Feature List Support
> > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> >
> > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> >
> > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > new file mode 100644
> > index 0000000..70356ce
> > --- /dev/null
> > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > @@ -0,0 +1,318 @@
> > +/*
> > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > + *
> > + * Copyright (C) 2017 Intel Corporation, Inc.
> > + *
> > + * Authors:
> > + * Kang Luwei <[email protected]>
> > + * Xiao Guangrong <[email protected]>
> > + * Wu Hao <[email protected]>
> > + * Joseph Grecco <[email protected]>
> > + * Enno Luebbers <[email protected]>
> > + * Tim Whisonant <[email protected]>
> > + * Ananda Ravuri <[email protected]>
> > + * Christopher Rauer <[email protected]>
> > + * Henry Mitchel <[email protected]>
> > + *
> > + * This work is licensed under the terms of the GNU GPL version 2.
> > + * SPDX-License-Identifier: GPL-2.0
> > + */
> > +
> > +#include <linux/module.h>
> > +#include <linux/iopoll.h>
> > +#include <linux/fpga/fpga-mgr.h>
> > +
> > +#include "fpga-dfl.h"
> > +#include "dfl-fme.h"
> > +
> > +#define PR_WAIT_TIMEOUT 8000000
> > +#define PR_HOST_STATUS_IDLE 0
> > +
> > +struct fme_mgr_priv {
> > + void __iomem *ioaddr;
> > + u64 pr_error;
> > +};
> > +
> > +static ssize_t interface_id_show(struct device *dev,
> > + struct device_attribute *attr, char *buf)
> > +{
> > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > + struct fme_mgr_priv *priv = mgr->priv;
> > + u64 intfc_id_l, intfc_id_h;
> > +
> > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > +
> > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > + (unsigned long long)intfc_id_h,
> > + (unsigned long long)intfc_id_l);
> > +}
> > +static DEVICE_ATTR_RO(interface_id);
> > +
> > +static const struct attribute *fme_mgr_attrs[] = {
> > + &dev_attr_interface_id.attr,
> > + NULL,
> > +};
> > +
> > +static u64 pr_error_to_mgr_status(u64 err)
> > +{
> > + u64 status = 0;
> > +
> > + if (err & FME_PR_ERR_OPERATION_ERR)
> > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > + if (err & FME_PR_ERR_CRC_ERR)
> > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > +
> > + return status;
> > +}
> > +
> > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > +{
> > + u64 pr_status, pr_error;
> > +
> > + pr_status = readq(fme_pr + FME_PR_STS);
> > + if (!(pr_status & FME_PR_STS_PR_STS))
> > + return 0;
> > +
> > + pr_error = readq(fme_pr + FME_PR_ERR);
> > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > +
> > + return pr_error;
> > +}
> > +
> > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > + struct fpga_image_info *info,
> > + const char *buf, size_t count)
> > +{
> > + struct device *dev = &mgr->dev;
> > + struct fme_mgr_priv *priv = mgr->priv;
> > + void __iomem *fme_pr = priv->ioaddr;
> > + u64 pr_ctrl, pr_status;
> > +
> > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > + dev_err(dev, "only support partial reconfiguration.\n");
>
> supports
>
> > + return -EINVAL;
> > + }
> > +
> > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > +
> > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > +
> > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > + PR_WAIT_TIMEOUT)) {
> > + dev_err(dev, "maximum PR timeout\n");
>
> We have two dev_err's with the same "maximum PR timeout" so the user
> would not be able to see at which point the timeout occurred.
>
> I suggest to get rid of the word 'maximum' for both. This one could
> be 'reset ack timeout" or something similar.

Sure, will switch to a more specific message per your suggestion. Thanks.

>
> > + return -ETIMEDOUT;
> > + }
> > +
> > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > +
> > + dev_dbg(dev,
> > + "waiting for PR resource in HW to be initialized and ready\n");
> > +
> > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > + (pr_status & FME_PR_STS_PR_STS) ==
> > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > + dev_err(dev, "maximum PR timeout\n");
>
> "PR_STS timeout"? Or something better.
>
> > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > + return -ETIMEDOUT;
> > + }
> > +
> > + dev_dbg(dev, "check and clear previous PR error\n");
> > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > + if (priv->pr_error)
> > + dev_dbg(dev, "previous PR error detected %llx\n",
> > + (unsigned long long)priv->pr_error);
> > +
> > + dev_dbg(dev, "set PR port ID\n");
> > +
> > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > +
> > + return 0;
> > +}
> > +
> > +static int fme_mgr_write(struct fpga_manager *mgr,
> > + const char *buf, size_t count)
> > +{
> > + struct device *dev = &mgr->dev;
> > + struct fme_mgr_priv *priv = mgr->priv;
> > + void __iomem *fme_pr = priv->ioaddr;
> > + u64 pr_ctrl, pr_status, pr_data;
> > + int delay = 0, pr_credit, i = 0;
> > +
> > + dev_dbg(dev, "start request\n");
> > +
> > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > +
> > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > +
> > + pr_status = readq(fme_pr + FME_PR_STS);
> > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > +
> > + while (count > 0) {
> > + while (pr_credit <= 1) {
> > + if (delay++ > PR_WAIT_TIMEOUT) {
> > + dev_err(dev, "maximum try\n");
>
> It looks like this is waiting for an entry in a queue and timing out
> here. Could you add a some comments for pr_credit above and this
> loop? Also a better error message, perhaps "PR credit timeout"?

Driver needs to read the PR credit to know if it could push PR data
to hardware or not. I will add more description here on this PR credit,
and use "PR credit timeout" as error message.

>
> > + return -ETIMEDOUT;
> > + }
> > + udelay(1);
> > +
> > + pr_status = readq(fme_pr + FME_PR_STS);
> > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > + }
> > +
> > + if (count >= 4) {
> > + pr_data = 0;
> > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > + *(((u32 *)buf) + i));
> > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > + count -= 4;
> > + pr_credit--;
> > + i++;
> > + } else {
> > + WARN_ON(1);
> > + return -EINVAL;
> > + }
> > + }
> > +
> > + return 0;
> > +}
> > +
> > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > + struct fpga_image_info *info)
> > +{
> > + struct device *dev = &mgr->dev;
> > + struct fme_mgr_priv *priv = mgr->priv;
> > + void __iomem *fme_pr = priv->ioaddr;
> > + u64 pr_ctrl;
> > +
> > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > +
> > + dev_dbg(dev, "green bitstream push complete\n");
> > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > +
> > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > + PR_WAIT_TIMEOUT)) {
> > + dev_err(dev, "maximum try.\n");
>
> Some message specific to here also, please.
>
> > + return -ETIMEDOUT;
> > + }
> > +
> > + dev_dbg(dev, "PR operation complete, checking status\n");
> > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > + if (priv->pr_error) {
> > + dev_dbg(dev, "PR error detected %llx\n",
> > + (unsigned long long)priv->pr_error);
> > + return -EIO;
> > + }
> > +
> > + dev_dbg(dev, "PR done successfully\n");
> > +
> > + return 0;
> > +}
> > +
> > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > +{
> > + return FPGA_MGR_STATE_UNKNOWN;
> > +}
> > +
> > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > +{
> > + struct fme_mgr_priv *priv = mgr->priv;
> > +
> > + return pr_error_to_mgr_status(priv->pr_error);
> > +}
> > +
> > +static const struct fpga_manager_ops fme_mgr_ops = {
> > + .write_init = fme_mgr_write_init,
> > + .write = fme_mgr_write,
> > + .write_complete = fme_mgr_write_complete,
> > + .state = fme_mgr_state,
> > + .status = fme_mgr_status,
> > +};
> > +
> > +static int fme_mgr_probe(struct platform_device *pdev)
> > +{
> > + struct device *dev = &pdev->dev;
> > + struct fme_mgr_priv *priv;
> > + struct fpga_manager *mgr;
> > + struct resource *res;
> > + int ret;
> > +
> > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > + if (!priv)
> > + return -ENOMEM;
> > +
> > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > + if (IS_ERR(priv->ioaddr))
> > + return PTR_ERR(priv->ioaddr);
> > +
> > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > + if (ret)
> > + return ret;
> > +
> > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > + if (!mgr)
> > + goto sysfs_remove_exit;
> > +
> > + mgr->name = "DFL FPGA Manager";
> > + mgr->mops = &fme_mgr_ops;
> > + mgr->priv = priv;
> > + mgr->parent = dev;
> > + platform_set_drvdata(pdev, mgr);
> > +
> > + ret = fpga_mgr_register(mgr);
> > + if (ret) {
> > + dev_err(dev, "unable to register FPGA manager\n");
> > + goto sysfs_remove_exit;
> > + }
> > +
> > + return 0;
> > +
> > +sysfs_remove_exit:
> > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > + return ret;
> > +}
> > +
> > +static int fme_mgr_remove(struct platform_device *pdev)
> > +{
> > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > +
> > + fpga_mgr_unregister(mgr);
> > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > +
> > + return 0;
> > +}
> > +
> > +static struct platform_driver fme_mgr_driver = {
> > + .driver = {
> > + .name = FPGA_DFL_FME_MGR,
> > + },
> > + .probe = fme_mgr_probe,
> > + .remove = fme_mgr_remove,
> > +};
> > +
> > +module_platform_driver(fme_mgr_driver);
> > +
> > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > +MODULE_AUTHOR("Intel Corporation");
> > +MODULE_LICENSE("GPL v2");
> > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > index e5a1094..d45eb82 100644
> > --- a/drivers/fpga/fpga-dfl.h
> > +++ b/drivers/fpga/fpga-dfl.h
> > @@ -130,7 +130,44 @@
> >
> > /* FME Partial Reconfiguration Sub Feature Register Set */
> > #define FME_PR_DFH DFH
> > -#define FME_PR_SIZE DFH_SIZE
> > +#define FME_PR_CTRL 0x8
> > +#define FME_PR_STS 0x10
> > +#define FME_PR_DATA 0x18
> > +#define FME_PR_ERR 0x20
> > +#define FME_PR_INTFC_ID_H 0xA8
> > +#define FME_PR_INTFC_ID_L 0xB0
> > +#define FME_PR_SIZE 0xB8
> > +
> > +/* FME PR Control Register Bitfield */
> > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > +
> > +/* FME PR Status Register Bitfield */
> > +/* Number of available entries in HW queue inside the PR engine. */
> > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > +#define FME_PR_STS_PR_STS_IDLE 0
> > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > +
> > +/* FME PR Data Register Bitfield */
> > +/* PR data from the raw-binary file. */
> > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > +
> > +/* FME PR Error Register */
> > +/* Previous PR Operation errors detected. */
> > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > +/* CRC error detected. */
> > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > +/* Incompatible PR bitstream detected. */
> > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > +/* PR data push protocol violated. */
> > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > +/* PR data fifo overflow error detected */
> > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> >
> > /* FME HSSI Sub Feature Register Set */
> > #define FME_HSSI_DFH DFH
>
> I see fpga-dfl.h as enumeration code which is separate from any driver
> implementation specifics other than what's required for the DFL
> enumeration scheme. These PR engine #defines should move to a .h
> that is dedicated to this specific PR hardware device. If someone else
> adds a different PR device to the framework, their PR driver would
> also have its own .h. Same for any other modules that aren't central
> to DFL enumeration.

Sure, will move PR related register to a separated header file.
DFL enumeration related registers, will still be kept in fpga-dfl.h.

Thanks
Hao

>
> Thanks,
> Alan
>
> > --
> > 1.8.3.1
> >

2018-02-03 01:27:29

by Luebbers, Enno

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

Hi Hao, Alan,

On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >
> > Hi Hao,
> >
> > A few comments below. Besides that, looks good.
> >
> > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > >
> > > Signed-off-by: Tim Whisonant <[email protected]>
> > > Signed-off-by: Enno Luebbers <[email protected]>
> > > Signed-off-by: Shiva Rao <[email protected]>
> > > Signed-off-by: Christopher Rauer <[email protected]>
> > > Signed-off-by: Kang Luwei <[email protected]>
> > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > Signed-off-by: Wu Hao <[email protected]>
> > > ----
> > > v3: rename driver to dfl-fpga-fme-mgr
> > > implemented status callback for fpga manager
> > > rebased due to fpga api changes
> > > ---
> > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > drivers/fpga/Kconfig | 6 +
> > > drivers/fpga/Makefile | 1 +
> > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > >
> > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > new file mode 100644
> > > index 0000000..2d4f917
> > > --- /dev/null
> > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > @@ -0,0 +1,8 @@
> > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > +Date: November 2017
> > > +KernelVersion: 4.15
> > > +Contact: Wu Hao <[email protected]>
> > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > + hardware. Userspace could use this information to check if
> > > + current hardware is compatible with given image before FPGA
> > > + programming.
> >
> > I'm a little confused by this. I can understand that the PR bitstream
> > has a dependency on the FPGA's static image, but I don't understand
> > the dependency of the bistream on the hardware that is used to program
> > the bitstream to the FPGA.
>
> Sorry for the confusion, the interface_id is used to indicate the version of
> the hardware for partial reconfiguration (it's part of the static image of
> the FPGA device). Will improve the description on this.
>

The interface_id expresses the compatibility of the static region with PR
bitstreams generated for it. It changes every time a new static region is
generated.

Would it make more sense to have the interface_id exposed as part of the FME
device (which represents the static region)? I'm not sure - it kind of also
makes sense here, where you would have all the information in one place (if the
interface_id matches, I can use this component to program a bitstream).

Sorry for my limited understanding of the infrastructure - would this same
"fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
it would need to expose multiple interface_ids (or we'd have to track both
interface IDs and an identifier for the target PR region).

> >
> > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > index 57da904..0171ecb 100644
> > > --- a/drivers/fpga/Kconfig
> > > +++ b/drivers/fpga/Kconfig
> > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > FPGA platform level management features. There shall be 1 FME
> > > per DFL based FPGA device.
> > >
> > > +config FPGA_DFL_FME_MGR
> > > + tristate "FPGA DFL FME Manager Driver"
> > > + depends on FPGA_DFL_FME
> > > + help
> > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > +
> > > config INTEL_FPGA_DFL_PCI
> > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > depends on PCI && FPGA_DFL
> > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > index cc75bb3..6378580 100644
> > > --- a/drivers/fpga/Makefile
> > > +++ b/drivers/fpga/Makefile
> > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > # FPGA Device Feature List Support
> > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > >
> > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > >
> > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > new file mode 100644
> > > index 0000000..70356ce
> > > --- /dev/null
> > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > @@ -0,0 +1,318 @@
> > > +/*
> > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > + *
> > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > + *
> > > + * Authors:
> > > + * Kang Luwei <[email protected]>
> > > + * Xiao Guangrong <[email protected]>
> > > + * Wu Hao <[email protected]>
> > > + * Joseph Grecco <[email protected]>
> > > + * Enno Luebbers <[email protected]>
> > > + * Tim Whisonant <[email protected]>
> > > + * Ananda Ravuri <[email protected]>
> > > + * Christopher Rauer <[email protected]>
> > > + * Henry Mitchel <[email protected]>
> > > + *
> > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > + * SPDX-License-Identifier: GPL-2.0
> > > + */
> > > +
> > > +#include <linux/module.h>
> > > +#include <linux/iopoll.h>
> > > +#include <linux/fpga/fpga-mgr.h>
> > > +
> > > +#include "fpga-dfl.h"
> > > +#include "dfl-fme.h"
> > > +
> > > +#define PR_WAIT_TIMEOUT 8000000
> > > +#define PR_HOST_STATUS_IDLE 0
> > > +
> > > +struct fme_mgr_priv {
> > > + void __iomem *ioaddr;
> > > + u64 pr_error;
> > > +};
> > > +
> > > +static ssize_t interface_id_show(struct device *dev,
> > > + struct device_attribute *attr, char *buf)
> > > +{
> > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > + struct fme_mgr_priv *priv = mgr->priv;
> > > + u64 intfc_id_l, intfc_id_h;
> > > +
> > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > +
> > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > + (unsigned long long)intfc_id_h,
> > > + (unsigned long long)intfc_id_l);
> > > +}
> > > +static DEVICE_ATTR_RO(interface_id);
> > > +
> > > +static const struct attribute *fme_mgr_attrs[] = {
> > > + &dev_attr_interface_id.attr,
> > > + NULL,
> > > +};
> > > +
> > > +static u64 pr_error_to_mgr_status(u64 err)
> > > +{
> > > + u64 status = 0;
> > > +
> > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > + if (err & FME_PR_ERR_CRC_ERR)
> > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > +
> > > + return status;
> > > +}
> > > +
> > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > +{
> > > + u64 pr_status, pr_error;
> > > +
> > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > + return 0;
> > > +
> > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > +
> > > + return pr_error;
> > > +}
> > > +
> > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > + struct fpga_image_info *info,
> > > + const char *buf, size_t count)
> > > +{
> > > + struct device *dev = &mgr->dev;
> > > + struct fme_mgr_priv *priv = mgr->priv;
> > > + void __iomem *fme_pr = priv->ioaddr;
> > > + u64 pr_ctrl, pr_status;
> > > +
> > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > + dev_err(dev, "only support partial reconfiguration.\n");
> >
> > supports
> >
> > > + return -EINVAL;
> > > + }
> > > +
> > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > +
> > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > +
> > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > + PR_WAIT_TIMEOUT)) {
> > > + dev_err(dev, "maximum PR timeout\n");
> >
> > We have two dev_err's with the same "maximum PR timeout" so the user
> > would not be able to see at which point the timeout occurred.
> >
> > I suggest to get rid of the word 'maximum' for both. This one could
> > be 'reset ack timeout" or something similar.
>
> Sure, will switch to a more specific message per your suggestion. Thanks.
>
> >
> > > + return -ETIMEDOUT;
> > > + }
> > > +
> > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > +
> > > + dev_dbg(dev,
> > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > +
> > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > + dev_err(dev, "maximum PR timeout\n");
> >
> > "PR_STS timeout"? Or something better.
> >
> > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > + return -ETIMEDOUT;
> > > + }
> > > +
> > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > + if (priv->pr_error)
> > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > + (unsigned long long)priv->pr_error);
> > > +
> > > + dev_dbg(dev, "set PR port ID\n");
> > > +
> > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > +
> > > + return 0;
> > > +}
> > > +
> > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > + const char *buf, size_t count)
> > > +{
> > > + struct device *dev = &mgr->dev;
> > > + struct fme_mgr_priv *priv = mgr->priv;
> > > + void __iomem *fme_pr = priv->ioaddr;
> > > + u64 pr_ctrl, pr_status, pr_data;
> > > + int delay = 0, pr_credit, i = 0;
> > > +
> > > + dev_dbg(dev, "start request\n");
> > > +
> > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > +
> > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > +
> > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > +
> > > + while (count > 0) {
> > > + while (pr_credit <= 1) {
> > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > + dev_err(dev, "maximum try\n");
> >
> > It looks like this is waiting for an entry in a queue and timing out
> > here. Could you add a some comments for pr_credit above and this
> > loop? Also a better error message, perhaps "PR credit timeout"?
>
> Driver needs to read the PR credit to know if it could push PR data
> to hardware or not. I will add more description here on this PR credit,
> and use "PR credit timeout" as error message.
>
> >
> > > + return -ETIMEDOUT;
> > > + }
> > > + udelay(1);
> > > +
> > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > + }
> > > +
> > > + if (count >= 4) {
> > > + pr_data = 0;
> > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > + *(((u32 *)buf) + i));
> > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > + count -= 4;
> > > + pr_credit--;
> > > + i++;
> > > + } else {
> > > + WARN_ON(1);
> > > + return -EINVAL;
> > > + }
> > > + }
> > > +
> > > + return 0;
> > > +}
> > > +
> > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > + struct fpga_image_info *info)
> > > +{
> > > + struct device *dev = &mgr->dev;
> > > + struct fme_mgr_priv *priv = mgr->priv;
> > > + void __iomem *fme_pr = priv->ioaddr;
> > > + u64 pr_ctrl;
> > > +
> > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > +
> > > + dev_dbg(dev, "green bitstream push complete\n");
> > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > +
> > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > + PR_WAIT_TIMEOUT)) {
> > > + dev_err(dev, "maximum try.\n");
> >
> > Some message specific to here also, please.
> >
> > > + return -ETIMEDOUT;
> > > + }
> > > +
> > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > + if (priv->pr_error) {
> > > + dev_dbg(dev, "PR error detected %llx\n",
> > > + (unsigned long long)priv->pr_error);
> > > + return -EIO;
> > > + }
> > > +
> > > + dev_dbg(dev, "PR done successfully\n");
> > > +
> > > + return 0;
> > > +}
> > > +
> > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > +{
> > > + return FPGA_MGR_STATE_UNKNOWN;
> > > +}
> > > +
> > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > +{
> > > + struct fme_mgr_priv *priv = mgr->priv;
> > > +
> > > + return pr_error_to_mgr_status(priv->pr_error);
> > > +}
> > > +
> > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > + .write_init = fme_mgr_write_init,
> > > + .write = fme_mgr_write,
> > > + .write_complete = fme_mgr_write_complete,
> > > + .state = fme_mgr_state,
> > > + .status = fme_mgr_status,
> > > +};
> > > +
> > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > +{
> > > + struct device *dev = &pdev->dev;
> > > + struct fme_mgr_priv *priv;
> > > + struct fpga_manager *mgr;
> > > + struct resource *res;
> > > + int ret;
> > > +
> > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > + if (!priv)
> > > + return -ENOMEM;
> > > +
> > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > + if (IS_ERR(priv->ioaddr))
> > > + return PTR_ERR(priv->ioaddr);
> > > +
> > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > + if (ret)
> > > + return ret;
> > > +
> > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > + if (!mgr)
> > > + goto sysfs_remove_exit;
> > > +
> > > + mgr->name = "DFL FPGA Manager";
> > > + mgr->mops = &fme_mgr_ops;
> > > + mgr->priv = priv;
> > > + mgr->parent = dev;
> > > + platform_set_drvdata(pdev, mgr);
> > > +
> > > + ret = fpga_mgr_register(mgr);
> > > + if (ret) {
> > > + dev_err(dev, "unable to register FPGA manager\n");
> > > + goto sysfs_remove_exit;
> > > + }
> > > +
> > > + return 0;
> > > +
> > > +sysfs_remove_exit:
> > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > + return ret;
> > > +}
> > > +
> > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > +{
> > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > +
> > > + fpga_mgr_unregister(mgr);
> > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > +
> > > + return 0;
> > > +}
> > > +
> > > +static struct platform_driver fme_mgr_driver = {
> > > + .driver = {
> > > + .name = FPGA_DFL_FME_MGR,
> > > + },
> > > + .probe = fme_mgr_probe,
> > > + .remove = fme_mgr_remove,
> > > +};
> > > +
> > > +module_platform_driver(fme_mgr_driver);
> > > +
> > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > +MODULE_AUTHOR("Intel Corporation");
> > > +MODULE_LICENSE("GPL v2");
> > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > index e5a1094..d45eb82 100644
> > > --- a/drivers/fpga/fpga-dfl.h
> > > +++ b/drivers/fpga/fpga-dfl.h
> > > @@ -130,7 +130,44 @@
> > >
> > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > #define FME_PR_DFH DFH
> > > -#define FME_PR_SIZE DFH_SIZE
> > > +#define FME_PR_CTRL 0x8
> > > +#define FME_PR_STS 0x10
> > > +#define FME_PR_DATA 0x18
> > > +#define FME_PR_ERR 0x20
> > > +#define FME_PR_INTFC_ID_H 0xA8
> > > +#define FME_PR_INTFC_ID_L 0xB0
> > > +#define FME_PR_SIZE 0xB8
> > > +
> > > +/* FME PR Control Register Bitfield */
> > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > +
> > > +/* FME PR Status Register Bitfield */
> > > +/* Number of available entries in HW queue inside the PR engine. */
> > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > +
> > > +/* FME PR Data Register Bitfield */
> > > +/* PR data from the raw-binary file. */
> > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > +
> > > +/* FME PR Error Register */
> > > +/* Previous PR Operation errors detected. */
> > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > +/* CRC error detected. */
> > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > +/* Incompatible PR bitstream detected. */
> > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > +/* PR data push protocol violated. */
> > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > +/* PR data fifo overflow error detected */
> > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > >
> > > /* FME HSSI Sub Feature Register Set */
> > > #define FME_HSSI_DFH DFH
> >
> > I see fpga-dfl.h as enumeration code which is separate from any driver
> > implementation specifics other than what's required for the DFL
> > enumeration scheme. These PR engine #defines should move to a .h
> > that is dedicated to this specific PR hardware device. If someone else
> > adds a different PR device to the framework, their PR driver would
> > also have its own .h. Same for any other modules that aren't central
> > to DFL enumeration.
>
> Sure, will move PR related register to a separated header file.
> DFL enumeration related registers, will still be kept in fpga-dfl.h.
>
> Thanks
> Hao
>
> >
> > Thanks,
> > Alan
> >
> > > --
> > > 1.8.3.1
> > >

2018-02-03 10:56:41

by Moritz Fischer

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

Hi Hao,

On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> Hi Hao, Alan,
>
> On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> > >
> > > Hi Hao,
> > >
> > > A few comments below. Besides that, looks good.
> > >
> > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > > >
> > > > Signed-off-by: Tim Whisonant <[email protected]>
> > > > Signed-off-by: Enno Luebbers <[email protected]>
> > > > Signed-off-by: Shiva Rao <[email protected]>
> > > > Signed-off-by: Christopher Rauer <[email protected]>
> > > > Signed-off-by: Kang Luwei <[email protected]>
> > > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > > Signed-off-by: Wu Hao <[email protected]>
> > > > ----
> > > > v3: rename driver to dfl-fpga-fme-mgr
> > > > implemented status callback for fpga manager
> > > > rebased due to fpga api changes
> > > > ---
> > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > > drivers/fpga/Kconfig | 6 +
> > > > drivers/fpga/Makefile | 1 +
> > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > > >
> > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > new file mode 100644
> > > > index 0000000..2d4f917
> > > > --- /dev/null
> > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > @@ -0,0 +1,8 @@
> > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > > +Date: November 2017
> > > > +KernelVersion: 4.15
> > > > +Contact: Wu Hao <[email protected]>
> > > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > > + hardware. Userspace could use this information to check if
> > > > + current hardware is compatible with given image before FPGA
> > > > + programming.
> > >
> > > I'm a little confused by this. I can understand that the PR bitstream
> > > has a dependency on the FPGA's static image, but I don't understand
> > > the dependency of the bistream on the hardware that is used to program
> > > the bitstream to the FPGA.
> >
> > Sorry for the confusion, the interface_id is used to indicate the version of
> > the hardware for partial reconfiguration (it's part of the static image of
> > the FPGA device). Will improve the description on this.

I'm not sure userland should be making the call on whether what you're
trying to load is compatible or not. Isn't there a way to check this in
your PR reconfiguration handler in-kernel?

> >
>
> The interface_id expresses the compatibility of the static region with PR
> bitstreams generated for it. It changes every time a new static region is
> generated.
>
> Would it make more sense to have the interface_id exposed as part of the FME
> device (which represents the static region)? I'm not sure - it kind of also
> makes sense here, where you would have all the information in one place (if the
> interface_id matches, I can use this component to program a bitstream).
>
> Sorry for my limited understanding of the infrastructure - would this same
> "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> it would need to expose multiple interface_ids (or we'd have to track both
> interface IDs and an identifier for the target PR region).
>
> > >
> > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > > index 57da904..0171ecb 100644
> > > > --- a/drivers/fpga/Kconfig
> > > > +++ b/drivers/fpga/Kconfig
> > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > > FPGA platform level management features. There shall be 1 FME
> > > > per DFL based FPGA device.
> > > >
> > > > +config FPGA_DFL_FME_MGR
> > > > + tristate "FPGA DFL FME Manager Driver"
> > > > + depends on FPGA_DFL_FME
> > > > + help
> > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > > +
> > > > config INTEL_FPGA_DFL_PCI
> > > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > > depends on PCI && FPGA_DFL
> > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > > index cc75bb3..6378580 100644
> > > > --- a/drivers/fpga/Makefile
> > > > +++ b/drivers/fpga/Makefile
> > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > > # FPGA Device Feature List Support
> > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > > >
> > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > > >
> > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > new file mode 100644
> > > > index 0000000..70356ce
> > > > --- /dev/null
> > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > @@ -0,0 +1,318 @@
> > > > +/*
> > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > > + *
> > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > > + *
> > > > + * Authors:
> > > > + * Kang Luwei <[email protected]>
> > > > + * Xiao Guangrong <[email protected]>
> > > > + * Wu Hao <[email protected]>
> > > > + * Joseph Grecco <[email protected]>
> > > > + * Enno Luebbers <[email protected]>
> > > > + * Tim Whisonant <[email protected]>
> > > > + * Ananda Ravuri <[email protected]>
> > > > + * Christopher Rauer <[email protected]>
> > > > + * Henry Mitchel <[email protected]>
> > > > + *
> > > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > > + * SPDX-License-Identifier: GPL-2.0
> > > > + */
> > > > +
> > > > +#include <linux/module.h>
> > > > +#include <linux/iopoll.h>
> > > > +#include <linux/fpga/fpga-mgr.h>
> > > > +
> > > > +#include "fpga-dfl.h"
> > > > +#include "dfl-fme.h"
> > > > +
> > > > +#define PR_WAIT_TIMEOUT 8000000
> > > > +#define PR_HOST_STATUS_IDLE 0
> > > > +
> > > > +struct fme_mgr_priv {
> > > > + void __iomem *ioaddr;
> > > > + u64 pr_error;
> > > > +};
> > > > +
> > > > +static ssize_t interface_id_show(struct device *dev,
> > > > + struct device_attribute *attr, char *buf)
> > > > +{
> > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + u64 intfc_id_l, intfc_id_h;
> > > > +
> > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > > +
> > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > > + (unsigned long long)intfc_id_h,
> > > > + (unsigned long long)intfc_id_l);
> > > > +}
> > > > +static DEVICE_ATTR_RO(interface_id);
> > > > +
> > > > +static const struct attribute *fme_mgr_attrs[] = {
> > > > + &dev_attr_interface_id.attr,
> > > > + NULL,
> > > > +};
> > > > +
> > > > +static u64 pr_error_to_mgr_status(u64 err)
> > > > +{
> > > > + u64 status = 0;
> > > > +
> > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > > + if (err & FME_PR_ERR_CRC_ERR)
> > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > > +
> > > > + return status;
> > > > +}
> > > > +
> > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > > +{
> > > > + u64 pr_status, pr_error;
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > > + return 0;
> > > > +
> > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > > +
> > > > + return pr_error;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > > + struct fpga_image_info *info,
> > > > + const char *buf, size_t count)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl, pr_status;
> > > > +
> > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > > + dev_err(dev, "only support partial reconfiguration.\n");
> > >
> > > supports
> > >
> > > > + return -EINVAL;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > > + PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum PR timeout\n");
> > >
> > > We have two dev_err's with the same "maximum PR timeout" so the user
> > > would not be able to see at which point the timeout occurred.
> > >
> > > I suggest to get rid of the word 'maximum' for both. This one could
> > > be 'reset ack timeout" or something similar.
> >
> > Sure, will switch to a more specific message per your suggestion. Thanks.
> >
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev,
> > > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum PR timeout\n");
> > >
> > > "PR_STS timeout"? Or something better.
> > >
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + if (priv->pr_error)
> > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > > + (unsigned long long)priv->pr_error);
> > > > +
> > > > + dev_dbg(dev, "set PR port ID\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > > + const char *buf, size_t count)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl, pr_status, pr_data;
> > > > + int delay = 0, pr_credit, i = 0;
> > > > +
> > > > + dev_dbg(dev, "start request\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > +
> > > > + while (count > 0) {
> > > > + while (pr_credit <= 1) {
> > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > > + dev_err(dev, "maximum try\n");
> > >
> > > It looks like this is waiting for an entry in a queue and timing out
> > > here. Could you add a some comments for pr_credit above and this
> > > loop? Also a better error message, perhaps "PR credit timeout"?
> >
> > Driver needs to read the PR credit to know if it could push PR data
> > to hardware or not. I will add more description here on this PR credit,
> > and use "PR credit timeout" as error message.
> >
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > + udelay(1);
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > + }
> > > > +
> > > > + if (count >= 4) {
> > > > + pr_data = 0;
> > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > > + *(((u32 *)buf) + i));
> > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > > + count -= 4;
> > > > + pr_credit--;
> > > > + i++;
> > > > + } else {
> > > > + WARN_ON(1);
> > > > + return -EINVAL;
> > > > + }
> > > > + }
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > > + struct fpga_image_info *info)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl;
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev, "green bitstream push complete\n");
> > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > > + PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum try.\n");
> > >
> > > Some message specific to here also, please.
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + if (priv->pr_error) {
> > > > + dev_dbg(dev, "PR error detected %llx\n",
> > > > + (unsigned long long)priv->pr_error);
> > > > + return -EIO;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "PR done successfully\n");
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > > +{
> > > > + return FPGA_MGR_STATE_UNKNOWN;
> > > > +}
> > > > +
> > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > > +{
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > +
> > > > + return pr_error_to_mgr_status(priv->pr_error);
> > > > +}
> > > > +
> > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > > + .write_init = fme_mgr_write_init,
> > > > + .write = fme_mgr_write,
> > > > + .write_complete = fme_mgr_write_complete,
> > > > + .state = fme_mgr_state,
> > > > + .status = fme_mgr_status,
> > > > +};
> > > > +
> > > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > > +{
> > > > + struct device *dev = &pdev->dev;
> > > > + struct fme_mgr_priv *priv;
> > > > + struct fpga_manager *mgr;
> > > > + struct resource *res;
> > > > + int ret;
> > > > +
> > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > > + if (!priv)
> > > > + return -ENOMEM;
> > > > +
> > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > > + if (IS_ERR(priv->ioaddr))
> > > > + return PTR_ERR(priv->ioaddr);
> > > > +
> > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > + if (ret)
> > > > + return ret;
> > > > +
> > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > > + if (!mgr)
> > > > + goto sysfs_remove_exit;
> > > > +
> > > > + mgr->name = "DFL FPGA Manager";
> > > > + mgr->mops = &fme_mgr_ops;
> > > > + mgr->priv = priv;
> > > > + mgr->parent = dev;
> > > > + platform_set_drvdata(pdev, mgr);
> > > > +
> > > > + ret = fpga_mgr_register(mgr);
> > > > + if (ret) {
> > > > + dev_err(dev, "unable to register FPGA manager\n");
> > > > + goto sysfs_remove_exit;
> > > > + }
> > > > +
> > > > + return 0;
> > > > +
> > > > +sysfs_remove_exit:
> > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > + return ret;
> > > > +}
> > > > +
> > > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > > +{
> > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > > +
> > > > + fpga_mgr_unregister(mgr);
> > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static struct platform_driver fme_mgr_driver = {
> > > > + .driver = {
> > > > + .name = FPGA_DFL_FME_MGR,
> > > > + },
> > > > + .probe = fme_mgr_probe,
> > > > + .remove = fme_mgr_remove,
> > > > +};
> > > > +
> > > > +module_platform_driver(fme_mgr_driver);
> > > > +
> > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > > +MODULE_AUTHOR("Intel Corporation");
> > > > +MODULE_LICENSE("GPL v2");
> > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > > index e5a1094..d45eb82 100644
> > > > --- a/drivers/fpga/fpga-dfl.h
> > > > +++ b/drivers/fpga/fpga-dfl.h
> > > > @@ -130,7 +130,44 @@
> > > >
> > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > > #define FME_PR_DFH DFH
> > > > -#define FME_PR_SIZE DFH_SIZE
> > > > +#define FME_PR_CTRL 0x8
> > > > +#define FME_PR_STS 0x10
> > > > +#define FME_PR_DATA 0x18
> > > > +#define FME_PR_ERR 0x20
> > > > +#define FME_PR_INTFC_ID_H 0xA8
> > > > +#define FME_PR_INTFC_ID_L 0xB0
> > > > +#define FME_PR_SIZE 0xB8
> > > > +
> > > > +/* FME PR Control Register Bitfield */
> > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > > +
> > > > +/* FME PR Status Register Bitfield */
> > > > +/* Number of available entries in HW queue inside the PR engine. */
> > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > > +
> > > > +/* FME PR Data Register Bitfield */
> > > > +/* PR data from the raw-binary file. */
> > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > > +
> > > > +/* FME PR Error Register */
> > > > +/* Previous PR Operation errors detected. */
> > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > > +/* CRC error detected. */
> > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > > +/* Incompatible PR bitstream detected. */
> > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > > +/* PR data push protocol violated. */
> > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > > +/* PR data fifo overflow error detected */
> > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > > >
> > > > /* FME HSSI Sub Feature Register Set */
> > > > #define FME_HSSI_DFH DFH
> > >
> > > I see fpga-dfl.h as enumeration code which is separate from any driver
> > > implementation specifics other than what's required for the DFL
> > > enumeration scheme. These PR engine #defines should move to a .h
> > > that is dedicated to this specific PR hardware device. If someone else
> > > adds a different PR device to the framework, their PR driver would
> > > also have its own .h. Same for any other modules that aren't central
> > > to DFL enumeration.
> >
> > Sure, will move PR related register to a separated header file.
> > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> >
> > Thanks
> > Hao
> >
> > >
> > > Thanks,
> > > Alan
> > >
> > > > --
> > > > 1.8.3.1
> > > >

Thanks,
Moritz


Attachments:
(No filename) (22.44 kB)
signature.asc (499.00 B)
Download all attachments

2018-02-04 09:50:48

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> Hi Hao, Alan,
>
> On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> > >
> > > Hi Hao,
> > >
> > > A few comments below. Besides that, looks good.
> > >
> > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > > >
> > > > Signed-off-by: Tim Whisonant <[email protected]>
> > > > Signed-off-by: Enno Luebbers <[email protected]>
> > > > Signed-off-by: Shiva Rao <[email protected]>
> > > > Signed-off-by: Christopher Rauer <[email protected]>
> > > > Signed-off-by: Kang Luwei <[email protected]>
> > > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > > Signed-off-by: Wu Hao <[email protected]>
> > > > ----
> > > > v3: rename driver to dfl-fpga-fme-mgr
> > > > implemented status callback for fpga manager
> > > > rebased due to fpga api changes
> > > > ---
> > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > > drivers/fpga/Kconfig | 6 +
> > > > drivers/fpga/Makefile | 1 +
> > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > > >
> > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > new file mode 100644
> > > > index 0000000..2d4f917
> > > > --- /dev/null
> > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > @@ -0,0 +1,8 @@
> > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > > +Date: November 2017
> > > > +KernelVersion: 4.15
> > > > +Contact: Wu Hao <[email protected]>
> > > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > > + hardware. Userspace could use this information to check if
> > > > + current hardware is compatible with given image before FPGA
> > > > + programming.
> > >
> > > I'm a little confused by this. I can understand that the PR bitstream
> > > has a dependency on the FPGA's static image, but I don't understand
> > > the dependency of the bistream on the hardware that is used to program
> > > the bitstream to the FPGA.
> >
> > Sorry for the confusion, the interface_id is used to indicate the version of
> > the hardware for partial reconfiguration (it's part of the static image of
> > the FPGA device). Will improve the description on this.
> >
>
> The interface_id expresses the compatibility of the static region with PR
> bitstreams generated for it. It changes every time a new static region is
> generated.
>
> Would it make more sense to have the interface_id exposed as part of the FME
> device (which represents the static region)? I'm not sure - it kind of also
> makes sense here, where you would have all the information in one place (if the
> interface_id matches, I can use this component to program a bitstream).

Hi Enno

Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
under fpga-dfl-fme.0. It's part of the FME device for sure. From another
point of view, it means if anyone wants to do PR on this Intel FPGA device,
he needs to find the FME device firstly, and then check if any fpga manager
created under this FME device, if yes, check the interface_id before PR via
the FME device node ioctl.

>
> Sorry for my limited understanding of the infrastructure - would this same
> "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> it would need to expose multiple interface_ids (or we'd have to track both
> interface IDs and an identifier for the target PR region).

Yes, the fpga manager could be shared with different PR regions.

Sorry, I'm not sure where we need to expose multiple interface_ids and why.

Thanks
Hao

>
> > >
> > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > > index 57da904..0171ecb 100644
> > > > --- a/drivers/fpga/Kconfig
> > > > +++ b/drivers/fpga/Kconfig
> > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > > FPGA platform level management features. There shall be 1 FME
> > > > per DFL based FPGA device.
> > > >
> > > > +config FPGA_DFL_FME_MGR
> > > > + tristate "FPGA DFL FME Manager Driver"
> > > > + depends on FPGA_DFL_FME
> > > > + help
> > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > > +
> > > > config INTEL_FPGA_DFL_PCI
> > > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > > depends on PCI && FPGA_DFL
> > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > > index cc75bb3..6378580 100644
> > > > --- a/drivers/fpga/Makefile
> > > > +++ b/drivers/fpga/Makefile
> > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > > # FPGA Device Feature List Support
> > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > > >
> > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > > >
> > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > new file mode 100644
> > > > index 0000000..70356ce
> > > > --- /dev/null
> > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > @@ -0,0 +1,318 @@
> > > > +/*
> > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > > + *
> > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > > + *
> > > > + * Authors:
> > > > + * Kang Luwei <[email protected]>
> > > > + * Xiao Guangrong <[email protected]>
> > > > + * Wu Hao <[email protected]>
> > > > + * Joseph Grecco <[email protected]>
> > > > + * Enno Luebbers <[email protected]>
> > > > + * Tim Whisonant <[email protected]>
> > > > + * Ananda Ravuri <[email protected]>
> > > > + * Christopher Rauer <[email protected]>
> > > > + * Henry Mitchel <[email protected]>
> > > > + *
> > > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > > + * SPDX-License-Identifier: GPL-2.0
> > > > + */
> > > > +
> > > > +#include <linux/module.h>
> > > > +#include <linux/iopoll.h>
> > > > +#include <linux/fpga/fpga-mgr.h>
> > > > +
> > > > +#include "fpga-dfl.h"
> > > > +#include "dfl-fme.h"
> > > > +
> > > > +#define PR_WAIT_TIMEOUT 8000000
> > > > +#define PR_HOST_STATUS_IDLE 0
> > > > +
> > > > +struct fme_mgr_priv {
> > > > + void __iomem *ioaddr;
> > > > + u64 pr_error;
> > > > +};
> > > > +
> > > > +static ssize_t interface_id_show(struct device *dev,
> > > > + struct device_attribute *attr, char *buf)
> > > > +{
> > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + u64 intfc_id_l, intfc_id_h;
> > > > +
> > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > > +
> > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > > + (unsigned long long)intfc_id_h,
> > > > + (unsigned long long)intfc_id_l);
> > > > +}
> > > > +static DEVICE_ATTR_RO(interface_id);
> > > > +
> > > > +static const struct attribute *fme_mgr_attrs[] = {
> > > > + &dev_attr_interface_id.attr,
> > > > + NULL,
> > > > +};
> > > > +
> > > > +static u64 pr_error_to_mgr_status(u64 err)
> > > > +{
> > > > + u64 status = 0;
> > > > +
> > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > > + if (err & FME_PR_ERR_CRC_ERR)
> > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > > +
> > > > + return status;
> > > > +}
> > > > +
> > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > > +{
> > > > + u64 pr_status, pr_error;
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > > + return 0;
> > > > +
> > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > > +
> > > > + return pr_error;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > > + struct fpga_image_info *info,
> > > > + const char *buf, size_t count)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl, pr_status;
> > > > +
> > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > > + dev_err(dev, "only support partial reconfiguration.\n");
> > >
> > > supports
> > >
> > > > + return -EINVAL;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > > + PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum PR timeout\n");
> > >
> > > We have two dev_err's with the same "maximum PR timeout" so the user
> > > would not be able to see at which point the timeout occurred.
> > >
> > > I suggest to get rid of the word 'maximum' for both. This one could
> > > be 'reset ack timeout" or something similar.
> >
> > Sure, will switch to a more specific message per your suggestion. Thanks.
> >
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev,
> > > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum PR timeout\n");
> > >
> > > "PR_STS timeout"? Or something better.
> > >
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + if (priv->pr_error)
> > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > > + (unsigned long long)priv->pr_error);
> > > > +
> > > > + dev_dbg(dev, "set PR port ID\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > > + const char *buf, size_t count)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl, pr_status, pr_data;
> > > > + int delay = 0, pr_credit, i = 0;
> > > > +
> > > > + dev_dbg(dev, "start request\n");
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > +
> > > > + while (count > 0) {
> > > > + while (pr_credit <= 1) {
> > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > > + dev_err(dev, "maximum try\n");
> > >
> > > It looks like this is waiting for an entry in a queue and timing out
> > > here. Could you add a some comments for pr_credit above and this
> > > loop? Also a better error message, perhaps "PR credit timeout"?
> >
> > Driver needs to read the PR credit to know if it could push PR data
> > to hardware or not. I will add more description here on this PR credit,
> > and use "PR credit timeout" as error message.
> >
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > + udelay(1);
> > > > +
> > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > + }
> > > > +
> > > > + if (count >= 4) {
> > > > + pr_data = 0;
> > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > > + *(((u32 *)buf) + i));
> > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > > + count -= 4;
> > > > + pr_credit--;
> > > > + i++;
> > > > + } else {
> > > > + WARN_ON(1);
> > > > + return -EINVAL;
> > > > + }
> > > > + }
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > > + struct fpga_image_info *info)
> > > > +{
> > > > + struct device *dev = &mgr->dev;
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > + u64 pr_ctrl;
> > > > +
> > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > +
> > > > + dev_dbg(dev, "green bitstream push complete\n");
> > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > > +
> > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > > + PR_WAIT_TIMEOUT)) {
> > > > + dev_err(dev, "maximum try.\n");
> > >
> > > Some message specific to here also, please.
> > >
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > + if (priv->pr_error) {
> > > > + dev_dbg(dev, "PR error detected %llx\n",
> > > > + (unsigned long long)priv->pr_error);
> > > > + return -EIO;
> > > > + }
> > > > +
> > > > + dev_dbg(dev, "PR done successfully\n");
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > > +{
> > > > + return FPGA_MGR_STATE_UNKNOWN;
> > > > +}
> > > > +
> > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > > +{
> > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > +
> > > > + return pr_error_to_mgr_status(priv->pr_error);
> > > > +}
> > > > +
> > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > > + .write_init = fme_mgr_write_init,
> > > > + .write = fme_mgr_write,
> > > > + .write_complete = fme_mgr_write_complete,
> > > > + .state = fme_mgr_state,
> > > > + .status = fme_mgr_status,
> > > > +};
> > > > +
> > > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > > +{
> > > > + struct device *dev = &pdev->dev;
> > > > + struct fme_mgr_priv *priv;
> > > > + struct fpga_manager *mgr;
> > > > + struct resource *res;
> > > > + int ret;
> > > > +
> > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > > + if (!priv)
> > > > + return -ENOMEM;
> > > > +
> > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > > + if (IS_ERR(priv->ioaddr))
> > > > + return PTR_ERR(priv->ioaddr);
> > > > +
> > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > + if (ret)
> > > > + return ret;
> > > > +
> > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > > + if (!mgr)
> > > > + goto sysfs_remove_exit;
> > > > +
> > > > + mgr->name = "DFL FPGA Manager";
> > > > + mgr->mops = &fme_mgr_ops;
> > > > + mgr->priv = priv;
> > > > + mgr->parent = dev;
> > > > + platform_set_drvdata(pdev, mgr);
> > > > +
> > > > + ret = fpga_mgr_register(mgr);
> > > > + if (ret) {
> > > > + dev_err(dev, "unable to register FPGA manager\n");
> > > > + goto sysfs_remove_exit;
> > > > + }
> > > > +
> > > > + return 0;
> > > > +
> > > > +sysfs_remove_exit:
> > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > + return ret;
> > > > +}
> > > > +
> > > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > > +{
> > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > > +
> > > > + fpga_mgr_unregister(mgr);
> > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static struct platform_driver fme_mgr_driver = {
> > > > + .driver = {
> > > > + .name = FPGA_DFL_FME_MGR,
> > > > + },
> > > > + .probe = fme_mgr_probe,
> > > > + .remove = fme_mgr_remove,
> > > > +};
> > > > +
> > > > +module_platform_driver(fme_mgr_driver);
> > > > +
> > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > > +MODULE_AUTHOR("Intel Corporation");
> > > > +MODULE_LICENSE("GPL v2");
> > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > > index e5a1094..d45eb82 100644
> > > > --- a/drivers/fpga/fpga-dfl.h
> > > > +++ b/drivers/fpga/fpga-dfl.h
> > > > @@ -130,7 +130,44 @@
> > > >
> > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > > #define FME_PR_DFH DFH
> > > > -#define FME_PR_SIZE DFH_SIZE
> > > > +#define FME_PR_CTRL 0x8
> > > > +#define FME_PR_STS 0x10
> > > > +#define FME_PR_DATA 0x18
> > > > +#define FME_PR_ERR 0x20
> > > > +#define FME_PR_INTFC_ID_H 0xA8
> > > > +#define FME_PR_INTFC_ID_L 0xB0
> > > > +#define FME_PR_SIZE 0xB8
> > > > +
> > > > +/* FME PR Control Register Bitfield */
> > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > > +
> > > > +/* FME PR Status Register Bitfield */
> > > > +/* Number of available entries in HW queue inside the PR engine. */
> > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > > +
> > > > +/* FME PR Data Register Bitfield */
> > > > +/* PR data from the raw-binary file. */
> > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > > +
> > > > +/* FME PR Error Register */
> > > > +/* Previous PR Operation errors detected. */
> > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > > +/* CRC error detected. */
> > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > > +/* Incompatible PR bitstream detected. */
> > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > > +/* PR data push protocol violated. */
> > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > > +/* PR data fifo overflow error detected */
> > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > > >
> > > > /* FME HSSI Sub Feature Register Set */
> > > > #define FME_HSSI_DFH DFH
> > >
> > > I see fpga-dfl.h as enumeration code which is separate from any driver
> > > implementation specifics other than what's required for the DFL
> > > enumeration scheme. These PR engine #defines should move to a .h
> > > that is dedicated to this specific PR hardware device. If someone else
> > > adds a different PR device to the framework, their PR driver would
> > > also have its own .h. Same for any other modules that aren't central
> > > to DFL enumeration.
> >
> > Sure, will move PR related register to a separated header file.
> > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> >
> > Thanks
> > Hao
> >
> > >
> > > Thanks,
> > > Alan
> > >
> > > > --
> > > > 1.8.3.1
> > > >

2018-02-04 10:16:20

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
> Hi Hao,
>
> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> > Hi Hao, Alan,
> >
> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> > > >
> > > > Hi Hao,
> > > >
> > > > A few comments below. Besides that, looks good.
> > > >
> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > > > >
> > > > > Signed-off-by: Tim Whisonant <[email protected]>
> > > > > Signed-off-by: Enno Luebbers <[email protected]>
> > > > > Signed-off-by: Shiva Rao <[email protected]>
> > > > > Signed-off-by: Christopher Rauer <[email protected]>
> > > > > Signed-off-by: Kang Luwei <[email protected]>
> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > > > Signed-off-by: Wu Hao <[email protected]>
> > > > > ----
> > > > > v3: rename driver to dfl-fpga-fme-mgr
> > > > > implemented status callback for fpga manager
> > > > > rebased due to fpga api changes
> > > > > ---
> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > > > drivers/fpga/Kconfig | 6 +
> > > > > drivers/fpga/Makefile | 1 +
> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > > > >
> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > new file mode 100644
> > > > > index 0000000..2d4f917
> > > > > --- /dev/null
> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > @@ -0,0 +1,8 @@
> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > > > +Date: November 2017
> > > > > +KernelVersion: 4.15
> > > > > +Contact: Wu Hao <[email protected]>
> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > > > + hardware. Userspace could use this information to check if
> > > > > + current hardware is compatible with given image before FPGA
> > > > > + programming.
> > > >
> > > > I'm a little confused by this. I can understand that the PR bitstream
> > > > has a dependency on the FPGA's static image, but I don't understand
> > > > the dependency of the bistream on the hardware that is used to program
> > > > the bitstream to the FPGA.
> > >
> > > Sorry for the confusion, the interface_id is used to indicate the version of
> > > the hardware for partial reconfiguration (it's part of the static image of
> > > the FPGA device). Will improve the description on this.
>
> I'm not sure userland should be making the call on whether what you're
> trying to load is compatible or not. Isn't there a way to check this in
> your PR reconfiguration handler in-kernel?

Hi Moritz

Actually with current driver interface, doing a partial reconfiguration with an
incompatible image, then driver will report PR failure with error code
FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
needs to know hardware interface_id information to find or re-generated correct
images. I think it's more flexible to leave it to userspace on using this
information exposed by driver. : )

Thanks
Hao

>
> > >
> >
> > The interface_id expresses the compatibility of the static region with PR
> > bitstreams generated for it. It changes every time a new static region is
> > generated.
> >
> > Would it make more sense to have the interface_id exposed as part of the FME
> > device (which represents the static region)? I'm not sure - it kind of also
> > makes sense here, where you would have all the information in one place (if the
> > interface_id matches, I can use this component to program a bitstream).
> >
> > Sorry for my limited understanding of the infrastructure - would this same
> > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> > it would need to expose multiple interface_ids (or we'd have to track both
> > interface IDs and an identifier for the target PR region).
> >
> > > >
> > > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > > > index 57da904..0171ecb 100644
> > > > > --- a/drivers/fpga/Kconfig
> > > > > +++ b/drivers/fpga/Kconfig
> > > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > > > FPGA platform level management features. There shall be 1 FME
> > > > > per DFL based FPGA device.
> > > > >
> > > > > +config FPGA_DFL_FME_MGR
> > > > > + tristate "FPGA DFL FME Manager Driver"
> > > > > + depends on FPGA_DFL_FME
> > > > > + help
> > > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > > > +
> > > > > config INTEL_FPGA_DFL_PCI
> > > > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > > > depends on PCI && FPGA_DFL
> > > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > > > index cc75bb3..6378580 100644
> > > > > --- a/drivers/fpga/Makefile
> > > > > +++ b/drivers/fpga/Makefile
> > > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > > > # FPGA Device Feature List Support
> > > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > > > >
> > > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > > > >
> > > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > new file mode 100644
> > > > > index 0000000..70356ce
> > > > > --- /dev/null
> > > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > @@ -0,0 +1,318 @@
> > > > > +/*
> > > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > > > + *
> > > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > > > + *
> > > > > + * Authors:
> > > > > + * Kang Luwei <[email protected]>
> > > > > + * Xiao Guangrong <[email protected]>
> > > > > + * Wu Hao <[email protected]>
> > > > > + * Joseph Grecco <[email protected]>
> > > > > + * Enno Luebbers <[email protected]>
> > > > > + * Tim Whisonant <[email protected]>
> > > > > + * Ananda Ravuri <[email protected]>
> > > > > + * Christopher Rauer <[email protected]>
> > > > > + * Henry Mitchel <[email protected]>
> > > > > + *
> > > > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > > > + * SPDX-License-Identifier: GPL-2.0
> > > > > + */
> > > > > +
> > > > > +#include <linux/module.h>
> > > > > +#include <linux/iopoll.h>
> > > > > +#include <linux/fpga/fpga-mgr.h>
> > > > > +
> > > > > +#include "fpga-dfl.h"
> > > > > +#include "dfl-fme.h"
> > > > > +
> > > > > +#define PR_WAIT_TIMEOUT 8000000
> > > > > +#define PR_HOST_STATUS_IDLE 0
> > > > > +
> > > > > +struct fme_mgr_priv {
> > > > > + void __iomem *ioaddr;
> > > > > + u64 pr_error;
> > > > > +};
> > > > > +
> > > > > +static ssize_t interface_id_show(struct device *dev,
> > > > > + struct device_attribute *attr, char *buf)
> > > > > +{
> > > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + u64 intfc_id_l, intfc_id_h;
> > > > > +
> > > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > > > +
> > > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > > > + (unsigned long long)intfc_id_h,
> > > > > + (unsigned long long)intfc_id_l);
> > > > > +}
> > > > > +static DEVICE_ATTR_RO(interface_id);
> > > > > +
> > > > > +static const struct attribute *fme_mgr_attrs[] = {
> > > > > + &dev_attr_interface_id.attr,
> > > > > + NULL,
> > > > > +};
> > > > > +
> > > > > +static u64 pr_error_to_mgr_status(u64 err)
> > > > > +{
> > > > > + u64 status = 0;
> > > > > +
> > > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > > > + if (err & FME_PR_ERR_CRC_ERR)
> > > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > > > +
> > > > > + return status;
> > > > > +}
> > > > > +
> > > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > > > +{
> > > > > + u64 pr_status, pr_error;
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > > > + return 0;
> > > > > +
> > > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > > > +
> > > > > + return pr_error;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > > > + struct fpga_image_info *info,
> > > > > + const char *buf, size_t count)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl, pr_status;
> > > > > +
> > > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > > > + dev_err(dev, "only support partial reconfiguration.\n");
> > > >
> > > > supports
> > > >
> > > > > + return -EINVAL;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > > > + PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum PR timeout\n");
> > > >
> > > > We have two dev_err's with the same "maximum PR timeout" so the user
> > > > would not be able to see at which point the timeout occurred.
> > > >
> > > > I suggest to get rid of the word 'maximum' for both. This one could
> > > > be 'reset ack timeout" or something similar.
> > >
> > > Sure, will switch to a more specific message per your suggestion. Thanks.
> > >
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev,
> > > > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum PR timeout\n");
> > > >
> > > > "PR_STS timeout"? Or something better.
> > > >
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + if (priv->pr_error)
> > > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > > > + (unsigned long long)priv->pr_error);
> > > > > +
> > > > > + dev_dbg(dev, "set PR port ID\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > > > + const char *buf, size_t count)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl, pr_status, pr_data;
> > > > > + int delay = 0, pr_credit, i = 0;
> > > > > +
> > > > > + dev_dbg(dev, "start request\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > +
> > > > > + while (count > 0) {
> > > > > + while (pr_credit <= 1) {
> > > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > > > + dev_err(dev, "maximum try\n");
> > > >
> > > > It looks like this is waiting for an entry in a queue and timing out
> > > > here. Could you add a some comments for pr_credit above and this
> > > > loop? Also a better error message, perhaps "PR credit timeout"?
> > >
> > > Driver needs to read the PR credit to know if it could push PR data
> > > to hardware or not. I will add more description here on this PR credit,
> > > and use "PR credit timeout" as error message.
> > >
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > + udelay(1);
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > + }
> > > > > +
> > > > > + if (count >= 4) {
> > > > > + pr_data = 0;
> > > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > > > + *(((u32 *)buf) + i));
> > > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > > > + count -= 4;
> > > > > + pr_credit--;
> > > > > + i++;
> > > > > + } else {
> > > > > + WARN_ON(1);
> > > > > + return -EINVAL;
> > > > > + }
> > > > > + }
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > > > + struct fpga_image_info *info)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl;
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev, "green bitstream push complete\n");
> > > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > > > + PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum try.\n");
> > > >
> > > > Some message specific to here also, please.
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + if (priv->pr_error) {
> > > > > + dev_dbg(dev, "PR error detected %llx\n",
> > > > > + (unsigned long long)priv->pr_error);
> > > > > + return -EIO;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "PR done successfully\n");
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > > > +{
> > > > > + return FPGA_MGR_STATE_UNKNOWN;
> > > > > +}
> > > > > +
> > > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > > > +{
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > +
> > > > > + return pr_error_to_mgr_status(priv->pr_error);
> > > > > +}
> > > > > +
> > > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > > > + .write_init = fme_mgr_write_init,
> > > > > + .write = fme_mgr_write,
> > > > > + .write_complete = fme_mgr_write_complete,
> > > > > + .state = fme_mgr_state,
> > > > > + .status = fme_mgr_status,
> > > > > +};
> > > > > +
> > > > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > > > +{
> > > > > + struct device *dev = &pdev->dev;
> > > > > + struct fme_mgr_priv *priv;
> > > > > + struct fpga_manager *mgr;
> > > > > + struct resource *res;
> > > > > + int ret;
> > > > > +
> > > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > > > + if (!priv)
> > > > > + return -ENOMEM;
> > > > > +
> > > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > > > + if (IS_ERR(priv->ioaddr))
> > > > > + return PTR_ERR(priv->ioaddr);
> > > > > +
> > > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > + if (ret)
> > > > > + return ret;
> > > > > +
> > > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > > > + if (!mgr)
> > > > > + goto sysfs_remove_exit;
> > > > > +
> > > > > + mgr->name = "DFL FPGA Manager";
> > > > > + mgr->mops = &fme_mgr_ops;
> > > > > + mgr->priv = priv;
> > > > > + mgr->parent = dev;
> > > > > + platform_set_drvdata(pdev, mgr);
> > > > > +
> > > > > + ret = fpga_mgr_register(mgr);
> > > > > + if (ret) {
> > > > > + dev_err(dev, "unable to register FPGA manager\n");
> > > > > + goto sysfs_remove_exit;
> > > > > + }
> > > > > +
> > > > > + return 0;
> > > > > +
> > > > > +sysfs_remove_exit:
> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > + return ret;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > > > +{
> > > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > > > +
> > > > > + fpga_mgr_unregister(mgr);
> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static struct platform_driver fme_mgr_driver = {
> > > > > + .driver = {
> > > > > + .name = FPGA_DFL_FME_MGR,
> > > > > + },
> > > > > + .probe = fme_mgr_probe,
> > > > > + .remove = fme_mgr_remove,
> > > > > +};
> > > > > +
> > > > > +module_platform_driver(fme_mgr_driver);
> > > > > +
> > > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > > > +MODULE_AUTHOR("Intel Corporation");
> > > > > +MODULE_LICENSE("GPL v2");
> > > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > > > index e5a1094..d45eb82 100644
> > > > > --- a/drivers/fpga/fpga-dfl.h
> > > > > +++ b/drivers/fpga/fpga-dfl.h
> > > > > @@ -130,7 +130,44 @@
> > > > >
> > > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > > > #define FME_PR_DFH DFH
> > > > > -#define FME_PR_SIZE DFH_SIZE
> > > > > +#define FME_PR_CTRL 0x8
> > > > > +#define FME_PR_STS 0x10
> > > > > +#define FME_PR_DATA 0x18
> > > > > +#define FME_PR_ERR 0x20
> > > > > +#define FME_PR_INTFC_ID_H 0xA8
> > > > > +#define FME_PR_INTFC_ID_L 0xB0
> > > > > +#define FME_PR_SIZE 0xB8
> > > > > +
> > > > > +/* FME PR Control Register Bitfield */
> > > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > > > +
> > > > > +/* FME PR Status Register Bitfield */
> > > > > +/* Number of available entries in HW queue inside the PR engine. */
> > > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > > > +
> > > > > +/* FME PR Data Register Bitfield */
> > > > > +/* PR data from the raw-binary file. */
> > > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > > > +
> > > > > +/* FME PR Error Register */
> > > > > +/* Previous PR Operation errors detected. */
> > > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > > > +/* CRC error detected. */
> > > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > > > +/* Incompatible PR bitstream detected. */
> > > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > > > +/* PR data push protocol violated. */
> > > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > > > +/* PR data fifo overflow error detected */
> > > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > > > >
> > > > > /* FME HSSI Sub Feature Register Set */
> > > > > #define FME_HSSI_DFH DFH
> > > >
> > > > I see fpga-dfl.h as enumeration code which is separate from any driver
> > > > implementation specifics other than what's required for the DFL
> > > > enumeration scheme. These PR engine #defines should move to a .h
> > > > that is dedicated to this specific PR hardware device. If someone else
> > > > adds a different PR device to the framework, their PR driver would
> > > > also have its own .h. Same for any other modules that aren't central
> > > > to DFL enumeration.
> > >
> > > Sure, will move PR related register to a separated header file.
> > > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> > >
> > > Thanks
> > > Hao
> > >
> > > >
> > > > Thanks,
> > > > Alan
> > > >
> > > > > --
> > > > > 1.8.3.1
> > > > >
>
> Thanks,
> Moritz



2018-02-05 17:24:34

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Sun, Feb 4, 2018 at 4:05 AM, Wu Hao <[email protected]> wrote:
> On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
>> Hi Hao,
>>
>> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
>> > Hi Hao, Alan,
>> >
>> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
>> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
>> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>> > > >
>> > > > Hi Hao,
>> > > >
>> > > > A few comments below. Besides that, looks good.
>> > > >
>> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
>> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>> > > > >
>> > > > > Signed-off-by: Tim Whisonant <[email protected]>
>> > > > > Signed-off-by: Enno Luebbers <[email protected]>
>> > > > > Signed-off-by: Shiva Rao <[email protected]>
>> > > > > Signed-off-by: Christopher Rauer <[email protected]>
>> > > > > Signed-off-by: Kang Luwei <[email protected]>
>> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
>> > > > > Signed-off-by: Wu Hao <[email protected]>
>> > > > > ----
>> > > > > v3: rename driver to dfl-fpga-fme-mgr
>> > > > > implemented status callback for fpga manager
>> > > > > rebased due to fpga api changes
>> > > > > ---
>> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
>> > > > > drivers/fpga/Kconfig | 6 +
>> > > > > drivers/fpga/Makefile | 1 +
>> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
>> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
>> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
>> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>> > > > >
>> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > new file mode 100644
>> > > > > index 0000000..2d4f917
>> > > > > --- /dev/null
>> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > @@ -0,0 +1,8 @@
>> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
>> > > > > +Date: November 2017
>> > > > > +KernelVersion: 4.15
>> > > > > +Contact: Wu Hao <[email protected]>
>> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
>> > > > > + hardware. Userspace could use this information to check if
>> > > > > + current hardware is compatible with given image before FPGA
>> > > > > + programming.
>> > > >
>> > > > I'm a little confused by this. I can understand that the PR bitstream
>> > > > has a dependency on the FPGA's static image, but I don't understand
>> > > > the dependency of the bistream on the hardware that is used to program
>> > > > the bitstream to the FPGA.
>> > >
>> > > Sorry for the confusion, the interface_id is used to indicate the version of
>> > > the hardware for partial reconfiguration (it's part of the static image of
>> > > the FPGA device). Will improve the description on this.
>>
>> I'm not sure userland should be making the call on whether what you're
>> trying to load is compatible or not.

Could you explain more about what your concern was about this (unless
Hao has covered it below)?

It makes sense to me in this use case at least since userspace has a
pile of images and is choosing which one to load.

>> Isn't there a way to check this in
>> your PR reconfiguration handler in-kernel?
>
> Hi Moritz
>
> Actually with current driver interface, doing a partial reconfiguration with an
> incompatible image, then driver will report PR failure with error code
> FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
> needs to know hardware interface_id information to find or re-generated correct
> images. I think it's more flexible to leave it to userspace on using this
> information exposed by driver. : )
>
> Thanks
> Hao
>
>>
>> > >
>> >
>> > The interface_id expresses the compatibility of the static region with PR
>> > bitstreams generated for it. It changes every time a new static region is
>> > generated.

In the near future the DFL framework will be used with SoC's that have
a hard FPGA PR manager (that's not part of the static region). The
hard FPGA manager driver won't know anything about the static region.

>> >
>> > Would it make more sense to have the interface_id exposed as part of the FME
>> > device (which represents the static region)? I'm not sure - it kind of also
>> > makes sense here, where you would have all the information in one place (if the
>> > interface_id matches, I can use this component to program a bitstream).

According to the intel-fpga.txt document, the identifier for the
static image is at

/sys/class/fpga_region/regionX/fpga-dfl-fme.n/bitstream_id

>> >
>> > Sorry for my limited understanding of the infrastructure - would this same
>> > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
>> > it would need to expose multiple interface_ids (or we'd have to track both
>> > interface IDs and an identifier for the target PR region).

interface_id sounds like it's trying to give some per-PR region
information. That could support the real case where a FPGA static
region has multiple PR regions and userspace has a bunch of images.
The images are keyed to certain regions. The reason the images are
region-specific could be that the regions have different connections
(possible but I think that's unlikely. but possible) or because the
FPGA doesn't support relocatable PR images (true of most FPGAs that
support PR, but not a problem with Stratix10).

Each interface_ids could be exposed per fpga-region (each fpga-region
maps to a PR region). That wouldn't be hard to implement, driver-wise
I think.

>> >
>> > > >
>> > > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
>> > > > > index 57da904..0171ecb 100644
>> > > > > --- a/drivers/fpga/Kconfig
>> > > > > +++ b/drivers/fpga/Kconfig
>> > > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
>> > > > > FPGA platform level management features. There shall be 1 FME
>> > > > > per DFL based FPGA device.
>> > > > >
>> > > > > +config FPGA_DFL_FME_MGR
>> > > > > + tristate "FPGA DFL FME Manager Driver"
>> > > > > + depends on FPGA_DFL_FME
>> > > > > + help
>> > > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
>> > > > > +
>> > > > > config INTEL_FPGA_DFL_PCI
>> > > > > tristate "Intel FPGA DFL PCIe Device Driver"
>> > > > > depends on PCI && FPGA_DFL
>> > > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
>> > > > > index cc75bb3..6378580 100644
>> > > > > --- a/drivers/fpga/Makefile
>> > > > > +++ b/drivers/fpga/Makefile
>> > > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
>> > > > > # FPGA Device Feature List Support
>> > > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
>> > > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
>> > > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
>> > > > >
>> > > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
>> > > > >
>> > > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
>> > > > > new file mode 100644
>> > > > > index 0000000..70356ce
>> > > > > --- /dev/null
>> > > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
>> > > > > @@ -0,0 +1,318 @@
>> > > > > +/*
>> > > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
>> > > > > + *
>> > > > > + * Copyright (C) 2017 Intel Corporation, Inc.
>> > > > > + *
>> > > > > + * Authors:
>> > > > > + * Kang Luwei <[email protected]>
>> > > > > + * Xiao Guangrong <[email protected]>
>> > > > > + * Wu Hao <[email protected]>
>> > > > > + * Joseph Grecco <[email protected]>
>> > > > > + * Enno Luebbers <[email protected]>
>> > > > > + * Tim Whisonant <[email protected]>
>> > > > > + * Ananda Ravuri <[email protected]>
>> > > > > + * Christopher Rauer <[email protected]>
>> > > > > + * Henry Mitchel <[email protected]>
>> > > > > + *
>> > > > > + * This work is licensed under the terms of the GNU GPL version 2.
>> > > > > + * SPDX-License-Identifier: GPL-2.0
>> > > > > + */
>> > > > > +
>> > > > > +#include <linux/module.h>
>> > > > > +#include <linux/iopoll.h>
>> > > > > +#include <linux/fpga/fpga-mgr.h>
>> > > > > +
>> > > > > +#include "fpga-dfl.h"
>> > > > > +#include "dfl-fme.h"
>> > > > > +
>> > > > > +#define PR_WAIT_TIMEOUT 8000000
>> > > > > +#define PR_HOST_STATUS_IDLE 0
>> > > > > +
>> > > > > +struct fme_mgr_priv {
>> > > > > + void __iomem *ioaddr;
>> > > > > + u64 pr_error;
>> > > > > +};
>> > > > > +
>> > > > > +static ssize_t interface_id_show(struct device *dev,
>> > > > > + struct device_attribute *attr, char *buf)
>> > > > > +{
>> > > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
>> > > > > + struct fme_mgr_priv *priv = mgr->priv;
>> > > > > + u64 intfc_id_l, intfc_id_h;
>> > > > > +
>> > > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
>> > > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
>> > > > > +
>> > > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
>> > > > > + (unsigned long long)intfc_id_h,
>> > > > > + (unsigned long long)intfc_id_l);
>> > > > > +}
>> > > > > +static DEVICE_ATTR_RO(interface_id);
>> > > > > +
>> > > > > +static const struct attribute *fme_mgr_attrs[] = {
>> > > > > + &dev_attr_interface_id.attr,
>> > > > > + NULL,
>> > > > > +};
>> > > > > +
>> > > > > +static u64 pr_error_to_mgr_status(u64 err)
>> > > > > +{
>> > > > > + u64 status = 0;
>> > > > > +
>> > > > > + if (err & FME_PR_ERR_OPERATION_ERR)
>> > > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
>> > > > > + if (err & FME_PR_ERR_CRC_ERR)
>> > > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
>> > > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
>> > > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
>> > > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
>> > > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
>> > > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
>> > > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
>> > > > > +
>> > > > > + return status;
>> > > > > +}
>> > > > > +
>> > > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
>> > > > > +{
>> > > > > + u64 pr_status, pr_error;
>> > > > > +
>> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
>> > > > > + if (!(pr_status & FME_PR_STS_PR_STS))
>> > > > > + return 0;
>> > > > > +
>> > > > > + pr_error = readq(fme_pr + FME_PR_ERR);
>> > > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
>> > > > > +
>> > > > > + return pr_error;
>> > > > > +}
>> > > > > +
>> > > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
>> > > > > + struct fpga_image_info *info,
>> > > > > + const char *buf, size_t count)
>> > > > > +{
>> > > > > + struct device *dev = &mgr->dev;
>> > > > > + struct fme_mgr_priv *priv = mgr->priv;
>> > > > > + void __iomem *fme_pr = priv->ioaddr;
>> > > > > + u64 pr_ctrl, pr_status;
>> > > > > +
>> > > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
>> > > > > + dev_err(dev, "only support partial reconfiguration.\n");
>> > > >
>> > > > supports
>> > > >
>> > > > > + return -EINVAL;
>> > > > > + }
>> > > > > +
>> > > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
>> > > > > +
>> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
>> > > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
>> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
>> > > > > +
>> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
>> > > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
>> > > > > + PR_WAIT_TIMEOUT)) {
>> > > > > + dev_err(dev, "maximum PR timeout\n");
>> > > >
>> > > > We have two dev_err's with the same "maximum PR timeout" so the user
>> > > > would not be able to see at which point the timeout occurred.
>> > > >
>> > > > I suggest to get rid of the word 'maximum' for both. This one could
>> > > > be 'reset ack timeout" or something similar.
>> > >
>> > > Sure, will switch to a more specific message per your suggestion. Thanks.
>> > >
>> > > >
>> > > > > + return -ETIMEDOUT;
>> > > > > + }
>> > > > > +
>> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
>> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
>> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
>> > > > > +
>> > > > > + dev_dbg(dev,
>> > > > > + "waiting for PR resource in HW to be initialized and ready\n");
>> > > > > +
>> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
>> > > > > + (pr_status & FME_PR_STS_PR_STS) ==
>> > > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
>> > > > > + dev_err(dev, "maximum PR timeout\n");
>> > > >
>> > > > "PR_STS timeout"? Or something better.
>> > > >
>> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
>> > > > > + return -ETIMEDOUT;
>> > > > > + }
>> > > > > +
>> > > > > + dev_dbg(dev, "check and clear previous PR error\n");
>> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
>> > > > > + if (priv->pr_error)
>> > > > > + dev_dbg(dev, "previous PR error detected %llx\n",
>> > > > > + (unsigned long long)priv->pr_error);
>> > > > > +
>> > > > > + dev_dbg(dev, "set PR port ID\n");
>> > > > > +
>> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
>> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
>> > > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
>> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
>> > > > > +
>> > > > > + return 0;
>> > > > > +}
>> > > > > +
>> > > > > +static int fme_mgr_write(struct fpga_manager *mgr,
>> > > > > + const char *buf, size_t count)
>> > > > > +{
>> > > > > + struct device *dev = &mgr->dev;
>> > > > > + struct fme_mgr_priv *priv = mgr->priv;
>> > > > > + void __iomem *fme_pr = priv->ioaddr;
>> > > > > + u64 pr_ctrl, pr_status, pr_data;
>> > > > > + int delay = 0, pr_credit, i = 0;
>> > > > > +
>> > > > > + dev_dbg(dev, "start request\n");
>> > > > > +
>> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
>> > > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
>> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
>> > > > > +
>> > > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
>> > > > > +
>> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
>> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
>> > > > > +
>> > > > > + while (count > 0) {
>> > > > > + while (pr_credit <= 1) {
>> > > > > + if (delay++ > PR_WAIT_TIMEOUT) {
>> > > > > + dev_err(dev, "maximum try\n");
>> > > >
>> > > > It looks like this is waiting for an entry in a queue and timing out
>> > > > here. Could you add a some comments for pr_credit above and this
>> > > > loop? Also a better error message, perhaps "PR credit timeout"?
>> > >
>> > > Driver needs to read the PR credit to know if it could push PR data
>> > > to hardware or not. I will add more description here on this PR credit,
>> > > and use "PR credit timeout" as error message.
>> > >
>> > > >
>> > > > > + return -ETIMEDOUT;
>> > > > > + }
>> > > > > + udelay(1);
>> > > > > +
>> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
>> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
>> > > > > + }
>> > > > > +
>> > > > > + if (count >= 4) {
>> > > > > + pr_data = 0;
>> > > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
>> > > > > + *(((u32 *)buf) + i));
>> > > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
>> > > > > + count -= 4;
>> > > > > + pr_credit--;
>> > > > > + i++;
>> > > > > + } else {
>> > > > > + WARN_ON(1);
>> > > > > + return -EINVAL;
>> > > > > + }
>> > > > > + }
>> > > > > +
>> > > > > + return 0;
>> > > > > +}
>> > > > > +
>> > > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
>> > > > > + struct fpga_image_info *info)
>> > > > > +{
>> > > > > + struct device *dev = &mgr->dev;
>> > > > > + struct fme_mgr_priv *priv = mgr->priv;
>> > > > > + void __iomem *fme_pr = priv->ioaddr;
>> > > > > + u64 pr_ctrl;
>> > > > > +
>> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
>> > > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
>> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
>> > > > > +
>> > > > > + dev_dbg(dev, "green bitstream push complete\n");
>> > > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
>> > > > > +
>> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
>> > > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
>> > > > > + PR_WAIT_TIMEOUT)) {
>> > > > > + dev_err(dev, "maximum try.\n");
>> > > >
>> > > > Some message specific to here also, please.
>> > > >
>> > > > > + return -ETIMEDOUT;
>> > > > > + }
>> > > > > +
>> > > > > + dev_dbg(dev, "PR operation complete, checking status\n");
>> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
>> > > > > + if (priv->pr_error) {
>> > > > > + dev_dbg(dev, "PR error detected %llx\n",
>> > > > > + (unsigned long long)priv->pr_error);
>> > > > > + return -EIO;
>> > > > > + }
>> > > > > +
>> > > > > + dev_dbg(dev, "PR done successfully\n");
>> > > > > +
>> > > > > + return 0;
>> > > > > +}
>> > > > > +
>> > > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
>> > > > > +{
>> > > > > + return FPGA_MGR_STATE_UNKNOWN;
>> > > > > +}
>> > > > > +
>> > > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
>> > > > > +{
>> > > > > + struct fme_mgr_priv *priv = mgr->priv;
>> > > > > +
>> > > > > + return pr_error_to_mgr_status(priv->pr_error);
>> > > > > +}
>> > > > > +
>> > > > > +static const struct fpga_manager_ops fme_mgr_ops = {
>> > > > > + .write_init = fme_mgr_write_init,
>> > > > > + .write = fme_mgr_write,
>> > > > > + .write_complete = fme_mgr_write_complete,
>> > > > > + .state = fme_mgr_state,
>> > > > > + .status = fme_mgr_status,
>> > > > > +};
>> > > > > +
>> > > > > +static int fme_mgr_probe(struct platform_device *pdev)
>> > > > > +{
>> > > > > + struct device *dev = &pdev->dev;
>> > > > > + struct fme_mgr_priv *priv;
>> > > > > + struct fpga_manager *mgr;
>> > > > > + struct resource *res;
>> > > > > + int ret;
>> > > > > +
>> > > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
>> > > > > + if (!priv)
>> > > > > + return -ENOMEM;
>> > > > > +
>> > > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
>> > > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
>> > > > > + if (IS_ERR(priv->ioaddr))
>> > > > > + return PTR_ERR(priv->ioaddr);
>> > > > > +
>> > > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
>> > > > > + if (ret)
>> > > > > + return ret;
>> > > > > +
>> > > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
>> > > > > + if (!mgr)
>> > > > > + goto sysfs_remove_exit;
>> > > > > +
>> > > > > + mgr->name = "DFL FPGA Manager";
>> > > > > + mgr->mops = &fme_mgr_ops;
>> > > > > + mgr->priv = priv;
>> > > > > + mgr->parent = dev;
>> > > > > + platform_set_drvdata(pdev, mgr);
>> > > > > +
>> > > > > + ret = fpga_mgr_register(mgr);
>> > > > > + if (ret) {
>> > > > > + dev_err(dev, "unable to register FPGA manager\n");
>> > > > > + goto sysfs_remove_exit;
>> > > > > + }
>> > > > > +
>> > > > > + return 0;
>> > > > > +
>> > > > > +sysfs_remove_exit:
>> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
>> > > > > + return ret;
>> > > > > +}
>> > > > > +
>> > > > > +static int fme_mgr_remove(struct platform_device *pdev)
>> > > > > +{
>> > > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
>> > > > > +
>> > > > > + fpga_mgr_unregister(mgr);
>> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
>> > > > > +
>> > > > > + return 0;
>> > > > > +}
>> > > > > +
>> > > > > +static struct platform_driver fme_mgr_driver = {
>> > > > > + .driver = {
>> > > > > + .name = FPGA_DFL_FME_MGR,
>> > > > > + },
>> > > > > + .probe = fme_mgr_probe,
>> > > > > + .remove = fme_mgr_remove,
>> > > > > +};
>> > > > > +
>> > > > > +module_platform_driver(fme_mgr_driver);
>> > > > > +
>> > > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
>> > > > > +MODULE_AUTHOR("Intel Corporation");
>> > > > > +MODULE_LICENSE("GPL v2");
>> > > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
>> > > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
>> > > > > index e5a1094..d45eb82 100644
>> > > > > --- a/drivers/fpga/fpga-dfl.h
>> > > > > +++ b/drivers/fpga/fpga-dfl.h
>> > > > > @@ -130,7 +130,44 @@
>> > > > >
>> > > > > /* FME Partial Reconfiguration Sub Feature Register Set */
>> > > > > #define FME_PR_DFH DFH
>> > > > > -#define FME_PR_SIZE DFH_SIZE
>> > > > > +#define FME_PR_CTRL 0x8
>> > > > > +#define FME_PR_STS 0x10
>> > > > > +#define FME_PR_DATA 0x18
>> > > > > +#define FME_PR_ERR 0x20
>> > > > > +#define FME_PR_INTFC_ID_H 0xA8
>> > > > > +#define FME_PR_INTFC_ID_L 0xB0
>> > > > > +#define FME_PR_SIZE 0xB8
>> > > > > +
>> > > > > +/* FME PR Control Register Bitfield */
>> > > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
>> > > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
>> > > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
>> > > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
>> > > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
>> > > > > +
>> > > > > +/* FME PR Status Register Bitfield */
>> > > > > +/* Number of available entries in HW queue inside the PR engine. */
>> > > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
>> > > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
>> > > > > +#define FME_PR_STS_PR_STS_IDLE 0
>> > > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
>> > > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
>> > > > > +
>> > > > > +/* FME PR Data Register Bitfield */
>> > > > > +/* PR data from the raw-binary file. */
>> > > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
>> > > > > +
>> > > > > +/* FME PR Error Register */
>> > > > > +/* Previous PR Operation errors detected. */
>> > > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
>> > > > > +/* CRC error detected. */
>> > > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
>> > > > > +/* Incompatible PR bitstream detected. */
>> > > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
>> > > > > +/* PR data push protocol violated. */
>> > > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
>> > > > > +/* PR data fifo overflow error detected */
>> > > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
>> > > > >
>> > > > > /* FME HSSI Sub Feature Register Set */
>> > > > > #define FME_HSSI_DFH DFH
>> > > >
>> > > > I see fpga-dfl.h as enumeration code which is separate from any driver
>> > > > implementation specifics other than what's required for the DFL
>> > > > enumeration scheme. These PR engine #defines should move to a .h
>> > > > that is dedicated to this specific PR hardware device. If someone else
>> > > > adds a different PR device to the framework, their PR driver would
>> > > > also have its own .h. Same for any other modules that aren't central
>> > > > to DFL enumeration.
>> > >
>> > > Sure, will move PR related register to a separated header file.
>> > > DFL enumeration related registers, will still be kept in fpga-dfl.h.
>> > >
>> > > Thanks
>> > > Hao
>> > >
>> > > >
>> > > > Thanks,
>> > > > Alan
>> > > >
>> > > > > --
>> > > > > 1.8.3.1
>> > > > >
>>
>> Thanks,
>> Moritz
>
>

2018-02-05 18:39:09

by Luebbers, Enno

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

Hi Hao,

On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> > Hi Hao, Alan,
> >
> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> > > >
> > > > Hi Hao,
> > > >
> > > > A few comments below. Besides that, looks good.
> > > >
> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > > > >
> > > > > Signed-off-by: Tim Whisonant <[email protected]>
> > > > > Signed-off-by: Enno Luebbers <[email protected]>
> > > > > Signed-off-by: Shiva Rao <[email protected]>
> > > > > Signed-off-by: Christopher Rauer <[email protected]>
> > > > > Signed-off-by: Kang Luwei <[email protected]>
> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > > > Signed-off-by: Wu Hao <[email protected]>
> > > > > ----
> > > > > v3: rename driver to dfl-fpga-fme-mgr
> > > > > implemented status callback for fpga manager
> > > > > rebased due to fpga api changes
> > > > > ---
> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > > > drivers/fpga/Kconfig | 6 +
> > > > > drivers/fpga/Makefile | 1 +
> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > > > >
> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > new file mode 100644
> > > > > index 0000000..2d4f917
> > > > > --- /dev/null
> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > @@ -0,0 +1,8 @@
> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > > > +Date: November 2017
> > > > > +KernelVersion: 4.15
> > > > > +Contact: Wu Hao <[email protected]>
> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > > > + hardware. Userspace could use this information to check if
> > > > > + current hardware is compatible with given image before FPGA
> > > > > + programming.
> > > >
> > > > I'm a little confused by this. I can understand that the PR bitstream
> > > > has a dependency on the FPGA's static image, but I don't understand
> > > > the dependency of the bistream on the hardware that is used to program
> > > > the bitstream to the FPGA.
> > >
> > > Sorry for the confusion, the interface_id is used to indicate the version of
> > > the hardware for partial reconfiguration (it's part of the static image of
> > > the FPGA device). Will improve the description on this.
> > >
> >
> > The interface_id expresses the compatibility of the static region with PR
> > bitstreams generated for it. It changes every time a new static region is
> > generated.
> >
> > Would it make more sense to have the interface_id exposed as part of the FME
> > device (which represents the static region)? I'm not sure - it kind of also
> > makes sense here, where you would have all the information in one place (if the
> > interface_id matches, I can use this component to program a bitstream).
>
> Hi Enno
>
> Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
> under fpga-dfl-fme.0. It's part of the FME device for sure. From another
> point of view, it means if anyone wants to do PR on this Intel FPGA device,
> he needs to find the FME device firstly, and then check if any fpga manager
> created under this FME device, if yes, check the interface_id before PR via
> the FME device node ioctl.

That sounds good, thank you!

>
> >
> > Sorry for my limited understanding of the infrastructure - would this same
> > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> > it would need to expose multiple interface_ids (or we'd have to track both
> > interface IDs and an identifier for the target PR region).
>
> Yes, the fpga manager could be shared with different PR regions.
>
> Sorry, I'm not sure where we need to expose multiple interface_ids and why.

It's basically a question of how to determine bitstream compatibility - either,
there's a separate interface_id per reconfigurable region, or there is a single
interface_id for the entire device. Both make sense from a certain perspective.

If there are multiple interface_ids per device (one per region), the driver
would need to expose all of them. If there's only a single one, the driver only
exposes that one ID - compatibility would be determined by looking at both that
single interface_id _and_ the identifier/number of the targeted region.

I would prefer a separate interface_id per region - it seems more generic and
flexible.

Thanks
- Enno

>
> Thanks
> Hao
>
> >
> > > >
> > > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > > > index 57da904..0171ecb 100644
> > > > > --- a/drivers/fpga/Kconfig
> > > > > +++ b/drivers/fpga/Kconfig
> > > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > > > FPGA platform level management features. There shall be 1 FME
> > > > > per DFL based FPGA device.
> > > > >
> > > > > +config FPGA_DFL_FME_MGR
> > > > > + tristate "FPGA DFL FME Manager Driver"
> > > > > + depends on FPGA_DFL_FME
> > > > > + help
> > > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > > > +
> > > > > config INTEL_FPGA_DFL_PCI
> > > > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > > > depends on PCI && FPGA_DFL
> > > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > > > index cc75bb3..6378580 100644
> > > > > --- a/drivers/fpga/Makefile
> > > > > +++ b/drivers/fpga/Makefile
> > > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > > > # FPGA Device Feature List Support
> > > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > > > >
> > > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > > > >
> > > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > new file mode 100644
> > > > > index 0000000..70356ce
> > > > > --- /dev/null
> > > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > @@ -0,0 +1,318 @@
> > > > > +/*
> > > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > > > + *
> > > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > > > + *
> > > > > + * Authors:
> > > > > + * Kang Luwei <[email protected]>
> > > > > + * Xiao Guangrong <[email protected]>
> > > > > + * Wu Hao <[email protected]>
> > > > > + * Joseph Grecco <[email protected]>
> > > > > + * Enno Luebbers <[email protected]>
> > > > > + * Tim Whisonant <[email protected]>
> > > > > + * Ananda Ravuri <[email protected]>
> > > > > + * Christopher Rauer <[email protected]>
> > > > > + * Henry Mitchel <[email protected]>
> > > > > + *
> > > > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > > > + * SPDX-License-Identifier: GPL-2.0
> > > > > + */
> > > > > +
> > > > > +#include <linux/module.h>
> > > > > +#include <linux/iopoll.h>
> > > > > +#include <linux/fpga/fpga-mgr.h>
> > > > > +
> > > > > +#include "fpga-dfl.h"
> > > > > +#include "dfl-fme.h"
> > > > > +
> > > > > +#define PR_WAIT_TIMEOUT 8000000
> > > > > +#define PR_HOST_STATUS_IDLE 0
> > > > > +
> > > > > +struct fme_mgr_priv {
> > > > > + void __iomem *ioaddr;
> > > > > + u64 pr_error;
> > > > > +};
> > > > > +
> > > > > +static ssize_t interface_id_show(struct device *dev,
> > > > > + struct device_attribute *attr, char *buf)
> > > > > +{
> > > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + u64 intfc_id_l, intfc_id_h;
> > > > > +
> > > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > > > +
> > > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > > > + (unsigned long long)intfc_id_h,
> > > > > + (unsigned long long)intfc_id_l);
> > > > > +}
> > > > > +static DEVICE_ATTR_RO(interface_id);
> > > > > +
> > > > > +static const struct attribute *fme_mgr_attrs[] = {
> > > > > + &dev_attr_interface_id.attr,
> > > > > + NULL,
> > > > > +};
> > > > > +
> > > > > +static u64 pr_error_to_mgr_status(u64 err)
> > > > > +{
> > > > > + u64 status = 0;
> > > > > +
> > > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > > > + if (err & FME_PR_ERR_CRC_ERR)
> > > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > > > +
> > > > > + return status;
> > > > > +}
> > > > > +
> > > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > > > +{
> > > > > + u64 pr_status, pr_error;
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > > > + return 0;
> > > > > +
> > > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > > > +
> > > > > + return pr_error;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > > > + struct fpga_image_info *info,
> > > > > + const char *buf, size_t count)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl, pr_status;
> > > > > +
> > > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > > > + dev_err(dev, "only support partial reconfiguration.\n");
> > > >
> > > > supports
> > > >
> > > > > + return -EINVAL;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > > > + PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum PR timeout\n");
> > > >
> > > > We have two dev_err's with the same "maximum PR timeout" so the user
> > > > would not be able to see at which point the timeout occurred.
> > > >
> > > > I suggest to get rid of the word 'maximum' for both. This one could
> > > > be 'reset ack timeout" or something similar.
> > >
> > > Sure, will switch to a more specific message per your suggestion. Thanks.
> > >
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev,
> > > > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum PR timeout\n");
> > > >
> > > > "PR_STS timeout"? Or something better.
> > > >
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + if (priv->pr_error)
> > > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > > > + (unsigned long long)priv->pr_error);
> > > > > +
> > > > > + dev_dbg(dev, "set PR port ID\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > > > + const char *buf, size_t count)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl, pr_status, pr_data;
> > > > > + int delay = 0, pr_credit, i = 0;
> > > > > +
> > > > > + dev_dbg(dev, "start request\n");
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > +
> > > > > + while (count > 0) {
> > > > > + while (pr_credit <= 1) {
> > > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > > > + dev_err(dev, "maximum try\n");
> > > >
> > > > It looks like this is waiting for an entry in a queue and timing out
> > > > here. Could you add a some comments for pr_credit above and this
> > > > loop? Also a better error message, perhaps "PR credit timeout"?
> > >
> > > Driver needs to read the PR credit to know if it could push PR data
> > > to hardware or not. I will add more description here on this PR credit,
> > > and use "PR credit timeout" as error message.
> > >
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > + udelay(1);
> > > > > +
> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > + }
> > > > > +
> > > > > + if (count >= 4) {
> > > > > + pr_data = 0;
> > > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > > > + *(((u32 *)buf) + i));
> > > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > > > + count -= 4;
> > > > > + pr_credit--;
> > > > > + i++;
> > > > > + } else {
> > > > > + WARN_ON(1);
> > > > > + return -EINVAL;
> > > > > + }
> > > > > + }
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > > > + struct fpga_image_info *info)
> > > > > +{
> > > > > + struct device *dev = &mgr->dev;
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > + u64 pr_ctrl;
> > > > > +
> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > +
> > > > > + dev_dbg(dev, "green bitstream push complete\n");
> > > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > > > +
> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > > > + PR_WAIT_TIMEOUT)) {
> > > > > + dev_err(dev, "maximum try.\n");
> > > >
> > > > Some message specific to here also, please.
> > > >
> > > > > + return -ETIMEDOUT;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > + if (priv->pr_error) {
> > > > > + dev_dbg(dev, "PR error detected %llx\n",
> > > > > + (unsigned long long)priv->pr_error);
> > > > > + return -EIO;
> > > > > + }
> > > > > +
> > > > > + dev_dbg(dev, "PR done successfully\n");
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > > > +{
> > > > > + return FPGA_MGR_STATE_UNKNOWN;
> > > > > +}
> > > > > +
> > > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > > > +{
> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > +
> > > > > + return pr_error_to_mgr_status(priv->pr_error);
> > > > > +}
> > > > > +
> > > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > > > + .write_init = fme_mgr_write_init,
> > > > > + .write = fme_mgr_write,
> > > > > + .write_complete = fme_mgr_write_complete,
> > > > > + .state = fme_mgr_state,
> > > > > + .status = fme_mgr_status,
> > > > > +};
> > > > > +
> > > > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > > > +{
> > > > > + struct device *dev = &pdev->dev;
> > > > > + struct fme_mgr_priv *priv;
> > > > > + struct fpga_manager *mgr;
> > > > > + struct resource *res;
> > > > > + int ret;
> > > > > +
> > > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > > > + if (!priv)
> > > > > + return -ENOMEM;
> > > > > +
> > > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > > > + if (IS_ERR(priv->ioaddr))
> > > > > + return PTR_ERR(priv->ioaddr);
> > > > > +
> > > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > + if (ret)
> > > > > + return ret;
> > > > > +
> > > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > > > + if (!mgr)
> > > > > + goto sysfs_remove_exit;
> > > > > +
> > > > > + mgr->name = "DFL FPGA Manager";
> > > > > + mgr->mops = &fme_mgr_ops;
> > > > > + mgr->priv = priv;
> > > > > + mgr->parent = dev;
> > > > > + platform_set_drvdata(pdev, mgr);
> > > > > +
> > > > > + ret = fpga_mgr_register(mgr);
> > > > > + if (ret) {
> > > > > + dev_err(dev, "unable to register FPGA manager\n");
> > > > > + goto sysfs_remove_exit;
> > > > > + }
> > > > > +
> > > > > + return 0;
> > > > > +
> > > > > +sysfs_remove_exit:
> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > + return ret;
> > > > > +}
> > > > > +
> > > > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > > > +{
> > > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > > > +
> > > > > + fpga_mgr_unregister(mgr);
> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > > > +
> > > > > +static struct platform_driver fme_mgr_driver = {
> > > > > + .driver = {
> > > > > + .name = FPGA_DFL_FME_MGR,
> > > > > + },
> > > > > + .probe = fme_mgr_probe,
> > > > > + .remove = fme_mgr_remove,
> > > > > +};
> > > > > +
> > > > > +module_platform_driver(fme_mgr_driver);
> > > > > +
> > > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > > > +MODULE_AUTHOR("Intel Corporation");
> > > > > +MODULE_LICENSE("GPL v2");
> > > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > > > index e5a1094..d45eb82 100644
> > > > > --- a/drivers/fpga/fpga-dfl.h
> > > > > +++ b/drivers/fpga/fpga-dfl.h
> > > > > @@ -130,7 +130,44 @@
> > > > >
> > > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > > > #define FME_PR_DFH DFH
> > > > > -#define FME_PR_SIZE DFH_SIZE
> > > > > +#define FME_PR_CTRL 0x8
> > > > > +#define FME_PR_STS 0x10
> > > > > +#define FME_PR_DATA 0x18
> > > > > +#define FME_PR_ERR 0x20
> > > > > +#define FME_PR_INTFC_ID_H 0xA8
> > > > > +#define FME_PR_INTFC_ID_L 0xB0
> > > > > +#define FME_PR_SIZE 0xB8
> > > > > +
> > > > > +/* FME PR Control Register Bitfield */
> > > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > > > +
> > > > > +/* FME PR Status Register Bitfield */
> > > > > +/* Number of available entries in HW queue inside the PR engine. */
> > > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > > > +
> > > > > +/* FME PR Data Register Bitfield */
> > > > > +/* PR data from the raw-binary file. */
> > > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > > > +
> > > > > +/* FME PR Error Register */
> > > > > +/* Previous PR Operation errors detected. */
> > > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > > > +/* CRC error detected. */
> > > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > > > +/* Incompatible PR bitstream detected. */
> > > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > > > +/* PR data push protocol violated. */
> > > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > > > +/* PR data fifo overflow error detected */
> > > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > > > >
> > > > > /* FME HSSI Sub Feature Register Set */
> > > > > #define FME_HSSI_DFH DFH
> > > >
> > > > I see fpga-dfl.h as enumeration code which is separate from any driver
> > > > implementation specifics other than what's required for the DFL
> > > > enumeration scheme. These PR engine #defines should move to a .h
> > > > that is dedicated to this specific PR hardware device. If someone else
> > > > adds a different PR device to the framework, their PR driver would
> > > > also have its own .h. Same for any other modules that aren't central
> > > > to DFL enumeration.
> > >
> > > Sure, will move PR related register to a separated header file.
> > > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> > >
> > > Thanks
> > > Hao
> > >
> > > >
> > > > Thanks,
> > > > Alan
> > > >
> > > > > --
> > > > > 1.8.3.1
> > > > >

2018-02-06 01:59:13

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
> Hi Hao,
>
> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> > > Hi Hao, Alan,
> > >
> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> > > > >
> > > > > Hi Hao,
> > > > >
> > > > > A few comments below. Besides that, looks good.
> > > > >
> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> > > > > >
> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
> > > > > > Signed-off-by: Shiva Rao <[email protected]>
> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
> > > > > > Signed-off-by: Kang Luwei <[email protected]>
> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> > > > > > Signed-off-by: Wu Hao <[email protected]>
> > > > > > ----
> > > > > > v3: rename driver to dfl-fpga-fme-mgr
> > > > > > implemented status callback for fpga manager
> > > > > > rebased due to fpga api changes
> > > > > > ---
> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> > > > > > drivers/fpga/Kconfig | 6 +
> > > > > > drivers/fpga/Makefile | 1 +
> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > >
> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > > new file mode 100644
> > > > > > index 0000000..2d4f917
> > > > > > --- /dev/null
> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> > > > > > @@ -0,0 +1,8 @@
> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> > > > > > +Date: November 2017
> > > > > > +KernelVersion: 4.15
> > > > > > +Contact: Wu Hao <[email protected]>
> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> > > > > > + hardware. Userspace could use this information to check if
> > > > > > + current hardware is compatible with given image before FPGA
> > > > > > + programming.
> > > > >
> > > > > I'm a little confused by this. I can understand that the PR bitstream
> > > > > has a dependency on the FPGA's static image, but I don't understand
> > > > > the dependency of the bistream on the hardware that is used to program
> > > > > the bitstream to the FPGA.
> > > >
> > > > Sorry for the confusion, the interface_id is used to indicate the version of
> > > > the hardware for partial reconfiguration (it's part of the static image of
> > > > the FPGA device). Will improve the description on this.
> > > >
> > >
> > > The interface_id expresses the compatibility of the static region with PR
> > > bitstreams generated for it. It changes every time a new static region is
> > > generated.
> > >
> > > Would it make more sense to have the interface_id exposed as part of the FME
> > > device (which represents the static region)? I'm not sure - it kind of also
> > > makes sense here, where you would have all the information in one place (if the
> > > interface_id matches, I can use this component to program a bitstream).
> >
> > Hi Enno
> >
> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
> > he needs to find the FME device firstly, and then check if any fpga manager
> > created under this FME device, if yes, check the interface_id before PR via
> > the FME device node ioctl.
>
> That sounds good, thank you!
>
> >
> > >
> > > Sorry for my limited understanding of the infrastructure - would this same
> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> > > it would need to expose multiple interface_ids (or we'd have to track both
> > > interface IDs and an identifier for the target PR region).
> >
> > Yes, the fpga manager could be shared with different PR regions.
> >
> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
>
> It's basically a question of how to determine bitstream compatibility - either,
> there's a separate interface_id per reconfigurable region, or there is a single
> interface_id for the entire device. Both make sense from a certain perspective.
>
> If there are multiple interface_ids per device (one per region), the driver
> would need to expose all of them. If there's only a single one, the driver only
> exposes that one ID - compatibility would be determined by looking at both that
> single interface_id _and_ the identifier/number of the targeted region.
>
> I would prefer a separate interface_id per region - it seems more generic and
> flexible.

It's possible to have per region interface_id (or even both per dev interface_id
and per region interface_id at the same time), but per FME PR sub feature
implementation, it supports multiple PR regions, but only provide one interface
id, so at least in this case, it's not per-region information per my
understanding. We can consider it later when hardware really supports it. : )

Thanks
Hao

>
> Thanks
> - Enno
>
> >
> > Thanks
> > Hao
> >
> > >
> > > > >
> > > > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> > > > > > index 57da904..0171ecb 100644
> > > > > > --- a/drivers/fpga/Kconfig
> > > > > > +++ b/drivers/fpga/Kconfig
> > > > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> > > > > > FPGA platform level management features. There shall be 1 FME
> > > > > > per DFL based FPGA device.
> > > > > >
> > > > > > +config FPGA_DFL_FME_MGR
> > > > > > + tristate "FPGA DFL FME Manager Driver"
> > > > > > + depends on FPGA_DFL_FME
> > > > > > + help
> > > > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> > > > > > +
> > > > > > config INTEL_FPGA_DFL_PCI
> > > > > > tristate "Intel FPGA DFL PCIe Device Driver"
> > > > > > depends on PCI && FPGA_DFL
> > > > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> > > > > > index cc75bb3..6378580 100644
> > > > > > --- a/drivers/fpga/Makefile
> > > > > > +++ b/drivers/fpga/Makefile
> > > > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> > > > > > # FPGA Device Feature List Support
> > > > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> > > > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> > > > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> > > > > >
> > > > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> > > > > >
> > > > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > > new file mode 100644
> > > > > > index 0000000..70356ce
> > > > > > --- /dev/null
> > > > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> > > > > > @@ -0,0 +1,318 @@
> > > > > > +/*
> > > > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> > > > > > + *
> > > > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> > > > > > + *
> > > > > > + * Authors:
> > > > > > + * Kang Luwei <[email protected]>
> > > > > > + * Xiao Guangrong <[email protected]>
> > > > > > + * Wu Hao <[email protected]>
> > > > > > + * Joseph Grecco <[email protected]>
> > > > > > + * Enno Luebbers <[email protected]>
> > > > > > + * Tim Whisonant <[email protected]>
> > > > > > + * Ananda Ravuri <[email protected]>
> > > > > > + * Christopher Rauer <[email protected]>
> > > > > > + * Henry Mitchel <[email protected]>
> > > > > > + *
> > > > > > + * This work is licensed under the terms of the GNU GPL version 2.
> > > > > > + * SPDX-License-Identifier: GPL-2.0
> > > > > > + */
> > > > > > +
> > > > > > +#include <linux/module.h>
> > > > > > +#include <linux/iopoll.h>
> > > > > > +#include <linux/fpga/fpga-mgr.h>
> > > > > > +
> > > > > > +#include "fpga-dfl.h"
> > > > > > +#include "dfl-fme.h"
> > > > > > +
> > > > > > +#define PR_WAIT_TIMEOUT 8000000
> > > > > > +#define PR_HOST_STATUS_IDLE 0
> > > > > > +
> > > > > > +struct fme_mgr_priv {
> > > > > > + void __iomem *ioaddr;
> > > > > > + u64 pr_error;
> > > > > > +};
> > > > > > +
> > > > > > +static ssize_t interface_id_show(struct device *dev,
> > > > > > + struct device_attribute *attr, char *buf)
> > > > > > +{
> > > > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> > > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > > + u64 intfc_id_l, intfc_id_h;
> > > > > > +
> > > > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> > > > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> > > > > > +
> > > > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> > > > > > + (unsigned long long)intfc_id_h,
> > > > > > + (unsigned long long)intfc_id_l);
> > > > > > +}
> > > > > > +static DEVICE_ATTR_RO(interface_id);
> > > > > > +
> > > > > > +static const struct attribute *fme_mgr_attrs[] = {
> > > > > > + &dev_attr_interface_id.attr,
> > > > > > + NULL,
> > > > > > +};
> > > > > > +
> > > > > > +static u64 pr_error_to_mgr_status(u64 err)
> > > > > > +{
> > > > > > + u64 status = 0;
> > > > > > +
> > > > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> > > > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> > > > > > + if (err & FME_PR_ERR_CRC_ERR)
> > > > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> > > > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> > > > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> > > > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> > > > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> > > > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> > > > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> > > > > > +
> > > > > > + return status;
> > > > > > +}
> > > > > > +
> > > > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> > > > > > +{
> > > > > > + u64 pr_status, pr_error;
> > > > > > +
> > > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> > > > > > + return 0;
> > > > > > +
> > > > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> > > > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> > > > > > +
> > > > > > + return pr_error;
> > > > > > +}
> > > > > > +
> > > > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> > > > > > + struct fpga_image_info *info,
> > > > > > + const char *buf, size_t count)
> > > > > > +{
> > > > > > + struct device *dev = &mgr->dev;
> > > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > > + u64 pr_ctrl, pr_status;
> > > > > > +
> > > > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> > > > > > + dev_err(dev, "only support partial reconfiguration.\n");
> > > > >
> > > > > supports
> > > > >
> > > > > > + return -EINVAL;
> > > > > > + }
> > > > > > +
> > > > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> > > > > > +
> > > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> > > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > > +
> > > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> > > > > > + PR_WAIT_TIMEOUT)) {
> > > > > > + dev_err(dev, "maximum PR timeout\n");
> > > > >
> > > > > We have two dev_err's with the same "maximum PR timeout" so the user
> > > > > would not be able to see at which point the timeout occurred.
> > > > >
> > > > > I suggest to get rid of the word 'maximum' for both. This one could
> > > > > be 'reset ack timeout" or something similar.
> > > >
> > > > Sure, will switch to a more specific message per your suggestion. Thanks.
> > > >
> > > > >
> > > > > > + return -ETIMEDOUT;
> > > > > > + }
> > > > > > +
> > > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> > > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > > +
> > > > > > + dev_dbg(dev,
> > > > > > + "waiting for PR resource in HW to be initialized and ready\n");
> > > > > > +
> > > > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> > > > > > + (pr_status & FME_PR_STS_PR_STS) ==
> > > > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> > > > > > + dev_err(dev, "maximum PR timeout\n");
> > > > >
> > > > > "PR_STS timeout"? Or something better.
> > > > >
> > > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > > + return -ETIMEDOUT;
> > > > > > + }
> > > > > > +
> > > > > > + dev_dbg(dev, "check and clear previous PR error\n");
> > > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > > + if (priv->pr_error)
> > > > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> > > > > > + (unsigned long long)priv->pr_error);
> > > > > > +
> > > > > > + dev_dbg(dev, "set PR port ID\n");
> > > > > > +
> > > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> > > > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> > > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > > +
> > > > > > + return 0;
> > > > > > +}
> > > > > > +
> > > > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> > > > > > + const char *buf, size_t count)
> > > > > > +{
> > > > > > + struct device *dev = &mgr->dev;
> > > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > > + u64 pr_ctrl, pr_status, pr_data;
> > > > > > + int delay = 0, pr_credit, i = 0;
> > > > > > +
> > > > > > + dev_dbg(dev, "start request\n");
> > > > > > +
> > > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> > > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > > +
> > > > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> > > > > > +
> > > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > > +
> > > > > > + while (count > 0) {
> > > > > > + while (pr_credit <= 1) {
> > > > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> > > > > > + dev_err(dev, "maximum try\n");
> > > > >
> > > > > It looks like this is waiting for an entry in a queue and timing out
> > > > > here. Could you add a some comments for pr_credit above and this
> > > > > loop? Also a better error message, perhaps "PR credit timeout"?
> > > >
> > > > Driver needs to read the PR credit to know if it could push PR data
> > > > to hardware or not. I will add more description here on this PR credit,
> > > > and use "PR credit timeout" as error message.
> > > >
> > > > >
> > > > > > + return -ETIMEDOUT;
> > > > > > + }
> > > > > > + udelay(1);
> > > > > > +
> > > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> > > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> > > > > > + }
> > > > > > +
> > > > > > + if (count >= 4) {
> > > > > > + pr_data = 0;
> > > > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> > > > > > + *(((u32 *)buf) + i));
> > > > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> > > > > > + count -= 4;
> > > > > > + pr_credit--;
> > > > > > + i++;
> > > > > > + } else {
> > > > > > + WARN_ON(1);
> > > > > > + return -EINVAL;
> > > > > > + }
> > > > > > + }
> > > > > > +
> > > > > > + return 0;
> > > > > > +}
> > > > > > +
> > > > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> > > > > > + struct fpga_image_info *info)
> > > > > > +{
> > > > > > + struct device *dev = &mgr->dev;
> > > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > > + void __iomem *fme_pr = priv->ioaddr;
> > > > > > + u64 pr_ctrl;
> > > > > > +
> > > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> > > > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> > > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> > > > > > +
> > > > > > + dev_dbg(dev, "green bitstream push complete\n");
> > > > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> > > > > > +
> > > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> > > > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> > > > > > + PR_WAIT_TIMEOUT)) {
> > > > > > + dev_err(dev, "maximum try.\n");
> > > > >
> > > > > Some message specific to here also, please.
> > > > >
> > > > > > + return -ETIMEDOUT;
> > > > > > + }
> > > > > > +
> > > > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> > > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> > > > > > + if (priv->pr_error) {
> > > > > > + dev_dbg(dev, "PR error detected %llx\n",
> > > > > > + (unsigned long long)priv->pr_error);
> > > > > > + return -EIO;
> > > > > > + }
> > > > > > +
> > > > > > + dev_dbg(dev, "PR done successfully\n");
> > > > > > +
> > > > > > + return 0;
> > > > > > +}
> > > > > > +
> > > > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> > > > > > +{
> > > > > > + return FPGA_MGR_STATE_UNKNOWN;
> > > > > > +}
> > > > > > +
> > > > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> > > > > > +{
> > > > > > + struct fme_mgr_priv *priv = mgr->priv;
> > > > > > +
> > > > > > + return pr_error_to_mgr_status(priv->pr_error);
> > > > > > +}
> > > > > > +
> > > > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> > > > > > + .write_init = fme_mgr_write_init,
> > > > > > + .write = fme_mgr_write,
> > > > > > + .write_complete = fme_mgr_write_complete,
> > > > > > + .state = fme_mgr_state,
> > > > > > + .status = fme_mgr_status,
> > > > > > +};
> > > > > > +
> > > > > > +static int fme_mgr_probe(struct platform_device *pdev)
> > > > > > +{
> > > > > > + struct device *dev = &pdev->dev;
> > > > > > + struct fme_mgr_priv *priv;
> > > > > > + struct fpga_manager *mgr;
> > > > > > + struct resource *res;
> > > > > > + int ret;
> > > > > > +
> > > > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> > > > > > + if (!priv)
> > > > > > + return -ENOMEM;
> > > > > > +
> > > > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> > > > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> > > > > > + if (IS_ERR(priv->ioaddr))
> > > > > > + return PTR_ERR(priv->ioaddr);
> > > > > > +
> > > > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > > + if (ret)
> > > > > > + return ret;
> > > > > > +
> > > > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> > > > > > + if (!mgr)
> > > > > > + goto sysfs_remove_exit;
> > > > > > +
> > > > > > + mgr->name = "DFL FPGA Manager";
> > > > > > + mgr->mops = &fme_mgr_ops;
> > > > > > + mgr->priv = priv;
> > > > > > + mgr->parent = dev;
> > > > > > + platform_set_drvdata(pdev, mgr);
> > > > > > +
> > > > > > + ret = fpga_mgr_register(mgr);
> > > > > > + if (ret) {
> > > > > > + dev_err(dev, "unable to register FPGA manager\n");
> > > > > > + goto sysfs_remove_exit;
> > > > > > + }
> > > > > > +
> > > > > > + return 0;
> > > > > > +
> > > > > > +sysfs_remove_exit:
> > > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > > + return ret;
> > > > > > +}
> > > > > > +
> > > > > > +static int fme_mgr_remove(struct platform_device *pdev)
> > > > > > +{
> > > > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> > > > > > +
> > > > > > + fpga_mgr_unregister(mgr);
> > > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> > > > > > +
> > > > > > + return 0;
> > > > > > +}
> > > > > > +
> > > > > > +static struct platform_driver fme_mgr_driver = {
> > > > > > + .driver = {
> > > > > > + .name = FPGA_DFL_FME_MGR,
> > > > > > + },
> > > > > > + .probe = fme_mgr_probe,
> > > > > > + .remove = fme_mgr_remove,
> > > > > > +};
> > > > > > +
> > > > > > +module_platform_driver(fme_mgr_driver);
> > > > > > +
> > > > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> > > > > > +MODULE_AUTHOR("Intel Corporation");
> > > > > > +MODULE_LICENSE("GPL v2");
> > > > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> > > > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> > > > > > index e5a1094..d45eb82 100644
> > > > > > --- a/drivers/fpga/fpga-dfl.h
> > > > > > +++ b/drivers/fpga/fpga-dfl.h
> > > > > > @@ -130,7 +130,44 @@
> > > > > >
> > > > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> > > > > > #define FME_PR_DFH DFH
> > > > > > -#define FME_PR_SIZE DFH_SIZE
> > > > > > +#define FME_PR_CTRL 0x8
> > > > > > +#define FME_PR_STS 0x10
> > > > > > +#define FME_PR_DATA 0x18
> > > > > > +#define FME_PR_ERR 0x20
> > > > > > +#define FME_PR_INTFC_ID_H 0xA8
> > > > > > +#define FME_PR_INTFC_ID_L 0xB0
> > > > > > +#define FME_PR_SIZE 0xB8
> > > > > > +
> > > > > > +/* FME PR Control Register Bitfield */
> > > > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> > > > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> > > > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> > > > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> > > > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> > > > > > +
> > > > > > +/* FME PR Status Register Bitfield */
> > > > > > +/* Number of available entries in HW queue inside the PR engine. */
> > > > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> > > > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> > > > > > +#define FME_PR_STS_PR_STS_IDLE 0
> > > > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> > > > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> > > > > > +
> > > > > > +/* FME PR Data Register Bitfield */
> > > > > > +/* PR data from the raw-binary file. */
> > > > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> > > > > > +
> > > > > > +/* FME PR Error Register */
> > > > > > +/* Previous PR Operation errors detected. */
> > > > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> > > > > > +/* CRC error detected. */
> > > > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> > > > > > +/* Incompatible PR bitstream detected. */
> > > > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> > > > > > +/* PR data push protocol violated. */
> > > > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> > > > > > +/* PR data fifo overflow error detected */
> > > > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> > > > > >
> > > > > > /* FME HSSI Sub Feature Register Set */
> > > > > > #define FME_HSSI_DFH DFH
> > > > >
> > > > > I see fpga-dfl.h as enumeration code which is separate from any driver
> > > > > implementation specifics other than what's required for the DFL
> > > > > enumeration scheme. These PR engine #defines should move to a .h
> > > > > that is dedicated to this specific PR hardware device. If someone else
> > > > > adds a different PR device to the framework, their PR driver would
> > > > > also have its own .h. Same for any other modules that aren't central
> > > > > to DFL enumeration.
> > > >
> > > > Sure, will move PR related register to a separated header file.
> > > > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> > > >
> > > > Thanks
> > > > Hao
> > > >
> > > > >
> > > > > Thanks,
> > > > > Alan
> > > > >
> > > > > > --
> > > > > > 1.8.3.1
> > > > > >

2018-02-06 02:30:06

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 05, 2018 at 11:21:52AM -0600, Alan Tull wrote:
> On Sun, Feb 4, 2018 at 4:05 AM, Wu Hao <[email protected]> wrote:
> > On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
> >> Hi Hao,
> >>
> >> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> >> > Hi Hao, Alan,
> >> >
> >> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> >> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> >> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >> > > >
> >> > > > Hi Hao,
> >> > > >
> >> > > > A few comments below. Besides that, looks good.
> >> > > >
> >> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> >> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >> > > > >
> >> > > > > Signed-off-by: Tim Whisonant <[email protected]>
> >> > > > > Signed-off-by: Enno Luebbers <[email protected]>
> >> > > > > Signed-off-by: Shiva Rao <[email protected]>
> >> > > > > Signed-off-by: Christopher Rauer <[email protected]>
> >> > > > > Signed-off-by: Kang Luwei <[email protected]>
> >> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> >> > > > > Signed-off-by: Wu Hao <[email protected]>
> >> > > > > ----
> >> > > > > v3: rename driver to dfl-fpga-fme-mgr
> >> > > > > implemented status callback for fpga manager
> >> > > > > rebased due to fpga api changes
> >> > > > > ---
> >> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> >> > > > > drivers/fpga/Kconfig | 6 +
> >> > > > > drivers/fpga/Makefile | 1 +
> >> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> >> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> >> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> >> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >> > > > >
> >> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > new file mode 100644
> >> > > > > index 0000000..2d4f917
> >> > > > > --- /dev/null
> >> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > @@ -0,0 +1,8 @@
> >> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> >> > > > > +Date: November 2017
> >> > > > > +KernelVersion: 4.15
> >> > > > > +Contact: Wu Hao <[email protected]>
> >> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> >> > > > > + hardware. Userspace could use this information to check if
> >> > > > > + current hardware is compatible with given image before FPGA
> >> > > > > + programming.
> >> > > >
> >> > > > I'm a little confused by this. I can understand that the PR bitstream
> >> > > > has a dependency on the FPGA's static image, but I don't understand
> >> > > > the dependency of the bistream on the hardware that is used to program
> >> > > > the bitstream to the FPGA.
> >> > >
> >> > > Sorry for the confusion, the interface_id is used to indicate the version of
> >> > > the hardware for partial reconfiguration (it's part of the static image of
> >> > > the FPGA device). Will improve the description on this.
> >>
> >> I'm not sure userland should be making the call on whether what you're
> >> trying to load is compatible or not.
>
> Could you explain more about what your concern was about this (unless
> Hao has covered it below)?
>
> It makes sense to me in this use case at least since userspace has a
> pile of images and is choosing which one to load.
>
> >> Isn't there a way to check this in
> >> your PR reconfiguration handler in-kernel?
> >
> > Hi Moritz
> >
> > Actually with current driver interface, doing a partial reconfiguration with an
> > incompatible image, then driver will report PR failure with error code
> > FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
> > needs to know hardware interface_id information to find or re-generated correct
> > images. I think it's more flexible to leave it to userspace on using this
> > information exposed by driver. : )
> >
> > Thanks
> > Hao
> >
> >>
> >> > >
> >> >
> >> > The interface_id expresses the compatibility of the static region with PR
> >> > bitstreams generated for it. It changes every time a new static region is
> >> > generated.
>
> In the near future the DFL framework will be used with SoC's that have
> a hard FPGA PR manager (that's not part of the static region). The
> hard FPGA manager driver won't know anything about the static region.
>
> >> >
> >> > Would it make more sense to have the interface_id exposed as part of the FME
> >> > device (which represents the static region)? I'm not sure - it kind of also
> >> > makes sense here, where you would have all the information in one place (if the
> >> > interface_id matches, I can use this component to program a bitstream).
>
> According to the intel-fpga.txt document, the identifier for the
> static image is at
>
> /sys/class/fpga_region/regionX/fpga-dfl-fme.n/bitstream_id

Hi Alan

This bitstream_id refects the full static region version. As you know, PR is
only a sub feature of the FME functional unit, it's possible that we have
different static region (different bitstream_id) but it has the exact same
PR sub feature under the FME, only changes on other sub features or function
units.

The interface_id is used to indicate the PR hardware version and express the
compatibility of the PR hardware with PR image generated for it. It updates
whenever PR hardware has been changed.

So we should use interface_id which is from the PR hardware Register for PR
compatibility check, not the bitstream_id of the full static region.

>
> >> >
> >> > Sorry for my limited understanding of the infrastructure - would this same
> >> > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> >> > it would need to expose multiple interface_ids (or we'd have to track both
> >> > interface IDs and an identifier for the target PR region).
>
> interface_id sounds like it's trying to give some per-PR region
> information. That could support the real case where a FPGA static
> region has multiple PR regions and userspace has a bunch of images.
> The images are keyed to certain regions. The reason the images are
> region-specific could be that the regions have different connections
> (possible but I think that's unlikely. but possible) or because the
> FPGA doesn't support relocatable PR images (true of most FPGAs that
> support PR, but not a problem with Stratix10).
>
> Each interface_ids could be exposed per fpga-region (each fpga-region
> maps to a PR region). That wouldn't be hard to implement, driver-wise
> I think.

I think that wouldn't be hard to implement, but actually in our case, FME PR
sub feature supports PR for multiple regions, only provide one interface id
register, so it's not per region interface_id for sure. But anyway, we could
add it whenever needed. :)

Thanks
Hao

>
> >> >
> >> > > >
> >> > > > > diff --git a/drivers/fpga/Kconfig b/drivers/fpga/Kconfig
> >> > > > > index 57da904..0171ecb 100644
> >> > > > > --- a/drivers/fpga/Kconfig
> >> > > > > +++ b/drivers/fpga/Kconfig
> >> > > > > @@ -150,6 +150,12 @@ config FPGA_DFL_FME
> >> > > > > FPGA platform level management features. There shall be 1 FME
> >> > > > > per DFL based FPGA device.
> >> > > > >
> >> > > > > +config FPGA_DFL_FME_MGR
> >> > > > > + tristate "FPGA DFL FME Manager Driver"
> >> > > > > + depends on FPGA_DFL_FME
> >> > > > > + help
> >> > > > > + Say Y to enable FPGA Manager driver for FPGA Management Engine.
> >> > > > > +
> >> > > > > config INTEL_FPGA_DFL_PCI
> >> > > > > tristate "Intel FPGA DFL PCIe Device Driver"
> >> > > > > depends on PCI && FPGA_DFL
> >> > > > > diff --git a/drivers/fpga/Makefile b/drivers/fpga/Makefile
> >> > > > > index cc75bb3..6378580 100644
> >> > > > > --- a/drivers/fpga/Makefile
> >> > > > > +++ b/drivers/fpga/Makefile
> >> > > > > @@ -31,6 +31,7 @@ obj-$(CONFIG_OF_FPGA_REGION) += of-fpga-region.o
> >> > > > > # FPGA Device Feature List Support
> >> > > > > obj-$(CONFIG_FPGA_DFL) += fpga-dfl.o
> >> > > > > obj-$(CONFIG_FPGA_DFL_FME) += fpga-dfl-fme.o
> >> > > > > +obj-$(CONFIG_FPGA_DFL_FME_MGR) += fpga-dfl-fme-mgr.o
> >> > > > >
> >> > > > > fpga-dfl-fme-objs := dfl-fme-main.o dfl-fme-pr.o
> >> > > > >
> >> > > > > diff --git a/drivers/fpga/fpga-dfl-fme-mgr.c b/drivers/fpga/fpga-dfl-fme-mgr.c
> >> > > > > new file mode 100644
> >> > > > > index 0000000..70356ce
> >> > > > > --- /dev/null
> >> > > > > +++ b/drivers/fpga/fpga-dfl-fme-mgr.c
> >> > > > > @@ -0,0 +1,318 @@
> >> > > > > +/*
> >> > > > > + * FPGA Manager Driver for FPGA Management Engine (FME)
> >> > > > > + *
> >> > > > > + * Copyright (C) 2017 Intel Corporation, Inc.
> >> > > > > + *
> >> > > > > + * Authors:
> >> > > > > + * Kang Luwei <[email protected]>
> >> > > > > + * Xiao Guangrong <[email protected]>
> >> > > > > + * Wu Hao <[email protected]>
> >> > > > > + * Joseph Grecco <[email protected]>
> >> > > > > + * Enno Luebbers <[email protected]>
> >> > > > > + * Tim Whisonant <[email protected]>
> >> > > > > + * Ananda Ravuri <[email protected]>
> >> > > > > + * Christopher Rauer <[email protected]>
> >> > > > > + * Henry Mitchel <[email protected]>
> >> > > > > + *
> >> > > > > + * This work is licensed under the terms of the GNU GPL version 2.
> >> > > > > + * SPDX-License-Identifier: GPL-2.0
> >> > > > > + */
> >> > > > > +
> >> > > > > +#include <linux/module.h>
> >> > > > > +#include <linux/iopoll.h>
> >> > > > > +#include <linux/fpga/fpga-mgr.h>
> >> > > > > +
> >> > > > > +#include "fpga-dfl.h"
> >> > > > > +#include "dfl-fme.h"
> >> > > > > +
> >> > > > > +#define PR_WAIT_TIMEOUT 8000000
> >> > > > > +#define PR_HOST_STATUS_IDLE 0
> >> > > > > +
> >> > > > > +struct fme_mgr_priv {
> >> > > > > + void __iomem *ioaddr;
> >> > > > > + u64 pr_error;
> >> > > > > +};
> >> > > > > +
> >> > > > > +static ssize_t interface_id_show(struct device *dev,
> >> > > > > + struct device_attribute *attr, char *buf)
> >> > > > > +{
> >> > > > > + struct fpga_manager *mgr = dev_get_drvdata(dev);
> >> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> >> > > > > + u64 intfc_id_l, intfc_id_h;
> >> > > > > +
> >> > > > > + intfc_id_l = readq(priv->ioaddr + FME_PR_INTFC_ID_L);
> >> > > > > + intfc_id_h = readq(priv->ioaddr + FME_PR_INTFC_ID_H);
> >> > > > > +
> >> > > > > + return scnprintf(buf, PAGE_SIZE, "%016llx%016llx\n",
> >> > > > > + (unsigned long long)intfc_id_h,
> >> > > > > + (unsigned long long)intfc_id_l);
> >> > > > > +}
> >> > > > > +static DEVICE_ATTR_RO(interface_id);
> >> > > > > +
> >> > > > > +static const struct attribute *fme_mgr_attrs[] = {
> >> > > > > + &dev_attr_interface_id.attr,
> >> > > > > + NULL,
> >> > > > > +};
> >> > > > > +
> >> > > > > +static u64 pr_error_to_mgr_status(u64 err)
> >> > > > > +{
> >> > > > > + u64 status = 0;
> >> > > > > +
> >> > > > > + if (err & FME_PR_ERR_OPERATION_ERR)
> >> > > > > + status |= FPGA_MGR_STATUS_OPERATION_ERR;
> >> > > > > + if (err & FME_PR_ERR_CRC_ERR)
> >> > > > > + status |= FPGA_MGR_STATUS_CRC_ERR;
> >> > > > > + if (err & FME_PR_ERR_INCOMPATIBLE_BS)
> >> > > > > + status |= FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR;
> >> > > > > + if (err & FME_PR_ERR_PROTOCOL_ERR)
> >> > > > > + status |= FPGA_MGR_STATUS_IP_PROTOCOL_ERR;
> >> > > > > + if (err & FME_PR_ERR_FIFO_OVERFLOW)
> >> > > > > + status |= FPGA_MGR_STATUS_FIFO_OVERFLOW_ERR;
> >> > > > > +
> >> > > > > + return status;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static u64 fme_mgr_pr_error_handle(void __iomem *fme_pr)
> >> > > > > +{
> >> > > > > + u64 pr_status, pr_error;
> >> > > > > +
> >> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> >> > > > > + if (!(pr_status & FME_PR_STS_PR_STS))
> >> > > > > + return 0;
> >> > > > > +
> >> > > > > + pr_error = readq(fme_pr + FME_PR_ERR);
> >> > > > > + writeq(pr_error, fme_pr + FME_PR_ERR);
> >> > > > > +
> >> > > > > + return pr_error;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static int fme_mgr_write_init(struct fpga_manager *mgr,
> >> > > > > + struct fpga_image_info *info,
> >> > > > > + const char *buf, size_t count)
> >> > > > > +{
> >> > > > > + struct device *dev = &mgr->dev;
> >> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> >> > > > > + void __iomem *fme_pr = priv->ioaddr;
> >> > > > > + u64 pr_ctrl, pr_status;
> >> > > > > +
> >> > > > > + if (!(info->flags & FPGA_MGR_PARTIAL_RECONFIG)) {
> >> > > > > + dev_err(dev, "only support partial reconfiguration.\n");
> >> > > >
> >> > > > supports
> >> > > >
> >> > > > > + return -EINVAL;
> >> > > > > + }
> >> > > > > +
> >> > > > > + dev_dbg(dev, "resetting PR before initiated PR\n");
> >> > > > > +
> >> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> >> > > > > + pr_ctrl |= FME_PR_CTRL_PR_RST;
> >> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> >> > > > > +
> >> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> >> > > > > + pr_ctrl & FME_PR_CTRL_PR_RSTACK, 1,
> >> > > > > + PR_WAIT_TIMEOUT)) {
> >> > > > > + dev_err(dev, "maximum PR timeout\n");
> >> > > >
> >> > > > We have two dev_err's with the same "maximum PR timeout" so the user
> >> > > > would not be able to see at which point the timeout occurred.
> >> > > >
> >> > > > I suggest to get rid of the word 'maximum' for both. This one could
> >> > > > be 'reset ack timeout" or something similar.
> >> > >
> >> > > Sure, will switch to a more specific message per your suggestion. Thanks.
> >> > >
> >> > > >
> >> > > > > + return -ETIMEDOUT;
> >> > > > > + }
> >> > > > > +
> >> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> >> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RST;
> >> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> >> > > > > +
> >> > > > > + dev_dbg(dev,
> >> > > > > + "waiting for PR resource in HW to be initialized and ready\n");
> >> > > > > +
> >> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_STS, pr_status,
> >> > > > > + (pr_status & FME_PR_STS_PR_STS) ==
> >> > > > > + FME_PR_STS_PR_STS_IDLE, 1, PR_WAIT_TIMEOUT)) {
> >> > > > > + dev_err(dev, "maximum PR timeout\n");
> >> > > >
> >> > > > "PR_STS timeout"? Or something better.
> >> > > >
> >> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> >> > > > > + return -ETIMEDOUT;
> >> > > > > + }
> >> > > > > +
> >> > > > > + dev_dbg(dev, "check and clear previous PR error\n");
> >> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> >> > > > > + if (priv->pr_error)
> >> > > > > + dev_dbg(dev, "previous PR error detected %llx\n",
> >> > > > > + (unsigned long long)priv->pr_error);
> >> > > > > +
> >> > > > > + dev_dbg(dev, "set PR port ID\n");
> >> > > > > +
> >> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> >> > > > > + pr_ctrl &= ~FME_PR_CTRL_PR_RGN_ID;
> >> > > > > + pr_ctrl |= FIELD_PREP(FME_PR_CTRL_PR_RGN_ID, info->region_id);
> >> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> >> > > > > +
> >> > > > > + return 0;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static int fme_mgr_write(struct fpga_manager *mgr,
> >> > > > > + const char *buf, size_t count)
> >> > > > > +{
> >> > > > > + struct device *dev = &mgr->dev;
> >> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> >> > > > > + void __iomem *fme_pr = priv->ioaddr;
> >> > > > > + u64 pr_ctrl, pr_status, pr_data;
> >> > > > > + int delay = 0, pr_credit, i = 0;
> >> > > > > +
> >> > > > > + dev_dbg(dev, "start request\n");
> >> > > > > +
> >> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> >> > > > > + pr_ctrl |= FME_PR_CTRL_PR_START;
> >> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> >> > > > > +
> >> > > > > + dev_dbg(dev, "pushing data from bitstream to HW\n");
> >> > > > > +
> >> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> >> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> >> > > > > +
> >> > > > > + while (count > 0) {
> >> > > > > + while (pr_credit <= 1) {
> >> > > > > + if (delay++ > PR_WAIT_TIMEOUT) {
> >> > > > > + dev_err(dev, "maximum try\n");
> >> > > >
> >> > > > It looks like this is waiting for an entry in a queue and timing out
> >> > > > here. Could you add a some comments for pr_credit above and this
> >> > > > loop? Also a better error message, perhaps "PR credit timeout"?
> >> > >
> >> > > Driver needs to read the PR credit to know if it could push PR data
> >> > > to hardware or not. I will add more description here on this PR credit,
> >> > > and use "PR credit timeout" as error message.
> >> > >
> >> > > >
> >> > > > > + return -ETIMEDOUT;
> >> > > > > + }
> >> > > > > + udelay(1);
> >> > > > > +
> >> > > > > + pr_status = readq(fme_pr + FME_PR_STS);
> >> > > > > + pr_credit = FIELD_GET(FME_PR_STS_PR_CREDIT, pr_status);
> >> > > > > + }
> >> > > > > +
> >> > > > > + if (count >= 4) {
> >> > > > > + pr_data = 0;
> >> > > > > + pr_data |= FIELD_PREP(FME_PR_DATA_PR_DATA_RAW,
> >> > > > > + *(((u32 *)buf) + i));
> >> > > > > + writeq(pr_data, fme_pr + FME_PR_DATA);
> >> > > > > + count -= 4;
> >> > > > > + pr_credit--;
> >> > > > > + i++;
> >> > > > > + } else {
> >> > > > > + WARN_ON(1);
> >> > > > > + return -EINVAL;
> >> > > > > + }
> >> > > > > + }
> >> > > > > +
> >> > > > > + return 0;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static int fme_mgr_write_complete(struct fpga_manager *mgr,
> >> > > > > + struct fpga_image_info *info)
> >> > > > > +{
> >> > > > > + struct device *dev = &mgr->dev;
> >> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> >> > > > > + void __iomem *fme_pr = priv->ioaddr;
> >> > > > > + u64 pr_ctrl;
> >> > > > > +
> >> > > > > + pr_ctrl = readq(fme_pr + FME_PR_CTRL);
> >> > > > > + pr_ctrl |= FME_PR_CTRL_PR_COMPLETE;
> >> > > > > + writeq(pr_ctrl, fme_pr + FME_PR_CTRL);
> >> > > > > +
> >> > > > > + dev_dbg(dev, "green bitstream push complete\n");
> >> > > > > + dev_dbg(dev, "waiting for HW to release PR resource\n");
> >> > > > > +
> >> > > > > + if (readq_poll_timeout(fme_pr + FME_PR_CTRL, pr_ctrl,
> >> > > > > + !(pr_ctrl & FME_PR_CTRL_PR_START), 1,
> >> > > > > + PR_WAIT_TIMEOUT)) {
> >> > > > > + dev_err(dev, "maximum try.\n");
> >> > > >
> >> > > > Some message specific to here also, please.
> >> > > >
> >> > > > > + return -ETIMEDOUT;
> >> > > > > + }
> >> > > > > +
> >> > > > > + dev_dbg(dev, "PR operation complete, checking status\n");
> >> > > > > + priv->pr_error = fme_mgr_pr_error_handle(fme_pr);
> >> > > > > + if (priv->pr_error) {
> >> > > > > + dev_dbg(dev, "PR error detected %llx\n",
> >> > > > > + (unsigned long long)priv->pr_error);
> >> > > > > + return -EIO;
> >> > > > > + }
> >> > > > > +
> >> > > > > + dev_dbg(dev, "PR done successfully\n");
> >> > > > > +
> >> > > > > + return 0;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static enum fpga_mgr_states fme_mgr_state(struct fpga_manager *mgr)
> >> > > > > +{
> >> > > > > + return FPGA_MGR_STATE_UNKNOWN;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static u64 fme_mgr_status(struct fpga_manager *mgr)
> >> > > > > +{
> >> > > > > + struct fme_mgr_priv *priv = mgr->priv;
> >> > > > > +
> >> > > > > + return pr_error_to_mgr_status(priv->pr_error);
> >> > > > > +}
> >> > > > > +
> >> > > > > +static const struct fpga_manager_ops fme_mgr_ops = {
> >> > > > > + .write_init = fme_mgr_write_init,
> >> > > > > + .write = fme_mgr_write,
> >> > > > > + .write_complete = fme_mgr_write_complete,
> >> > > > > + .state = fme_mgr_state,
> >> > > > > + .status = fme_mgr_status,
> >> > > > > +};
> >> > > > > +
> >> > > > > +static int fme_mgr_probe(struct platform_device *pdev)
> >> > > > > +{
> >> > > > > + struct device *dev = &pdev->dev;
> >> > > > > + struct fme_mgr_priv *priv;
> >> > > > > + struct fpga_manager *mgr;
> >> > > > > + struct resource *res;
> >> > > > > + int ret;
> >> > > > > +
> >> > > > > + priv = devm_kzalloc(dev, sizeof(*priv), GFP_KERNEL);
> >> > > > > + if (!priv)
> >> > > > > + return -ENOMEM;
> >> > > > > +
> >> > > > > + res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> >> > > > > + priv->ioaddr = devm_ioremap(dev, res->start, resource_size(res));
> >> > > > > + if (IS_ERR(priv->ioaddr))
> >> > > > > + return PTR_ERR(priv->ioaddr);
> >> > > > > +
> >> > > > > + ret = sysfs_create_files(&pdev->dev.kobj, fme_mgr_attrs);
> >> > > > > + if (ret)
> >> > > > > + return ret;
> >> > > > > +
> >> > > > > + mgr = devm_kzalloc(dev, sizeof(*mgr), GFP_KERNEL);
> >> > > > > + if (!mgr)
> >> > > > > + goto sysfs_remove_exit;
> >> > > > > +
> >> > > > > + mgr->name = "DFL FPGA Manager";
> >> > > > > + mgr->mops = &fme_mgr_ops;
> >> > > > > + mgr->priv = priv;
> >> > > > > + mgr->parent = dev;
> >> > > > > + platform_set_drvdata(pdev, mgr);
> >> > > > > +
> >> > > > > + ret = fpga_mgr_register(mgr);
> >> > > > > + if (ret) {
> >> > > > > + dev_err(dev, "unable to register FPGA manager\n");
> >> > > > > + goto sysfs_remove_exit;
> >> > > > > + }
> >> > > > > +
> >> > > > > + return 0;
> >> > > > > +
> >> > > > > +sysfs_remove_exit:
> >> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> >> > > > > + return ret;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static int fme_mgr_remove(struct platform_device *pdev)
> >> > > > > +{
> >> > > > > + struct fpga_manager *mgr = platform_get_drvdata(pdev);
> >> > > > > +
> >> > > > > + fpga_mgr_unregister(mgr);
> >> > > > > + sysfs_remove_files(&pdev->dev.kobj, fme_mgr_attrs);
> >> > > > > +
> >> > > > > + return 0;
> >> > > > > +}
> >> > > > > +
> >> > > > > +static struct platform_driver fme_mgr_driver = {
> >> > > > > + .driver = {
> >> > > > > + .name = FPGA_DFL_FME_MGR,
> >> > > > > + },
> >> > > > > + .probe = fme_mgr_probe,
> >> > > > > + .remove = fme_mgr_remove,
> >> > > > > +};
> >> > > > > +
> >> > > > > +module_platform_driver(fme_mgr_driver);
> >> > > > > +
> >> > > > > +MODULE_DESCRIPTION("FPGA Manager for FPGA Management Engine");
> >> > > > > +MODULE_AUTHOR("Intel Corporation");
> >> > > > > +MODULE_LICENSE("GPL v2");
> >> > > > > +MODULE_ALIAS("platform:fpga-dfl-fme-mgr");
> >> > > > > diff --git a/drivers/fpga/fpga-dfl.h b/drivers/fpga/fpga-dfl.h
> >> > > > > index e5a1094..d45eb82 100644
> >> > > > > --- a/drivers/fpga/fpga-dfl.h
> >> > > > > +++ b/drivers/fpga/fpga-dfl.h
> >> > > > > @@ -130,7 +130,44 @@
> >> > > > >
> >> > > > > /* FME Partial Reconfiguration Sub Feature Register Set */
> >> > > > > #define FME_PR_DFH DFH
> >> > > > > -#define FME_PR_SIZE DFH_SIZE
> >> > > > > +#define FME_PR_CTRL 0x8
> >> > > > > +#define FME_PR_STS 0x10
> >> > > > > +#define FME_PR_DATA 0x18
> >> > > > > +#define FME_PR_ERR 0x20
> >> > > > > +#define FME_PR_INTFC_ID_H 0xA8
> >> > > > > +#define FME_PR_INTFC_ID_L 0xB0
> >> > > > > +#define FME_PR_SIZE 0xB8
> >> > > > > +
> >> > > > > +/* FME PR Control Register Bitfield */
> >> > > > > +#define FME_PR_CTRL_PR_RST BIT(0) /* Reset PR engine */
> >> > > > > +#define FME_PR_CTRL_PR_RSTACK BIT(4) /* Ack for PR engine reset */
> >> > > > > +#define FME_PR_CTRL_PR_RGN_ID GENMASK_ULL(9, 7) /* PR Region ID */
> >> > > > > +#define FME_PR_CTRL_PR_START BIT(12) /* Start to request for PR service */
> >> > > > > +#define FME_PR_CTRL_PR_COMPLETE BIT(13) /* PR data push complete notification */
> >> > > > > +
> >> > > > > +/* FME PR Status Register Bitfield */
> >> > > > > +/* Number of available entries in HW queue inside the PR engine. */
> >> > > > > +#define FME_PR_STS_PR_CREDIT GENMASK_ULL(8, 0)
> >> > > > > +#define FME_PR_STS_PR_STS BIT(16) /* PR operation status */
> >> > > > > +#define FME_PR_STS_PR_STS_IDLE 0
> >> > > > > +#define FME_PR_STS_PR_CTRLR_STS GENMASK_ULL(22, 20) /* Controller status */
> >> > > > > +#define FME_PR_STS_PR_HOST_STS GENMASK_ULL(27, 24) /* PR host status */
> >> > > > > +
> >> > > > > +/* FME PR Data Register Bitfield */
> >> > > > > +/* PR data from the raw-binary file. */
> >> > > > > +#define FME_PR_DATA_PR_DATA_RAW GENMASK_ULL(32, 0)
> >> > > > > +
> >> > > > > +/* FME PR Error Register */
> >> > > > > +/* Previous PR Operation errors detected. */
> >> > > > > +#define FME_PR_ERR_OPERATION_ERR BIT(0)
> >> > > > > +/* CRC error detected. */
> >> > > > > +#define FME_PR_ERR_CRC_ERR BIT(1)
> >> > > > > +/* Incompatible PR bitstream detected. */
> >> > > > > +#define FME_PR_ERR_INCOMPATIBLE_BS BIT(2)
> >> > > > > +/* PR data push protocol violated. */
> >> > > > > +#define FME_PR_ERR_PROTOCOL_ERR BIT(3)
> >> > > > > +/* PR data fifo overflow error detected */
> >> > > > > +#define FME_PR_ERR_FIFO_OVERFLOW BIT(4)
> >> > > > >
> >> > > > > /* FME HSSI Sub Feature Register Set */
> >> > > > > #define FME_HSSI_DFH DFH
> >> > > >
> >> > > > I see fpga-dfl.h as enumeration code which is separate from any driver
> >> > > > implementation specifics other than what's required for the DFL
> >> > > > enumeration scheme. These PR engine #defines should move to a .h
> >> > > > that is dedicated to this specific PR hardware device. If someone else
> >> > > > adds a different PR device to the framework, their PR driver would
> >> > > > also have its own .h. Same for any other modules that aren't central
> >> > > > to DFL enumeration.
> >> > >
> >> > > Sure, will move PR related register to a separated header file.
> >> > > DFL enumeration related registers, will still be kept in fpga-dfl.h.
> >> > >
> >> > > Thanks
> >> > > Hao
> >> > >
> >> > > >
> >> > > > Thanks,
> >> > > > Alan
> >> > > >
> >> > > > > --
> >> > > > > 1.8.3.1
> >> > > > >
> >>
> >> Thanks,
> >> Moritz
> >
> >

2018-02-06 04:27:10

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 5, 2018 at 8:17 PM, Wu Hao <[email protected]> wrote:
> On Mon, Feb 05, 2018 at 11:21:52AM -0600, Alan Tull wrote:
>> On Sun, Feb 4, 2018 at 4:05 AM, Wu Hao <[email protected]> wrote:
>> > On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
>> >> Hi Hao,
>> >>
>> >> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
>> >> > Hi Hao, Alan,
>> >> >
>> >> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
>> >> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
>> >> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>> >> > > >
>> >> > > > Hi Hao,
>> >> > > >
>> >> > > > A few comments below. Besides that, looks good.
>> >> > > >
>> >> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
>> >> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>> >> > > > >
>> >> > > > > Signed-off-by: Tim Whisonant <[email protected]>
>> >> > > > > Signed-off-by: Enno Luebbers <[email protected]>
>> >> > > > > Signed-off-by: Shiva Rao <[email protected]>
>> >> > > > > Signed-off-by: Christopher Rauer <[email protected]>
>> >> > > > > Signed-off-by: Kang Luwei <[email protected]>
>> >> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
>> >> > > > > Signed-off-by: Wu Hao <[email protected]>
>> >> > > > > ----
>> >> > > > > v3: rename driver to dfl-fpga-fme-mgr
>> >> > > > > implemented status callback for fpga manager
>> >> > > > > rebased due to fpga api changes
>> >> > > > > ---
>> >> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
>> >> > > > > drivers/fpga/Kconfig | 6 +
>> >> > > > > drivers/fpga/Makefile | 1 +
>> >> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
>> >> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
>> >> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
>> >> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>> >> > > > >
>> >> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > new file mode 100644
>> >> > > > > index 0000000..2d4f917
>> >> > > > > --- /dev/null
>> >> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > @@ -0,0 +1,8 @@
>> >> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
>> >> > > > > +Date: November 2017
>> >> > > > > +KernelVersion: 4.15
>> >> > > > > +Contact: Wu Hao <[email protected]>
>> >> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
>> >> > > > > + hardware. Userspace could use this information to check if
>> >> > > > > + current hardware is compatible with given image before FPGA
>> >> > > > > + programming.
>> >> > > >
>> >> > > > I'm a little confused by this. I can understand that the PR bitstream
>> >> > > > has a dependency on the FPGA's static image, but I don't understand
>> >> > > > the dependency of the bistream on the hardware that is used to program
>> >> > > > the bitstream to the FPGA.
>> >> > >
>> >> > > Sorry for the confusion, the interface_id is used to indicate the version of
>> >> > > the hardware for partial reconfiguration (it's part of the static image of
>> >> > > the FPGA device). Will improve the description on this.
>> >>
>> >> I'm not sure userland should be making the call on whether what you're
>> >> trying to load is compatible or not.
>>
>> Could you explain more about what your concern was about this (unless
>> Hao has covered it below)?
>>
>> It makes sense to me in this use case at least since userspace has a
>> pile of images and is choosing which one to load.
>>
>> >> Isn't there a way to check this in
>> >> your PR reconfiguration handler in-kernel?
>> >
>> > Hi Moritz
>> >
>> > Actually with current driver interface, doing a partial reconfiguration with an
>> > incompatible image, then driver will report PR failure with error code
>> > FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
>> > needs to know hardware interface_id information to find or re-generated correct
>> > images. I think it's more flexible to leave it to userspace on using this
>> > information exposed by driver. : )
>> >
>> > Thanks
>> > Hao
>> >
>> >>
>> >> > >
>> >> >
>> >> > The interface_id expresses the compatibility of the static region with PR
>> >> > bitstreams generated for it. It changes every time a new static region is
>> >> > generated.
>>
>> In the near future the DFL framework will be used with SoC's that have
>> a hard FPGA PR manager (that's not part of the static region). The
>> hard FPGA manager driver won't know anything about the static region.
>>
>> >> >
>> >> > Would it make more sense to have the interface_id exposed as part of the FME
>> >> > device (which represents the static region)? I'm not sure - it kind of also
>> >> > makes sense here, where you would have all the information in one place (if the
>> >> > interface_id matches, I can use this component to program a bitstream).
>>
>> According to the intel-fpga.txt document, the identifier for the
>> static image is at
>>
>> /sys/class/fpga_region/regionX/fpga-dfl-fme.n/bitstream_id
>
> Hi Alan

Hi Hao,

>
> This bitstream_id refects the full static region version. As you know, PR is
> only a sub feature of the FME functional unit, it's possible that we have
> different static region (different bitstream_id) but it has the exact same
> PR sub feature under the FME, only changes on other sub features or function
> units.

OK, thanks for that explanation. That makes sense but could have
easily been different. Please document this somewhere.

Alan

2018-02-06 04:27:36

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 5, 2018 at 7:47 PM, Wu Hao <[email protected]> wrote:
> On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
>> Hi Hao,
>>
>> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
>> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
>> > > Hi Hao, Alan,
>> > >
>> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
>> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
>> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>> > > > >
>> > > > > Hi Hao,
>> > > > >
>> > > > > A few comments below. Besides that, looks good.
>> > > > >
>> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
>> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>> > > > > >
>> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
>> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
>> > > > > > Signed-off-by: Shiva Rao <[email protected]>
>> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
>> > > > > > Signed-off-by: Kang Luwei <[email protected]>
>> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
>> > > > > > Signed-off-by: Wu Hao <[email protected]>
>> > > > > > ----
>> > > > > > v3: rename driver to dfl-fpga-fme-mgr
>> > > > > > implemented status callback for fpga manager
>> > > > > > rebased due to fpga api changes
>> > > > > > ---
>> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
>> > > > > > drivers/fpga/Kconfig | 6 +
>> > > > > > drivers/fpga/Makefile | 1 +
>> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
>> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
>> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
>> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>> > > > > >
>> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > > new file mode 100644
>> > > > > > index 0000000..2d4f917
>> > > > > > --- /dev/null
>> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> > > > > > @@ -0,0 +1,8 @@
>> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
>> > > > > > +Date: November 2017
>> > > > > > +KernelVersion: 4.15
>> > > > > > +Contact: Wu Hao <[email protected]>
>> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
>> > > > > > + hardware. Userspace could use this information to check if
>> > > > > > + current hardware is compatible with given image before FPGA
>> > > > > > + programming.
>> > > > >
>> > > > > I'm a little confused by this. I can understand that the PR bitstream
>> > > > > has a dependency on the FPGA's static image, but I don't understand
>> > > > > the dependency of the bistream on the hardware that is used to program
>> > > > > the bitstream to the FPGA.
>> > > >
>> > > > Sorry for the confusion, the interface_id is used to indicate the version of
>> > > > the hardware for partial reconfiguration (it's part of the static image of
>> > > > the FPGA device). Will improve the description on this.
>> > > >
>> > >
>> > > The interface_id expresses the compatibility of the static region with PR
>> > > bitstreams generated for it. It changes every time a new static region is
>> > > generated.
>> > >
>> > > Would it make more sense to have the interface_id exposed as part of the FME
>> > > device (which represents the static region)? I'm not sure - it kind of also
>> > > makes sense here, where you would have all the information in one place (if the
>> > > interface_id matches, I can use this component to program a bitstream).
>> >
>> > Hi Enno
>> >
>> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
>> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
>> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
>> > he needs to find the FME device firstly, and then check if any fpga manager
>> > created under this FME device, if yes, check the interface_id before PR via
>> > the FME device node ioctl.
>>
>> That sounds good, thank you!
>>
>> >
>> > >
>> > > Sorry for my limited understanding of the infrastructure - would this same
>> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
>> > > it would need to expose multiple interface_ids (or we'd have to track both
>> > > interface IDs and an identifier for the target PR region).
>> >
>> > Yes, the fpga manager could be shared with different PR regions.
>> >
>> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
>>
>> It's basically a question of how to determine bitstream compatibility - either,
>> there's a separate interface_id per reconfigurable region, or there is a single
>> interface_id for the entire device. Both make sense from a certain perspective.
>>
>> If there are multiple interface_ids per device (one per region), the driver
>> would need to expose all of them. If there's only a single one, the driver only
>> exposes that one ID - compatibility would be determined by looking at both that
>> single interface_id _and_ the identifier/number of the targeted region.
>>
>> I would prefer a separate interface_id per region - it seems more generic and
>> flexible.

Hi Enno,

I agree with this.

>
> It's possible to have per region interface_id (or even both per dev interface_id
> and per region interface_id at the same time), but per FME PR sub feature
> implementation, it supports multiple PR regions, but only provide one interface
> id, so at least in this case, it's not per-region information per my
> understanding. We can consider it later when hardware really supports it. : )

Hi Hao,

I understand that in the case of this PR hardware, the region to
program is selected when the region_id to program is written to a PR
hardware control register. For another example, Arria10 has a hard PR
hardware and the PR bitstream lands in the area of the FPGA for which
it was compiled. In that case, for the PR bitstream to be compatible
with a PR region, the layout of the edge connections also needs to be
compatible, so compatibility is per-region in that case instead of
per-PR hardware. And besides, as I said yesterday, the hard PR
hardware would not know what the static region ID is when this
framework is used with such a device.

That's why I think making the id per-region may be more future proof,
even if it may see unnecessary in the case of the original blue bits
this was written for.

Alan

2018-02-06 05:34:30

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 05, 2018 at 10:25:27PM -0600, Alan Tull wrote:
> On Mon, Feb 5, 2018 at 8:17 PM, Wu Hao <[email protected]> wrote:
> > On Mon, Feb 05, 2018 at 11:21:52AM -0600, Alan Tull wrote:
> >> On Sun, Feb 4, 2018 at 4:05 AM, Wu Hao <[email protected]> wrote:
> >> > On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
> >> >> Hi Hao,
> >> >>
> >> >> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> >> >> > Hi Hao, Alan,
> >> >> >
> >> >> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> >> >> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> >> >> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >> >> > > >
> >> >> > > > Hi Hao,
> >> >> > > >
> >> >> > > > A few comments below. Besides that, looks good.
> >> >> > > >
> >> >> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> >> >> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >> >> > > > >
> >> >> > > > > Signed-off-by: Tim Whisonant <[email protected]>
> >> >> > > > > Signed-off-by: Enno Luebbers <[email protected]>
> >> >> > > > > Signed-off-by: Shiva Rao <[email protected]>
> >> >> > > > > Signed-off-by: Christopher Rauer <[email protected]>
> >> >> > > > > Signed-off-by: Kang Luwei <[email protected]>
> >> >> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> >> >> > > > > Signed-off-by: Wu Hao <[email protected]>
> >> >> > > > > ----
> >> >> > > > > v3: rename driver to dfl-fpga-fme-mgr
> >> >> > > > > implemented status callback for fpga manager
> >> >> > > > > rebased due to fpga api changes
> >> >> > > > > ---
> >> >> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> >> >> > > > > drivers/fpga/Kconfig | 6 +
> >> >> > > > > drivers/fpga/Makefile | 1 +
> >> >> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> >> >> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> >> >> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> >> >> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >> >> > > > >
> >> >> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > new file mode 100644
> >> >> > > > > index 0000000..2d4f917
> >> >> > > > > --- /dev/null
> >> >> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > @@ -0,0 +1,8 @@
> >> >> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> >> >> > > > > +Date: November 2017
> >> >> > > > > +KernelVersion: 4.15
> >> >> > > > > +Contact: Wu Hao <[email protected]>
> >> >> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> >> >> > > > > + hardware. Userspace could use this information to check if
> >> >> > > > > + current hardware is compatible with given image before FPGA
> >> >> > > > > + programming.
> >> >> > > >
> >> >> > > > I'm a little confused by this. I can understand that the PR bitstream
> >> >> > > > has a dependency on the FPGA's static image, but I don't understand
> >> >> > > > the dependency of the bistream on the hardware that is used to program
> >> >> > > > the bitstream to the FPGA.
> >> >> > >
> >> >> > > Sorry for the confusion, the interface_id is used to indicate the version of
> >> >> > > the hardware for partial reconfiguration (it's part of the static image of
> >> >> > > the FPGA device). Will improve the description on this.
> >> >>
> >> >> I'm not sure userland should be making the call on whether what you're
> >> >> trying to load is compatible or not.
> >>
> >> Could you explain more about what your concern was about this (unless
> >> Hao has covered it below)?
> >>
> >> It makes sense to me in this use case at least since userspace has a
> >> pile of images and is choosing which one to load.
> >>
> >> >> Isn't there a way to check this in
> >> >> your PR reconfiguration handler in-kernel?
> >> >
> >> > Hi Moritz
> >> >
> >> > Actually with current driver interface, doing a partial reconfiguration with an
> >> > incompatible image, then driver will report PR failure with error code
> >> > FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
> >> > needs to know hardware interface_id information to find or re-generated correct
> >> > images. I think it's more flexible to leave it to userspace on using this
> >> > information exposed by driver. : )
> >> >
> >> > Thanks
> >> > Hao
> >> >
> >> >>
> >> >> > >
> >> >> >
> >> >> > The interface_id expresses the compatibility of the static region with PR
> >> >> > bitstreams generated for it. It changes every time a new static region is
> >> >> > generated.
> >>
> >> In the near future the DFL framework will be used with SoC's that have
> >> a hard FPGA PR manager (that's not part of the static region). The
> >> hard FPGA manager driver won't know anything about the static region.
> >>
> >> >> >
> >> >> > Would it make more sense to have the interface_id exposed as part of the FME
> >> >> > device (which represents the static region)? I'm not sure - it kind of also
> >> >> > makes sense here, where you would have all the information in one place (if the
> >> >> > interface_id matches, I can use this component to program a bitstream).
> >>
> >> According to the intel-fpga.txt document, the identifier for the
> >> static image is at
> >>
> >> /sys/class/fpga_region/regionX/fpga-dfl-fme.n/bitstream_id
> >
> > Hi Alan
>
> Hi Hao,
>
> >
> > This bitstream_id refects the full static region version. As you know, PR is
> > only a sub feature of the FME functional unit, it's possible that we have
> > different static region (different bitstream_id) but it has the exact same
> > PR sub feature under the FME, only changes on other sub features or function
> > units.
>
> OK, thanks for that explanation. That makes sense but could have
> easily been different. Please document this somewhere.

Sure, will do.

Thanks
Hao

2018-02-06 06:46:37

by Moritz Fischer

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 05, 2018 at 10:25:27PM -0600, Alan Tull wrote:
> On Mon, Feb 5, 2018 at 8:17 PM, Wu Hao <[email protected]> wrote:
> > On Mon, Feb 05, 2018 at 11:21:52AM -0600, Alan Tull wrote:
> >> On Sun, Feb 4, 2018 at 4:05 AM, Wu Hao <[email protected]> wrote:
> >> > On Sat, Feb 03, 2018 at 11:41:24AM +0100, Moritz Fischer wrote:
> >> >> Hi Hao,
> >> >>
> >> >> On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> >> >> > Hi Hao, Alan,
> >> >> >
> >> >> > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> >> >> > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> >> >> > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >> >> > > >
> >> >> > > > Hi Hao,
> >> >> > > >
> >> >> > > > A few comments below. Besides that, looks good.
> >> >> > > >
> >> >> > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> >> >> > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >> >> > > > >
> >> >> > > > > Signed-off-by: Tim Whisonant <[email protected]>
> >> >> > > > > Signed-off-by: Enno Luebbers <[email protected]>
> >> >> > > > > Signed-off-by: Shiva Rao <[email protected]>
> >> >> > > > > Signed-off-by: Christopher Rauer <[email protected]>
> >> >> > > > > Signed-off-by: Kang Luwei <[email protected]>
> >> >> > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> >> >> > > > > Signed-off-by: Wu Hao <[email protected]>
> >> >> > > > > ----
> >> >> > > > > v3: rename driver to dfl-fpga-fme-mgr
> >> >> > > > > implemented status callback for fpga manager
> >> >> > > > > rebased due to fpga api changes
> >> >> > > > > ---
> >> >> > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> >> >> > > > > drivers/fpga/Kconfig | 6 +
> >> >> > > > > drivers/fpga/Makefile | 1 +
> >> >> > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> >> >> > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> >> >> > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> >> >> > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >> >> > > > >
> >> >> > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > new file mode 100644
> >> >> > > > > index 0000000..2d4f917
> >> >> > > > > --- /dev/null
> >> >> > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > @@ -0,0 +1,8 @@
> >> >> > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> >> >> > > > > +Date: November 2017
> >> >> > > > > +KernelVersion: 4.15
> >> >> > > > > +Contact: Wu Hao <[email protected]>
> >> >> > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> >> >> > > > > + hardware. Userspace could use this information to check if
> >> >> > > > > + current hardware is compatible with given image before FPGA
> >> >> > > > > + programming.
> >> >> > > >
> >> >> > > > I'm a little confused by this. I can understand that the PR bitstream
> >> >> > > > has a dependency on the FPGA's static image, but I don't understand
> >> >> > > > the dependency of the bistream on the hardware that is used to program
> >> >> > > > the bitstream to the FPGA.
> >> >> > >
> >> >> > > Sorry for the confusion, the interface_id is used to indicate the version of
> >> >> > > the hardware for partial reconfiguration (it's part of the static image of
> >> >> > > the FPGA device). Will improve the description on this.
> >> >>
> >> >> I'm not sure userland should be making the call on whether what you're
> >> >> trying to load is compatible or not.

I guess I was worried that you might take down a PR region that you
could've avoided. On second thought unless the kernel could extract that
info somehow from the bitstream before programming this is probably
anyway not possible, since the only one that knows the id -> bitstream
mapping is userland.

Hao's comments further down outline well enough that the hardware can
deal with incompatible bitstreams. So ignore my comments ;-)

> >>
> >> Could you explain more about what your concern was about this (unless
> >> Hao has covered it below)?
> >>
> >> It makes sense to me in this use case at least since userspace has a
> >> pile of images and is choosing which one to load.
> >>
> >> >> Isn't there a way to check this in
> >> >> your PR reconfiguration handler in-kernel?
> >> >
> >> > Hi Moritz
> >> >
> >> > Actually with current driver interface, doing a partial reconfiguration with an
> >> > incompatible image, then driver will report PR failure with error code
> >> > FPGA_MGR_STATUS_INCOMPATIBLE_IMAGE_ERR as hardware checks it, but anyway user
> >> > needs to know hardware interface_id information to find or re-generated correct
> >> > images. I think it's more flexible to leave it to userspace on using this
> >> > information exposed by driver. : )
> >> >
> >> > Thanks
> >> > Hao
> >> >
> >> >>
> >> >> > >
> >> >> >
> >> >> > The interface_id expresses the compatibility of the static region with PR
> >> >> > bitstreams generated for it. It changes every time a new static region is
> >> >> > generated.
> >>
> >> In the near future the DFL framework will be used with SoC's that have
> >> a hard FPGA PR manager (that's not part of the static region). The
> >> hard FPGA manager driver won't know anything about the static region.
> >>
> >> >> >
> >> >> > Would it make more sense to have the interface_id exposed as part of the FME
> >> >> > device (which represents the static region)? I'm not sure - it kind of also
> >> >> > makes sense here, where you would have all the information in one place (if the
> >> >> > interface_id matches, I can use this component to program a bitstream).
> >>
> >> According to the intel-fpga.txt document, the identifier for the
> >> static image is at
> >>
> >> /sys/class/fpga_region/regionX/fpga-dfl-fme.n/bitstream_id
> >
> > Hi Alan
>
> Hi Hao,
>
> >
> > This bitstream_id refects the full static region version. As you know, PR is
> > only a sub feature of the FME functional unit, it's possible that we have
> > different static region (different bitstream_id) but it has the exact same
> > PR sub feature under the FME, only changes on other sub features or function
> > units.
>
> OK, thanks for that explanation. That makes sense but could have
> easily been different. Please document this somewhere.
>
> Alan

Moritz

2018-02-06 06:58:44

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Mon, Feb 05, 2018 at 10:25:54PM -0600, Alan Tull wrote:
> On Mon, Feb 5, 2018 at 7:47 PM, Wu Hao <[email protected]> wrote:
> > On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
> >> Hi Hao,
> >>
> >> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
> >> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> >> > > Hi Hao, Alan,
> >> > >
> >> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> >> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> >> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >> > > > >
> >> > > > > Hi Hao,
> >> > > > >
> >> > > > > A few comments below. Besides that, looks good.
> >> > > > >
> >> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> >> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >> > > > > >
> >> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
> >> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
> >> > > > > > Signed-off-by: Shiva Rao <[email protected]>
> >> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
> >> > > > > > Signed-off-by: Kang Luwei <[email protected]>
> >> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> >> > > > > > Signed-off-by: Wu Hao <[email protected]>
> >> > > > > > ----
> >> > > > > > v3: rename driver to dfl-fpga-fme-mgr
> >> > > > > > implemented status callback for fpga manager
> >> > > > > > rebased due to fpga api changes
> >> > > > > > ---
> >> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> >> > > > > > drivers/fpga/Kconfig | 6 +
> >> > > > > > drivers/fpga/Makefile | 1 +
> >> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> >> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> >> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> >> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >> > > > > >
> >> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > > new file mode 100644
> >> > > > > > index 0000000..2d4f917
> >> > > > > > --- /dev/null
> >> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> > > > > > @@ -0,0 +1,8 @@
> >> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> >> > > > > > +Date: November 2017
> >> > > > > > +KernelVersion: 4.15
> >> > > > > > +Contact: Wu Hao <[email protected]>
> >> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> >> > > > > > + hardware. Userspace could use this information to check if
> >> > > > > > + current hardware is compatible with given image before FPGA
> >> > > > > > + programming.
> >> > > > >
> >> > > > > I'm a little confused by this. I can understand that the PR bitstream
> >> > > > > has a dependency on the FPGA's static image, but I don't understand
> >> > > > > the dependency of the bistream on the hardware that is used to program
> >> > > > > the bitstream to the FPGA.
> >> > > >
> >> > > > Sorry for the confusion, the interface_id is used to indicate the version of
> >> > > > the hardware for partial reconfiguration (it's part of the static image of
> >> > > > the FPGA device). Will improve the description on this.
> >> > > >
> >> > >
> >> > > The interface_id expresses the compatibility of the static region with PR
> >> > > bitstreams generated for it. It changes every time a new static region is
> >> > > generated.
> >> > >
> >> > > Would it make more sense to have the interface_id exposed as part of the FME
> >> > > device (which represents the static region)? I'm not sure - it kind of also
> >> > > makes sense here, where you would have all the information in one place (if the
> >> > > interface_id matches, I can use this component to program a bitstream).
> >> >
> >> > Hi Enno
> >> >
> >> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
> >> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
> >> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
> >> > he needs to find the FME device firstly, and then check if any fpga manager
> >> > created under this FME device, if yes, check the interface_id before PR via
> >> > the FME device node ioctl.
> >>
> >> That sounds good, thank you!
> >>
> >> >
> >> > >
> >> > > Sorry for my limited understanding of the infrastructure - would this same
> >> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> >> > > it would need to expose multiple interface_ids (or we'd have to track both
> >> > > interface IDs and an identifier for the target PR region).
> >> >
> >> > Yes, the fpga manager could be shared with different PR regions.
> >> >
> >> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
> >>
> >> It's basically a question of how to determine bitstream compatibility - either,
> >> there's a separate interface_id per reconfigurable region, or there is a single
> >> interface_id for the entire device. Both make sense from a certain perspective.
> >>
> >> If there are multiple interface_ids per device (one per region), the driver
> >> would need to expose all of them. If there's only a single one, the driver only
> >> exposes that one ID - compatibility would be determined by looking at both that
> >> single interface_id _and_ the identifier/number of the targeted region.
> >>
> >> I would prefer a separate interface_id per region - it seems more generic and
> >> flexible.
>
> Hi Enno,
>
> I agree with this.
>
> >
> > It's possible to have per region interface_id (or even both per dev interface_id
> > and per region interface_id at the same time), but per FME PR sub feature
> > implementation, it supports multiple PR regions, but only provide one interface
> > id, so at least in this case, it's not per-region information per my
> > understanding. We can consider it later when hardware really supports it. : )
>
> Hi Hao,
>
> I understand that in the case of this PR hardware, the region to
> program is selected when the region_id to program is written to a PR
> hardware control register. For another example, Arria10 has a hard PR
> hardware and the PR bitstream lands in the area of the FPGA for which
> it was compiled. In that case, for the PR bitstream to be compatible
> with a PR region, the layout of the edge connections also needs to be
> compatible, so compatibility is per-region in that case instead of
> per-PR hardware.

Hi Alan,

Thanks a lot for the explanation. :)

I fully understand the consideration of adding per-region interface_id.

> And besides, as I said yesterday, the hard PR
> hardware would not know what the static region ID is when this
> framework is used with such a device.

Yes, is it possible that hard PR hardware with different versions, requires
different images or different methods for compatibility checking?

>
> That's why I think making the id per-region may be more future proof,
> even if it may see unnecessary in the case of the original blue bits
> this was written for.

I feel that per-PR hardware interface id is useful in some cases, and maybe
in some cases, both per-PR hardware and per-region interface ids are needed
for its compatibility checking, so shall we leave developers to decide to
implement per-PR hardware or per-region or both interface ids based on their
own hardware implementations? How do you think? :)

Thanks
Hao

>
> Alan

2018-02-06 18:55:31

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Tue, Feb 6, 2018 at 12:47 AM, Wu Hao <[email protected]> wrote:
> On Mon, Feb 05, 2018 at 10:25:54PM -0600, Alan Tull wrote:
>> On Mon, Feb 5, 2018 at 7:47 PM, Wu Hao <[email protected]> wrote:
>> > On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
>> >> Hi Hao,
>> >>
>> >> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
>> >> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
>> >> > > Hi Hao, Alan,
>> >> > >
>> >> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
>> >> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
>> >> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>> >> > > > >
>> >> > > > > Hi Hao,
>> >> > > > >
>> >> > > > > A few comments below. Besides that, looks good.
>> >> > > > >
>> >> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
>> >> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>> >> > > > > >
>> >> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
>> >> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
>> >> > > > > > Signed-off-by: Shiva Rao <[email protected]>
>> >> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
>> >> > > > > > Signed-off-by: Kang Luwei <[email protected]>
>> >> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
>> >> > > > > > Signed-off-by: Wu Hao <[email protected]>
>> >> > > > > > ----
>> >> > > > > > v3: rename driver to dfl-fpga-fme-mgr
>> >> > > > > > implemented status callback for fpga manager
>> >> > > > > > rebased due to fpga api changes
>> >> > > > > > ---
>> >> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
>> >> > > > > > drivers/fpga/Kconfig | 6 +
>> >> > > > > > drivers/fpga/Makefile | 1 +
>> >> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
>> >> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
>> >> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
>> >> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>> >> > > > > >
>> >> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > > new file mode 100644
>> >> > > > > > index 0000000..2d4f917
>> >> > > > > > --- /dev/null
>> >> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> > > > > > @@ -0,0 +1,8 @@
>> >> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
>> >> > > > > > +Date: November 2017
>> >> > > > > > +KernelVersion: 4.15
>> >> > > > > > +Contact: Wu Hao <[email protected]>
>> >> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
>> >> > > > > > + hardware. Userspace could use this information to check if
>> >> > > > > > + current hardware is compatible with given image before FPGA
>> >> > > > > > + programming.
>> >> > > > >
>> >> > > > > I'm a little confused by this. I can understand that the PR bitstream
>> >> > > > > has a dependency on the FPGA's static image, but I don't understand
>> >> > > > > the dependency of the bistream on the hardware that is used to program
>> >> > > > > the bitstream to the FPGA.
>> >> > > >
>> >> > > > Sorry for the confusion, the interface_id is used to indicate the version of
>> >> > > > the hardware for partial reconfiguration (it's part of the static image of
>> >> > > > the FPGA device). Will improve the description on this.
>> >> > > >
>> >> > >
>> >> > > The interface_id expresses the compatibility of the static region with PR
>> >> > > bitstreams generated for it. It changes every time a new static region is
>> >> > > generated.
>> >> > >
>> >> > > Would it make more sense to have the interface_id exposed as part of the FME
>> >> > > device (which represents the static region)? I'm not sure - it kind of also
>> >> > > makes sense here, where you would have all the information in one place (if the
>> >> > > interface_id matches, I can use this component to program a bitstream).
>> >> >
>> >> > Hi Enno
>> >> >
>> >> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
>> >> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
>> >> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
>> >> > he needs to find the FME device firstly, and then check if any fpga manager
>> >> > created under this FME device, if yes, check the interface_id before PR via
>> >> > the FME device node ioctl.
>> >>
>> >> That sounds good, thank you!
>> >>
>> >> >
>> >> > >
>> >> > > Sorry for my limited understanding of the infrastructure - would this same
>> >> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
>> >> > > it would need to expose multiple interface_ids (or we'd have to track both
>> >> > > interface IDs and an identifier for the target PR region).
>> >> >
>> >> > Yes, the fpga manager could be shared with different PR regions.
>> >> >
>> >> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
>> >>
>> >> It's basically a question of how to determine bitstream compatibility - either,
>> >> there's a separate interface_id per reconfigurable region, or there is a single
>> >> interface_id for the entire device. Both make sense from a certain perspective.
>> >>
>> >> If there are multiple interface_ids per device (one per region), the driver
>> >> would need to expose all of them. If there's only a single one, the driver only
>> >> exposes that one ID - compatibility would be determined by looking at both that
>> >> single interface_id _and_ the identifier/number of the targeted region.
>> >>
>> >> I would prefer a separate interface_id per region - it seems more generic and
>> >> flexible.
>>
>> Hi Enno,
>>
>> I agree with this.
>>
>> >
>> > It's possible to have per region interface_id (or even both per dev interface_id
>> > and per region interface_id at the same time), but per FME PR sub feature
>> > implementation, it supports multiple PR regions, but only provide one interface
>> > id, so at least in this case, it's not per-region information per my
>> > understanding. We can consider it later when hardware really supports it. : )
>>
>> Hi Hao,
>>
>> I understand that in the case of this PR hardware, the region to
>> program is selected when the region_id to program is written to a PR
>> hardware control register. For another example, Arria10 has a hard PR
>> hardware and the PR bitstream lands in the area of the FPGA for which
>> it was compiled. In that case, for the PR bitstream to be compatible
>> with a PR region, the layout of the edge connections also needs to be
>> compatible, so compatibility is per-region in that case instead of
>> per-PR hardware.
>
> Hi Alan,
>
> Thanks a lot for the explanation. :)
>
> I fully understand the consideration of adding per-region interface_id.
>
>> And besides, as I said yesterday, the hard PR
>> hardware would not know what the static region ID is when this
>> framework is used with such a device.
>
> Yes, is it possible that hard PR hardware with different versions, requires
> different images or different methods for compatibility checking?

Because it is really hardware and not something in the FPGA fabric,
the hard PR hardware isn't going to change versions very often. It
has to be designed to be flexible and not add any constraints on the
PR regions. If some feature is added or a bug is fixed, that's just a
driver issue at most and should not affect PR region compatibility.
PR region compatibility would only be dependent on the static FPGA
image and the regions that are created in it. It could be exported in
terms of a single static region ID or per-region ID.

>
>>
>> That's why I think making the id per-region may be more future proof,
>> even if it may see unnecessary in the case of the original blue bits
>> this was written for.
>
> I feel that per-PR hardware interface id is useful in some cases, and maybe
> in some cases, both per-PR hardware and per-region interface ids are needed
> for its compatibility checking, so shall we leave developers to decide to
> implement per-PR hardware or per-region or both interface ids based on their
> own hardware implementations? How do you think? :)

That gives us 3 sets of id's. Seems overly complicated and the
userspace would have to figure out which set of id's to use. I want
to see an interface that isn't more complicated than it needs to be
but still can be expected to be ok for the future (as far as we can
anticipate).

Would per-region id's cause any problems that you can see? I
understand that the region id's would all be the same value for a
given PR hardware in your use case, but that doesn't seem like it
would be hard to implement or that it opens up some possible failure.

Alan

2018-02-07 05:02:22

by Wu Hao

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Tue, Feb 06, 2018 at 12:53:44PM -0600, Alan Tull wrote:
> On Tue, Feb 6, 2018 at 12:47 AM, Wu Hao <[email protected]> wrote:
> > On Mon, Feb 05, 2018 at 10:25:54PM -0600, Alan Tull wrote:
> >> On Mon, Feb 5, 2018 at 7:47 PM, Wu Hao <[email protected]> wrote:
> >> > On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
> >> >> Hi Hao,
> >> >>
> >> >> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
> >> >> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
> >> >> > > Hi Hao, Alan,
> >> >> > >
> >> >> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
> >> >> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
> >> >> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
> >> >> > > > >
> >> >> > > > > Hi Hao,
> >> >> > > > >
> >> >> > > > > A few comments below. Besides that, looks good.
> >> >> > > > >
> >> >> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
> >> >> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
> >> >> > > > > >
> >> >> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
> >> >> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
> >> >> > > > > > Signed-off-by: Shiva Rao <[email protected]>
> >> >> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
> >> >> > > > > > Signed-off-by: Kang Luwei <[email protected]>
> >> >> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
> >> >> > > > > > Signed-off-by: Wu Hao <[email protected]>
> >> >> > > > > > ----
> >> >> > > > > > v3: rename driver to dfl-fpga-fme-mgr
> >> >> > > > > > implemented status callback for fpga manager
> >> >> > > > > > rebased due to fpga api changes
> >> >> > > > > > ---
> >> >> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
> >> >> > > > > > drivers/fpga/Kconfig | 6 +
> >> >> > > > > > drivers/fpga/Makefile | 1 +
> >> >> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
> >> >> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
> >> >> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
> >> >> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
> >> >> > > > > >
> >> >> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > > new file mode 100644
> >> >> > > > > > index 0000000..2d4f917
> >> >> > > > > > --- /dev/null
> >> >> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
> >> >> > > > > > @@ -0,0 +1,8 @@
> >> >> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
> >> >> > > > > > +Date: November 2017
> >> >> > > > > > +KernelVersion: 4.15
> >> >> > > > > > +Contact: Wu Hao <[email protected]>
> >> >> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
> >> >> > > > > > + hardware. Userspace could use this information to check if
> >> >> > > > > > + current hardware is compatible with given image before FPGA
> >> >> > > > > > + programming.
> >> >> > > > >
> >> >> > > > > I'm a little confused by this. I can understand that the PR bitstream
> >> >> > > > > has a dependency on the FPGA's static image, but I don't understand
> >> >> > > > > the dependency of the bistream on the hardware that is used to program
> >> >> > > > > the bitstream to the FPGA.
> >> >> > > >
> >> >> > > > Sorry for the confusion, the interface_id is used to indicate the version of
> >> >> > > > the hardware for partial reconfiguration (it's part of the static image of
> >> >> > > > the FPGA device). Will improve the description on this.
> >> >> > > >
> >> >> > >
> >> >> > > The interface_id expresses the compatibility of the static region with PR
> >> >> > > bitstreams generated for it. It changes every time a new static region is
> >> >> > > generated.
> >> >> > >
> >> >> > > Would it make more sense to have the interface_id exposed as part of the FME
> >> >> > > device (which represents the static region)? I'm not sure - it kind of also
> >> >> > > makes sense here, where you would have all the information in one place (if the
> >> >> > > interface_id matches, I can use this component to program a bitstream).
> >> >> >
> >> >> > Hi Enno
> >> >> >
> >> >> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
> >> >> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
> >> >> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
> >> >> > he needs to find the FME device firstly, and then check if any fpga manager
> >> >> > created under this FME device, if yes, check the interface_id before PR via
> >> >> > the FME device node ioctl.
> >> >>
> >> >> That sounds good, thank you!
> >> >>
> >> >> >
> >> >> > >
> >> >> > > Sorry for my limited understanding of the infrastructure - would this same
> >> >> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
> >> >> > > it would need to expose multiple interface_ids (or we'd have to track both
> >> >> > > interface IDs and an identifier for the target PR region).
> >> >> >
> >> >> > Yes, the fpga manager could be shared with different PR regions.
> >> >> >
> >> >> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
> >> >>
> >> >> It's basically a question of how to determine bitstream compatibility - either,
> >> >> there's a separate interface_id per reconfigurable region, or there is a single
> >> >> interface_id for the entire device. Both make sense from a certain perspective.
> >> >>
> >> >> If there are multiple interface_ids per device (one per region), the driver
> >> >> would need to expose all of them. If there's only a single one, the driver only
> >> >> exposes that one ID - compatibility would be determined by looking at both that
> >> >> single interface_id _and_ the identifier/number of the targeted region.
> >> >>
> >> >> I would prefer a separate interface_id per region - it seems more generic and
> >> >> flexible.
> >>
> >> Hi Enno,
> >>
> >> I agree with this.
> >>
> >> >
> >> > It's possible to have per region interface_id (or even both per dev interface_id
> >> > and per region interface_id at the same time), but per FME PR sub feature
> >> > implementation, it supports multiple PR regions, but only provide one interface
> >> > id, so at least in this case, it's not per-region information per my
> >> > understanding. We can consider it later when hardware really supports it. : )
> >>
> >> Hi Hao,
> >>
> >> I understand that in the case of this PR hardware, the region to
> >> program is selected when the region_id to program is written to a PR
> >> hardware control register. For another example, Arria10 has a hard PR
> >> hardware and the PR bitstream lands in the area of the FPGA for which
> >> it was compiled. In that case, for the PR bitstream to be compatible
> >> with a PR region, the layout of the edge connections also needs to be
> >> compatible, so compatibility is per-region in that case instead of
> >> per-PR hardware.
> >
> > Hi Alan,
> >
> > Thanks a lot for the explanation. :)
> >
> > I fully understand the consideration of adding per-region interface_id.
> >
> >> And besides, as I said yesterday, the hard PR
> >> hardware would not know what the static region ID is when this
> >> framework is used with such a device.
> >
> > Yes, is it possible that hard PR hardware with different versions, requires
> > different images or different methods for compatibility checking?
>
> Because it is really hardware and not something in the FPGA fabric,
> the hard PR hardware isn't going to change versions very often. It
> has to be designed to be flexible and not add any constraints on the
> PR regions. If some feature is added or a bug is fixed, that's just a
> driver issue at most and should not affect PR region compatibility.
> PR region compatibility would only be dependent on the static FPGA
> image and the regions that are created in it. It could be exported in
> terms of a single static region ID or per-region ID.
>
> >
> >>
> >> That's why I think making the id per-region may be more future proof,
> >> even if it may see unnecessary in the case of the original blue bits
> >> this was written for.
> >
> > I feel that per-PR hardware interface id is useful in some cases, and maybe
> > in some cases, both per-PR hardware and per-region interface ids are needed
> > for its compatibility checking, so shall we leave developers to decide to
> > implement per-PR hardware or per-region or both interface ids based on their
> > own hardware implementations? How do you think? :)
>
> That gives us 3 sets of id's. Seems overly complicated and the
> userspace would have to figure out which set of id's to use. I want
> to see an interface that isn't more complicated than it needs to be
> but still can be expected to be ok for the future (as far as we can
> anticipate).
>
> Would per-region id's cause any problems that you can see? I
> understand that the region id's would all be the same value for a
> given PR hardware in your use case, but that doesn't seem like it
> would be hard to implement or that it opens up some possible failure.

Hi Alan

No, I don't have any concern on per-region id at all. I will remove this
sysfs interface_id interface from this driver, and another patch to add
a common sysfs interface under the fpga-region. I feel maybe we can use
"compat_id" instead of "id" here to avoid confusion for that sysfs
interface under fpga-region, as we added a region_id to fpga_image_info,
and they have totally different purposes. How do you think?

Thanks
Hao

2018-02-07 22:39:07

by Alan Tull

[permalink] [raw]
Subject: Re: [PATCH v3 14/21] fpga: dfl: add fpga manager platform driver for FME

On Tue, Feb 6, 2018 at 10:52 PM, Wu Hao <[email protected]> wrote:
> On Tue, Feb 06, 2018 at 12:53:44PM -0600, Alan Tull wrote:
>> On Tue, Feb 6, 2018 at 12:47 AM, Wu Hao <[email protected]> wrote:
>> > On Mon, Feb 05, 2018 at 10:25:54PM -0600, Alan Tull wrote:
>> >> On Mon, Feb 5, 2018 at 7:47 PM, Wu Hao <[email protected]> wrote:
>> >> > On Mon, Feb 05, 2018 at 10:36:45AM -0800, Luebbers, Enno wrote:
>> >> >> Hi Hao,
>> >> >>
>> >> >> On Sun, Feb 04, 2018 at 05:37:06PM +0800, Wu Hao wrote:
>> >> >> > On Fri, Feb 02, 2018 at 04:26:26PM -0800, Luebbers, Enno wrote:
>> >> >> > > Hi Hao, Alan,
>> >> >> > >
>> >> >> > > On Fri, Feb 02, 2018 at 05:42:13PM +0800, Wu Hao wrote:
>> >> >> > > > On Thu, Feb 01, 2018 at 04:00:36PM -0600, Alan Tull wrote:
>> >> >> > > > > On Mon, Nov 27, 2017 at 12:42 AM, Wu Hao <[email protected]> wrote:
>> >> >> > > > >
>> >> >> > > > > Hi Hao,
>> >> >> > > > >
>> >> >> > > > > A few comments below. Besides that, looks good.
>> >> >> > > > >
>> >> >> > > > > > This patch adds fpga manager driver for FPGA Management Engine (FME). It
>> >> >> > > > > > implements fpga_manager_ops for FPGA Partial Reconfiguration function.
>> >> >> > > > > >
>> >> >> > > > > > Signed-off-by: Tim Whisonant <[email protected]>
>> >> >> > > > > > Signed-off-by: Enno Luebbers <[email protected]>
>> >> >> > > > > > Signed-off-by: Shiva Rao <[email protected]>
>> >> >> > > > > > Signed-off-by: Christopher Rauer <[email protected]>
>> >> >> > > > > > Signed-off-by: Kang Luwei <[email protected]>
>> >> >> > > > > > Signed-off-by: Xiao Guangrong <[email protected]>
>> >> >> > > > > > Signed-off-by: Wu Hao <[email protected]>
>> >> >> > > > > > ----
>> >> >> > > > > > v3: rename driver to dfl-fpga-fme-mgr
>> >> >> > > > > > implemented status callback for fpga manager
>> >> >> > > > > > rebased due to fpga api changes
>> >> >> > > > > > ---
>> >> >> > > > > > .../ABI/testing/sysfs-platform-fpga-dfl-fme-mgr | 8 +
>> >> >> > > > > > drivers/fpga/Kconfig | 6 +
>> >> >> > > > > > drivers/fpga/Makefile | 1 +
>> >> >> > > > > > drivers/fpga/fpga-dfl-fme-mgr.c | 318 +++++++++++++++++++++
>> >> >> > > > > > drivers/fpga/fpga-dfl.h | 39 ++-
>> >> >> > > > > > 5 files changed, 371 insertions(+), 1 deletion(-)
>> >> >> > > > > > create mode 100644 Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> >> > > > > > create mode 100644 drivers/fpga/fpga-dfl-fme-mgr.c
>> >> >> > > > > >
>> >> >> > > > > > diff --git a/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> >> > > > > > new file mode 100644
>> >> >> > > > > > index 0000000..2d4f917
>> >> >> > > > > > --- /dev/null
>> >> >> > > > > > +++ b/Documentation/ABI/testing/sysfs-platform-fpga-dfl-fme-mgr
>> >> >> > > > > > @@ -0,0 +1,8 @@
>> >> >> > > > > > +What: /sys/bus/platform/devices/fpga-dfl-fme-mgr.0/interface_id
>> >> >> > > > > > +Date: November 2017
>> >> >> > > > > > +KernelVersion: 4.15
>> >> >> > > > > > +Contact: Wu Hao <[email protected]>
>> >> >> > > > > > +Description: Read-only. It returns interface id of partial reconfiguration
>> >> >> > > > > > + hardware. Userspace could use this information to check if
>> >> >> > > > > > + current hardware is compatible with given image before FPGA
>> >> >> > > > > > + programming.
>> >> >> > > > >
>> >> >> > > > > I'm a little confused by this. I can understand that the PR bitstream
>> >> >> > > > > has a dependency on the FPGA's static image, but I don't understand
>> >> >> > > > > the dependency of the bistream on the hardware that is used to program
>> >> >> > > > > the bitstream to the FPGA.
>> >> >> > > >
>> >> >> > > > Sorry for the confusion, the interface_id is used to indicate the version of
>> >> >> > > > the hardware for partial reconfiguration (it's part of the static image of
>> >> >> > > > the FPGA device). Will improve the description on this.
>> >> >> > > >
>> >> >> > >
>> >> >> > > The interface_id expresses the compatibility of the static region with PR
>> >> >> > > bitstreams generated for it. It changes every time a new static region is
>> >> >> > > generated.
>> >> >> > >
>> >> >> > > Would it make more sense to have the interface_id exposed as part of the FME
>> >> >> > > device (which represents the static region)? I'm not sure - it kind of also
>> >> >> > > makes sense here, where you would have all the information in one place (if the
>> >> >> > > interface_id matches, I can use this component to program a bitstream).
>> >> >> >
>> >> >> > Hi Enno
>> >> >> >
>> >> >> > Yes, this interface is under fpga-dfl-fme-mgr.0, and fpga-dfl-fme-mgr.0 is
>> >> >> > under fpga-dfl-fme.0. It's part of the FME device for sure. From another
>> >> >> > point of view, it means if anyone wants to do PR on this Intel FPGA device,
>> >> >> > he needs to find the FME device firstly, and then check if any fpga manager
>> >> >> > created under this FME device, if yes, check the interface_id before PR via
>> >> >> > the FME device node ioctl.
>> >> >>
>> >> >> That sounds good, thank you!
>> >> >>
>> >> >> >
>> >> >> > >
>> >> >> > > Sorry for my limited understanding of the infrastructure - would this same
>> >> >> > > "fpga-dfl-fme-mgr.0" be used for PR if we had multiple PR regions? In that case
>> >> >> > > it would need to expose multiple interface_ids (or we'd have to track both
>> >> >> > > interface IDs and an identifier for the target PR region).
>> >> >> >
>> >> >> > Yes, the fpga manager could be shared with different PR regions.
>> >> >> >
>> >> >> > Sorry, I'm not sure where we need to expose multiple interface_ids and why.
>> >> >>
>> >> >> It's basically a question of how to determine bitstream compatibility - either,
>> >> >> there's a separate interface_id per reconfigurable region, or there is a single
>> >> >> interface_id for the entire device. Both make sense from a certain perspective.
>> >> >>
>> >> >> If there are multiple interface_ids per device (one per region), the driver
>> >> >> would need to expose all of them. If there's only a single one, the driver only
>> >> >> exposes that one ID - compatibility would be determined by looking at both that
>> >> >> single interface_id _and_ the identifier/number of the targeted region.
>> >> >>
>> >> >> I would prefer a separate interface_id per region - it seems more generic and
>> >> >> flexible.
>> >>
>> >> Hi Enno,
>> >>
>> >> I agree with this.
>> >>
>> >> >
>> >> > It's possible to have per region interface_id (or even both per dev interface_id
>> >> > and per region interface_id at the same time), but per FME PR sub feature
>> >> > implementation, it supports multiple PR regions, but only provide one interface
>> >> > id, so at least in this case, it's not per-region information per my
>> >> > understanding. We can consider it later when hardware really supports it. : )
>> >>
>> >> Hi Hao,
>> >>
>> >> I understand that in the case of this PR hardware, the region to
>> >> program is selected when the region_id to program is written to a PR
>> >> hardware control register. For another example, Arria10 has a hard PR
>> >> hardware and the PR bitstream lands in the area of the FPGA for which
>> >> it was compiled. In that case, for the PR bitstream to be compatible
>> >> with a PR region, the layout of the edge connections also needs to be
>> >> compatible, so compatibility is per-region in that case instead of
>> >> per-PR hardware.
>> >
>> > Hi Alan,
>> >
>> > Thanks a lot for the explanation. :)
>> >
>> > I fully understand the consideration of adding per-region interface_id.
>> >
>> >> And besides, as I said yesterday, the hard PR
>> >> hardware would not know what the static region ID is when this
>> >> framework is used with such a device.
>> >
>> > Yes, is it possible that hard PR hardware with different versions, requires
>> > different images or different methods for compatibility checking?
>>
>> Because it is really hardware and not something in the FPGA fabric,
>> the hard PR hardware isn't going to change versions very often. It
>> has to be designed to be flexible and not add any constraints on the
>> PR regions. If some feature is added or a bug is fixed, that's just a
>> driver issue at most and should not affect PR region compatibility.
>> PR region compatibility would only be dependent on the static FPGA
>> image and the regions that are created in it. It could be exported in
>> terms of a single static region ID or per-region ID.
>>
>> >
>> >>
>> >> That's why I think making the id per-region may be more future proof,
>> >> even if it may see unnecessary in the case of the original blue bits
>> >> this was written for.
>> >
>> > I feel that per-PR hardware interface id is useful in some cases, and maybe
>> > in some cases, both per-PR hardware and per-region interface ids are needed
>> > for its compatibility checking, so shall we leave developers to decide to
>> > implement per-PR hardware or per-region or both interface ids based on their
>> > own hardware implementations? How do you think? :)
>>
>> That gives us 3 sets of id's. Seems overly complicated and the
>> userspace would have to figure out which set of id's to use. I want
>> to see an interface that isn't more complicated than it needs to be
>> but still can be expected to be ok for the future (as far as we can
>> anticipate).
>>
>> Would per-region id's cause any problems that you can see? I
>> understand that the region id's would all be the same value for a
>> given PR hardware in your use case, but that doesn't seem like it
>> would be hard to implement or that it opens up some possible failure.
>
> Hi Alan
>
> No, I don't have any concern on per-region id at all. I will remove this
> sysfs interface_id interface from this driver, and another patch to add
> a common sysfs interface under the fpga-region. I feel maybe we can use
> "compat_id" instead of "id" here to avoid confusion for that sysfs
> interface under fpga-region, as we added a region_id to fpga_image_info,
> and they have totally different purposes. How do you think?

Hi Hao,

Sounds good. Yes, that that would keep things clear.

Alan

>
> Thanks
> Hao