Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp1665083pxj; Wed, 19 May 2021 10:59:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyno+DoHpP4n5Z78US5b8kjY4uF2naf9ofxg9I3nD7hoe51vwE0N5rm2R8RDUgqTSj8K29b X-Received: by 2002:a05:6402:19a:: with SMTP id r26mr310131edv.44.1621447146251; Wed, 19 May 2021 10:59:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1621447146; cv=none; d=google.com; s=arc-20160816; b=y4YhbEUr8CrxofLGlQ3/p0MUeKmSB6YFFgbGambpGsmE3SCz2SwZdKzlDbMuPcuUD9 ewsiDYiXhIBGoKr+RrHGRa9I7X+BfelWxMWLwOyWkGhJzu3W+ebQe9KbXDKBGGuPYUij aCRd+nS+BOtkJtSqv/ZzqniP3C6B9Cuw0BDVuTcDuxGZ9/t6hBJBA5oKEe8t41MFKUjj LvS3jmhSP28Q4y1d7Oc61kpNFQpl2IPeuPNNUPVbMebdmLevNeDkCfG8+/mWgc6t/r8q PyDvNesi8fmC2dk0ukgsmVuA9uoBdeR/AYBL2V/X7ofc1bsPpPk8gmhKZvFrmmzk/RCY FtHg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date; bh=sLhmD0yPfu2DDPZzg8i4xtcabAEDDGTjvWlwhgVjJ5A=; b=kg+Mb/jPSyB37qSJ3P+gw8thSo6dgpKuj6kFbQOGNCbN6c1tg9URNhdNtiJlfJgy2G V4JbsIG5XIxvH8hEDglzb103ivSsDrgIPMqQ9VU4GLrxRcGDUpMi2ig4Clp9zd8l6kw9 kmK9jp9jVhyGSDpviF+x9oNh/BzSwGol0VFnsUu5SDX2dWhe3MRDME+c/1x8jEMdL/Le FMlGOM+ZrKgwaeaXARlZgDyA95+1VzovVeVXs9Z1KXl6HeBAkOort2C+NdaBW8ujduMn OQEQpUTlsiumOEworrtB/N+eGnXIv2a3iYzIWmH4HRStGGm/mosmn8XGbCduMDRFbjE9 0GYA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m12si8018508edr.335.2021.05.19.10.58.42; Wed, 19 May 2021 10:59:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S245009AbhERLlE (ORCPT + 99 others); Tue, 18 May 2021 07:41:04 -0400 Received: from bhuna.collabora.co.uk ([46.235.227.227]:38774 "EHLO bhuna.collabora.co.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244264AbhERLlD (ORCPT ); Tue, 18 May 2021 07:41:03 -0400 Received: from localhost (unknown [IPv6:2a01:e0a:2c:6930:5cf4:84a1:2763:fe0d]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: bbrezillon) by bhuna.collabora.co.uk (Postfix) with ESMTPSA id 108181F42F08; Tue, 18 May 2021 12:39:44 +0100 (BST) Date: Tue, 18 May 2021 13:39:40 +0200 From: Boris Brezillon To: Cc: Mark Brown , Miquel Raynal , Vignesh Raghavendra , , Alexandre Torgue , , , , , Subject: Re: [PATCH v3 3/3] spi: stm32-qspi: add automatic poll status feature Message-ID: <20210518133940.7b14addc@collabora.com> In-Reply-To: <20210518093951.23136-4-patrice.chotard@foss.st.com> References: <20210518093951.23136-1-patrice.chotard@foss.st.com> <20210518093951.23136-4-patrice.chotard@foss.st.com> Organization: Collabora X-Mailer: Claws Mail 3.17.8 (GTK+ 2.24.33; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 18 May 2021 11:39:51 +0200 wrote: > From: Patrice Chotard > > STM32 QSPI is able to automatically poll a specified register inside the > memory and relieve the CPU from this task. > > As example, when erasing a large memory area, we got cpu load > equal to 50%. This patch allows to perform the same operation > with a cpu load around 2%. > > Signed-off-by: Christophe Kerello > Signed-off-by: Patrice Chotard > --- > Changes in v3: > - Remove spi_mem_finalize_op() API added in v2. > > Changes in v2: > - mask and match stm32_qspi_poll_status() parameters are 2-byte wide > - Make usage of new spi_mem_finalize_op() API in stm32_qspi_wait_poll_status() > > drivers/spi/spi-stm32-qspi.c | 83 ++++++++++++++++++++++++++++++++---- > 1 file changed, 75 insertions(+), 8 deletions(-) > > diff --git a/drivers/spi/spi-stm32-qspi.c b/drivers/spi/spi-stm32-qspi.c > index 7e640ccc7e77..01168a859005 100644 > --- a/drivers/spi/spi-stm32-qspi.c > +++ b/drivers/spi/spi-stm32-qspi.c > @@ -36,6 +36,7 @@ > #define CR_FTIE BIT(18) > #define CR_SMIE BIT(19) > #define CR_TOIE BIT(20) > +#define CR_APMS BIT(22) > #define CR_PRESC_MASK GENMASK(31, 24) > > #define QSPI_DCR 0x04 > @@ -53,6 +54,7 @@ > #define QSPI_FCR 0x0c > #define FCR_CTEF BIT(0) > #define FCR_CTCF BIT(1) > +#define FCR_CSMF BIT(3) > > #define QSPI_DLR 0x10 > > @@ -107,6 +109,7 @@ struct stm32_qspi { > u32 clk_rate; > struct stm32_qspi_flash flash[STM32_QSPI_MAX_NORCHIP]; > struct completion data_completion; > + struct completion match_completion; > u32 fmode; > > struct dma_chan *dma_chtx; > @@ -115,6 +118,7 @@ struct stm32_qspi { > > u32 cr_reg; > u32 dcr_reg; > + unsigned long status_timeout; > > /* > * to protect device configuration, could be different between > @@ -128,11 +132,20 @@ static irqreturn_t stm32_qspi_irq(int irq, void *dev_id) > struct stm32_qspi *qspi = (struct stm32_qspi *)dev_id; > u32 cr, sr; > > + cr = readl_relaxed(qspi->io_base + QSPI_CR); > sr = readl_relaxed(qspi->io_base + QSPI_SR); > > + if (cr & CR_SMIE && sr & SR_SMF) { > + /* disable irq */ > + cr &= ~CR_SMIE; > + writel_relaxed(cr, qspi->io_base + QSPI_CR); > + complete(&qspi->match_completion); > + > + return IRQ_HANDLED; > + } > + > if (sr & (SR_TEF | SR_TCF)) { > /* disable irq */ > - cr = readl_relaxed(qspi->io_base + QSPI_CR); > cr &= ~CR_TCIE & ~CR_TEIE; > writel_relaxed(cr, qspi->io_base + QSPI_CR); > complete(&qspi->data_completion); > @@ -319,6 +332,24 @@ static int stm32_qspi_wait_cmd(struct stm32_qspi *qspi, > return err; > } > > +static int stm32_qspi_wait_poll_status(struct stm32_qspi *qspi, > + const struct spi_mem_op *op) > +{ > + u32 cr; > + > + reinit_completion(&qspi->match_completion); > + cr = readl_relaxed(qspi->io_base + QSPI_CR); > + writel_relaxed(cr | CR_SMIE, qspi->io_base + QSPI_CR); > + > + if (!wait_for_completion_timeout(&qspi->match_completion, > + msecs_to_jiffies(qspi->status_timeout))) > + return -ETIMEDOUT; > + > + writel_relaxed(FCR_CSMF, qspi->io_base + QSPI_FCR); > + > + return 0; > +} > + > static int stm32_qspi_get_mode(struct stm32_qspi *qspi, u8 buswidth) > { > if (buswidth == 4) > @@ -332,7 +363,7 @@ static int stm32_qspi_send(struct spi_mem *mem, const struct spi_mem_op *op) > struct stm32_qspi *qspi = spi_controller_get_devdata(mem->spi->master); > struct stm32_qspi_flash *flash = &qspi->flash[mem->spi->chip_select]; > u32 ccr, cr; > - int timeout, err = 0; > + int timeout, err = 0, err_poll_status = 0; > > dev_dbg(qspi->dev, "cmd:%#x mode:%d.%d.%d.%d addr:%#llx len:%#x\n", > op->cmd.opcode, op->cmd.buswidth, op->addr.buswidth, > @@ -378,6 +409,9 @@ static int stm32_qspi_send(struct spi_mem *mem, const struct spi_mem_op *op) > if (op->addr.nbytes && qspi->fmode != CCR_FMODE_MM) > writel_relaxed(op->addr.val, qspi->io_base + QSPI_AR); > > + if (qspi->fmode == CCR_FMODE_APM) > + err_poll_status = stm32_qspi_wait_poll_status(qspi, op); > + > err = stm32_qspi_tx(qspi, op); > > /* > @@ -387,7 +421,7 @@ static int stm32_qspi_send(struct spi_mem *mem, const struct spi_mem_op *op) > * byte of device (device size - fifo size). like device size is not > * knows, the prefetching is always stop. > */ > - if (err || qspi->fmode == CCR_FMODE_MM) > + if (err || err_poll_status || qspi->fmode == CCR_FMODE_MM) > goto abort; > > /* wait end of tx in indirect mode */ > @@ -406,15 +440,46 @@ static int stm32_qspi_send(struct spi_mem *mem, const struct spi_mem_op *op) > cr, !(cr & CR_ABORT), 1, > STM32_ABT_TIMEOUT_US); > > - writel_relaxed(FCR_CTCF, qspi->io_base + QSPI_FCR); > + writel_relaxed(FCR_CTCF | FCR_CSMF, qspi->io_base + QSPI_FCR); > > - if (err || timeout) > - dev_err(qspi->dev, "%s err:%d abort timeout:%d\n", > - __func__, err, timeout); > + if (err || err_poll_status || timeout) > + dev_err(qspi->dev, "%s err:%d err_poll_status:%d abort timeout:%d\n", > + __func__, err, err_poll_status, timeout); > > return err; > } > > +static int stm32_qspi_poll_status(struct spi_mem *mem, const struct spi_mem_op *op, > + u16 mask, u16 match, > + unsigned long initial_delay_us, > + unsigned long polling_rate_us, > + unsigned long timeout_ms) > +{ > + struct stm32_qspi *qspi = spi_controller_get_devdata(mem->spi->master); > + int ret; > + Don't you have special constraints on the op that can be passed to poll status request (does it support more than 1byte of status?)? If not, I think you should at least call spi_mem_supports_op(). > + ret = pm_runtime_get_sync(qspi->dev); > + if (ret < 0) { > + pm_runtime_put_noidle(qspi->dev); > + return ret; > + } > + > + mutex_lock(&qspi->lock); > + > + writel_relaxed(mask, qspi->io_base + QSPI_PSMKR); > + writel_relaxed(match, qspi->io_base + QSPI_PSMAR); > + qspi->fmode = CCR_FMODE_APM; > + qspi->status_timeout = timeout_ms; > + > + ret = stm32_qspi_send(mem, op); > + mutex_unlock(&qspi->lock); > + > + pm_runtime_mark_last_busy(qspi->dev); > + pm_runtime_put_autosuspend(qspi->dev); > + > + return ret; > +} > + > static int stm32_qspi_exec_op(struct spi_mem *mem, const struct spi_mem_op *op) > { > struct stm32_qspi *qspi = spi_controller_get_devdata(mem->spi->master); > @@ -527,7 +592,7 @@ static int stm32_qspi_setup(struct spi_device *spi) > flash->presc = presc; > > mutex_lock(&qspi->lock); > - qspi->cr_reg = 3 << CR_FTHRES_SHIFT | CR_SSHIFT | CR_EN; > + qspi->cr_reg = CR_APMS | 3 << CR_FTHRES_SHIFT | CR_SSHIFT | CR_EN; > writel_relaxed(qspi->cr_reg, qspi->io_base + QSPI_CR); > > /* set dcr fsize to max address */ > @@ -607,6 +672,7 @@ static const struct spi_controller_mem_ops stm32_qspi_mem_ops = { > .exec_op = stm32_qspi_exec_op, > .dirmap_create = stm32_qspi_dirmap_create, > .dirmap_read = stm32_qspi_dirmap_read, > + .poll_status = stm32_qspi_poll_status, > }; > > static int stm32_qspi_probe(struct platform_device *pdev) > @@ -661,6 +727,7 @@ static int stm32_qspi_probe(struct platform_device *pdev) > } > > init_completion(&qspi->data_completion); > + init_completion(&qspi->match_completion); > > qspi->clk = devm_clk_get(dev, NULL); > if (IS_ERR(qspi->clk)) {