Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp1480956yba; Sun, 5 May 2019 07:17:45 -0700 (PDT) X-Google-Smtp-Source: APXvYqz0LBD8furR2IW0aNfK0tjiwB8s3w1Q9/m8wLIkwcpjUMKFx+6tjHOTHWWMnfDAFWz0uH/1 X-Received: by 2002:a63:f250:: with SMTP id d16mr25087409pgk.168.1557065865432; Sun, 05 May 2019 07:17:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1557065865; cv=none; d=google.com; s=arc-20160816; b=XypMzKvagOnPKtYpXyriYSwnk2jN5XXgWVj/RmgXYNZ2/No8URpdBCRXKNJjLjeuF0 H6Gi+aO7HropSdTvvx2yNwQgunsGsXS14eURkgVOXUr2vyO1VyvtJp0HAJRPpcrK8ARW gip4GOCzSRzWRqRP4whfQxmpkKEvbTkoGIOyDg4+mMH77PtuptkGtXQ8SDVuz50+XtVy wY/mPVg9EOs5w3ciTo5I6qWiZkLfKcdFHqEdD5MQV3uYJgYoWeJFCja/Nvn/IFH6fxZx /HvYgYKgVPFj8Jo5XK2Ho2hrSonDd5IFzLb1UWjOfhootyAk4pZWX6CBQBgiqHA7ZZYi W1gQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject:dkim-signature; bh=my3CSpaG+RIjFLu6AGMT/Gf6hCTEG4E32HO0YjpZBJA=; b=hMrQ9nzgLlSxd1DpWBuyu7tBfJRNa9zlp2x3tDWmkHJTUunB1DFBUi9NaVr7gqcnJu HOPB+ifdKFkK8HasOROSEZC3MUJL442if0j+kMhAo7zY9PM9c0/XK298M4d/I/7sOAOq KSmXZ3QtgeWPsIRDc1kzfrJOACbtOg4nJWWLZceP81qIGCjtjauFm+uTO2/9Eo8DxcfL 5HOGb7aF1wGfY/4FAQD3SbMWsQ7S8hZQ5gU3CB1Dhw8Ts12OMZeS4WtH6sk8Bw+B8oCX KN0DU3HuuTWgvZDjTiQAqT9AtzzH3B2GgrLX9bcDhLqOGG7WAAbVrqGliMawwlx1DT1p pxvQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=F57lAOeK; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i18si8756426pfd.64.2019.05.05.07.17.17; Sun, 05 May 2019 07:17:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=F57lAOeK; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727916AbfEENjt (ORCPT + 99 others); Sun, 5 May 2019 09:39:49 -0400 Received: from mail-it1-f194.google.com ([209.85.166.194]:37326 "EHLO mail-it1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726736AbfEENjt (ORCPT ); Sun, 5 May 2019 09:39:49 -0400 Received: by mail-it1-f194.google.com with SMTP id r85so16143454itc.2; Sun, 05 May 2019 06:39:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:from:to:cc:references:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=my3CSpaG+RIjFLu6AGMT/Gf6hCTEG4E32HO0YjpZBJA=; b=F57lAOeKRicZuWnLZ5NO+0hL+F0rT8yqqduQat+twwN5kdHq3NASv+3t+FFf5fCjAy AMqDleT/M+FMIntid8gwyk/S3H5w9o2qxcNLOwafN5aAh7qCj/uuK/nKuL3T9COfqsIj CZRuxC0so4sOQrGuaU8iVSME++YjqD+qWiJ7tSg85Lfip9/NYPqrZ4iEDtwDkTuHqRof 6N4ZG0EN5FO7wClGqHLf1fhv2kBon5T2CRxW9PdtbLGbjERif3aTRALGtnwbMCFdfCx9 jUqJLJvYrLooYYQ1hgEt4SluLS9xlFmdaC2xKitA+Ts17kuNqx0vIODFjKNPCYBuggig HvQw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:from:to:cc:references:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=my3CSpaG+RIjFLu6AGMT/Gf6hCTEG4E32HO0YjpZBJA=; b=oP5azT2ef13tfV5b74rFcMiIOrduQ7dWLDtU3FeJmVZCKEO6rOsjaQvlEA1S5zI34z /Lv7+ROgtG0lZgJfAMepXOcBp9+UtlVO+KzfEBTiEwjMtsTj6foeEzjHf0MJYHp/IMGT YFcseFYCK3sYqb7ZAko/RDd6OcPIvaL3Dc1HKK0Xnw7nIFaWmo2xzYj0sxWlE1BClSyw xGRsH7ei6a7bHYei9Ic1pGJyMe8ELO4Skxvy09CrrfAbliLRVF/OFMzSDXpgSeZ2fucU /zzFNrTmQhD5ijjnM0FhEkmKcpXM/Xf8IzkryHLcR3A38/wkjyW2oDGZEK4wO3StiE5U w1LQ== X-Gm-Message-State: APjAAAV7W17qayIIOZhu2EnPj7nykrpuQXuDAJ7ZQLmhFG/ag8838AKe K5FjLnIvF32EzDUdg04qZSC/38si X-Received: by 2002:a02:a890:: with SMTP id l16mr2853199jam.137.1557063587796; Sun, 05 May 2019 06:39:47 -0700 (PDT) Received: from [192.168.2.145] (ppp94-29-35-107.pppoe.spdop.ru. [94.29.35.107]) by smtp.googlemail.com with ESMTPSA id v187sm8959811ita.0.2019.05.05.06.39.45 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 05 May 2019 06:39:47 -0700 (PDT) Subject: Re: [PATCH] dma: tegra: add accurate reporting of dma state From: Dmitry Osipenko To: Ben Dooks , linux-kernel@lists.codethink.co.uk Cc: Laxman Dewangan , Jon Hunter , Vinod Koul , Dan Williams , Thierry Reding , dmaengine@vger.kernel.org, linux-tegra@vger.kernel.org, linux-kernel@vger.kernel.org References: <20190424162348.23692-1-ben.dooks@codethink.co.uk> <6631ad02-0e3c-d768-8c23-fd1f091402df@gmail.com> Message-ID: <6cceabe0-ecfa-e241-a937-5a7c9761820a@gmail.com> Date: Sun, 5 May 2019 16:39:43 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <6631ad02-0e3c-d768-8c23-fd1f091402df@gmail.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 04.05.2019 19:06, Dmitry Osipenko пишет: > 01.05.2019 11:58, Ben Dooks пишет: >> On 24/04/2019 19:17, Dmitry Osipenko wrote: >>> 24.04.2019 19:23, Ben Dooks пишет: >>>> The tx_status callback does not report the state of the transfer >>>> beyond complete segments. This causes problems with users such as >>>> ALSA when applications want to know accurately how much data has >>>> been moved. >>>> >>>> This patch addes a function tegra_dma_update_residual() to query >>>> the hardware and modify the residual information accordinly. It >>>> takes into account any hardware issues when trying to read the >>>> state, such as delays between finishing a buffer and signalling >>>> the interrupt. >>>> >>>> Signed-off-by: Ben Dooks >>> >>> Hello Ben, >>> >>> Thank you very much for keeping it up. I have couple comments, please >>> see them below. >>> >>>> Cc: Dmitry Osipenko >>>> Cc: Laxman Dewangan (supporter:TEGRA DMA DRIVERS) >>>> Cc: Jon Hunter (supporter:TEGRA DMA DRIVERS) >>>> Cc: Vinod Koul (maintainer:DMA GENERIC OFFLOAD >>>> ENGINE SUBSYSTEM) >>>> Cc: Dan Williams (reviewer:ASYNCHRONOUS >>>> TRANSFERS/TRANSFORMS (IOAT) API) >>>> Cc: Thierry Reding (supporter:TEGRA >>>> ARCHITECTURE SUPPORT) >>>> Cc: dmaengine@vger.kernel.org (open list:DMA GENERIC OFFLOAD ENGINE >>>> SUBSYSTEM) >>>> Cc: linux-tegra@vger.kernel.org (open list:TEGRA ARCHITECTURE SUPPORT) >>>> Cc: linux-kernel@vger.kernel.org (open list) >>>> --- >>>>   drivers/dma/tegra20-apb-dma.c | 92 ++++++++++++++++++++++++++++++++--- >>>>   1 file changed, 86 insertions(+), 6 deletions(-) >>>> >>>> diff --git a/drivers/dma/tegra20-apb-dma.c >>>> b/drivers/dma/tegra20-apb-dma.c >>>> index cf462b1abc0b..544e7273e741 100644 >>>> --- a/drivers/dma/tegra20-apb-dma.c >>>> +++ b/drivers/dma/tegra20-apb-dma.c >>>> @@ -808,6 +808,90 @@ static int tegra_dma_terminate_all(struct >>>> dma_chan *dc) >>>>       return 0; >>>>   } >>>>   +static unsigned int tegra_dma_update_residual(struct >>>> tegra_dma_channel *tdc, >>>> +                          struct tegra_dma_sg_req *sg_req, >>>> +                          struct tegra_dma_desc *dma_desc, >>>> +                          unsigned int residual) >>>> +{ >>>> +    unsigned long status = 0x0; >>>> +    unsigned long wcount; >>>> +    unsigned long ahbptr; >>>> +    unsigned long tmp = 0x0; >>>> +    unsigned int result; >>> >>> You could pre-assign ahbptr=0xffffffff and result=residual here, then >>> you could remove all the duplicated assigns below. >> >> ok, ta. >> >>>> +    int retries = TEGRA_APBDMA_BURST_COMPLETE_TIME * 10; >>>> +    int done; >>>> + >>>> +    /* if we're not the current request, then don't alter the >>>> residual */ >>>> +    if (sg_req != list_first_entry(&tdc->pending_sg_req, >>>> +                       struct tegra_dma_sg_req, node)) { >>>> +        result = residual; >>>> +        ahbptr = 0xffffffff; >>>> +        goto done; >>>> +    } >>>> + >>>> +    /* loop until we have a reliable result for residual */ >>>> +    do { >>>> +        ahbptr = tdc_read(tdc, TEGRA_APBDMA_CHAN_AHBPTR); >>>> +        status = tdc_read(tdc, TEGRA_APBDMA_CHAN_STATUS); >>>> +        tmp =  tdc_read(tdc, 0x08);    /* total count for debug */ >>> >>> The "tmp" variable isn't used anywhere in the code, please remove it. >> >> must have been left over. >> >>>> + >>>> +        /* check status, if channel isn't busy then skip */ >>>> +        if (!(status & TEGRA_APBDMA_STATUS_BUSY)) { >>>> +            result = residual; >>>> +            break; >>>> +        } >>> >>> This doesn't look correct because TRM says "Busy bit gets set as soon >>> as a channel is enabled and gets cleared after transfer completes", >>> hence a cleared BUSY bit means that all transfers are completed and >>> result=residual is incorrect here. Given that there is a check for EOC >>> bit being set below, this hunk should be removed. >> >> I'll check notes, but see below. >> >>>> + >>>> +        /* if we've got an interrupt pending on the channel, don't >>>> +         * try and deal with the residue as the hardware has likely >>>> +         * moved on to the next buffer. return all data moved. >>>> +         */ >>>> +        if (status & TEGRA_APBDMA_STATUS_ISE_EOC) { >>>> +            result = residual - sg_req->req_len; >>>> +            break; >>>> +        } >>>> + >>>> +        if (tdc->tdma->chip_data->support_separate_wcount_reg) >>>> +            wcount = tdc_read(tdc, TEGRA_APBDMA_CHAN_WORD_TRANSFER); >>>> +        else >>>> +            wcount = status; >>>> + >>>> +        /* If the request is at the full point, then there is a >>>> +         * chance that we have read the status register in the >>>> +         * middle of the hardware reloading the next buffer. >>>> +         * >>>> +         * The sequence seems to be at the end of the buffer, to >>>> +         * load the new word count before raising the EOC flag (or >>>> +         * changing the ping-pong flag which could have also been >>>> +         * used to determine a new buffer). This  means there is a >>>> +         * small window where we cannot determine zero-done for the >>>> +         * current buffer, or moved to next buffer. >>>> +         * >>>> +         * If done shows 0, then retry the load, as it may hit the >>>> +         * above hardware race. We will either get a new value which >>>> +         * is from the first buffer, or we get an EOC (new buffer) >>>> +         * or both a new value and an EOC... >>>> +         */ >>>> +        done = get_current_xferred_count(tdc, sg_req, wcount); >>>> +        if (done != 0) { >>>> +            result = residual - done; >>>> +            break; >>>> +        } >>>> + >>>> +        ndelay(100); >>> >>> Please use udelay(1) because there is no ndelay on arm32 and >>> ndelay(100) is getting rounded up to 1usec. AFAIK, arm64 doesn't have >>> reliable ndelay on Tegra either because timer rate changes with the >>> CPU frequency scaling. >> >> I'll check, but last time it was implemented. This seems a backwards step. >> >>> Secondly done=0 isn't a error case, technically this could be the case >>> when tegra_dma_update_residual() is invoked just after starting the >>> transfer. Hence I think this do-while loop and timeout checking aren't >>> needed at all since done=0 is a perfectly valid case. >> >> this is not checking for an error, it's checking for a possible >> inaccurate reading. > > If you'll change reading order of the status / words registers like I > suggested, then there won't be a case for the inaccuracy. > > The EOC bit should be set atomically once transfer is finished, you > can't get wrapped around words count and EOC bit not being set. > > For oneshot transfer that runs with interrupt being disabled, the words > counter will stop at 0 and the unset BUSY bit will indicate that the > transfer is completed. > >>> >>> Altogether seems the tegra_dma_update_residual() could be reduced to: >>> >>> static unsigned int tegra_dma_update_residual(struct tegra_dma_channel >>> *tdc, >>>                           struct tegra_dma_sg_req *sg_req, >>>                           struct tegra_dma_desc *dma_desc, >>>                           unsigned int residual) >>> { >>>     unsigned long status, wcount; >>> >>>     if (list_is_first(&sg_req->node, &tdc->pending_sg_req)) >>>         return residual; >>> >>>     if (tdc->tdma->chip_data->support_separate_wcount_reg) >>>         wcount = tdc_read(tdc, TEGRA_APBDMA_CHAN_WORD_TRANSFER); >>> >>>     status = tdc_read(tdc, TEGRA_APBDMA_CHAN_STATUS); >>> >>>     if (!tdc->tdma->chip_data->support_separate_wcount_reg) >>>         wcount = status; >>> >>>     if (status & TEGRA_APBDMA_STATUS_ISE_EOC) >>>         return residual - sg_req->req_len; >>> >>>     return residual - get_current_xferred_count(tdc, sg_req, wcount); >>> } >> >> I'm not sure if that will work all the time. It took days of testing to >> get reliable error data for the cases we're looking for here. > > Could you please tell exactly what those cases are. I don't see when the > simplified variant could fail, but maybe I already forgot some extra > details about how APB DMA works. > > I tested the variant I'm suggesting (with the fixed typos and added > check for the BUSY bit) and it works absolutely fine, audio stuttering > issue is fixed, everything else works too. Please consider to use it for > the next version of the patch if there are no objections. > Actually the BUSY bit checking shouldn't be needed. I think it's a bug in the driver that it may not enable EOC interrupt and will send a patch to fix it.