Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp4388321rwi; Mon, 17 Oct 2022 05:45:39 -0700 (PDT) X-Google-Smtp-Source: AMsMyM6YrpOLULYnlk11ffoyypUjV+jclyf1sMgiTF1Cm9nKVjx3ZkSDItgNlB2F8y484UQTOxkx X-Received: by 2002:a17:907:2cd8:b0:78d:9c3c:d788 with SMTP id hg24-20020a1709072cd800b0078d9c3cd788mr8614387ejc.327.1666010739329; Mon, 17 Oct 2022 05:45:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666010739; cv=none; d=google.com; s=arc-20160816; b=0pGDeviG5lbP8DhFr/ob7OuJiT2TLTx9JjTcWXdsLg4VksJxCwhf7sXLpAg9+PiXiX 8nB/4xYoAJAKa3R4OmFPWRHqzk7zNu+EXt8pPV9cqHFKfUKsZr+DUR4i0xbn9Abpgyj2 udmOi0kX/fchtlU4hMhQZOMBpEmqtXGTw0fflx+bWKKDuki+gyQbe/BNH+b/6egxapF1 npujAvrlbpi+P0GdqL2g+VM1nFPzBDzxuXwLchcCXllcZmq32bxHF//m6muLj5Kj5KhD vLmrc8D17Qzhewcnw+jmq2VHm8mq8/HNDyG44+AAJBrw++hnybAvXuyBDyWAXaR42o0M 0s6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=CUEy7JMh05DpSgBaUDJR/zjmdMwfOrgvwqfnUrr/cFI=; b=HqMnh/D5671uFxPYdMMfj1ohaDoSzT6u6UfTYa0BpmC0P6V3dh5HA9EONUulS4zTh0 sPyYeCUWv9zLV2HFKiGlkDGMbumHHHTi8PYN8xjPV/aKbywgJvmQabwqd0iVdpyS9XoI dvDoxr05n9NQSXLNUBnJ6B1RRJN9OcBaodd12gMBBoN6kkFcZ5PfN2+95Twe1TRuC5hh QpNllvgcbezVhbcySxxUT3KBSDMAwp95vY/o0eeQSgoPSM9RxbJTJTCUsmKlirLuStzu mVTwdYjQsyNnIE/YFw9syQEzd4atyaLsGBqpRqH+DQuaBCeQUGdMI6spshv/DkKcJ8MZ SANg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=fklx5Z6I; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id k19-20020aa7c053000000b004573f60f27csi8293628edo.440.2022.10.17.05.45.12; Mon, 17 Oct 2022 05:45:39 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=fklx5Z6I; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230240AbiJQMgg (ORCPT + 99 others); Mon, 17 Oct 2022 08:36:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55024 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229874AbiJQMgd (ORCPT ); Mon, 17 Oct 2022 08:36:33 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 780941ADBD; Mon, 17 Oct 2022 05:36:32 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 0733461117; Mon, 17 Oct 2022 12:36:32 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 69EECC4314C; Mon, 17 Oct 2022 12:36:31 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1666010191; bh=1DK6j1RdeLyk2nOKe2HKqu85PpZMd3dzDfvg7w5UV9k=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=fklx5Z6I9cLTunQIu/YMuvNORrweUup3qzJUSU5fjd9zz+bgbPEM77L5D4u8LkA7M +FdRPxEax2/+FzpmsJcASpA8Y2XFsLbCZeIMF8heej8xdKedyo02ZGw6N6Nfqc7nJv zjEM3hz0t9RHJ44YXaZWvcyj2N274U/+4WP9faCBb3HC2qCnWtGxCp6OI6pit8reDH 4YEwgx5fzmf0vu10R0cUnn2I+CM3a9nFga2m6RcpbCcvIcSaz5nNq4aUXEQ0t9THlE TJ7Hz5vEsykKJiodgDA7OAOe4MK/ZPMzhqR2v5o5EEo9pxF8WeMIaZ6Ab7/kRXbgPo bYiB2OE5yruzg== Received: by mail-oa1-f48.google.com with SMTP id 586e51a60fabf-1322d768ba7so13057057fac.5; Mon, 17 Oct 2022 05:36:31 -0700 (PDT) X-Gm-Message-State: ACrzQf0X/Y1edT7mWSLc6zU3qsvCN7t2U8U2nV31cw5oGD+QUeeDli6z zet28uQK5Z3WWxRqhyr5r9F3QVTLOfMZ27+JEcw= X-Received: by 2002:a05:6870:4413:b0:136:66cc:6af8 with SMTP id u19-20020a056870441300b0013666cc6af8mr15408042oah.112.1666010190362; Mon, 17 Oct 2022 05:36:30 -0700 (PDT) MIME-Version: 1.0 References: <20221003223222.448551-1-prabhakar.mahadev-lad.rj@bp.renesas.com> <20221003223222.448551-3-prabhakar.mahadev-lad.rj@bp.renesas.com> In-Reply-To: From: Guo Ren Date: Mon, 17 Oct 2022 20:36:17 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [RFC PATCH v2 2/2] soc: renesas: Add L2 cache management for RZ/Five SoC To: "Lad, Prabhakar" Cc: Geert Uytterhoeven , Paul Walmsley , Palmer Dabbelt , Albert Ou , Rob Herring , Krzysztof Kozlowski , Magnus Damm , Heiko Stuebner , Conor Dooley , Philipp Tomsich , Nathan Chancellor , Atish Patra , Anup Patel , linux-renesas-soc@vger.kernel.org, devicetree@vger.kernel.org, linux-kernel@vger.kernel.org, linux-riscv@lists.infradead.org, Biju Das , Lad Prabhakar Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Oct 17, 2022 at 5:40 PM Lad, Prabhakar wrote: > > Hi Guo, > > On Tue, Oct 11, 2022 at 2:10 PM Guo Ren wrote: > > > > On Tue, Oct 11, 2022 at 5:39 PM Lad, Prabhakar > > wrote: > > > > > > Hi Guo, > > > > > > On Thu, Oct 6, 2022 at 1:59 AM Guo Ren wrote: > > > > > > > > On Wed, Oct 5, 2022 at 11:03 PM Lad, Prabhakar > > > > wrote: > > > > > > > > > > Hi Guo, > > > > > > > > > > On Wed, Oct 5, 2022 at 3:23 PM Guo Ren wrote: > > > > > > > > > > > > On Wed, Oct 5, 2022 at 8:54 PM Lad, Prabhakar > > > > > > wrote: > > > > > > > > > > > > > > Hi Guo, > > > > > > > > > > > > > > On Wed, Oct 5, 2022 at 2:29 AM Guo Ren wrote: > > > > > > > > > > > > > > > > On Tue, Oct 4, 2022 at 6:32 AM Prabhakar wrote: > > > > > > > > > > > > > > > > > > From: Lad Prabhakar > > > > > > > > > > > > > > > > > > On the AX45MP core, cache coherency is a specification option so it may > > > > > > > > > not be supported. In this case DMA will fail. As a workaround, firstly we > > > > > > > > > allocate a global dma coherent pool from which DMA allocations are taken > > > > > > > > > and marked as non-cacheable + bufferable using the PMA region as specified > > > > > > > > > in the device tree. Synchronization callbacks are implemented to > > > > > > > > > synchronize when doing DMA transactions. > > > > > > > > > > > > > > > > > > The Andes AX45MP core has a Programmable Physical Memory Attributes (PMA) > > > > > > > > > block that allows dynamic adjustment of memory attributes in the runtime. > > > > > > > > > It contains a configurable amount of PMA entries implemented as CSR > > > > > > > > > registers to control the attributes of memory locations in interest. > > > > > > > > > > > > > > > > > > Below are the memory attributes supported: > > > > > > > > > * Device, Non-bufferable > > > > > > > > > * Device, bufferable > > > > > > > > > * Memory, Non-cacheable, Non-bufferable > > > > > > > > > * Memory, Non-cacheable, Bufferable > > > > > > > > > * Memory, Write-back, No-allocate > > > > > > > > > * Memory, Write-back, Read-allocate > > > > > > > > > * Memory, Write-back, Write-allocate > > > > > > > > > * Memory, Write-back, Read and Write-allocate > > > > > > > > Seems Svpbmt's PMA, IO, and NC wouldn't fit your requirements, could > > > > > > > > give a map list of the types of Svpbmt? And give out what you needed, > > > > > > > > but Svpbmt can't. > > > > > > > > > > > > > > > Sorry I didn't get what you meant here, could you please elaborate. > > > > > > I know there is no pbmt in AX45MP, I am just curious how many physical > > > > > > memory attributes you would use in linux? It seems only one type used > > > > > > in the series: > > > > > > cpu_nocache_area_set -> sbi_ecall(SBI_EXT_ANDES, > > > > > > SBI_EXT_ANDES_SET_PMA, offset, vaddr, size, entry_id, 0, 0); > > > > > > > > > > > Yes, currently we only use "Memory, Non-cacheable, Bufferable". I was > > > > > wondering if we could send these options as flags from DT something > > > > > like below so that it's not hard coded in the code. > > > > > > > > > > /* PMA config */ > > > > > #define AX45MP_PMACFG_ETYP GENMASK(1, 0) > > > > > /* OFF: PMA entry is disabled */ > > > > > #define AX45MP_PMACFG_ETYP_DISABLED 0 > > > > > /* Naturally aligned power of 2 region */ > > > > > #define AX45MP_PMACFG_ETYP_NAPOT 3 > > > > > > > > > > #define AX45MP_PMACFG_MTYP GENMASK(5, 2) > > > > > /* Device, Non-bufferable */ > > > > > #define AX45MP_PMACFG_MTYP_DEV_NON_BUF (0 << 2) > > > > > /* Device, bufferable */ > > > > > #define AX45MP_PMACFG_MTYP_DEV_BUF (1 << 2) > > > > > /* Memory, Non-cacheable, Non-bufferable */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_NON_CACHE_NON_BUF (2 << 2) > > > > > /* Memory, Non-cacheable, Bufferable */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_NON_CACHE_BUF (3 << 2) > > > > > /* Memory, Write-back, No-allocate */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_WB_NA (8 << 2) > > > > > /* Memory, Write-back, Read-allocate */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_WB_RA (9 << 2) > > > > > /* Memory, Write-back, Write-allocate */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_WB_WA (10 << 2) > > > > > /* Memory, Write-back, Read and Write-allocate */ > > > > > #define AX45MP_PMACFG_MTYP_MEM_WB_R_WA (11 << 2) > > > > > > > > > > /* AMO instructions are supported */ > > > > > #define AX45MP_PMACFG_NAMO_AMO_SUPPORT (0 << 6) > > > > > /* AMO instructions are not supported */ > > > > > #define AX45MP_PMACFG_NAMO_AMO_NO_SUPPORT (1 << 6) > > > > > > > > > > > > > > > pma-regions = <0x0 0x00000000 0x0 0x10000000 0x0 > > > > > AX45MP_PMACFG_ETYP_NAPOT | AX45MP_PMACFG_MTYP_MEM_NON_CACHE_BUF | > > > > > AX45MP_PMACFG_NAMO_AMO_SUPPORT>, > > > > > <0x0 0x10000000 0x0 0x04000000 0x0 > > > > > AX45MP_PMACFG_ETYP_NAPOT | AX45MP_PMACFG_MTYP_MEM_NON_CACHE_BUF | > > > > > AX45MP_PMACFG_NAMO_AMO_SUPPORT >, > > > > > <0x0 0x20000000 0x0 0x10000000 0x0 > > > > > AX45MP_PMACFG_ETYP_NAPOT | AX45MP_PMACFG_MTYP_MEM_NON_CACHE_BUF | > > > > > AX45MP_PMACFG_NAMO_AMO_SUPPORT>, > > > > > <0x0 0x58000000 0x0 0x08000000 0x0 > > > > > AX45MP_PMACFG_ETYP_NAPOT | AX45MP_PMACFG_MTYP_MEM_NON_CACHE_BUF | > > > > > AX45MP_PMACFG_NAMO_AMO_SUPPORT>; > > > > > > > > > > Does the above sound good? > > > > I've no idea. But for working around, I would give Acked-by. > > > > > > > > > > > > > > > I'm not sure how you make emmc/usb/gmac's dma ctrl desc work around > > > > > > without pbmt when they don't have cache coherency protocol. Do you > > > > > > need to inject dma_sync for desc synchronization? What's the effect of > > > > > > dynamic PMA in the patch series? > > > > > > > > > > > Currently we have setup the pma regions as below: > > > > > > > > > > l2cache: cache-controller@13400000 { > > > > > compatible = "andestech,ax45mp-cache", "cache"; > > > > > cache-size = <0x40000>; > > > > > cache-line-size = <64>; > > > > > cache-sets = <1024>; > > > > > cache-unified; > > > > > reg = <0x0 0x13400000 0x0 0x100000>; > > > > > pma-regions = <0x0 0x00000000 0x0 0x10000000 0x0 0xf>, > > > > > <0x0 0x10000000 0x0 0x04000000 0x0 0xf>, > > > > > <0x0 0x20000000 0x0 0x10000000 0x0 0xf>, > > > > > <0x0 0x58000000 0x0 0x08000000 0x0 0xf>; > > > > > interrupts = ; > > > > > }; > > > > > > > > > > The last pma-regions entry 0x58000000 is a DDR location this memory > > > > > locations is marked as shared DMA pool with below in DT, > > > > > > > > > > reserved-memory { > > > > > #address-cells = <2>; > > > > > #size-cells = <2>; > > > > > ranges; > > > > > > > > > > reserved: linux,cma@58000000 { > > > > > compatible = "shared-dma-pool"; > > > > > no-map; > > > > > linux,dma-default; > > > > > reg = <0x0 0x58000000 0x0 0x08000000>; > > > > > }; > > > > > }; > > > > > > > > > > And for ARCH_R9A07G043 we automatically select DMA_GLOBAL_POOL, so the > > > > > IP blocks (emmc/usb/gmac's) requesting DMA'able memory will > > > > > automatically fall into this region which is non-cacheable but > > > > > bufferable (set in PMA) and rest everything is taken care by clean and > > > > > flush callbacks. We dont have inject dma_sync for desc > > > > > synchronization for existing drivers (which are shared with Renesas > > > > > RZ/G2L family) > > > > Better than I thought :). The "non-cacheable but c" is "weak > > > > order," also raising the bufferable signal of AXI transactions. Right? > > > Yes, I have confirmed from the HW team it does raise bufferable signal > > > of AXI transactions. So far with the drivers (ETH/USB/DMAC) we haven't > > > seen issues so far. > > > > > > Do you foresee any issues? > > That depends on you interconnect design, most of the simple > > interconnects would ignore bufferable. Some NoC interconnects would > > buffer the transactions, which means data would be buffered in > > interconnects after CPU store instruction retired. If the CPU kicks > > the dma working with an IO reg write, hw may not guarantee the orders > > of the last data written and dma IO reg kick start. Then dma may lose > > the data. > > > I haven't see this issue, maybe to avoid this the controller register > space could be marked as non-cachebale + non-bufferable in the PMA by That sounds good, and should be. Although maybe not needed. > this way we could ensure orders. > > What do you think? > > Cheers, > Prabhakar -- Best Regards Guo Ren