Received: by 2002:a05:7412:419a:b0:f3:1519:9f41 with SMTP id i26csp3123875rdh; Mon, 27 Nov 2023 07:02:24 -0800 (PST) X-Google-Smtp-Source: AGHT+IEcwFjB65giqGFwKT+5XM/9NwfVEjy2fwe/wVSFlyxLOu/hYuiZRxrAGE3Efe2SjudY6NJQ X-Received: by 2002:a25:6894:0:b0:d9a:5859:bf14 with SMTP id d142-20020a256894000000b00d9a5859bf14mr9736288ybc.64.1701097344274; Mon, 27 Nov 2023 07:02:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701097344; cv=none; d=google.com; s=arc-20160816; b=zfV+yxV1nc9KV8bXzsazHgmyRK9pT9/piyOsxOV1FjIMX5jDfjFJAEciyQi8ylAe1W gguI8a0huAMWEEy0XE0dyxQqgznsah1gwP/KnsY+nItFeGEjk4w1SUDlyFqVsvTpysza p7HDa3c76uGUwGrj142khs6YfNy+gVUKTEIaWPwSwL3sr2DGS4b5rLbl3yuNr09SvB79 SqsUITu/ObaPlBjA7XQieCo8NQhbk2qK++wlrl+ZtoKooLJoQQsUyWGInwSP1hKoa4i3 iI2mf6X5B41c1UGJHmieBqx9rbxYz+5+rQnL+FliXkYdT6Bo3YASN0MHw2Jy98mwPLw1 mWJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=7biW1Mw8n6QLTzXDyEdH2rMwYFOOh3ORWc6XBBSz8rI=; fh=oVyZPWtm6mpH3d4qDAoXsAeagaN2dloHFHPyagpC8cc=; b=dCCwqrBIpMtRi2bQFOl+qaP7ZAzHsoKoo1PdojIp5IOUXbmoc6sHnD51hTViPQtYK2 LNJi91k+MgXvaSmcXblKGUho/qGvz4iii99cKGAoRZ6AdzgcoDrcieWYtGsCOmXdUyyc +l1Sql0+ACtsyX+pdU1n3gkJXerlR0TWc11vpz0RXhFySi5kXHccfyw36xdZ+PDAI9XE wQrhgWf2ke1xNeLXyw0eXYnVgaNhyOIduGjNFWvfPDciVopXsBloP9zbMGLe2Sd5yfex fCLOm/5qxtoPudz5oOtRUVutLzmKQNvAgH33XL/fooXetcfeGbOPL8RJiCjQOwh2aDrs ydMA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:6 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from pete.vger.email (pete.vger.email. [2620:137:e000::3:6]) by mx.google.com with ESMTPS id y6-20020a25dc06000000b00da0ae1e6408si6733600ybe.584.2023.11.27.07.02.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 27 Nov 2023 07:02:24 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:6 as permitted sender) client-ip=2620:137:e000::3:6; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:6 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by pete.vger.email (Postfix) with ESMTP id A4AE2809B9C0; Mon, 27 Nov 2023 07:01:31 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at pete.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233668AbjK0PBJ (ORCPT + 99 others); Mon, 27 Nov 2023 10:01:09 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49396 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233282AbjK0PBI (ORCPT ); Mon, 27 Nov 2023 10:01:08 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 38867AA; Mon, 27 Nov 2023 07:01:14 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 8CEEE2F4; Mon, 27 Nov 2023 07:02:01 -0800 (PST) Received: from raptor (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 947493F6C4; Mon, 27 Nov 2023 07:01:08 -0800 (PST) Date: Mon, 27 Nov 2023 15:01:05 +0000 From: Alexandru Elisei To: David Hildenbrand Cc: catalin.marinas@arm.com, will@kernel.org, oliver.upton@linux.dev, maz@kernel.org, james.morse@arm.com, suzuki.poulose@arm.com, yuzenghui@huawei.com, arnd@arndb.de, akpm@linux-foundation.org, mingo@redhat.com, peterz@infradead.org, juri.lelli@redhat.com, vincent.guittot@linaro.org, dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, mgorman@suse.de, bristot@redhat.com, vschneid@redhat.com, mhiramat@kernel.org, rppt@kernel.org, hughd@google.com, pcc@google.com, steven.price@arm.com, anshuman.khandual@arm.com, vincenzo.frascino@arm.com, eugenis@google.com, kcc@google.com, hyesoo.yu@samsung.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, kvmarm@lists.linux.dev, linux-fsdevel@vger.kernel.org, linux-arch@vger.kernel.org, linux-mm@kvack.org, linux-trace-kernel@vger.kernel.org Subject: Re: [PATCH RFC v2 12/27] arm64: mte: Add tag storage pages to the MIGRATE_CMA migratetype Message-ID: References: <20231119165721.9849-1-alexandru.elisei@arm.com> <20231119165721.9849-13-alexandru.elisei@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on pete.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (pete.vger.email [0.0.0.0]); Mon, 27 Nov 2023 07:01:31 -0800 (PST) Hi David, On Fri, Nov 24, 2023 at 08:40:55PM +0100, David Hildenbrand wrote: > On 19.11.23 17:57, Alexandru Elisei wrote: > > Add the MTE tag storage pages to the MIGRATE_CMA migratetype, which allows > > the page allocator to manage them like regular pages. > > > > Ths migratype lends the pages some very desirable properties: > > > > * They cannot be longterm pinned, meaning they will always be migratable. > > > > * The pages can be allocated explicitely by using their PFN (with > > alloc_contig_range()) when they are needed to store tags. > > > > Signed-off-by: Alexandru Elisei > > --- > > arch/arm64/Kconfig | 1 + > > arch/arm64/kernel/mte_tag_storage.c | 68 +++++++++++++++++++++++++++++ > > include/linux/mmzone.h | 5 +++ > > mm/internal.h | 3 -- > > 4 files changed, 74 insertions(+), 3 deletions(-) > > > > diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig > > index fe8276fdc7a8..047487046e8f 100644 > > --- a/arch/arm64/Kconfig > > +++ b/arch/arm64/Kconfig > > @@ -2065,6 +2065,7 @@ config ARM64_MTE > > if ARM64_MTE > > config ARM64_MTE_TAG_STORAGE > > bool "Dynamic MTE tag storage management" > > + select CONFIG_CMA > > help > > Adds support for dynamic management of the memory used by the hardware > > for storing MTE tags. This memory, unlike normal memory, cannot be > > diff --git a/arch/arm64/kernel/mte_tag_storage.c b/arch/arm64/kernel/mte_tag_storage.c > > index fa6267ef8392..427f4f1909f3 100644 > > --- a/arch/arm64/kernel/mte_tag_storage.c > > +++ b/arch/arm64/kernel/mte_tag_storage.c > > @@ -5,10 +5,12 @@ > > * Copyright (C) 2023 ARM Ltd. > > */ > > +#include > > #include > > #include > > #include > > #include > > +#include > > #include > > #include > > #include > > @@ -189,6 +191,14 @@ static int __init fdt_init_tag_storage(unsigned long node, const char *uname, > > return ret; > > } > > + /* Pages are managed in pageblock_nr_pages chunks */ > > + if (!IS_ALIGNED(tag_range->start | range_len(tag_range), pageblock_nr_pages)) { > > + pr_err("Tag storage region 0x%llx-0x%llx not aligned to pageblock size 0x%llx", > > + PFN_PHYS(tag_range->start), PFN_PHYS(tag_range->end), > > + PFN_PHYS(pageblock_nr_pages)); > > + return -EINVAL; > > + } > > + > > ret = tag_storage_get_memory_node(node, &mem_node); > > if (ret) > > return ret; > > @@ -254,3 +264,61 @@ void __init mte_tag_storage_init(void) > > pr_info("MTE tag storage region management disabled"); > > } > > } > > + > > +static int __init mte_tag_storage_activate_regions(void) > > +{ > > + phys_addr_t dram_start, dram_end; > > + struct range *tag_range; > > + unsigned long pfn; > > + int i, ret; > > + > > + if (num_tag_regions == 0) > > + return 0; > > + > > + dram_start = memblock_start_of_DRAM(); > > + dram_end = memblock_end_of_DRAM(); > > + > > + for (i = 0; i < num_tag_regions; i++) { > > + tag_range = &tag_regions[i].tag_range; > > + /* > > + * Tag storage region was clipped by arm64_bootmem_init() > > + * enforcing addressing limits. > > + */ > > + if (PFN_PHYS(tag_range->start) < dram_start || > > + PFN_PHYS(tag_range->end) >= dram_end) { > > + pr_err("Tag storage region 0x%llx-0x%llx outside addressable memory", > > + PFN_PHYS(tag_range->start), PFN_PHYS(tag_range->end)); > > + ret = -EINVAL; > > + goto out_disabled; > > + } > > + } > > + > > + /* > > + * MTE disabled, tag storage pages can be used like any other pages. The > > + * only restriction is that the pages cannot be used by kexec because > > + * the memory remains marked as reserved in the memblock allocator. > > + */ > > + if (!system_supports_mte()) { > > + for (i = 0; i< num_tag_regions; i++) { > > + tag_range = &tag_regions[i].tag_range; > > + for (pfn = tag_range->start; pfn <= tag_range->end; pfn++) > > + free_reserved_page(pfn_to_page(pfn)); > > + } > > + ret = 0; > > + goto out_disabled; > > + } > > + > > + for (i = 0; i < num_tag_regions; i++) { > > + tag_range = &tag_regions[i].tag_range; > > + for (pfn = tag_range->start; pfn <= tag_range->end; pfn += pageblock_nr_pages) > > + init_cma_reserved_pageblock(pfn_to_page(pfn)); > > + totalcma_pages += range_len(tag_range); > > + } > > You shouldn't be doing that manually in arm code. Likely you want some cma.c > helper for something like that. If you referring to the last loop (the one that does ini_cma_reserved_pageblock()), indeed, there's already a function which does that, cma_init_reserved_areas() -> cma_activate_area(). > > But, can you elaborate on why you took this hacky (sorry) approach as > documented in the cover letter: No worries, it is indeed a bit hacky :) > > "The arm64 code manages this memory directly instead of using > cma_declare_contiguous/cma_alloc for performance reasons." > > What is the exact problem? I am referring to the performance degredation that is fixed in patch #26, "arm64: mte: Fast track reserving tag storage when the block is free" [1]. The issue is that alloc_contig_range() -> __alloc_contig_migrate_range() calls lru_cache_disable(), which IPIs all the CPUs in the system, and that leads to a 10-20% performance degradation on Chrome. It has been observed that most of the time the tag storage pages are free, and the lru_cache_disable() calls are unnecessary. The performance degradation is almost entirely eliminated by having the code take the tag storage page directly from the free list if it's free, instead of calling alloc_contig_range(). Do you believe it would be better to use the cma code, and modify it to use this fast path to take the page drectly from the buddy allocator? I can definitely try to integrate the code with cma_alloc(), but I think keeping the fast path for reserving tag storage is extremely desirable, since it makes such a huge difference to performance. [1] https://lore.kernel.org/linux-trace-kernel/20231119165721.9849-27-alexandru.elisei@arm.com/ Thanks, Alex > > -- > Cheers, > > David / dhildenb > >