Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp3580797iog; Mon, 27 Jun 2022 20:47:11 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tsaJJ94laOx1Ptt6XRFtNtPK7V5IibwNMONsw/bvopC9O84j9uD1NxDQsvj8aj6hiq6q7P X-Received: by 2002:a05:6402:c44:b0:431:52cc:f933 with SMTP id cs4-20020a0564020c4400b0043152ccf933mr20196278edb.41.1656388031456; Mon, 27 Jun 2022 20:47:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656388031; cv=none; d=google.com; s=arc-20160816; b=gUKSmMdlhGg5Mg65k10QGLMC+KMD36WfAoz26wbGzim0a6Hq01nlBmCOFisvtCgdnn A1aQyAF0NMw6GIcOeWhmXN+vRCAEqBYi4bHe37lJewEMn3L7K3CE9XeZFpbbRkgJG0EX ixHYNJ3BNaMMnmnVCxBa+MmObfvWlEpRzOzSBGRztsRF/fvRQ8EAt8TvowdXRYyoJOAe 7sC3ANE9JUo0Poy2wubWhfTapB2Y9iHcLNK7x4fRpXvrZgtII9Mz8DPSiY2VErkk4rtn 2T2aEAXkB3E3UAMrfznmJCXlNUMvbiOAQT4tMD9/MShz0L82hUplEzyrkc6n/K4Q8rOU pDwQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=DRQZS+wO2+PjOUJoLYoaHvlWGefOcXK2lUi53DdAk/I=; b=nkfZbNSxrQziPE1EpBApJVALJw3QywywCb8N11ARrhRW5k9bEK+BsVYlIPq7ZFrBcB Jd+ImeTmxy2gjZgztW/wg8oJAQUUIBRJi6M2q/K4Nll1SsG9W61pxp27D8R5erBfMi+C LHEYNOsAPmNAcMIjMTwYzY/vRbmxiPhmSage9LZzTkiApvJEBHwYUH7WH1ijPR/TX+au 4EeeUTjVH5zFdcasdTEyThbI5vY0Rm6HeTQ7eOrMS/FzMuwmd3CB8zcJL81tlkt3Ton7 6cRy2p62ev17lMdHVhBjB8xbBZEnk9zYGlPudtLWO8kpGmGbcTM+7gKZEdSmem+S0k7/ FKaA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=VQrIYAO6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id a13-20020a1709063a4d00b0070f89ef5eb5si12591714ejf.283.2022.06.27.20.46.46; Mon, 27 Jun 2022 20:47:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=VQrIYAO6; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231344AbiF1DWy (ORCPT + 99 others); Mon, 27 Jun 2022 23:22:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43832 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231519AbiF1DWc (ORCPT ); Mon, 27 Jun 2022 23:22:32 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8E0BD2529E; Mon, 27 Jun 2022 20:22:27 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 23DA1615BC; Tue, 28 Jun 2022 03:22:27 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 7C638C341CE; Tue, 28 Jun 2022 03:22:26 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1656386546; bh=IX9GpJCf5zFk83qLHuc5OtFoLqfpiJ8Lh1FdnhW9AN8=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=VQrIYAO6vwg+696izoP5SrdL119ptxuLxDTzlMy5952K7SVAji6NzqdfgeYKpO57Q Be2+pA3wgSkglkl1si+HjfhLb2sx8Z89pnuJsoe6YdX0xyqIoeIp4jZ8fXvLgoQ4rw PtBN3woDHU2blgSxAFrIjEjaL9+U9zHkmXqbxo1dul56S+5hzgZfuJ3m0y6BPcEk+p 3mA5xVn4OYmK3xC92hv3mcftMjiqmsjALEGEwFGKs8XjGvstBkv/E9m/nkXNB7eOWJ NS169tFOgOZRQ5G+8UrwZIlb8LGuYbc3zFyEGH0VgCB44a5+7zKmopaKI/y2gStc0c ddrPGp7EbHZPw== Received: by mail-ua1-f54.google.com with SMTP id x24so4110666uaf.11; Mon, 27 Jun 2022 20:22:26 -0700 (PDT) X-Gm-Message-State: AJIora8xyXajS3bt9y7Jeq0shDR5Dtzx4gI+Uu9CrzhLHb6w/xAOPPTR 0vVdsFM6GjZv0aqbyglB8bqulzCz0AgQMubAw6A= X-Received: by 2002:a9f:2c9a:0:b0:381:c1c7:82a6 with SMTP id w26-20020a9f2c9a000000b00381c1c782a6mr4752539uaj.23.1656386545380; Mon, 27 Jun 2022 20:22:25 -0700 (PDT) MIME-Version: 1.0 References: <20220625095459.3786827-1-chenhuacai@loongson.cn> <20220625095459.3786827-2-chenhuacai@loongson.cn> In-Reply-To: From: Huacai Chen Date: Tue, 28 Jun 2022 11:22:14 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH 2/3] LoongArch: Add sparse memory vmemmap support To: Muchun Song Cc: Huacai Chen , Arnd Bergmann , Thomas Bogendoerfer , Dave Hansen , Andy Lutomirski , Peter Zijlstra , Catalin Marinas , Will Deacon , loongarch@lists.linux.dev, linux-arch , Xuefeng Li , Guo Ren , Xuerui Wang , Jiaxun Yang , Andrew Morton , Linux Memory Management List , "open list:MIPS" , LKML , LAK , Feiyang Chen , Min Zhou Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Muchun, On Mon, Jun 27, 2022 at 6:33 PM Muchun Song wrote: > > On Sat, Jun 25, 2022 at 5:54 PM Huacai Chen wrote: > > > > From: Feiyang Chen > > > > Add sparse memory vmemmap support for LoongArch. SPARSEMEM_VMEMMAP > > uses a virtually mapped memmap to optimise pfn_to_page and page_to_pfn > > operations. This is the most efficient option when sufficient kernel > > resources are available. > > > > Signed-off-by: Min Zhou > > Signed-off-by: Huacai Chen > > Signed-off-by: Feiyang Chen > > --- > > arch/loongarch/Kconfig | 2 + > > arch/loongarch/include/asm/pgtable.h | 5 +- > > arch/loongarch/include/asm/sparsemem.h | 8 +++ > > arch/loongarch/mm/init.c | 71 +++++++++++++++++++++++++- > > include/linux/mm.h | 2 + > > mm/sparse-vmemmap.c | 10 ++++ > > 6 files changed, 96 insertions(+), 2 deletions(-) > > > > diff --git a/arch/loongarch/Kconfig b/arch/loongarch/Kconfig > > index dc19cf3071ea..8e56ca28165e 100644 > > --- a/arch/loongarch/Kconfig > > +++ b/arch/loongarch/Kconfig > > @@ -49,6 +49,7 @@ config LOONGARCH > > select ARCH_USE_QUEUED_RWLOCKS > > select ARCH_USE_QUEUED_SPINLOCKS > > select ARCH_WANT_DEFAULT_TOPDOWN_MMAP_LAYOUT > > + select ARCH_WANT_HUGETLB_PAGE_OPTIMIZE_VMEMMAP > > I think this should be a separate patch to enable HVO (HugeTLB Vmemmap > Optimization) since it is irrelevant to this patch. It seems I have misunderstood HVO, then I will remove HVO parts from this patch. Thank you. Huacai > > Thanks. > > > select ARCH_WANTS_NO_INSTR > > select BUILDTIME_TABLE_SORT > > select COMMON_CLK > > @@ -422,6 +423,7 @@ config ARCH_FLATMEM_ENABLE > > > > config ARCH_SPARSEMEM_ENABLE > > def_bool y > > + select SPARSEMEM_VMEMMAP_ENABLE > > help > > Say Y to support efficient handling of sparse physical memory, > > for architectures which are either NUMA (Non-Uniform Memory Access) > > diff --git a/arch/loongarch/include/asm/pgtable.h b/arch/loongarch/include/asm/pgtable.h > > index 9c811c3f7572..b701ec7a0309 100644 > > --- a/arch/loongarch/include/asm/pgtable.h > > +++ b/arch/loongarch/include/asm/pgtable.h > > @@ -92,7 +92,10 @@ extern unsigned long zero_page_mask; > > #define VMALLOC_START MODULES_END > > #define VMALLOC_END \ > > (vm_map_base + \ > > - min(PTRS_PER_PGD * PTRS_PER_PUD * PTRS_PER_PMD * PTRS_PER_PTE * PAGE_SIZE, (1UL << cpu_vabits)) - PMD_SIZE) > > + min(PTRS_PER_PGD * PTRS_PER_PUD * PTRS_PER_PMD * PTRS_PER_PTE * PAGE_SIZE, (1UL << cpu_vabits)) - PMD_SIZE - VMEMMAP_SIZE) > > + > > +#define vmemmap ((struct page *)((VMALLOC_END + PMD_SIZE) & PMD_MASK)) > > +#define VMEMMAP_END ((unsigned long)vmemmap + VMEMMAP_SIZE - 1) > > > > #define pte_ERROR(e) \ > > pr_err("%s:%d: bad pte %016lx.\n", __FILE__, __LINE__, pte_val(e)) > > diff --git a/arch/loongarch/include/asm/sparsemem.h b/arch/loongarch/include/asm/sparsemem.h > > index 3d18cdf1b069..a1e440f6bec7 100644 > > --- a/arch/loongarch/include/asm/sparsemem.h > > +++ b/arch/loongarch/include/asm/sparsemem.h > > @@ -11,6 +11,14 @@ > > #define SECTION_SIZE_BITS 29 /* 2^29 = Largest Huge Page Size */ > > #define MAX_PHYSMEM_BITS 48 > > > > +#ifndef CONFIG_SPARSEMEM_VMEMMAP > > +#define VMEMMAP_SIZE 0 > > +#else > > +#define VMEMMAP_SIZE (sizeof(struct page) * (1UL << (cpu_pabits + 1 - PAGE_SHIFT))) > > +#endif > > + > > +#include > > + > > #endif /* CONFIG_SPARSEMEM */ > > > > #ifdef CONFIG_MEMORY_HOTPLUG > > diff --git a/arch/loongarch/mm/init.c b/arch/loongarch/mm/init.c > > index 7094a68c9b83..35128229fe46 100644 > > --- a/arch/loongarch/mm/init.c > > +++ b/arch/loongarch/mm/init.c > > @@ -22,7 +22,7 @@ > > #include > > #include > > #include > > -#include > > +#include > > #include > > > > #include > > @@ -157,6 +157,75 @@ void arch_remove_memory(u64 start, u64 size, struct vmem_altmap *altmap) > > #endif > > #endif > > > > +#ifdef CONFIG_SPARSEMEM_VMEMMAP > > +int __meminit vmemmap_populate_hugepages(unsigned long start, unsigned long end, > > + int node, struct vmem_altmap *altmap) > > +{ > > + unsigned long addr = start; > > + unsigned long next; > > + pgd_t *pgd; > > + p4d_t *p4d; > > + pud_t *pud; > > + pmd_t *pmd; > > + > > + for (addr = start; addr < end; addr = next) { > > + next = pmd_addr_end(addr, end); > > + > > + pgd = vmemmap_pgd_populate(addr, node); > > + if (!pgd) > > + return -ENOMEM; > > + p4d = vmemmap_p4d_populate(pgd, addr, node); > > + if (!p4d) > > + return -ENOMEM; > > + pud = vmemmap_pud_populate(p4d, addr, node); > > + if (!pud) > > + return -ENOMEM; > > + > > + pmd = pmd_offset(pud, addr); > > + if (pmd_none(*pmd)) { > > + void *p = NULL; > > + > > + p = vmemmap_alloc_block_buf(PMD_SIZE, node, NULL); > > + if (p) { > > + pmd_t entry; > > + > > + entry = pfn_pmd(virt_to_pfn(p), PAGE_KERNEL); > > + pmd_val(entry) |= _PAGE_HUGE | _PAGE_HGLOBAL; > > + set_pmd_at(&init_mm, addr, pmd, entry); > > + > > + continue; > > + } > > + } else if (pmd_val(*pmd) & _PAGE_HUGE) { > > + vmemmap_verify((pte_t *)pmd, node, addr, next); > > + continue; > > + } > > + if (vmemmap_populate_basepages(addr, next, node, NULL)) > > + return -ENOMEM; > > + } > > + > > + return 0; > > +} > > + > > +#if CONFIG_PGTABLE_LEVELS == 2 > > +int __meminit vmemmap_populate(unsigned long start, unsigned long end, int node, > > + struct vmem_altmap *altmap) > > +{ > > + return vmemmap_populate_basepages(start, end, node, NULL); > > +} > > +#else > > +int __meminit vmemmap_populate(unsigned long start, unsigned long end, int node, > > + struct vmem_altmap *altmap) > > +{ > > + return vmemmap_populate_hugepages(start, end, node, NULL); > > +} > > +#endif > > + > > +void vmemmap_free(unsigned long start, unsigned long end, > > + struct vmem_altmap *altmap) > > +{ > > +} > > +#endif > > + > > /* > > * Align swapper_pg_dir in to 64K, allows its address to be loaded > > * with a single LUI instruction in the TLB handlers. If we used > > diff --git a/include/linux/mm.h b/include/linux/mm.h > > index bc8f326be0ce..3472b924a1ea 100644 > > --- a/include/linux/mm.h > > +++ b/include/linux/mm.h > > @@ -3203,6 +3203,8 @@ void *sparse_buffer_alloc(unsigned long size); > > struct page * __populate_section_memmap(unsigned long pfn, > > unsigned long nr_pages, int nid, struct vmem_altmap *altmap, > > struct dev_pagemap *pgmap); > > +void pmd_init(void *addr); > > +void pud_init(void *addr); > > pgd_t *vmemmap_pgd_populate(unsigned long addr, int node); > > p4d_t *vmemmap_p4d_populate(pgd_t *pgd, unsigned long addr, int node); > > pud_t *vmemmap_pud_populate(p4d_t *p4d, unsigned long addr, int node); > > diff --git a/mm/sparse-vmemmap.c b/mm/sparse-vmemmap.c > > index f4fa61dbbee3..33e2a1ceee72 100644 > > --- a/mm/sparse-vmemmap.c > > +++ b/mm/sparse-vmemmap.c > > @@ -587,6 +587,10 @@ pmd_t * __meminit vmemmap_pmd_populate(pud_t *pud, unsigned long addr, int node) > > return pmd; > > } > > > > +void __weak __meminit pmd_init(void *addr) > > +{ > > +} > > + > > pud_t * __meminit vmemmap_pud_populate(p4d_t *p4d, unsigned long addr, int node) > > { > > pud_t *pud = pud_offset(p4d, addr); > > @@ -594,11 +598,16 @@ pud_t * __meminit vmemmap_pud_populate(p4d_t *p4d, unsigned long addr, int node) > > void *p = vmemmap_alloc_block_zero(PAGE_SIZE, node); > > if (!p) > > return NULL; > > + pmd_init(p); > > pud_populate(&init_mm, pud, p); > > } > > return pud; > > } > > > > +void __weak __meminit pud_init(void *addr) > > +{ > > +} > > + > > p4d_t * __meminit vmemmap_p4d_populate(pgd_t *pgd, unsigned long addr, int node) > > { > > p4d_t *p4d = p4d_offset(pgd, addr); > > @@ -606,6 +615,7 @@ p4d_t * __meminit vmemmap_p4d_populate(pgd_t *pgd, unsigned long addr, int node) > > void *p = vmemmap_alloc_block_zero(PAGE_SIZE, node); > > if (!p) > > return NULL; > > + pud_init(p); > > p4d_populate(&init_mm, p4d, p); > > } > > return p4d; > > -- > > 2.27.0 > >