Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp1928112pxb; Mon, 20 Sep 2021 08:19:45 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwPbb3q879pvACaoCO1iG0ncxLAdwdYLHfSsn/R3XWk4FploYbRUb7K5k8ZDL987MFKOyci X-Received: by 2002:a5d:5147:: with SMTP id u7mr29626241wrt.69.1632151185577; Mon, 20 Sep 2021 08:19:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632151185; cv=none; d=google.com; s=arc-20160816; b=t4Mbwk2opvNMdInt4mQWI1puA1zYeOpnl9A3dFDe7mRMwVCRPeUPoPzyYcDYKFqTzg tLYea/BNyLWcMa0XY2j4V2LGFeEsHFoniKjITAuUoxarGQJGs5bEFSwVkeVbH81SOBA5 SasnapvGTtDkD4HymfpZ/WbUUpNrSEdaKThWOEtLxCGpvTyzc4ht+Pgytc+SKkzursXF d/sWltOr49OUSKFQMimbE8OHPKhUkehsXQmZ+KWAjkUdPNYBw0lxpjSPuknrbGW13mXP /18zCjNKkG0sT60uiG20hgPz074XGcvt+m0FyBvTbK/nDMjzBSuE7vcUsPn+L6HHtw9k /kzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=HBPtad2sw6hrPEM5CDUy0wjm2zQ2PFAzoNdR3yq+RM0=; b=zNupTjJDjnwuDfBnb8FWsDO9k+1ONPWeCJGSn44hc5W+x0z6Q62NxM701evdTG6Jet mYuD/zwENEAjsYWwaX68qmoZ8KcrTTEIc2o2FVerCjR0VVxIrNYJtLEqD3ForEZ+22ct nmyV0j2R4JO/ndyXjTOsOhFB18qM89hh+C6MBjYnNmp+d7LX7otowgsW/34OrzDe7y6T 1yvzf+IkBfjN3XjpKqHXC8ea7rRP2CE6vBVcKbJ19uATSm5n1SwGLL+Bv0iszp5zndKr 9Dh4XLHMQRT7xEWYt/g79bKkCk63xX6HjsuwI8rzvjgQUUHUGCRUy1YtCspW4YsI/ADF ls2A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p5si16240523ejj.74.2021.09.20.08.19.15; Mon, 20 Sep 2021 08:19:45 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234962AbhITKXH (ORCPT + 99 others); Mon, 20 Sep 2021 06:23:07 -0400 Received: from foss.arm.com ([217.140.110.172]:45444 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231738AbhITKXG (ORCPT ); Mon, 20 Sep 2021 06:23:06 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 8DC031063; Mon, 20 Sep 2021 03:21:39 -0700 (PDT) Received: from C02TD0UTHF1T.local (unknown [10.57.16.51]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 1803E3F59C; Mon, 20 Sep 2021 03:21:36 -0700 (PDT) Date: Mon, 20 Sep 2021 11:21:28 +0100 From: Mark Rutland To: Liu Shixin Cc: Alexander Potapenko , Marco Elver , Dmitry Vyukov , Catalin Marinas , Will Deacon , kasan-dev@googlegroups.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] arm64: remove page granularity limitation from KFENCE Message-ID: <20210920101938.GA13863@C02TD0UTHF1T.local> References: <20210918083849.2696287-1-liushixin2@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210918083849.2696287-1-liushixin2@huawei.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Sep 18, 2021 at 04:38:49PM +0800, Liu Shixin wrote: > Currently if KFENCE is enabled in arm64, the entire linear map will be > mapped at page granularity which seems overkilled. Actually only the > kfence pool requires to be mapped at page granularity. We can remove the > restriction from KFENCE and force the linear mapping of the kfence pool > at page granularity later in arch_kfence_init_pool(). > > Signed-off-by: Liu Shixin We'd made this apply to the entire linear map because it was simpler to do so, there are other reasons to want the linear map at page granularity (e.g. RODATA_FULL), and is also the default behaviour (since RODATA_FULL_DEFAULT_ENABLED is `default y`). We also avoid live changes from block<->table mappings, since the archtitecture gives us very weak guarantees there and generally requires a Break-Before-Make sequence (though IIRC this was tightened up somewhat, so maybe going one way is supposed to work). Unless it's really necessary, I'd rather not split these block mappings while they're live. The bigger question is does this actually matter in practice? I understand that in theory this can result in better TLB usage, but does this actually affect a workload in a meaningful way? Without numbers, I'd rather leave this as-is so that we're not adding complexity and an ongoing maintenance burden. Thanks, Mark. > --- > arch/arm64/include/asm/kfence.h | 69 ++++++++++++++++++++++++++++++++- > arch/arm64/mm/mmu.c | 4 +- > 2 files changed, 70 insertions(+), 3 deletions(-) > > diff --git a/arch/arm64/include/asm/kfence.h b/arch/arm64/include/asm/kfence.h > index aa855c6a0ae6..bee101eced0b 100644 > --- a/arch/arm64/include/asm/kfence.h > +++ b/arch/arm64/include/asm/kfence.h > @@ -8,9 +8,76 @@ > #ifndef __ASM_KFENCE_H > #define __ASM_KFENCE_H > > +#include > #include > +#include > > -static inline bool arch_kfence_init_pool(void) { return true; } > +static inline int split_pud_page(pud_t *pud, unsigned long addr) > +{ > + int i; > + pmd_t *pmd = pmd_alloc_one(&init_mm, addr); > + unsigned long pfn = PFN_DOWN(__pa(addr)); > + > + if (!pmd) > + return -ENOMEM; > + > + for (i = 0; i < PTRS_PER_PMD; i++) > + set_pmd(pmd + i, pmd_mkhuge(pfn_pmd(pfn + i * PTRS_PER_PTE, PAGE_KERNEL))); > + > + smp_wmb(); /* See comment in __pte_alloc */ > + pud_populate(&init_mm, pud, pmd); > + flush_tlb_kernel_range(addr, addr + PUD_SIZE); > + return 0; > +} > + > +static inline int split_pmd_page(pmd_t *pmd, unsigned long addr) > +{ > + int i; > + pte_t *pte = pte_alloc_one_kernel(&init_mm); > + unsigned long pfn = PFN_DOWN(__pa(addr)); > + > + if (!pte) > + return -ENOMEM; > + > + for (i = 0; i < PTRS_PER_PTE; i++) > + set_pte(pte + i, pfn_pte(pfn + i, PAGE_KERNEL)); > + > + smp_wmb(); /* See comment in __pte_alloc */ > + pmd_populate_kernel(&init_mm, pmd, pte); > + > + flush_tlb_kernel_range(addr, addr + PMD_SIZE); > + return 0; > +} > + > +static inline bool arch_kfence_init_pool(void) > +{ > + unsigned long addr; > + pgd_t *pgd; > + p4d_t *p4d; > + pud_t *pud; > + pmd_t *pmd; > + > + for (addr = (unsigned long)__kfence_pool; is_kfence_address((void *)addr); > + addr += PAGE_SIZE) { > + pgd = pgd_offset(&init_mm, addr); > + if (pgd_leaf(*pgd)) > + return false; > + p4d = p4d_offset(pgd, addr); > + if (p4d_leaf(*p4d)) > + return false; > + pud = pud_offset(p4d, addr); > + if (pud_leaf(*pud)) { > + if (split_pud_page(pud, addr & PUD_MASK)) > + return false; > + } > + pmd = pmd_offset(pud, addr); > + if (pmd_leaf(*pmd)) { > + if (split_pmd_page(pmd, addr & PMD_MASK)) > + return false; > + } > + } > + return true; > +} > > static inline bool kfence_protect_page(unsigned long addr, bool protect) > { > diff --git a/arch/arm64/mm/mmu.c b/arch/arm64/mm/mmu.c > index cfd9deb347c3..b2c79ccfb1c5 100644 > --- a/arch/arm64/mm/mmu.c > +++ b/arch/arm64/mm/mmu.c > @@ -516,7 +516,7 @@ static void __init map_mem(pgd_t *pgdp) > */ > BUILD_BUG_ON(pgd_index(direct_map_end - 1) == pgd_index(direct_map_end)); > > - if (can_set_direct_map() || crash_mem_map || IS_ENABLED(CONFIG_KFENCE)) > + if (can_set_direct_map() || crash_mem_map) > flags |= NO_BLOCK_MAPPINGS | NO_CONT_MAPPINGS; > > /* > @@ -1485,7 +1485,7 @@ int arch_add_memory(int nid, u64 start, u64 size, > * KFENCE requires linear map to be mapped at page granularity, so that > * it is possible to protect/unprotect single pages in the KFENCE pool. > */ > - if (can_set_direct_map() || IS_ENABLED(CONFIG_KFENCE)) > + if (can_set_direct_map()) > flags |= NO_BLOCK_MAPPINGS | NO_CONT_MAPPINGS; > > __create_pgd_mapping(swapper_pg_dir, start, __phys_to_virt(start), > -- > 2.18.0.huawei.25 >