Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 3C013C433EF for ; Mon, 22 Nov 2021 09:00:54 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238892AbhKVJD5 (ORCPT ); Mon, 22 Nov 2021 04:03:57 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55620 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229716AbhKVJDz (ORCPT ); Mon, 22 Nov 2021 04:03:55 -0500 Received: from mail-qk1-x736.google.com (mail-qk1-x736.google.com [IPv6:2607:f8b0:4864:20::736]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5F288C061574 for ; Mon, 22 Nov 2021 01:00:49 -0800 (PST) Received: by mail-qk1-x736.google.com with SMTP id 132so17345215qkj.11 for ; Mon, 22 Nov 2021 01:00:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=3sRZ4A1M5AudWFAyajd/b2gfytpmyoTcIYiz47gSMik=; b=DEwSAxaSMj/tJBx9+Zo70EMhHcJjmZ+c9QaWlyjHVK0/ND4qVEn5SOxg0D9Xrwp0Qf 37iDq+GbKzmYVA9FVPEV8WDA2XZ2faX9YrJQUjiUkOh/1Rm+RnN+qUzHpAzuHcrgzVcB 7BNblD4ALnF4B3VtfUkrFNHcFQWn3bEMMBrAKTrmJA2raYgA4mzHDA7f85bxPtVXjTR+ RzzJH8+b3bajZFXTeQSnNuP+kqRruHLZf6wUdc7l/KolswMF7gLB+deDqtR1TEhY5/P4 6cH4Txme/sCkC3Eb6RvyPexvTqc25WV/dVk67dltC8a5S1zszijHYcywVqJkaGTQb5Ck PBAQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=3sRZ4A1M5AudWFAyajd/b2gfytpmyoTcIYiz47gSMik=; b=ZiOFRgHJAdgriIhu3dQAmauBcvoWMhejq8AkAPKuTRgyE3IY/j5xPEohYn1vm8Rkhu r+56E9oFfF1lHCndn0Xj4qaOaqR8h6cp21LpDrxq9AyfuYeUV+lzyfuv4NPaOz7nXqTl 5ldTN5AemkYH+5uaDDLL3CCTF5lbZImHmFmoMrJPjM8UIEDKASB8O9JUTPf90TDcoNyc gMnsCrK8bKQwaBvsgNYh1YYUKm8nl986EXao+fempKg7MdE6FAJW6kk4ssobXdOf/17m Ukp+HeLzYVBF6XtlwrdwkGd16C0IK40VSiIOfiKUYijeITlfIHTPeGaXKzAVOd49egYh qWdQ== X-Gm-Message-State: AOAM530VnPRhrO22XTeC6PeDBAQv9ODuzXeerGzOPz9Ee5jUGF7t5PJS qv510uv5xmMWrVz/yzaMWytQvQPimF8ITuAR9O0= X-Google-Smtp-Source: ABdhPJz9/6N/xHx6EOH1YBXExh1TSP30qiKTygpEywhBj/RyU2TB6Pe+1yPot+SVff6jpRHuBDuecJutUxTvg3wJc1w= X-Received: by 2002:a05:620a:404c:: with SMTP id i12mr48450420qko.298.1637571648555; Mon, 22 Nov 2021 01:00:48 -0800 (PST) MIME-Version: 1.0 References: <1637558929-22971-1-git-send-email-huangzhaoyang@gmail.com> In-Reply-To: From: Zhaoyang Huang Date: Mon, 22 Nov 2021 17:00:27 +0800 Message-ID: Subject: Re: [RFC PATCH] arch: arm64: introduce RODATA_FULL_USE_PTE_CONT To: Ard Biesheuvel Cc: Catalin Marinas , Will Deacon , Anshuman Khandual , Andrew Morton , Nicholas Piggin , Mike Rapoport , Pavel Tatashin , Christophe Leroy , Jonathan Marek , Zhaoyang Huang , Linux Memory Management List , Linux Kernel Mailing List Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Nov 22, 2021 at 4:52 PM Ard Biesheuvel wrote: > > On Mon, 22 Nov 2021 at 06:29, Huangzhaoyang wrote: > > > > From: Zhaoyang Huang > > > > Kernel linear mapping will be split to the smallest granularity when > > RODATA_FULL applied, which could lead to TLB pressure. Introduce a method > > to apply PTE_CONT on pte. > > > > Signed-off-by: Zhaoyang Huang > > How would this lead to TLB pressure, and how does the use of > contiguous mappings mitigate that? The linear mapping of the kernel is > rarely used, as all normal accesses to it go via the vmalloc region, > so in which case would TLB entries be allocated for this region in a > way that could cause a measurable performance impact? In fact, the patch is about to use PTE_CONT *OUT OF* the range of kernel text. Would you please have a look at the code. It apply PTE_CONT during map_mem and then clear it when load_module change the corresponding linear mapping to the area it use in vmalloc area. > > > > --- > > arch/arm64/Kconfig | 9 +++++++++ > > arch/arm64/mm/mmu.c | 10 ++++++++-- > > arch/arm64/mm/pageattr.c | 9 +++++++++ > > 3 files changed, 26 insertions(+), 2 deletions(-) > > > > diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig > > index fee914c..3f8fbf0 100644 > > --- a/arch/arm64/Kconfig > > +++ b/arch/arm64/Kconfig > > @@ -1198,6 +1198,15 @@ config RODATA_FULL_DEFAULT_ENABLED > > This requires the linear region to be mapped down to pages, > > which may adversely affect performance in some cases. > > > > +config RODATA_FULL_USE_PTE_CONT > > + bool "Apply PTE_CONT when RODATA_FULL_DEFAULT_ENABLED enabled" > > + depends on RODATA_FULL_DEFAULT_ENABLED > > + default y > > + help > > + Apply PTE_CONT on linear mapping as much as we can when > > + RODATA_FULL_DEFAULT_ENABLED enabled which could decrease the > > + impaction on performance by small pte granularity. > > + > > config ARM64_SW_TTBR0_PAN > > bool "Emulate Privileged Access Never using TTBR0_EL1 switching" > > help > > diff --git a/arch/arm64/mm/mmu.c b/arch/arm64/mm/mmu.c > > index cfd9deb..8017b17 100644 > > --- a/arch/arm64/mm/mmu.c > > +++ b/arch/arm64/mm/mmu.c > > @@ -124,15 +124,21 @@ static bool pgattr_change_is_safe(u64 old, u64 new) > > * The following mapping attributes may be updated in live > > * kernel mappings without the need for break-before-make. > > */ > > +#ifndef CONFIG_RODATA_FULL_USE_PTE_CONT > > pteval_t mask = PTE_PXN | PTE_RDONLY | PTE_WRITE | PTE_NG; > > +#else > > + pteval_t mask = PTE_PXN | PTE_RDONLY | PTE_WRITE | PTE_NG | PTE_CONT; > > +#endif > > > > /* creating or taking down mappings is always safe */ > > if (old == 0 || new == 0) > > return true; > > > > /* live contiguous mappings may not be manipulated at all */ > > - if ((old | new) & PTE_CONT) > > +#ifndef CONFIG_RODATA_FULL_USE_PTE_CONT > > + if (old | new) & PTE_CONT) > > return false; > > +#endif > > > > /* Transitioning from Non-Global to Global is unsafe */ > > if (old & ~new & PTE_NG) > > @@ -206,7 +212,7 @@ static void alloc_init_cont_pte(pmd_t *pmdp, unsigned long addr, > > > > /* use a contiguous mapping if the range is suitably aligned */ > > if ((((addr | next | phys) & ~CONT_PTE_MASK) == 0) && > > - (flags & NO_CONT_MAPPINGS) == 0) > > + (IS_ENABLED(CONFIG_RODATA_FULL_USE_PTE_CONT) || (flags & NO_CONT_MAPPINGS) == 0)) > > __prot = __pgprot(pgprot_val(prot) | PTE_CONT); > > > > init_pte(pmdp, addr, next, phys, __prot); > > diff --git a/arch/arm64/mm/pageattr.c b/arch/arm64/mm/pageattr.c > > index a3bacd7..88a87eb 100644 > > --- a/arch/arm64/mm/pageattr.c > > +++ b/arch/arm64/mm/pageattr.c > > @@ -99,6 +99,15 @@ static int change_memory_common(unsigned long addr, int numpages, > > if (rodata_full && (pgprot_val(set_mask) == PTE_RDONLY || > > pgprot_val(clear_mask) == PTE_RDONLY)) { > > for (i = 0; i < area->nr_pages; i++) { > > +#ifdef CONFIG_RODATA_FULL_USE_PTE_CONT > > + unsigned long cont_pte_low_bound; > > + unsigned long addr; > > + > > + addr = (u64)page_address(area->pages[i]); > > + cont_pte_low_bound = addr & CONT_PTE_MASK; > > + __change_memory_common(cont_pte_low_bound, > > + (~CONT_PTE_MASK + 1), __pgprot(0) , __pgprot(PTE_CONT)); > > +#endif > > __change_memory_common((u64)page_address(area->pages[i]), > > PAGE_SIZE, set_mask, clear_mask); > > } > > -- > > 1.9.1 > >