Received: by 2002:ab2:60d1:0:b0:1f7:5705:b850 with SMTP id i17csp980829lqm; Thu, 2 May 2024 01:20:17 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCXa7j1v7UbHqpAxXCwZXc4umVgNo5X2SKUDpAVR6RGtrAxqRq7Bzx5TUrh6op6ljJM/aAAxM8HKvfRQq21iIfP/p6C5swuPjXH6UnAFTA== X-Google-Smtp-Source: AGHT+IFqBIAw+bAN32FekcbN60ouF8kojiUjucUiEW48s/2O6iqtvKWBpQBKMOcWV+zxV74GQ5A7 X-Received: by 2002:a05:620a:490d:b0:790:c69a:d3f3 with SMTP id vy13-20020a05620a490d00b00790c69ad3f3mr5239260qkn.15.1714638017150; Thu, 02 May 2024 01:20:17 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1714638017; cv=pass; d=google.com; s=arc-20160816; b=UFmkaVj1Mnt4kJu1OxoY4yjAT+RZjWLIqF1I+dkNGGNGcD/gnM+K54H0loihtiAnAh wpHMKvVpZSL/KpdFxu+qR1Dab1igir4G0x5Hib6nKfBeJCgHTdNMJR5aHP3N9DdtB76g WWJdcA4h7Qiwb0owCr1xRDkljhGixbqDExgP7+zNaUK3FoNnJN52em6YCmLJaVW4Uy/t MI1dU2rM9B8SDTRYH/c5JtE0NJ5fwCqxz42VCSbIwpxTjUJZFT8d//WqLJTcU9BclY8c nrRGDhFHAPWPo9YdqOVLXvGphSfASWAT+A3oYmUC9f/N8twYbOTJhaJ1nHGBQgvt7Pi6 w+WQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:date:message-id; bh=g7MQo1RUYjQze/Mp45iIpPSqe7Wb7K7fK+NjMY5Sd48=; fh=0sI1SiHKjOXFQRse5lHBBiukz1g9ieVwiLfgI0S+TSo=; b=ng0Ue2mifm3Q4hO3Wok1yHtBXEAC115+kPgAX/oQxSy5sZPZOIEnss7vlVcH2WDkZ+ Q/aTl5x34VYtHnVqPRACUhNiwDTNPmCXsu+Vb7D6IO53b2R6pDYBNIyVOKDSLtvG0LPN bahwgE56dIaxApftsCYx4pQsMzfdz+z2lROYexthFYTC4ipjbngFNic1kH4XMzUgOshI 6E4ZPXZmZ+VVU0vGRE9r2OauWi9u8sGUHFuPh6r1GjZOHIscckG7O5M4OXF+Sj4ACQ3L k+oTOcPqFpJabwexPAg64y2g/a4Yka/VwyqIC6CMWU08vy0SoQJXKHkVU31u85/9NA69 sJdw==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-166106-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-166106-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [147.75.199.223]) by mx.google.com with ESMTPS id g25-20020a05620a219900b00790f65b82e0si410570qka.630.2024.05.02.01.20.16 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 02 May 2024 01:20:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-166106-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) client-ip=147.75.199.223; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-166106-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-166106-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id C3B301C218B4 for ; Thu, 2 May 2024 08:20:16 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 62A3A32C8E; Thu, 2 May 2024 08:20:11 +0000 (UTC) Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 339AF1BF54 for ; Thu, 2 May 2024 08:20:07 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=217.140.110.172 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1714638010; cv=none; b=MBYy/mp7k3OYtyeEuy3GTGBUlUzFtUZfaBg5wsrl+uaFAKe7mv72Yd9aOnJxE+scDH4blh0nNxC5vHQa0ME8oEF/cusa8yltIylgxXH/GD8f5Aq7huYwnbP/M6aD7oqbEp/ELojl1cjEnsxzISRjh5UFRcI3WDpCfHkUlM3Gcx0= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1714638010; c=relaxed/simple; bh=fw0KIOzlehbte2mP913i3LuGFlmrb3tLqg7bZQFpkhI=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=q6bsmqEtBGNR9ovrFEW9E/neP54Ai4q61VnuQljBUhzVOuycSrlaZrFZtylZ4MBE0w25oYNeu1Cx7Q4C7C+Mk06BT7rGoxpcTY4EqfTVjCtKSlYPE71aV2PYKuCWs3hi1gk/mM26iObQCHW2eJAsnI6VFjNgvNxLPxxitjmWG2U= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com; spf=pass smtp.mailfrom=arm.com; arc=none smtp.client-ip=217.140.110.172 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=arm.com Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 1B0292F4; Thu, 2 May 2024 01:20:33 -0700 (PDT) Received: from [10.57.65.146] (unknown [10.57.65.146]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id AA3E63F71E; Thu, 2 May 2024 01:20:05 -0700 (PDT) Message-ID: <9b2d91ac-3266-4173-a525-bbc7b087a482@arm.com> Date: Thu, 2 May 2024 09:20:04 +0100 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v3 2/3] arm64/mm: Move PTE_PRESENT_INVALID to overlay PTE_NG Content-Language: en-GB To: Anshuman Khandual , Catalin Marinas , Will Deacon , Joey Gouly , Ard Biesheuvel , Mark Rutland , David Hildenbrand , Peter Xu , Mike Rapoport , Shivansh Vij Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org References: <20240501145419.1390363-1-ryan.roberts@arm.com> <20240501145419.1390363-3-ryan.roberts@arm.com> <5ac445b2-5c11-407c-87d2-4ede8e212d71@arm.com> From: Ryan Roberts In-Reply-To: <5ac445b2-5c11-407c-87d2-4ede8e212d71@arm.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit On 02/05/2024 07:12, Anshuman Khandual wrote: > > > On 5/1/24 20:24, Ryan Roberts wrote: >> PTE_PRESENT_INVALID was previously occupying bit 59, which when a PTE is >> valid can either be IGNORED, PBHA[0] or AttrIndex[3], depending on the >> HW configuration. In practice this is currently not a problem because >> PTE_PRESENT_INVALID can only be 1 when PTE_VALID=0 and upstream Linux >> always requires the bit set to 0 for a valid pte. >> >> However, if in future Linux wants to use the field (e.g. AttrIndex[3]) >> then we could end up with confusion when PTE_PRESENT_INVALID comes along >> and corrupts the field - we would ideally want to preserve it even for >> an invalid (but present) pte. >> >> The other problem with bit 59 is that it prevents the offset field of a >> swap entry within a swap pte from growing beyond 51 bits. By moving >> PTE_PRESENT_INVALID to a low bit we can lay the swap pte out so that the >> offset field could grow to 52 bits in future. >> >> So let's move PTE_PRESENT_INVALID to overlay PTE_NG (bit 11). >> >> There is no need to persist NG for a present-invalid entry; it is always >> set for user mappings and is not used by SW to derive any state from the > > But the idea of present and invalid state is that all the HW used information > should be fetched successfully even though the the entry is not valid and not > being walked by the MMU. Setting and clearing PTE SW bits in such state, does > not change that, but tampering with HW bits would break the assumption around > present and invalid entry ? But we are not changing anything; NG is *always* set in valid-present user entries. And it continues to be *always* set in invalid-present user entries, because PTE_PRESENT_INVALID is the same bit. Additionally there are no helpers (that I could find) that read or write the NG bit so core-mm can't possibly be accidentally corrupting anything or reading back incorrect state. > >> pte. PTE_NS was considered instead of PTE_NG, but it is RES0 for >> non-secure SW, so there is a chance that future architecture may >> allocate the bit and we may therefore need to persist that bit for >> present-invalid ptes. > > If we are being careful around PTE_NS and even for AttrIndex[3] as mentioned > earlier to be useful during an invalid state, how can PTE_NG be used without > any such consideration. Are you suggesting that in future we may want to make global mappings available to user space? I think that's unlikely? The point about NG is that it is already architecturally defined in normal world and we are using it - kernel mappings are (usually) global, user mappings are not. So a new meaning for that bit can't come along unless we are willing to say that *all* mappings (inc kernel mappings) will become implicitly non-global. So my assertion that this bit can be reused like this (now and in future) is predicated on 1) all user mappings are and will continue to be non-global, 2) the bit will never be repurposed by the architecture (or if it is, Linux will never opt-in because we need per-pte (non-)global control). > >> >> These are both marginal benefits, but make things a bit tidier in my >> opinion. > > Apart from swap offset field expansion does this change achieve anything else ? It frees up bit 59 to be used for its architectural purpose in future. > >> >> Reviewed-by: Catalin Marinas >> Signed-off-by: Ryan Roberts >> --- >> arch/arm64/include/asm/pgtable-prot.h | 2 +- >> arch/arm64/include/asm/pgtable.h | 12 ++++++------ >> 2 files changed, 7 insertions(+), 7 deletions(-) >> >> diff --git a/arch/arm64/include/asm/pgtable-prot.h b/arch/arm64/include/asm/pgtable-prot.h >> index cd8c06f5fb02..3047d10987fd 100644 >> --- a/arch/arm64/include/asm/pgtable-prot.h >> +++ b/arch/arm64/include/asm/pgtable-prot.h >> @@ -18,7 +18,7 @@ >> #define PTE_DIRTY (_AT(pteval_t, 1) << 55) >> #define PTE_SPECIAL (_AT(pteval_t, 1) << 56) >> #define PTE_DEVMAP (_AT(pteval_t, 1) << 57) >> -#define PTE_PRESENT_INVALID (_AT(pteval_t, 1) << 59) /* only when !PTE_VALID */ >> +#define PTE_PRESENT_INVALID (PTE_NG) /* only when !PTE_VALID */ >> >> #define _PROT_DEFAULT (PTE_TYPE_PAGE | PTE_AF | PTE_SHARED) >> #define _PROT_SECT_DEFAULT (PMD_TYPE_SECT | PMD_SECT_AF | PMD_SECT_S) >> diff --git a/arch/arm64/include/asm/pgtable.h b/arch/arm64/include/asm/pgtable.h >> index c0f4471423db..7f1ff59c43ed 100644 >> --- a/arch/arm64/include/asm/pgtable.h >> +++ b/arch/arm64/include/asm/pgtable.h >> @@ -1254,15 +1254,15 @@ static inline pmd_t pmdp_establish(struct vm_area_struct *vma, >> * Encode and decode a swap entry: >> * bits 0-1: present (must be zero) >> * bits 2: remember PG_anon_exclusive >> - * bits 3-7: swap type >> - * bits 8-57: swap offset >> - * bit 59: PTE_PRESENT_INVALID (must be zero) >> + * bits 6-10: swap type >> + * bit 11: PTE_PRESENT_INVALID (must be zero) >> + * bits 12-61: swap offset >> */ >> -#define __SWP_TYPE_SHIFT 3 >> +#define __SWP_TYPE_SHIFT 6 >> #define __SWP_TYPE_BITS 5 >> -#define __SWP_OFFSET_BITS 50 >> #define __SWP_TYPE_MASK ((1 << __SWP_TYPE_BITS) - 1) >> -#define __SWP_OFFSET_SHIFT (__SWP_TYPE_BITS + __SWP_TYPE_SHIFT) >> +#define __SWP_OFFSET_SHIFT 12 >> +#define __SWP_OFFSET_BITS 50 >> #define __SWP_OFFSET_MASK ((1UL << __SWP_OFFSET_BITS) - 1) >> >> #define __swp_type(x) (((x).val >> __SWP_TYPE_SHIFT) & __SWP_TYPE_MASK)