Received: by 2002:a25:5b86:0:0:0:0:0 with SMTP id p128csp388932ybb; Thu, 28 Mar 2019 04:45:51 -0700 (PDT) X-Google-Smtp-Source: APXvYqxkDG+O6f+NJe17cBuvOyN0pWbDnYrQxL+QBYcTKURRkm1Q9E5E8N5WmTM1LKQKmcG2qriB X-Received: by 2002:a17:902:8f92:: with SMTP id z18mr34417343plo.123.1553773551215; Thu, 28 Mar 2019 04:45:51 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1553773551; cv=none; d=google.com; s=arc-20160816; b=hAytmOfSY+S4bf78ZHJffw/DkeT/Z5ts4lfPoN+Y5UmlkZahoDn2On6NX6sI+hS9t6 N1rY5GYjPBWzeYTYR1HdoO0gFyT1+nxXqQrjjOgFa24zGfrszl1+PeecrZpM7Qs3vybm 9VxOQxdCfXa0cT7SNJqxB4JCAU+uGRMm0z0yDEsOZ6zycxVHN5D22H9SUDJgYeR+2OnR cTfk15QYgXEkhPMnSBZORWPlnnGe5sV9s4c5GYRMw5hLicgwaGiUEcn6q7m6YduPcw3+ hHuMb/5cIsRWE5XALWSbf1LONIXrTrtwCWeMpSy/T+1fKDLdunWXnX78V7dRBulwa+4m i6Hw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=mX3ZNmbz3E6bL5fB7CSied1YiiuhskRa10DQGDO03lU=; b=YXNseI/GRUkimbjve8RxFFaqDYY6a9wQGcYoGnjHw9QOnr5M7mdHqKDzUayJ7FN88M sD2EELb7mbRPt2FRYPKsP8OkjwGfS1pCoGyZqTRKm6lHQNHT/ALVGZKU2N8X1Nl/rEoe w3ON47fpJsW+siyo2Pf+eX8lbbo+kTqU+fE4xhBuA24v6U+UDN6rSbMR2OD3ujg9ZEQo nC7xAQUmXECFW9AzF5kI14XPSwgjpLhqvalfp0CY33TSjLnPjBf/E14X+0qoL+7Z/3ai 4h0lkK7H+AfAn1hO4mVNHWwo0bVVU2rUg6w3p/wvV7+5Lz9an1cBKDqE4wN573y6aAPG lxXw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k1si20736276pfi.25.2019.03.28.04.45.35; Thu, 28 Mar 2019 04:45:51 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726593AbfC1LnD (ORCPT + 99 others); Thu, 28 Mar 2019 07:43:03 -0400 Received: from mail-pl1-f195.google.com ([209.85.214.195]:33754 "EHLO mail-pl1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726439AbfC1LnD (ORCPT ); Thu, 28 Mar 2019 07:43:03 -0400 Received: by mail-pl1-f195.google.com with SMTP id bg8so4826906plb.0 for ; Thu, 28 Mar 2019 04:43:02 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=mX3ZNmbz3E6bL5fB7CSied1YiiuhskRa10DQGDO03lU=; b=i7UZ78OAtSb6ZZ0KIa1GJPZ95yQFApcFN9xutCgK3xOiAoLm6RMnhOZxqyWjdQ4gZI jfaZL5rwA4HML7yymMuqA2nagxjSUMjslQuB0DhxVyPK+XJuZZTJ6CJKAAWHkPcEKNkY o1jsZAWJy0EGtoxIDu48bxO1EnQzJQGEF69wITvNZiZf64Mgl1h9zdPlN0d3JK9B35Lm VrozAWJV225IhtRqcdehkW1nuDVWaQBBUVnD2fwMKTOmtDMRVvhhIr9uvsDiN+PWVS/g i654jPaj2GC0k79x4V2qeK9g5z48KKrk3IGBqH9+eAaVGPOr1QXU97jAIVGsLKYrY2Ta ManQ== X-Gm-Message-State: APjAAAXiECC96jBhjmhzl0n1r//m5zpgchRzje+LLfkLfkcMirNccwTX 547cajoAjFyq/wb6l1xJs+q2gA== X-Received: by 2002:a17:902:bb92:: with SMTP id m18mr25270650pls.316.1553773382483; Thu, 28 Mar 2019 04:43:02 -0700 (PDT) Received: from localhost.localdomain ([182.69.197.66]) by smtp.gmail.com with ESMTPSA id i24sm19437197pfo.85.2019.03.28.04.42.57 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 28 Mar 2019 04:43:01 -0700 (PDT) Subject: Re: [PATCH v3 1/3] arm64, vmcoreinfo : Append 'PTRS_PER_PGD' to vmcoreinfo To: James Morse Cc: linux-kernel@vger.kernel.org, bhupesh.linux@gmail.com, Mark Rutland , Will Deacon , Dave Anderson , Kazuhito Hagio , kexec@lists.infradead.org, linux-arm-kernel@lists.infradead.org, Steve Capper , Kazuhito Hagio References: <1553058574-18606-1-git-send-email-bhsharma@redhat.com> <1553058574-18606-2-git-send-email-bhsharma@redhat.com> <2757805b-61cb-8f4a-1917-0c57012f09df@arm.com> From: Bhupesh Sharma Message-ID: <58c6cda9-9fd4-3b3e-740a-7b9b80b1f634@redhat.com> Date: Thu, 28 Mar 2019 17:12:54 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 MIME-Version: 1.0 In-Reply-To: <2757805b-61cb-8f4a-1917-0c57012f09df@arm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi James, Thanks for your review. Please see my comments inline: On 03/26/2019 10:06 PM, James Morse wrote: > Hi Bhupesh, > > On 20/03/2019 05:09, Bhupesh Sharma wrote: >> With ARMv8.2-LVA architecture extension availability, arm64 hardware >> which supports this extension can support a virtual address-space upto >> 52-bits. >> >> Since at the moment we enable the support of this extension in kernel >> via CONFIG flags, e.g. >> - User-space 52-bit LVA via CONFIG_ARM64_USER_VA_BITS_52 >> >> so, there is no clear mechanism in the user-space right now to >> determine these CONFIG flag values and hence determine the maximum >> virtual address space supported by the underlying kernel. >> >> User-space tools like 'makedumpfile' therefore are broken currently >> as they have no proper method to calculate the 'PTRS_PER_PGD' value >> which is required to perform a page table walk to determine the >> physical address of a corresponding virtual address found in >> kcore/vmcoreinfo. >> >> If one appends 'PTRS_PER_PGD' number to vmcoreinfo for arm64, >> it can be used in user-space to determine the maximum virtual address >> supported by underlying kernel. > > I don't think this really solves the problem, it feels fragile. > > I can see how vmcoreinfo tells you VA_BITS==48, PAGE_SIZE==64K and PTRS_PER_PGD=1024. > You can use this to work out that the top level page table size isn't consistent with a > 48bit VA, so 52bit VA must be in use... > > But wasn't your problem walking the kernel page tables? In particular the offset that we > apply because the tables were based on a 48bit VA shifted up in swapper_pg_dir. > > Where does the TTBR1_EL1 offset come from with this property? I assume makedumpfile > hard-codes it when it sees 52bit is in use ... somewhere. > We haven't solved the problem! But isn't the TTBR1_EL1 offset already appended by the kernel via e842dfb5a2d3 ("arm64: mm: Offset TTBR1 to allow 52-bit PTRS_PER_PGD") in case of kernel configuration where 52-bit userspace VAs are possible. I copy a snippet from the git log of the above from Steve, which explains the above: " In other words a 48-bit kernel virtual address will have a different pgd_index when using PTRS_PER_PGD = 64 and 1024. If, however, we note that: kva = 0xFFFF << 48 + lower (where lower[63:48] == 0b) and, PGDIR_SHIFT = 42 (as we are dealing with 64KB PAGE_SIZE) We can consider: (kva >> PGDIR_SHIFT) & (1024 - 1) - (kva >> PGDIR_SHIFT) & (64 - 1) = (0xFFFF << 6) & 0x3FF - (0xFFFF << 6) & 0x3F // "lower" cancels out = 0x3C0 In other words, one can switch PTRS_PER_PGD to the 52-bit value globally provided that they increment ttbr1_el1 by 0x3C0 * 8 = 0x1E00 bytes when running with 48-bit kernel VAs (TCR_EL1.T1SZ = 16). For kernel configuration where 52-bit userspace VAs are possible, this patch offsets ttbr1_el1 and sets PTRS_PER_PGD corresponding to the 52-bit value. " Accordingly we have the following assembler helper in 'arch/arm64/include/asm/assembler.h': .macro offset_ttbr1, ttbr #ifdef CONFIG_ARM64_52BIT_VA orr \ttbr, \ttbr, #TTBR1_BADDR_4852_OFFSET #endif .endm where: #ifdef CONFIG_ARM64_52BIT_VA /* Must be at least 64-byte aligned to prevent corruption of the TTBR */ #define TTBR1_BADDR_4852_OFFSET (((UL(1) << (52 - PGDIR_SHIFT)) - \ (UL(1) << (48 - PGDIR_SHIFT))) * 8) #endif And before any load TTBR1 operation in the kernel we offset ttbr1_el1, in case CONFIG_ARM64_52BIT_VA is true, for e.g in 'arch/arm64/kernel/head.S': ENTRY(__enable_mmu) <..snip..> offset_ttbr1 x1 msr ttbr1_el1, x1 // load TTBR1 <..snip..> So, the user-space (makedumpfile, for e.g.), just needs to know the PTRS_PER_PGD value and it can calculate the pgd_index for a virtual address using the following formula: pgd_index(vaddr) = (((vaddr) >> PGDIR_SHIFT) & (PTRS_PER_PGD - 1)); Note that the above computation holds true both for PTRS_PER_PGD = 64 (48-bit kernel with 48-bit User VA) and 1024 (48-bit with 52-bit User VA) cases. And these are the configurations for which we are trying to fix the user-space regressions reported (on arm64) recently. > Today __cpu_setup() sets T0SZ and T1SZ differently for 52bit VA, but in the future it > could set them the same, or different the other-way-round. > > Will makedumpfile using this value keep working once T1SZ is 52bit VA too? In this case > there would be no ttbr offset. > > If you need another vmcoreinfo flag once that happens, we've done something wrong here. I am currently experimenting with Steve's patches for 52-bit kernel VA () and will comment more on the same when I am able to get the user-space utilities like makedumpfile and kexec-tools to work with the same on both ARMv8 Fast Simulator model and older CPUs which don't support ARMv8.2 extensions. However, I think we should not hold up fixes for regressions already reported, because the 52-bit kernel VA changes probably still need some more rework. > (Not to mention what happens if the TTBR1_EL1 uses 52bit va, but TTBR0_EL1 doesn't) I am wondering if there are any real users of the above combination. So far, I have generally come across discussions where the following variations of the address spaces have been proposed/requested: - 48bit kernel VA + 48-bit User VA, - 48-bit kernel VA + 52-bit User VA, - 52-bit kernel VA + 52-bit User VA. Thanks, Bhupesh >> Suggested-by: Steve Capper > > (CC: +Steve) > > > Thanks, > > James >