Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp473352yba; Fri, 12 Apr 2019 07:17:50 -0700 (PDT) X-Google-Smtp-Source: APXvYqzUHag5NCkhX+0p6lDUrnak/fWdeQW9HTtIL7Rnz6cxriKGhCmW7aYgN4V41Cf2fZOyhQ/p X-Received: by 2002:a63:5149:: with SMTP id r9mr51531429pgl.177.1555078670546; Fri, 12 Apr 2019 07:17:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555078670; cv=none; d=google.com; s=arc-20160816; b=xoFmrY7qrWzjN6+eXiRDNto4E7s+TrR9fH9eCVHSW9IJ1mcBxQeWT6AxksDTubHY7b jXc35HXoQ6APVeY8Xmjru2vK4/LZAAq/v5biebKU9DFF9yeEDUJx8clw21VlNPB42mL3 EYj02b7ba2PDBwbUkOtoAqx4IL+s6ZrRCaVP14PXAp3XeY/LSnaFtyKa22YCzdbrZxww xdKpUplDVEKLUKuTxX8tuaw5vnzYTJFyRqS13vYMPrA2KbH6wGK+XYc+Set1JbQMql8X G2zAEZHtCKfdLzr3VOdpXEWyXkVQrsdJ1AViZeLzrTBWNc5qFxj0ReltuS18g5mn9mbM CdGw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=3MMU6aQ0kc9WSYs729+oGjeFb16eYqurWXI4diGGqtk=; b=XKtrqKZWqhHjA8ATFeyolQSfOl21pPbL0oZ4glEClxmvSWXLggUbAcwb42usk9QouB +MAq9+P4yQH8YZD9SFiYfdTD83e6xwC4uoHfWFEWQu+DOovnk8zPIA5J2kBwAjyt5RQ/ kf86yiX0bS8KARxMG9RXm+s4r6c8Kg9quxTwwWwOafRXToqNVhIkwsdngzAWvn6HmGUL X7Stti9P3yiRQ3384y4jNQb2wjhNqD2NnlSqg5DQY63fxlExfCdWFK3NzT+CUsU+9P4+ wCFbUaqSLyEeXvokM1clJLlG/SMAt9C3K/BsAJf4jvwAA68mFFzMBFhTbwNF/QlsX+Qw YPzQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 1si27358076plo.217.2019.04.12.07.17.34; Fri, 12 Apr 2019 07:17:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726998AbfDLOQe (ORCPT + 99 others); Fri, 12 Apr 2019 10:16:34 -0400 Received: from foss.arm.com ([217.140.101.70]:33794 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726714AbfDLOQe (ORCPT ); Fri, 12 Apr 2019 10:16:34 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 72D17374; Fri, 12 Apr 2019 07:16:33 -0700 (PDT) Received: from [10.1.199.35] (e107154-lin.cambridge.arm.com [10.1.199.35]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 9D0813F557; Fri, 12 Apr 2019 07:16:23 -0700 (PDT) Subject: Re: [PATCH v2 2/4] arm64: Define Documentation/arm64/elf_at_flags.txt To: Catalin Marinas Cc: Vincenzo Frascino , linux-arm-kernel@lists.infradead.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-kselftest@vger.kernel.org, linux-kernel@vger.kernel.org, Alexander Viro , Alexei Starovoitov , Andrew Morton , Andrey Konovalov , Arnaldo Carvalho de Melo , Branislav Rankov , Chintan Pandya , Daniel Borkmann , Dave Martin , "David S. Miller" , Dmitry Vyukov , Eric Dumazet , Evgeniy Stepanov , Graeme Barnes , Greg Kroah-Hartman , Ingo Molnar , Jacob Bramley , Kate Stewart , Kees Cook , "Kirill A . Shutemov" , Kostya Serebryany , Lee Smith , Luc Van Oostenryck , Mark Rutland , Peter Zijlstra , Ramana Radhakrishnan , Robin Murphy , Ruben Ayrapetyan , Shuah Khan , Steven Rostedt , Szabolcs Nagy , Will Deacon References: <20190318163533.26838-1-vincenzo.frascino@arm.com> <20190318163533.26838-3-vincenzo.frascino@arm.com> <859341c2-b352-e914-312a-d3de652495b6@arm.com> <20190403165031.GE34351@arrakis.emea.arm.com> From: Kevin Brodsky Message-ID: Date: Fri, 12 Apr 2019 15:16:19 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.5.3 MIME-Version: 1.0 In-Reply-To: <20190403165031.GE34351@arrakis.emea.arm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-GB Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 03/04/2019 17:50, Catalin Marinas wrote: > On Fri, Mar 22, 2019 at 03:52:49PM +0000, Kevin Brodsky wrote: >> On 18/03/2019 16:35, Vincenzo Frascino wrote: >>> +2. Features exposed via AT_FLAGS >>> +-------------------------------- >>> + >>> +bit[0]: ARM64_AT_FLAGS_SYSCALL_TBI >>> + >>> + On arm64 the TCR_EL1.TBI0 bit has been always enabled on the arm64 >>> + kernel, hence the userspace (EL0) is allowed to set a non-zero value >>> + in the top byte but the resulting pointers are not allowed at the >>> + user-kernel syscall ABI boundary. >>> + When bit[0] is set to 1 the kernel is advertising to the userspace >>> + that a relaxed ABI is supported hence this type of pointers are now >>> + allowed to be passed to the syscalls, when these pointers are in >>> + memory ranges privately owned by a process and obtained by the >>> + process in accordance with the definition of "valid tagged pointer" >>> + in paragraph 3. >>> + In these cases the tag is preserved as the pointer goes through the >>> + kernel. Only when the kernel needs to check if a pointer is coming >>> + from userspace an untag operation is required. >> I would leave this last sentence out, because: >> 1. It is an implementation detail that doesn't impact this user ABI. >> 2. It is not entirely accurate: untagging the pointer may be needed for >> various kinds of address lookup (like finding the corresponding VMA), at >> which point the kernel usually already knows it is a userspace pointer. > I fully agree, the above paragraph should not be part of the user ABI > document. > >>> +3. ARM64_AT_FLAGS_SYSCALL_TBI >>> +----------------------------- >>> + >>> +From the kernel syscall interface prospective, we define, for the purposes >>> +of this document, a "valid tagged pointer" as a pointer that either it has >>> +a zero value set in the top byte or it has a non-zero value, it is in memory >>> +ranges privately owned by a userspace process and it is obtained in one of >>> +the following ways: >>> + - mmap() done by the process itself, where either: >>> + * flags = MAP_PRIVATE | MAP_ANONYMOUS >>> + * flags = MAP_PRIVATE and the file descriptor refers to a regular >>> + file or "/dev/zero" >>> + - a mapping below sbrk(0) done by the process itself >> I don't think that's very clear, this doesn't say how the mapping is >> obtained. Maybe "a mapping obtained by the process using brk() or sbrk()"? > I think what we mean here is anything in the "[heap]" section as per > /proc/*/maps (in the kernel this would be start_brk to brk). > >>> + - any memory mapped by the kernel in the process's address space during >>> + creation and following the restrictions presented above (i.e. data, bss, >>> + stack). >> With the rules above, the code section is included as well. Replacing "i.e." >> with "e.g." would avoid having to list every single section (which is >> probably not a good idea anyway). > We could mention [stack] explicitly as that's documented in the > Documentation/filesystems/proc.txt and it's likely considered ABI > already. > > The code section is MAP_PRIVATE, and can be done by the dynamic loader > (user process), so it falls under the mmap() rules listed above. I guess > we could simply drop "done by the process itself" here and allow > MAP_PRIVATE|MAP_ANONYMOUS or MAP_PRIVATE of regular file. This would > cover the [heap] and [stack] and we won't have to debate the brk() case > at all. That's probably the best option. I initially used this wording because I was worried that there could be cases where the kernel allocates "magic" memory for userspace that is MAP_PRIVATE|MAP_ANONYMOUS, but in fact it's probably not the case (presumably such mapping should always be done via install_special_mapping(), which is definitely not MAP_PRIVATE). > We probably mention somewhere (or we should in the tagged pointers doc) > that we don't support tagged PC. I think that Documentation/arm64/tagged-pointers.txt already makes it reasonably clear (anyway, with the architecture not supporting it, you can't expect much from the kernel). Kevin