Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3727038imu; Mon, 10 Dec 2018 06:59:19 -0800 (PST) X-Google-Smtp-Source: AFSGD/W1tvGsqf2C5mnDOnyAgxG3IL+7DjZrotXc/1WjN0iPGDGOq9S3jRIJpL2Kmjh7WqjtI2SR X-Received: by 2002:a63:a16:: with SMTP id 22mr11201143pgk.318.1544453959677; Mon, 10 Dec 2018 06:59:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544453959; cv=none; d=google.com; s=arc-20160816; b=FFDITs1vRdxkNZIapUx76dOycoWz7qU3Gin03CNLe2vID8PiwynewGAiyHRgm58q4z bJk0WcH1l/ZOowcEfjS/I+q3Ifgx6bd3ZzVwUYHg8Gz5tvCcaUl4/QGju4yI71dLy0/R 3ElO3NN6/oS3Q/X68vC4nDPVdFc7XOf7Y3dLp6Zk/wp/ho7gFnkQNJGcAfycDNTcil6H 358VK4Rtdxi8pdvOA/FrkEIrcL1n/jpH6eSDpO3dhTQqwUqvLW5Y7lInAw5hQouORJ3K aVryxOWpwYKwTH7fksz8qi2IZb0okKBKGo872n9INBS/Vb/fpPyy1TJcBE93yopO0v3Y Gyxg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=IZ8689zs104k5ulxQyF+gZD7raETzVJZ63Tyumg5Y6Q=; b=ZWDgx1QQAP5D/OVBRSmjALNQ81nfsGwRiiczc8QLponJu8NGlYHuDk4gx2wAcT75GP A6l6gaDGl+3QXO6DNIiJOc0nrnbBJqd+W+2G1b68X+VTKtbM+Hb+RX49aWtbUZuNBdf3 blfb56gKWiqBIshVQF9dFuH5aHRR/MXzs2nMXr2VZIAIWU/snBkm47G67Wpct2VjFrPE ngc3pNr9EkKxY8tUW5AdQCJvSqnsGnbBlZ6OdfbP7AUvbisdvvpC1GebqWDLgMnVnFbb YEYCL9aO986NjTZ6/SKDAoc0IC8bevEtfDzyFbqF7PDIaOM0DNb2tf6ny/KzcEmbfn69 NfQA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p3si9759907pgi.0.2018.12.10.06.59.03; Mon, 10 Dec 2018 06:59:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727968AbeLJObb (ORCPT + 99 others); Mon, 10 Dec 2018 09:31:31 -0500 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:55210 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726791AbeLJOba (ORCPT ); Mon, 10 Dec 2018 09:31:30 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 4A14D165C; Mon, 10 Dec 2018 06:31:29 -0800 (PST) Received: from e119884-lin.cambridge.arm.com (e119884-lin.cambridge.arm.com [10.1.196.72]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id DA3393F575; Mon, 10 Dec 2018 06:31:24 -0800 (PST) From: Vincenzo Frascino To: linux-arm-kernel@lists.infradead.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org, linux-kselftest@vger.kernel.org, linux-kernel@vger.kernel.org Cc: Catalin Marinas , Will Deacon , Mark Rutland , Robin Murphy , Kees Cook , Kate Stewart , Greg Kroah-Hartman , Andrew Morton , Ingo Molnar , "Kirill A . Shutemov" , Shuah Khan , Chintan Pandya , Jacob Bramley , Ruben Ayrapetyan , Andrey Konovalov , Lee Smith , Kostya Serebryany , Dmitry Vyukov , Ramana Radhakrishnan , Luc Van Oostenryck , Evgeniy Stepanov , Alexander Viro Subject: [RFC][PATCH 2/3] arm64: Define Documentation/arm64/elf_at_flags.txt Date: Mon, 10 Dec 2018 14:30:43 +0000 Message-Id: <20181210143044.12714-3-vincenzo.frascino@arm.com> X-Mailer: git-send-email 2.19.2 In-Reply-To: <20181210143044.12714-1-vincenzo.frascino@arm.com> References: <20181210143044.12714-1-vincenzo.frascino@arm.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On arm64 the TCR_EL1.TBI0 bit has been set since Linux 3.x hence the userspace (EL0) is allowed to set a non-zero value in the top byte but the resulting pointers are not allowed at the user-kernel syscall ABI boundary. With the relaxed ABI proposed through this document, it is now possible to pass tagged pointers to the syscalls, when these pointers are in memory ranges obtained by an anonymous (MAP_ANONYMOUS) mmap() or brk(). This change in the ABI requires a mechanism to inform the userspace that such an option is available. This patch specifies and documents the way on which AT_FLAGS can be used to advertise this feature to the userspace. Cc: Catalin Marinas Cc: Will Deacon CC: Andrey Konovalov Signed-off-by: Vincenzo Frascino --- Documentation/arm64/elf_at_flags.txt | 111 +++++++++++++++++++++++++++ 1 file changed, 111 insertions(+) create mode 100644 Documentation/arm64/elf_at_flags.txt diff --git a/Documentation/arm64/elf_at_flags.txt b/Documentation/arm64/elf_at_flags.txt new file mode 100644 index 000000000000..153e657c058a --- /dev/null +++ b/Documentation/arm64/elf_at_flags.txt @@ -0,0 +1,111 @@ +ARM64 ELF AT_FLAGS +================== + +This document describes the usage and semantics of AT_FLAGS on arm64. + +1. Introduction +--------------- + +AT_FLAGS is part of the Auxiliary Vector, contains the flags and it +is currently set to zero by the kernel on arm64. + +The auxiliary vector can be accessed by the userspace using the +getauxval() API provided by the C library. +getauxval() returns an unsigned long and when a flag is present in +the AT_FLAGS, the corresponding bit in the returned value is set to 1. + +The AT_FLAGS with a "defined semantic" on arm64 are exposed to the +userspace via user API (uapi/asm/atflags.h). +The AT_FLAGS bits with "undefined semantics" are set to zero by default. +This means that the AT_FLAGS bits to which this document does not assign +an explicit meaning are to be intended reserved for future use. +The kernel will populate all such bits with zero until meanings are +assigned to them. If and when meanings are assigned, it is guaranteed +that they will not impact the functional operation of existing userspace +software. Userspace software should ignore any AT_FLAGS bit whose meaning +is not defined when the software is written. + +The userspace software can test for features by acquiring the AT_FLAGS +entry of the auxiliary vector, and testing whether a relevant flag +is set. + +Example of a userspace test function: + +bool feature_x_is_present(void) +{ + unsigned long at_flags = getauxval(AT_FLAGS); + if (at_flags & FEATURE_X) + return true; + + return false; +} + +Where the software relies on a feature advertised by AT_FLAGS, it +should check that the feature is present before attempting to +use it. + +2. Features exposed via AT_FLAGS +-------------------------------- + +bit[0]: ARM64_AT_FLAGS_SYSCALL_TBI + + On arm64 the TCR_EL1.TBI0 bit has been set since Linux 3.x hence + the userspace (EL0) is allowed to set a non-zero value in the top + byte but the resulting pointers are not allowed at the user-kernel + syscall ABI boundary. + When bit[0] is set to 1 the kernel is advertising to the userspace + that a relaxed ABI is supported hence this type of pointers are now + allowed to be passed to the syscalls, when these pointers are in + memory ranges obtained by anonymous (MAP_ANONYMOUS) mmap() or brk(). + In these cases the tag is preserved as the pointer goes through the + kernel. Only when the kernel needs to check if a pointer is coming + from userspace (i.e. access_ok()) an untag operation is required. + +3. ARM64_AT_FLAGS_SYSCALL_TBI +----------------------------- + +When ARM64_AT_FLAGS_SYSCALL_TBI is enabled every syscalls can accept tagged +pointers, when these pointers are in memory ranges obtained by an anonymous +(MAP_ANONYMOUS) mmap() or brk(). + +A definition of the meaning of tagged pointers on arm64 can be found in: +Documentation/arm64/tagged-pointers.txt. + +When a pointer does not are in a memory range obtained by an anonymous mmap() +or brk(), this can not be passed to a syscall if it is tagged. + +To be more explicit: a syscall can accept pointers whose memory range is +obtained by a non-anonymous mmap() or brk() if and only if the tag encoded in +the top-byte is 0x00. + +When a new syscall is added, this can accept tagged pointers if and only if +these pointers are in memory ranges obtained by an anonymous (MAP_ANONYMOUS) +mmap() or brk(). In all the other cases, the tag encoded in the top-byte is +expected to be 0x00. + +Example of correct usage (pseudo-code) for a userspace application: + +bool arm64_syscall_tbi_is_present(void) +{ + unsigned long at_flags = getauxval(AT_FLAGS); + if (at_flags & ARM64_AT_FLAGS_SYSCALL_TBI) + return true; + + return false; +} + +void main(void) +{ + char *addr = mmap(NULL, PAGE_SIZE, PROT_READ | PROT_WRITE, + MAP_ANONYMOUS, -1, 0); + + /* Check if the relaxed ABI is supported */ + if (arm64_syscall_tbi_is_present()) { + /* Add a tag to the pointer and to the memory */ + addr = tag_pointer_and_memory(addr); + } + + /* Write to memory */ + strcpy("Hello World\n", addr); +} + -- 2.19.2