Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754741AbbDNLvA (ORCPT ); Tue, 14 Apr 2015 07:51:00 -0400 Received: from vegas.theobroma-systems.com ([144.76.126.164]:41733 "EHLO mail.theobroma-systems.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753264AbbDNLuv convert rfc822-to-8bit (ORCPT ); Tue, 14 Apr 2015 07:50:51 -0400 Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2070.6\)) Subject: Re: [PATCH v4 00/24] ILP32 for ARM64 From: "Dr. Philipp Tomsich" In-Reply-To: <3234795.e0Uq9k2nUp@wuerfel> Date: Tue, 14 Apr 2015 13:50:21 +0200 Cc: "Pinski, Andrew" , linux-arm-kernel , "linux-kernel@vger.kernel.org" , Andreas Kraschitzer , Benedikt Huber , Catalin Marinas , Andrew Pinski , Kumar Sankaran , Christoph Muellner Content-Transfer-Encoding: 8BIT Message-Id: <76000FE9-46E5-4883-9E4F-C65444FD406C@theobroma-systems.com> References: <17844053.vZiPCu4un3@wuerfel> <6C99D1ED-37A9-4E68-A137-BF8F78720BFE@caviumnetworks.com> <3234795.e0Uq9k2nUp@wuerfel> To: Arnd Bergmann X-Mailer: Apple Mail (2.2070.6) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 6486 Lines: 122 > On 14 Apr 2015, at 13:14, Arnd Bergmann wrote: > > On Tuesday 14 April 2015 10:45:43 Pinski, Andrew wrote: >>> On Apr 14, 2015, at 3:08 AM, Arnd Bergmann wrote: >>> >>>> On Tuesday 14 April 2015 11:33:13 Dr. Philipp Tomsich wrote: >>>> Arnd, >>>> >>>> After getting a good night’s sleep, the “reuse the existing system call table” comment >>>> makes a little more sense as I construe it as having just one merged system call table >>>> for both LP64 and ILP32 and handling the differences through a different system call >>>> numbering in unistd.h towards LP64 and ILP32 processes. >>>> >>>> If this is the intended implementation, I am not fully sold on the benefit: having a private >>>> copy of unistd.h for ARM64 seems to be a less readable and less maintenance-friendly >>>> solution to having separate tables. >>>> >>>> We’re open to input on this and—if merging the system call tables is the consensus— >>>> would like to get the change underway as soon as possible. >>> >>> There are multiple ways of doing this: >>> >>> a) separate syscall table for arm64: as you say, this is the current approach, >>> and I'd like to avoid that too >>> b) add syscalls for ilp32 as additional numbers in the normal lp64 version of >>> asm-generic/unistd.h, and share the binary tables between ilp32 and lp64 >>> on aarch64 >>> c) change asm-generic/unistd.h to generate three possible tables: instead of >>> just native (lp64 or ilp32 depending on the arch), compat (support for >>> existing ilp32 binaries on some architectures, there would also be a >>> "modern" ilp32 variant that is a mix of the two, as your table today >>> d) don't use the asm-generic/unistd.h table for aarch64-ilp32 at all, but instead >>> reuse the table from arch/arm64/include/asm/unistd32.h >>> >>> I think you are referring to approach b) or c) above, but my preferred one >>> would actually be d). >> >> D is the worst of all 4 options in my mind. The reason is when a new syscall is >> added, then you have to update that file too. > > I don't know what the miscommunication is here, but the advantage of d is > specifically that it is /less/ work to maintain: With the current approach, > each new syscall that gets added needs to be checked to see if the normal > aarch64 version works or if it needs another wrapper, while with d) we > get the update for free, because we follow exactly what aarch32 is doing. I must agree with Andrew, that (d) seems like a bad fit for ILP32… after all the ILP32 (ELF) ABI specifies that 64bit values are to be passed in a single register, but the unistd32.h assumes a 32bit kernel that receives 64bit arguments split over two registers (i.e. the 64-variants of the various system calls, such as ftruncate64). I strongly prefer (b) as this satisfies the largest number of requirements: (i) it will be a single system call table for LP64 and ILP32 (ii) it’s easy to be make use of the 64bit capable system calls (iii) it fits with the relationship of ILP32 to LP64 in the ILP32 ELF ABI definition As we don’t support ILP32 without LP64, the resulting implementation will be even simpler (i.e. no need to duplicate the entire table) than the n32 implementation on MIPS... >> Also d is worse than the rest as >> you no longer default to 64bit off_t which is not a good thing. > > That decision is up to the libc implementation, just as it is for the existing > aarch32 libc. The kernel just offers both versions and the libc can pick > one, or use the _LARGEFILE64_SOURCE hack that glibc has to also implement > both. It would probably be reasonable to use 64-bit off_t only for a libc > and ignore the old calls. The glibc implementation, as we have it today, always uses the 64bit system call and performs overflow-checking on results (on ILP32, we can’t perform overflow checking on arguments, as the callee needs to sign-extend). In other words: glibc uses the LP64 system calls and handles any pre- and post-processing in the system-call wrappers. >> B is just as bad and goes against using the generic syscall numbers. > > How so? The newly introduce syscalls then would be the generic ones. By applying the “strace-test” (i.e. “How will this affect the implementation of strace, when considering a ILP32-compiled and a LP64-compiled strace where both should be capable of tracing either ILP32 or LP64 targets?”), option (b) appears the cleanest choice, as no test on the dependent’s ABI would be necessary and all internal dispatching could be performed on syscall numbers alone. Using the same "strace-test", option (d) would be the least preferred, as it will make the internal dispatching entirely dependent on the dependent’s ABI. >> I was trying to model ilp32 so there was less maintain hassle if a new syscall was added. >> >> Also about time_t, my original patch had used 32bit but was asked to change >> it to the 64bit one. So now I am upset this being asked again to change it back. >> The review process for the linux kernel is much harder than the review process >> of gcc or even glibc now. > > For now, I'm just opening that discussion again, but the reason this > comes up again now is that a lot has happened in the meantime on this > front, and we have already decided to merge new architecture ports with > 32-bit time_t since. I think my sloppy e-mail writing blew this out of proportion: I never intended to focus on ‘time_t’, but on ‘timespec’ as a whole (i.e. keeping ‘tv_nsec’ defined as a ‘long’ in userspace). The problem, as far as I can see from the kernel source, is that kernel/compat.c assumes that tv_sec (time_t) and tv_nsec (long) are of equal size. My hope was that to resolve this by extending the compat-layer (in those places where COMPAT_USE_64BIT_TIME is tested) with a COMPAT_USE_64BIT_TIME_COMPLIANT path that supports a 64-bit time_t with a ILP32-long tv_nsecs. We should not break C11-compliant programs by changing the size of tv_nsecs. As a consequence, we shouldn’t propagate a questionable choice made for x32 into ILP32. On the other hand, I don’t want to limit time_t to 32bits, as C11 permits any reasonable define for it… --Philipp.-- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/