Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp4041870ybv; Tue, 25 Feb 2020 12:01:27 -0800 (PST) X-Google-Smtp-Source: APXvYqysEcl3aulQtOPhZMI6vx6xQ+MJqLJQxfjojniiW1ypnySDta9uGo/56z/U5B2vBd1mO9Jt X-Received: by 2002:aca:cc07:: with SMTP id c7mr424336oig.165.1582660887064; Tue, 25 Feb 2020 12:01:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582660887; cv=none; d=google.com; s=arc-20160816; b=LBTVxwkn77Lx648dDC4Dk6z3kgUJytkT2cuFBB4zg7xCFt37kaKdlTa3PFmMRS+O2q HqplFYNkiPuYiiemNCrLaz38qrfDQVk7aKLma5jyt9sq6DKb8iNU/dYeQTOYpIegT0f8 T3URU2z6HL9y1bJ7R0hXIyYbGDIBgBgzO63tIlhJWHCUPmkNg4BHHm8loV/3KtenRO0F LSAE4jJgvV+7dFu93CvpB0Nm8xzcfDDhN5kwQZXLYH4RJJMk7DaF1TCl8vEbPU3U0399 00j+Y2fB00E2rM3UCzvGZxMf/YYoOxNmBdWvB2+BVb0cn0fOiIK080bdDOkpMHLv1KFe PEDA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:user-agent:references :in-reply-to:subject:cc:to:from:date:content-transfer-encoding :mime-version:dkim-signature; bh=4pbXjCTmSWvRoORcqT8xdio5aWElTyodfDE1yRwxE88=; b=GUOzSaiEXECf2Av9PMXV34d8k0zOcRUK1+YQ9gVbw65eJMO3C5B6xRtCWF9SDybYlj rUUKyXwviv+M7CnmA/CUcSxpnksewHj/KFeSButNou04nAtLSQswJjbkDq5hi+S3hN+l e0EkT17hwW3mjG5s1LCm2dhUFdznbKmHJerwIGlcPqsQ1jZFCx8v45nhxOx0Tzlwzhcy UpMjJfbmrhcK45/N6xaV7FKHDmYkurkMSRpQQFLuN4D+gBL6vvIJxewZI94uIxwHZZQJ WdwJIxlXemlzzUBm5ypYvpyfOKyN6ZkPqiS6cWAFThKk4UGk8KL0jrX9to67UWyE7Q4Y KUsQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@agner.ch header.s=dkim header.b=fiBYar2G; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t4si129814otc.160.2020.02.25.12.01.13; Tue, 25 Feb 2020 12:01:27 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@agner.ch header.s=dkim header.b=fiBYar2G; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730990AbgBYUAe (ORCPT + 99 others); Tue, 25 Feb 2020 15:00:34 -0500 Received: from mail.kmu-office.ch ([178.209.48.109]:49082 "EHLO mail.kmu-office.ch" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728051AbgBYUAe (ORCPT ); Tue, 25 Feb 2020 15:00:34 -0500 Received: from webmail.kmu-office.ch (unknown [IPv6:2a02:418:6a02::a3]) by mail.kmu-office.ch (Postfix) with ESMTPSA id 570BF5C3734; Tue, 25 Feb 2020 21:00:32 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=agner.ch; s=dkim; t=1582660832; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=4pbXjCTmSWvRoORcqT8xdio5aWElTyodfDE1yRwxE88=; b=fiBYar2G/FyAj0rGQCdcDOsJCqXNCqszfBq8dZigfTqmWS+tLIEW0neeI804TrlxLJifMa 1HPUdZmB6ic3W68C63TuiBA0p28gC+eEavJlWTzyGQtOdhiHIAyb442lak/sbUJJ3J3Xlc r4VrpgUqOqIDPIowtDZgzmq74SNHJ6g= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Date: Tue, 25 Feb 2020 21:00:32 +0100 From: Stefan Agner To: Robin Murphy , Nick Desaulniers , Ard Biesheuvel Cc: Arnd Bergmann , LKML , Jian Cai , clang-built-linux , Manoj Gupta , Russell King , Linux ARM Subject: Re: [PATCH] ARM: use assembly mnemonics for VFP register access In-Reply-To: References: <8bb16ac4b15a7e28a8e819ef9aae20bfc3f75fbc.1582266841.git.stefan@agner.ch> User-Agent: Roundcube Webmail/1.4.1 Message-ID: <0e4196b528284b07d088dec086f3378a@agner.ch> X-Sender: stefan@agner.ch Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020-02-25 20:45, Robin Murphy wrote: > On 2020-02-25 7:33 pm, Ard Biesheuvel wrote: >> On Tue, 25 Feb 2020 at 20:10, Nick Desaulniers wrote: >>> >>> On Mon, Feb 24, 2020 at 9:22 PM Stefan Agner wrote: >>>> >>>> Clang's integrated assembler does not allow to to use the mcr >>>> instruction to access floating point co-processor registers: >>>> arch/arm/vfp/vfpmodule.c:342:2: error: invalid operand for instruction >>>> fmxr(FPEXC, fpexc & ~(FPEXC_EX|FPEXC_DEX|FPEXC_FP2V|FPEXC_VV|FPEXC_TRAP_MASK)); >>>> ^ >>>> arch/arm/vfp/vfpinstr.h:79:6: note: expanded from macro 'fmxr' >>>> asm("mcr p10, 7, %0, " vfpreg(_vfp_) ", cr0, 0 @ fmxr " #_vfp_ ", %0" \ >>>> ^ >>>> :1:6: note: instantiated into assembly here >>>> mcr p10, 7, r0, cr8, cr0, 0 @ fmxr FPEXC, r0 >>>> ^ >>>> >>>> The GNU assembler supports the .fpu directive at least since 2.17 (when >>>> documentation has been added). Since Linux requires binutils 2.21 it is >>>> safe to use .fpu directive. Use the .fpu directive and mnemonics for VFP >>>> register access. >>>> >>>> This allows to build vfpmodule.c with Clang and its integrated assembler. >>>> >>>> Link: https://github.com/ClangBuiltLinux/linux/issues/905 >>>> Signed-off-by: Stefan Agner >>>> --- >>>> arch/arm/vfp/vfpinstr.h | 12 ++++-------- >>>> 1 file changed, 4 insertions(+), 8 deletions(-) >>>> >>>> diff --git a/arch/arm/vfp/vfpinstr.h b/arch/arm/vfp/vfpinstr.h >>>> index 38dc154e39ff..799ccf065406 100644 >>>> --- a/arch/arm/vfp/vfpinstr.h >>>> +++ b/arch/arm/vfp/vfpinstr.h >>>> @@ -62,21 +62,17 @@ >>>> #define FPSCR_C (1 << 29) >>>> #define FPSCR_V (1 << 28) >>>> >>>> -/* >>>> - * Since we aren't building with -mfpu=vfp, we need to code >>>> - * these instructions using their MRC/MCR equivalents. >>>> - */ >>>> -#define vfpreg(_vfp_) #_vfp_ >>>> - >>>> #define fmrx(_vfp_) ({ \ >>>> u32 __v; \ >>>> - asm("mrc p10, 7, %0, " vfpreg(_vfp_) ", cr0, 0 @ fmrx %0, " #_vfp_ \ >>>> + asm(".fpu vfpv2\n" \ >>>> + "vmrs %0, " #_vfp_ \ >>>> : "=r" (__v) : : "cc"); \ >>>> __v; \ >>>> }) >>>> >>>> #define fmxr(_vfp_,_var_) \ >>>> - asm("mcr p10, 7, %0, " vfpreg(_vfp_) ", cr0, 0 @ fmxr " #_vfp_ ", %0" \ >>>> + asm(".fpu vfpv2\n" \ >>>> + "vmsr " #_vfp_ ", %0" \ >>>> : : "r" (_var_) : "cc") >>>> >>>> u32 vfp_single_cpdo(u32 inst, u32 fpscr); >>>> -- >>> >>> Hi Stefan, >>> Thanks for the patch. Reading through: >>> - FMRX, FMXR, and FMSTAT: >>> http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0068b/Bcfbdihi.html >>> - VMRS and VMSR: >>> http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0204h/Bcfbdihi.html >>> >>> Should a macro called `fmrx` that had a comment about `fmrx` be using >>> `vmrs` in place of `fmrx`? >>> >>> It looks like Clang treats them the same, but GCC keeps them separate: >>> https://godbolt.org/z/YKmSAs >>> Ah, this is only when streaming to assembly. Looks like they have the >>> same encoding, and produce the same disassembly. (Godbolt emits >>> assembly by default, and has the option to compile, then disassemble). >>> If I take my case from godbolt above: >>> >>> ➜ /tmp arm-linux-gnueabihf-gcc -O2 -c x.c >>> ➜ /tmp llvm-objdump -dr x.o >>> >>> x.o: file format elf32-arm-little >>> >>> >>> Disassembly of section .text: >>> >>> 00000000 bar: >>> 0: f1 ee 10 0a vmrs r0, fpscr >>> 4: 70 47 bx lr >>> 6: 00 bf nop >>> >>> 00000008 baz: >>> 8: f1 ee 10 0a vmrs r0, fpscr >>> c: 70 47 bx lr >>> e: 00 bf nop >>> >>> So indeed a similar encoding exists for the two different assembler >>> instructions. >> >> Does that hold for ARM (A32) instructions as well? > > It should do - they're all the same thing underneath. The UAL syntax > just renamed all the legacy VFP mnemonics from Fxxx to Vxxx form, > apart from a couple of things that were already deprecated. GAS still > accepts both regardless of ".syntax unified", and as a result GCC > never saw a reason to stop emitting the old mnemonics. > Yes this is really only a mnemonic change when unified assembler language (UAL) got introduce, the ARM ARM has a list of mnemonic changes in the appendix. Just do make sure I also did compare the disassembled object file of vfpmodule.c before and after this change. I guess we could (should?) also change the macro name, but I guess that should be a separate commit anyway. -- Stefan