Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp2831829pxb; Tue, 21 Sep 2021 08:35:23 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw4hoA3FnzfjuIfd79BWDZqKrZsXdLZIZsVMbay22nW2pwJaAsBkiWzEGw2DmtGlk1Np8bY X-Received: by 2002:a05:6e02:486:: with SMTP id b6mr22384935ils.163.1632238523626; Tue, 21 Sep 2021 08:35:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632238523; cv=none; d=google.com; s=arc-20160816; b=s0VWLybqwx7BR5GkSB3SrxISfyl1GhXkcLXZmCJoASozuTes7uDEojQ4Gu3noca0ap sfpD3YYYr8nbQtf/3RIL1gT8k4PiLCADV+7YNKanrDjkVXGtEQS4BhYUVUm5icBXDJJr kvDEH5UPTjEAM1cJhckv/diRQDKfSyebnD9Tka65kpbqP8h8QY2mXVTkZ3KRRKI4CD7z Rx1k0/KE2OE45gSC/eAizT8rIwMibr0wpteyabBAhFXv8Mi/k63JttaItbm765KIDuOo r3Ybe5rjtbS0ArQf4oizyEA/aQPHgmUWWjNRsF8t+JyTc7ZLaSyncEFUzjp7Ix3TSfvT EE6Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=Sp94Unwx50yRMf18na8o7QDmUrpMEUgnSMKqkLAtWn0=; b=xNjob3IZA4NKXC0wddbq21Q9wXPLX3M2CV4etkzwFfHIgsDrRPp5YqIWkw6TNxgVCL XOMcROP0TPIaQ/d8UJNz8peSUvHF3GnKvP4nBvaHiu87Q0aIZV8A4fXdAqDQRo1bNrT4 PjPX/Xy50jYUMUB4BbFwxrvLnkUlNyEdG13qDTQ1fCaE1TV/3pzlnvoxAkNKU32XB2nx aTzy6bjd/kwl4bBrv0RdQkHPPuqFDxGYErEbHl7Jc1D3cazp3jxHYpnSvEN1zz8Tq0UK YGC8vRRI/urqHvmznsNSKyfnWpIiXOvMhuPF+lJEMAIGJ2uamWK4nX6VqBPP36d63BZn kU7Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p16si17246251iov.22.2021.09.21.08.35.10; Tue, 21 Sep 2021 08:35:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234089AbhIUPf0 (ORCPT + 99 others); Tue, 21 Sep 2021 11:35:26 -0400 Received: from foss.arm.com ([217.140.110.172]:35258 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233985AbhIUPfZ (ORCPT ); Tue, 21 Sep 2021 11:35:25 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id BEEA5113E; Tue, 21 Sep 2021 08:33:56 -0700 (PDT) Received: from C02TD0UTHF1T.local (unknown [10.57.23.155]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 451273F718; Tue, 21 Sep 2021 08:33:55 -0700 (PDT) Date: Tue, 21 Sep 2021 16:33:52 +0100 From: Mark Rutland To: Ard Biesheuvel Cc: Peter Zijlstra , Frederic Weisbecker , Catalin Marinas , Will Deacon , LKML , James Morse , Quentin Perret , Christophe Leroy Subject: Re: [PATCH 2/4] arm64: implement support for static call trampolines Message-ID: <20210921153352.GC35846@C02TD0UTHF1T.local> References: <20210920233237.90463-1-frederic@kernel.org> <20210920233237.90463-3-frederic@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Sep 21, 2021 at 04:44:56PM +0200, Ard Biesheuvel wrote: > On Tue, 21 Sept 2021 at 09:10, Peter Zijlstra wrote: > > > > On Tue, Sep 21, 2021 at 01:32:35AM +0200, Frederic Weisbecker wrote: > > > > > +#define __ARCH_DEFINE_STATIC_CALL_TRAMP(name, target) \ > > > + asm(" .pushsection .static_call.text, \"ax\" \n" \ > > > + " .align 3 \n" \ > > > + " .globl " STATIC_CALL_TRAMP_STR(name) " \n" \ > > > + STATIC_CALL_TRAMP_STR(name) ": \n" \ > > > + " hint 34 /* BTI C */ \n" \ > > > + " adrp x16, 1f \n" \ > > > + " ldr x16, [x16, :lo12:1f] \n" \ > > > + " cbz x16, 0f \n" \ > > > + " br x16 \n" \ > > > + "0: ret \n" \ > > > + " .popsection \n" \ > > > + " .pushsection .rodata, \"a\" \n" \ > > > + " .align 3 \n" \ > > > + "1: .quad " target " \n" \ > > > + " .popsection \n") > > > > So I like what Christophe did for PPC32: > > > > https://lkml.kernel.org/r/6ec2a7865ed6a5ec54ab46d026785bafe1d837ea.1630484892.git.christophe.leroy@csgroup.eu > > > > Where he starts with an unconditional jmp and uses that IFF the offset > > fits and only does the data load when it doesn't. Ard, woulnd't that > > also make sense on ARM64? I'm thinking most in-kernel function pointers > > would actually fit, it's just the module muck that gets to have too > > large pointers, no? > > > > Yeah, I'd have to page that back in. But it seems like the following > > bti c > > adrp x16, > ldr x16, [x16, ...] > br x16 > > with either set to 'b target' for the near targets, 'ret' for > the NULL target, and 'nop' for the far targets should work, and the > architecture permits patching branches into NOPs and vice versa > without special synchronization. I think so, yes. We can do sligntly better with an inline literal pool and a PC-relative LDR to fold the ADRP+LDR, e.g. .align 3 tramp: BTI C {B | RET | NOP} LDR X16, 1f BR X16 1: .quad Since that's in the .text, it's RO for regular accesses anyway. > But I must be missing something here, or why did we have that long > discussion before? I think the long discussion was because v2 had some more complex options (mostly due to trying to use ADRP+ADD) and atomicity/preemption issues meant we could only transition between some of those one-way, and it was subtle/complex: https://lore.kernel.org/linux-arm-kernel/20201028184114.6834-1-ardb@kernel.org/ For v3, that was all gone, but we didn't have a user. Since the common case *should* be handled by {B | RET | NOP }, I reckon it's fine to have just that and the literal pool fallback (which I'll definitely need for the sorts of kernel I run when fuzzing, where the kernel Image itself can be 100s of MiBs). Thanks, Mark.