Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754107AbbFJWR0 (ORCPT ); Wed, 10 Jun 2015 18:17:26 -0400 Received: from mx1.redhat.com ([209.132.183.28]:60409 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752427AbbFJWRR (ORCPT ); Wed, 10 Jun 2015 18:17:17 -0400 Date: Wed, 10 Jun 2015 17:17:16 -0500 From: Josh Poimboeuf To: Andy Lutomirski Cc: Michal Marek , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "linux-kernel@vger.kernel.org" , Andi Kleen , live-patching@vger.kernel.org, X86 ML , "H. Peter Anvin" , Linus Torvalds , Peter Zijlstra Subject: Re: [PATCH v5 02/10] x86: Compile-time asm code validation Message-ID: <20150610221716.GF1125@treble.redhat.com> References: <20150610175320.GD29724@treble.redhat.com> <20150610185845.GA1125@treble.redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20150610185845.GA1125@treble.redhat.com> User-Agent: Mutt/1.5.23.1-rc1 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1995 Lines: 50 On Wed, Jun 10, 2015 at 01:58:45PM -0500, Josh Poimboeuf wrote: > On Wed, Jun 10, 2015 at 11:15:19AM -0700, Andy Lutomirski wrote: > > On Wed, Jun 10, 2015 at 10:53 AM, Josh Poimboeuf wrote: > > > On Wed, Jun 10, 2015 at 10:21:36AM -0700, Andy Lutomirski wrote: > > >> GCC can generate those, and the ia32_ptregs_common label is an example > > >> of such a thing. > > >> > > >> I'd rather have the script understand tail calls and possibly require > > >> that ia32_ptregs_common have a dummy frame pointer save in front > > >> before the label if needed. > > > > > > Why do you prefer tail calls there? See patch 3 for how I handled that > > > for ia32_ptregs_common (I duplicated the code with macros). > > > > > > I think adding support for tail calls in the tooling would be tricky. > > > So I'm just trying to figure out if there's a good reason to keep them. > > > > To save code size by deduplicating common tails. The code currently > > does that, and it would be nice to avoid bloating the code to keep the > > validator happy. > > Well, I wonder whether it's really worth sacrificing code readability > and consistency, and maybe some improved i-cache locality, to save a few > hundred bytes of code size. I should also mention that my proposed ia32_ptregs_common patch, which duplicated the needed code, was more optimized for performance than code size. But if you're more worried about code size, we could turn ia32_ptregs_common into a proper callable function, and then replace jmp ia32_ptregs_common with: call ia32_ptregs_common ret So it becomes a regular call instead of a tail call. It only adds a few instructions and the function is self-contained. Would that be good enough? -- Josh -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/