Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp2469813imm; Thu, 9 Aug 2018 13:33:50 -0700 (PDT) X-Google-Smtp-Source: AA+uWPzHcN1wKZv4mkuxsbLyBGYPyOWTFk7Js/H01a1HHLqMxFXKxFQA+BSZ7nqQ5qzNTikdOwOF X-Received: by 2002:a17:902:bd93:: with SMTP id q19-v6mr3348993pls.238.1533846830292; Thu, 09 Aug 2018 13:33:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533846830; cv=none; d=google.com; s=arc-20160816; b=wiFS9G/S53taTkTmPHTB7LyLoapLWbtKI72qBOlZvWkl7PgXZ4d4VqQ3bAt7gh4kDH 5c0cscug56TAVhupwQt5Gj4Kp3IiYzXbUHHcjbbWnMneRdM2esFDJsa0VPCiSwJQW1mn d8sN3fCL3TaPtfxfQ1seKsBTVDJLS+kID1jspkgJKmWFZBiOlzm97vd6l45jKnkFeNGX NVKOlg8cYe/XLfdDe8MT0pF6xKkzWRC9ldpvnu3C7hv2BrjB0v/aCRhBWykAVr+p2TDh 3b1VdHzdedfYf4a+7r4MW45nijlzliZQWKVzsJ815ACNKvEUsoT9R/loPGRzSBzlgh1t rmrg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:arc-authentication-results; bh=LizzJBNxDseqjZBqFSVbve4bqfs2q9B33vkLl+OHkp4=; b=nI5oP920xbiFQw9adyYWxRX3y+/2PxaqoOuKlGW92i/s/wfPLNh3PDsZ0JBVxu79+X q2RnVSRawGn7sA50cpqjh0yZqlGpgUDW1zAjzxC0yOEcKbF+s+FR+FpqfNp25cXXSkvu RnAsIBqHFU4amBqe0tmJIlowrEHYY10ryYu6KTvKa/8Smm6YXC7u7Rma36amuVFTlQab RJj/KILf4EFVvPYuc5b6yleHRejvEqKObQ1on7d+G6HFIIu95stqS5ArkinJi/JiUB/s 5lZVNNyOmE2n5R3s44fHaqrFEHUHu8U30Z4EOXi2uzxgCuFQTzQvfHiFMhs1AP1Sm+pX QRew== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=vmware.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y14-v6si5623728plp.371.2018.08.09.13.33.34; Thu, 09 Aug 2018 13:33:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=vmware.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727245AbeHIW7K (ORCPT + 99 others); Thu, 9 Aug 2018 18:59:10 -0400 Received: from ex13-edg-ou-002.vmware.com ([208.91.0.190]:12122 "EHLO EX13-EDG-OU-002.vmware.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726953AbeHIW7K (ORCPT ); Thu, 9 Aug 2018 18:59:10 -0400 X-Greylist: delayed 905 seconds by postgrey-1.27 at vger.kernel.org; Thu, 09 Aug 2018 18:59:08 EDT Received: from sc9-mailhost3.vmware.com (10.113.161.73) by EX13-EDG-OU-002.vmware.com (10.113.208.156) with Microsoft SMTP Server id 15.0.1156.6; Thu, 9 Aug 2018 13:17:21 -0700 Received: from sc2-haas01-esx0118.eng.vmware.com (sc2-haas01-esx0118.eng.vmware.com [10.172.44.118]) by sc9-mailhost3.vmware.com (Postfix) with ESMTP id A400A406E8; Thu, 9 Aug 2018 13:17:37 -0700 (PDT) From: Nadav Amit To: Ingo Molnar CC: , Peter Zijlstra , Thomas Gleixner , , Nadav Amit , "H. Peter Anvin" , Josh Poimboeuf Subject: [PATCH v7 05/10] x86: alternatives: macrofy locks for better inlining Date: Thu, 9 Aug 2018 13:15:48 -0700 Message-ID: <20180809201554.168804-6-namit@vmware.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20180809201554.168804-1-namit@vmware.com> References: <20180809201554.168804-1-namit@vmware.com> MIME-Version: 1.0 Content-Type: text/plain Received-SPF: None (EX13-EDG-OU-002.vmware.com: namit@vmware.com does not designate permitted sender hosts) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org GCC considers the number of statements in inlined assembly blocks, according to new-lines and semicolons, as an indication to the cost of the block in time and space. This data is distorted by the kernel code, which puts information in alternative sections. As a result, the compiler may perform incorrect inlining and branch optimizations. The solution is to set an assembly macro and call it from the inlined assembly block. As a result GCC considers the inline assembly block as a single instruction. This patch handles the LOCK prefix, allowing more aggresive inlining. text data bss dec hex filename 18140140 10225284 2957312 31322736 1ddf270 ./vmlinux before 18146889 10225380 2957312 31329581 1de0d2d ./vmlinux after (+6845) Static text symbols: Before: 40286 After: 40218 (-68) Cc: Thomas Gleixner Cc: Ingo Molnar Cc: "H. Peter Anvin" Cc: x86@kernel.org Cc: Josh Poimboeuf Acked-by: Peter Zijlstra (Intel) Signed-off-by: Nadav Amit --- arch/x86/include/asm/alternative-asm.h | 20 ++++++++++++++------ arch/x86/include/asm/alternative.h | 11 ++--------- arch/x86/kernel/macros.S | 1 + 3 files changed, 17 insertions(+), 15 deletions(-) diff --git a/arch/x86/include/asm/alternative-asm.h b/arch/x86/include/asm/alternative-asm.h index 31b627b43a8e..8e4ea39e55d0 100644 --- a/arch/x86/include/asm/alternative-asm.h +++ b/arch/x86/include/asm/alternative-asm.h @@ -7,16 +7,24 @@ #include #ifdef CONFIG_SMP - .macro LOCK_PREFIX -672: lock +.macro LOCK_PREFIX_HERE .pushsection .smp_locks,"a" .balign 4 - .long 672b - . + .long 671f - . # offset .popsection - .endm +671: +.endm + +.macro LOCK_PREFIX insn:vararg + LOCK_PREFIX_HERE + lock \insn +.endm #else - .macro LOCK_PREFIX - .endm +.macro LOCK_PREFIX_HERE +.endm + +.macro LOCK_PREFIX insn:vararg +.endm #endif /* diff --git a/arch/x86/include/asm/alternative.h b/arch/x86/include/asm/alternative.h index 4cd6a3b71824..d7faa16622d8 100644 --- a/arch/x86/include/asm/alternative.h +++ b/arch/x86/include/asm/alternative.h @@ -31,15 +31,8 @@ */ #ifdef CONFIG_SMP -#define LOCK_PREFIX_HERE \ - ".pushsection .smp_locks,\"a\"\n" \ - ".balign 4\n" \ - ".long 671f - .\n" /* offset */ \ - ".popsection\n" \ - "671:" - -#define LOCK_PREFIX LOCK_PREFIX_HERE "\n\tlock; " - +#define LOCK_PREFIX_HERE "LOCK_PREFIX_HERE\n\t" +#define LOCK_PREFIX "LOCK_PREFIX " #else /* ! CONFIG_SMP */ #define LOCK_PREFIX_HERE "" #define LOCK_PREFIX "" diff --git a/arch/x86/kernel/macros.S b/arch/x86/kernel/macros.S index f1fe1d570365..852487a9fc56 100644 --- a/arch/x86/kernel/macros.S +++ b/arch/x86/kernel/macros.S @@ -8,3 +8,4 @@ #include #include +#include -- 2.17.1