Received: by 10.213.65.68 with SMTP id h4csp176618imn; Tue, 3 Apr 2018 18:13:49 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/RrMqR0qrbUcSqz00H+m+JXETXtPAzUBCXeT2jUhz+UIdawd+Olfg1n4Y6UlFyYuDj4SmK X-Received: by 2002:a17:902:128c:: with SMTP id g12-v6mr16115050pla.98.1522804429117; Tue, 03 Apr 2018 18:13:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1522804429; cv=none; d=google.com; s=arc-20160816; b=fhEcfo5hNhK3fOlKD2BtfqbmnQV/qkhJ0TdK8tvXnmfK62cZ9hzPw9TI0YgEw3KlIl VGbs3nKVVOekplWkwUXmqprUrtN9CbRS+h9HZSYdFmtJQJm36AeqjPviWhpvUYypJsre SPOD/hsGVOzvbjonwVRf/1G9eXB4PW9fxBA7aE4cqduxmTtp6fq+1DwznRR7ZdWitVIm ibWQ+lA5xD5cTA+j2iGQtN9fp5XLlfjXAfAXu13eX/8uoYBH0Tn7dVFePGoGL4PiS74/ or111ldjZDPSjKekoOi/ts8l1rdrwOpwpcTwSUw5k5GB41SIKIei7PXK0319MvesmMXR X1eQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:from:cc:to:subject :arc-authentication-results; bh=REnZmG/kSMSN8HYwn9CiVBhSmG1cxLgVMIROta30DFQ=; b=UpLTr9ddfXnZpdlZs3zINaQdGPFOOQeqLR+gb4jTAsyFNVOrW3ZON3nkvI5qJKV3cQ Z2jM/QhA4Kvv2cjuzm22c4TaBk0Bb0d8Vilab1POU4E7ri6zWkrG2F56LwZVfsQsq2G2 QsFFzq2fjFizbzVnob57pMOltTHpxKx2bmOGCpOzk3zUV3dipuaO4RKx/aPvvnPKaJMi 76xoyKR+g+4q8VWkksAIU3O8eynViCtJ5HqrFqCUqmxDLhxMgs0SAyRQuPR0Tf0wipLm tuRJMZli/HZTLtJfL88hsbzinPvnRPWPZw7z6ij707DJknFBE/gFBKA8XciFF+cT59q5 WBNg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d92-v6si1832575pld.195.2018.04.03.18.13.34; Tue, 03 Apr 2018 18:13:49 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754245AbeDDBMc (ORCPT + 99 others); Tue, 3 Apr 2018 21:12:32 -0400 Received: from mga09.intel.com ([134.134.136.24]:6293 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753124AbeDDBMb (ORCPT ); Tue, 3 Apr 2018 21:12:31 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by orsmga102.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 03 Apr 2018 18:12:31 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.48,403,1517904000"; d="scan'208";a="213721724" Received: from viggo.jf.intel.com (HELO localhost.localdomain) ([10.54.39.119]) by orsmga005.jf.intel.com with ESMTP; 03 Apr 2018 18:12:31 -0700 Subject: [PATCH 00/11] [v4] Use global pages with PTI To: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, Dave Hansen , aarcange@redhat.com, luto@kernel.org, torvalds@linux-foundation.org, keescook@google.com, hughd@google.com, jgross@suse.com, x86@kernel.org, namit@vmware.com From: Dave Hansen Date: Tue, 03 Apr 2018 18:09:46 -0700 Message-Id: <20180404010946.6186729B@viggo.jf.intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Changes from v3: * Fix whitespace issue noticed by willy * Clarify comments about X86_FEATURE_PGE checks * Clarify commit message around the necessity of _PAGE_GLOBAL filtering when CR4.PGE=0 or PGE is unsupported. Changes from v2: * Add performance numbers to changelogs * Fix compile error resulting from use of x86-specific __default_kernel_pte_mask in arch-generic mm/early_ioremap.c * Delay kernel text cloning until after we are done messing with it (patch 11). * Blacklist K8 explicitly from mapping all kernel text as global (this should never happen because K8 does not use pti when pti=auto, but we on the safe side). (patch 11) -- The later versions of the KAISER patches (pre-PTI) allowed the user/kernel shared areas to be GLOBAL. The thought was that this would reduce the TLB overhead of keeping two copies of these mappings. During the switch over to PTI, we seem to have lost our ability to have GLOBAL mappings. This adds them back. To measure the benefits of this, I took a modern Atom system without PCIDs and ran a microbenchmark[1] (higher is better): No Global Lines (baseline ): 6077741 lseeks/sec 88 Global Lines (kern entry): 7528609 lseeks/sec (+23.9%) 94 Global Lines (all ktext ): 8433111 lseeks/sec (+38.8%) On a modern Skylake desktop with PCIDs, the benefits are tangible, but not huge: No Global pages (baseline): 15783951 lseeks/sec 28 Global pages (this set): 16054688 lseeks/sec +270737 lseeks/sec (+1.71%) I also double-checked with a kernel compile on the Skylake system (lower is better): No Global pages (baseline): 186.951 seconds time elapsed ( +- 0.35% ) 28 Global pages (this set): 185.756 seconds time elapsed ( +- 0.09% ) -1.195 seconds (-0.64%) 1. https://github.com/antonblanchard/will-it-scale/blob/master/tests/lseek1.c Cc: Andrea Arcangeli Cc: Andy Lutomirski Cc: Linus Torvalds Cc: Kees Cook Cc: Hugh Dickins Cc: Juergen Gross Cc: x86@kernel.org Cc: Nadav Amit