Received: by 10.213.65.68 with SMTP id h4csp1121452imn; Sat, 24 Mar 2018 04:06:50 -0700 (PDT) X-Google-Smtp-Source: AG47ELtzxIq/PXOi61pffUcbmFaDIY/t8V5PtuZn3DSboQK0eElt7q1t2lPUKZfagshejQz0FLX2 X-Received: by 2002:a17:902:5852:: with SMTP id f18-v6mr32192606plj.289.1521889610886; Sat, 24 Mar 2018 04:06:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521889610; cv=none; d=google.com; s=arc-20160816; b=A//taKE3DrtVszW6GRAlPh9EmJr1R9PwFlo5aZGOFcKHWw4MbURVJv/4YNd1+6lC6u 6G4M9A4o3Ixka1oIRMgHbYW5S4jeMM3jrachSEkZWQ7ljI3D6djnPQeyCN/Xc1BNwoiS qULUVhHTK8eOgg1UuwQPox0/D7VmYjJXhiNB6s5WwSaMoRdGPR5a0L6UB8MgonFOiPjp AwLWQMKNHnEsXfvU8XTAMy2R1vlJ21QyH/gcdjafVejDx71O9hqlpt7EduHed9Ejjax/ PHvx/H6rGi5nicMTYdwDpohIM2Y8oJOlFOqkV3+aFpoHMgbanCESApg+2BSnQKTA6miC 3uqw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature:arc-authentication-results; bh=015l/NmSOglnBv7MhvzRnIXM/qNTDXzkpvqA2ZTDK0A=; b=M+ImkRRgGLJi54lB+ieK8fZCuF9eTd5/Veedc7OY7myUTRptOGrNXSXSn+aAnt9P3c EK/0+AqAgqXsQTC8VvBkFFJO6CnGKe+tXu+eRL7xvEUsDjfJ2mzsrQU1/bFQlBNsldm1 VTcMhPxKPgyU3t7oEovjMKwTtYczBzCq6Y4EqOc8ClJCOpGe8DDhg9K98Rr3xWbx7gVr YoWyucDIOKJjW0N2e5pZ/9k6XZ8voqVSfq9GhdbMmIx7DHTIPVlxOpnAd9hSpo7fZeYQ jhFmpvJH8ykewFzs/jPDzi0nq8BmscjUzAUVQYtrEF8eUPFEBA0ORB1rUgEeipxys6OF Ij8g== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=isqQKiDk; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u24si8110710pfh.326.2018.03.24.04.06.36; Sat, 24 Mar 2018 04:06:50 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=isqQKiDk; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751935AbeCXLFk (ORCPT + 99 others); Sat, 24 Mar 2018 07:05:40 -0400 Received: from mail-wr0-f196.google.com ([209.85.128.196]:37127 "EHLO mail-wr0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751861AbeCXLFi (ORCPT ); Sat, 24 Mar 2018 07:05:38 -0400 Received: by mail-wr0-f196.google.com with SMTP id l49so5312801wrl.4 for ; Sat, 24 Mar 2018 04:05:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=015l/NmSOglnBv7MhvzRnIXM/qNTDXzkpvqA2ZTDK0A=; b=isqQKiDk3kh3JnGltRJbfeRVSklmv3gMIwLO0Wia8aTjepE8MZdZtvwalfjcJxJm91 /5Tkj3WLFaCjr2XxkFXQwvur/ukBYcqSye+zIOqHNHuFmHhiTxnLV7TJPg1DzpQn+hxt Zj6qoeuMPcqC78xuNbhjrmorbD0titXIfG9tywDxGYWXQjRu/QvTHc8jU//6KwfWvDZg SziXHf0B5YNyMEA/EQ9xJ8DIXQzfbjBmyx8ipRyy/wSPPicFcGzeEJWuUgK48jHuvCCH nWY6gMxha8Ckc1DnANXhrOR3qHLdqm1oaMnuIHlMefrBDRNHUm1vNm+a9luS8E1/XBNm V+NQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=015l/NmSOglnBv7MhvzRnIXM/qNTDXzkpvqA2ZTDK0A=; b=CTaHvv6tPV43rm95H9eBlxLTzIwjOe74QEpSI9BNPG9zwS9WPsNTI+AnLvLnb/cpaW FgApqxqB3H5sEO0tR1H8vi8+7CcgA74FdV8ZTeLDF/PsQ9iZG+1Ts7w75ckBnAPJgJB3 6/tdYWWbHXMMoRONHFcWUMpD7b+3c2IQ4pzmnBtGbOjnCtxGMKEAZ2PPSPfLgBGvG7qk RZ4f98p1N7SjDABPSW9Qi/sc/Onk1xNGBGVbRxehKB+4MgOTRiV0QZ7TAtDLzHAmh0FB akZd+707e1BuLKnrbIsLo0wkUz4lPxoLg68Av1+rw0DzIbtJpPlbCIdsulhNBZJk97Lr ayyA== X-Gm-Message-State: AElRT7FJ+IvLQweqJ1Pc0UL3fPCyYiOZ+prvgfilOdNQ5dciscc7nD/A q3SC0l80HhKY/eVr3X1sqlY= X-Received: by 10.223.142.23 with SMTP id n23mr26938731wrb.28.1521889537329; Sat, 24 Mar 2018 04:05:37 -0700 (PDT) Received: from gmail.com (2E8B0CD5.catv.pool.telekom.hu. [46.139.12.213]) by smtp.gmail.com with ESMTPSA id s126sm13227189wmd.38.2018.03.24.04.05.35 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Sat, 24 Mar 2018 04:05:36 -0700 (PDT) Date: Sat, 24 Mar 2018 12:05:34 +0100 From: Ingo Molnar To: Dave Hansen Cc: Linus Torvalds , Linux Kernel Mailing List , linux-mm , Andrea Arcangeli , Andrew Lutomirski , Kees Cook , Hugh Dickins , =?iso-8859-1?Q?J=FCrgen_Gro=DF?= , the arch/x86 maintainers , namit@vmware.com Subject: Re: [PATCH 00/11] Use global pages with PTI Message-ID: <20180324110534.t52m5gvn4r7kvmnj@gmail.com> References: <20180323174447.55F35636@viggo.jf.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: NeoMutt/20170609 (1.8.3) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org * Dave Hansen wrote: > This is time doing a modestly-sized kernel compile on a 4-core Skylake > desktop. > > User Time Kernel Time Clock Elapsed > Baseline ( 0 GLB PTEs) 803.79 67.77 237.30 > w/series (28 GLB PTEs) 807.70 (+0.7%) 68.07 (+0.7%) 238.07 (+0.3%) > > Without PCIDs, it behaves the way I would expect. > > I'll ask around, but I'm open to any ideas about what the heck might be > causing this. Hm, so it's a bit weird that while user time and kernel time both increased by about 0.7%, elapsed time only increased by 0.3%? Typically kernel builds are much more parallel for that to be typical, so maybe there's some noise in the measurement? Before spending too much time on the global-TLB patch angle I'd suggest investing a bit of time into making sure that the regression you are seeing is actually real: You haven't described how you have measured kernel build times and "+0.7% regression" might turn out to be the real number, but sub-1% accuracy kernel build times are *awfully* susceptible to: - various sources of noise - systematic statistical errors which doesn't show up as measurement-to-measurement noise but which skews the results: such as the boot-to-boot memory layout of the source code and object files. - cpufreq artifacts Even repeated builds with 'make clean' inbetween can be misleading because the exact layout of key include files and binaries which get accessed the most often during a build are set into stone once they've been read into the page cache for the first time after bootup. Automated reboots between measurements can be misleading as well, if the file layout after bootup is too deterministic. So here's a pretty reliable way to measure kernel build time, which tries to avoid the various pitfalls of caching. First I make sure that cpufreq is set to 'performance': for ((cpu=0; cpu<120; cpu++)); do G=/sys/devices/system/cpu/cpu$cpu/cpufreq/scaling_governor [ -f $G ] && echo performance > $G done [ ... because it can be *really* annoying to discover that an ostensible performance regression was a cpufreq artifact ... again. ;-) ] Then I copy a kernel tree to /tmp (ramfs) as root: cd /tmp rm -rf linux git clone ~/linux linux cd linux make defconfig >/dev/null ... and then we can build the kernel in such a loop (as root again): perf stat --repeat 10 --null --pre '\ cp -a kernel ../kernel.copy.$(date +%s); \ rm -rf *; \ git checkout .; \ echo 1 > /proc/sys/vm/drop_caches; \ find ../kernel* -type f | xargs cat >/dev/null; \ make -j kernel >/dev/null; \ make clean >/dev/null 2>&1; \ sync '\ \ make -j16 >/dev/null ( I have tested these by pasting them into a terminal. Adjust the ~/linux source git tree and the '-j16' to your system. ) Notes: - the 'pre' script portion is not timed by 'perf stat', only the raw build times - we flush all caches via drop_caches and re-establish everything again, but: - we also introduce an intentional memory leak by slowly filling up ramfs with copies of 'kernel/', thus continously changing the layout of free memory, cached data such as compiler binaries and the source code hierarchy. (Note that the leak is about 8MB per iteration, so it isn't massive.) With 10 iterations this is the statistical stability I get this on a big box: Performance counter stats for 'make -j128 kernel' (10 runs): 26.346436425 seconds time elapsed (+- 0.19%) ... which, despite a high iteration count of 10, is still surprisingly noisy, right? A 0.2% stddev is probably not enough to call a 0.7% regression with good confidence, so I had to use *30* iterations to make measurement noise to be about an order of magnitude lower than the effect I'm trying to measure: Performance counter stats for 'make -j128' (30 runs): 26.334767571 seconds time elapsed (+- 0.09% ) i.e. "26.334 +- 0.023" seconds is a number we can have pretty high confidence in, on this system. And just to demonstrate that it's all real, I repeated the whole 30-iteration measurement again: Performance counter stats for 'make -j128' (30 runs): 26.311166142 seconds time elapsed (+- 0.07%) Even if in the end you get a similar result, close to the +0.7% overhead you already measured, we should have more confidence in blaming global TLBs for the performance regression. BYMMV. Thanks, Ingo [*] Note that even this doesn't eliminate certain sources of measurement error: such as the boot-to-boot variance in the layout of certain key kernel data structures - but kernel builds are mostly user-space dominated, so drop_caches should be good enough.