Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753419Ab2JARED (ORCPT ); Mon, 1 Oct 2012 13:04:03 -0400 Received: from mga02.intel.com ([134.134.136.20]:23963 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752463Ab2JARD7 (ORCPT ); Mon, 1 Oct 2012 13:03:59 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.80,517,1344236400"; d="scan'208";a="217641858" Message-ID: <5069CCF9.7040309@linux.intel.com> Date: Mon, 01 Oct 2012 10:03:53 -0700 From: "H. Peter Anvin" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:13.0) Gecko/20120605 Thunderbird/13.0 MIME-Version: 1.0 To: Andrea Arcangeli CC: "Kirill A. Shutemov" , Andrew Morton , linux-mm@kvack.org, Andi Kleen , linux-kernel@vger.kernel.org, "Kirill A. Shutemov" , Arnd Bergmann , Ingo Molnar , linux-arch@vger.kernel.org Subject: Re: [PATCH 0/3] Virtual huge zero page References: <1348875441-19561-1-git-send-email-kirill.shutemov@linux.intel.com> <20120929134811.GC26989@redhat.com> <5069B804.6040902@linux.intel.com> <20121001163118.GC18051@redhat.com> In-Reply-To: <20121001163118.GC18051@redhat.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2289 Lines: 57 On 10/01/2012 09:31 AM, Andrea Arcangeli wrote: > On Mon, Oct 01, 2012 at 08:34:28AM -0700, H. Peter Anvin wrote: >> On 09/29/2012 06:48 AM, Andrea Arcangeli wrote: >>> >>> There would be a small cache benefit here... but even then some first >>> level caches are virtually indexed IIRC (always physically tagged to >>> avoid the software to notice) and virtually indexed ones won't get any >>> benefit. >>> >> >> Not quite. The virtual indexing is limited to a few bits (e.g. three >> bits on K8); the right way to deal with that is to color the zeropage, >> both the regular one and the virtual one (the virtual one would circle >> through all the colors repeatedly.) >> >> The cache difference, therefore, is *huge*. > > Kirill measured the cache benefit and it provided a 6% gain, not very > huge but certainly significant. > >> It's a performance tradeoff, and it can, and should, be measured. > > I now measured the other side of the trade, by touching only one > character every 4k page in the range to simulate a very seeking load, > and doing so the physical huge zero page wins with a 600% margin, so > if the cache benefit is huge for the virtual zero page, the TLB > benefit is massive for the physical zero page. > > Overall I think picking the solution that risks to regress the least > (also compared to current status of no zero page) is the safest. > Something isn't quite right about that. If you look at your numbers: 1,049,134,961 LLC-loads 6,222 LLC-load-misses This is another way of saying in your benchmark the huge zero page is parked in your LLC - using up 2 MB of your LLC, typically a significant portion of said cache. In a real-life application that will squeeze out real data, but in your benchmark the system is artificially quiescent. It is well known that microbenchmarks can be horribly misleading. What led to Kirill investigating huge zero page in the first place was the fact that some applications/macrobenchmarks benefit, and I think those are the right thing to look at. -hpa -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/