Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S263133AbUKTD6E (ORCPT ); Fri, 19 Nov 2004 22:58:04 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S263072AbUKTCnB (ORCPT ); Fri, 19 Nov 2004 21:43:01 -0500 Received: from holomorphy.com ([207.189.100.168]:50816 "EHLO holomorphy.com") by vger.kernel.org with ESMTP id S263071AbUKTCe4 (ORCPT ); Fri, 19 Nov 2004 21:34:56 -0500 Date: Fri, 19 Nov 2004 18:34:43 -0800 From: William Lee Irwin III To: Nick Piggin Cc: Christoph Lameter , torvalds@osdl.org, akpm@osdl.org, Benjamin Herrenschmidt , Hugh Dickins , linux-mm@kvack.org, linux-ia64@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: page fault scalability patch V11 [0/7]: overview Message-ID: <20041120023443.GD2714@holomorphy.com> References: <419D581F.2080302@yahoo.com.au> <419D5E09.20805@yahoo.com.au> <1100848068.25520.49.camel@gaston> <20041120020401.GC2714@holomorphy.com> <419EA96E.9030206@yahoo.com.au> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <419EA96E.9030206@yahoo.com.au> Organization: The Domain of Holomorphy User-Agent: Mutt/1.5.6+20040722i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1257 Lines: 28 William Lee Irwin III wrote: >> Split counters easily resolve the issues with both these approaches >> (and apparently your co-workers are suggesting it too, and have >> performance results backing it). On Sat, Nov 20, 2004 at 01:18:22PM +1100, Nick Piggin wrote: > Split counters still require atomic operations though. This is what > Christoph's latest effort is directed at removing. And they'll still > bounce cachelines around. (I assume we've reached the conclusion > that per-cpu split counters per-mm won't fly?). Split != per-cpu, though it may be. Counterexamples are as simple as atomic_inc(&mm->rss[smp_processor_id()>>RSS_IDX_SHIFT]); Furthermore, see Robin Holt's results regarding the performance of the atomic operations and their relation to cacheline sharing. And frankly, the argument that the space overhead of per-cpu counters is problematic is not compelling. Even at 1024 cpus it's smaller than an ia64 pagetable page, of which there are numerous instances attached to each mm. -- wli - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/