Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757365Ab0DIIvE (ORCPT ); Fri, 9 Apr 2010 04:51:04 -0400 Received: from cantor.suse.de ([195.135.220.2]:33284 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755923Ab0DIIu6 (ORCPT ); Fri, 9 Apr 2010 04:50:58 -0400 Date: Fri, 9 Apr 2010 18:50:50 +1000 From: Nick Piggin To: Peter Zijlstra Cc: Andrea Arcangeli , Avi Kivity , Thomas Gleixner , Rik van Riel , Ingo Molnar , akpm@linux-foundation.org, Linus Torvalds , linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, Benjamin Herrenschmidt , David Miller , Hugh Dickins , Mel Gorman Subject: Re: [PATCH 00/13] mm: preemptibility -v2 Message-ID: <20100409085050.GO5683@laptop> References: <20100408191737.296180458@chello.nl> <20100409041421.GM5683@laptop> <1270802131.20295.3270.camel@laptop> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1270802131.20295.3270.camel@laptop> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3333 Lines: 79 On Fri, Apr 09, 2010 at 10:35:31AM +0200, Peter Zijlstra wrote: > On Fri, 2010-04-09 at 14:14 +1000, Nick Piggin wrote: > > On Thu, Apr 08, 2010 at 09:17:37PM +0200, Peter Zijlstra wrote: > > > Hi, > > > > > > This (still incomplete) patch-set makes part of the mm a lot more preemptible. > > > It converts i_mmap_lock and anon_vma->lock to mutexes. On the way there it > > > also makes mmu_gather preemptible. > > > > > > The main motivation was making mm_take_all_locks() preemptible, since it > > > appears people are nesting hundreds of spinlocks there. > > > > > > The side-effects are that we can finally make mmu_gather preemptible, something > > > which lots of people have wanted to do for a long time. > > > > What's the straight-line performance impact of all this? And how about > > concurrency, I wonder. mutexes of course are double the atomics, and > > you've added a refcount which is two more again for those paths using > > it. > > > > Page faults are very important. We unfortunately have some databases > > doing a significant amount of mmap/munmap activity too. > > You think this would affect the mmap/munmap times in any significant > way? It seems to me those are relatively heavy ops to begin with. They're actually not _too_ heavy because they just setup and tear down vmas. No flushing or faulting required (well, in order to make any _use_ of them you need flushing and faulting of course). I have some microbenchmarks like this and the page fault test I could try. > > I'd like to > > see microbenchmark numbers for each of those (both anon and file backed > > for page faults). > > OK, I'll dig out that fault test used in the whole mmap_sem/rwsem thread > a while back and modify it to also do file backed faults. That'd be good. Anonymous as well of course, for non-databases :) > > kbuild does quite a few pages faults, that would be an easy thing to > > test. Not sure what reasonable kinds of cases exercise parallelism. > > > > > > > What kind of performance tests would people have me run on this to satisfy > > > their need for numbers? I've done a kernel build on x86_64 and if anything that > > > was slightly faster with these patches, but it was well within the noise > > > levels so it might be heat noise I'm looking at ;-) > > > > Is it because you're reducing the number of TLB flushes, or what > > (kbuild isn't multi threaded so on x86 TLB flushes should be really > > fast anyway). > > I'll try and get some perf stat runs to get some insight into this. But > the numbers were: > > time make O=defconfig -j48 bzImage (5x, cache hot) > > without: avg: 39.2018s +- 0.3407 > with: avg: 38.9886s +- 0.1814 Well that's interesting. Nice if it is an improvement not just some anomoly. I'd start by looking at TLB flushes maybe? For testing, it would be nice to make the flush sizes equal so you get more of a comparison of the straight line code. Other than this, I don't have a good suggestion of what to test. I mean, how far can you go? :) Some threaded workloads would probably be a good idea, though. Java? -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/