Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755201Ab1FOURz (ORCPT ); Wed, 15 Jun 2011 16:17:55 -0400 Received: from mx3.mail.elte.hu ([157.181.1.138]:58779 "EHLO mx3.mail.elte.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755169Ab1FOURx (ORCPT ); Wed, 15 Jun 2011 16:17:53 -0400 Date: Wed, 15 Jun 2011 22:17:13 +0200 From: Ingo Molnar To: Tim Chen Cc: Linus Torvalds , Peter Zijlstra , Paul McKenney , Andrew Morton , Hugh Dickins , KOSAKI Motohiro , Benjamin Herrenschmidt , David Miller , Martin Schwidefsky , Russell King , Paul Mundt , Jeff Dike , Richard Weinberger , Tony Luck , KAMEZAWA Hiroyuki , Mel Gorman , Nick Piggin , Namhyung Kim , ak@linux.intel.com, shaohua.li@intel.com, alex.shi@intel.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org, "Rafael J. Wysocki" Subject: Re: REGRESSION: Performance regressions from switching anon_vma->lock to mutex Message-ID: <20110615201713.GC4762@elte.hu> References: <1308097798.17300.142.camel@schen9-DESK> <1308134200.15315.32.camel@twins> <1308135495.15315.38.camel@twins> <1308168784.17300.152.camel@schen9-DESK> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1308168784.17300.152.camel@schen9-DESK> User-Agent: Mutt/1.5.20 (2009-08-17) X-ELTE-SpamScore: -2.0 X-ELTE-SpamLevel: X-ELTE-SpamCheck: no X-ELTE-SpamVersion: ELTE 2.0 X-ELTE-SpamCheck-Details: score=-2.0 required=5.9 tests=BAYES_00 autolearn=no SpamAssassin version=3.3.1 -2.0 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1253 Lines: 32 * Tim Chen wrote: > On Wed, 2011-06-15 at 12:11 -0700, Linus Torvalds wrote: > > > > > And it results in real problems. For example, if you use "perf record" > > to see what the hell is up, the use of kernel threads for RCU > > callbacks means that the RCU cost is never even seen. I don't know how > > Tim did his profiling to figure out the costs, and I don't know how he > > decided that the spinlock to semaphore conversion was the culprit, but > > it is entirely possible that Tim didn't actually bisect the problem, > > but instead used "perf record" on the exim task, saw that the > > semaphore costs had gone up, and decided that it must be the > > conversion. > > > > Yes, I was using perf to do the profiling. I thought that the mutex > conversion was the most likely culprit based on the change in > profile. have you used callgraph profiling (perf record -g) or flat profiling? Flat profiling can be misleading when there's proxy work done. Thanks, Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/