Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756232Ab1FNLiQ (ORCPT ); Tue, 14 Jun 2011 07:38:16 -0400 Received: from mga01.intel.com ([192.55.52.88]:1041 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755482Ab1FNLiO (ORCPT ); Tue, 14 Jun 2011 07:38:14 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.65,364,1304319600"; d="scan'208";a="17999128" Subject: Re: rcu: performance regression From: "Alex,Shi" To: "Li, Shaohua" Cc: "paulmck@linux.vnet.ibm.com" , Ingo Molnar , lkml , "Chen, Tim C" In-Reply-To: <1308040388.21027.1847.camel@debian> References: <1308029185.15392.147.camel@sli10-conroe> <1308040388.21027.1847.camel@debian> Content-Type: text/plain; charset="UTF-8" Date: Tue, 14 Jun 2011 19:37:54 +0800 Message-ID: <1308051474.13178.3.camel@debian> Mime-Version: 1.0 X-Mailer: Evolution 2.28.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2628 Lines: 55 On Tue, 2011-06-14 at 16:33 +0800, Alex,Shi wrote: > On Tue, 2011-06-14 at 13:26 +0800, Li, Shaohua wrote: > > Commit a26ac2455ffcf3(rcu: move TREE_RCU from softirq to kthread) > > introduced performance regression. In our AIM7 test, this commit caused > > about 40% regression. > > The commit runs rcu callbacks in a kthread instead of softirq. We > > observed high rate of context switch which is caused by this. Out test > > system has 64 CPUs and HZ is 1000, so we saw more than 64k context > > switch per second which is caused by the rcu thread. > > I also did trace and found when rcy thread is woken up, most time the > > thread doesn't handle any callbacks actually, I am interested how you do trace and get 64k CS more? > it just initializes new gp > > or end one gp or similar. > > From my understanding, the purpose to make rcu runs in kthread is to > > speed up rcu callbacks run (with help of rtmutex PI), not for end gp and > > so on, which runs pretty fast actually and doesn't need boost. > > To verify my findings, I had below debug patch applied. It still handles > > rcu callbacks in kthread if there is any pending callbacks, but other > > things are still running in softirq. this completely solved our > > regression. I thought this can still boost callbacks run. but I'm not > > expert in the area, so please help. > > This commit also cause hackbench process mode performance dropping, and > Shaohua's patch do recovered this. But in hackbench testing, the vmstat > show context switch have some reduce. And perf tool show > root_domain->cpupri->prio_to_cpu[]->lock has contention with the commit. > > > 11.53% hackbench [kernel] [k] > | > --- _raw_spin_lock_irqsave > cpupri_set > __enqueue_rt_entity > enqueue_rt_entity > enqueue_task_rt > enqueue_task > activate_task > ttwu_activate > ttwu_do_activate.clone.3 > try_to_wake_up > wake_up_process > invoke_rcu_cpu_kthread > rcu_check_callbacks > update_process_times > tick_sched_timer > __run_hrtimer -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/