Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751810Ab2B1FFU (ORCPT ); Tue, 28 Feb 2012 00:05:20 -0500 Received: from mailout-de.gmx.net ([213.165.64.23]:52623 "HELO mailout-de.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1750695Ab2B1FFT (ORCPT ); Tue, 28 Feb 2012 00:05:19 -0500 X-Authenticated: #14349625 X-Provags-ID: V01U2FsdGVkX19Hb3MT2zpH8/XkEktLKp4jdaNzRqr9hm0En5AkW4 HjX/QHxieq/nZX Message-ID: <1330405514.17181.29.camel@marge.simpson.net> Subject: Re: sched: Avoid SMT siblings in select_idle_sibling() if possible From: Mike Galbraith To: Suresh Siddha Cc: Srivatsa Vaddagiri , Peter Zijlstra , linux-kernel , Ingo Molnar , Paul Turner Date: Tue, 28 Feb 2012 06:05:14 +0100 In-Reply-To: <1330380683.23436.16.camel@sbsiddha-desk.sc.intel.com> References: <1321495153.5100.7.camel@marge.simson.net> <1321544313.6308.25.camel@marge.simson.net> <1321545376.2495.1.camel@laptop> <1321547917.6308.48.camel@marge.simson.net> <1321551381.15339.21.camel@sbsiddha-desk.sc.intel.com> <1321629267.7080.13.camel@marge.simson.net> <1329748861.2293.345.camel@twins> <1329761661.6276.146.camel@marge.simpson.net> <20120223104959.GA8454@linux.vnet.ibm.com> <1329996064.7411.106.camel@marge.simpson.net> <20120225065403.GB12313@linux.vnet.ibm.com> <1330158655.11655.58.camel@marge.simpson.net> <1330380683.23436.16.camel@sbsiddha-desk.sc.intel.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.2.1 Content-Transfer-Encoding: 7bit Mime-Version: 1.0 X-Y-GMX-Trusted: 0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1761 Lines: 38 On Mon, 2012-02-27 at 14:11 -0800, Suresh Siddha wrote: > On Sat, 2012-02-25 at 09:30 +0100, Mike Galbraith wrote: > > My less rotund config shows the L2 penalty decidedly more prominently. > > We used to have avg_overlap as a synchronous wakeup hint, but it was > > broken by preemption and whatnot, got the axe to recover some cycles. A > > reliable and dirt cheap replacement would be a good thing to have. > > > > TCP_RR and tbench are far way away from the overlap breakeven point on > > E5620, whereas with Q6600s shared L2, you can start converting overlap > > into throughput almost immediately. > > > > 2.4 GHz E5620 > > Throughput 248.994 MB/sec 1 procs SD_SHARE_PKG_RESOURCES > > Throughput 379.488 MB/sec 1 procs !SD_SHARE_PKG_RESOURCES > > > > 2.4 GHz Q6600 > > Throughput 299.049 MB/sec 1 procs SD_SHARE_PKG_RESOURCES > > Throughput 300.018 MB/sec 1 procs !SD_SHARE_PKG_RESOURCES > > > > Also it is not always about just the L2 cache being shared/not or > warm/cold etc. It also depends on the core c-states/p-states etc. It > will cost waking up an idle core and the cost will depend on the what > core-c state it is in. And also if we ping-pong between cores often, > cpufreq governor will come and request for a lower core p-state even > though the load was keeping one core or the other in the socket always > busy at any given point of time. Yeah, pinning yields a couple percent on Q6600 box, more on E5620 despite its spiffier gearbox.. likely turbo-boost doing it's thing. -Mike -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/