Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753496Ab0DTEvO (ORCPT ); Tue, 20 Apr 2010 00:51:14 -0400 Received: from e23smtp06.au.ibm.com ([202.81.31.148]:34982 "EHLO e23smtp06.au.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750842Ab0DTEvN (ORCPT ); Tue, 20 Apr 2010 00:51:13 -0400 Date: Tue, 20 Apr 2010 10:20:38 +0530 From: Vaidyanathan Srinivasan To: Peter Zijlstra Cc: Salman Qazi , mingo@elte.hu, linux-kernel@vger.kernel.org, akpm@linux-foundation.org, linux-pm@lists.linux-foundation.org, arjan@infradead.org, csadler@google.com, ranjitm@google.com, kenchen@google.com, dawnchen@google.com Subject: Re: [PATCH 0/3] [idled]: Idle Cycle Injector for power capping Message-ID: <20100420045038.GB2915@dirshya.in.ibm.com> Reply-To: svaidy@linux.vnet.ibm.com References: <20100413234902.29004.41655.stgit@bumblebee1.mtv.corp.google.com> <1271317886.32749.69.camel@laptop> <1271703701.1676.235.camel@laptop> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline In-Reply-To: <1271703701.1676.235.camel@laptop> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3519 Lines: 78 * Peter Zijlstra [2010-04-19 21:01:41]: > On Mon, 2010-04-19 at 10:20 -0700, Salman Qazi wrote: > > On Thu, Apr 15, 2010 at 12:51 AM, Peter Zijlstra wrote: > > > On Tue, 2010-04-13 at 17:08 -0700, Salman wrote: > > >> As we discussed earlier this year, Google has an implementation that it > > >> would like to share. I have finally gotten around to porting it to > > >> v2.6.33 and cleaning up the interfaces. It is provided in the following > > >> messages for your review. I realize that when we first discussed this > > >> idea, a lot of ideas were presented for enhancing it. Thanks alot for > > >> your suggestions. I haven't gotten around to implementing any of them. > > > > > > .33 is way too old to submit patches against. > > > > Will bump up the version when I refresh the change. > > > > > > > > That said, I really really dislike this approach, I would much rather > > > see it tie in with power aware scheduling. > > > > I think I can see your point: there is potentially better information > > about the power consumption of the CPU beyond the time it was busy. > > But please clarify: is your complaint the lack of use of this > > information or are you arguing for a deeper integration into the > > scheduler (I.e. implementing it as part of the scheduler rather than > > an independent thread) or both? > > Right, so the IBM folks who were looking at power aware scheduling were > working on an interface to quantify the amount of power to save. Indicating required system capacity to the loadbalance and using that information to evacuate cores or socket was the basic idea. Ref: http://lkml.org/lkml/2009/5/13/173 The challenges with that approach is the predictable evacuation or forced idleness is not guaranteed. > But their approach, was an extension of the regular power aware > load-balancer, which basically groups tasks onto sockets so that whole > sockets can go idle. Integrating with the load balancer will make the design cleaner and avoid forcefully running an idle thread. The scheduler should schedule 'nothing' so that idleness can happen and cpuidle governor can take care of idle states. > However Arjan explained to me that your approach, which idles the whole > machine, has the advantage that also memory banks can go into idle mode > and save power. Well, this is an ideal goal. Injecting some amount of idle time across all cores/threads preferably with overlapping time window will save quite a lot of power on x86. But atleast overlapping idle times among sibling threads are required to get any power savings. This proposed approach does not yet have the ability to do overlapping idle times, though they may randomly occur. > Still in the interest to cut back on power-saving interfaces it would be > nice to see if there is anything we can do to merge these things, but I > really haven't thought much about that yet. Atleast integrating this with ACPI cpu aggregation driver can be a good first step. Both the drivers and code are for the same power capping purpose using idle time injection and running an high priority idle thread for short duration. ACPI Processor Aggregator Driver for 2.6.32-rc1 Ref: http://lkml.org/lkml/2009/10/3/13 --Vaidy -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/