Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753178AbbGFGXf (ORCPT ); Mon, 6 Jul 2015 02:23:35 -0400 Received: from mga03.intel.com ([134.134.136.65]:49787 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751147AbbGFGX0 (ORCPT ); Mon, 6 Jul 2015 02:23:26 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.15,413,1432623600"; d="scan'208";a="741232314" Date: Mon, 6 Jul 2015 06:31:44 +0800 From: Yuyang Du To: Peter Zijlstra Cc: Morten Rasmussen , Mike Galbraith , Rabin Vincent , "mingo@redhat.com" , "linux-kernel@vger.kernel.org" , Paul Turner , Ben Segall Subject: Re: [PATCH?] Livelock in pick_next_task_fair() / idle_balance() Message-ID: <20150705223144.GG5197@intel.com> References: <1435728995.9397.7.camel@gmail.com> <20150701145551.GA15690@axis.com> <20150701204404.GH25159@twins.programming.kicks-ass.net> <20150701232511.GA5197@intel.com> <1435824347.5351.18.camel@gmail.com> <20150702010539.GB5197@intel.com> <20150702114032.GA7598@e105550-lin.cambridge.arm.com> <20150702193702.GD5197@intel.com> <20150703093441.GA15477@e105550-lin.cambridge.arm.com> <20150703163831.GQ3644@twins.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20150703163831.GQ3644@twins.programming.kicks-ass.net> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2304 Lines: 48 On Fri, Jul 03, 2015 at 06:38:31PM +0200, Peter Zijlstra wrote: > > I'm not against having a policy that sits somewhere in between, we just > > have to agree it is the right policy and clean up the load-balance code > > such that the implemented policy is clear. > > Right, for balancing its a tricky question, but mixing them without > intent is, as you say, a bit of a mess. > > So clearly blocked load doesn't make sense for (new)idle balancing. OTOH > it does make some sense for the regular periodic balancing, because > there we really do care mostly about the averages, esp. so when we're > overloaded -- but there are issues there too. > > Now we can't track them both (or rather we could, but overhead). > > I like Yuyang's load tracking rewrite, but it changes exactly this part, > and I'm not sure I understand the full ramifications of that yet. Thanks. It would be a pure average policy, which is non-perfect like now, and certainly needs a mixing like now, but it is worth a starter, because it is simple and reasaonble, and based on it, the other parts can be simple and reasonable. > One way out would be to split the load balancer into 3 distinct regions; > > 1) get a task on every CPU, screw everything else. > 2) get each CPU fully utilized, still ignoring 'load' > 3) when everybody is fully utilized, consider load. > > If we make find_busiest_foo() select one of these 3, and make > calculate_imbalance() invariant to the metric passed in, and have things > like cpu_load() and task_load() return different, but coherent, numbers > depending on which region we're in, this almost sounds 'simple'. > > The devil is in the details, and the balancer is a hairy nest of details > which will make the above non-trivial. > > But for 1) we could simply 'balance' on nr_running, for 2) we can > 'balance' on runnable_avg and for 3) we'll 'balance' on load_avg (which > will then include blocked load). > > Let me go play outside for a bit so that it can sink in what kind of > nonsense my heat addled brain has just sprouted :-) -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/