Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755242AbdCPQ6n (ORCPT ); Thu, 16 Mar 2017 12:58:43 -0400 Received: from mail-vk0-f54.google.com ([209.85.213.54]:35558 "EHLO mail-vk0-f54.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754466AbdCPQ6k (ORCPT ); Thu, 16 Mar 2017 12:58:40 -0400 MIME-Version: 1.0 In-Reply-To: <20170316124420.GQ31499@e106622-lin> References: <1488292722-19410-6-git-send-email-patrick.bellasi@arm.com> <20170315114052.GB18557@e110439-lin> <20170315144449.GH31499@e106622-lin> <20170315162414.GI31499@e106622-lin> <20170316111644.GP31499@e106622-lin> <20170316122726.GA10171@e110439-lin> <20170316124420.GQ31499@e106622-lin> From: Joel Fernandes Date: Thu, 16 Mar 2017 09:58:38 -0700 Message-ID: Subject: Re: [RFC v3 5/5] sched/{core,cpufreq_schedutil}: add capacity clamping for RT/DL tasks To: Juri Lelli Cc: Patrick Bellasi , "Joel Fernandes (Google)" , Linux Kernel Mailing List , Linux PM , Ingo Molnar , Peter Zijlstra , "Rafael J . Wysocki" , Andres Oportus Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2875 Lines: 67 On Thu, Mar 16, 2017 at 5:44 AM, Juri Lelli wrote: > On 16/03/17 12:27, Patrick Bellasi wrote: >> On 16-Mar 11:16, Juri Lelli wrote: >> > On 15/03/17 16:40, Joel Fernandes wrote: >> > > On Wed, Mar 15, 2017 at 9:24 AM, Juri Lelli wrote: >> > > [..] >> > > > >> > > >> > However, trying to quickly summarize how that would work (for who is >> > > >> > already somewhat familiar with reclaiming bits): >> > > >> > >> > > >> > - a task utilization contribution is accounted for (at rq level) as >> > > >> > soon as it wakes up for the first time in a new period >> > > >> > - its contribution is then removed after the 0lag time (or when the >> > > >> > task gets throttled) >> > > >> > - frequency transitions are triggered accordingly >> > > >> > >> > > >> > So, I don't see why triggering a go down request after the 0lag time >> > > >> > expired and quickly reacting to tasks waking up would have create >> > > >> > problems in your case? >> > > >> >> > > >> In my experience, the 'reacting to tasks' bit doesn't work very well. >> > > > >> > > > Humm.. but in this case we won't be 'reacting', we will be >> > > > 'anticipating' tasks' needs, right? >> > > >> > > Are you saying we will start ramping frequency before the next >> > > activation so that we're ready for it? >> > > >> > >> > I'm saying that there is no need to ramp, simply select the frequency >> > that is needed for a task (or a set of them). >> > >> > > If not, it sounds like it will only make the frequency request on the >> > > next activation when the Active bandwidth increases due to the task >> > > waking up. By then task has already started to run, right? >> > > >> > >> > When the task is enqueued back we select the frequency considering its >> > bandwidth request (and the bandwidth/utilization of the others). So, >> > when it actually starts running it will already have enough capacity to >> > finish in time. >> >> Here we are factoring out the time required to actually switch to the >> required OPP. I think Joel was referring to this time. >> Yes, that's what I meant. > > Right. But, this is an HW limitation. It seems a problem that every > scheduler driven decision will have to take into account. So, doesn't > make more sense to let the driver (or the governor shim layer) introduce > some sort of hysteresis to frequency changes if needed? The problem IMO which Hysterisis in the governor will not help is what if you had a DL task that is not waking up for several periods and then wakes up, then for that wake up, we would still be subject to the HW limitation of time taken to switch to needed OPP. Right? >> That time cannot really be eliminated but from having faster OOP >> swiching HW support. Still, jumping strating to the "optimal" OPP >> instead of rumping up is a big improvement. Yes I think so. Thanks, Joel