Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp582444imm; Mon, 21 May 2018 10:42:26 -0700 (PDT) X-Google-Smtp-Source: AB8JxZoBCZcEk/bH6NRuWvzeyga5Ia4rE8tIP4bDBllSn04ANKyJDEVEFZw32UsdC/p1gACBljqm X-Received: by 2002:a17:902:3381:: with SMTP id b1-v6mr21394936plc.248.1526924546227; Mon, 21 May 2018 10:42:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526924546; cv=none; d=google.com; s=arc-20160816; b=nRhDcvKq1Z2ojqHO7BXZyj5++0PRA5d7ONaNKH8YZ3ViEsEkgU1/ghWJfVj/NTFJji xumpDeinWMOTYU0OTS5mEYM6Bq/iiVra69ODfe7Qnp/DsDzVl6Rdi0IGiWZ71hA7cEVa eNbuuQ+RJKBUWwgfdB/Bl29Pdpsyt2+iSY7ePFodyUymUZ9u2cHUh0PnU06L4u57jkFU 5Bn9XU/sD/qKCTblUAZDGq4/5s02FBhrDLCHoBTGDaSWH9htnDClh7rJ9UJ640xFN2/k 0qX6wNs0/4EL3C5/PcS1JuZFW/+ZVKguuVFcEAdaSXyP1INl3zKtB7YJTqLdQHRQwPaJ jjBg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=MC7fPTit1JrdK6QshyYmIKKLFbbkNQDlmwcb5DRuT7s=; b=jjdBUQPwacH0vlIAVwjk4S07Ms9X+BoEbLoTHBvYcAHRRNvVFltdyhrBKO4IAXLO4k xTPvfIKE6EfbsjWzrpUINmptmu4J9rSx12TyfEQf7HYlc+eqxi7XojGGls2eAj7/86jA ErRTr1gq8G0tbq20bej7rfv4GqlYaupQFkapNeh0oSqdIm/1e+ywa1adR3FKbAUw8EMN ZD/lWRKNkFUvUra7MrX2jcEeeQ0ZY3nGHVDOWPNWlUdFVX55Fi7Lvj7MVx1nJ6Gixzjr cdIa4v9r1zHxkqXxMuh68NyEjt6oFEOHMMqYtes1b2BWFTbV+i79aJnrW9vEYrb7fCj+ wY2Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y5-v6si8324326pgq.276.2018.05.21.10.42.11; Mon, 21 May 2018 10:42:26 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753440AbeEURmF (ORCPT + 99 others); Mon, 21 May 2018 13:42:05 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:54424 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753324AbeEURlr (ORCPT ); Mon, 21 May 2018 13:41:47 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 6115C1435; Mon, 21 May 2018 10:41:47 -0700 (PDT) Received: from e110439-lin (e110439-lin.cambridge.arm.com [10.1.210.68]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 2D5653F25D; Mon, 21 May 2018 10:41:45 -0700 (PDT) Date: Mon, 21 May 2018 18:41:42 +0100 From: Patrick Bellasi To: Joel Fernandes Cc: "Joel Fernandes (Google.)" , linux-kernel@vger.kernel.org, Viresh Kumar , "Rafael J . Wysocki" , Peter Zijlstra , Ingo Molnar , Juri Lelli , Luca Abeni , Todd Kjos , claudio@evidence.eu.com, kernel-team@android.com, linux-pm@vger.kernel.org Subject: Re: [PATCH v2] schedutil: Allow cpufreq requests to be made even when kthread kicked Message-ID: <20180521174142.GU30654@e110439-lin> References: <20180518185501.173552-1-joel@joelfernandes.org> <20180521105055.GQ30654@e110439-lin> <20180521154957.GA14065@joelaf.mtv.corp.google.com> <20180521170050.GT30654@e110439-lin> <20180521172001.GA21678@joelaf.mtv.corp.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180521172001.GA21678@joelaf.mtv.corp.google.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21-May 10:20, Joel Fernandes wrote: > Hi Patrick, > > On Mon, May 21, 2018 at 06:00:50PM +0100, Patrick Bellasi wrote: > > On 21-May 08:49, Joel Fernandes wrote: > > > On Mon, May 21, 2018 at 11:50:55AM +0100, Patrick Bellasi wrote: > > > > On 18-May 11:55, Joel Fernandes (Google.) wrote: > > > > > From: "Joel Fernandes (Google)" > > > > > > > > > > Currently there is a chance of a schedutil cpufreq update request to be > > > > > dropped if there is a pending update request. This pending request can > > > > > be delayed if there is a scheduling delay of the irq_work and the wake > > > > > up of the schedutil governor kthread. > > > > > > > > > > A very bad scenario is when a schedutil request was already just made, > > > > > such as to reduce the CPU frequency, then a newer request to increase > > > > > CPU frequency (even sched deadline urgent frequency increase requests) > > > > > can be dropped, even though the rate limits suggest that its Ok to > > > > > process a request. This is because of the way the work_in_progress flag > > > > > is used. > > > > > > > > > > This patch improves the situation by allowing new requests to happen > > > > > even though the old one is still being processed. Note that in this > > > > > approach, if an irq_work was already issued, we just update next_freq > > > > > and don't bother to queue another request so there's no extra work being > > > > > done to make this happen. > > > > > > > > Maybe I'm missing something but... is not this patch just a partial > > > > mitigation of the issue you descrive above? > > > > > > > > If a DL freq increase is queued, with this patch we store the request > > > > but we don't actually increase the frequency until the next schedutil > > > > update, which can be one tick away... isn't it? > > > > > > > > If that's the case, maybe something like the following can complete > > > > the cure? > > > > > > We already discussed this and thought of this case, I think you missed a > > > previous thread [1]. The outer loop in the kthread_work subsystem will take > > > care of calling sugov_work again incase another request was queued which we > > > happen to miss. > > > > Ok, I missed that thread... my bad. > > Sure no problem, sorry I was just pointing out the thread, not blaming you > for not reading it ;) Sure, np here too ;) > > However, [1] made me noticing that your solution works under the > > assumption that we keep queuing a new kworker job for each request we > > get, isn't it? > > Not at each request, but each request after work_in_progress was cleared by the > sugov_work. Any requests that happen between work_in_progress is set and > cleared only result in updating of the next_freq. I see, so we enqueue for the time of: mutex_lock(&sg_policy->work_lock); __cpufreq_driver_target(sg_policy->policy, freq, CPUFREQ_RELATION_L); mutex_unlock(&sg_policy->work_lock); > > If that's the case, this means that if, for example, during a > > frequency switch you get a request to reduce the frequency (e.g. > > deadline task passing the 0-lag time) and right after a request to > > increase the frequency (e.g. the current FAIR task tick)... you will > > enqueue a freq drop followed by a freq increase and actually do two > > frequency hops? > > Yes possibly, Not sure about the time window above, I can try to get some measurements tomorrow. > I see your point but I'm not sure if the tight loop around that > is worth the complexity, or atleast is within the scope of my patch. > Perhaps the problem you describe can be looked at as a future enhancement? Sure, I already have it as a patch on top of your. I can post it afterwards and we can discuss whether it makes sense or not. Still have to better check, but I think that maybe we can skip the queueing altogether if some work is already pending... in case we wanna go for a dedicated inner loop like the one I was proposing. Apart that, I think that your patch is already fixing 90% of the issue we have now. > thanks, > > - Joel -- #include Patrick Bellasi