Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp3220484ybc; Mon, 25 Nov 2019 10:52:25 -0800 (PST) X-Google-Smtp-Source: APXvYqwNm4NEAEOmYkS0U8Ucyk+VmOe6wU0AmWiAxZp5iXmq4EFGA/rnFqOQCGNN6WMV2sPPgr3r X-Received: by 2002:a50:a7a1:: with SMTP id i30mr20358742edc.94.1574707944890; Mon, 25 Nov 2019 10:52:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574707944; cv=none; d=google.com; s=arc-20160816; b=FfZYsLqteLqeMKhNCAv7UyWjA82W0hHAQktkTXY7BzzsOBw3iai2q7/gLu8MO7lh1G zVkueUCjzYAP9lXgBSCj637aiDbbgOcISvjDCSso4m02vh2OzfpVTCyHayBQwt8g2vvp 9WtREdsZYpl07E6WbeF3/pTc2ynaPqxcXB7xBJH5fp6YbgNaQjjg4pgiId9GMnxuk43M 1cWkxgFC3AebUmVJUakIKzdWbDHpJBYfB4aYkwBPTHzbKJlaCJbzv5dxkRTPqOyiY8Nl PnwhQw5Kc7RT1paW+YGAtGR0vU6nT+BLGLc7om+kQzVU47GiY71ZdpryXGqFhPgMstqZ ES7A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=6PyL3D60Gl54SMgMzjlPHgzkSmn0MVUojW9GGhwFRLs=; b=AZjkG9LvLsNZyKSCm/pGDALZtepUtXhNKAuNK5Sg/CzaODKlxy9zxx6VOgEAy4KYyd 0DzdozIOP7QYDnJB0UEBSpg0i3Gqb+KQi/webAJnC9MKSJo+u45/e3nwwiMRGylt55Ej rF0hAKqgGwuYoyHwKwIYtD/GmtispIcBrkpDdp0MWu3uaykZAZ5F/jjJkFILbySvTSUZ dwpMpRMVmUOr9/Vhcs27w+m2mkC8YUr9ZI+HHYgUNd+gry2AdygfDbMcy35I0Ua6WLtB EOm2p/4XNlkh1w72U9BePcUoBhk112j7DyW4ctQ1Wapx6Dy8R2WWnV/J+oUH40w/ZBBr ANUA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v13si5285599ejh.118.2019.11.25.10.52.01; Mon, 25 Nov 2019 10:52:24 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729147AbfKYRdZ (ORCPT + 99 others); Mon, 25 Nov 2019 12:33:25 -0500 Received: from foss.arm.com ([217.140.110.172]:53248 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727194AbfKYRdY (ORCPT ); Mon, 25 Nov 2019 12:33:24 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id BADE831B; Mon, 25 Nov 2019 09:33:23 -0800 (PST) Received: from [10.1.194.37] (e113632-lin.cambridge.arm.com [10.1.194.37]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 9F7E93F68E; Mon, 25 Nov 2019 09:33:22 -0800 (PST) Subject: Re: [PATCH 3/3] sched/fair: Consider uclamp for "task fits capacity" checks To: Qais Yousef Cc: linux-kernel@vger.kernel.org, peterz@infradead.org, mingo@kernel.org, vincent.guittot@linaro.org, dietmar.eggemann@arm.com, patrick.bellasi@matbug.net, qperret@google.com, morten.rasmussen@arm.com References: <20191120175533.4672-1-valentin.schneider@arm.com> <20191120175533.4672-4-valentin.schneider@arm.com> <20191124222051.kbb62phfsln5ixg4@e107158-lin.cambridge.arm.com> From: Valentin Schneider Message-ID: Date: Mon, 25 Nov 2019 17:33:21 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 In-Reply-To: <20191124222051.kbb62phfsln5ixg4@e107158-lin.cambridge.arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 24/11/2019 22:20, Qais Yousef wrote: > On 11/20/19 17:55, Valentin Schneider wrote: >> +static inline >> +unsigned long uclamp_task_util(struct task_struct *p, unsigned long util) >> +{ >> + return clamp(util, >> + (unsigned long)uclamp_eff_value(p, UCLAMP_MIN), >> + (unsigned long)uclamp_eff_value(p, UCLAMP_MAX)); >> +} > > uclamp_eff_value() will check if a task belongs to a cgroup, and if it does > apply its uclamp. The funny thing about the cgroup settings is that they can > have a uclamp_max < uclamp_min. uclamp_util_with() does check for this > condition but this function doesn't. > So according to comments, uclamp_util_with() has to check for inversion because it is comparing task clamps with rq-wide aggregated clamps. However I have to admit I still am not clear as to how a clamp inversion can occur. Without cgroups, it all goes through __sched_setscheduler() -> uclamp_validate() which enforces uclamp.min <= uclamp.max. So the rq-wide aggregation could not lead to an inversion. With cgroups, I do recall something about allowing the cgroup *knobs* to be inverted, but AFAICT that gets sanitized when it trickles down to the scheduler via cpu_util_update_eff(): eff[UCLAMP_MIN] = min(eff[UCLAMP_MIN], eff[UCLAMP_MAX]); So I don't see how inversion could happen within uclamp_task_util(). Patrick, any chance you could light up my torch? FWIW I gave this a spin but couldn't create an inversion inside the scheduler (debug prints appended at the end): $ cgcreate -g cpu:cgtest $ ./loop.sh & $ PID=$! $ echo $PID > /sys/fs/cgroup/cpu/cgtest/cgroup.procs $ echo 20 > /sys/fs/cgroup/cpu/cgtest/cpu.uclamp.max [ 24.442204] tg=00000000b546653d clamp_id=1 value=205 [ 24.447169] css=00000000b546653d uclamp.min=0 uclamp.max=205 [ 24.452932] p=loop.sh tg.uclamp.min=0 tg.uclamp.max=205 p.uclamp.min=0 p.uclamp.max=1024 [ 24.461021] p=loop.sh uclamp.min=0 uclamp.max=205 $ echo 80 > /sys/fs/cgroup/cpu/cgtest/cpu.uclamp.min [ 25.472174] tg=00000000b546653d clamp_id=0 value=819 [ 25.477135] css=00000000b546653d uclamp.min=205 uclamp.max=205 [ 25.483066] p=loop.sh tg.uclamp.min=205 tg.uclamp.max=205 p.uclamp.min=0 p.uclamp.max=1024 [ 25.491311] p=loop.sh uclamp.min=205 uclamp.max=205 --- diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 2da6a005bb3f..7b31fdc7cc90 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -1088,6 +1088,17 @@ uclamp_update_active_tasks(struct cgroup_subsys_state *css, if ((0x1 << clamp_id) & clamps) uclamp_update_active(p, clamp_id); } + pr_info("p=%s tg.uclamp.min=%u tg.uclamp.max=%u " + "p.uclamp.min=%u p.uclamp.max=%u\n", + p->comm, task_group(p)->uclamp[UCLAMP_MIN].value, + task_group(p)->uclamp[UCLAMP_MAX].value, + p->uclamp_req[UCLAMP_MIN].value, + p->uclamp_req[UCLAMP_MAX].value); + + + pr_info("p=%s uclamp.min=%u uclamp.max=%u\n", + p->comm, uclamp_tg_restrict(p, UCLAMP_MIN).value, + uclamp_tg_restrict(p, UCLAMP_MAX).value); } css_task_iter_end(&it); } @@ -7195,6 +7206,8 @@ static void cpu_util_update_eff(struct cgroup_subsys_state *css) /* Ensure protection is always capped by limit */ eff[UCLAMP_MIN] = min(eff[UCLAMP_MIN], eff[UCLAMP_MAX]); + pr_info("css=%p uclamp.min=%i uclamp.max=%i\n", css, eff[UCLAMP_MIN], eff[UCLAMP_MAX]); + /* Propagate most restrictive effective clamps */ clamps = 0x0; uc_se = css_tg(css)->uclamp; @@ -7273,8 +7286,10 @@ static ssize_t cpu_uclamp_write(struct kernfs_open_file *of, char *buf, rcu_read_lock(); tg = css_tg(of_css(of)); - if (tg->uclamp_req[clamp_id].value != req.util) + if (tg->uclamp_req[clamp_id].value != req.util) { + pr_info("tg=%p clamp_id=%i value=%llu\n", tg, clamp_id, req.util); uclamp_se_set(&tg->uclamp_req[clamp_id], req.util, false); + } /* * Because of not recoverable conversion rounding we keep track of the