Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp4801023ybv; Mon, 17 Feb 2020 06:15:21 -0800 (PST) X-Google-Smtp-Source: APXvYqzGhNJcpe59eDuEgozuBjEl9dWbacy/TN2R1/ZMsB9IlEexQZJ02mUvbbnY7OADClWP9Vy0 X-Received: by 2002:a05:6808:5d0:: with SMTP id d16mr10047381oij.45.1581948921439; Mon, 17 Feb 2020 06:15:21 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581948921; cv=none; d=google.com; s=arc-20160816; b=x2Uxydp1PwvWZc+GG03tZ1qagsY/g+GFNZ/YaXu9tgjgQ4WN+XpioZCMWMKRXe1rzA hANom4Tcx9bJfAwGqFcZ2hLT4flt6v72/BTjOtnVoVKG8dgdo2jtdSMkUZWekXeDeyk/ A/kzXytlLTXtxN7yUAz0ktjzVONM1UGp0kDEcPGmmv8fTle0R1IGiwdfohVDDBLx5yst +Qbvb0W6h706xonb8tSg0z/WsIkI2J+4oJQdL73WZ4SO6SZlO1N7mowOgw62b6P2JBOx PWsYlloTSNeXqBjhgbhpMlMEuMonAYNxt7VtXnROHr35xaYNwDuPLGBuY5XDDiKr8TMt anww== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=BY5k0d8ytag3b6+sfzij7K6VwRzXE6L1ldAZXhBOm2g=; b=CNCd6DPj7GKFOEfTPM5+FZgbfgD9DDiRt3HhILzXItBqcYB/KyNTpv+iepcv7Mj64p 0TiIBJ5xajBUhHTwoiTHgOIOSKFqGa7V3VICWwNBmXgcC1sh6aM67M3+dXtSPZNnm4WK LwNYspPybuRTEBddvPOszTP8yKvI0M10KZArc3rEZY/P1XZh0LZ9QT3ZcNzUnXpxe86s y6vRfMnlibOqWNhSWvtv/OCL17T2Pq4TypsTPfpOVg2bLfPniIh+otq0PpLckdBClCi2 WrTG6rmbX4sgoePY3lyynL3FeMxuzCteAmtUqAiCpriTvrBtDLgWwm62XFeWY698JTGM yzpQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m12si6263462oim.195.2020.02.17.06.15.09; Mon, 17 Feb 2020 06:15:21 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728542AbgBQNxL (ORCPT + 99 others); Mon, 17 Feb 2020 08:53:11 -0500 Received: from foss.arm.com ([217.140.110.172]:36092 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727346AbgBQNxL (ORCPT ); Mon, 17 Feb 2020 08:53:11 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id EC24430E; Mon, 17 Feb 2020 05:53:10 -0800 (PST) Received: from e107158-lin.cambridge.arm.com (e107158-lin.cambridge.arm.com [10.1.195.21]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 9C5903F703; Mon, 17 Feb 2020 05:53:09 -0800 (PST) Date: Mon, 17 Feb 2020 13:53:07 +0000 From: Qais Yousef To: Pavan Kondeti Cc: Ingo Molnar , Peter Zijlstra , Steven Rostedt , Dietmar Eggemann , Juri Lelli , Vincent Guittot , Ben Segall , Mel Gorman , linux-kernel@vger.kernel.org Subject: Re: [PATCH 3/3] sched/rt: fix pushing unfit tasks to a better CPU Message-ID: <20200217135306.cjc2225wdlwqiicu@e107158-lin.cambridge.arm.com> References: <20200214163949.27850-1-qais.yousef@arm.com> <20200214163949.27850-4-qais.yousef@arm.com> <20200217092329.GC28029@codeaurora.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20200217092329.GC28029@codeaurora.org> User-Agent: NeoMutt/20171215 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 02/17/20 14:53, Pavan Kondeti wrote: > Hi Qais, > > On Fri, Feb 14, 2020 at 04:39:49PM +0000, Qais Yousef wrote: > > [...] > > > diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c > > index 0c8bac134d3a..5ea235f2cfe8 100644 > > --- a/kernel/sched/rt.c > > +++ b/kernel/sched/rt.c > > @@ -1430,7 +1430,7 @@ select_task_rq_rt(struct task_struct *p, int cpu, int sd_flag, int flags) > > { > > struct task_struct *curr; > > struct rq *rq; > > - bool test; > > + bool test, fit; > > > > /* For anything but wake ups, just return the task_cpu */ > > if (sd_flag != SD_BALANCE_WAKE && sd_flag != SD_BALANCE_FORK) > > @@ -1471,16 +1471,32 @@ select_task_rq_rt(struct task_struct *p, int cpu, int sd_flag, int flags) > > unlikely(rt_task(curr)) && > > (curr->nr_cpus_allowed < 2 || curr->prio <= p->prio); > > > > - if (test || !rt_task_fits_capacity(p, cpu)) { > > + fit = rt_task_fits_capacity(p, cpu); > > + > > + if (test || !fit) { > > int target = find_lowest_rq(p); > > > > - /* > > - * Don't bother moving it if the destination CPU is > > - * not running a lower priority task. > > - */ > > - if (target != -1 && > > - p->prio < cpu_rq(target)->rt.highest_prio.curr) > > - cpu = target; > > + if (target != -1) { > > + /* > > + * Don't bother moving it if the destination CPU is > > + * not running a lower priority task. > > + */ > > + if (p->prio < cpu_rq(target)->rt.highest_prio.curr) { > > + > > + cpu = target; > > + > > + } else if (p->prio == cpu_rq(target)->rt.highest_prio.curr) { > > + > > + /* > > + * If the priority is the same and the new CPU > > + * is a better fit, then move, otherwise don't > > + * bother here either. > > + */ > > + fit = rt_task_fits_capacity(p, target); > > + if (fit) > > + cpu = target; > > + } > > + } > > I understand that we are opting for the migration when priorities are tied but > the task can fit on the new task. But there is no guarantee that this task > stay there. Because any CPU that drops RT prio can pull the task. Then why > not leave it to the balancer? This patch does help in the 2 RT task test case. Without it I can see a big delay for the task to migrate from a little CPU to a big one, although the big is free. Maybe my test is too short (1 second). The delay I've seen is 0.5-0.7s.. https://imgur.com/a/qKJk4w4 Maybe I missed the real root cause. Let me dig more. > > I notice a case where tasks would migrate for no reason (happens without this > patch also). Assuming BIG cores are busy with other RT tasks. Now this RT > task can go to *any* little CPU. There is no bias towards its previous CPU. > I don't know if it makes any difference but I see RT task placement is too > keen on reducing the migrations unless it is absolutely needed. In find_lowest_rq() there's a check if the task_cpu(p) is in the lowest_mask and prefer it if it is. But yeah I see it happening too https://imgur.com/a/FYqLIko Tasks on CPU 0 and 3 swap. Note that my tasks are periodic but the plots don't show that. I shouldn't have changed something to affect this bias. Do you think it's something I introduced? It's something maybe worth digging into though. I'll try to have a look. Thanks -- Qais Yousef