Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp5245613ybv; Mon, 17 Feb 2020 15:46:17 -0800 (PST) X-Google-Smtp-Source: APXvYqzitVgevnHkaG+vfF50MvW+BJBJCHACneSzefbdEjMqMJuFuqDNKRgtcsblFeXIP459qusv X-Received: by 2002:aca:1c0d:: with SMTP id c13mr924908oic.44.1581983177826; Mon, 17 Feb 2020 15:46:17 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581983177; cv=none; d=google.com; s=arc-20160816; b=WXV7MYw2OwOvht3d94/HzGZj0iQ9Vn3vXkc8ezgy3YcFj4BxdujVJ/afUtPpbZNa1D 1LUdlKW5AsFWx1s2Bqk14tyfdZMicdT3d9MQM4KY7YuTVa495x1Ey9rR7AAuucrFkS2S hbpj6VXuAeExqJiJpVNHRPzUxKK0XdS45R3dSwlwoP66JdPgCx8QGS2ouMpduUt3tRnv M+FfvU0Ih8mRcHf+XlYH/08NxSfwyWDnZfuxivfYUpPsJbpBNHoroePGKfcg4pkSWhPo 27J+IXuzeyIs1pHbZTZAMZE0jueOoD3p7ToD72dNxj5RRAlh48JVDihyBdAEktb9e+r4 ZQmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=tSfkaNFaWKX3q5iIVrxiCV38lkB92/O0XRkXzJFaF8A=; b=Vpz1Nd/cBql9KuQSSl9n+mlDjABcWjDqN9ctKjgeUybX22i4HTBqnTdCR77dQdVVjs +brk42DojTWKaRViKID6yB1nyoIXgnat/7Ka8SfygDTRBs3VaouCUXNKnGMbroOZnTtc 8b2ZsweRLb27LqsZjBzRIJvN9TnXmWQ1jpXvR6g6M0mim+P4yfEqFDrXcnBZVOa6Lrvg mtMooF3xDS/mYZgSQnNF6yzAS1a05RvAeIz5BNhBjU6Hc4a4WgeLKqsWJ3N5oQaZRnxR /Lek+b+Oqs1vFLnQ7rkeHsv4yjEJvPJX2GlK2wOX5aUxMmrPHa13F10CQiBee64jTFH+ AiBA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o18si929681otk.80.2020.02.17.15.46.03; Mon, 17 Feb 2020 15:46:17 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726069AbgBQXpy (ORCPT + 99 others); Mon, 17 Feb 2020 18:45:54 -0500 Received: from foss.arm.com ([217.140.110.172]:42850 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725987AbgBQXpy (ORCPT ); Mon, 17 Feb 2020 18:45:54 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id A7A3F30E; Mon, 17 Feb 2020 15:45:53 -0800 (PST) Received: from e107158-lin (e107158-lin.cambridge.arm.com [10.1.195.21]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 551EA3F703; Mon, 17 Feb 2020 15:45:52 -0800 (PST) Date: Mon, 17 Feb 2020 23:45:49 +0000 From: Qais Yousef To: Dietmar Eggemann Cc: Ingo Molnar , Peter Zijlstra , Steven Rostedt , Pavan Kondeti , Juri Lelli , Vincent Guittot , Ben Segall , Mel Gorman , linux-kernel@vger.kernel.org Subject: Re: [PATCH 1/3] sched/rt: cpupri_find: implement fallback mechanism for !fit case Message-ID: <20200217234549.rpv3ns7bd7l6twqu@e107158-lin> References: <20200214163949.27850-1-qais.yousef@arm.com> <20200214163949.27850-2-qais.yousef@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: User-Agent: NeoMutt/20171215 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 02/17/20 20:09, Dietmar Eggemann wrote: > On 14/02/2020 17:39, Qais Yousef wrote: > > [...] > > > /** > > * cpupri_find - find the best (lowest-pri) CPU in the system > > * @cp: The cpupri context > > @@ -62,80 +115,72 @@ int cpupri_find(struct cpupri *cp, struct task_struct *p, > > struct cpumask *lowest_mask, > > bool (*fitness_fn)(struct task_struct *p, int cpu)) > > { > > - int idx = 0; > > int task_pri = convert_prio(p->prio); > > + int best_unfit_idx = -1; > > + int idx = 0, cpu; > > > > BUG_ON(task_pri >= CPUPRI_NR_PRIORITIES); > > > > for (idx = 0; idx < task_pri; idx++) { > > - struct cpupri_vec *vec = &cp->pri_to_cpu[idx]; > > - int skip = 0; > > > > - if (!atomic_read(&(vec)->count)) > > - skip = 1; > > - /* > > - * When looking at the vector, we need to read the counter, > > - * do a memory barrier, then read the mask. > > - * > > - * Note: This is still all racey, but we can deal with it. > > - * Ideally, we only want to look at masks that are set. > > - * > > - * If a mask is not set, then the only thing wrong is that we > > - * did a little more work than necessary. > > - * > > - * If we read a zero count but the mask is set, because of the > > - * memory barriers, that can only happen when the highest prio > > - * task for a run queue has left the run queue, in which case, > > - * it will be followed by a pull. If the task we are processing > > - * fails to find a proper place to go, that pull request will > > - * pull this task if the run queue is running at a lower > > - * priority. > > - */ > > - smp_rmb(); > > - > > - /* Need to do the rmb for every iteration */ > > - if (skip) > > - continue; > > - > > - if (cpumask_any_and(p->cpus_ptr, vec->mask) >= nr_cpu_ids) > > + if (!__cpupri_find(cp, p, lowest_mask, idx)) > > continue; > > > > - if (lowest_mask) { > > - int cpu; > > Shouldn't we add an extra condition here? > > + if (!static_branch_unlikely(&sched_asym_cpucapacity)) > + return 1; > + > > Otherwise non-heterogeneous systems have to got through this > for_each_cpu(cpu, lowest_mask) further below for no good reason. Hmm below is the best solution I can think of at the moment. Works for you? It's independent of what this patch tries to fix, so I'll add as a separate patch to the series in the next update. Thanks -- Qais Yousef --- diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c index 5ea235f2cfe8..5f2eaf3affde 100644 --- a/kernel/sched/rt.c +++ b/kernel/sched/rt.c @@ -14,6 +14,8 @@ static int do_sched_rt_period_timer(struct rt_bandwidth *rt_b, int overrun); struct rt_bandwidth def_rt_bandwidth; +typedef bool (*fitness_fn_t)(struct task_struct *p, int cpu); + static enum hrtimer_restart sched_rt_period_timer(struct hrtimer *timer) { struct rt_bandwidth *rt_b = @@ -1708,6 +1710,7 @@ static int find_lowest_rq(struct task_struct *task) struct cpumask *lowest_mask = this_cpu_cpumask_var_ptr(local_cpu_mask); int this_cpu = smp_processor_id(); int cpu = task_cpu(task); + fitness_fn_t fitness_fn; /* Make sure the mask is initialized first */ if (unlikely(!lowest_mask)) @@ -1716,8 +1719,17 @@ static int find_lowest_rq(struct task_struct *task) if (task->nr_cpus_allowed == 1) return -1; /* No other targets possible */ + /* + * Help cpupri_find avoid the cost of looking for a fitting CPU when + * not really needed. + */ + if (static_branch_unlikely(&sched_asym_cpucapacity)) + fitness_fn = rt_task_fits_capacity; + else + fitness_fn = NULL; + if (!cpupri_find(&task_rq(task)->rd->cpupri, task, lowest_mask, - rt_task_fits_capacity)) + fitness_fn)) return -1; /* No targets found */ /* > > > + if (!lowest_mask || !fitness_fn) > > + return 1; > > > > - cpumask_and(lowest_mask, p->cpus_ptr, vec->mask); > > + /* Ensure the capacity of the CPUs fit the task */ > > + for_each_cpu(cpu, lowest_mask) { > > + if (!fitness_fn(p, cpu)) > > + cpumask_clear_cpu(cpu, lowest_mask); > > + } > > [...]