Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp666280pxy; Thu, 22 Apr 2021 10:30:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyNBYoN8rV2xf47K0JJkNSXWL3N7818AFRQLwtGf5YyrFS9u85N8FN5I369WY1lEc05O1NH X-Received: by 2002:a17:90a:1990:: with SMTP id 16mr1182837pji.172.1619112619635; Thu, 22 Apr 2021 10:30:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619112619; cv=none; d=google.com; s=arc-20160816; b=AbAdLZmGS2wWP5Wkw3aPG6kBfJ6Nv4w9v9l4JGXZSMRmsOf/1OZCPOEZz4wUMFYGbi SRu81ZPVnqPqjPcIXVxVtAZpdwzfWU3+tIMOKiahwTPjh2oVeHCZhtmZl0Zga2ZAvwFx VS/kizqFrVEsIOSflf1WdPIGhAciW8w9x7+uQrWjCpNE+upG8CILlZV8ahd4QmbXUwfl 22Whd63p4iC5kXQatIHikqgWs1RyNkIhRxl66aPa1J+JTR+HG2ulLT67J18BVW/8fHsw fPP7sfxhfQK+VclgDtmldnKBKLPlMUkTAZyCP5O4lCjBi6yWz1b5foLU+CLMCFlFmBM1 Bygg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=ICwNjAectYEIbnoqaOEtmK699JOLQuz+ow5Yk5TvHNw=; b=bFsgWt/rWImeJw2WpClWfhSBkIilQqpxyk0bDAUzETvSDH/OVddCQt3dXPddXBRcEy 3ZVQ2nhI5EfULgv3ra3Rk+wMGrkc1y+IE4Z3aAaXU0sUhDYDwkOD7pq9ayiXfHee8+S3 bagcRedx5uji4sjTaE2MvkE+f7etGHhouunZLzXPYf0LBeB7dWPSoy5G5eMABFtmL+6n 6mS3uDw/vxFjh61aHaBfnmBVkEjnuIq7QQ3DPMzY8SGBFeWJnlEfd8+/8wE/TrgbzdVA sylHOu57cRyloKSDdnq5CFcUg9RhkI09sg88TzjhHPA+9NpKhCmIY94sjGcov/TtwZLq 2egw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id w14si3680527plc.85.2021.04.22.10.30.07; Thu, 22 Apr 2021 10:30:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236668AbhDVRaH (ORCPT + 99 others); Thu, 22 Apr 2021 13:30:07 -0400 Received: from foss.arm.com ([217.140.110.172]:54372 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236058AbhDVRaG (ORCPT ); Thu, 22 Apr 2021 13:30:06 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id D2741D6E; Thu, 22 Apr 2021 10:29:30 -0700 (PDT) Received: from [192.168.178.6] (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id EDC763F694; Thu, 22 Apr 2021 10:29:28 -0700 (PDT) Subject: Re: [PATCH 2/2] sched/fair: Relax task_hot() for misfit tasks To: Valentin Schneider , Vincent Guittot Cc: linux-kernel@vger.kernel.org, Peter Zijlstra , Ingo Molnar , Morten Rasmussen , Qais Yousef , Quentin Perret , Pavan Kondeti , Rik van Riel , Lingutla Chandrasekhar References: <20210415175846.494385-1-valentin.schneider@arm.com> <20210415175846.494385-3-valentin.schneider@arm.com> <20210416135113.GA16445@vingu-book> <87blaakxji.mognet@arm.com> <878s5bvrij.mognet@arm.com> From: Dietmar Eggemann Message-ID: Date: Thu, 22 Apr 2021 19:29:27 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.7.1 MIME-Version: 1.0 In-Reply-To: <878s5bvrij.mognet@arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21/04/2021 12:52, Valentin Schneider wrote: > On 20/04/21 16:33, Vincent Guittot wrote: >> On Mon, 19 Apr 2021 at 19:13, Valentin Schneider >> wrote: >>> >>> On 16/04/21 15:51, Vincent Guittot wrote: >>>> Le jeudi 15 avril 2021 � 18:58:46 (+0100), Valentin Schneider a �crit : >>>>> + >>>>> +/* >>>>> + * What does migrating this task do to our capacity-aware scheduling criterion? >>>>> + * >>>>> + * Returns 1, if the task needs more capacity than the dst CPU can provide. >>>>> + * Returns 0, if the task needs the extra capacity provided by the dst CPU >>>>> + * Returns -1, if the task isn't impacted by the migration wrt capacity. >>>>> + */ >>>>> +static int migrate_degrades_capacity(struct task_struct *p, struct lb_env *env) >>>>> +{ >>>>> + if (!(env->sd->flags & SD_ASYM_CPUCAPACITY)) >>>>> + return -1; >>>>> + >>>>> + if (!task_fits_capacity(p, capacity_of(env->src_cpu))) { >>>>> + if (cpu_capacity_greater(env->dst_cpu, env->src_cpu)) >>>>> + return 0; >>>>> + else if (cpu_capacity_greater(env->src_cpu, env->dst_cpu)) >>>>> + return 1; >>>>> + else >>>>> + return -1; >>>>> + } >>>> >>>> Being there means that task fits src_cpu capacity so why testing p against dst_cpu ? >>>> >>> >>> Because if p fits on src_cpu, we don't want to move it to a dst_cpu on >>> which it *doesn't* fit. >> >> OK. I was confused because I thought that this was only to force >> migration in case of group_misfit_task but you tried to extend to >> other cases... I'm not convinced that you succeeded to cover all cases >> >> Also I found this function which returns 3 values a bit disturbing. >> IIUC you tried to align to migrate_degrades_capacity but you should >> have better aligned to task_hot and return only 0 or 1. -1 is not used >> > > Ack, will do. > >>>>> @@ -7672,6 +7698,15 @@ int can_migrate_task(struct task_struct *p, struct lb_env *env) >>>>> if (tsk_cache_hot == -1) >>>>> tsk_cache_hot = task_hot(p, env); >>>>> >>>>> + /* >>>>> + * On a (sane) asymmetric CPU capacity system, the increase in compute >>>>> + * capacity should offset any potential performance hit caused by a >>>>> + * migration. >>>>> + */ >>>>> + if ((env->dst_grp_type == group_has_spare) && >>>> >>>> Shouldn't it be env->src_grp_type == group_misfit_task to only care of misfit task case as >>>> stated in $subject >>>> >>> >>> Previously this was env->idle != CPU_NOT_IDLE, but I figured dst_grp_type >>> could give us a better picture. Staring at this some more, this isn't so >>> true when the group size goes up - there's no guarantees the dst_cpu is the >>> one that has spare cycles, and the other CPUs might not be able to grant >>> the capacity uplift dst_cpu can. >> >> yeah you have to keep checking for env->idle != CPU_NOT_IDLE >> >>> >>> As for not using src_grp_type == group_misfit_task, this is pretty much the >>> same as [1]. CPU-bound (misfit) task + some other task on the same rq >>> implies group_overloaded classification when balancing at MC level (no SMT, >>> so one group per CPU). >> >> Is it something that happens often or just a sporadic/transient state >> ? I mean does it really worth the extra complexity and do you see >> performance improvement ? >> > > "Unfortunately" yes, this is a relatively common scenario when running "1 > big task per CPU" types of workloads. The expected behaviour for big.LITTLE > systems is to upmigrate tasks stuck on the LITTLE CPUs as soon as a big CPU > becomes free, usually via newidle balance (which, since they process work > faster than the LITTLEs, is bound to happen), and an extra task being > enqueued at "the wrong time" can prevent this from happening. > > This usually means a misfit task can take a few dozen extra ms than it > should to be migrated - in the tests I run (which are pretty much this 1 > hog per CPU workload) this happens about ~20% of the time. > >> You should better focus on fixing the simple case of group_misfit_task >> task. This other cases looks far more complex with lot of corner cases >> >>> >>> [1]: http://lore.kernel.org/r/jhjblcuv2mo.mognet@arm.com Just to make sure I can follow the conversation ... In case you: (1) return 1 instead of -1 (2) keep the `env->idle != CPU_NOT_IDLE` check (3) and remove the `dst_grp_type == group_has_spare` check you are pretty much back to what you had in [PATCH v3 7/7] directly in task_hot() except: (4) the 'if (p fits on src_cpu && p !fits dst_cpu) => tsk_cache_hot) check?