Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5078931imm; Tue, 31 Jul 2018 05:15:37 -0700 (PDT) X-Google-Smtp-Source: AAOMgpflfiH30RTgOoDHmK2eBejWb4Ch0TlQRnfyF5HkH9xSp5ZgbYiGwc0NiFfF0RJjmUcGmxZ9 X-Received: by 2002:a62:c8c2:: with SMTP id i63-v6mr22077101pfk.73.1533039337603; Tue, 31 Jul 2018 05:15:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533039337; cv=none; d=google.com; s=arc-20160816; b=b21OL/iyLrFI8LNIdwSJFiQP9n5TKKnOV3UaICUBBilA6uszRaNITEQ3M4yecS8CYZ 6w6w3N6QKJyd/tchTgrRfXg0wtxNmd65J/tmTco8+3App/IsUxnUjXiszwemhoJVj9YN TagunAljzcYO2e/YYaDuBvBQOqkVaNij08Widy+hk6eBS2bE4e13CtKctWN9el+QirxR AQZTKZYB06lpKenzCrOIe9+0QuNXr9Zwitl+gn0iV6YdY9nW+rXFIuoKAYFV8QbbvgWF YCPuJ8G18B/80v5QAfclxVBZwDfmp1VLhSmS0QNt2yQnRy9uAEMQsSk2WQhBsuiWMvv6 pALA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature :arc-authentication-results; bh=wK+hmBriPJS0essk1f42PXyiyWgwWCsQJ3P3hHFJ8go=; b=xQf7prQ+MKQOvoEE2kxIaM7BOGP84AzoXIJgGg//D6MJSrMHk9IeQpJyquF+fZcOgf TIJLVuN5cj7SdUBNCXn5QR+892mEiVmJZpVGg8JvJxcGowt7xjVX1nAkZXipNXSrc7wb P3yq5RzV/zAyz50LSnAGj08VqO9FM2b2/HAy3hZwGUOI1CNWGwx4wWN4AG031M5Hs6ky chKanfzbS29tIL1NzM+vaeba0Tkm/1Zrfc5YHmkujuuuYi2rRH+/aXwJjd67KJ05z/ps giPZ/Y9rpK9H3rc+zyNkedbacIhIRUCAkRYN7qNl8lyRlIqrLFkeEe/LEuALhz6nJycS kyfw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=If23gzKT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h12-v6si14056169pfk.156.2018.07.31.05.15.23; Tue, 31 Jul 2018 05:15:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=If23gzKT; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732233AbeGaNxW (ORCPT + 99 others); Tue, 31 Jul 2018 09:53:22 -0400 Received: from mail-io0-f193.google.com ([209.85.223.193]:39834 "EHLO mail-io0-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732125AbeGaNxV (ORCPT ); Tue, 31 Jul 2018 09:53:21 -0400 Received: by mail-io0-f193.google.com with SMTP id o22-v6so12787425ioh.6 for ; Tue, 31 Jul 2018 05:13:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=wK+hmBriPJS0essk1f42PXyiyWgwWCsQJ3P3hHFJ8go=; b=If23gzKT7yrbYP1vcWQ9gviEddnq2lkH7pa9ux1u7evw3VcrxokMDN9OfdKDKH6cRS VTFAXUDsHJTj378K7LPSgE7qoB85wGqeh3tg/k2I9YHLHGEp1XHfy8RRacj23f39f+M+ Gsg2kZYJ5f4NWZAkL19w7J9uatxJiZEmmWUag= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=wK+hmBriPJS0essk1f42PXyiyWgwWCsQJ3P3hHFJ8go=; b=aVkpkMDkkEaVKEfXiBPdIsdgjdzPsUpzoWVMRKbeagIOfe2NOUP72ToPHyadv00iRm CiLxjMZM6TQ3Rft43hd1RQVrABLXh5hkoxV4TPP0N8eaTxbEPyGYzAIHXnqFwGJuEfTj 9F1KXe0NS7DEgONvPJ1GXorAR7e1qZe8N4MVFIkOsn240L1li6nxbe7zIuelgXBhPDO2 5656wxNxVDYCUa6o5ms9I4X/+0vGIdAuhs40KeMkdG5ou/iLySVEi1TymIG4aRBfIMlT 0fsvViIKqs9pCZhT1sqpE3eGLL5e82gxNf5H5cQRmf66OfJ9Nz4o6kP2YH1ZYLIEhOyR TuLw== X-Gm-Message-State: AOUpUlG+ER2k9s1IwXqm8hUkUd/rxPXQUjeisKVyOeZv4AbjE3zoVnHj rv4XNnFxHe+m/w1d3U3IwCEKVhDPrnR6EXZ2jL/C6Q== X-Received: by 2002:a6b:ec08:: with SMTP id c8-v6mr10833182ioh.18.1533039198149; Tue, 31 Jul 2018 05:13:18 -0700 (PDT) MIME-Version: 1.0 References: <1530699470-29808-1-git-send-email-morten.rasmussen@arm.com> <20180709150839.GA19287@e105550-lin.cambridge.arm.com> <98eb0daf-4467-3790-ec80-90b746807392@arm.com> <7bad2238-369c-f61c-0a51-eb14204e0429@arm.com> In-Reply-To: <7bad2238-369c-f61c-0a51-eb14204e0429@arm.com> From: Vincent Guittot Date: Tue, 31 Jul 2018 14:13:07 +0200 Message-ID: Subject: Re: [PATCHv4 00/12] sched/fair: Migrate 'misfit' tasks on asymmetric capacity systems To: Dietmar Eggemann Cc: Valentin Schneider , Morten Rasmussen , Peter Zijlstra , Ingo Molnar , gaku.inami.xh@renesas.com, linux-kernel Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 30 Jul 2018 at 16:30, Dietmar Eggemann wrote: > > On 07/26/2018 07:14 PM, Valentin Schneider wrote: > > Hi, > > > > On 09/07/18 16:08, Morten Rasmussen wrote: > >> On Fri, Jul 06, 2018 at 12:18:27PM +0200, Vincent Guittot wrote: > >>> Hi Morten, > >>> > >>> On Wed, 4 Jul 2018 at 12:18, Morten Rasmussen wrote: > > [...] > > > With that out of the way, I did some lmbench runs: > >> lat_mem_rd 10 1024 > > > > With ASYM_PACKING, I still see lmbench tasks remaining on LITTLE CPUs while > > bigs are free, because ASYM_PACKING only does explicit active balancing on > > CPU_NEWLY_IDLE balancing - otherwise it'll rely on the nr_balance_failed counter. > > > > However, that counter can be reset before it reaches the threshold at which > > active balance is done, which can lead to huge upmigration delays (almost a > > full second). I also see the same kind of issues on Juno r0. > > > > This could be resolved by extending ASYM_PACKING active balancing to > > non NEWLY_IDLE cases, but then we'd be thrashing everything. That's another > > argument for basing upmigration on task load-tracking signals, as we can > > determine which tasks need active balancing much faster than the > > nr_balance_failed counter way while not active balancing the world. > > The task layout of the test looks like n=85 always running tasks (each > for ~ 1.25ms on big or little) and they all get created and run one How mistfit task can make a difference for a benchmark which uses 1.25ms tasks ? > after the other. So on a big cpu, their util values go from 512 to 1024 > and from 223 to 446 on little cpu (Juno board). Latter thanks to > Quentin's 'sched/fair: Fix util_avg of new tasks for asymmetric systems'. > > root@juno:~# cat /sys/devices/system/cpu/cpu[01]/cpu_capacity > 446 > 1024 > > > (lat_mem_rd 10 1024) with ASYM_PACKING: > > ... > > 4.0 148.66 <----- > > 4.5 10.191 > ... > > 7.5 10.203 > > 8.0 154.354 <----- > > I ran the test affine to big, little and all cpus on tip/sched/core w/o > ASYM_PACKING or Misfit: > > cputype: big little all > cpumask: 0x06 0x39 0xff > > mem size <---- latency ----> > > 0.00098 3.668 3.595 3.669 > 0.00195 3.668 3.594 3.594 > 0.00293 3.668 3.593 3.595 > 0.00391 3.669 3.596 3.595 > ... > 3.75000 58.687 121.934 122.293 > 4.00000 57.054 121.771 120.489 > 4.50000 56.914 121.851 56.729 > 5.00000 57.347 121.777 56.975 > 5.50000 57.705 121.738 68.981 > 6.00000 57.935 121.728 57.542 > 6.50000 58.119 121.694 121.799 > 7.00000 58.194 121.502 57.844 > 7.50000 58.258 121.684 58.050 > 8.00000 58.293 121.725 58.030 > 9.00000 58.309 121.793 58.188 > 10.00000 58.561 122.252 122.078 > > There is no diff between big and little cpus with small memory sizes, > just with the MB range. > If I look into the trace for 'all' it turns out that their are cases in > which, even if the task only run for ~15% of the time on big, the > latency value is printed as when it was running affine to big. So using > the latency value as an indicator where the task was scheduled is IMHO > not really possible.