Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp2883457pxb; Sat, 6 Feb 2021 10:56:42 -0800 (PST) X-Google-Smtp-Source: ABdhPJw68s7sbB0ZD0IXvU/4UkXGm8p91KB9wrL7gD8HX/3eIFRhRBk1lPlTv0LQTUapU9ueTXzb X-Received: by 2002:a17:906:3f8d:: with SMTP id b13mr9772964ejj.464.1612637802050; Sat, 06 Feb 2021 10:56:42 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1612637802; cv=none; d=google.com; s=arc-20160816; b=pewrMPW2a6eSlEiROwcBDH/pI+Wsv0vBlIP+oIaL0MvboJYSOpAhCwQBR5uVe7zGcN nZ2HKSaB53TklFPRPq7HX7b/3wyh09tzzyD/05oKz0PTDHMY/Z5ION0lFlViTi/8JNhN saOYr+G0ObFu26Fi5WwX4KOrDCAIJ1iWx8aP/avc399VenE3daz4c5H2zaK3oSx4u28Q mxiwrM5zN2t1hNEBaed/YXimVWv3HP5pY8QMbzs2KNnqRXarb5hZfGd9dOfxm1IpAoxD WV+yp+Zyvg0feWghStB9QrgQElRh4Dr+R9DAzIvDbf0bWregVZEhlilSO/6MddWXWiOR ga8w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:references:message-id :in-reply-to:subject:cc:to:from:date; bh=3f0ADfaE+qOEgVo7NgRmgPzg08zCL8I++Lg7TBfukfA=; b=v8vMp6NHBKacDCO0qRn+WGhfVRUVsGcRz3DgmfNFQYGBE5K2PLxbci0uIYa0Ya1/Ro VuGTmGIgK4UtYJ6hVF8OzWQ0nFILoZaPyQ2CtJXT2UTIifkQ5D++NXXtmYKL5XQ1u/nm PEEYEk4sHkfz1ztJV49BNpiMYKqH5W6PdjTqxfrQx92lBNjYzfDWQSqYQE6mKNXSm9iB zOCQUpcsD0Mju9Hk19YVd/lABRl39/mkpkilxd3cfsFIzNax7+bClykZWVkJWBm05OA1 N46MH0qvoQfIlamnxvjCmYgpm+sDYWpvFMfjCkaFKqilhCgdijvrHkEnaYlTvq6WVHiL qFUg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id p2si7785215edm.515.2021.02.06.10.56.18; Sat, 06 Feb 2021 10:56:42 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230218AbhBFRVl (ORCPT + 99 others); Sat, 6 Feb 2021 12:21:41 -0500 Received: from mail3-relais-sop.national.inria.fr ([192.134.164.104]:65372 "EHLO mail3-relais-sop.national.inria.fr" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229715AbhBFRVk (ORCPT ); Sat, 6 Feb 2021 12:21:40 -0500 X-IronPort-AV: E=Sophos;i="5.81,158,1610406000"; d="scan'208";a="372289270" Received: from 173.121.68.85.rev.sfr.net (HELO hadrien) ([85.68.121.173]) by mail3-relais-sop.national.inria.fr with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 06 Feb 2021 18:20:56 +0100 Date: Sat, 6 Feb 2021 18:20:56 +0100 (CET) From: Julia Lawall X-X-Sender: jll@hadrien To: Vincent Guittot cc: Mel Gorman , Ingo Molnar , Peter Zijlstra , Juri Lelli , Dietmar Eggemann , Steven Rostedt , Ben Segall , Daniel Bristot de Oliveira , linux-kernel Subject: Re: [PATCH v2] sched/fair: check for idle core In-Reply-To: Message-ID: References: <1603372550-14680-1-git-send-email-Julia.Lawall@inria.fr> <20201027091936.GS32041@suse.de> <20210125091238.GE20777@suse.de> User-Agent: Alpine 2.22 (DEB 394 2020-01-19) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 25 Jan 2021, Vincent Guittot wrote: > On Mon, 25 Jan 2021 at 10:20, Julia Lawall wrote: > > > > > > > > On Mon, 25 Jan 2021, Mel Gorman wrote: > > > > > On Sun, Jan 24, 2021 at 09:38:14PM +0100, Julia Lawall wrote: > > > > > > > > > > > > On Tue, 27 Oct 2020, Mel Gorman wrote: > > > > > > > > > On Thu, Oct 22, 2020 at 03:15:50PM +0200, Julia Lawall wrote: > > > > > > Fixes: 11f10e5420f6 ("sched/fair: Use load instead of runnable load in wakeup path") > > > > > > Signed-off-by: Julia Lawall > > > > > > Reviewed-by Vincent Guittot > > > > > > > > > > > > > > > > While not a universal win, it was mostly a win or neutral. In few cases > > > > > where there was a problem, one benchmark I'm a bit suspicious of generally > > > > > as occasionally it generates bad results for unknown and unpredictable > > > > > reasons. In another, it was very machine specific and the differences > > > > > were small in absolte time rather than relative time. Other tests on the > > > > > same machine were fine so overall; > > > > > > > > > > Acked-by: Mel Gorman > > > > > > > > Recently, we have been testing the phoronix multicore benchmarks. On v5.9 > > > > with this patch, the preparation time of phoronix slows down, from ~23 > > > > seconds to ~28 seconds. In v5.11-rc4, we see 29 seconds. It's not yet > > > > clear what causes the problem. But perhaps the patch should be removed > > > > from v5.11, until the problem is understood. > > > > > > > > commit d8fcb81f1acf651a0e50eacecca43d0524984f87 > > > > > > > > > > I'm not 100% convinved given that it was a mix of wins and losses. In > > > the wakup path in general, universal wins almost never happen. It's not > > > 100% clear from your mail what happens during the preparation patch. If > > > it included time to download the benchmarks and install then it would be > > > inherently variable due to network time (if download) or cache hotness > > > (if installing/compiling). While preparation time can be interesting -- > > > for example, if preparation involves reading a lot of files from disk, > > > it's not universally interesting when it's not the critical phase of a > > > benchmark. > > > > The benchmark is completely downloaded prior to the runs. There seems to > > be some perturbation to the activation of containerd. Normally it is > > even: * * * * > > Does it impact the benchmark results too or only the preparation prior > to running the benchmark ? > > > > > and with the patch it becomes more like: * ** ** > > > > That is every other one is on time, and every other one is late. > > > > But I don't know why this happens. > > > > julia > > > > > > > > I think it would be better to wait until the problem is fully understood > > > to see if it's a timing artifact (e.g. a race between when prev_cpu is > > > observed to be idle and when it is busy). > > I agree that a better understanding of what is happening is necessary > before any changes The tests were incorrect. The faster ones without the patch were with schedutil. If we use powersave with the patch or without we get the same setup time and comparable values for the metrics for the actual benchmarks (some of which vary a lot, though). So there is no evidence of any problem with the patch. julia