Received: by 10.192.165.156 with SMTP id m28csp918250imm; Wed, 11 Apr 2018 09:14:36 -0700 (PDT) X-Google-Smtp-Source: AIpwx48QI57vDUUaP5yHs4ooIpjMwoSeiq8w5eUGz9Mw9qxXPxqM+vtoarNkrT9RPJgmdAgzJgyZ X-Received: by 2002:a17:902:107:: with SMTP id 7-v6mr5734583plb.374.1523463276616; Wed, 11 Apr 2018 09:14:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523463276; cv=none; d=google.com; s=arc-20160816; b=l4BUJ3G6jWhwb+1LCWmZTbJV8LoBh8c5cgnkb03XB1x1kvBooWBxA2Z4bLakCnsW5F tv8L6OMPAyRLeh6vQ+0U9fLex4SiaTuH8zmiGocUevR7HJZHxhcrFhTMKZ12EKF+zJLe lSuo3aikr/HntFzaij3IuhOG+wlXZCr8kPXeKzJsS0qWLbNOstyKhxibaySCuCNL7+Ml vD7BY8pgja41ZfIYXZLcifYhctjUmooHVyH0rsVIeN5l5X9uU49GJxpYnMcRezTca4I0 i6APRnYDOmkF0XydRFHNOAeH8DK7OscuB2jBxe1SB2TwhsYgJ+kqcHyt1C223iO1eeQb nB6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=e6KsEEwLQsTVeVIcJqZqbSSWE8OBRwHuejhdFWR+uag=; b=TCfMVcs3ipGbOWEBT998gUI8WST/nrgZsBkhpxkbGLaYLs2g5Sk3ZXdwLyeGIqDoNy Bw3RDC5LExUJczIUcpt7Fmm0Qy8BA1q1k29K+IQNR392pHArsV66PIm/Twj+phnUOTkf /CUUUTRXCp34KsxuksfUEDZOOKH8Ozfy89T6NYApb7gGuUul0B8wCGqPzSSbKkCvm2cP aQg3zDItfVOF8+LB6yF5ZuBDaiGFJr0uuW/tyOO0xiM3fviZuA35iqojR4CECKe5PUdN B7WylkMQ+uBUxbZCSDxJ9ZeTaJ0Fn8VdTyKfJ5aokXBs15gslVV5CFuCWcZOv8aBw0QC gvtw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=JlQBt2pn; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z7si944642pgv.803.2018.04.11.09.13.59; Wed, 11 Apr 2018 09:14:36 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=JlQBt2pn; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753129AbeDKQLK (ORCPT + 99 others); Wed, 11 Apr 2018 12:11:10 -0400 Received: from mail-it0-f50.google.com ([209.85.214.50]:52113 "EHLO mail-it0-f50.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752894AbeDKQLI (ORCPT ); Wed, 11 Apr 2018 12:11:08 -0400 Received: by mail-it0-f50.google.com with SMTP id b5-v6so3471989itj.1 for ; Wed, 11 Apr 2018 09:11:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=e6KsEEwLQsTVeVIcJqZqbSSWE8OBRwHuejhdFWR+uag=; b=JlQBt2pnmZpz7whHA0tgT+oT4OVjh3k9b45PT4k7o3gemk3v9po00gffuuZ0ei0qWn F+kI7iNPgn2ME3vW+JlIt6lckVY1O2MDcxBw7c41FbEbBacLP1ulrT5TM9hMSXXCQX4l yTxTNL5d1poL+p8ST7pD5Pr8241+4RfP7nKNo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=e6KsEEwLQsTVeVIcJqZqbSSWE8OBRwHuejhdFWR+uag=; b=pBxngLMxJ4g3QzRjZeBZXIbWv5pvn+oWNUxif1uLTHu9r66tT6PXFdb5M5kzV4hxEF XNd01P1CQQOc37b85FsWq4ezFqCPHcC02YZeIMlDj3r8nzqj3hsdGFOMSHd3PeE1r2FQ HMVAfgdpFxrLUlFiu/CnlfDPC3GxhZW4KGWwwSIE3chiolcasNyo6t70ypwvBS4j/YBV I+SsjfsMq4UN/c+MS8mE0zdBN8HaZ3cLhuY7OuFx0+t5F5iHKZRkZxOb1oqZ8II4bDkJ arKoixP0fCqdqgM4VIGgJw6i68/tmSLLmfagWlDowirB9BA73PL0XBRQaFg4ldWWRRFf vXqQ== X-Gm-Message-State: ALQs6tAaunwo0os9yRSgIQIjtUxrvBABgVuNG+i/afzAT8G2Fc7nnrgx Ifd3FWMkLqv8ygayalh1FhhsoorXoDVnLaORRFVgiw== X-Received: by 2002:a24:1c11:: with SMTP id c17-v6mr4487840itc.17.1523463067916; Wed, 11 Apr 2018 09:11:07 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.222.20 with HTTP; Wed, 11 Apr 2018 09:10:47 -0700 (PDT) In-Reply-To: <20180411160000.GO4082@hirez.programming.kicks-ass.net> References: <20180406172835.20078-1-patrick.bellasi@arm.com> <20180410110412.GG14248@e110439-lin> <20180411151450.GK4043@hirez.programming.kicks-ass.net> <20180411153710.GN4082@hirez.programming.kicks-ass.net> <20180411160000.GO4082@hirez.programming.kicks-ass.net> From: Vincent Guittot Date: Wed, 11 Apr 2018 18:10:47 +0200 Message-ID: Subject: Re: [PATCH] sched/fair: schedutil: update only with all info available To: Peter Zijlstra Cc: Patrick Bellasi , linux-kernel , "open list:THERMAL" , Ingo Molnar , "Rafael J . Wysocki" , Viresh Kumar , Juri Lelli , Joel Fernandes , Steve Muckle , Dietmar Eggemann , Morten Rasmussen Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11 April 2018 at 18:00, Peter Zijlstra wrote: > On Wed, Apr 11, 2018 at 05:41:24PM +0200, Vincent Guittot wrote: >> Yes. and to be honest I don't have any clues of the root cause :-( >> Heiner mentioned that it's much better in latest linux-next but I >> haven't seen any changes related to the code of those patches > > Yeah, it's a bit of a puzzle. Now you touch nohz, and the patches in > next that are most likely to have affected this are rjw's > cpuidle-vs-nohz patches. The common demoninator being nohz. > > Now I think rjw's patches will ensure we enter nohz _less_, they avoid > stopping the tick when we expect to go idle for a short period only. > > So if your patch makes nohz go wobbly, going nohz less will make that > better. > > Of course, I've no actual clue as to what that patch (it's the last one > in the series, right?: > > 31e77c93e432 ("sched/fair: Update blocked load when newly idle") > > ) does that is so offensive to that one machine. You never did manage to > reproduce, right? yes > > Could is be that for some reason the nohz balancer now takes a very long > time to run? Heiner mentions that is was a relatively slow celeron and he uses ondemand governor. So I was about to ask him to use performance governor to see if it can be because cpu runs slow and takes too muche time to enter idle > > Could something like the following happen (and this is really flaky > thinking here): > > last CPU goes idle, we enter idle_balance(), that kicks ilb, ilb runs, > which somehow again triggers idle_balance and around we go? > > I'm not immediately seeing how that could happen, but if we do something > daft like that we can tie up the CPU for a while, mostly with IRQs > disabled, and that would be visible as that latency he sees. > >