Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp1070993pxv; Thu, 1 Jul 2021 16:37:09 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwWpJVYRLn7BczNYx0pG4gKHLTKp/pEAk/4AxU+UOcJ9eANi1DdevbdCyDrVIXpCz9DMDGs X-Received: by 2002:a17:906:52d6:: with SMTP id w22mr2429664ejn.512.1625182628873; Thu, 01 Jul 2021 16:37:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1625182628; cv=none; d=google.com; s=arc-20160816; b=LDQnUCl722wdziufcy9wTsaZWoxBMeRuk052v8GgoES5vz8T7wz5eXcVmSkaNMblbB +xbguKAuyQ7kggRDlD8z59r/4zxW1lVOFTN8cTPVcVptXqRhj/xiYGyVEoKrr5wQB6eQ JRv3TzVSARYVM6g3ipwQjXRe7lYikWPsYpE1oheP7wHFsXwVhHagTZLsmhgKSxYAXdw4 8HDIE6enn5v1rge2Ql2yRreyv9+OoU0+RJfiw1ZVl/dIKFNXpBhe1FCyGuX9RyeCTPi/ bF8IqMd+pPVO93nJddTrNAShYBXmNw86p249LdjznoRdyh/wEpKAoH+VoTdp7hmaavzo 1sJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:message-id :mime-version:in-reply-to:references:cc:to:subject:from:date :dkim-signature; bh=E9l07IxNSgJt9CE4ED2nK8dEUQ3zKcbJc06F5p86Rl0=; b=d55I43cp7vnN3lHTi4U5HnSa5ELcu5qyPctKp8TmqO1lToRfX6ReVP0J0HQCCFZUUT nHBJHX6UtjpZPR5DeNXmy3lYmsyvV0du6SjwH+XGBmTuhDwgv95/8P3tLQYwNKRubQ86 NNsnF8M/pHd/InQ6xN+bpCEQV7aqTdKP+tQzKGkdoiCMV55uwjr1c44SfR57ZjICGeJJ 6YbwP93L4WjCUy2Zyj3FMhfTt4LNQECyIyA9Bx7Ke8Sl6COf9aIZieMfdVJ2kiWmdt0h bzzXsrrndH+0JH1dlyZ6NmcEGYm5uz8DTAAwGkIscJX8gATj0nB5G9NmUPyaMdGDv2Ma LJOA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="WXM//DJ7"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 6si1368879eje.347.2021.07.01.16.36.45; Thu, 01 Jul 2021 16:37:08 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="WXM//DJ7"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234251AbhGAXgX (ORCPT + 99 others); Thu, 1 Jul 2021 19:36:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41708 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234163AbhGAXgX (ORCPT ); Thu, 1 Jul 2021 19:36:23 -0400 Received: from mail-pf1-x430.google.com (mail-pf1-x430.google.com [IPv6:2607:f8b0:4864:20::430]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 86950C061762 for ; Thu, 1 Jul 2021 16:33:51 -0700 (PDT) Received: by mail-pf1-x430.google.com with SMTP id d12so7421114pfj.2 for ; Thu, 01 Jul 2021 16:33:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:subject:to:cc:references:in-reply-to:mime-version :message-id:content-transfer-encoding; bh=E9l07IxNSgJt9CE4ED2nK8dEUQ3zKcbJc06F5p86Rl0=; b=WXM//DJ7dgD9d1YB/uTZEMX3MY/V5Eu+twkQEQL781kragkKcNi2bOA6j93goQnqYp jFHqB26a3oOOWHT2vLzGWq22EX+cNBvt2qdyvq9Xza53ZHAYzXcJxga4oQPtxyt2cUWa iKiRm4CCaWru4XLN2F1kB/cpEq4vmDgJdTMf/lP9yTXEMOGvK7ftLa8Lg33yYrHLPuT/ o8XWR5hHYuMjnjuQQA8YV07yXY0y3v6TaFubMUIuHBJ6vKgRQCnevRivEHFIZLs4zja7 CMrdG5Wc4IUK6fDdLzGShdGLDlmN/cYj78987h6riqIMIkxlV2lc/tyNroW0Q8wUYK8R nWUQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:subject:to:cc:references:in-reply-to :mime-version:message-id:content-transfer-encoding; bh=E9l07IxNSgJt9CE4ED2nK8dEUQ3zKcbJc06F5p86Rl0=; b=ezAS7VIid0TPM+DgMvNNxcpod7wjDyPMgwab7Z9OYM3yLjCZ4JQB++J6t+5wSG75+W mhHqcmCTx/RR1ilNEK6zC8lwE/jHSxmvnjjNi+sov/qXDbaZLGq4WkTThGHbjVn/xYRr EGFZxrKD4F/KyG94o6CDWZ31Mi0QoOwsMXDBNjmMyuAsJjvrPCivjE1I3d7rHf6k7lXW LBUstR50ueFnqSs+LoK6Nd6gzToj+e2DdTv/IgicRKPBAjO4V+2tP5wY/we6w+Qwb9jN OKLabKpT3j1Jr0dbZ8tKENpVqO1MUwi6Oqh5PWRrQheClHv1ZSizIIdq+CdvSVUhQP16 LRHA== X-Gm-Message-State: AOAM530xVAFBE0cJhSsxMd9NJfxQT9/AW8DGKl526b7dvWDbQoy03YJv lTBI+euMpEVuVr7V/ct+WDfLisYMajw= X-Received: by 2002:a63:5c04:: with SMTP id q4mr1989071pgb.127.1625182431071; Thu, 01 Jul 2021 16:33:51 -0700 (PDT) Received: from localhost ([118.209.250.144]) by smtp.gmail.com with ESMTPSA id j3sm1157167pfe.98.2021.07.01.16.33.49 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 01 Jul 2021 16:33:50 -0700 (PDT) Date: Fri, 02 Jul 2021 09:33:45 +1000 From: Nicholas Piggin Subject: Re: [PATCH] nohz: nohz idle balancing per node To: Mel Gorman , Peter Zijlstra Cc: Daniel Bristot de Oliveira , Ben Segall , Dietmar Eggemann , Frederic Weisbecker , Juri Lelli , linux-kernel@vger.kernel.org, Ingo Molnar , Steven Rostedt , Srikar Dronamraju , Vaidyanathan Srinivasan , Vincent Guittot References: <20210701055323.2199175-1-npiggin@gmail.com> <20210701131139.GD3772@suse.de> In-Reply-To: <20210701131139.GD3772@suse.de> MIME-Version: 1.0 Message-Id: <1625181736.w1x011tuds.astroid@bobo.none> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Excerpts from Mel Gorman's message of July 1, 2021 11:11 pm: > On Thu, Jul 01, 2021 at 12:18:18PM +0200, Peter Zijlstra wrote: >> On Thu, Jul 01, 2021 at 03:53:23PM +1000, Nicholas Piggin wrote: >> > Currently a single nohz idle CPU is designated to perform balancing on >> > behalf of all other nohz idle CPUs in the system. Implement a per node >> > nohz balancer to minimize cross-node memory accesses and runqueue lock >> > acquisitions. >> >=20 >> > On a 4 node system, this improves performance by 9.3% on a 'pgbench -N= ' >> > with 32 clients/jobs (which is about where throughput maxes out due to >> > IO and contention in postgres). >>=20 >> Hmm, Suresh tried something like this around 2010 and then we ran into >> trouble that when once node went completely idle and another node was >> fully busy, the completely idle node would not run ILB and the node >> would forever stay idle. >>=20 >=20 > An effect like that *might* be visible at > https://beta.suse.com/private/mgorman/melt/v5.13/3-perf-test/sched/sched-= nohznuma-v1r1/html/network-tbench/hardy2/ > at the CPU usage heatmaps ordered by topology at the very bottom of > the page. >=20 > The heatmap covers all client counts so there are "blocks" of activity fo= r > each client count tested. The third block is for 8 thread counts so a nod= e > is not fully busy yet. I'm not sure what I'm looking at. Where are these blocks? Along the x=20 axis? > However, with the vanilla kernel, there is some > load on each node but with the patch all the load is on one node. This > did not happen on the two other test machines so the observation is not > reliable and could be a total coincidence. tbench is pretty finicky so it could be. >=20 > That said, there were some gains but large losses depending on the client > count across the 3 machines for tbench which is a concern. Other results, > like pgbench mentioned in the changelog, will not complete until tomorrow > to see if it is a general pattern or tbench-specific. >=20 > https://beta.suse.com/private/mgorman/melt/v5.13/3-perf-test/sched/sched-= nohznuma-v1r1/html/network-tbench/bing2/ > https://beta.suse.com/private/mgorman/melt/v5.13/3-perf-test/sched/sched-= nohznuma-v1r1/html/network-tbench/hardy2/ > https://beta.suse.com/private/mgorman/melt/v5.13/3-perf-test/sched/sched-= nohznuma-v1r1/html/network-tbench/marvin2/ All 2-node. How many runs does it do at each clinet count? There's a big=20 regression at one clinet with one of them, but the other two have small=20 gains. Thanks, Nick