Received: by 2002:a05:6602:18e:0:0:0:0 with SMTP id m14csp1652237ioo; Sun, 22 May 2022 23:05:22 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwXc0Lt6D3PrN5qiYm5z/8h/Ft7uxx3xf+Xd8ZIPADlx/k+5TxBfX1QxDRgXxG9Z1QfUDSO X-Received: by 2002:a65:50cd:0:b0:3f5:d0e6:7798 with SMTP id s13-20020a6550cd000000b003f5d0e67798mr18939458pgp.95.1653285922714; Sun, 22 May 2022 23:05:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1653285922; cv=none; d=google.com; s=arc-20160816; b=nZuAqDrT6S2ipw2tVNGCDLK+wodOCPiRvbVqBjHN2qnlm7RaJX3IS3rvLlMZqFaTSI HD9Ohz8xvK4FoSCl91HIBjehE2LTTUKgV2rAWf7Ioa3DhzZrIzlGeDaZYVWcBLwTesEd /kwKYEfcBdyP629ZxqXyNQQqfP4huTpERjK7UyGZBtIvUdSnLlW/ru8chBviqguGh9IF cv8WtsOoHwyRDcnVHlpw4xQ4BH3r8Yw20qa7OJS+RJxSVbdNpctdmy0kC5Cefs8YviCN 5Clp6M1oRF7P3bbZlgd4Z2IO3x7BERZF9/Rc4aB0RovgygBdxgYbtdsKwWbdvOESJAGe D4AA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=pR4dTv3NpFxsW5NmhNjLPufQ/MT8Bkv7NBLJ/fFIF4E=; b=NN1ibbw50qS52AckYRgYhbsR6jR4MYen/9qC8OHb6VEllWpC2u23eAzhuoyI5dKLFH jLHq7RTIvfDdf3T66IjFvgkZHbSJglZbHnx2FmMkVT6x0WtfjE30hSG63rvyyQg6o6P3 zl4W2E1unLpjoryIGyR2llAm0w+qeXZbFn0wsPOcuajjWpyoPo4sLVvSx1FlLh2lkWux kbv3szjoFUutJbgGu/fgm1sc1bQ2RvTS7IrJAUIv5xy/N5hGMw8WwbSSGfm13KNEmbdb 4a6OB4bbFB2X7TXEKXIsEJry/0iutK+FICIfP4RCVEbRYmg4uneRx4ovsMP5fME1wzRd 2t+g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id c7-20020a655a87000000b003816043f10bsi8597748pgt.768.2022.05.22.23.05.22 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 22 May 2022 23:05:22 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 2DE3327CD8; Sun, 22 May 2022 22:57:14 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1348077AbiETKSU (ORCPT + 99 others); Fri, 20 May 2022 06:18:20 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35268 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238821AbiETKSS (ORCPT ); Fri, 20 May 2022 06:18:18 -0400 Received: from outbound-smtp46.blacknight.com (outbound-smtp46.blacknight.com [46.22.136.58]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B91D8EAD1D for ; Fri, 20 May 2022 03:18:16 -0700 (PDT) Received: from mail.blacknight.com (pemlinmail04.blacknight.ie [81.17.254.17]) by outbound-smtp46.blacknight.com (Postfix) with ESMTPS id 3D0ECFA964 for ; Fri, 20 May 2022 11:18:15 +0100 (IST) Received: (qmail 13443 invoked from network); 20 May 2022 10:18:15 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.198.246]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 20 May 2022 10:18:15 -0000 Date: Fri, 20 May 2022 11:18:12 +0100 From: Mel Gorman To: K Prateek Nayak Cc: Peter Zijlstra , Ingo Molnar , Vincent Guittot , Valentin Schneider , Aubrey Li , LKML Subject: Re: [PATCH 0/4] Mitigate inconsistent NUMA imbalance behaviour Message-ID: <20220520101812.GW3441@techsingularity.net> References: <20220511143038.4620-1-mgorman@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RDNS_NONE, SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 20, 2022 at 10:28:02AM +0530, K Prateek Nayak wrote: > Hello Mel, > > We tested the patch series on a our systems. > > tl;dr > > Results of testing: > - Benefits short running Stream tasks in NPS2 and NPS4 mode. > - Benefits seen for tbench in NPS1 mode for 8-128 worker count. > - Regression in Hackbench with 16 groups in NPS1 mode. A rerun for > same data point suggested run to run variation on patched kernel. > - Regression in case of tbench with 32 and 64 workers in NPS2 mode. > Patched kernel however seems to report more stable value for 64 > worker count compared to tip. > - Slight regression in schbench in NPS2 and NPS4 mode for large > worker count but we did spot some run to run variation with > both tip and patched kernel. > > Below are all the detailed numbers for the benchmarks. > Thanks! I looked through the results but I do not see anything that is very alarming. Some notes. o Hackbench with 16 groups on NPS1, that would likely be 640 tasks communicating unless other paramters are used. I expect it to be variable and it's a heavily overloaded scenario. Initial placement is not necessarily critical as migrations are likely to be very high. On NPS1, there is going to be random luck given that the latency to individual CPUs and the physical topology is hidden. o NPS2 with 128 workers. That's at the threshold where load is potentially evenly split between the two sockets but not perfectly split due to migrate-on-wakeup being a little unpredictable. Might be worth checking the variability there. o Same observations for tbench. I looked at my own results for NPS1 on Zen3 and what I see is that there is a small blip there but the mpstat heat map indicates that the nodes are being more evenly used than without the patch which is expected. o STREAM is interesting in that there are large differences between 10 runs and 100 hundred runs. In indicates that without pinning that STREAM can be a bit variable. The problem might be similar to NAS as reported in the leader mail with the variability due to commit c6f886546cb8 for unknown reasons. > > > > kernel/sched/fair.c | 59 ++++++++++++++++++++++++++--------------- > > kernel/sched/topology.c | 23 ++++++++++------ > > 2 files changed, 53 insertions(+), 29 deletions(-) > > > > Please let me know if you would like me to get some additional > data on the test system. Other than checking variability, the min, max and range, I don't need additional data. I suspect in some cases like what I observed with NAS that there is wide variability for reasons independent of this series. I'm of the opinion though that your results are not a barrier for merging. Do you agree? -- Mel Gorman SUSE Labs