Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp808496ybb; Fri, 20 Mar 2020 08:24:43 -0700 (PDT) X-Google-Smtp-Source: ADFU+vsR5yVrMicIvxSOgMQVf359mHaJwkbRzwT4my6mhboNX4C76rsuHT68FVbseCsDc2mIEou+ X-Received: by 2002:a05:6830:6:: with SMTP id c6mr6960514otp.84.1584717882953; Fri, 20 Mar 2020 08:24:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1584717882; cv=none; d=google.com; s=arc-20160816; b=bQHFWkP00EU/yQheOECeiS860lhIb0HoXBwZ9Wty2n9EDvFDDYbWeWYCs+R1LbhUu9 raIXhgtrh1DLlRNEPsTQe3YXR9JYbmJVArAGnKw0Lw18XbI7snZawaniTtExArX/6jCJ M5JFzua6RqAEoMCetP8Bs5bBSUY1O5tQZ1i2/LEZHRiw9zYM5clVqnilPwPrpaxu95yZ FKLZc9ZMZDQHKNITxNhaEEbQNr935LO6pWMoDXkIFwQe5GMeToI2z5EGQ4M+ii9vSk+t dq27DAhB2pZUL8HM4UquroC+APx4gw8WWVrDSpFMn3xa3Lhsoxv+BX29i7NXNTCzx1wu kAZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=AO4FkD2LspHwNGCy6p8fkih1ba0DNTqLebLiMj5kmCg=; b=wzFikLUrLSNEYpO97jvRYphcxPEvMegSLSIczHuNH8A74oaZKwSthjOjFNfFPjxVcP ssE0U/cTmpjNisJMZnNFgbJUVhEgtGaPvXhfzH+0OEC3lHVlSST6KMTo108Bi+7cVFeF uL/JQF0aUoXH46zs3a0us1EBw14lwfW7ydETiLJBL93MOOaltyvOEEZLpXMhypilXDf0 FmDRrlGsChXfkoV3I6QeJrLyyO9GXitW3W9wKH3tdd87qf0eKu+12FY7OGWd8C33oG8G ahID3uNJ0X2hZorOQ1GQfgP55ud94D/Au6BFpjLecZ1dXBmBSJ0UoWf4SXVwq20T/XBS WPVw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c23si2869473oic.53.2020.03.20.08.24.29; Fri, 20 Mar 2020 08:24:42 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727060AbgCTPW4 (ORCPT + 99 others); Fri, 20 Mar 2020 11:22:56 -0400 Received: from outbound-smtp44.blacknight.com ([46.22.136.52]:58855 "EHLO outbound-smtp44.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726843AbgCTPWz (ORCPT ); Fri, 20 Mar 2020 11:22:55 -0400 Received: from mail.blacknight.com (pemlinmail01.blacknight.ie [81.17.254.10]) by outbound-smtp44.blacknight.com (Postfix) with ESMTPS id AE32BF804D for ; Fri, 20 Mar 2020 15:22:53 +0000 (GMT) Received: (qmail 16808 invoked from network); 20 Mar 2020 15:22:53 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.18.57]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 20 Mar 2020 15:22:53 -0000 Date: Fri, 20 Mar 2020 15:22:51 +0000 From: Mel Gorman To: Jirka Hladky Cc: Phil Auld , Peter Zijlstra , Ingo Molnar , Vincent Guittot , Juri Lelli , Dietmar Eggemann , Steven Rostedt , Ben Segall , Valentin Schneider , Hillf Danton , LKML Subject: Re: [PATCH 00/13] Reconcile NUMA balancing decisions with the load balancer v6 Message-ID: <20200320152251.GC3818@techsingularity.net> References: <20200224095223.13361-1-mgorman@techsingularity.net> <20200309191233.GG10065@pauld.bos.csb> <20200309203625.GU3818@techsingularity.net> <20200312095432.GW3818@techsingularity.net> <20200312155640.GX3818@techsingularity.net> <20200312214736.GA3818@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Mar 20, 2020 at 03:37:44PM +0100, Jirka Hladky wrote: > Hi Mel, > > just a quick update. I have increased the testing coverage and other tests > from the NAS shows a big performance drop for the low number of threads as > well: > > sp_C_x - show still the biggest drop upto 50% > bt_C_x - performance drop upto 40% > ua_C_x - performance drop upto 30% > MPI or OMP and what is a low thread count? For MPI at least, I saw a 0.4% gain on an 4-node machine for bt_C and a 3.88% regression on 8-nodes. I think it must be OMP you are using because I found I had to disable UA for MPI at some point in the past for reasons I no longer remember. > My point is that the performance drop for the low number of threads is more > common than we have initially thought. > > Let me know what you need more data. > I just a clarification on the thread count and a confirmation it's OMP. For MPI, I did note that some of the other NAS kernels shows a slight dip but it was nowhere near as severe as SP and the problem was the same as more -- two or more tasks stayed on the same node without spreading out because there was no pressure to do so. There was enough CPU and memory capacity with no obvious pattern that could be used to spread the load wide early. One possibility would be to spread wide always at clone time and assume wake_affine will pull related tasks but it's fragile because it breaks if the cloned task execs and then allocates memory from a remote node only to migrate to a local node immediately. -- Mel Gorman SUSE Labs