Received: by 2002:a25:23cc:0:0:0:0:0 with SMTP id j195csp874280ybj; Thu, 7 May 2020 09:32:11 -0700 (PDT) X-Google-Smtp-Source: APiQypJqUBbNP2R07RZma35JX2BorbOABv/D23VGb/+heFqg6XQVKL1zs5+wHDKskCjA/MzuO5t/ X-Received: by 2002:aa7:c7cb:: with SMTP id o11mr13102563eds.380.1588869131745; Thu, 07 May 2020 09:32:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588869131; cv=none; d=google.com; s=arc-20160816; b=fM1sCWs+s/k2XV+3xqHtjB4tYVltCm/XrRqO+YMlJBlahnIET96cP8R8usyFa8MZDc 7t32/fwBvkAm4juz9RjX26decU2e6cQL2bGBzo3wr4OwXPTz2/Fnq5Wje+viwUFIDJgL v6t0lN0OwmE6gQxFaT4Y57BksJYtVXCnr208VXLNk/9/6W3Sp/zxc/u8ZQ5ObTLkliMk VpQGTHqHKQysCki8cBzDi9oN6WfFAjaefwrCzR7/r6xW7sw+voLxZMs8zkEZGO7u5e1i Ede677oT58zakyn7A6ONJ3DMI4oJ22n2CVbhce8UoU21siEvYQDug6wh4cJGeTA5OO8x JIBQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=e7iMCcJ0qsnQwLB3wwMjiGDP/YW9nSq1UVvNlnZHm78=; b=qw1hVzzCGtgDOSGz5PS+sYTr9t4dP6+uUawBuZGlSDHTBrPstlslH82NxOH2T8hFMk ZeRIDCCVHBjC/4nBYcoBd//sxu/FbIi5vdL2VczAi1yHDaOylarsjvD2CoR6PCg6w8L4 iMAuESf0m7XEBk1RXP0o7Dwy6lZK3EnQreYZD+C25vl7t2cwTgNTomukEh4xe9gFXVNT L0Ung63Lhe8WakJNZL40GkVy2zKO/1xtIRPqNbBJpS19q6RpN4YXg2rt/2itoEMtGX+8 ZqhXsI12LEqIgpDWqFK3jLzrgRp9/HSYzTz8Fa2PVYBHjb69nzKq8db0uicvgamc3fnZ B0sA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=WevyCnOp; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cm14si3268564edb.546.2020.05.07.09.31.46; Thu, 07 May 2020 09:32:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=WevyCnOp; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727829AbgEGQaB (ORCPT + 99 others); Thu, 7 May 2020 12:30:01 -0400 Received: from us-smtp-delivery-1.mimecast.com ([207.211.31.120]:53577 "EHLO us-smtp-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726308AbgEGQaA (ORCPT ); Thu, 7 May 2020 12:30:00 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1588868999; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=e7iMCcJ0qsnQwLB3wwMjiGDP/YW9nSq1UVvNlnZHm78=; b=WevyCnOpqs1K2QRjJtvprM91QoQRP2Ow6UlHmTioxFiXeXPd+88GoqFl2PoiZyguoDpaqG xlD6fn8VRT0uqBUJ8fAuFTNf4kXFlatu1NILPS3+RRNew9wmxsw1aFaOEk90MiE8b5dm+V whrdL1tn2JNEPF6t1I8QUw5wq2SJHd0= Received: from mail-ed1-f69.google.com (mail-ed1-f69.google.com [209.85.208.69]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-255-_D2I7j4LOeuuVKFW714jXA-1; Thu, 07 May 2020 12:29:57 -0400 X-MC-Unique: _D2I7j4LOeuuVKFW714jXA-1 Received: by mail-ed1-f69.google.com with SMTP id x14so2624863edv.2 for ; Thu, 07 May 2020 09:29:57 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=e7iMCcJ0qsnQwLB3wwMjiGDP/YW9nSq1UVvNlnZHm78=; b=mszSMVu6e4K2mSLcw7I86zT3rNZLAs0gEnaVoDq7/0AY1sQvAmdV/PYSPcs05ugfzD 561qexuCG3KxVJN1ybWRO6C1GbRXtWp+C5ciyzSWHztTrOsebHnLw7EXxZg9gcH55eqs PIjHj0EXJk1Zq5v+ABboi8C1jMCvmtWiV5n3POfSiespE2nvAmIfBh7B2YoFbYV1e1Cq fTSfo26LIC0/dYhg4JrRyCwHjNh0ItHjzEhawE2gRZpWzRXeXFSrBY1HW3HVB/NuSEAu YvbghwmxVoxQtAU59b6BPCEgdcLmUM+UmiFPGdcXuhMSlC5SndBkPVb29/iV2CaPP9Vy tYSw== X-Gm-Message-State: AGi0PuZVLyNCTbBx+K+j1dJ2lWhcPzTn28njDtAZOkwuOqH9tTcQYU3M kRPL2rg0Co9rSiah31bEA7QIU+x8YPfXfy7GmOwdAwIXXw4Rzgcxob2HjcYUgkrDL3OPvMoR4l7 /b9ZL0mGlVlBBJxhuD6EQZDHIc+J/O2jkWKMvMT12 X-Received: by 2002:a17:907:2069:: with SMTP id qp9mr13369331ejb.137.1588868996404; Thu, 07 May 2020 09:29:56 -0700 (PDT) X-Received: by 2002:a17:907:2069:: with SMTP id qp9mr13369301ejb.137.1588868996149; Thu, 07 May 2020 09:29:56 -0700 (PDT) MIME-Version: 1.0 References: <20200312155640.GX3818@techsingularity.net> <20200312214736.GA3818@techsingularity.net> <20200320152251.GC3818@techsingularity.net> <20200320163843.GD3818@techsingularity.net> <20200507155422.GD3758@techsingularity.net> In-Reply-To: <20200507155422.GD3758@techsingularity.net> From: Jirka Hladky Date: Thu, 7 May 2020 18:29:44 +0200 Message-ID: Subject: Re: [PATCH 00/13] Reconcile NUMA balancing decisions with the load balancer v6 To: Mel Gorman Cc: Phil Auld , Peter Zijlstra , Ingo Molnar , Vincent Guittot , Juri Lelli , Dietmar Eggemann , Steven Rostedt , Ben Segall , Valentin Schneider , Hillf Danton , LKML , Douglas Shakshober , Waiman Long , Joe Mario , Bill Gray Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Mel, we are not targeting just OMP applications. We see the performance degradation also for other workloads, like SPECjbb2005 and SPECjvm2008. Even worse, it also affects a higher number of threads. For example, comparing 5.7.0-0.rc2 against 5.6 kernel, on 4 NUMA server with 2x AMD 7351 CPU, we see performance degradation 22% for 32 threads (the system has 64 CPUs in total). We observe this degradation only when we run a single SPECjbb binary. When running 4 SPECjbb binaries in parallel, there is no change in performance between 5.6 and 5.7. That's why we are asking for the kernel tunable, which we would add to the tuned profile. We don't expect users to change this frequently but rather to set the performance profile once based on the purpose of the server. If you could prepare a patch for us, we would be more than happy to test it extensively. Based on the results, we can then evaluate if it's the way to go. Thoughts? Thanks a lot! Jirka On Thu, May 7, 2020 at 5:54 PM Mel Gorman wrote: > > On Thu, May 07, 2020 at 05:24:17PM +0200, Jirka Hladky wrote: > > Hi Mel, > > > > > > Yes, it's indeed OMP. With low threads count, I mean up to 2x number of > > > > NUMA nodes (8 threads on 4 NUMA node servers, 16 threads on 8 NUMA node > > > > servers). > > > > > > Ok, so we know it's within the imbalance threshold where a NUMA node can > > > be left idle. > > > > we have discussed today with my colleagues the performance drop for > > some workloads for low threads counts (roughly up to 2x number of NUMA > > nodes). We are worried that it can be a severe issue for some use > > cases, which require a full memory bandwidth even when only part of > > CPUs is used. > > > > We understand that scheduler cannot distinguish this type of workload > > from others automatically. However, there was an idea for a * new > > kernel tunable to control the imbalance threshold *. Based on the > > purpose of the server, users could set this tunable. See the tuned > > project, which allows creating performance profiles [1]. > > > > I'm not completely opposed to it but given that the setting is global, > I imagine it could have other consequences if two applications ran > at different times have different requirements. Given that it's OMP, > I would have imagined that an application that really cared about this > would specify what was needed using OMP_PLACES. Why would someone prefer > kernel tuning or a tuned profile over OMP_PLACES? After all, it requires > specific knowledge of the application even to know that a particular > tuned profile is needed. > > -- > Mel Gorman > SUSE Labs > -- -Jirka