Received: by 2002:a05:7412:3784:b0:e2:908c:2ebd with SMTP id jk4csp2236118rdb; Tue, 3 Oct 2023 14:52:19 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEeEK7AS824Ue7oopVCleipBnFOOz1LYJYpS/aci8VS09PwEo+N0UMzkwWOmPmy0M56AvGG X-Received: by 2002:a05:6808:c3:b0:3a7:3ce0:1ad7 with SMTP id t3-20020a05680800c300b003a73ce01ad7mr673818oic.20.1696369939611; Tue, 03 Oct 2023 14:52:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1696369939; cv=none; d=google.com; s=arc-20160816; b=OufQefkaR68Dz8tROkzdlAdEULSc5qCDRY3G107D6N7NCUtFyF9AtftVl99AL+yzO1 jzQOnXjeRt+M5avAPyjwZ52dKUrlyUDe5+iqMGdwndQ9hYifaU2/ewOv0TpOKICYhAYc sbQEbk2BTGW3ID2xhbfAxDYyaFlPjC1rKl1I9/+/dfzCeVdOwUyM2lRNTgDz2G1akVoM ew2gGLDdFdjUDnAbePKinA7UhlgajEX81JjGcyfu6dBTlWc2UGM43f5hTT03WbtvOLmm wSmVHCo2fHcaofMUaAIJF7QkwPDdO9uM0ODaajmS65GjWxIf8byFafBcvP/EZApW3PD+ S1mQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=nnmS8DXkjdVhg68qX7c14kBClGtQYzgjiQqqX6QhaL4=; fh=zjLtq/u5fU5VXWM97UkgEi15by5qN2LM2sl6/P5lwKg=; b=Tx6kfqWM+NkoBHDL5Sa9rR5/zVeZkEv5QWkirfDs/axMiuq9OH1c55gp9/T82UM+QL krX79nCKsZDjfhQXw+IFUCOsJT8EcO76+vXmTmqg8nbYkQLrVk3V7sarwz5IX23Dc3Ik AMggUOW2/zUfdDjSFcCEqIClqxiqIVrAnvn73zddVSSZAgbFWboPR9WEpQ76HAMkI7Sz /huTU6hlXrEm/9zl6yXGXY6qc8/hM/QsHuCLbNwTL5lCfgfVAzd2JvAhT6IzCyb9+/1a PGncb37TfVKWjlOooHGA+IpixiPrG/dezfDF/nYV/P+lQqA9v6R/YDdbIYBMrgIxQheI ubMg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=HeB+7NoX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [2620:137:e000::3:3]) by mx.google.com with ESMTPS id g5-20020a17090a828500b00274c8c95bdesi122257pjn.46.2023.10.03.14.52.18 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 03 Oct 2023 14:52:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) client-ip=2620:137:e000::3:3; Authentication-Results: mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=HeB+7NoX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:3 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id 0F78481AC6B8; Tue, 3 Oct 2023 14:52:16 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241267AbjJCVwJ (ORCPT + 99 others); Tue, 3 Oct 2023 17:52:09 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33004 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232049AbjJCVwI (ORCPT ); Tue, 3 Oct 2023 17:52:08 -0400 Received: from casper.infradead.org (casper.infradead.org [IPv6:2001:8b0:10b:1236::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 11455A7 for ; Tue, 3 Oct 2023 14:52:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=nnmS8DXkjdVhg68qX7c14kBClGtQYzgjiQqqX6QhaL4=; b=HeB+7NoXrJ4t/oTZGb8b7OFEFi V3A3z9zevjwclhDluaVDBm0i3fGqXet/tTX/csic4k0xIgGo02PMH0W8BT9zAeG22T1P2+eQbjhGx 0iEVSOJzfAXwv99OLmbX1hXuhw/4IgRdGqaApb1e7dwDumDjJpiDNRBn6HV14VCvDBdY0gc0pqv7Z KiQTRvCHJVWfz/M2Jw3s4jw+l0DpnjCGPAof3RUvoP84ag3Db04OdWLVquDRMjnjtKKAaFLwH9J7D C/UgnaS1zBMMR9sVhdzIpCNQLEKWnINUsRh7KN+m0Rw/uXH7TIBeVOiEc/NIDIEmLQv/m/0937SYL syiGWLkA==; Received: from j130084.upc-j.chello.nl ([24.132.130.84] helo=noisy.programming.kicks-ass.net) by casper.infradead.org with esmtpsa (Exim 4.94.2 #2 (Red Hat Linux)) id 1qnnJA-00HLnP-2L; Tue, 03 Oct 2023 21:52:00 +0000 Received: by noisy.programming.kicks-ass.net (Postfix, from userid 1000) id B88BB30036C; Tue, 3 Oct 2023 23:51:59 +0200 (CEST) Date: Tue, 3 Oct 2023 23:51:59 +0200 From: Peter Zijlstra To: Julia Lawall Cc: Ingo Molnar , Vincent Guittot , Dietmar Eggemann , Mel Gorman , linux-kernel@vger.kernel.org Subject: Re: EEVDF and NUMA balancing Message-ID: <20231003215159.GJ1539@noisy.programming.kicks-ass.net> References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Tue, 03 Oct 2023 14:52:16 -0700 (PDT) On Tue, Oct 03, 2023 at 10:25:08PM +0200, Julia Lawall wrote: > Is it expected that the commit e8f331bcc270 should have an impact on the > frequency of NUMA balancing? Definitely not expected. The only effect of that commit was supposed to be the runqueue order of tasks. I'll go stare at it in the morning -- definitely too late for critical thinking atm. Thanks! > The NAS benchmark ua.C.x (NPB3.4-OMP, > https://github.com/mbdevpl/nas-parallel-benchmarks.git) on a 4-socket > Intel Xeon 6130 suffers from some NUMA moves that leave some sockets with > too few threads and other sockets with too many threads. Prior to the > commit e8f331bcc270, this was corrected by subsequent load balancing, > leading to run times of 20-40 seconds (around 20 seconds can be achieved > if one just turns NUMA balancing off). After commit e8f331bcc270, the > running time can go up to 150 seconds. In the worst case, I have seen a > core remain idle for 75 seconds. It seems that the load balancer at the > NUMA domain level is not able to do anything, because when a core on the > overloaded socket has multiple threads, they are tasks that were NUMA > balanced to the socket, and thus should not leave. So the "busiest" core > chosen by find_busiest_queue doesn't actually contain any stealable > threads. Maybe it could be worth stealing from a core that has only one > task in this case, in hopes that the tasks that are tied to a socket will > spread out better across it if more space is available? > > An example run is attached. The cores are renumbered according to the > sockets, so there is an overload on socket 1 and an underload on sockets > 2. > > julia