Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp2820030rdb; Tue, 12 Sep 2023 13:08:24 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFrbquZMptLio+rT73uw0sYSU5x3HqVsL9ETIyccggBS143D+p2J+BQ7DShGjh/73ilPYTX X-Received: by 2002:a17:903:18c:b0:1b8:6cae:3570 with SMTP id z12-20020a170903018c00b001b86cae3570mr966968plg.11.1694549303790; Tue, 12 Sep 2023 13:08:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694549303; cv=none; d=google.com; s=arc-20160816; b=FlQBah8hoPYvrkrq9hDTBby5ajHuB4pCymDEcNz1Jpg0lvja+aeHO6HVzrIX8HMaSi 6/lNROM6GAVDYffcwBt0wgBWnp7Kb+p0A2Offx0WFFTlIQsKildeXoEhyRcSTjzz39Xb t1xp8ZE97N3mZQUriaIoi5sQP6oPhgjBqiQ9dnp+AUPbY4l/bLikAGN/XmIIlOcg2yqC dGt9DuamCXTfZrCfsZhnGjlYKjiTVIWnMfQXMFzqFHwmK7Tfz66WOev6TJV+mzxYycnV sm6sJBqba8zQBfwyUrwyvGSNDpFdLAAx95hUtRyBaSC9TAD96AoJxDaauNi+cO0lza6a 7OZw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=+PbL7/nohR/Hbyn15dxzzqtwDxoevO/q/Oeu6EGIrkQ=; fh=hBumMKgdp07rd33RvRRiOkGat7EEsmGW7/N2mfIMmX8=; b=ACgdYriWhcBpJHrUN3+aVqvZLMLrkE2dl2r3mG6tder1p64JyuMK+XUs/qtjUh7lGB LZgIT0RoKlkHClwd3ungyVM2LiwwaCvxN9uWBkLoh+/3Bmkf+MeCUaWCz3XZrOZoOKv4 jJJSnHM9lMUnXhLiR2vfaXL+a/d8/sWgMlI5APaUIAtwnY4GBdtDg9CVIh6RnbH/B2Qs 5nMVwD2qT4PS/mSslVg3gXCasizmQaYqYELW06yy1zFfibtFRd0v2eevF9PapE0b6ux1 JXqjkwa68IQrJ6YrsQehVAYufqVRLcUi3ZPl4rygwqK0ljdbsUdZZrpl/z7TpOJYrVI4 QuAg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=fuJLYqQ4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from agentk.vger.email (agentk.vger.email. [2620:137:e000::3:2]) by mx.google.com with ESMTPS id q4-20020a17090311c400b001befe0ac50csi8953267plh.354.2023.09.12.13.08.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Sep 2023 13:08:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) client-ip=2620:137:e000::3:2; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=fuJLYqQ4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id E4A178119063; Tue, 12 Sep 2023 09:03:06 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235340AbjILQC7 (ORCPT + 99 others); Tue, 12 Sep 2023 12:02:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60722 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234679AbjILQC6 (ORCPT ); Tue, 12 Sep 2023 12:02:58 -0400 Received: from mgamail.intel.com (mgamail.intel.com [134.134.136.126]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D5F8B10F1; Tue, 12 Sep 2023 09:02:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1694534574; x=1726070574; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=U+a6QizW/Aulka1WQ2QT9P6hpcvGBSRq2JsjI2qWUjA=; b=fuJLYqQ4CeKUIhED8SGfE6McKliYiUm22MpYXEh/XS8yZSDRHTdd239s YjmxVKAlWsv55PxPhZzDm1oPXubzJ23eXtLMuB9g7WP7utH3DEZH+DM3c d9TfF4DilNAFWg4NYE3hBWCBrxUP7xZ9EpEMPp9QFuubFzjyh6yHy8Upu DlQyQhzOa9b2/bI6gW1V3frcswT8/0icfKfiAA6FVimNhRsec0OvllhBL Yy2pAE3Enkt9m/C3ONM7Ib9e5lLq5/Hab6IqBgjoiviqXWZkdlzxKDrS1 TJ4c7nLMXnV/5zg6/uPcNxxHdT45Go0HdpeMgYoU27eC4yPYMrdWHjz1A A==; X-IronPort-AV: E=McAfee;i="6600,9927,10831"; a="363450044" X-IronPort-AV: E=Sophos;i="6.02,139,1688454000"; d="scan'208";a="363450044" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orsmga106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 12 Sep 2023 09:01:29 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10831"; a="1074597625" X-IronPort-AV: E=Sophos;i="6.02,139,1688454000"; d="scan'208";a="1074597625" Received: from agluck-desk3.sc.intel.com (HELO agluck-desk3) ([172.25.222.74]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 12 Sep 2023 09:01:18 -0700 Date: Tue, 12 Sep 2023 09:01:17 -0700 From: Tony Luck To: Reinette Chatre Cc: Fenghua Yu , Peter Newman , Jonathan Corbet , Shuah Khan , x86@kernel.org, Shaopeng Tan , James Morse , Jamie Iles , Babu Moger , Randy Dunlap , linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, patches@lists.linux.dev Subject: Re: [PATCH v5 0/8] Add support for Sub-NUMA cluster (SNC) systems Message-ID: References: <20230722190740.326190-1-tony.luck@intel.com> <20230829234426.64421-1-tony.luck@intel.com> <45e81506-3c43-95ba-56a4-38e1bb2e42dc@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <45e81506-3c43-95ba-56a4-38e1bb2e42dc@intel.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Tue, 12 Sep 2023 09:03:07 -0700 (PDT) X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email On Mon, Sep 11, 2023 at 01:23:35PM -0700, Reinette Chatre wrote: > Hi Tony, > > On 8/29/2023 4:44 PM, Tony Luck wrote: > > The Sub-NUMA cluster feature on some Intel processors partitions > > the CPUs that share an L3 cache into two or more sets. This plays > > havoc with the Resource Director Technology (RDT) monitoring features. > > Prior to this patch Intel has advised that SNC and RDT are incompatible. > > > > Some of these CPU support an MSR that can partition the RMID > > counters in the same way. This allows for monitoring features > > to be used (with the caveat that memory accesses between different > > SNC NUMA nodes may still not be counted accuratlely. > > Same typo as in V4. Sorry. Will fix and re-post. > > > > Note that this patch series improves resctrl reporting considerably > > on systems with SNC enabled, but there will still be some anomalies > > for processes accessing memory from other sub-NUMA nodes. > > I have the same question as with V4 that was not answered in that email > thread nor in this new version. > https://lore.kernel.org/lkml/e350514e-76ed-14ea-3e74-c0852658182f@intel.com/ Non-SNC systems already have an issue when reporting memory bandwidth for a task that Linux may migrate the task to a CPU on a different node which means that logging for that task will also move to different files in the mon_data/mon_L3_*/ for the new node. With SNC enabled, migration between NUMA nodes on the same socket may happen much more frequently because: 1) The CPUs on the other NUMA nodes in the socket are in the same Linux L3 cache domain. So Linux regard the migration as "cheap". 2) The ACPI SLIT table on SNC enabled systems may also report the latency for remote access to another NUMA node on the same socket as significantly lower than the latency for cross-socket access. On my test system the SLIT distance for same socket nodes is 0xC, compared to 0x15 for cross-socket distance. This will also lead to Linux being more likely to migrate a task to a CPU on another SNC NUMA node in the same socket. To avoid migration issues, users may use sched_setaffinity(2) to bind tasks to the subset of CPUs that share an SNC NUMA node. I can write this up in a new cover letter. > I stop my review of this series here. Reinette Should I repost the whole series as v6 with the new cover letter. The only change to the patches so far is to the selftest reported by Shaopeng Tan[1]. -Tony [1] https://lore.kernel.org/all/TYAPR01MB633033C489AAC0E514CBC6688BEEA@TYAPR01MB6330.jpnprd01.prod.outlook.com/