Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp3701317pxb; Tue, 26 Jan 2021 02:26:32 -0800 (PST) X-Google-Smtp-Source: ABdhPJx+zxJGA47sNPV6G6qwqnehABfxlZbPtWPcpNwlu/092AB1bKvawRD0/CucpQEVTN4y0iAd X-Received: by 2002:a05:6402:55:: with SMTP id f21mr4141095edu.38.1611656792331; Tue, 26 Jan 2021 02:26:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611656792; cv=none; d=google.com; s=arc-20160816; b=raeNDahQ06RKZZbKIP6qUj4yZx3DU5ZjTwdDU0avrNNfV/vnI1on5F1bkWK58U45PN pOkUgMVDi60bEuQoGaGMkSOBZRVWsVGOjvEehkGReRP0eD4DGh9C5ZheLMTOgxrJqYa6 R6Tq3Ice1nWnSNlmQA7YpjNWrF2QiJRW0BLb859VlYwu3/BOry4Kbc5y+QiXiWv6ExAO uvnFSqLn+NDPMENQ1KC46I6YaNXWUPaF55D8fU9Z3tCzDI0HSib3AhEwiZY9aOo+bJEG UA0/X/iaqSGFFQAFzOomhBdCAV5y+RdN/oi2vLkH+CEBCalD6b7RMfrn7q1RDwXbUAx9 TBpw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:in-reply-to:references:date:from:cc :to:subject:ironport-sdr:ironport-sdr; bh=40e5/eztqJ103ZNWSm/TQIjAVNBbaIwzMSVfxiwq/DA=; b=pkJf226ayR+d0t3z+13kV+bAhS2QdkuN4EbvodhYO5td87xhdpd2SphmMuMvGj3dhg XrTnJF2ZKmknctki7E7sc8/7PvHCr/2LGmdAWnBI4RP0Vz2cH+xK9gkDlUSiD7Tz2Rjs MlBK5gm/tbff93ZAeq+PQe+xR7WFmzeNh8W0TdpwaqCFne6SRlQXNsbW0wPp0bRcIDWi KjZAKqp1C+lW6ZJaoC2QGVBuZT8ZDXoszMcon0E/H8GC788b65aTxhfbm6V5UmlfJCOA ZYeOQs2nAmsL7hKbWWthCCtRi33FoQkkfmhAuWM4kZNzDfh44ZaoNNlU5nqbNs9xpp/l luiA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ho43si6824931ejc.421.2021.01.26.02.26.06; Tue, 26 Jan 2021 02:26:32 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2391953AbhAZKX4 (ORCPT + 99 others); Tue, 26 Jan 2021 05:23:56 -0500 Received: from mga07.intel.com ([134.134.136.100]:20190 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1731186AbhAZB2B (ORCPT ); Mon, 25 Jan 2021 20:28:01 -0500 IronPort-SDR: weWuBJNkQov4X/6TDjx6IjPKQOsZKjtSXWOGnfZT+z6lhuqarkewQ2NzjHP8aB5xJLR2SjUFc2 IcXeedi3cOXQ== X-IronPort-AV: E=McAfee;i="6000,8403,9875"; a="243908458" X-IronPort-AV: E=Sophos;i="5.79,375,1602572400"; d="scan'208";a="243908458" Received: from fmsmga001.fm.intel.com ([10.253.24.23]) by orsmga105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 25 Jan 2021 16:41:39 -0800 IronPort-SDR: hOJfIs/C4PU8o2XSx/YV0eVbYfsSWcLbCKOxKOfY8Xe4I5nKGX89Oj08ePyqrk4jaGgPj3LsCi GfLcZQl2rr9w== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.79,375,1602572400"; d="scan'208";a="472556473" Received: from viggo.jf.intel.com (HELO localhost.localdomain) ([10.54.77.144]) by fmsmga001.fm.intel.com with ESMTP; 25 Jan 2021 16:41:39 -0800 Subject: [RFC][PATCH 04/13] mm/numa: node demotion data structure and lookup To: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, Dave Hansen , yang.shi@linux.alibaba.com, rientjes@google.com, ying.huang@intel.com, dan.j.williams@intel.com, david@redhat.com, osalvador@suse.de From: Dave Hansen Date: Mon, 25 Jan 2021 16:34:19 -0800 References: <20210126003411.2AC51464@viggo.jf.intel.com> In-Reply-To: <20210126003411.2AC51464@viggo.jf.intel.com> Message-Id: <20210126003419.43281680@viggo.jf.intel.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Dave Hansen Prepare for the kernel to auto-migrate pages to other memory nodes with a user defined node migration table. This allows creating single migration target for each NUMA node to enable the kernel to do NUMA page migrations instead of simply reclaiming colder pages. A node with no target is a "terminal node", so reclaim acts normally there. The migration target does not fundamentally _need_ to be a single node, but this implementation starts there to limit complexity. If you consider the migration path as a graph, cycles (loops) in the graph are disallowed. This avoids wasting resources by constantly migrating (A->B, B->A, A->B ...). The expectation is that cycles will never be allowed. Signed-off-by: Dave Hansen Cc: Yang Shi Cc: David Rientjes Cc: Huang Ying Cc: Dan Williams Cc: David Hildenbrand Cc: osalvador -- changes in July 2020: - Remove loop from next_demotion_node() and get_online_mems(). This means that the node returned by next_demotion_node() might now be offline, but the worst case is that the allocation fails. That's fine since it is transient. --- b/mm/migrate.c | 16 ++++++++++++++++ 1 file changed, 16 insertions(+) diff -puN mm/migrate.c~0006-node-Define-and-export-memory-migration-path mm/migrate.c --- a/mm/migrate.c~0006-node-Define-and-export-memory-migration-path 2021-01-25 16:23:09.553866709 -0800 +++ b/mm/migrate.c 2021-01-25 16:23:09.558866709 -0800 @@ -1161,6 +1161,22 @@ out: return rc; } +static int node_demotion[MAX_NUMNODES] = {[0 ... MAX_NUMNODES - 1] = NUMA_NO_NODE}; + +/** + * next_demotion_node() - Get the next node in the demotion path + * @node: The starting node to lookup the next node + * + * @returns: node id for next memory node in the demotion path hierarchy + * from @node; NUMA_NO_NODE if @node is terminal. This does not keep + * @node online or guarantee that it *continues* to be the next demotion + * target. + */ +int next_demotion_node(int node) +{ + return node_demotion[node]; +} + /* * Obtain the lock on page, remove all ptes and migrate the page * to the newly allocated page in newpage. _