Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp5223760pxv; Tue, 20 Jul 2021 23:44:38 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzINhBNeMlVIr4WQ+yhgS9/XET6Egmkw0qt6dOwUR2g2RVRoV2gq8jGm0V00xLyCaLFO6Qq X-Received: by 2002:a92:1942:: with SMTP id e2mr24238427ilm.4.1626849878146; Tue, 20 Jul 2021 23:44:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1626849878; cv=none; d=google.com; s=arc-20160816; b=iBk5VGglFA/+yIGJZAM6Z0eqtnRTxrN76A9hjWKRsRgMM1dAo8mOqIcWIGjBUvPyii FLZBZkTbi6flMuzHARp6Wghu8p27T3iV4UVuhOEK7lymVBN7Bt0PRVJ/14OO4A9hxHoZ 2JE6AMeuUnsgyZRsKi/KpbbgE8nffhZO6m01jJvil0OMzDcDZFpSK+F2IR1NQCbLqpeZ 1dLiHZeOtlEAs0svF2VCgbUtiWsdABbsH/EGQ/YCjW24DIYW9pZoL4C5wMu6s0fNsBCI SYdixy5rD5J0nmOWKyBOkDQczeMu02RUt7VcsWePXhX0H66GOrQV0I8RGT1PiN/59BZR wowQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=1arwKahyxc7tkbBnHYjMLMg09/QKyIDmTEUfyEC1qI8=; b=ubmcWyLh2f40sbKvDLcmzZa7+F/TT3S0jpbca6JQoE/9skLYXAo/Cgq26apl0Kilnb I+3YcGGdYpm2PLei31ApVpnZX++3/fGszuKeahGSJ1QqsZRkAwt2XrGgViqQ9A2RjGPx QfDnTQI4ToSIolk+B1mhaPrLNUvJ3rgGkqW+7vdy2/3zkPbeiLz/8r1kn9nYZJ4AoiLk jgtI2LYD26pE2+exanoHH721UlPKs1iKebbWivh9r2HeRUUb+9WAEdG9CvNOppm50U1H A72YCvGtcWgT9g77S85ex40mR49HpZnoBKvuPtmthT4yAygv9XIh2B5PxibiQXv+Htsw 9IqA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id s1si26704064jat.54.2021.07.20.23.44.26; Tue, 20 Jul 2021 23:44:38 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234215AbhGUGCP (ORCPT + 99 others); Wed, 21 Jul 2021 02:02:15 -0400 Received: from mga06.intel.com ([134.134.136.31]:44371 "EHLO mga06.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234182AbhGUF7C (ORCPT ); Wed, 21 Jul 2021 01:59:02 -0400 X-IronPort-AV: E=McAfee;i="6200,9189,10051"; a="272496363" X-IronPort-AV: E=Sophos;i="5.84,257,1620716400"; d="scan'208";a="272496363" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga104.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jul 2021 23:39:39 -0700 X-IronPort-AV: E=Sophos;i="5.84,257,1620716400"; d="scan'208";a="511389983" Received: from yhuang6-desk2.sh.intel.com ([10.239.159.119]) by fmsmga002-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jul 2021 23:39:35 -0700 From: Huang Ying To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Dave Hansen , "Huang, Ying" , Yang Shi , Zi Yan , Michal Hocko , Wei Xu , Oscar Salvador , David Rientjes , Dan Williams , David Hildenbrand , Greg Thelen , Keith Busch , Yang Shi Subject: [PATCH -V11 2/9] mm/migrate: update node demotion order on hotplug events Date: Wed, 21 Jul 2021 14:39:19 +0800 Message-Id: <20210721063926.3024591-2-ying.huang@intel.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: <20210721063926.3024591-1-ying.huang@intel.com> References: <20210721063926.3024591-1-ying.huang@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Dave Hansen Reclaim-based migration is attempting to optimize data placement in memory based on the system topology. If the system changes, so must the migration ordering. The implementation is conceptually simple and entirely unoptimized. On any memory or CPU hotplug events, assume that a node was added or removed and recalculate all migration targets. This ensures that the node_demotion[] array is always ready to be used in case the new reclaim mode is enabled. This recalculation is far from optimal, most glaringly that it does not even attempt to figure out the hotplug event would have some *actual* effect on the demotion order. But, given the expected paucity of hotplug events, this should be fine. Link: https://lkml.kernel.org/r/20210715055145.195411-3-ying.huang@intel.com Signed-off-by: Dave Hansen Signed-off-by: "Huang, Ying" Reviewed-by: Yang Shi Reviewed-by: Zi Yan Cc: Michal Hocko Cc: Wei Xu Cc: Oscar Salvador Cc: David Rientjes Cc: Dan Williams Cc: David Hildenbrand Cc: Greg Thelen Cc: Keith Busch Cc: Yang Shi Signed-off-by: Andrew Morton --- mm/migrate.c | 90 +++++++++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 89 insertions(+), 1 deletion(-) diff --git a/mm/migrate.c b/mm/migrate.c index b7a40ab47648..a40c391f9ca7 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -49,6 +49,7 @@ #include #include #include +#include #include @@ -3057,6 +3058,7 @@ void migrate_vma_finalize(struct migrate_vma *migrate) EXPORT_SYMBOL(migrate_vma_finalize); #endif /* CONFIG_DEVICE_PRIVATE */ +#if defined(CONFIG_MEMORY_HOTPLUG) /* Disable reclaim-based migration. */ static void __disable_all_migrate_targets(void) { @@ -3191,10 +3193,96 @@ static void __set_migration_target_nodes(void) /* * For callers that do not hold get_online_mems() already. */ -__maybe_unused // <- temporay to prevent warnings during bisects static void set_migration_target_nodes(void) { get_online_mems(); __set_migration_target_nodes(); put_online_mems(); } + +/* + * React to hotplug events that might affect the migration targets + * like events that online or offline NUMA nodes. + * + * The ordering is also currently dependent on which nodes have + * CPUs. That means we need CPU on/offline notification too. + */ +static int migration_online_cpu(unsigned int cpu) +{ + set_migration_target_nodes(); + return 0; +} + +static int migration_offline_cpu(unsigned int cpu) +{ + set_migration_target_nodes(); + return 0; +} + +/* + * This leaves migrate-on-reclaim transiently disabled between + * the MEM_GOING_OFFLINE and MEM_OFFLINE events. This runs + * whether reclaim-based migration is enabled or not, which + * ensures that the user can turn reclaim-based migration at + * any time without needing to recalculate migration targets. + * + * These callbacks already hold get_online_mems(). That is why + * __set_migration_target_nodes() can be used as opposed to + * set_migration_target_nodes(). + */ +static int __meminit migrate_on_reclaim_callback(struct notifier_block *self, + unsigned long action, void *arg) +{ + switch (action) { + case MEM_GOING_OFFLINE: + /* + * Make sure there are not transient states where + * an offline node is a migration target. This + * will leave migration disabled until the offline + * completes and the MEM_OFFLINE case below runs. + */ + disable_all_migrate_targets(); + break; + case MEM_OFFLINE: + case MEM_ONLINE: + /* + * Recalculate the target nodes once the node + * reaches its final state (online or offline). + */ + __set_migration_target_nodes(); + break; + case MEM_CANCEL_OFFLINE: + /* + * MEM_GOING_OFFLINE disabled all the migration + * targets. Reenable them. + */ + __set_migration_target_nodes(); + break; + case MEM_GOING_ONLINE: + case MEM_CANCEL_ONLINE: + break; + } + + return notifier_from_errno(0); +} + +static int __init migrate_on_reclaim_init(void) +{ + int ret; + + ret = cpuhp_setup_state(CPUHP_AP_ONLINE_DYN, "migrate on reclaim", + migration_online_cpu, + migration_offline_cpu); + /* + * In the unlikely case that this fails, the automatic + * migration targets may become suboptimal for nodes + * where N_CPU changes. With such a small impact in a + * rare case, do not bother trying to do anything special. + */ + WARN_ON(ret < 0); + + hotplug_memory_notifier(migrate_on_reclaim_callback, 100); + return 0; +} +late_initcall(migrate_on_reclaim_init); +#endif /* CONFIG_MEMORY_HOTPLUG */ -- 2.30.2