Received: by 2002:a05:6a10:d5a5:0:0:0:0 with SMTP id gn37csp2147319pxb; Fri, 8 Oct 2021 01:41:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyZA7c/4WiNZD8iYjFaaisTbfQqF1U1LUXn1ubXTeT8XzH7gPjjkLNlBBYTg/zZr0E4Lrhj X-Received: by 2002:a05:6402:2787:: with SMTP id b7mr12851419ede.285.1633682512367; Fri, 08 Oct 2021 01:41:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1633682512; cv=none; d=google.com; s=arc-20160816; b=NXWMGLnzKu34ho7BpDLWNq0IIDLDqoFT8OcIos931qRJOxFTv/uA4NReeEGIBzJTyk fF+sls+uU4E1oFFSbvuEXA8qrzVqTBGhP7u+OK0WthxKl8/yD6uIYzeb14F/fuuNo1Kt 7IED0ZJqJ6i/zPcEINxya5vATZZCvAnOsMCEm2Us8ieb7zQYJphI0NRS/VPA6fx0RF6U IEh5vjpKf+NT7ChUkwsA39IBEwmHCgcEtWVTTUNAG3aZjLyXKZM3rP9qc8jTYRw0hnBZ Yq5tKEn3qiF3cRxkV9z2I0a8QiDyx0R7h/BnvwDz1HZdACXETl9HB23TkSFghAbu3UgN 0sLg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=XTnyybC9KZ/RPyJsRZX1xJRVA+hHN9OHkBazGCmOkEc=; b=JujyFBNwp8MFKIj48DcgqLZPcha1wYaL85+bFDS1HwtNmU4x4gBGbwkvu8DKQPV0+M IChUE1jhDNTsesTWMXZmGqZW2b7pNbIoAGTLCtpuR5Ks44ctTWZSEnwbxp6asCvKlsSb QIONOM3Tv3t2PwhNBq2DXoeO0k1g8Y7ktecKsOkC6aEWMvhlUWqR2sA0U2SgL1QTQEzz I6RaIjXQFVTaYhdB52rtQS9CIcvxIhKTUDzSOKOL5u8xRPEbEgKKF1hSNJrwMxTDMj8K gF+UuvjCQuNnSmFp01VMQbu/2X1WzJrvUXUKF7VSfmZtSuFpJ3LE8/D2SG3HvQ/A7rFN ZT+Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id s3si2179528ejs.386.2021.10.08.01.41.28; Fri, 08 Oct 2021 01:41:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233932AbhJHIly (ORCPT + 99 others); Fri, 8 Oct 2021 04:41:54 -0400 Received: from mga01.intel.com ([192.55.52.88]:38268 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233767AbhJHIly (ORCPT ); Fri, 8 Oct 2021 04:41:54 -0400 X-IronPort-AV: E=McAfee;i="6200,9189,10130"; a="249823556" X-IronPort-AV: E=Sophos;i="5.85,357,1624345200"; d="scan'208";a="249823556" Received: from orsmga006.jf.intel.com ([10.7.209.51]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 08 Oct 2021 01:39:59 -0700 X-IronPort-AV: E=Sophos;i="5.85,357,1624345200"; d="scan'208";a="439860313" Received: from yhuang6-desk2.sh.intel.com ([10.239.159.119]) by orsmga006-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 08 Oct 2021 01:39:55 -0700 From: Huang Ying To: linux-kernel@vger.kernel.org Cc: Huang Ying , Andrew Morton , Michal Hocko , Rik van Riel , Mel Gorman , Peter Zijlstra , Dave Hansen , Yang Shi , Zi Yan , Wei Xu , osalvador , Shakeel Butt , linux-mm@kvack.org Subject: [PATCH -V9 1/6] NUMA Balancing: add page promotion counter Date: Fri, 8 Oct 2021 16:39:33 +0800 Message-Id: <20211008083938.1702663-2-ying.huang@intel.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: <20211008083938.1702663-1-ying.huang@intel.com> References: <20211008083938.1702663-1-ying.huang@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org In a system with multiple memory types, e.g. DRAM and PMEM, the CPU and DRAM in one socket will be put in one NUMA node as before, while the PMEM will be put in another NUMA node as described in the description of the commit c221c0b0308f ("device-dax: "Hotplug" persistent memory for use like normal RAM"). So, the NUMA balancing mechanism will identify all PMEM accesses as remote access and try to promote the PMEM pages to DRAM. To distinguish the number of the inter-type promoted pages from that of the inter-socket migrated pages. A new vmstat count is added. The counter is per-node (count in the target node). So this can be used to identify promotion imbalance among the NUMA nodes. Signed-off-by: "Huang, Ying" Cc: Andrew Morton Cc: Michal Hocko Cc: Rik van Riel Cc: Mel Gorman Cc: Peter Zijlstra Cc: Dave Hansen Cc: Yang Shi Cc: Zi Yan Cc: Wei Xu Cc: osalvador Cc: Shakeel Butt Cc: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org --- include/linux/mmzone.h | 3 +++ include/linux/node.h | 5 +++++ include/linux/vmstat.h | 2 ++ mm/migrate.c | 10 ++++++++-- mm/vmstat.c | 3 +++ 5 files changed, 21 insertions(+), 2 deletions(-) diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h index 6a1d79d84675..37ccd6158765 100644 --- a/include/linux/mmzone.h +++ b/include/linux/mmzone.h @@ -209,6 +209,9 @@ enum node_stat_item { NR_PAGETABLE, /* used for pagetables */ #ifdef CONFIG_SWAP NR_SWAPCACHE, +#endif +#ifdef CONFIG_NUMA_BALANCING + PGPROMOTE_SUCCESS, /* promote successfully */ #endif NR_VM_NODE_STAT_ITEMS }; diff --git a/include/linux/node.h b/include/linux/node.h index 8e5a29897936..26e96fcc66af 100644 --- a/include/linux/node.h +++ b/include/linux/node.h @@ -181,4 +181,9 @@ static inline void register_hugetlbfs_with_node(node_registration_func_t reg, #define to_node(device) container_of(device, struct node, dev) +static inline bool node_is_toptier(int node) +{ + return node_state(node, N_CPU); +} + #endif /* _LINUX_NODE_H_ */ diff --git a/include/linux/vmstat.h b/include/linux/vmstat.h index d6a6cf53b127..75c53b7d1539 100644 --- a/include/linux/vmstat.h +++ b/include/linux/vmstat.h @@ -112,9 +112,11 @@ static inline void vm_events_fold_cpu(int cpu) #ifdef CONFIG_NUMA_BALANCING #define count_vm_numa_event(x) count_vm_event(x) #define count_vm_numa_events(x, y) count_vm_events(x, y) +#define mod_node_balancing_page_state(n, i, v) mod_node_page_state(n, i, v) #else #define count_vm_numa_event(x) do {} while (0) #define count_vm_numa_events(x, y) do { (void)(y); } while (0) +#define mod_node_balancing_page_state(n, i, v) do {} while (0) #endif /* CONFIG_NUMA_BALANCING */ #ifdef CONFIG_DEBUG_TLBFLUSH diff --git a/mm/migrate.c b/mm/migrate.c index a6a7743ee98f..c3affc587902 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -2148,6 +2148,7 @@ int migrate_misplaced_page(struct page *page, struct vm_area_struct *vma, pg_data_t *pgdat = NODE_DATA(node); int isolated; int nr_remaining; + int nr_succeeded; LIST_HEAD(migratepages); new_page_t *new; bool compound; @@ -2186,7 +2187,8 @@ int migrate_misplaced_page(struct page *page, struct vm_area_struct *vma, list_add(&page->lru, &migratepages); nr_remaining = migrate_pages(&migratepages, *new, NULL, node, - MIGRATE_ASYNC, MR_NUMA_MISPLACED, NULL); + MIGRATE_ASYNC, MR_NUMA_MISPLACED, + &nr_succeeded); if (nr_remaining) { if (!list_empty(&migratepages)) { list_del(&page->lru); @@ -2195,8 +2197,12 @@ int migrate_misplaced_page(struct page *page, struct vm_area_struct *vma, putback_lru_page(page); } isolated = 0; - } else + } else { count_vm_numa_events(NUMA_PAGE_MIGRATE, nr_pages); + if (!node_is_toptier(page_to_nid(page)) && node_is_toptier(node)) + mod_node_balancing_page_state( + NODE_DATA(node), PGPROMOTE_SUCCESS, nr_succeeded); + } BUG_ON(!list_empty(&migratepages)); return isolated; diff --git a/mm/vmstat.c b/mm/vmstat.c index 8ce2620344b2..fff0ec94d795 100644 --- a/mm/vmstat.c +++ b/mm/vmstat.c @@ -1236,6 +1236,9 @@ const char * const vmstat_text[] = { #ifdef CONFIG_SWAP "nr_swapcached", #endif +#ifdef CONFIG_NUMA_BALANCING + "pgpromote_success", +#endif /* enum writeback_stat_item counters */ "nr_dirty_threshold", -- 2.30.2