LinuxLists.cc - [PATCH v3 0/7] hugetlb: parallelize hugetlb page init on boot

2024-01-02 13:13:28

Subject: [PATCH v3 0/7] hugetlb: parallelize hugetlb page init on boot

Hi all, hugetlb init parallelization has now been updated to v3.

This series is tested on next-20240102 and can not be applied to v6.7-rc8.

Update Summary:
- Select CONFIG_PADATA as we use padata_do_multithreaded
- Fix a race condition in h->next_nid_to_alloc
- Fix local variable initialization issues
- Remove RFC tag

Thanks to the testing by David Rientjes, we now know that this patch reduce
hugetlb 1G initialization time from 77s to 18.3s on a 12T machine[4].

# Introduction
Hugetlb initialization during boot takes up a considerable amount of time.
For instance, on a 2TB system, initializing 1,800 1GB huge pages takes 1-2
seconds out of 10 seconds. Initializing 11,776 1GB pages on a 12TB Intel
host takes more than 1 minute[1]. This is a noteworthy figure.

Inspired by [2] and [3], hugetlb initialization can also be accelerated
through parallelization. Kernel already has infrastructure like
padata_do_multithreaded, this patch uses it to achieve effective results
by minimal modifications.

[1] https://lore.kernel.org/all/[email protected]/
[2] https://lore.kernel.org/all/[email protected]/
[3] https://lore.kernel.org/all/[email protected]/
[4] https://lore.kernel.org/all/[email protected]/

# Test result
test no patch(ms) patched(ms) saved
------------------- -------------- ------------- --------
256c2t(4 node) 1G 4745 2024 57.34%
128c1t(2 node) 1G 3358 1712 49.02%
12t 1G 77000 18300 76.23%

256c2t(4 node) 2M 3336 1051 68.52%
128c1t(2 node) 2M 1943 716 63.15%

# Change log
Changes in v3:
- Select CONFIG_PADATA as we use padata_do_multithreaded
- Fix a race condition in h->next_nid_to_alloc
- Fix local variable initialization issues
- Remove RFC tag

Changes in v2:
- https://lore.kernel.org/all/[email protected]/
- Reduce complexity with `padata_do_multithreaded`
- Support 1G hugetlb

v1:
- https://lore.kernel.org/all/[email protected]/
- parallelize 2M hugetlb initialization with workqueue

Gang Li (7):
hugetlb: code clean for hugetlb_hstate_alloc_pages
hugetlb: split hugetlb_hstate_alloc_pages
padata: dispatch works on different nodes
hugetlb: pass *next_nid_to_alloc directly to
for_each_node_mask_to_alloc
hugetlb: have CONFIG_HUGETLBFS select CONFIG_PADATA
hugetlb: parallelize 2M hugetlb allocation and initialization
hugetlb: parallelize 1G hugetlb initialization

fs/Kconfig | 1 +
include/linux/hugetlb.h | 2 +-
include/linux/padata.h | 3 +
kernel/padata.c | 8 +-
mm/hugetlb.c | 224 +++++++++++++++++++++++++++-------------
mm/mm_init.c | 1 +
6 files changed, 163 insertions(+), 76 deletions(-)

--
2.20.1

2024-01-02 13:13:51

Subject: [PATCH v3 0/7] hugetlb: parallelize hugetlb page init on boot

Subject: [PATCH v3 1/7] hugetlb: code clean for hugetlb_hstate_alloc_pages

Subject: [PATCH v3 2/7] hugetlb: split hugetlb_hstate_alloc_pages

Subject: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: [PATCH v3 5/7] hugetlb: have CONFIG_HUGETLBFS select CONFIG_PADATA

Subject: [PATCH v3 6/7] hugetlb: parallelize 2M hugetlb allocation and initialization

Subject: [PATCH v3 7/7] hugetlb: parallelize 1G hugetlb initialization

Subject: Re: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: Re: [PATCH v3 0/7] hugetlb: parallelize hugetlb page init on boot

Subject: Re: [PATCH v3 0/7] hugetlb: parallelize hugetlb page init on boot

Subject: Re: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: Re: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: Re: [PATCH v3 1/7] hugetlb: code clean for hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 1/7] hugetlb: code clean for hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 2/7] hugetlb: split hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 1/7] hugetlb: code clean for hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 1/7] hugetlb: code clean for hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 2/7] hugetlb: split hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: Re: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: Re: [PATCH v3 5/7] hugetlb: have CONFIG_HUGETLBFS select CONFIG_PADATA

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: Re: [PATCH v3 4/7] hugetlb: pass *next_nid_to_alloc directly to for_each_node_mask_to_alloc

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: Re: [PATCH v3 2/7] hugetlb: split hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 2/7] hugetlb: split hugetlb_hstate_alloc_pages

Subject: Re: [PATCH v3 5/7] hugetlb: have CONFIG_HUGETLBFS select CONFIG_PADATA

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes

Subject: Re: [PATCH v3 3/7] padata: dispatch works on different nodes