Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp264046pxb; Wed, 20 Jan 2021 06:35:55 -0800 (PST) X-Google-Smtp-Source: ABdhPJwHK3ITrnfyeNdIm7M+dEpSC0cSeyve3rVoQ3uMtZxEnnGokEIblF1PjSNwnuNMtHzBRPyv X-Received: by 2002:a05:6402:143d:: with SMTP id c29mr3342636edx.274.1611153355467; Wed, 20 Jan 2021 06:35:55 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1611153355; cv=none; d=google.com; s=arc-20160816; b=x9ZCvgiUvPrmSFvkxQA7553lLg9y+aLAckgkDcFkxVBM35QDNLkhT3jj1LJqHnFb6G AgoaZNRvilYqa5MdQGrP+y6Cezn49K8UCc38BPY+hWy/w2FQfA3fCNumsuZpXvFKjk6J Pri2DYD12pD/fhtMWtBRTRQLkYDKVfZbxnwPYVpUVbCsCNkuqjaBux5tFSHZ3R0EuXvF 0gjRH2bgWSijlDFeZjTGfeH9IbFj7XkWheTtOEfCLjrZA5aHZ4fXVoiv8SWyDefMH8K7 vIuJ3RZ/oBGXyfcEeIiTGCFwgg06dfVqXEJ28tLtwOBk2QdGDd04nu8D6tdUgK7XPtli jWqg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=BhFA3dUSDH5j1SmI6VzHAtoR/C+MRZ1dv46zw/rCVqE=; b=wWVafSbfEccg+jLYUycrpi2jwBWAmji2uYLWQrtPyyftLFg6lNY/GHwM1ncsP+rd8l ScWiuMB9uaQye6kzNfOtWcRa5BFQPvOcztpmjHLfeMrOTzhy7A5CUTELQeQh/2DdiOcx be3tL78gMsVGmuma+Rd4HJfVow3Hki3t+EdkAFDhpZO8k/fRXVxEceHK4gDjyXzBdOnG ECJbsuH+a2e8+KUOUjP0Ijojktb/IHRGCT36iNWaZ8P56aWq+0ZZ609cEEbILsQjOWKC 2y8ZUHpIeiTxVuS6Qr7pZHap8J0fZgp2VFdqsyXALRleHBbZR6GYtOryT6FQ4WI1BKOq OnRA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance-com.20150623.gappssmtp.com header.s=20150623 header.b=Xwroxzkk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id f10si736056ejb.622.2021.01.20.06.35.30; Wed, 20 Jan 2021 06:35:55 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance-com.20150623.gappssmtp.com header.s=20150623 header.b=Xwroxzkk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2390416AbhATObV (ORCPT + 99 others); Wed, 20 Jan 2021 09:31:21 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51778 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729582AbhATOY6 (ORCPT ); Wed, 20 Jan 2021 09:24:58 -0500 Received: from mail-pj1-x1036.google.com (mail-pj1-x1036.google.com [IPv6:2607:f8b0:4864:20::1036]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 02CB6C0613C1 for ; Wed, 20 Jan 2021 06:23:30 -0800 (PST) Received: by mail-pj1-x1036.google.com with SMTP id cq1so2252440pjb.4 for ; Wed, 20 Jan 2021 06:23:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=BhFA3dUSDH5j1SmI6VzHAtoR/C+MRZ1dv46zw/rCVqE=; b=Xwroxzkk6rTNIr4WrzoC7/JLrMCHLCFuw0ZdnWrHl39esBbZsUrT5P4pzx1gR2EE9T HiZ9NbQQuPznMHuFR4t2mPGMZVVdWW8+57YlWsLmi/asHvfgzZEEZYhwpELaHC/WC4Wm cWO2w8eDQ0hnqz1pfLz5quCMfpxJnDZ6MwCWAar9Yg5eCm8/G/ba6AsxIabEFK/jemsX Th/GWrDuwcipv4c2PC9o3psSDozs4rNaU2gyAISZ0PtabXIVflEF9FXGr8Lr94onTr70 A4Ao03ncinN48bcnxNAJ2mozBxoG+maKT/1a2wzSKlKXVS1Fu/FN3C8EYMvrNnett7Ej Kakw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=BhFA3dUSDH5j1SmI6VzHAtoR/C+MRZ1dv46zw/rCVqE=; b=rCMOqtrcnSCa3o3CaedyiyU0+uTo7fTVD2jcrm/hgmOYEWVKcdDMH/MboWhu7Bb6PE kWNsGNqhXLJj6xmerF7jfOu59Gqgmri+AZ24LN1qeUyqb3Xjo+DA/INTFKh2Q2oes7h3 jb5DplUFk9ut/R/I7mJHuGOeOsSc5SJMAiwxe08ZevIoUeWwKgKhlBeNamqy2bNWU4y4 sk/FXFCvm8FFrd5ShtZL5y19jCh3ej2sss5JovjWysWtJmNjHKug+/zAieWkwiISIcAq 2xQni0RdD4xmoSYdY1pUY5Lc8n2L7Pg9EeJJXXf6p3FdTBn5Yz8XFwJEJXFQrkg1h/vc wfmQ== X-Gm-Message-State: AOAM530RoUWNg+59LFQ5dzE2U0JlFDqk/Ljx2H/qZ0kIDjP/H1buOPAP 1qPR1gVJaZN+2lx8MR/gAtL/TjtHzUQAAF/FhbxDHQ== X-Received: by 2002:a17:902:8503:b029:dc:44f:62d8 with SMTP id bj3-20020a1709028503b02900dc044f62d8mr10138409plb.34.1611152609471; Wed, 20 Jan 2021 06:23:29 -0800 (PST) MIME-Version: 1.0 References: <20210117151053.24600-1-songmuchun@bytedance.com> <20210120130959.GA7881@localhost.localdomain> In-Reply-To: <20210120130959.GA7881@localhost.localdomain> From: Muchun Song Date: Wed, 20 Jan 2021 22:22:51 +0800 Message-ID: Subject: Re: [External] Re: [PATCH v13 00/12] Free some vmemmap pages of HugeTLB page To: Oscar Salvador Cc: Mike Kravetz , Xiongchun duan , Jonathan Corbet , Thomas Gleixner , paulmck@kernel.org, dave.hansen@linux.intel.com, anshuman.khandual@arm.com, oneukum@suse.com, bp@alien8.de, hpa@zytor.com, x86@kernel.org, Randy Dunlap , mingo@redhat.com, mchehab+huawei@kernel.org, luto@kernel.org, Andrew Morton , viro@zeniv.linux.org.uk, Peter Zijlstra , David Rientjes , Michal Hocko , jroedel@suse.de, Mina Almasry , pawan.kumar.gupta@linux.intel.com, =?UTF-8?B?SE9SSUdVQ0hJIE5BT1lBKOWggOWPoyDnm7TkuZ8p?= , David Hildenbrand , "Song Bao Hua (Barry Song)" , linux-doc@vger.kernel.org, LKML , Linux Memory Management List , linux-fsdevel , Matthew Wilcox Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 20, 2021 at 9:10 PM Oscar Salvador wrote: > > On Wed, Jan 20, 2021 at 08:52:50PM +0800, Muchun Song wrote: > > Hi Oscar and Mike, > > > > Any suggestions about this version? Looking forward to your > > review. Thanks a lot. > > Hi Muchun, > > I plan to keep reviewing it in the coming days (tomorrow or Friday). > I glanced over patch#3 when you posted the series and nothing sticked out besides > what you have already pointed out, but I will have a further look. OK. Thanks :) > > thanks > > > > > > > > > > > Changelog in v11 -> v12: > > > - Move VM_WARN_ON_PAGE to a separate patch. > > > - Call __free_hugepage() with hugetlb_lock (See patch #5.) to serialize > > > with dissolve_free_huge_page(). It is to prepare for patch #9. > > > - Introduce PageHugeInflight. See patch #9. > > > > > > Changelog in v10 -> v11: > > > - Fix compiler error when !CONFIG_HUGETLB_PAGE_FREE_VMEMMAP. > > > - Rework some comments and commit changes. > > > - Rework vmemmap_remap_free() to 3 parameters. > > > > > > Thanks to Oscar and Mike's suggestions and review. > > > > > > Changelog in v9 -> v10: > > > - Fix a bug in patch #11. Thanks to Oscar for pointing that out. > > > - Rework some commit log or comments. Thanks Mike and Oscar for the suggestions. > > > - Drop VMEMMAP_TAIL_PAGE_REUSE in the patch #3. > > > > > > Thank you very much Mike and Oscar for reviewing the code. > > > > > > Changelog in v8 -> v9: > > > - Rework some code. Very thanks to Oscar. > > > - Put all the non-hugetlb vmemmap functions under sparsemem-vmemmap.c. > > > > > > Changelog in v7 -> v8: > > > - Adjust the order of patches. > > > > > > Very thanks to David and Oscar. Your suggestions are very valuable. > > > > > > Changelog in v6 -> v7: > > > - Rebase to linux-next 20201130 > > > - Do not use basepage mapping for vmemmap when this feature is disabled. > > > - Rework some patchs. > > > [PATCH v6 08/16] mm/hugetlb: Free the vmemmap pages associated with each hugetlb page > > > [PATCH v6 10/16] mm/hugetlb: Allocate the vmemmap pages associated with each hugetlb page > > > > > > Thanks to Oscar and Barry. > > > > > > Changelog in v5 -> v6: > > > - Disable PMD/huge page mapping of vmemmap if this feature was enabled. > > > - Simplify the first version code. > > > > > > Changelog in v4 -> v5: > > > - Rework somme comments and code in the [PATCH v4 04/21] and [PATCH v4 05/21]. > > > > > > Thanks to Mike and Oscar's suggestions. > > > > > > Changelog in v3 -> v4: > > > - Move all the vmemmap functions to hugetlb_vmemmap.c. > > > - Make the CONFIG_HUGETLB_PAGE_FREE_VMEMMAP default to y, if we want to > > > disable this feature, we should disable it by a boot/kernel command line. > > > - Remove vmemmap_pgtable_{init, deposit, withdraw}() helper functions. > > > - Initialize page table lock for vmemmap through core_initcall mechanism. > > > > > > Thanks for Mike and Oscar's suggestions. > > > > > > Changelog in v2 -> v3: > > > - Rename some helps function name. Thanks Mike. > > > - Rework some code. Thanks Mike and Oscar. > > > - Remap the tail vmemmap page with PAGE_KERNEL_RO instead of PAGE_KERNEL. > > > Thanks Matthew. > > > - Add some overhead analysis in the cover letter. > > > - Use vmemap pmd table lock instead of a hugetlb specific global lock. > > > > > > Changelog in v1 -> v2: > > > - Fix do not call dissolve_compound_page in alloc_huge_page_vmemmap(). > > > - Fix some typo and code style problems. > > > - Remove unused handle_vmemmap_fault(). > > > - Merge some commits to one commit suggested by Mike. > > > > > > Muchun Song (12): > > > mm: memory_hotplug: factor out bootmem core functions to > > > bootmem_info.c > > > mm: hugetlb: introduce a new config HUGETLB_PAGE_FREE_VMEMMAP > > > mm: hugetlb: free the vmemmap pages associated with each HugeTLB page > > > mm: hugetlb: defer freeing of HugeTLB pages > > > mm: hugetlb: allocate the vmemmap pages associated with each HugeTLB > > > page > > > mm: hugetlb: set the PageHWPoison to the raw error page > > > mm: hugetlb: flush work when dissolving a HugeTLB page > > > mm: hugetlb: introduce PageHugeInflight > > > mm: hugetlb: add a kernel parameter hugetlb_free_vmemmap > > > mm: hugetlb: introduce nr_free_vmemmap_pages in the struct hstate > > > mm: hugetlb: gather discrete indexes of tail page > > > mm: hugetlb: optimize the code with the help of the compiler > > > > > > Documentation/admin-guide/kernel-parameters.txt | 14 ++ > > > Documentation/admin-guide/mm/hugetlbpage.rst | 3 + > > > arch/x86/mm/init_64.c | 13 +- > > > fs/Kconfig | 18 ++ > > > include/linux/bootmem_info.h | 65 ++++++ > > > include/linux/hugetlb.h | 37 ++++ > > > include/linux/hugetlb_cgroup.h | 15 +- > > > include/linux/memory_hotplug.h | 27 --- > > > include/linux/mm.h | 5 + > > > mm/Makefile | 2 + > > > mm/bootmem_info.c | 124 +++++++++++ > > > mm/hugetlb.c | 218 +++++++++++++++++-- > > > mm/hugetlb_vmemmap.c | 278 ++++++++++++++++++++++++ > > > mm/hugetlb_vmemmap.h | 45 ++++ > > > mm/memory_hotplug.c | 116 ---------- > > > mm/sparse-vmemmap.c | 273 +++++++++++++++++++++++ > > > mm/sparse.c | 1 + > > > 17 files changed, 1082 insertions(+), 172 deletions(-) > > > create mode 100644 include/linux/bootmem_info.h > > > create mode 100644 mm/bootmem_info.c > > > create mode 100644 mm/hugetlb_vmemmap.c > > > create mode 100644 mm/hugetlb_vmemmap.h > > > > > > -- > > > 2.11.0 > > > > > > > -- > Oscar Salvador > SUSE L3