Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754151Ab3IWMHn (ORCPT ); Mon, 23 Sep 2013 08:07:43 -0400 Received: from mga02.intel.com ([134.134.136.20]:58714 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754055Ab3IWMGY (ORCPT ); Mon, 23 Sep 2013 08:06:24 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.90,962,1371106800"; d="scan'208";a="382188963" From: "Kirill A. Shutemov" To: Andrea Arcangeli , Andrew Morton Cc: Al Viro , Hugh Dickins , Wu Fengguang , Jan Kara , Mel Gorman , linux-mm@kvack.org, Andi Kleen , Matthew Wilcox , "Kirill A. Shutemov" , Hillf Danton , Dave Hansen , Ning Qu , Alexander Shishkin , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, "Kirill A. Shutemov" Subject: [PATCHv6 04/22] thp: compile-time and sysfs knob for thp pagecache Date: Mon, 23 Sep 2013 15:05:32 +0300 Message-Id: <1379937950-8411-5-git-send-email-kirill.shutemov@linux.intel.com> X-Mailer: git-send-email 1.8.4.rc3 In-Reply-To: <1379937950-8411-1-git-send-email-kirill.shutemov@linux.intel.com> References: <1379937950-8411-1-git-send-email-kirill.shutemov@linux.intel.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5263 Lines: 145 For now, TRANSPARENT_HUGEPAGE_PAGECACHE is only implemented for x86_64. It's disabled by default. Radix tree perload overhead can be significant on !BASE_FULL systems, so let's add dependency. /sys/kernel/mm/transparent_hugepage/page_cache is runtime knob for the feature. Signed-off-by: Kirill A. Shutemov --- Documentation/vm/transhuge.txt | 9 +++++++++ include/linux/huge_mm.h | 14 ++++++++++++++ mm/Kconfig | 11 +++++++++++ mm/huge_memory.c | 23 +++++++++++++++++++++++ 4 files changed, 57 insertions(+) diff --git a/Documentation/vm/transhuge.txt b/Documentation/vm/transhuge.txt index 4a63953a41..4cc15c40f4 100644 --- a/Documentation/vm/transhuge.txt +++ b/Documentation/vm/transhuge.txt @@ -103,6 +103,15 @@ echo always >/sys/kernel/mm/transparent_hugepage/enabled echo madvise >/sys/kernel/mm/transparent_hugepage/enabled echo never >/sys/kernel/mm/transparent_hugepage/enabled +If TRANSPARENT_HUGEPAGE_PAGECACHE is enabled kernel will use huge pages in +page cache if possible. It can be disable and re-enabled via sysfs: + +echo 0 >/sys/kernel/mm/transparent_hugepage/page_cache +echo 1 >/sys/kernel/mm/transparent_hugepage/page_cache + +If it's disabled kernel will not add new huge pages to page cache and +split them on mapping, but already mapped pages will stay intakt. + It's also possible to limit defrag efforts in the VM to generate hugepages in case they're not immediately free to madvise regions or to never try to defrag memory and simply fallback to regular pages diff --git a/include/linux/huge_mm.h b/include/linux/huge_mm.h index 3935428c57..fb0847572c 100644 --- a/include/linux/huge_mm.h +++ b/include/linux/huge_mm.h @@ -40,6 +40,7 @@ enum transparent_hugepage_flag { TRANSPARENT_HUGEPAGE_DEFRAG_FLAG, TRANSPARENT_HUGEPAGE_DEFRAG_REQ_MADV_FLAG, TRANSPARENT_HUGEPAGE_DEFRAG_KHUGEPAGED_FLAG, + TRANSPARENT_HUGEPAGE_PAGECACHE, TRANSPARENT_HUGEPAGE_USE_ZERO_PAGE_FLAG, #ifdef CONFIG_DEBUG_VM TRANSPARENT_HUGEPAGE_DEBUG_COW_FLAG, @@ -229,4 +230,17 @@ static inline int do_huge_pmd_numa_page(struct mm_struct *mm, struct vm_area_str #endif /* CONFIG_TRANSPARENT_HUGEPAGE */ +static inline bool transparent_hugepage_pagecache(void) +{ + if (!IS_ENABLED(CONFIG_TRANSPARENT_HUGEPAGE_PAGECACHE)) + return false; + if (!(transparent_hugepage_flags & (1<