Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp5414279pxu; Thu, 22 Oct 2020 01:23:07 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz5KExpJbXN5RzqimotID14g9SkMuiPzN6zgS96UhDf4cH+1IrYmbpnOAyi5EBwKl74qEAa X-Received: by 2002:a50:8fc5:: with SMTP id y63mr1221448edy.10.1603354986946; Thu, 22 Oct 2020 01:23:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1603354986; cv=none; d=google.com; s=arc-20160816; b=CuYHiMiAxrAgrQ4Wk+/jN2C8B6EclMuqg9gRxhDAuI/NlwrBlLwZSniexfk/UmGsNw NgyRSLXczXBOaUu/I8NRSuC2DvPdrLCuI2dVeaZMRsIcbblQTLpo/CuXdz+xhvORwFnH VHjfIhmALeDgh0Pgg4RIzql+OC1AIi+cppuA0XHwCPnFohXUXr9NV2/uQqH8OzGZasbW z2avtcHfswrUXTovGUXhQ1mTBSVoJwB5iwrlGsOgWNNE0wRoHBpUcnnaUctRCDHX654E audBEA7sbyCN9V/jsVDuftGkazL9IQIGiM3xeFHPzLXk2LObigsvsfLxDCuavqpO6nCR nPcg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:subject:cc:to:from:date; bh=77pfmz0rvX0pdLEi0aiMSKPtpOT/Po3BSoKAiP6EmbM=; b=LHOBEWS/UNLfgfjmH+hfISV7fi+sey7G06KGDQwndxInfgAsCDIqKHd34HVl9v1t9/ Jqx6YipXfTWKYAJR+EEwfMjkaWJSPdSu4d5JjoU+yfHyO7yPa3TQC+mWT9eXlNAQ7Zi2 JfTI/lHDAhtaxq5EqLkrUKgR6/XOEebjx85F2K6cQIkOR3UVTSq1qhoe8wu5tL+fziyK n8bM1P8/7t9XT/MyX3u8jBth77D7CmMogT3An8LlhCXer8JXPvRMm15aqkHZq8SckEiq 9xRCDfwkPUr3qqA0NHl5do82kDoMayic+JyGvCVtEQ37J/jc9Q7qzkgCjDaSmMp2tE2s nGoA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l90si478001edl.249.2020.10.22.01.22.44; Thu, 22 Oct 2020 01:23:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2895541AbgJVDs4 (ORCPT + 99 others); Wed, 21 Oct 2020 23:48:56 -0400 Received: from shelob.surriel.com ([96.67.55.147]:33498 "EHLO shelob.surriel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2443824AbgJVDs4 (ORCPT ); Wed, 21 Oct 2020 23:48:56 -0400 Received: from [2603:3005:d05:2b00:6e0b:84ff:fee2:98bb] (helo=imladris.surriel.com) by shelob.surriel.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94) (envelope-from ) id 1kVRaq-0000iQ-SX; Wed, 21 Oct 2020 23:48:48 -0400 Date: Wed, 21 Oct 2020 23:48:46 -0400 From: Rik van Riel To: Hugh Dickins Cc: Andrew Morton , linux-mm@kvack.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, Mel Gorman , Andrea Arcangeli , Matthew Wilcox Subject: [PATCH] mm,thp,shmem: limit shmem THP alloc gfp_mask Message-ID: <20201021234846.5cc97e62@imladris.surriel.com> X-Mailer: Claws Mail 3.17.6 (GTK+ 2.24.32; x86_64-redhat-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: riel@shelob.surriel.com Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org The allocation flags of anonymous transparent huge pages can be controlled through the files in /sys/kernel/mm/transparent_hugepage/defrag, which can help the system from getting bogged down in the page reclaim and compaction code when many THPs are getting allocated simultaneously. However, the gfp_mask for shmem THP allocations were not limited by those configuration settings, and some workloads ended up with all CPUs stuck on the LRU lock in the page reclaim code, trying to allocate dozens of THPs simultaneously. This patch applies the same configurated limitation of THPs to shmem hugepage allocations, to prevent that from happening. This way a THP defrag setting of "never" or "defer+madvise" will result in quick allocation failures without direct reclaim when no 2MB free pages are available. Signed-off-by: Rik van Riel --- diff --git a/include/linux/gfp.h b/include/linux/gfp.h index c603237e006c..0a5b164a26d9 100644 --- a/include/linux/gfp.h +++ b/include/linux/gfp.h @@ -614,6 +614,8 @@ bool gfp_pfmemalloc_allowed(gfp_t gfp_mask); extern void pm_restrict_gfp_mask(void); extern void pm_restore_gfp_mask(void); +extern gfp_t alloc_hugepage_direct_gfpmask(struct vm_area_struct *vma); + #ifdef CONFIG_PM_SLEEP extern bool pm_suspended_storage(void); #else diff --git a/mm/huge_memory.c b/mm/huge_memory.c index 9474dbc150ed..9b08ce5cc387 100644 --- a/mm/huge_memory.c +++ b/mm/huge_memory.c @@ -649,7 +649,7 @@ static vm_fault_t __do_huge_pmd_anonymous_page(struct vm_fault *vmf, * available * never: never stall for any thp allocation */ -static inline gfp_t alloc_hugepage_direct_gfpmask(struct vm_area_struct *vma) +gfp_t alloc_hugepage_direct_gfpmask(struct vm_area_struct *vma) { const bool vma_madvised = !!(vma->vm_flags & VM_HUGEPAGE); diff --git a/mm/shmem.c b/mm/shmem.c index 537c137698f8..d1290eb508e5 100644 --- a/mm/shmem.c +++ b/mm/shmem.c @@ -1545,8 +1545,11 @@ static struct page *shmem_alloc_hugepage(gfp_t gfp, return NULL; shmem_pseudo_vma_init(&pvma, info, hindex); - page = alloc_pages_vma(gfp | __GFP_COMP | __GFP_NORETRY | __GFP_NOWARN, - HPAGE_PMD_ORDER, &pvma, 0, numa_node_id(), true); + /* Limit the gfp mask according to THP configuration. */ + gfp |= __GFP_COMP | __GFP_NORETRY | __GFP_NOWARN; + gfp &= alloc_hugepage_direct_gfpmask(&pvma); + page = alloc_pages_vma(gfp, HPAGE_PMD_ORDER, &pvma, 0, numa_node_id(), + true); shmem_pseudo_vma_destroy(&pvma); if (page) prep_transhuge_page(page); -- All rights reversed.