Received: by 2002:a05:6a10:f3d0:0:0:0:0 with SMTP id a16csp3776775pxv; Mon, 28 Jun 2021 12:35:25 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzpyTZnQ2Tq8/jdoasjGztJuXCbuwRftk63np4Itampa6WhmWs9PtsR32WEcMCnlx+2qmhy X-Received: by 2002:a92:b07:: with SMTP id b7mr18133407ilf.132.1624908925150; Mon, 28 Jun 2021 12:35:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1624908925; cv=none; d=google.com; s=arc-20160816; b=KMbxpoM5PVzw6IhpQHrnBeb7Li2IV41qib5RAiRAvyC7bDWqqqBBQPPmD4YPq2cztZ EfBG2KOdd+I+8c+a8ioaujWWmHwtBdlFUt//67HCrcobjhF6GMpvx/TiI5TdRKq6mP11 WVfOeAZhmYR1w5PRNk4b6R8sjwH2apr8qdhsX89r+x5kK8dH6olzh9FAh/yYjgtaHQkM F/kFaHM4mos/OnZODK4YF7mB9Yt9IYuzpeEh256PCeK1AtDN41w+EyNe1ZLRoA1uTMpm yTYWELDmStN0CVffmW4p1+pNqjTA6X+2jBNUv4lKxWaNcywDQU8nZg9FVWeSwwSFYcOp Ca2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=vnvtEX1k7MQbPN64EUj/act+2DIjur/WObU76C204x0=; b=wqHHX/90jHmhQfktp4SvpeloHvxDKJxVVJh9ulo55FvfRnQ5oB7J81Q6ZijmjMDDmS 7Q2qO32/k45OGt5a0rO81UwBBWs1RPhzh32d5PI60Ld0+90GO84eyBwK4jpIbDP4+uAK MwvmkZQjk06t5mbh5WUVfBsIjKCJN2/k17rm5W76wZ09Pfu39txw3Pv1QmdLK4jNGnAP VCXzfXGQgDa5nKuxwolaA7G/VV3eHeaur+hZQBfZre7AgRhbrfM2pDji3ApC8Xi4XeDX dJ0Qrfd77/XJBuXieGJa+XBtC5dXsZhEePH2LOLI9bsAQ/fFHxLnSE0wU/ocL9+eFkiG MK2g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=X0rZ2ohd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c8si15122249iln.97.2021.06.28.12.35.12; Mon, 28 Jun 2021 12:35:25 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=X0rZ2ohd; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236590AbhF1Ots (ORCPT + 99 others); Mon, 28 Jun 2021 10:49:48 -0400 Received: from mail.kernel.org ([198.145.29.99]:43014 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234727AbhF1Oh3 (ORCPT ); Mon, 28 Jun 2021 10:37:29 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id DEBE461CB6; Mon, 28 Jun 2021 14:30:51 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1624890653; bh=C8NrAGTgym9T/0CtgOZMrTOtuVqXQdyXiyQ8zDt9Rvg=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=X0rZ2ohdwobH5j2mJ9hIzQK/tIzPi6ntMy0YHPr9w+AzVYd993Q1cU/CLdeWx+VhD 2WWDqhEYAuxVEETb5X/AHxeZ1w1SpbJVCU9pD+I1GrIQ6GyMLN0modsb29768Sc1/Q Lq2EqasOuqVBvtADItOfs0Dc2+TnkNM71OM1yeU9MG3QMlTof+7zQu6swoI1Nqfm7C HFz74a7xGRpOOt6YHrl5OYOCbFEB63m4mxxo2D0Km8l6n4+CdDVOOcR0K7IIdbj6Y1 gRDJEiNxZB+XD53IMKXH2FZF7GaNv+p2bef68R63ePpAm+C1TmyewYbH55supX+/Pk U2WL8NZHCX4DA== From: Sasha Levin To: linux-kernel@vger.kernel.org, stable@vger.kernel.org Cc: Hugh Dickins , "Kirill A . Shutemov" , Yang Shi , Alistair Popple , Jan Kara , Jue Wang , "Matthew Wilcox (Oracle)" , Miaohe Lin , Minchan Kim , Naoya Horiguchi , Oscar Salvador , Peter Xu , Ralph Campbell , Shakeel Butt , Wang Yugui , Zi Yan , Andrew Morton , Linus Torvalds , Greg Kroah-Hartman Subject: [PATCH 5.4 50/71] mm/thp: make is_huge_zero_pmd() safe and quicker Date: Mon, 28 Jun 2021 10:29:43 -0400 Message-Id: <20210628143004.32596-51-sashal@kernel.org> X-Mailer: git-send-email 2.30.2 In-Reply-To: <20210628143004.32596-1-sashal@kernel.org> References: <20210628143004.32596-1-sashal@kernel.org> MIME-Version: 1.0 X-KernelTest-Patch: http://kernel.org/pub/linux/kernel/v5.x/stable-review/patch-5.4.129-rc1.gz X-KernelTest-Tree: git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable-rc.git X-KernelTest-Branch: linux-5.4.y X-KernelTest-Patches: git://git.kernel.org/pub/scm/linux/kernel/git/stable/stable-queue.git X-KernelTest-Version: 5.4.129-rc1 X-KernelTest-Deadline: 2021-06-30T14:29+00:00 X-stable: review X-Patchwork-Hint: Ignore Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Hugh Dickins commit 3b77e8c8cde581dadab9a0f1543a347e24315f11 upstream. Most callers of is_huge_zero_pmd() supply a pmd already verified present; but a few (notably zap_huge_pmd()) do not - it might be a pmd migration entry, in which the pfn is encoded differently from a present pmd: which might pass the is_huge_zero_pmd() test (though not on x86, since L1TF forced us to protect against that); or perhaps even crash in pmd_page() applied to a swap-like entry. Make it safe by adding pmd_present() check into is_huge_zero_pmd() itself; and make it quicker by saving huge_zero_pfn, so that is_huge_zero_pmd() will not need to do that pmd_page() lookup each time. __split_huge_pmd_locked() checked pmd_trans_huge() before: that worked, but is unnecessary now that is_huge_zero_pmd() checks present. Link: https://lkml.kernel.org/r/21ea9ca-a1f5-8b90-5e88-95fb1c49bbfa@google.com Fixes: e71769ae5260 ("mm: enable thp migration for shmem thp") Signed-off-by: Hugh Dickins Acked-by: Kirill A. Shutemov Reviewed-by: Yang Shi Cc: Alistair Popple Cc: Jan Kara Cc: Jue Wang Cc: "Matthew Wilcox (Oracle)" Cc: Miaohe Lin Cc: Minchan Kim Cc: Naoya Horiguchi Cc: Oscar Salvador Cc: Peter Xu Cc: Ralph Campbell Cc: Shakeel Butt Cc: Wang Yugui Cc: Zi Yan Cc: Signed-off-by: Andrew Morton Signed-off-by: Linus Torvalds Signed-off-by: Greg Kroah-Hartman --- include/linux/huge_mm.h | 8 +++++++- mm/huge_memory.c | 5 ++++- 2 files changed, 11 insertions(+), 2 deletions(-) diff --git a/include/linux/huge_mm.h b/include/linux/huge_mm.h index d8b86fd39113..d2dbe462efee 100644 --- a/include/linux/huge_mm.h +++ b/include/linux/huge_mm.h @@ -259,6 +259,7 @@ struct page *follow_devmap_pud(struct vm_area_struct *vma, unsigned long addr, extern vm_fault_t do_huge_pmd_numa_page(struct vm_fault *vmf, pmd_t orig_pmd); extern struct page *huge_zero_page; +extern unsigned long huge_zero_pfn; static inline bool is_huge_zero_page(struct page *page) { @@ -267,7 +268,7 @@ static inline bool is_huge_zero_page(struct page *page) static inline bool is_huge_zero_pmd(pmd_t pmd) { - return is_huge_zero_page(pmd_page(pmd)); + return READ_ONCE(huge_zero_pfn) == pmd_pfn(pmd) && pmd_present(pmd); } static inline bool is_huge_zero_pud(pud_t pud) @@ -398,6 +399,11 @@ static inline bool is_huge_zero_page(struct page *page) return false; } +static inline bool is_huge_zero_pmd(pmd_t pmd) +{ + return false; +} + static inline bool is_huge_zero_pud(pud_t pud) { return false; diff --git a/mm/huge_memory.c b/mm/huge_memory.c index e74c5a505e2b..47d95048f31e 100644 --- a/mm/huge_memory.c +++ b/mm/huge_memory.c @@ -61,6 +61,7 @@ static struct shrinker deferred_split_shrinker; static atomic_t huge_zero_refcount; struct page *huge_zero_page __read_mostly; +unsigned long huge_zero_pfn __read_mostly = ~0UL; bool transparent_hugepage_enabled(struct vm_area_struct *vma) { @@ -97,6 +98,7 @@ static struct page *get_huge_zero_page(void) __free_pages(zero_page, compound_order(zero_page)); goto retry; } + WRITE_ONCE(huge_zero_pfn, page_to_pfn(zero_page)); /* We take additional reference here. It will be put back by shrinker */ atomic_set(&huge_zero_refcount, 2); @@ -146,6 +148,7 @@ static unsigned long shrink_huge_zero_page_scan(struct shrinker *shrink, if (atomic_cmpxchg(&huge_zero_refcount, 1, 0) == 1) { struct page *zero_page = xchg(&huge_zero_page, NULL); BUG_ON(zero_page == NULL); + WRITE_ONCE(huge_zero_pfn, ~0UL); __free_pages(zero_page, compound_order(zero_page)); return HPAGE_PMD_NR; } @@ -2182,7 +2185,7 @@ static void __split_huge_pmd_locked(struct vm_area_struct *vma, pmd_t *pmd, return; } - if (pmd_trans_huge(*pmd) && is_huge_zero_pmd(*pmd)) { + if (is_huge_zero_pmd(*pmd)) { /* * FIXME: Do we want to invalidate secondary mmu by calling * mmu_notifier_invalidate_range() see comments below inside -- 2.30.2