Received: by 2002:ac0:a591:0:0:0:0:0 with SMTP id m17-v6csp1548481imm; Fri, 6 Jul 2018 02:03:25 -0700 (PDT) X-Google-Smtp-Source: AAOMgpcUQrquMpPsCzefyAN+fxncx4nu5epIdhGy2kHDry1KX9Z5x72ljXiguIXzgkGShAkvx2gl X-Received: by 2002:a17:902:7606:: with SMTP id k6-v6mr9378818pll.56.1530867805859; Fri, 06 Jul 2018 02:03:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1530867805; cv=none; d=google.com; s=arc-20160816; b=lG8z1Scxv+FI1CbblBe4jcRwCW8WATYZPk9ujrYsFOpsRnanntJXpBrTZB8Umn5wFI 4WPXBOMe7PUesOJifGNYlBlfxC819sHMCjQ/km0y/VbUqG034ymtCvbQHWBVd0dYwSSI WPlUrybHCsdLQjwMRGQZ2eAIl4b2D+ScGF53WfzxInj20Nl7PUQkk1876pKAMZnL8ZiY Trpm3JivaPSQ88zr6ka5QZqNI56sCzRlyAyfvA8dy4eUfKtCkKyAaHxw7eiJgrmaQX1R 6nSAyDlXIe8nogz302K/NVNsqFR6e76ZG1tLtQIfLxrbb/PpK8Gx+qoGtzA9Xbhpzx0G im5Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature:arc-authentication-results; bh=rBt92G0faVb+l4LzkWBVpJgQsD4xrw1+OJsW6IjcYy8=; b=XczJAuqugFhxXCtwxVZe+Y0UTWJyS6cK8Fyght5TJGiJLd68S9lmoMjrvF2/fIA1AK rWGWMy0SZTZk9FT6ZuDsEMsygyUqesxckYpMWTsrGIsz+0LgxnZw7VeYcam5yJ/qJZnB jG8QgwIgJKQnjx4R4ojz4g7FA6BV3mqB8QBC8S0r1BksenTESnd6oWkHEJ+0n1cb74EM uFGc1xH2k/pIZA4Mh5SBz9xUhw/W8xUNcNLgauaZNmucJuenaemL8urxIs/yMbf2UbaW uzgDIZfbIOZmcSBrtUxibduncS0+tScAhYMXWLBpI+JWzu5TsR1SI8vlKKnMX3kUnR3l N8pA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=u0kItspC; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x3-v6si7808254plb.478.2018.07.06.02.03.10; Fri, 06 Jul 2018 02:03:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=u0kItspC; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754074AbeGFJCF (ORCPT + 99 others); Fri, 6 Jul 2018 05:02:05 -0400 Received: from mail-pl0-f68.google.com ([209.85.160.68]:37444 "EHLO mail-pl0-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753287AbeGFJCD (ORCPT ); Fri, 6 Jul 2018 05:02:03 -0400 Received: by mail-pl0-f68.google.com with SMTP id 31-v6so2584283plc.4 for ; Fri, 06 Jul 2018 02:02:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=rBt92G0faVb+l4LzkWBVpJgQsD4xrw1+OJsW6IjcYy8=; b=u0kItspC61BqUab9sXL0VtDMPQpptLfkx2ZFWY3hkl8eL8rydpEm9z8v9Fr3Tk1zvK iHRGWEjjxKRAAHbCocvGZVddgegK63JSG+oZybOg9/XZVUUhimxwDeRfam0uPdPY59yN G6K1qxivYQYRu5K/Ymase7m2uvsn1lUKxEGjEWeL04QMGuBXyDNwo+mIoTydA5K2P1Sq YCxNbVCuZcdgTbN23dugG6OUhlz3m4IXMkDOqJTW+tCtKY7zV8VAnEo1FwOe4HQDIiQX XN5MHDK+p7OVEFvCFNEXWXw86KPGdHAjPXZzbRRSUfOR/A/jZSyCRELNryHJLWg7XDNT PD6A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=rBt92G0faVb+l4LzkWBVpJgQsD4xrw1+OJsW6IjcYy8=; b=lsFkvdt7sbDRYujw6wOu7CP5mEXXs5qpkaYT6RZ7PLnNronwwd9gzvG+7k7fbSqukx 6Tx+2oLYDHZiaf3mjXqfxLZ5Mj4biTzoc92efxbZtFlZkulSSF1kqBrcF7uZFyyhrCw0 dN51rmRtGOc+ZU7rplCB4Sku5rRm9Fhb7d1DD1/UxRb4pp3jAe2Z5/176ASgj6v/vj5X o7DTpUDILmK9OTo0na8VDM/bJgbMlhqBzEIOAZBlps3iLf66pi6H02YI/z1tJFmmpkuU ITuecKKMvNdQ7dkh0TWJts9lf1g44hPivMMY28/0vVMooc0312vZrdM2y1tyh2Gs88ov uFmA== X-Gm-Message-State: APt69E1ebQQUv394VF6JXLqwaAwdvpsWJJhvaRsB//UZozxOdwnhuKhY rCzRQwVbv1ZpmIeeIULby9tt7gTv X-Received: by 2002:a17:902:8347:: with SMTP id z7-v6mr9686011pln.290.1530867722888; Fri, 06 Jul 2018 02:02:02 -0700 (PDT) Received: from ct7host.localdomain ([38.106.11.25]) by smtp.gmail.com with ESMTPSA id e5-v6sm10837092pgs.59.2018.07.06.02.01.51 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 06 Jul 2018 02:02:02 -0700 (PDT) From: Jia He To: Russell King , Catalin Marinas , Will Deacon , Mark Rutland , Ard Biesheuvel , Andrew Morton , Michal Hocko Cc: Wei Yang , Kees Cook , Laura Abbott , Vladimir Murzin , Philip Derrin , AKASHI Takahiro , James Morse , Steve Capper , Pavel Tatashin , Gioh Kim , Vlastimil Babka , Mel Gorman , Johannes Weiner , Kemi Wang , Petr Tesarik , YASUAKI ISHIMATSU , Andrey Ryabinin , Nikolay Borisov , Daniel Jordan , Daniel Vacek , Eugeniu Rosca , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Jia He Subject: [RESEND PATCH v10 2/6] mm: page_alloc: remain memblock_next_valid_pfn() on arm/arm64 Date: Fri, 6 Jul 2018 17:01:11 +0800 Message-Id: <1530867675-9018-3-git-send-email-hejianet@gmail.com> X-Mailer: git-send-email 1.8.3.1 In-Reply-To: <1530867675-9018-1-git-send-email-hejianet@gmail.com> References: <1530867675-9018-1-git-send-email-hejianet@gmail.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Jia He Commit b92df1de5d28 ("mm: page_alloc: skip over regions of invalid pfns where possible") optimized the loop in memmap_init_zone(). But it causes possible panic bug. So Daniel Vacek reverted it later. But as suggested by Daniel Vacek, it is fine to using memblock to skip gaps and finding next valid frame with CONFIG_HAVE_ARCH_PFN_VALID. Daniel said: "On arm and arm64, memblock is used by default. But generic version of pfn_valid() is based on mem sections and memblock_next_valid_pfn() does not always return the next valid one but skips more resulting in some valid frames to be skipped (as if they were invalid). And that's why kernel was eventually crashing on some !arm machines." About the performance consideration: As said by James in b92df1de5, "I have tested this patch on a virtual model of a Samurai CPU with a sparse memory map. The kernel boot time drops from 109 to 62 seconds." Thus it would be better if we remain memblock_next_valid_pfn on arm/arm64. Suggested-by: Daniel Vacek Signed-off-by: Jia He --- include/linux/mmzone.h | 11 +++++++++++ mm/memblock.c | 30 ++++++++++++++++++++++++++++++ mm/page_alloc.c | 5 ++++- 3 files changed, 45 insertions(+), 1 deletion(-) diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h index 32699b2..57cdc42 100644 --- a/include/linux/mmzone.h +++ b/include/linux/mmzone.h @@ -1241,6 +1241,8 @@ static inline int pfn_valid(unsigned long pfn) return 0; return valid_section(__nr_to_section(pfn_to_section_nr(pfn))); } + +#define next_valid_pfn(pfn) (pfn + 1) #endif static inline int pfn_present(unsigned long pfn) @@ -1266,6 +1268,10 @@ static inline int pfn_present(unsigned long pfn) #endif #define early_pfn_valid(pfn) pfn_valid(pfn) +#ifdef CONFIG_HAVE_MEMBLOCK_PFN_VALID +extern ulong memblock_next_valid_pfn(ulong pfn); +#define next_valid_pfn(pfn) memblock_next_valid_pfn(pfn) +#endif void sparse_init(void); #else #define sparse_init() do {} while (0) @@ -1287,6 +1293,11 @@ struct mminit_pfnnid_cache { #define early_pfn_valid(pfn) (1) #endif +/* fallback to default definitions*/ +#ifndef next_valid_pfn +#define next_valid_pfn(pfn) (pfn + 1) +#endif + void memory_present(int nid, unsigned long start, unsigned long end); /* diff --git a/mm/memblock.c b/mm/memblock.c index b9cdfa0..ccad225 100644 --- a/mm/memblock.c +++ b/mm/memblock.c @@ -1139,6 +1139,36 @@ int __init_memblock memblock_set_node(phys_addr_t base, phys_addr_t size, } #endif /* CONFIG_HAVE_MEMBLOCK_NODE_MAP */ +#ifdef CONFIG_HAVE_MEMBLOCK_PFN_VALID +ulong __init_memblock memblock_next_valid_pfn(ulong pfn) +{ + struct memblock_type *type = &memblock.memory; + unsigned int right = type->cnt; + unsigned int mid, left = 0; + phys_addr_t addr = PFN_PHYS(++pfn); + + do { + mid = (right + left) / 2; + + if (addr < type->regions[mid].base) + right = mid; + else if (addr >= (type->regions[mid].base + + type->regions[mid].size)) + left = mid + 1; + else { + /* addr is within the region, so pfn is valid */ + return pfn; + } + } while (left < right); + + if (right == type->cnt) + return -1UL; + else + return PHYS_PFN(type->regions[right].base); +} +EXPORT_SYMBOL(memblock_next_valid_pfn); +#endif /*CONFIG_HAVE_MEMBLOCK_PFN_VALID*/ + static phys_addr_t __init memblock_alloc_range_nid(phys_addr_t size, phys_addr_t align, phys_addr_t start, phys_addr_t end, int nid, ulong flags) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index cd3c7b9..607deff 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -5485,8 +5485,11 @@ void __meminit memmap_init_zone(unsigned long size, int nid, unsigned long zone, if (context != MEMMAP_EARLY) goto not_early; - if (!early_pfn_valid(pfn)) + if (!early_pfn_valid(pfn)) { + pfn = next_valid_pfn(pfn) - 1; continue; + } + if (!early_pfn_in_nid(pfn, nid)) continue; if (!update_defer_init(pgdat, pfn, end_pfn, &nr_initialised)) -- 1.8.3.1