Received: by 10.213.65.68 with SMTP id h4csp2105426imn; Mon, 2 Apr 2018 00:51:55 -0700 (PDT) X-Google-Smtp-Source: AIpwx48k/Ek+iJjpFW8SpGaTL0OfWrcXTQPR/9bhac6Hnb3tBp1XpER5O8dAiQpcUKe53ZAh7cs2 X-Received: by 10.99.123.23 with SMTP id w23mr5633145pgc.10.1522655515446; Mon, 02 Apr 2018 00:51:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1522655515; cv=none; d=google.com; s=arc-20160816; b=jihjCtt9jMzRWfGEfUuSXYFQk2z6SKZZqGYp9UKYPekyetgnvstSjFYM4VbfSygSt+ I55z3dVeqZZnr/gicS+OuQ2Prw0bl7VWjY2e+zfMUoi9dz6S2KpUfDlvTG3Tr4e4rqv4 WoGLlwxzty1CMDEW4/VQAs6E6Lwfof0VfYZWAn3/hasqtb7eEl8do7cC2/TMc/X8M/8f 8tn687jNq2OgIRl98sBWS4ui2IqK2GE4OtQXxVhYX2AAozAX501Kun6uwl4Dl+JcoXau E2zYzoEhHKGwVJGdCO+8/UI4UOxl3HsfzTWOnrFKAw/Zy+/rPxHA8rDqI9SCpvmhrCen oehQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject:dkim-signature:arc-authentication-results; bh=bt3+cXppIh5Tl8/04INU4Ruo5MhHC3MJHj0DpbpjN7w=; b=lVAr86tm1WJkF+jqgtpq8sxoS7DbMjhqDnM2tl98nGaL0yotiJWnWCqCHlk38LtSIU oeDjv2hml5r7nUyoLUxV+QAaWQkTSOg6QETJbvJCXrsNKUbMdCviFqmZIb25Ko45g7WU T7Wc+8dJ9BygX6agb8FVsCz3TBjtbyhLqHG/bBd+NHNqqZI9MAZC4VZFlGzW4/DIdNDk O5rvndqgJYFoGHd8puvwrVATIDLvhPBWKV/zxKuCpZu4GsmlsRSlHCep0Ot+MsPbHyeZ rNsziws8Lr4ByMxOgx5BOH2MJRg2UW82nr17r9KNmX1tnRVni+kzpGOfvrxjPfCO/3BF n35g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=JmloU15j; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v11si9435229pgt.61.2018.04.02.00.51.29; Mon, 02 Apr 2018 00:51:55 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=JmloU15j; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754315AbeDBHtp (ORCPT + 99 others); Mon, 2 Apr 2018 03:49:45 -0400 Received: from mail-io0-f195.google.com ([209.85.223.195]:44255 "EHLO mail-io0-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752775AbeDBHtn (ORCPT ); Mon, 2 Apr 2018 03:49:43 -0400 Received: by mail-io0-f195.google.com with SMTP id d7so17036309ioc.11 for ; Mon, 02 Apr 2018 00:49:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding; bh=bt3+cXppIh5Tl8/04INU4Ruo5MhHC3MJHj0DpbpjN7w=; b=JmloU15j04iITc4x/yPB8yvbViNUODPnpL8uLmX+pyFf+Z3185yyOrTRZnv/E5F+Uc rc5k2KiVROyJ4CFyU6O6N+/Vf3mwjp3oPU6ALVrg+OglLTtVloyujzvdqc1DD5OXT++S HFa85QnrT0lxRGzpqVtJM5mDjpRYh4GQEoaUIprLBFdFSiT2wUf/l8UVlCsaHQR3X2Ix fmrVKpUHDafDzyMz7Xgc/LDSi8Pa5Capkf4oghY5frO9XE1opaRi6J3PfClVZers4BGY LLQGumuTMEy179qj2r4LwBOSJ1WwXwEpZVKOxxPMgVfr3P660Me9POh7r0eQT9MxshVF mEIA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=bt3+cXppIh5Tl8/04INU4Ruo5MhHC3MJHj0DpbpjN7w=; b=mNguIhbSn/scSUxjuMm6pO1CeDjTvfDePRI3nOUQR4orpY7u2WW0QUVBo1XQwPo1Bv haPM+9L9BPeNfuCVfVFxl3yFg+UFmFHdW3QK8vJSeTbJ9vLptOBErcYMxxyoH2c9VU6T 2fTCqukIyJFksmkxh+pLxbSnZRpv0tOU9g0jw1PQZA4LYObFgADnnhDenI4nwmOkfqyH VkdFUUWst/atQlldnctANcKuZY1T7mg53Fil3Qkdf62QR1AwwTodEAEJ/Y5+uItFAeg4 dGr981SyCtnmhvC3lajC1Q4lDv9DNSPYhzBZKmAEUwppJwGvkFPYS/YAn33wd6s2feuC u1kw== X-Gm-Message-State: ALQs6tCX7Qh1s0Pk+KKJNJGa03rUPxIt8PjfcqaTojO8mDvkyvsR0yB9 UdAwH5vydYeM0AWcNHyibn8= X-Received: by 10.107.129.148 with SMTP id l20mr7712160ioi.140.1522655382874; Mon, 02 Apr 2018 00:49:42 -0700 (PDT) Received: from [0.0.0.0] (67.216.217.169.16clouds.com. [67.216.217.169]) by smtp.gmail.com with ESMTPSA id a129-v6sm27635itd.34.2018.04.02.00.49.25 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 02 Apr 2018 00:49:42 -0700 (PDT) Subject: Re: [PATCH v5 1/5] mm: page_alloc: remain memblock_next_valid_pfn() on arm and arm64 To: Ard Biesheuvel Cc: Russell King , Catalin Marinas , Will Deacon , Mark Rutland , Andrew Morton , Michal Hocko , Wei Yang , Kees Cook , Laura Abbott , Vladimir Murzin , Philip Derrin , Grygorii Strashko , AKASHI Takahiro , James Morse , Steve Capper , Pavel Tatashin , Gioh Kim , Vlastimil Babka , Mel Gorman , Johannes Weiner , Kemi Wang , Petr Tesarik , YASUAKI ISHIMATSU , Andrey Ryabinin , Nikolay Borisov , Daniel Jordan , Daniel Vacek , Eugeniu Rosca , linux-arm-kernel , Linux Kernel Mailing List , Linux-MM , Jia He References: <1522636236-12625-1-git-send-email-hejianet@gmail.com> <1522636236-12625-2-git-send-email-hejianet@gmail.com> From: Jia He Message-ID: <41445229-043c-976f-3961-13770163444f@gmail.com> Date: Mon, 2 Apr 2018 15:49:23 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 4/2/2018 2:55 PM, Ard Biesheuvel Wrote: > On 2 April 2018 at 04:30, Jia He wrote: >> Commit b92df1de5d28 ("mm: page_alloc: skip over regions of invalid pfns >> where possible") optimized the loop in memmap_init_zone(). But it causes >> possible panic bug. So Daniel Vacek reverted it later. >> >> But as suggested by Daniel Vacek, it is fine to using memblock to skip >> gaps and finding next valid frame with CONFIG_HAVE_ARCH_PFN_VALID. >> >> On arm and arm64, memblock is used by default. But generic version of >> pfn_valid() is based on mem sections and memblock_next_valid_pfn() does >> not always return the next valid one but skips more resulting in some >> valid frames to be skipped (as if they were invalid). And that's why >> kernel was eventually crashing on some !arm machines. >> >> And as verified by Eugeniu Rosca, arm can benifit from commit >> b92df1de5d28. So remain the memblock_next_valid_pfn on arm{,64} and move >> the related codes to arm64 arch directory. >> >> Suggested-by: Daniel Vacek >> Signed-off-by: Jia He > Hello Jia, > > Apologies for chiming in late. no problem, thanks for your comments  ;-) > > If we are going to rearchitect this, I'd rather we change the loop in > memmap_init_zone() so that we skip to the next valid PFN directly > rather than skipping to the last invalid PFN so that the pfn++ in the hmm... Maybe this macro name makes you confused pfn = skip_to_last_invalid_pfn(pfn); how about skip_to_next_valid_pfn? > for () results in the next value. Can we replace the pfn++ there with > a function calls that defaults to 'return pfn + 1', but does the skip > for architectures that implement it? I am not sure I understand your question here. With this patch, on !arm arches, skip_to_last_invalid_pfn is equal to (pfn), and will be increased when for{} loop continue. We only *skip* to the start pfn of next valid region when CONFIG_HAVE_MEMBLOCK and CONFIG_HAVE_ARCH_PFN_VALID(arm/arm64 supports both). -- Cheers, Jia > > >> --- >> arch/arm/include/asm/page.h | 2 ++ >> arch/arm/mm/init.c | 31 ++++++++++++++++++++++++++++++- >> arch/arm64/include/asm/page.h | 2 ++ >> arch/arm64/mm/init.c | 31 ++++++++++++++++++++++++++++++- >> include/linux/mmzone.h | 1 + >> mm/page_alloc.c | 4 +++- >> 6 files changed, 68 insertions(+), 3 deletions(-) >> >> diff --git a/arch/arm/include/asm/page.h b/arch/arm/include/asm/page.h >> index 4355f0e..489875c 100644 >> --- a/arch/arm/include/asm/page.h >> +++ b/arch/arm/include/asm/page.h >> @@ -158,6 +158,8 @@ typedef struct page *pgtable_t; >> >> #ifdef CONFIG_HAVE_ARCH_PFN_VALID >> extern int pfn_valid(unsigned long); >> +extern unsigned long memblock_next_valid_pfn(unsigned long pfn); >> +#define skip_to_last_invalid_pfn(pfn) (memblock_next_valid_pfn(pfn) - 1) >> #endif >> >> #include >> diff --git a/arch/arm/mm/init.c b/arch/arm/mm/init.c >> index a1f11a7..0fb85ca 100644 >> --- a/arch/arm/mm/init.c >> +++ b/arch/arm/mm/init.c >> @@ -198,7 +198,36 @@ int pfn_valid(unsigned long pfn) >> return memblock_is_map_memory(__pfn_to_phys(pfn)); >> } >> EXPORT_SYMBOL(pfn_valid); >> -#endif >> + >> +/* HAVE_MEMBLOCK is always enabled on arm */ >> +unsigned long __init_memblock memblock_next_valid_pfn(unsigned long pfn) >> +{ >> + struct memblock_type *type = &memblock.memory; >> + unsigned int right = type->cnt; >> + unsigned int mid, left = 0; >> + phys_addr_t addr = PFN_PHYS(++pfn); >> + >> + do { >> + mid = (right + left) / 2; >> + >> + if (addr < type->regions[mid].base) >> + right = mid; >> + else if (addr >= (type->regions[mid].base + >> + type->regions[mid].size)) >> + left = mid + 1; >> + else { >> + /* addr is within the region, so pfn is valid */ >> + return pfn; >> + } >> + } while (left < right); >> + >> + if (right == type->cnt) >> + return -1UL; >> + else >> + return PHYS_PFN(type->regions[right].base); >> +} >> +EXPORT_SYMBOL(memblock_next_valid_pfn); >> +#endif /*CONFIG_HAVE_ARCH_PFN_VALID*/ >> >> #ifndef CONFIG_SPARSEMEM >> static void __init arm_memory_present(void) >> diff --git a/arch/arm64/include/asm/page.h b/arch/arm64/include/asm/page.h >> index 60d02c8..e57d3f2 100644 >> --- a/arch/arm64/include/asm/page.h >> +++ b/arch/arm64/include/asm/page.h >> @@ -39,6 +39,8 @@ typedef struct page *pgtable_t; >> >> #ifdef CONFIG_HAVE_ARCH_PFN_VALID >> extern int pfn_valid(unsigned long); >> +extern unsigned long memblock_next_valid_pfn(unsigned long pfn); >> +#define skip_to_last_invalid_pfn(pfn) (memblock_next_valid_pfn(pfn) - 1) >> #endif >> >> #include >> diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c >> index 00e7b90..13e43ff 100644 >> --- a/arch/arm64/mm/init.c >> +++ b/arch/arm64/mm/init.c >> @@ -290,7 +290,36 @@ int pfn_valid(unsigned long pfn) >> return memblock_is_map_memory(pfn << PAGE_SHIFT); >> } >> EXPORT_SYMBOL(pfn_valid); >> -#endif >> + >> +/* HAVE_MEMBLOCK is always enabled on arm64 */ >> +unsigned long __init_memblock memblock_next_valid_pfn(unsigned long pfn) >> +{ >> + struct memblock_type *type = &memblock.memory; >> + unsigned int right = type->cnt; >> + unsigned int mid, left = 0; >> + phys_addr_t addr = PFN_PHYS(++pfn); >> + >> + do { >> + mid = (right + left) / 2; >> + >> + if (addr < type->regions[mid].base) >> + right = mid; >> + else if (addr >= (type->regions[mid].base + >> + type->regions[mid].size)) >> + left = mid + 1; >> + else { >> + /* addr is within the region, so pfn is valid */ >> + return pfn; >> + } >> + } while (left < right); >> + >> + if (right == type->cnt) >> + return -1UL; >> + else >> + return PHYS_PFN(type->regions[right].base); >> +} >> +EXPORT_SYMBOL(memblock_next_valid_pfn); >> +#endif /*CONFIG_HAVE_ARCH_PFN_VALID*/ >> >> #ifndef CONFIG_SPARSEMEM >> static void __init arm64_memory_present(void) >> diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h >> index d797716..f9c0c46 100644 >> --- a/include/linux/mmzone.h >> +++ b/include/linux/mmzone.h >> @@ -1245,6 +1245,7 @@ static inline int pfn_valid(unsigned long pfn) >> return 0; >> return valid_section(__nr_to_section(pfn_to_section_nr(pfn))); >> } >> +#define skip_to_last_invalid_pfn(pfn) (pfn) >> #endif >> >> static inline int pfn_present(unsigned long pfn) >> diff --git a/mm/page_alloc.c b/mm/page_alloc.c >> index c19f5ac..30f7d76 100644 >> --- a/mm/page_alloc.c >> +++ b/mm/page_alloc.c >> @@ -5483,8 +5483,10 @@ void __meminit memmap_init_zone(unsigned long size, int nid, unsigned long zone, >> if (context != MEMMAP_EARLY) >> goto not_early; >> >> - if (!early_pfn_valid(pfn)) >> + if (!early_pfn_valid(pfn)) { >> + pfn = skip_to_last_invalid_pfn(pfn); >> continue; >> + } >> if (!early_pfn_in_nid(pfn, nid)) >> continue; >> if (!update_defer_init(pgdat, pfn, end_pfn, &nr_initialised)) >> -- >> 2.7.4 >>