Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp1742485yba; Sun, 14 Apr 2019 19:29:53 -0700 (PDT) X-Google-Smtp-Source: APXvYqxodYVLhhLhBx/MZeac2UHfjP1+1kKrl+pVovM/S3npd6ka8IoGU/4TysfVb3lgE0gN4STu X-Received: by 2002:a63:475e:: with SMTP id w30mr63754320pgk.308.1555295393209; Sun, 14 Apr 2019 19:29:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555295393; cv=none; d=google.com; s=arc-20160816; b=alD+if15DRtEWcqqZBg3cZBjkNBooG7AMWwhiZjKnSyp98en/by8Ja3zhsCA5Cg97E EOypzCRWAZ3QbEdtv6HH8ewjMSBogpsPjZ1MN8FsXdYh0MTrXYTLJ7qKm912ftQ/SRVk ib/d0GoULrcEJ9029zGAGJ2PfbmcNoG0mRdGGVWVVchKJu4cUsybgKJhUSreNaPKqijy DAoiLF9UdkjROob4YE7KXvWHuOmIG279tKSNzRHixEY0+k+mDAfSKocZDjtaNk1tSPV/ 0U92A8tuuqgvfcwfQUGVTaywICpqTcS7YJKhnK/Ckqc8sUurE7sA+M7OXBLUSGLZH+qx GRpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:cc:references:to :subject; bh=Ud4QlAGwOrCcyiVs7B472eC6PFbxpec59lVyhKL8IO4=; b=0f8JkI4lijrZodBElUlY6MBal47kMI+u9eSEknQuG+GF+A04Hpxo8yWci6KJ1X7OZD JUBTe1nFyaAmUDjaP4DY1pSO75JsvBe/Qg5eN14kSazhInFjlAvmjdOrfMJsxEpaU8G7 RkuUl5WpMSkVfIhICR7qf6llHic4imUt3p6pIqfvRrf3AOHypBwj4SR59O2kij6C7TuQ QrEYxxr6zEsg4YHFIKh5MXBT78B7dIPjoaEeWnF7PZlRCz/hJcxTSamGHT6A9DlmbCvF ZK932pzIlP5b3msVMKLAuEC4iNYEkpHp/fjqGl7M3sN61fyVaLlWjgSDxltvVzYaIQ8a bDVg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w12si29031139pgr.412.2019.04.14.19.29.36; Sun, 14 Apr 2019 19:29:53 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726392AbfDOC1n (ORCPT + 99 others); Sun, 14 Apr 2019 22:27:43 -0400 Received: from szxga06-in.huawei.com ([45.249.212.32]:35776 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725789AbfDOC1m (ORCPT ); Sun, 14 Apr 2019 22:27:42 -0400 Received: from DGGEMS401-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 470ACB05A20EA7ED8FC5; Mon, 15 Apr 2019 10:27:40 +0800 (CST) Received: from [127.0.0.1] (10.177.131.64) by DGGEMS401-HUB.china.huawei.com (10.3.19.201) with Microsoft SMTP Server id 14.3.408.0; Mon, 15 Apr 2019 10:27:33 +0800 Subject: Re: [PATCH v3 3/4] arm64: kdump: support more than one crash kernel regions To: Mike Rapoport References: <20190409102819.121335-1-chenzhou10@huawei.com> <20190409102819.121335-4-chenzhou10@huawei.com> <20190414121315.GD20947@rapoport-lnx> CC: , , , , , , , , , , , , , , From: Chen Zhou Message-ID: Date: Mon, 15 Apr 2019 10:27:30 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.7.1 MIME-Version: 1.0 In-Reply-To: <20190414121315.GD20947@rapoport-lnx> Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.177.131.64] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Mike, On 2019/4/14 20:13, Mike Rapoport wrote: > Hi, > > On Tue, Apr 09, 2019 at 06:28:18PM +0800, Chen Zhou wrote: >> After commit (arm64: kdump: support reserving crashkernel above 4G), >> there may be two crash kernel regions, one is below 4G, the other is >> above 4G. >> >> Crash dump kernel reads more than one crash kernel regions via a dtb >> property under node /chosen, >> linux,usable-memory-range = > > Somehow I've missed that previously, but how is this supposed to work on > EFI systems? Whatever the way in which the systems work, there is FDT pointer(__fdt_pointer) in arm64 kernel and file /sys/firmware/fdt will be created in late_initcall. Kexec-tools read and update file /sys/firmware/fdt in EFI systems to support kdump to boot capture kernel. For supporting more than one crash kernel regions, kexec-tools make changes accordingly. Details are in below: http://lists.infradead.org/pipermail/kexec/2019-April/022792.html Thanks, Chen Zhou > >> Signed-off-by: Chen Zhou >> --- >> arch/arm64/mm/init.c | 66 ++++++++++++++++++++++++++++++++++++++++-------- >> include/linux/memblock.h | 6 +++++ >> mm/memblock.c | 7 ++--- >> 3 files changed, 66 insertions(+), 13 deletions(-) >> >> diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c >> index 3bebddf..0f18665 100644 >> --- a/arch/arm64/mm/init.c >> +++ b/arch/arm64/mm/init.c >> @@ -65,6 +65,11 @@ phys_addr_t arm64_dma_phys_limit __ro_after_init; >> >> #ifdef CONFIG_KEXEC_CORE >> >> +/* at most two crash kernel regions, low_region and high_region */ >> +#define CRASH_MAX_USABLE_RANGES 2 >> +#define LOW_REGION_IDX 0 >> +#define HIGH_REGION_IDX 1 >> + >> /* >> * reserve_crashkernel() - reserves memory for crash kernel >> * >> @@ -297,8 +302,8 @@ static int __init early_init_dt_scan_usablemem(unsigned long node, >> const char *uname, int depth, void *data) >> { >> struct memblock_region *usablemem = data; >> - const __be32 *reg; >> - int len; >> + const __be32 *reg, *endp; >> + int len, nr = 0; >> >> if (depth != 1 || strcmp(uname, "chosen") != 0) >> return 0; >> @@ -307,22 +312,63 @@ static int __init early_init_dt_scan_usablemem(unsigned long node, >> if (!reg || (len < (dt_root_addr_cells + dt_root_size_cells))) >> return 1; >> >> - usablemem->base = dt_mem_next_cell(dt_root_addr_cells, ®); >> - usablemem->size = dt_mem_next_cell(dt_root_size_cells, ®); >> + endp = reg + (len / sizeof(__be32)); >> + while ((endp - reg) >= (dt_root_addr_cells + dt_root_size_cells)) { >> + usablemem[nr].base = dt_mem_next_cell(dt_root_addr_cells, ®); >> + usablemem[nr].size = dt_mem_next_cell(dt_root_size_cells, ®); >> + >> + if (++nr >= CRASH_MAX_USABLE_RANGES) >> + break; >> + } >> >> return 1; >> } >> >> static void __init fdt_enforce_memory_region(void) >> { >> - struct memblock_region reg = { >> - .size = 0, >> - }; >> + int i, cnt = 0; >> + struct memblock_region regs[CRASH_MAX_USABLE_RANGES]; >> + >> + memset(regs, 0, sizeof(regs)); >> + of_scan_flat_dt(early_init_dt_scan_usablemem, regs); >> + >> + for (i = 0; i < CRASH_MAX_USABLE_RANGES; i++) >> + if (regs[i].size) >> + cnt++; >> + else >> + break; >> + >> + if (cnt - 1 == LOW_REGION_IDX) >> + memblock_cap_memory_range(regs[LOW_REGION_IDX].base, >> + regs[LOW_REGION_IDX].size); >> + else if (cnt - 1 == HIGH_REGION_IDX) { >> + /* >> + * Two crash kernel regions, cap the memory range >> + * [regs[LOW_REGION_IDX].base, regs[HIGH_REGION_IDX].end] >> + * and then remove the memory range in the middle. >> + */ >> + int start_rgn, end_rgn, i, ret; >> + phys_addr_t mid_base, mid_size; >> + >> + mid_base = regs[LOW_REGION_IDX].base + regs[LOW_REGION_IDX].size; >> + mid_size = regs[HIGH_REGION_IDX].base - mid_base; >> + ret = memblock_isolate_range(&memblock.memory, mid_base, >> + mid_size, &start_rgn, &end_rgn); >> >> - of_scan_flat_dt(early_init_dt_scan_usablemem, ®); >> + if (ret) >> + return; >> >> - if (reg.size) >> - memblock_cap_memory_range(reg.base, reg.size); >> + memblock_cap_memory_range(regs[LOW_REGION_IDX].base, >> + regs[HIGH_REGION_IDX].base - >> + regs[LOW_REGION_IDX].base + >> + regs[HIGH_REGION_IDX].size); >> + for (i = end_rgn - 1; i >= start_rgn; i--) { >> + if (!memblock_is_nomap(&memblock.memory.regions[i])) >> + memblock_remove_region(&memblock.memory, i); >> + } >> + memblock_remove_range(&memblock.reserved, mid_base, >> + mid_base + mid_size); >> + } >> } >> >> void __init arm64_memblock_init(void) >> diff --git a/include/linux/memblock.h b/include/linux/memblock.h >> index 294d5d8..787d252 100644 >> --- a/include/linux/memblock.h >> +++ b/include/linux/memblock.h >> @@ -110,9 +110,15 @@ void memblock_discard(void); >> >> phys_addr_t memblock_find_in_range(phys_addr_t start, phys_addr_t end, >> phys_addr_t size, phys_addr_t align); >> +void memblock_remove_region(struct memblock_type *type, unsigned long r); >> void memblock_allow_resize(void); >> int memblock_add_node(phys_addr_t base, phys_addr_t size, int nid); >> int memblock_add(phys_addr_t base, phys_addr_t size); >> +int memblock_isolate_range(struct memblock_type *type, >> + phys_addr_t base, phys_addr_t size, >> + int *start_rgn, int *end_rgn); >> +int memblock_remove_range(struct memblock_type *type, >> + phys_addr_t base, phys_addr_t size); >> int memblock_remove(phys_addr_t base, phys_addr_t size); >> int memblock_free(phys_addr_t base, phys_addr_t size); >> int memblock_reserve(phys_addr_t base, phys_addr_t size); >> diff --git a/mm/memblock.c b/mm/memblock.c >> index e7665cf..1846e2d 100644 >> --- a/mm/memblock.c >> +++ b/mm/memblock.c >> @@ -357,7 +357,8 @@ phys_addr_t __init_memblock memblock_find_in_range(phys_addr_t start, >> return ret; >> } >> >> -static void __init_memblock memblock_remove_region(struct memblock_type *type, unsigned long r) >> +void __init_memblock memblock_remove_region(struct memblock_type *type, >> + unsigned long r) >> { >> type->total_size -= type->regions[r].size; >> memmove(&type->regions[r], &type->regions[r + 1], >> @@ -724,7 +725,7 @@ int __init_memblock memblock_add(phys_addr_t base, phys_addr_t size) >> * Return: >> * 0 on success, -errno on failure. >> */ >> -static int __init_memblock memblock_isolate_range(struct memblock_type *type, >> +int __init_memblock memblock_isolate_range(struct memblock_type *type, >> phys_addr_t base, phys_addr_t size, >> int *start_rgn, int *end_rgn) >> { >> @@ -784,7 +785,7 @@ static int __init_memblock memblock_isolate_range(struct memblock_type *type, >> return 0; >> } >> >> -static int __init_memblock memblock_remove_range(struct memblock_type *type, >> +int __init_memblock memblock_remove_range(struct memblock_type *type, >> phys_addr_t base, phys_addr_t size) >> { >> int start_rgn, end_rgn; >> -- >> 2.7.4 >> >