Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp1819231yba; Sun, 14 Apr 2019 21:57:05 -0700 (PDT) X-Google-Smtp-Source: APXvYqwKNm5OMhx4MDXVM7GqQ690H6uqO3Ya7Sp5k7DdAitB65Hteg75JAh/KgWK8DbxRTYhoG1a X-Received: by 2002:a17:902:2b81:: with SMTP id l1mr73836604plb.289.1555304225349; Sun, 14 Apr 2019 21:57:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555304225; cv=none; d=google.com; s=arc-20160816; b=Tcm4XnL6OUw+CVaJ2Xs3zOb4f/8qpcUfXO9IpTdIPNNfe7llzXZoSMzifIzeKIVLTH 3lTAi2ynK/60Pt6G95384f6TKmgG7MOQFx3QGGP+zzLJwP+HC//rlpRzYettg9IbEzz1 DW3Bz+9oL+z+rssoTYcrJYU3heq6MQVWLIdCuscZWuV6XxyzdYvnZ2hRzYvx2B2HYURd ND5bWj0ApugG9Tul3PQZBOurA3siNq36mmoRIZMudwt0xeeCFRamjmyp5CCMaMd3NROt M3Y4XGG6vNehhoxQsibeVwIVrTkRF20/2An98dMcZsv0ID+VkiKG6d8J+fE8VwlrMf0c sMbQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:user-agent:in-reply-to :content-disposition:mime-version:references:subject:cc:to:from:date; bh=Rm3KjdmG1YkCqDiS3Yb2YN6egSA3rxZOYlR2c+5PUCQ=; b=zP8OOoIVywZQ4KM1ZK3zXphkWEU/oxgjpEL/qT0rTaOQlBoS9u7rlFbR6Ss5UXLPU3 TVbJUleqxQW9q/NHvI1fyiSujJ/qY+VhB6BQijM/NkO0z9dlRdUObbpYC4XgsHg0Wu73 STeJfg0HnUJOIXsItjUi9XpRlHvS2IJnp8bh1GQhro+/+RkPpMBezuNzbBxhNDeiyLUe 8zanO27dhE4N8qmOUDC0w4j3f2fhoclqKB2y2Ccg34Kzb4Swk0G1yQ1eEz72WEs+gOsz Qrxte7kDZdKKr9J68OOSrxafHMHPYOe2qbD+8UF8T0JShimsmEDt36+o8FoNiAR9x/uz YsDg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j22si31249842pfi.167.2019.04.14.21.56.35; Sun, 14 Apr 2019 21:57:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725846AbfDOEzc (ORCPT + 99 others); Mon, 15 Apr 2019 00:55:32 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:47338 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725789AbfDOEzb (ORCPT ); Mon, 15 Apr 2019 00:55:31 -0400 Received: from pps.filterd (m0098414.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.27/8.16.0.27) with SMTP id x3F4nJdC047904 for ; Mon, 15 Apr 2019 00:55:29 -0400 Received: from e06smtp07.uk.ibm.com (e06smtp07.uk.ibm.com [195.75.94.103]) by mx0b-001b2d01.pphosted.com with ESMTP id 2rvd8qmt1g-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Mon, 15 Apr 2019 00:55:29 -0400 Received: from localhost by e06smtp07.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Mon, 15 Apr 2019 05:55:27 +0100 Received: from b06cxnps4075.portsmouth.uk.ibm.com (9.149.109.197) by e06smtp07.uk.ibm.com (192.168.101.137) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Mon, 15 Apr 2019 05:55:22 +0100 Received: from d06av23.portsmouth.uk.ibm.com (d06av23.portsmouth.uk.ibm.com [9.149.105.59]) by b06cxnps4075.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id x3F4tL3b52428994 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Mon, 15 Apr 2019 04:55:21 GMT Received: from d06av23.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 8EB54A4051; Mon, 15 Apr 2019 04:55:21 +0000 (GMT) Received: from d06av23.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 5F64AA4057; Mon, 15 Apr 2019 04:55:20 +0000 (GMT) Received: from rapoport-lnx (unknown [9.148.8.112]) by d06av23.portsmouth.uk.ibm.com (Postfix) with ESMTPS; Mon, 15 Apr 2019 04:55:20 +0000 (GMT) Date: Mon, 15 Apr 2019 07:55:18 +0300 From: Mike Rapoport To: Chen Zhou Cc: tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, ebiederm@xmission.com, catalin.marinas@arm.com, will.deacon@arm.com, akpm@linux-foundation.org, ard.biesheuvel@linaro.org, horms@verge.net.au, takahiro.akashi@linaro.org, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, kexec@lists.infradead.org, linux-mm@kvack.org, wangkefeng.wang@huawei.com Subject: Re: [PATCH v3 3/4] arm64: kdump: support more than one crash kernel regions References: <20190409102819.121335-1-chenzhou10@huawei.com> <20190409102819.121335-4-chenzhou10@huawei.com> <20190414121315.GD20947@rapoport-lnx> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) X-TM-AS-GCONF: 00 x-cbid: 19041504-0028-0000-0000-00000360CBC8 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19041504-0029-0000-0000-0000241FFD0A Message-Id: <20190415045518.GA6167@rapoport-lnx> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2019-04-15_03:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=2 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1904150032 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 15, 2019 at 10:27:30AM +0800, Chen Zhou wrote: > Hi Mike, > > On 2019/4/14 20:13, Mike Rapoport wrote: > > Hi, > > > > On Tue, Apr 09, 2019 at 06:28:18PM +0800, Chen Zhou wrote: > >> After commit (arm64: kdump: support reserving crashkernel above 4G), > >> there may be two crash kernel regions, one is below 4G, the other is > >> above 4G. > >> > >> Crash dump kernel reads more than one crash kernel regions via a dtb > >> property under node /chosen, > >> linux,usable-memory-range = > > > > Somehow I've missed that previously, but how is this supposed to work on > > EFI systems? > > Whatever the way in which the systems work, there is FDT pointer(__fdt_pointer) > in arm64 kernel and file /sys/firmware/fdt will be created in late_initcall. > > Kexec-tools read and update file /sys/firmware/fdt in EFI systems to support kdump to > boot capture kernel. > > For supporting more than one crash kernel regions, kexec-tools make changes accordingly. > Details are in below: > http://lists.infradead.org/pipermail/kexec/2019-April/022792.html Thanks for the clarification! > Thanks, > Chen Zhou > > > > >> Signed-off-by: Chen Zhou > >> --- > >> arch/arm64/mm/init.c | 66 ++++++++++++++++++++++++++++++++++++++++-------- > >> include/linux/memblock.h | 6 +++++ > >> mm/memblock.c | 7 ++--- > >> 3 files changed, 66 insertions(+), 13 deletions(-) > >> > >> diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c > >> index 3bebddf..0f18665 100644 > >> --- a/arch/arm64/mm/init.c > >> +++ b/arch/arm64/mm/init.c > >> @@ -65,6 +65,11 @@ phys_addr_t arm64_dma_phys_limit __ro_after_init; > >> > >> #ifdef CONFIG_KEXEC_CORE > >> > >> +/* at most two crash kernel regions, low_region and high_region */ > >> +#define CRASH_MAX_USABLE_RANGES 2 > >> +#define LOW_REGION_IDX 0 > >> +#define HIGH_REGION_IDX 1 > >> + > >> /* > >> * reserve_crashkernel() - reserves memory for crash kernel > >> * > >> @@ -297,8 +302,8 @@ static int __init early_init_dt_scan_usablemem(unsigned long node, > >> const char *uname, int depth, void *data) > >> { > >> struct memblock_region *usablemem = data; > >> - const __be32 *reg; > >> - int len; > >> + const __be32 *reg, *endp; > >> + int len, nr = 0; > >> > >> if (depth != 1 || strcmp(uname, "chosen") != 0) > >> return 0; > >> @@ -307,22 +312,63 @@ static int __init early_init_dt_scan_usablemem(unsigned long node, > >> if (!reg || (len < (dt_root_addr_cells + dt_root_size_cells))) > >> return 1; > >> > >> - usablemem->base = dt_mem_next_cell(dt_root_addr_cells, ®); > >> - usablemem->size = dt_mem_next_cell(dt_root_size_cells, ®); > >> + endp = reg + (len / sizeof(__be32)); > >> + while ((endp - reg) >= (dt_root_addr_cells + dt_root_size_cells)) { > >> + usablemem[nr].base = dt_mem_next_cell(dt_root_addr_cells, ®); > >> + usablemem[nr].size = dt_mem_next_cell(dt_root_size_cells, ®); > >> + > >> + if (++nr >= CRASH_MAX_USABLE_RANGES) > >> + break; > >> + } > >> > >> return 1; > >> } > >> > >> static void __init fdt_enforce_memory_region(void) > >> { > >> - struct memblock_region reg = { > >> - .size = 0, > >> - }; > >> + int i, cnt = 0; > >> + struct memblock_region regs[CRASH_MAX_USABLE_RANGES]; > >> + > >> + memset(regs, 0, sizeof(regs)); > >> + of_scan_flat_dt(early_init_dt_scan_usablemem, regs); > >> + > >> + for (i = 0; i < CRASH_MAX_USABLE_RANGES; i++) > >> + if (regs[i].size) > >> + cnt++; > >> + else > >> + break; > >> + > >> + if (cnt - 1 == LOW_REGION_IDX) > >> + memblock_cap_memory_range(regs[LOW_REGION_IDX].base, > >> + regs[LOW_REGION_IDX].size); > >> + else if (cnt - 1 == HIGH_REGION_IDX) { > >> + /* > >> + * Two crash kernel regions, cap the memory range > >> + * [regs[LOW_REGION_IDX].base, regs[HIGH_REGION_IDX].end] > >> + * and then remove the memory range in the middle. > >> + */ > >> + int start_rgn, end_rgn, i, ret; > >> + phys_addr_t mid_base, mid_size; > >> + > >> + mid_base = regs[LOW_REGION_IDX].base + regs[LOW_REGION_IDX].size; > >> + mid_size = regs[HIGH_REGION_IDX].base - mid_base; > >> + ret = memblock_isolate_range(&memblock.memory, mid_base, > >> + mid_size, &start_rgn, &end_rgn); > >> > >> - of_scan_flat_dt(early_init_dt_scan_usablemem, ®); > >> + if (ret) > >> + return; > >> > >> - if (reg.size) > >> - memblock_cap_memory_range(reg.base, reg.size); > >> + memblock_cap_memory_range(regs[LOW_REGION_IDX].base, > >> + regs[HIGH_REGION_IDX].base - > >> + regs[LOW_REGION_IDX].base + > >> + regs[HIGH_REGION_IDX].size); > >> + for (i = end_rgn - 1; i >= start_rgn; i--) { > >> + if (!memblock_is_nomap(&memblock.memory.regions[i])) > >> + memblock_remove_region(&memblock.memory, i); > >> + } > >> + memblock_remove_range(&memblock.reserved, mid_base, > >> + mid_base + mid_size); > >> + } > >> } > >> > >> void __init arm64_memblock_init(void) > >> diff --git a/include/linux/memblock.h b/include/linux/memblock.h > >> index 294d5d8..787d252 100644 > >> --- a/include/linux/memblock.h > >> +++ b/include/linux/memblock.h > >> @@ -110,9 +110,15 @@ void memblock_discard(void); > >> > >> phys_addr_t memblock_find_in_range(phys_addr_t start, phys_addr_t end, > >> phys_addr_t size, phys_addr_t align); > >> +void memblock_remove_region(struct memblock_type *type, unsigned long r); > >> void memblock_allow_resize(void); > >> int memblock_add_node(phys_addr_t base, phys_addr_t size, int nid); > >> int memblock_add(phys_addr_t base, phys_addr_t size); > >> +int memblock_isolate_range(struct memblock_type *type, > >> + phys_addr_t base, phys_addr_t size, > >> + int *start_rgn, int *end_rgn); > >> +int memblock_remove_range(struct memblock_type *type, > >> + phys_addr_t base, phys_addr_t size); > >> int memblock_remove(phys_addr_t base, phys_addr_t size); > >> int memblock_free(phys_addr_t base, phys_addr_t size); > >> int memblock_reserve(phys_addr_t base, phys_addr_t size); > >> diff --git a/mm/memblock.c b/mm/memblock.c > >> index e7665cf..1846e2d 100644 > >> --- a/mm/memblock.c > >> +++ b/mm/memblock.c > >> @@ -357,7 +357,8 @@ phys_addr_t __init_memblock memblock_find_in_range(phys_addr_t start, > >> return ret; > >> } > >> > >> -static void __init_memblock memblock_remove_region(struct memblock_type *type, unsigned long r) > >> +void __init_memblock memblock_remove_region(struct memblock_type *type, > >> + unsigned long r) > >> { > >> type->total_size -= type->regions[r].size; > >> memmove(&type->regions[r], &type->regions[r + 1], > >> @@ -724,7 +725,7 @@ int __init_memblock memblock_add(phys_addr_t base, phys_addr_t size) > >> * Return: > >> * 0 on success, -errno on failure. > >> */ > >> -static int __init_memblock memblock_isolate_range(struct memblock_type *type, > >> +int __init_memblock memblock_isolate_range(struct memblock_type *type, > >> phys_addr_t base, phys_addr_t size, > >> int *start_rgn, int *end_rgn) > >> { > >> @@ -784,7 +785,7 @@ static int __init_memblock memblock_isolate_range(struct memblock_type *type, > >> return 0; > >> } > >> > >> -static int __init_memblock memblock_remove_range(struct memblock_type *type, > >> +int __init_memblock memblock_remove_range(struct memblock_type *type, > >> phys_addr_t base, phys_addr_t size) > >> { > >> int start_rgn, end_rgn; > >> -- > >> 2.7.4 > >> > > > -- Sincerely yours, Mike.