Received: by 2002:a05:6358:45e:b0:b5:b6eb:e1f9 with SMTP id 30csp1834831rwe; Sat, 27 Aug 2022 19:11:20 -0700 (PDT) X-Google-Smtp-Source: AA6agR6YgRIxFYnl9ok5fDNx4qaV8R8bWI1diB38kyUNX2faAjMFFm3EXPIbUl2FcY5Bt0VOLbvu X-Received: by 2002:a05:6402:2b8b:b0:43a:5475:f1ae with SMTP id fj11-20020a0564022b8b00b0043a5475f1aemr11738504edb.363.1661652680245; Sat, 27 Aug 2022 19:11:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1661652680; cv=none; d=google.com; s=arc-20160816; b=jI7hw0KVytJ7j8HgW+4ctPbBdr9gq9KsDf6EMoTs50xmlOY+BsIuoZxNEG1j0Ig+I9 MQiGQNhS4YqVBfQW7J33VW7otSaZIQrnakMxP/7UsSq2UeOTwrb/5UwBtMr65OrMBkbG 9jbcJxez6b0+BFjyVijMSW3cQWte67fI/xv2Zwt8voiHx1qu7RC9vJ42x3LeiLEga6m+ cokCvsHOJSrfawRalNhOGmQUfnYrwiHmX34sN3ikvxP2wVQi9zVaiosgDoEr5ZB82Q3e VmbKlWRarj7levhdIXCFImnpw262oiBR8EdQ2lr0yvQVsdHz6AJvDFgyqNnQZkAGQw+T eROg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=3fZFEPgbjjmu6J/LmASfOOq+L6kipv2vP4maHTFsNbM=; b=dr7DKpqJT8qgeKKvEja5EntnGbWJawilH14l7reqo2xY+cEwvvbe5gBkIgdJzGFWFm XyEgtR7D+legWHlaTRBmalW8HAQNiYzP8a/5OlkUYOsFxOZ6UW28Yl/cZaMHV7r4FScO jZZl04p3eVLkA7AJKdmcgp+yEPkyG6wtBQpUsKddjUwkCPFtIaz7ahL54Y9V8YT273kr bxBPcZKTP613bPXhZDA4UlDOEQ70Ro7EwDepIYLUCxv/GbPXt7lRc8uGyrHdZDcW212s iymUiOCVQ/Maj0LrpKhxRQKRvn8dBg0nbm25w2ucdGEo9S1AFlmwCAGtjedMigtqEiGW TJWw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ipILlsGu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id fh8-20020a1709073a8800b00738466b687dsi3484341ejc.318.2022.08.27.19.10.39; Sat, 27 Aug 2022 19:11:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ipILlsGu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229778AbiH1B5k (ORCPT + 99 others); Sat, 27 Aug 2022 21:57:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38230 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229458AbiH1B5j (ORCPT ); Sat, 27 Aug 2022 21:57:39 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C8BC73ED51 for ; Sat, 27 Aug 2022 18:57:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1661651856; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=3fZFEPgbjjmu6J/LmASfOOq+L6kipv2vP4maHTFsNbM=; b=ipILlsGuzhFm8jOPVEZQ9f2FBnAPZbbNpxucgl0ZXaF/ddIHdacibfBws5zqxLYh44Nqg+ XtRxQjecrjnO22qFbdnzzrPEhrZ+EeMFDTkb22hkYg1tN5LRGPKqf3Ncmv9cF/gNAdT1Yc rubUxL1l6WLBfybrcRaAGhDWd8+C+zo= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-376-z8xHsvapMrav6hnIavubbQ-1; Sat, 27 Aug 2022 21:57:33 -0400 X-MC-Unique: z8xHsvapMrav6hnIavubbQ-1 Received: from smtp.corp.redhat.com (int-mx09.intmail.prod.int.rdu2.redhat.com [10.11.54.9]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 5C727811E76; Sun, 28 Aug 2022 01:57:32 +0000 (UTC) Received: from localhost (ovpn-12-25.pek2.redhat.com [10.72.12.25]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 71506492C3B; Sun, 28 Aug 2022 01:57:31 +0000 (UTC) Date: Sun, 28 Aug 2022 09:57:28 +0800 From: Baoquan He To: linux-kernel@vger.kernel.org, Nicolas Saenz Julienne Cc: linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com, ardb@kernel.org, rppt@kernel.org, guanghuifeng@linux.alibaba.com, mark.rutland@arm.com, will@kernel.org, linux-mm@kvack.org, thunder.leizhen@huawei.com, wangkefeng.wang@huawei.com, kexec@lists.infradead.org Subject: Re: [PATCH 0/2] arm64, kdump: enforce to take 4G as the crashkernel low memory end Message-ID: References: <20220828005545.94389-1-bhe@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220828005545.94389-1-bhe@redhat.com> X-Scanned-By: MIMEDefang 2.85 on 10.11.54.9 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Forgot adding Nicolas when sending patch, add now. On 08/28/22 at 08:55am, Baoquan He wrote: > Problem: > ======= > On arm64, block and section mapping is supported to build page tables. > However, currently it enforces to take base page mapping for the whole > linear mapping if CONFIG_ZONE_DMA or CONFIG_ZONE_DMA32 is enabled and > crashkernel kernel parameter is set. This will cause longer time of the > linear mapping process during bootup and severe performance degradation > during running time. > > Root cause: > ========== > On arm64, crashkernel reservation relies on knowing the upper limit of > low memory zone because it needs to reserve memory in the zone so that > devices' DMA addressing in kdump kernel can be satisfied. However, the > limit on arm64 is variant. And the upper limit can only be decided late > till bootmem_init() is called. > > And we need to map the crashkernel region with base page granularity when > doing linear mapping, because kdump needs to protect the crashkernel region > via set_memory_valid(,0) after kdump kernel loading. However, arm64 doesn't > support well on splitting the built block or section mapping due to some > cpu reststriction [1]. And unfortunately, the linear mapping is done before > bootmem_init(). > > To resolve the above conflict on arm64, the compromise is enforcing to > take base page mapping for the entire linear mapping if crashkernel is > set, and CONFIG_ZONE_DMA or CONFIG_ZONE_DMA32 is enabed. Hence > performance is sacrificed. > > Solution: > ========= > To fix the problem, we should always take 4G as the crashkernel low > memory end in case CONFIG_ZONE_DMA or CONFIG_ZONE_DMA32 is enabled. > With this, we don't need to defer the crashkernel reservation till > bootmem_init() is called to set the arm64_dma_phys_limit. As long as > memblock init is done, we can conclude what is the upper limit of low > memory zone. > > 1) both CONFIG_ZONE_DMA or CONFIG_ZONE_DMA32 are disabled or memblock_start_of_DRAM() > 4G > limit = PHYS_ADDR_MAX+1 (Corner cases) > 2) CONFIG_ZONE_DMA or CONFIG_ZONE_DMA32 are enabled: > limit = 4G (generic case) > > Justification: > ============== > In fact, kdump kernel doesn't need to cover all peripherals' > addressing bits. Only device taken as dump target need be taken care of > and its addressing bits need be satified. Currently, there are two kinds > of dumping, dumped to local storage disk or dumped through network card to > remove storage server. It means only storage disk or netowrk card taken > as dump target need be consider if their addressing bits are satisfied. > For saving memory, we usually generate kdump specific initramfs including > necessary kernel modules for dump target devices. All other unnecessary > kernel modules are excluded and their correspondent devices won't be > initialized during kdump kernel bootup. > > So far, only Raspberry Pi 4 has some peripherals whcih can only address > 30 bits memory range as reported in [2]. Devices on all other arm64 systems > can address 32bits memory range. > > So by enforcing to take 4G as the crashkernel low memory end, the only > risk is if RPi4 owns storage disk or network card which can't address > 32bits memory range because they could be set as dump target. Even if > RPi4 truly has storage devices or network card which can only address 30 > bits memory range, it should be a corner case. We can document it since > crashkernel is more taken as a feature on server. Besides, RPi4 still can > use crashkernel=xM@yM to sepcify a location for 32bits addressing if it > really has that kind of storage device or network card and kdump is expected. > > [1] > https://lore.kernel.org/all/YrIIJkhKWSuAqkCx@arm.com/T/#u > > [2] > [PATCH v6 0/4] Raspberry Pi 4 DMA addressing support > https://lore.kernel.org/linux-arm-kernel/20190911182546.17094-1-nsaenzjulienne@suse.de/T/ > > > ====== > Question to Nicolas: > > Hi Nicolas, > > In cover letter of [2] patchset, you told RPi4 has peripherals which > can only address 30bits range. In below sentence, do you mean "the PCIe, > V3D, GENET" can't address 32bit range, or they have wider view of > address space the same as 40-bit DMA channels? I am confused about that. > > And the storage device or network card on RPi4 can address 32bit range > or 32bit range, do we have document or do you happen to know that? > > """ > The new Raspberry Pi 4 has up to 4GB of memory but most peripherals can > only address the first GB: their DMA address range is > 0xc0000000-0xfc000000 which is aliased to the first GB of physical > memory 0x00000000-0x3c000000. Note that only some peripherals have these > limitations: the PCIe, V3D, GENET, and 40-bit DMA channels have a wider > view of the address space by virtue of being hooked up trough a second > interconnect. > """ > > > Baoquan He (2): > arm64, kdump: enforce to take 4G as the crashkernel low memory end > arm64: remove unneed defer_reserve_crashkernel() and crash_mem_map > > arch/arm64/include/asm/memory.h | 5 ---- > arch/arm64/mm/init.c | 24 ++++++++------- > arch/arm64/mm/mmu.c | 53 ++++++++++++++------------------- > 3 files changed, 36 insertions(+), 46 deletions(-) > > > base-commit: 10d4879f9ef01cc6190fafe4257d06f375bab92c > -- > 2.34.1 >