Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp1627130rwi; Thu, 20 Oct 2022 15:15:35 -0700 (PDT) X-Google-Smtp-Source: AMsMyM78eMaITEmB2v/L66uya7cQ1fHS8HeKM8YMWo8T7N2V8X9AoPdQiRCcpptojPLkmE1jErY+ X-Received: by 2002:a17:902:f0d4:b0:17f:72bd:74f3 with SMTP id v20-20020a170902f0d400b0017f72bd74f3mr16531364pla.109.1666304135024; Thu, 20 Oct 2022 15:15:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666304135; cv=none; d=google.com; s=arc-20160816; b=gx/F04UT5PCxfLiJb6RRtB9Q6w4LinVTo6amECAn2UeD/wjd3wlEUPUwpvlL4P1jU6 GCWPfC7A0XxQyOQL3EqZ0x0DHPFG02ztavVSnpE2xp9pcQA7j9PVrVDB7V9f483iQx++ ZilsNInvUjblzruC4j7sWP/YiVAQXM9ovA4uJ6tHHBfox3Vv63mt8Ju1moDVtAlWi/+P syjXOV+gETyLZEkBsOh1GeVM35+66k6fMgHCuQH6lr0NmK3A0pPklEe/Sn4bnLkkPyxM 1+//CyjyRP1QpGfA03wMkeXhVt5aV3HWVG7J2XS2vDz4jaOyfVcqeV9jkbxDkGYn6qK/ rHQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=P6AtYy1SpzGZHT2Nu1bIjQyEuoBY2A4C6JBnW2B2WtgV/AGRQYbeaIu/VuOWpXihNp mZPE01+U/DKES7mXCeQ7d7O3P/ouoQAQK7L9mhfnj+asXb9wlB1gdU3uQGpinrW8t9d9 BdvsK57LXhAiNP8/ENN3gbt4gY708a6QBEv7aT0AwDXhA8pMibDsUBFN1VigGeZIuLLz 0aaexp/lGeSJcZthgOYH2yHhVm0TQcMOfzoV+8RZy8QJ23KsLzkazzUT1C5qdDj2ImVS 1b5eaHDSVNGb6z0UlvGNVS+y3vjRTQqoc1tUWHzAwiyzYsgEMJO4szmr8TNN9NrBDHjK Vg9A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=mhSDba1z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id n11-20020a635c4b000000b00456b507fb2asi22550579pgm.446.2022.10.20.15.15.13; Thu, 20 Oct 2022 15:15:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=mhSDba1z; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229613AbiJTVya (ORCPT + 99 others); Thu, 20 Oct 2022 17:54:30 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53202 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229576AbiJTVy2 (ORCPT ); Thu, 20 Oct 2022 17:54:28 -0400 Received: from mail-qk1-x729.google.com (mail-qk1-x729.google.com [IPv6:2607:f8b0:4864:20::729]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 19F596438; Thu, 20 Oct 2022 14:54:21 -0700 (PDT) Received: by mail-qk1-x729.google.com with SMTP id a18so924518qko.0; Thu, 20 Oct 2022 14:54:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=mhSDba1z9+BM+IWhMevt2Zop1ZxgI5vtmX5lEIxyybt5IFnrl63bEj9L06UWGe4Qyj Xh/VFcCdOIlTEXPIAuLMElK/9X2NLL8Xzxgxx0JErorvkGNAIpD6am2MMcie+7j5b5vO f1MRedrX5cqtmrOkuu8sxdcuN0Hwe6AP38s1kkny9SdIMjdyEfTXrbkZ6bO8KyxtESCx Mkt9SAKXBXsE89tXA+vISzKSht45tmKyLvFcnC2QPmatgyY3BBRkC/Fc+BNFns+IuIDL f3NjTc8DIXlDssWaLdR05AHBxi1y/lzgM+XYwxZOBTzai43oKzRsI7ysCjgZ6PHgXyRN rT2A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=XJV3aFcijl0Rz0XlRnZF5dclSHM+OiKYXagTECOXWUs=; b=pjCUd8U4AsJ6E2akOt5hGNNNTOQcCQIczpckBC0VrzwKC0rgjqrdQk7zAzIO5PRNqZ CJe8X4FHeHqT6LLL5P9UEjmRBFxWhVeAidoIL/hB7DYkGTOzbSNP/ZpyG6S2ss/J1c+a PIHGb0oQ8I1AKTqn6QswFNU7/uQYxckVmJGQ2iWju3B1ALlq3fRSGSEHyjKN1cixb6od e/Sk/iM16Hg4D3ynBFQZp4zIhcp+wC1zmq1VxYBvjk7zQXaTg3q0iEJEntvmy8U/Twwz P+FbEy6Wt4aaccezy1hDbI9AkRCVJucg8b3r+8E3hKhFqCV7jMcH8DQEfHMYfVoo1IGp 0DfA== X-Gm-Message-State: ACrzQf3NuhvHoeQH4MMUWGm+7fUKKeehmkf+T48WnMV9XaT41o7wmYOO 5subbbPsrD3qCOZvkEZyQC8= X-Received: by 2002:a05:620a:2697:b0:6cf:33cd:2bd2 with SMTP id c23-20020a05620a269700b006cf33cd2bd2mr11114790qkp.341.1666302861044; Thu, 20 Oct 2022 14:54:21 -0700 (PDT) Received: from stbirv-lnx-3.igp.broadcom.net ([192.19.223.252]) by smtp.gmail.com with ESMTPSA id bj41-20020a05620a192900b006bb29d932e1sm8121067qkb.105.2022.10.20.14.54.18 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 20 Oct 2022 14:54:20 -0700 (PDT) From: Doug Berger To: Andrew Morton Cc: Jonathan Corbet , Mike Rapoport , Borislav Petkov , "Paul E. McKenney" , Neeraj Upadhyay , Randy Dunlap , Damien Le Moal , Muchun Song , Vlastimil Babka , Johannes Weiner , Michal Hocko , KOSAKI Motohiro , Mel Gorman , Mike Kravetz , Florian Fainelli , David Hildenbrand , Oscar Salvador , Joonsoo Kim , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Doug Berger Subject: [PATCH v3 0/9] mm: introduce Designated Movable Blocks Date: Thu, 20 Oct 2022 14:53:09 -0700 Message-Id: <20221020215318.4193269-1-opendmb@gmail.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org MOTIVATION: Some Broadcom devices (e.g. 7445, 7278) contain multiple memory controllers with each mapped in a different address range within a Uniform Memory Architecture. Some users of these systems have expressed the desire to locate ZONE_MOVABLE memory on each memory controller to allow user space intensive processing to make better use of the additional memory bandwidth. Unfortunately, the historical monotonic layout of zones would mean that if the lowest addressed memory controller contains ZONE_MOVABLE memory then all of the memory available from memory controllers at higher addresses must also be in the ZONE_MOVABLE zone. This would force all kernel memory accesses onto the lowest addressed memory controller and significantly reduce the amount of memory available for non-movable allocations. The main objective of this patch set is therefore to allow a block of memory to be designated as part of the ZONE_MOVABLE zone where it will always only be used by the kernel page allocator to satisfy requests for movable pages. The term Designated Movable Block is introduced here to represent such a block. The favored implementation allows extension of the 'movablecore' kernel parameter to allow specification of a base address and support for multiple blocks. The existing 'movablecore' mechanisms are retained. BACKGROUND: NUMA architectures support distributing movablecore memory across each node, but it is undesirable to introduce the overhead and complexities of NUMA on systems that don't have a Non-Uniform Memory Architecture. Commit 342332e6a925 ("mm/page_alloc.c: introduce kernelcore=mirror option") also depends on zone overlap to support sytems with multiple mirrored ranges. Commit c6f03e2903c9 ("mm, memory_hotplug: remove zone restrictions") embraced overlapped zones for memory hotplug. This commit set follows their lead to allow the ZONE_MOVABLE zone to overlap other zones. Designated Movable Blocks are made absent from overlapping zones and present within the ZONE_MOVABLE zone. I initially investigated an implementation using a Designated Movable migrate type in line with comments[1] made by Mel Gorman regarding a "sticky" MIGRATE_MOVABLE type to avoid using ZONE_MOVABLE. However, this approach was riskier since it was much more instrusive on the allocation paths. Ultimately, the progress made by the memory hotplug folks to expand the ZONE_MOVABLE functionality convinced me to follow this approach. Changes in v3: - removed OTHER OPPORTUNITIES and NOTES from this cover letter. - prevent the creation of empty zones instead of adding extra info to zoneinfo. - size the ZONE_MOVABLE span to the minimum necessary to cover pages within the zone to be more intuitive. - removed "real" from variable names that were consolidated. - rebased to akpm-mm/master (i.e. Linux 6.1-rc1). Changes in v2: - first three commits upstreamed separately [3], [4], and [5]. - commits 04-06 submitted separately [6]. - Corrected errors "Reported-by: kernel test robot " - Deferred commits after 15 to simplify review of the base functionality. - minor reorganization of commit 13. v2: https://lore.kernel.org/linux-mm/20220928223301.375229-1-opendmb@gmail.com/ v1: https://lore.kernel.org/linux-mm/20220913195508.3511038-1-opendmb@gmail.com/ [1] https://lore.kernel.org/lkml/20160428103927.GM2858@techsingularity.net/ [2] https://lore.kernel.org/lkml/1401260672-28339-1-git-send-email-iamjoonsoo.kim@lge.com [3] https://lore.kernel.org/linux-mm/20220914023913.1855924-1-zi.yan@sent.com [4] https://lore.kernel.org/linux-mm/20220823030209.57434-2-linmiaohe@huawei.com [5] https://lore.kernel.org/linux-mm/20220914190917.3517663-1-opendmb@gmail.com [6] https://lore.kernel.org/linux-mm/20220921223639.1152392-1-opendmb@gmail.com/ Doug Berger (9): lib/show_mem.c: display MovableOnly mm/page_alloc: calculate node_spanned_pages from pfns mm/page_alloc: prevent creation of empty zones mm/page_alloc.c: allow oversized movablecore mm/page_alloc: introduce init_reserved_pageblock() memblock: introduce MEMBLOCK_MOVABLE flag mm/dmb: Introduce Designated Movable Blocks mm/page_alloc: make alloc_contig_pages DMB aware mm/page_alloc: allow base for movablecore .../admin-guide/kernel-parameters.txt | 14 +- include/linux/dmb.h | 29 +++ include/linux/gfp.h | 5 +- include/linux/memblock.h | 8 + lib/show_mem.c | 2 +- mm/Kconfig | 12 ++ mm/Makefile | 1 + mm/cma.c | 15 +- mm/dmb.c | 91 +++++++++ mm/memblock.c | 30 ++- mm/page_alloc.c | 188 +++++++++++++----- 11 files changed, 338 insertions(+), 57 deletions(-) create mode 100644 include/linux/dmb.h create mode 100644 mm/dmb.c -- 2.25.1