Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp97349imm; Sun, 8 Jul 2018 20:32:52 -0700 (PDT) X-Google-Smtp-Source: AAOMgpeq18kr7LWhfC67SzzVO9mR1b4nFh8OU1dIdKyABvSAlcyFzPsvhoBY31Z286RthtkXAlUY X-Received: by 2002:a17:902:bd97:: with SMTP id q23-v6mr18858836pls.238.1531107172789; Sun, 08 Jul 2018 20:32:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531107172; cv=none; d=google.com; s=arc-20160816; b=ywdeurcmd9HUwFl1TPNgCe6AhfIO6CD1nEDsJKsnMOCicQnEkqr2IkmKthbrP+6CA2 L65UTJ3J/MBz6L8QIm0UDAsX9iBEdo8iUPD0LHi2bosnoISeJsPQ5gqxxMaGQXQjxUrX Hj/W2mVF4D3zjZGK1YKS4i1MHZD7OuTEkty8ZTEX9IT/VmQKqMyptvqtHyFaPTHPvATw F/Y31+GKfGHS8TQMyJtVEIZY2XaLI1eCSj1+ZbZmIO1IKX8DqNS+YfuUbZpGA+n0gxaG VW1riTayA23ExH5tZQCBgz3fz6x0NFpi+alyuh8bl3D6DtgBwfFQNW78AipbCg12rXUR +f/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject:dkim-signature:arc-authentication-results; bh=JcdSlO+Vd+LZVjphZzOLIi3zZFzpRrgtb1ZpuCL41S4=; b=qk2t5b+HWOl6a/GzJpIbhQNdWYqANRi+w/2INTMoL3QI8rQjMEEtSovECFMMLybDjU CzadoRQejKmlz4BdmA9EGeovFuf5sr5TzFfMJaEH81Zodxnt/NXSUelaQ1OEFf3q4MA+ cSMWO4OdAm3sSzmacaCBoFdL97boEsJsUIxP2Op4RdCjajvZBINOjqeH4XGy7P4HV83e X9gQh70GmOnyySNuX38bX+YD042vfhq5s/6qsUlI4UrPWDhh5EXfek9DvnTjaoG2uex9 oZ9vnLc/ivP6Zn7EOI48CN+f0PkP/L515g1w8ZvpXlBwi4lFifcG1mABvrWoYUcomFSn vaaQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=HitN0QRK; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d2-v6si13584854plh.387.2018.07.08.20.32.38; Sun, 08 Jul 2018 20:32:52 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=HitN0QRK; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933165AbeGIDcA (ORCPT + 99 others); Sun, 8 Jul 2018 23:32:00 -0400 Received: from mail-pf0-f193.google.com ([209.85.192.193]:45451 "EHLO mail-pf0-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932656AbeGIDb7 (ORCPT ); Sun, 8 Jul 2018 23:31:59 -0400 Received: by mail-pf0-f193.google.com with SMTP id i26-v6so1179424pfo.12 for ; Sun, 08 Jul 2018 20:31:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding; bh=JcdSlO+Vd+LZVjphZzOLIi3zZFzpRrgtb1ZpuCL41S4=; b=HitN0QRK/Mu3qU1gS3lxnqXkxwQmlVUaCtbNzklIGR395yaOivLsPNYwHDKWWWx2li EnI4RkabdH2Mh7Jgzgsyv5Rc4GNTxaQePr0DlvYbFvR/kgV2+c2w4WERnwIne3rJuC/Y yKfaKOcf27pmDSHEcVI29u2XdXW41ALqUwQL2aVkT+nu7iukx2Ixr/RH+gMmAtGilnxe vK3V+H2xLuBTw1k42zfqcbRK+H+AUYqHG1cYu39bGxgi7io2gBUvUn3muRhipar8TeXm M0zQCQfvtuf1+zbXbgV+mu2rAD+x3jQlRJG5aIIwGjfxo6toV0qX++8vTbaVtx1wfMNL /6Kg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=JcdSlO+Vd+LZVjphZzOLIi3zZFzpRrgtb1ZpuCL41S4=; b=ahBOXkw4iXFNlH8695XS1IcDY+8+c2GGBWCFeSMssdOa1e1i11DZNqUrMYkmqxlVDw X2l6Hb3LZGY80CLFb4oYzN1Q/g254mWLWxIGyNnVucV787NdAyWYCvK9Ei/ZSdhcp23G sE4Ei2MZH50miy+sSua4YE2vUcN2tLK1RIJSdXObagcbovQmjxy0j4GIAf7K18ARvc0L xadQPd9yqvm22/+kfcugM7ndjHc4JVPOz1r+FvEvpKkiMJxXsE/PGt5Vx5iWz7QbHWPe rT9acW+LGGWWaHNi9OHZFmhL7Nj9grTVIGHeqoR18Toe3n6UDj/z7e0Kw4plowJ18erH U5ug== X-Gm-Message-State: APt69E3Fq65lsWpOhKIBja3b+KUubDVHDIf9dJ9FHLrIjVyYB3jfX60t OPop0P6FpPlswA/Edw7J4kw= X-Received: by 2002:a65:6110:: with SMTP id z16-v6mr12950810pgu.412.1531107118579; Sun, 08 Jul 2018 20:31:58 -0700 (PDT) Received: from [0.0.0.0] (65.49.234.56.16clouds.com. [65.49.234.56]) by smtp.gmail.com with ESMTPSA id d18-v6sm22077637pfn.118.2018.07.08.20.30.56 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 08 Jul 2018 20:31:57 -0700 (PDT) Subject: Re: [RESEND PATCH v10 2/6] mm: page_alloc: remain memblock_next_valid_pfn() on arm/arm64 To: Andrew Morton , Daniel Vacek Cc: Russell King , Catalin Marinas , Will Deacon , Mark Rutland , Ard Biesheuvel , Michal Hocko , Wei Yang , Kees Cook , Laura Abbott , Vladimir Murzin , Philip Derrin , AKASHI Takahiro , James Morse , Steve Capper , Pavel Tatashin , Gioh Kim , Vlastimil Babka , Mel Gorman , Johannes Weiner , Kemi Wang , Petr Tesarik , YASUAKI ISHIMATSU , Andrey Ryabinin , Nikolay Borisov , Daniel Jordan , Daniel Vacek , Eugeniu Rosca , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Jia He References: <1530867675-9018-1-git-send-email-hejianet@gmail.com> <1530867675-9018-3-git-send-email-hejianet@gmail.com> <20180706153709.6bcc76b0245f239f1d1dcc8a@linux-foundation.org> From: Jia He Message-ID: <4895a92f-f4c2-b200-3c7c-4fe8c4596f32@gmail.com> Date: Mon, 9 Jul 2018 11:30:58 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <20180706153709.6bcc76b0245f239f1d1dcc8a@linux-foundation.org> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Andew Thanks for the comments On 7/7/2018 6:37 AM, Andrew Morton Wrote: > On Fri, 6 Jul 2018 17:01:11 +0800 Jia He wrote: > >> From: Jia He >> >> Commit b92df1de5d28 ("mm: page_alloc: skip over regions of invalid pfns >> where possible") optimized the loop in memmap_init_zone(). But it causes >> possible panic bug. So Daniel Vacek reverted it later. >> >> But as suggested by Daniel Vacek, it is fine to using memblock to skip >> gaps and finding next valid frame with CONFIG_HAVE_ARCH_PFN_VALID. >> Daniel said: >> "On arm and arm64, memblock is used by default. But generic version of >> pfn_valid() is based on mem sections and memblock_next_valid_pfn() does >> not always return the next valid one but skips more resulting in some >> valid frames to be skipped (as if they were invalid). And that's why >> kernel was eventually crashing on some !arm machines." >> >> About the performance consideration: >> As said by James in b92df1de5, >> "I have tested this patch on a virtual model of a Samurai CPU >> with a sparse memory map. The kernel boot time drops from 109 to >> 62 seconds." >> >> Thus it would be better if we remain memblock_next_valid_pfn on arm/arm64. >> > > We're making a bit of a mess here. mmzone.h: > > ... > #ifndef CONFIG_HAVE_ARCH_PFN_VALID > ... > #define next_valid_pfn(pfn) (pfn + 1) Yes, ^ this line can be removed. > #endif > ... > #ifdef CONFIG_HAVE_MEMBLOCK_PFN_VALID > #define next_valid_pfn(pfn) memblock_next_valid_pfn(pfn) > ... > #else > ... > #ifndef next_valid_pfn > #define next_valid_pfn(pfn) (pfn + 1) > #endif > > I guess it works OK, since CONFIG_HAVE_MEMBLOCK_PFN_VALID depends on > CONFIG_HAVE_ARCH_PFN_VALID. But it could all do with some cleanup and > modernization. > > - Perhaps memblock_next_valid_pfn() should just be called > pfn_valid(). So the header file's responsibility is to provide > pfn_valid() and next_valid_pfn(). > > - CONFIG_HAVE_ARCH_PFN_VALID should go away. The current way of > doing such thnigs is for the arch (or some Kconfig combination) to > define pfn_valid() and next_valid_pfn() in some fashion and to then > ensure that one of them is #defined to something, to indicate that > both of these have been set up. Or something like that. This is what I did in Patch v2, please see [1]. But Daniel opposed it [2] As he said: Now, if any other architecture defines CONFIG_HAVE_ARCH_PFN_VALID and implements it's own version of pfn_valid(), there is no guarantee that it will be based on memblock data or somehow equivalent to the arm implementation, right? I think it make sense, so I introduced the new config CONFIG_HAVE_MEMBLOCK_PFN_VALID instead of using CONFIG_HAVE_ARCH_PFN_VALID how about you ? :-) [1] https://lkml.org/lkml/2018/3/24/71 [2] https://lkml.org/lkml/2018/3/28/231 > > > Secondly, in memmap_init_zone() > >> - if (!early_pfn_valid(pfn)) >> + if (!early_pfn_valid(pfn)) { >> + pfn = next_valid_pfn(pfn) - 1; >> continue; >> + } >> + > > This is weird-looking. next_valid_pfn(pfn) is usually (pfn+1) so it's > a no-op. Sometimes we're calling memblock_next_valid_pfn() and then > backing up one, presumably because the `for' loop ends in `pfn++'. Or > something. Can this please be fully commented or cleaned up? To clean it up, maybe below is not acceptable for you and other experts ? if (!early_pfn_valid(pfn)) { #ifndef XXX continue; } #else pfn = next_valid_pfn(pfn) - 1; continue; } #endif Another way which was suggested by Ard Biesheuvel something like: for (pfn = start_pfn; pfn < end_pfn; pfn = next_valid_pfn(pfn)) ... But it might have impact on memmap_init_zone loop. E.g. context != MEMMAP_EARLY, pfn will not be checked by early_pfn_valid, thus it will change the mem hotplug logic. Sure, as you suggested, I can give more comments in all the cases of different configs/arches for this line. -- Cheers, Jia