Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp1968998imm; Fri, 7 Sep 2018 08:51:18 -0700 (PDT) X-Google-Smtp-Source: ANB0VdaNYkaP4zVl0QSKgI42Xv0YpiQux5FsqbynDNan3ZDmj/RB9MwDW3JM7coZHhszHJIvSOiu X-Received: by 2002:a63:7107:: with SMTP id m7-v6mr8791895pgc.73.1536335478817; Fri, 07 Sep 2018 08:51:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536335478; cv=none; d=google.com; s=arc-20160816; b=hoIVFkH0ooNs4saf140OxPQPSBHB8/NYX35hKrR4+gAcjioIA45TewXoWYgR6Ajw2i mSOKLmZC/RtphPP2i6kRR8YH8Ft6GMF9Fjd55NkMR0N+xIxZzKfTX6OiS9Hl08vuYNV6 Kol98M+IL69zBFovWAxD+pNhidOoOlzUibMLhELd6aasMIcYX4CyXxUQ0lGIsnHO1ry+ ZbmWZ28b0hcEVzk/zPt1QpcJVquRqEqc+YzZYBhms93hd5K/eH1HpybSDhD1iOO2lntF PanzqHGZL0oYgbG2bMnUkXLJVdfvsIBGD3p5WcEteovQpZlFWtK1CfMubvrk062DtrrZ HULQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=KDTzJr0C7T1PPejjrOC/vZ8VXEZkbC9sMc5rhaVd6Bc=; b=enIMc1WQsm9PWkDE4Burf9bEKVKcrdcxeYkGMak1PeiVPfKc9Da5LTkImF00lM4u2n KvqL6TfNwFteffLMz47lKFXn1dciwM9of7D84XFB7asyXoGVsD08WNfuT+5c8I5lo2q4 PXROayucob8VXcsd5+qCGG0dez8EvAZ8m7w6vlRgvwYIHkpAbVZtMdhh1NJ4oRUmM5R3 Z2GUdW8cP1Qxqa4o0oeLyID1t/lv4otL2eLtnbU96HqdkIFMvqoXUQ4qVf1SFAvZFKTL MXBd6f/anRezGK+BCMtGkl7h3xx9GTiTczUI5+tcFlORWBEIWvI7Xny8qsS3Eo7LsFXh 6PmQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b66-v6si8949626pga.60.2018.09.07.08.51.03; Fri, 07 Sep 2018 08:51:18 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729947AbeIGTZs (ORCPT + 99 others); Fri, 7 Sep 2018 15:25:48 -0400 Received: from foss.arm.com ([217.140.101.70]:32980 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727821AbeIGTZr (ORCPT ); Fri, 7 Sep 2018 15:25:47 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 103D218A; Fri, 7 Sep 2018 07:44:33 -0700 (PDT) Received: from edgewater-inn.cambridge.arm.com (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id D455E3F575; Fri, 7 Sep 2018 07:44:32 -0700 (PDT) Received: by edgewater-inn.cambridge.arm.com (Postfix, from userid 1000) id 078CE1AE3828; Fri, 7 Sep 2018 15:44:47 +0100 (BST) Date: Fri, 7 Sep 2018 15:44:47 +0100 From: Will Deacon To: Ard Biesheuvel Cc: Jia He , Russell King , Catalin Marinas , Mark Rutland , Andrew Morton , Michal Hocko , Wei Yang , Kees Cook , Laura Abbott , Vladimir Murzin , Philip Derrin , AKASHI Takahiro , James Morse , Steve Capper , Gioh Kim , Vlastimil Babka , Mel Gorman , Johannes Weiner , Kemi Wang , Petr Tesarik , YASUAKI ISHIMATSU , Andrey Ryabinin , Nikolay Borisov , Daniel Jordan , Daniel Vacek , Eugeniu Rosca , linux-arm-kernel , Linux Kernel Mailing List , Linux-MM , Jia He Subject: Re: [PATCH v11 0/3] remain and optimize memblock_next_valid_pfn on arm and arm64 Message-ID: <20180907144447.GD12788@arm.com> References: <1534907237-2982-1-git-send-email-jia.he@hxt-semitech.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Sep 06, 2018 at 01:24:22PM +0200, Ard Biesheuvel wrote: > On 22 August 2018 at 05:07, Jia He wrote: > > Commit b92df1de5d28 ("mm: page_alloc: skip over regions of invalid pfns > > where possible") optimized the loop in memmap_init_zone(). But it causes > > possible panic bug. So Daniel Vacek reverted it later. > > > > But as suggested by Daniel Vacek, it is fine to using memblock to skip > > gaps and finding next valid frame with CONFIG_HAVE_ARCH_PFN_VALID. > > > > More from what Daniel said: > > "On arm and arm64, memblock is used by default. But generic version of > > pfn_valid() is based on mem sections and memblock_next_valid_pfn() does > > not always return the next valid one but skips more resulting in some > > valid frames to be skipped (as if they were invalid). And that's why > > kernel was eventually crashing on some !arm machines." > > > > About the performance consideration: > > As said by James in b92df1de5, > > "I have tested this patch on a virtual model of a Samurai CPU with a > > sparse memory map. The kernel boot time drops from 109 to 62 seconds." > > Thus it would be better if we remain memblock_next_valid_pfn on arm/arm64. > > > > Besides we can remain memblock_next_valid_pfn, there is still some room > > for improvement. After this set, I can see the time overhead of memmap_init > > is reduced from 27956us to 13537us in my armv8a server(QDF2400 with 96G > > memory, pagesize 64k). I believe arm server will benefit more if memory is > > larger than TBs > > > > OK so we can summarize the benefits of this series as follows: > - boot time on a virtual model of a Samurai CPU drops from 109 to 62 seconds > - boot time on a QDF2400 arm64 server with 96 GB of RAM drops by ~15 > *milliseconds* > > Google was not very helpful in figuring out what a Samurai CPU is and > why we should care about the boot time of Linux running on a virtual > model of it, and the 15 ms speedup is not that compelling either. > > Apologies to Jia that it took 11 revisions to reach this conclusion, > but in /my/ opinion, tweaking the fragile memblock/pfn handling code > for this reason is totally unjustified, and we're better off > disregarding these patches. Oh, we're talking about a *simulator* for the significant boot time improvement here? I didn't realise that, so I agree that the premise of this patch set looks pretty questionable given how much "fun" we've had with the memmap on arm and arm64. Will