Received: by 2002:ac0:98c7:0:0:0:0:0 with SMTP id g7-v6csp6260412imd; Wed, 31 Oct 2018 09:06:17 -0700 (PDT) X-Google-Smtp-Source: AJdET5c0XUF0PUZnfy9Dn8Kjiev3L1h4mnzAJF2tdQjaEbR3wbXg4fqx1HrhEcZSsjAAYCCakehe X-Received: by 2002:a17:902:6a8b:: with SMTP id n11-v6mr3960372plk.16.1541001977639; Wed, 31 Oct 2018 09:06:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1541001977; cv=none; d=google.com; s=arc-20160816; b=alfRJSkq0pwpWoRkDvtMpgXC44k0AR6D4ur2ZsuNPA532nzpcbWfT3D5o8uZ3u/sCH S1O9jda11AiQKZKxSNfJ3cVWC0fMfr7OnDUt1JfKa7VIrw/qFSZ8jtv3wumjGlAdeYAq T690eSD8v9/TFeSn9ifl/qycGK4c4V3aXTQgdTrMyB74lP0WYsRRbxPNLsN+F82buzNJ pNKJ1PR3QNsRKfAKEFopnaHvfg5YdZ00j7DHVZXasaTG3Z304vEG98QaQXMVgpLAWuD1 zl9qIVnHsu2A7A1jSR34+EYk9DZAQDWsYSOlT6teQinyN8BSNwxEhdIYvWRttPcioaUF ETpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id; bh=DkDV8qw5pJpw9em0nN+zpY5DYT79H5NNmyWSuWgSAu8=; b=h5BKsY6aekbOR5/Bhj6Y0t+unqCteTRPJhVo3SEyHipAO1m0abwJbTw1azidAavyX4 kSdNwSXMpUr6Wn7DpkIBkSit2DDlIMkhyqZtjP4r1CH224g4NIlfaxi200+i10uc7q8O JsH04dHo5f+DgkMgEl/EJnyXiniH6wdF/wYMI8o76QUiTKU/+oN35MgcGbeBYFtt4tvO Erm/8818c1cJWtfDEZSCrSpYJFtFUNV6IIfLG4E6Io2fsEQpCW8SFg++PCQBHbA8WYYA tHitmC02Aig7oDWsHTmDs/dUozIEri9albE2ZdTx3EfTi4uLWIrdFnpizFSvqqpXuwbz xkgQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i13-v6si27073601pgo.128.2018.10.31.09.05.52; Wed, 31 Oct 2018 09:06:17 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729800AbeKABDm (ORCPT + 99 others); Wed, 31 Oct 2018 21:03:42 -0400 Received: from mga11.intel.com ([192.55.52.93]:47312 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729782AbeKABDm (ORCPT ); Wed, 31 Oct 2018 21:03:42 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga003.jf.intel.com ([10.7.209.27]) by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 31 Oct 2018 09:05:05 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,447,1534834800"; d="scan'208";a="96609430" Received: from ahduyck-desk1.jf.intel.com ([10.7.198.76]) by orsmga003.jf.intel.com with ESMTP; 31 Oct 2018 09:05:04 -0700 Message-ID: Subject: Re: [mm PATCH v4 3/6] mm: Use memblock/zone specific iterator for handling deferred page init From: Alexander Duyck To: Pasha Tatashin , "linux-mm@kvack.org" , "akpm@linux-foundation.org" Cc: "mhocko@suse.com" , "dave.jiang@intel.com" , "linux-kernel@vger.kernel.org" , "willy@infradead.org" , "davem@davemloft.net" , "yi.z.zhang@linux.intel.com" , "khalid.aziz@oracle.com" , "rppt@linux.vnet.ibm.com" , "vbabka@suse.cz" , "sparclinux@vger.kernel.org" , "dan.j.williams@intel.com" , "ldufour@linux.vnet.ibm.com" , "mgorman@techsingularity.net" , "mingo@kernel.org" , "kirill.shutemov@linux.intel.com" Date: Wed, 31 Oct 2018 09:05:04 -0700 In-Reply-To: <5b937f29-a6e1-6622-b035-246229021d3e@microsoft.com> References: <20181017235043.17213.92459.stgit@localhost.localdomain> <20181017235419.17213.68425.stgit@localhost.localdomain> <5b937f29-a6e1-6622-b035-246229021d3e@microsoft.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5 (3.28.5-1.fc28) Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2018-10-31 at 15:40 +0000, Pasha Tatashin wrote: > > On 10/17/18 7:54 PM, Alexander Duyck wrote: > > This patch introduces a new iterator for_each_free_mem_pfn_range_in_zone. > > > > This iterator will take care of making sure a given memory range provided > > is in fact contained within a zone. It takes are of all the bounds checking > > we were doing in deferred_grow_zone, and deferred_init_memmap. In addition > > it should help to speed up the search a bit by iterating until the end of a > > range is greater than the start of the zone pfn range, and will exit > > completely if the start is beyond the end of the zone. > > > > This patch adds yet another iterator called > > for_each_free_mem_range_in_zone_from and then uses it to support > > initializing and freeing pages in groups no larger than MAX_ORDER_NR_PAGES. > > By doing this we can greatly improve the cache locality of the pages while > > we do several loops over them in the init and freeing process. > > > > We are able to tighten the loops as a result since we only really need the > > checks for first_init_pfn in our first iteration and after that we can > > assume that all future values will be greater than this. So I have added a > > function called deferred_init_mem_pfn_range_in_zone that primes the > > iterators and if it fails we can just exit. > > > > On my x86_64 test system with 384GB of memory per node I saw a reduction in > > initialization time from 1.85s to 1.38s as a result of this patch. > > > > Signed-off-by: Alexander Duyck > > Hi Alex, > > Could you please split this patch into two parts: > > 1. Add deferred_init_maxorder() > 2. Add memblock iterator? > > This would allow a better bisecting in case of problems. Chaning two > loops into deferred_init_maxorder() while a good idea, is still > non-trivial and might lead to bugs. > > Thank you, > Pavel I can do that, but I will need to flip the order. I will add the new iterator first and then deferred_init_maxorder. Otherwise the intermediate step ends up being too much throw-away code. - Alex