Received: by 2002:a25:23cc:0:0:0:0:0 with SMTP id j195csp261034ybj; Wed, 6 May 2020 16:59:29 -0700 (PDT) X-Google-Smtp-Source: APiQypKG+1DYMU3CHLrydnNfeNaCEeQIHLUBZ0SOUc054ez9Xbn82wXgG1DMOfGePivRIz3X2x+v X-Received: by 2002:a50:9dcd:: with SMTP id l13mr9630035edk.126.1588809568952; Wed, 06 May 2020 16:59:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588809568; cv=none; d=google.com; s=arc-20160816; b=h2M3J54DXwyYLEZCquJlwfBQdhy7hubF01+0BZfdaEoACvhMc321SreqwQ1Yr84jAL 33i2I9XXGxGcG0uEz1Kmz70fWQX4ZrSTrk9uM2VYcy2bNxenPlDj8s1nYdaW6RzHgbFL BnJ9zxZGX4KDXBFOKZzUDrRFVO8JWE0PG6WC/SMpRh0JRSVGsiAJtUxTD1PE6kMTLlw/ eJIdQAchPdu4DhBhSthrj6WC0WOZ+p6/CN1RVxLAN0c5Xywaz6ynZ/pB9g6/YupUzel9 I6GZvEMHNLq2nGzQSh3BJLw+xvCGPRMESK4KdKAHiw2hb8qg2DI0h75/zbya8Dlb+zUq BCNg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=b20tuqEZWMS3xbLdEetDOfYwGb0aFxf2MGC5L4I9mYw=; b=ysNuTkhIpnekicDcE8lOg8wWcuu/YjHeUoCIafhwfsEBYcUnjTmyq4AEjcbD+G0spM F7cFNWwg44KwYgxNM4q/MY7mj3eTxVHLLODlLkLYBazZwK85mSDeDkE6Xx+bLlCqazgT y7uzWGig+Wm2MJinf0GQBRixbUz3ZaL2K3kH1j7VpvjaDQkjjkPIlbT0fQuL8TZHgjDB NBJK+m1bFCq5WmReiMaK2v/TA10xuCTHxQ7qx4sT0WE+6zb/Vbz/Qx4fPaOqfMdFsZcl 5nPfgVeJQd8yFQsAZfJni7dLhn2V5m+KKLriIiVClbIw07vCUhD/pfgQebbzu50mGC/r 1UWw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2020-01-29 header.b="LU/8X2og"; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id qh10si2262361ejb.49.2020.05.06.16.58.51; Wed, 06 May 2020 16:59:28 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2020-01-29 header.b="LU/8X2og"; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729243AbgEFXCA (ORCPT + 99 others); Wed, 6 May 2020 19:02:00 -0400 Received: from userp2120.oracle.com ([156.151.31.85]:58308 "EHLO userp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730228AbgEFXCA (ORCPT ); Wed, 6 May 2020 19:02:00 -0400 Received: from pps.filterd (userp2120.oracle.com [127.0.0.1]) by userp2120.oracle.com (8.16.0.42/8.16.0.42) with SMTP id 046Mvl6A141076; Wed, 6 May 2020 23:01:31 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=date : from : to : cc : subject : message-id : references : mime-version : content-type : in-reply-to; s=corp-2020-01-29; bh=b20tuqEZWMS3xbLdEetDOfYwGb0aFxf2MGC5L4I9mYw=; b=LU/8X2ogceI0aRT+iPTv3y5AR7MLQ/zyDBCGoRzr4SwwYatnyLviW1jD9bjOBDKl8X0a RPGMy7xgjk02IbAU0EJKWCfdKLT+z8E0COtwjbl2DQ6Z90ky58g4JcYeTVnkT1jzx/B8 BwND+NE/JfVGsOB9Vt7xXNzT9C+kS0xOVsdpVojX7/Rix2hqmNTL8Ih/qmVJAyRLlIin qZqdIoxI99AtH6j1ja84ShOOkKzN33V1oTroNRKrZWI22qdlVjvOhYpKKE0oAvD3UDq/ XROjiLpzwnZOgVk+9+Vo0HoLegAVKFUyxzajeHEooP53FCTP2tIVvwl74+6LxeH/nnmc hA== Received: from userp3020.oracle.com (userp3020.oracle.com [156.151.31.79]) by userp2120.oracle.com with ESMTP id 30s1gncyrc-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 06 May 2020 23:01:31 +0000 Received: from pps.filterd (userp3020.oracle.com [127.0.0.1]) by userp3020.oracle.com (8.16.0.42/8.16.0.42) with SMTP id 046Mucts019798; Wed, 6 May 2020 23:01:31 GMT Received: from aserv0121.oracle.com (aserv0121.oracle.com [141.146.126.235]) by userp3020.oracle.com with ESMTP id 30us7numv8-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 06 May 2020 23:01:31 +0000 Received: from abhmp0018.oracle.com (abhmp0018.oracle.com [141.146.116.24]) by aserv0121.oracle.com (8.14.4/8.13.8) with ESMTP id 046N1RGX018232; Wed, 6 May 2020 23:01:27 GMT Received: from ca-dmjordan1.us.oracle.com (/10.211.9.48) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Wed, 06 May 2020 16:01:26 -0700 Date: Wed, 6 May 2020 19:01:48 -0400 From: Daniel Jordan To: Alexander Duyck Cc: Daniel Jordan , Josh Triplett , Andrew Morton , Herbert Xu , Steffen Klassert , Alex Williamson , Alexander Duyck , Dan Williams , Dave Hansen , David Hildenbrand , Jason Gunthorpe , Jonathan Corbet , Kirill Tkhai , Michal Hocko , Pavel Machek , Pavel Tatashin , Peter Zijlstra , Randy Dunlap , Shile Zhang , Tejun Heo , Zi Yan , linux-crypto@vger.kernel.org, linux-mm , LKML Subject: Re: [PATCH 6/7] mm: parallelize deferred_init_memmap() Message-ID: <20200506230148.44yytsociprgenf4@ca-dmjordan1.us.oracle.com> References: <20200430201125.532129-7-daniel.m.jordan@oracle.com> <3C3C62BE-6363-41C3-834C-C3124EB3FFAB@joshtriplett.org> <20200505014844.ulp4rtih7adtcicm@ca-dmjordan1.us.oracle.com> <20200505020916.mve4ijrg4z5h7eh5@ca-dmjordan1.us.oracle.com> <20200506222127.l3p2a2vjavwz2bdl@ca-dmjordan1.us.oracle.com> <20200506224335.gv3as7vuik3rtt5w@ca-dmjordan1.us.oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200506224335.gv3as7vuik3rtt5w@ca-dmjordan1.us.oracle.com> User-Agent: NeoMutt/20180716 X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9613 signatures=668687 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 bulkscore=0 adultscore=0 suspectscore=0 mlxlogscore=999 malwarescore=0 phishscore=0 mlxscore=0 spamscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2003020000 definitions=main-2005060183 X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9613 signatures=668687 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 adultscore=0 suspectscore=0 mlxscore=0 spamscore=0 clxscore=1015 priorityscore=1501 bulkscore=0 phishscore=0 impostorscore=0 malwarescore=0 lowpriorityscore=0 mlxlogscore=999 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2003020000 definitions=main-2005060183 Sender: linux-crypto-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-crypto@vger.kernel.org On Wed, May 06, 2020 at 06:43:35PM -0400, Daniel Jordan wrote: > On Wed, May 06, 2020 at 03:36:54PM -0700, Alexander Duyck wrote: > > On Wed, May 6, 2020 at 3:21 PM Daniel Jordan wrote: > > > > > > On Tue, May 05, 2020 at 07:55:43AM -0700, Alexander Duyck wrote: > > > > One question about this data. What is the power management > > > > configuration on the systems when you are running these tests? I'm > > > > just curious if CPU frequency scaling, C states, and turbo are > > > > enabled? > > > > > > Yes, intel_pstate is loaded in active mode without hwp and with turbo enabled > > > (those power management docs are great by the way!) and intel_idle is in use > > > too. > > > > > > > I ask because that is what I have seen usually make the > > > > difference in these kind of workloads as the throughput starts > > > > dropping off as you start seeing the core frequency lower and more > > > > cores become active. > > > > > > If I follow, you're saying there's a chance performance would improve with the > > > above disabled, but how often would a system be configured that way? Even if > > > it were faster, the machine is configured how it's configured, or am I missing > > > your point? > > > > I think you might be missing my point. What I was getting at is that I > > know for performance testing sometimes C states and P states get > > disabled in order to get consistent results between runs, it sounds > > like you have them enabled though. I was just wondering if you had > > disabled them or not. If they were disabled then you wouldn't get the > > benefits of turbo and as such adding more cores wouldn't come at a > > penalty, while with it enabled the first few cores should start to > > slow down as they fell out of turbo mode. So it may be part of the > > reason why you are only hitting about 10x at full core count. I checked the memory bandwidth of the biggest system, the Skylake. Couldn't find official specs for it, all I could quickly find were stream results from a blog post of ours that quoted a range of about 123-145 GB/s over both nodes when compiling with gcc. That's with all CPUs. Again using all CPUs, multithreaded page init is doing 41 GiB/s per node assuming it's just touching the 64 bytes of each page struct, so assuming there's more memory traffic than just struct page, it seems another part of the reason for only 10x is we're bottlenecked on memory bandwidth.