Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp200065pxf; Wed, 31 Mar 2021 00:42:12 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw67wkz3LFx3EektKqyXs96lJ+ud7gG+TwcOLT3SlaNB2QWiS7ozjNCgyoj7GshCL0xZFn1 X-Received: by 2002:aa7:d0c2:: with SMTP id u2mr2065503edo.158.1617176531815; Wed, 31 Mar 2021 00:42:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617176531; cv=none; d=google.com; s=arc-20160816; b=Nz23ya+J7LM3XaH7sWe4nN/AYbYoASQpPtWhJVDh2XUSxILlYfSkHz1ZBUTHZXzi+C XAZ5DIsrnH6nxnwAIzmRIfUpqIgRrv7g6q2t0psSm8uxnyJL67NnAzaVFnTVo+XDdID1 0UjXp8FnLHCB2muULlG9vB2mHl8vFujZ/5O/ydFGkGPZkCpWtD3nRGgx+wpRAOIn0ObM W3qglBtF6B2qJmmz+ovoc5ah1V4D+5o93/8YFM4iIu1BfWL+djUrI+uWjVUOo1tX5yVl uiPDDK83t/4mbhHDTXe7dXfzlsaM/bE95p8/PwANRNwrxmImp3MYpBfIG26IJnZcnzcH JasA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=XbPobyfldbXUDhhP78aqGmsTnY1Dbdw1crdBPHXf9Tc=; b=ixX6/EHBdYpwLuqrPNX6u+zw9JP9HhuNF3CWGPQ4KF/g7Vx9EbaktuI0IUeymZadpz FPXBYChrx3JtsSyvdBKGEeBhHoL+fEW7GXPtbFQXpRO62kL7hQ6ToL7FbvVN8e/Qg1UK 0movX2Fgia6GUrH7MFCye6HHpgenjkE4YNrxohbcr0z+XveQWo+PK2fwX5k1uovcb5Bu 8uDaG98P61Vp8G4RIHFBacb+DUrx2PmZ+Lk+BdX4Bak0U2Hms9CNvEmMFBwuqHYw92q6 xnHOBlM2EsJQLGGN5JGprt9/NRMVTesUOkLQuQmBHDYfMBDyuYG26rZRHeLVH49BW7S2 twzw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q2si1196811edb.393.2021.03.31.00.41.49; Wed, 31 Mar 2021 00:42:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234193AbhCaHiT (ORCPT + 99 others); Wed, 31 Mar 2021 03:38:19 -0400 Received: from outbound-smtp26.blacknight.com ([81.17.249.194]:38264 "EHLO outbound-smtp26.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234006AbhCaHiI (ORCPT ); Wed, 31 Mar 2021 03:38:08 -0400 Received: from mail.blacknight.com (pemlinmail05.blacknight.ie [81.17.254.26]) by outbound-smtp26.blacknight.com (Postfix) with ESMTPS id 5224ECACC9 for ; Wed, 31 Mar 2021 08:38:07 +0100 (IST) Received: (qmail 31482 invoked from network); 31 Mar 2021 07:38:07 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.22.4]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 31 Mar 2021 07:38:07 -0000 Date: Wed, 31 Mar 2021 08:38:05 +0100 From: Mel Gorman To: Jesper Dangaard Brouer Cc: Linux-MM , Linux-RT-Users , LKML , Chuck Lever , Matthew Wilcox Subject: Re: [RFC PATCH 0/6] Use local_lock for pcp protection and reduce stat overhead Message-ID: <20210331073805.GY3697@techsingularity.net> References: <20210329120648.19040-1-mgorman@techsingularity.net> <20210330205154.1fe1e479@carbon> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: <20210330205154.1fe1e479@carbon> User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 30, 2021 at 08:51:54PM +0200, Jesper Dangaard Brouer wrote: > On Mon, 29 Mar 2021 13:06:42 +0100 > Mel Gorman wrote: > > > This series requires patches in Andrew's tree so the series is also > > available at > > > > git://git.kernel.org/pub/scm/linux/kernel/git/mel/linux.git mm-percpu-local_lock-v1r15 > > > > tldr: Jesper and Chuck, it would be nice to verify if this series helps > > the allocation rate of the bulk page allocator. RT people, this > > *partially* addresses some problems PREEMPT_RT has with the page > > allocator but it needs review. > > I've run a new micro-benchmark[1] which shows: > (CPU: Intel(R) Xeon(R) CPU E5-1650 v4 @ 3.60GHz) > > > BASELINE > single_page alloc+put: 194 cycles(tsc) 54.106 ns > > ARRAY variant: time_bulk_page_alloc_free_array: step=bulk size > > Per elem: 195 cycles(tsc) 54.225 ns (step:1) > Per elem: 127 cycles(tsc) 35.492 ns (step:2) > Per elem: 117 cycles(tsc) 32.643 ns (step:3) > Per elem: 111 cycles(tsc) 30.992 ns (step:4) > Per elem: 106 cycles(tsc) 29.606 ns (step:8) > Per elem: 102 cycles(tsc) 28.532 ns (step:16) > Per elem: 99 cycles(tsc) 27.728 ns (step:32) > Per elem: 98 cycles(tsc) 27.252 ns (step:64) > Per elem: 97 cycles(tsc) 27.090 ns (step:128) > > This should be seen in comparison with the older micro-benchmark[2] > done on branch mm-bulk-rebase-v5r9. > > BASELINE > single_page alloc+put: Per elem: 199 cycles(tsc) 55.472 ns > > ARRAY variant: time_bulk_page_alloc_free_array: step=bulk size > > Per elem: 202 cycles(tsc) 56.383 ns (step:1) > Per elem: 144 cycles(tsc) 40.047 ns (step:2) > Per elem: 134 cycles(tsc) 37.339 ns (step:3) > Per elem: 128 cycles(tsc) 35.578 ns (step:4) > Per elem: 120 cycles(tsc) 33.592 ns (step:8) > Per elem: 116 cycles(tsc) 32.362 ns (step:16) > Per elem: 113 cycles(tsc) 31.476 ns (step:32) > Per elem: 110 cycles(tsc) 30.633 ns (step:64) > Per elem: 110 cycles(tsc) 30.596 ns (step:128) > Ok, so bulk allocation is faster than allocating single pages, no surprise there. Putting the array figures for bulk allocation into tabular format and comparing we get; Array variant (time to allocate a page in nanoseconds, lower is better) Baseline Patched 1 56.383 54.225 (+3.83%) 2 40.047 35.492 (+11.38%) 3 37.339 32.643 (+12.58%) 4 35.578 30.992 (+12.89%) 8 33.592 29.606 (+11.87%) 16 32.362 28.532 (+11.85%) 32 31.476 27.728 (+11.91%) 64 30.633 27.252 (+11.04%) 128 30.596 27.090 (+11.46%) The series is 11-12% faster when allocating multiple pages. That's a fairly positive outcome and I'll include this in the series leader if you have no objections. Thanks Jesper! -- Mel Gorman SUSE Labs