Received: by 10.223.185.116 with SMTP id b49csp8401994wrg; Fri, 2 Mar 2018 00:46:25 -0800 (PST) X-Google-Smtp-Source: AG47ELsGsuMBSMDHaNF69HoYzSdMqFH4CmdXJXLEPdeACs+ZZ0rRc+UgrTcvzFa5ahmEstRXrhBv X-Received: by 2002:a17:902:6bc2:: with SMTP id m2-v6mr4104613plt.432.1519980385442; Fri, 02 Mar 2018 00:46:25 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1519980385; cv=none; d=google.com; s=arc-20160816; b=vbaXV3uiqZ/ELFtfAs28aJiMGo7OqXT+PdGLPhjvCRJHVhdwkiWyhOYA4Tad/IT8Om zwyhb15yJiqBGmyZNQqBFgprvkIbN6DGuizxuTVJ0631wfYw1ldL+VWzHmNczO/vl8VB iZQCgvesxiWLrC7Njm14xe5FStNfL3KGZimDYqgUsLxf+y8nmShoWkzboT4piRpnyN2W CJyhHsply/KrzZfRx9lAqvLqLrO//Q8GL6K2xA5iVkrIbM4JPSwRViXcKrTtqTfDm3J+ JXcrOfHimcBlFmLB1ueriE+Z8/QFnPLZh4rk/fEs36rvLLoBCD3z05RXWh5dS0WiKTin HPmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=hkyf9jGE6t/bYQxZQ55LdUznezuXDTY2chdjX5X4RmE=; b=llVwhhrO7JtiKKFY3iJYJLYy/e46MMVx1iJArNnESGJqCok1RoiIKGgB8vRYmSYBro b/CQmv425ca3bB1h3f818KVml0t9wDSdyQuRshyfn274KBUW1NzlgmifuOhYIC5ik71x Oti20kqEnoyzIKkXB90fw6pZ0DUfEvHw2zww+BR/4ktQr29tEk/G2eUoQIUp/s+eleeh Nf+KQ5ax7y+Q/3K9S7/rETLh8cd2pNc1GaTmu1UKdirZhUza8dRLe6HXFh0qVdHKKePw GSo4bbkHBcSnxdwVNXukzUZgsHESn7b1gEaEAxjx9ifJTyD2haxm0QfxPd8g9ve0bOpa kxhQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a9-v6si4593526pln.89.2018.03.02.00.46.10; Fri, 02 Mar 2018 00:46:25 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1034314AbeCBIAa (ORCPT + 99 others); Fri, 2 Mar 2018 03:00:30 -0500 Received: from mga04.intel.com ([192.55.52.120]:24578 "EHLO mga04.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1030476AbeCBIA2 (ORCPT ); Fri, 2 Mar 2018 03:00:28 -0500 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga104.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 02 Mar 2018 00:00:24 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.47,411,1515484800"; d="scan'208";a="21512780" Received: from aaronlu.sh.intel.com (HELO intel.com) ([10.239.159.135]) by fmsmga008.fm.intel.com with ESMTP; 02 Mar 2018 00:00:22 -0800 Date: Fri, 2 Mar 2018 16:01:25 +0800 From: Aaron Lu To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Huang Ying , Dave Hansen , Kemi Wang , Tim Chen , Andi Kleen , Michal Hocko , Vlastimil Babka , Mel Gorman , Matthew Wilcox , David Rientjes Subject: Re: [PATCH v4 2/3] mm/free_pcppages_bulk: do not hold lock when picking pages to free Message-ID: <20180302080125.GB6356@intel.com> References: <20180301062845.26038-1-aaron.lu@intel.com> <20180301062845.26038-3-aaron.lu@intel.com> <20180301160105.aca958fac871998d582307d4@linux-foundation.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180301160105.aca958fac871998d582307d4@linux-foundation.org> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Mar 01, 2018 at 04:01:05PM -0800, Andrew Morton wrote: > On Thu, 1 Mar 2018 14:28:44 +0800 Aaron Lu wrote: > > > When freeing a batch of pages from Per-CPU-Pages(PCP) back to buddy, > > the zone->lock is held and then pages are chosen from PCP's migratetype > > list. While there is actually no need to do this 'choose part' under > > lock since it's PCP pages, the only CPU that can touch them is us and > > irq is also disabled. > > > > Moving this part outside could reduce lock held time and improve > > performance. Test with will-it-scale/page_fault1 full load: > > > > kernel Broadwell(2S) Skylake(2S) Broadwell(4S) Skylake(4S) > > v4.16-rc2+ 9034215 7971818 13667135 15677465 > > this patch 9536374 +5.6% 8314710 +4.3% 14070408 +3.0% 16675866 +6.4% > > > > What the test does is: starts $nr_cpu processes and each will repeatedly > > do the following for 5 minutes: > > 1 mmap 128M anonymouse space; > > 2 write access to that space; > > 3 munmap. > > The score is the aggregated iteration. > > But it's a loss for uniprocessor systems: it adds more code and adds an > additional pass across a list. Performance wise, I assume the loss is pretty small and can not be measured. On my Sandybridge desktop, with will-it-scale/page_fault1/single process run to emulate uniprocessor system, the score is(average of 3 runs): base(patch 1/3): 649710 this patch: 653554 +0.6% prefetch(patch 3/3): 650336 (in noise range compared to base) On 4 sockets Intel Broadwell with will-it-scale/page_fault1/single process run: base(patch 1/3): 498649 this patch: 504171 +1.1% prefetch(patch 3/3): 506334 +1.5% (compared to base) It looks like we don't need to worry too much about performance for uniprocessor system.