Received: by 10.223.185.116 with SMTP id b49csp80375wrg; Fri, 2 Mar 2018 14:03:19 -0800 (PST) X-Google-Smtp-Source: AG47ELuSaRo1jV0j3fTJe/x5bgeIC5YFycPhRL0Yy5BGdwzWYMhy6bJaeISGQX7Bh9NPXA+uQxyH X-Received: by 10.99.44.22 with SMTP id s22mr5690592pgs.111.1520028199483; Fri, 02 Mar 2018 14:03:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520028199; cv=none; d=google.com; s=arc-20160816; b=ZOBshOrJKvBxynzUEHuzreq2fyCZeDUnD8hrNlRFRAQqVrDwCJcmgCFNZdTJxMMQHw Xi5Pud8MPrxxxGojwkpoBqEKzTz4v+KemP0llqyKR9yvIstAFqGAlrKfSjK5uaAzzz9V wNVr8rdzDNb3+k5QUmN6ie9nLWkQPyfNJnjPg3QGwlv/x4Y2SzCCUVEHsnbYQ++jtS/L dbmI0oKx88gAw5SiQgReop4HBX+2DNz0o3MNJbZ1w/exsq6GUsb+749Z06ege0AttJM6 /9VqMDPnmJlOvwlGNReaGx3MKXSUatccf5fcrRuDlBqx6zHvJUCSZQh+6XccGu390BqK PG3Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :arc-authentication-results; bh=D/034hZDMLZ+gzZy4ioGAPEGVlwzMEcfOQQvIe1IsHU=; b=ZtfkHmoeui7fQhMCBBCAVUMyb3crZjL5mePr9+3g1PRvusRqQOF0kTJL6mjK/FCufy VFgpF3IG/IWHOlXYktGCo/W6O2c5O7asNg4etHwqCz8bWFBt1cvtSEu/gU1UjBlbpgM3 TUwdeQDk8eLJ8S6Q+1saEgQG3qn8DgoCBhbhJ9+LAt82TWeI2T7a2PeCf/C1Q7jbgh6K 8nRztv8zexKFqkzBokny/zPGbdEAQ/SZaQHK52P26ZYAz1C5tIMGjPCO25SSYtJwIqau QF2/HjrgCJzP+i50K1kzGRTNXU3CAMm9GOo227L3uMzCV39SEGCaSm1fg/X8H04jrzj4 lhYw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r13si4515196pgp.504.2018.03.02.14.03.04; Fri, 02 Mar 2018 14:03:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933747AbeCBVXh (ORCPT + 99 others); Fri, 2 Mar 2018 16:23:37 -0500 Received: from mail.linuxfoundation.org ([140.211.169.12]:55404 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933732AbeCBVXe (ORCPT ); Fri, 2 Mar 2018 16:23:34 -0500 Received: from akpm3.svl.corp.google.com (unknown [104.133.9.71]) by mail.linuxfoundation.org (Postfix) with ESMTPSA id E33F41087; Fri, 2 Mar 2018 21:23:33 +0000 (UTC) Date: Fri, 2 Mar 2018 13:23:32 -0800 From: Andrew Morton To: Aaron Lu Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Huang Ying , Dave Hansen , Kemi Wang , Tim Chen , Andi Kleen , Michal Hocko , Vlastimil Babka , Mel Gorman , Matthew Wilcox , David Rientjes Subject: Re: [PATCH v4 2/3] mm/free_pcppages_bulk: do not hold lock when picking pages to free Message-Id: <20180302132332.2c69559686ff24d15ff44ae8@linux-foundation.org> In-Reply-To: <20180302080125.GB6356@intel.com> References: <20180301062845.26038-1-aaron.lu@intel.com> <20180301062845.26038-3-aaron.lu@intel.com> <20180301160105.aca958fac871998d582307d4@linux-foundation.org> <20180302080125.GB6356@intel.com> X-Mailer: Sylpheed 3.6.0 (GTK+ 2.24.31; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2 Mar 2018 16:01:25 +0800 Aaron Lu wrote: > On Thu, Mar 01, 2018 at 04:01:05PM -0800, Andrew Morton wrote: > > On Thu, 1 Mar 2018 14:28:44 +0800 Aaron Lu wrote: > > > > > When freeing a batch of pages from Per-CPU-Pages(PCP) back to buddy, > > > the zone->lock is held and then pages are chosen from PCP's migratetype > > > list. While there is actually no need to do this 'choose part' under > > > lock since it's PCP pages, the only CPU that can touch them is us and > > > irq is also disabled. > > > > > > Moving this part outside could reduce lock held time and improve > > > performance. Test with will-it-scale/page_fault1 full load: > > > > > > kernel Broadwell(2S) Skylake(2S) Broadwell(4S) Skylake(4S) > > > v4.16-rc2+ 9034215 7971818 13667135 15677465 > > > this patch 9536374 +5.6% 8314710 +4.3% 14070408 +3.0% 16675866 +6.4% > > > > > > What the test does is: starts $nr_cpu processes and each will repeatedly > > > do the following for 5 minutes: > > > 1 mmap 128M anonymouse space; > > > 2 write access to that space; > > > 3 munmap. > > > The score is the aggregated iteration. > > > > But it's a loss for uniprocessor systems: it adds more code and adds an > > additional pass across a list. > > Performance wise, I assume the loss is pretty small and can not > be measured. > > On my Sandybridge desktop, with will-it-scale/page_fault1/single process > run to emulate uniprocessor system, the score is(average of 3 runs): > > base(patch 1/3): 649710 > this patch: 653554 +0.6% Does that mean we got faster or slower? > prefetch(patch 3/3): 650336 (in noise range compared to base) > > On 4 sockets Intel Broadwell with will-it-scale/page_fault1/single > process run: > > base(patch 1/3): 498649 > this patch: 504171 +1.1% > prefetch(patch 3/3): 506334 +1.5% (compared to base) > > It looks like we don't need to worry too much about performance for > uniprocessor system. Well. We can say that of hundreds of patches. And we end up with a fatter and slower kernel than we otherwise would. Please take a look, see if there's a tidy way of avoiding this. Probably there isn't, in which case oh well. But let's at least try.