Received: by 10.223.176.5 with SMTP id f5csp2022624wra; Wed, 31 Jan 2018 15:24:34 -0800 (PST) X-Google-Smtp-Source: AH8x22718rQGWJRvSdc2UMVUFQM6dDTdWsOcjQ3D2OmtFpQGIKsDdoFcSfDRTHooZOjweZ9Gjr7C X-Received: by 2002:a17:902:5853:: with SMTP id f19-v6mr1591844plj.116.1517441074016; Wed, 31 Jan 2018 15:24:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517441073; cv=none; d=google.com; s=arc-20160816; b=WYGZCWesw8pLTLTzILAHZUnvlTz7uXC5lq4h9q52zJwXqnAQ/4cxOnC2MrZt+HVxfQ Zdk9kgQ+6UgbpWP77NC0SX0funSFqwTq3DsJLkBwAeqBRDZmI/t/Qyc3pOZMQsla7Bl5 tluDR5vwWs54GCTGSYQfSvOYe1Ax3Y7NowSk4HpqNWYRogoo0n6Q5+rCvUWAwHCeMYmm j6Iyz03VQwZqJ1v0nP0366Gh2NnbBt3EVK1o2zCMckVP0hKcXnyYhJw9xBQ1GA6bVbFs SNeNS5RandDJOmzW9Sidyjb2r3UKLlgqpPCQ6FFSaudt9fqGT8XaHiiFMt5kHgMMKwlZ aepQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :dkim-signature:arc-authentication-results; bh=iyD+1SsiJS28ZvqMmgHoN7rfI/Xh1fPKd5P89NfbVP0=; b=oNQ/CW8CGbVK4Ea93PbrszNkYenG3BSQ4MQr1aJxBymjNfoKstg7P1UXf1tY4t6Nh0 xfYQ1EJS56I/cSjugmV1Yory/P3XbdKX6lBgIZ1v6v3K6wnxU/4+RJ4qHpEhzUA2EHtP 2U9EPh3dFHGfHjwr0Cxzab8zCACFms9w4xRWGa5d2m+XDQ3ug9y5isSgFB2DoidD60LX /CaEXQyQtSPYNrcq6F+LVsPvZ3uurxUCK5rO8gscI2tplpSIJhcWNbkL+7l/AAdJIM8c 0LYTlUaQFwjB/4+kWr1ylGAyNcKf2TgH4NpO1SwxYDcrkSj2404nHZA7MCoxbXuS/aQd Bt1w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=UnZAH5/g; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f13si5720447pfj.28.2018.01.31.15.24.11; Wed, 31 Jan 2018 15:24:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=UnZAH5/g; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754271AbeAaXEi (ORCPT + 99 others); Wed, 31 Jan 2018 18:04:38 -0500 Received: from aserp2120.oracle.com ([141.146.126.78]:58344 "EHLO aserp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754063AbeAaXEc (ORCPT ); Wed, 31 Jan 2018 18:04:32 -0500 Received: from pps.filterd (aserp2120.oracle.com [127.0.0.1]) by aserp2120.oracle.com (8.16.0.22/8.16.0.22) with SMTP id w0VMpoGE130478; Wed, 31 Jan 2018 23:04:23 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=from : to : cc : subject : date : message-id; s=corp-2017-10-26; bh=iyD+1SsiJS28ZvqMmgHoN7rfI/Xh1fPKd5P89NfbVP0=; b=UnZAH5/gFXeX+mesYixOlwa6fM0JdO7q0hVTfYL03WJBrLes5A9VC1uEq8vAwB1xKQFq z2QdtRtOhDrYdasKCBSiWthpIdM1ugyTo653jOe1APIu31h0mKNAtdhvsnI7lATC6pgx w9na6XWpcnB96oAHsQhYqY56wMc5wKVG3JPrW3ZdzHjnKf54CnNoR6zgBncdlCrJmTfF xdnLynegRN91B4RANw00h1B50MHti4Qat6fEy+BftFr+majcftgEx8R7f6ALD8J04q8x xy6rJDy8koP6FJpEmhjpjvs19fbNzw9F5LvzvaqPRa+KTv0cSfi6y7royEQ565647YKM nw== Received: from userv0021.oracle.com (userv0021.oracle.com [156.151.31.71]) by aserp2120.oracle.com with ESMTP id 2fuf632mw8-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 31 Jan 2018 23:04:23 +0000 Received: from aserv0122.oracle.com (aserv0122.oracle.com [141.146.126.236]) by userv0021.oracle.com (8.14.4/8.14.4) with ESMTP id w0VN4L5w021407 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Wed, 31 Jan 2018 23:04:22 GMT Received: from abhmp0010.oracle.com (abhmp0010.oracle.com [141.146.116.16]) by aserv0122.oracle.com (8.14.4/8.14.4) with ESMTP id w0VN4KuE021989; Wed, 31 Jan 2018 23:04:20 GMT Received: from parnassus.us.oracle.com (/10.39.213.30) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Wed, 31 Jan 2018 15:04:19 -0800 From: daniel.m.jordan@oracle.com To: linux-mm@kvack.org, linux-kernel@vger.kernel.org Cc: aaron.lu@intel.com, ak@linux.intel.com, akpm@linux-foundation.org, Dave.Dice@oracle.com, dave@stgolabs.net, khandual@linux.vnet.ibm.com, ldufour@linux.vnet.ibm.com, mgorman@suse.de, mhocko@kernel.org, pasha.tatashin@oracle.com, steven.sistare@oracle.com, yossi.lev@oracle.com Subject: [RFC PATCH v1 00/13] lru_lock scalability Date: Wed, 31 Jan 2018 18:04:00 -0500 Message-Id: <20180131230413.27653-1-daniel.m.jordan@oracle.com> X-Mailer: git-send-email 2.16.1 X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=8791 signatures=668659 X-Proofpoint-Spam-Details: rule=notspam policy=default score=1 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=1 mlxscore=1 mlxlogscore=213 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1801310283 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org lru_lock, a per-node* spinlock that protects an LRU list, is one of the hottest locks in the kernel. On some workloads on large machines, it shows up at the top of lock_stat. One way to improve lru_lock scalability is to introduce an array of locks, with each lock protecting certain batches of LRU pages. *ooooooooooo**ooooooooooo**ooooooooooo**oooo ... | || || || \ batch 1 / \ batch 2 / \ batch 3 / In this ASCII depiction of an LRU, a page is represented with either '*' or 'o'. An asterisk indicates a sentinel page, which is a page at the edge of a batch. An 'o' indicates a non-sentinel page. To remove a non-sentinel LRU page, only one lock from the array is required. This allows multiple threads to remove pages from different batches simultaneously. A sentinel page requires lru_lock in addition to a lock from the array. Full performance numbers appear in the last patch in this series, but this prototype allows a microbenchmark to do up to 28% more page faults per second with 16 or more concurrent processes. This work was developed in collaboration with Steve Sistare. Note: This is an early prototype. I'm submitting it now to support my request to attend LSF/MM, as well as get early feedback on the idea. Any comments appreciated. * lru_lock is actually per-memcg, but without memcg's in the picture it becomes per-node. Aaron Lu (1): mm: add a percpu_pagelist_batch sysctl interface Daniel Jordan (12): mm: allow compaction to be disabled mm: add lock array to pgdat and batch fields to struct page mm: introduce struct lru_list_head in lruvec to hold per-LRU batch info mm: add batching logic to add/delete/move API's mm: add lru_[un]lock_all APIs mm: convert to-be-refactored lru_lock callsites to lock-all API mm: temporarily convert lru_lock callsites to lock-all API mm: introduce add-only version of pagevec_lru_move_fn mm: add LRU batch lock API's mm: use lru_batch locking in release_pages mm: split up release_pages into non-sentinel and sentinel passes mm: splice local lists onto the front of the LRU include/linux/mm_inline.h | 209 +++++++++++++++++++++++++++++++++++++++++++++- include/linux/mm_types.h | 5 ++ include/linux/mmzone.h | 25 +++++- kernel/sysctl.c | 9 ++ mm/Kconfig | 1 - mm/huge_memory.c | 6 +- mm/memcontrol.c | 5 +- mm/mlock.c | 11 +-- mm/mmzone.c | 7 +- mm/page_alloc.c | 43 +++++++++- mm/page_idle.c | 4 +- mm/swap.c | 208 ++++++++++++++++++++++++++++++++++++--------- mm/vmscan.c | 49 +++++------ 13 files changed, 500 insertions(+), 82 deletions(-) -- 2.16.1