Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp680242imu; Wed, 9 Jan 2019 04:43:49 -0800 (PST) X-Google-Smtp-Source: ALg8bN7U5hsDcKwnCQmbQfrrMEirZsWP7ratjU+b1V4bTKg7LrLfcOmKsiZrseplMdEg0kfJ/FQ7 X-Received: by 2002:a17:902:4025:: with SMTP id b34mr5963439pld.181.1547037829030; Wed, 09 Jan 2019 04:43:49 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1547037828; cv=none; d=google.com; s=arc-20160816; b=RORV7BttaUQSsQFQ500fjITGaUIQnNwmuNxvUhoYTEfhVNrx7/t8fnmWMI3yJR0fRR +0b+nJ8XnqzPzZFZcQ5mkdj61fHeOrht997epedXCLPRA6BYbD/FE+uCHk85SWSOMLL5 fskdE1bTjtL8CG1Nj5MBCaX6+OrS5yMGL5cOapEYwCYikN3VsBBHP0/7gJfBbYtp310W 2agK99k4jzdHbsultvrum1iGNkQrd3h3OlCCohAfusa/BcP6piLEPRRxcnHCqlfuiROD TjoCYShW+oFsOPBobzeMhuKuNKWqX8k4G6XT9m1Dtk+SYUcJzq8C6S2laM/CffJN8jt6 JFVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:to:from:subject; bh=5tY899Xd2X+9Z3CRn+G6UgzyTSWMoq5hPLIK/1l/VBc=; b=q+/bpwn1mwlBQB3LwjQhKz41rzp7Ab0viR7tY9RFqK4vNlg6d2ze9jWnv84FtPnPNi 5YbOPaEXGYledfvt+U5rX9T6jXBe2bO7aDUs870Pqgn7SyW76L0yHiXNb5A/cQNtQAXb np6wcDDOaMKwxcNU/EnFVg+bIPMeGw63IkTSdSFzXP3G/wLwb1TCbrPa9ohMq79DAkEr XUTtXYC3Oq1n4sT7x0PBLWXjdVplsMGYml2fkr+1DhcwFxTGTIIBxeh2RtYBPf/aUZ/I g536uMQdsk2A0pvXWuUkBR/33JJ9UDgDt3AO6fVEaAjrxeqWS/RDZQkW/PW1EToSpPTy YA+A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i1si68975663pfj.276.2019.01.09.04.43.33; Wed, 09 Jan 2019 04:43:48 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730753AbfAIMUh (ORCPT + 99 others); Wed, 9 Jan 2019 07:20:37 -0500 Received: from relay.sw.ru ([185.231.240.75]:42302 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727828AbfAIMUg (ORCPT ); Wed, 9 Jan 2019 07:20:36 -0500 Received: from [172.16.25.169] (helo=localhost.localdomain) by relay.sw.ru with esmtp (Exim 4.91) (envelope-from ) id 1ghCqO-0004gt-O2; Wed, 09 Jan 2019 15:20:24 +0300 Subject: [PATCH 1/3] mm: Uncharge and keep page in pagecache on memcg reclaim From: Kirill Tkhai To: akpm@linux-foundation.org, hannes@cmpxchg.org, josef@toxicpanda.com, jack@suse.cz, hughd@google.com, ktkhai@virtuozzo.com, darrick.wong@oracle.com, mhocko@suse.com, aryabinin@virtuozzo.com, guro@fb.com, mgorman@techsingularity.net, shakeelb@google.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org Date: Wed, 09 Jan 2019 15:20:24 +0300 Message-ID: <154703642447.32690.5604527676583713589.stgit@localhost.localdomain> In-Reply-To: <154703479840.32690.6504699919905946726.stgit@localhost.localdomain> References: <154703479840.32690.6504699919905946726.stgit@localhost.localdomain> User-Agent: StGit/0.18 MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This patch makes __remove_mapping() not remove a page from pagecache on memcg reclaim. After all mappings are removed and refcounter is freezed, we uncharge page memcg. Further putback_lru_page() places page into root_mem_cgroup, so it remains in pagecache till global reclaim. This gives memcg tasks extra possibility to obtain page from pagecache instead of launching IO. Next patch makes pagecache_get_page() to recharge a page in case of its memcg is NULL (i.e., on first access after uncharging). It looks to be the only function, which is used by filesystems to obtain a pagecache page. Here we introduce AS_KEEP_MEMCG_RECLAIM flag to mark the filesystems, which are reviewed, that they really follow this way. It has a sense to keep pages in __remove_mapping() only for them. Later, we remove this flags after all filesystems are reviewed. Signed-off-by: Kirill Tkhai --- include/linux/pagemap.h | 1 + mm/vmscan.c | 22 ++++++++++++++++++---- 2 files changed, 19 insertions(+), 4 deletions(-) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index 1020e6f40880..1b880da85868 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -29,6 +29,7 @@ enum mapping_flags { AS_EXITING = 4, /* final truncate in progress */ /* writeback related tags are not used */ AS_NO_WRITEBACK_TAGS = 5, + AS_KEEP_MEMCG_RECLAIM = 6, }; /** diff --git a/mm/vmscan.c b/mm/vmscan.c index a714c4f800e9..7237603c8973 100644 --- a/mm/vmscan.c +++ b/mm/vmscan.c @@ -887,7 +887,7 @@ static pageout_t pageout(struct page *page, struct address_space *mapping, * gets returned with a refcount of 0. */ static int __remove_mapping(struct address_space *mapping, struct page *page, - bool reclaimed) + bool reclaimed, bool memcg_reclaim) { unsigned long flags; int refcount; @@ -963,7 +963,20 @@ static int __remove_mapping(struct address_space *mapping, struct page *page, if (reclaimed && page_is_file_cache(page) && !mapping_exiting(mapping) && !dax_mapping(mapping)) shadow = workingset_eviction(mapping, page); - __delete_from_page_cache(page, shadow); +#ifdef CONFIG_MEMCG + if (memcg_reclaim && + test_bit(AS_KEEP_MEMCG_RECLAIM, &mapping->flags)) { + /* + * Page is not dirty/writeback/mapped, so we may avoid + * taking mem_cgroup::move_lock for changing its memcg. + * See mem_cgroup_move_account() for details. + */ + mem_cgroup_uncharge(page); + page_ref_unfreeze(page, refcount); + goto cannot_free; + } else +#endif + __delete_from_page_cache(page, shadow); xa_unlock_irqrestore(&mapping->i_pages, flags); if (freepage != NULL) @@ -985,7 +998,7 @@ static int __remove_mapping(struct address_space *mapping, struct page *page, */ int remove_mapping(struct address_space *mapping, struct page *page) { - if (__remove_mapping(mapping, page, false)) { + if (__remove_mapping(mapping, page, false, false)) { /* * Unfreezing the refcount with 1 rather than 2 effectively * drops the pagecache ref for us without requiring another @@ -1458,7 +1471,8 @@ static unsigned long shrink_page_list(struct list_head *page_list, count_vm_event(PGLAZYFREED); count_memcg_page_event(page, PGLAZYFREED); - } else if (!mapping || !__remove_mapping(mapping, page, true)) + } else if (!mapping || !__remove_mapping(mapping, page, true, + !global_reclaim(sc))) goto keep_locked; unlock_page(page);