Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp5224880pxv; Tue, 20 Jul 2021 23:46:58 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz0li8nfu8uctWTH1e8QI/PaE31HS6WgRmiSpdfNMSIWNuj2v6bGAQexJpffFEkALiVQ5/k X-Received: by 2002:aa7:db94:: with SMTP id u20mr46300858edt.381.1626850018601; Tue, 20 Jul 2021 23:46:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1626850018; cv=none; d=google.com; s=arc-20160816; b=sBfZrwNPgYaYAd8G0DB9w/4FL3TEf6q1ARj6VEkc2TA/er2C91bj6bLhWdkB6pw5n1 4aaddK5hAS+0B4VRZ0zqmRDrOncJIx6tmyzWbVVH/TmS6wYRIC/lZRH4/01AN3DCrQ9+ tExe2TkQLT+cN4mBBIEcdrN/EkRyDLt1jHFQoU/YzrNujKTmM6bKwPZS25m4ZhlAk5L5 m9SQVzqHR93EVbWsxchWXQsq5dPoKK/n/vJU84PPg7LnFGX/FBtLYmhqBeyWmeTxc1WP 0RNV8QbV+iTZSnT7L46y5UgyxYKiHV1FEbKRUmXFXrGLmT8ixVnacf4cfUITtQ3PwBGU gKXQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=gknsUjw5m0a1eP7HYo2T+jat+4GsZ3bzg6mJChjlkf0=; b=qixmJshNLUds3ahgYZ7n+S2Hja6fYvmpTKMUrfA//mHU/B3X7mCuOb5H1WKu2UwaV7 Qn8Z0Xv4RiXVE+lFcA4GCDKmSvuwFPHeOeGGVTE2LnIOmxRBSZamlRLd5xr1VOtl9aQL thX4C6g1Ux5JIuAaHmTqU5iVT3oXnURi0neiSYy8Oj2fJcneZWhwWgMeJbM6BMIcVPRY 3B5KkIepllFOKzhlXFQM9/jchGJrfNPhOFiBNFf16nok44ncQK+uo6xee08Cam5voMXe HksLSAunDEQf2GJA92XQvKU3Hajs67NuTT3sZ110QqBroCe2QvPmB5l1Fy9J54Lt0lLV PVqw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a26si8561311edr.155.2021.07.20.23.46.35; Tue, 20 Jul 2021 23:46:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229920AbhGUGDa (ORCPT + 99 others); Wed, 21 Jul 2021 02:03:30 -0400 Received: from mga06.intel.com ([134.134.136.31]:44391 "EHLO mga06.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234258AbhGUF7U (ORCPT ); Wed, 21 Jul 2021 01:59:20 -0400 X-IronPort-AV: E=McAfee;i="6200,9189,10051"; a="272496396" X-IronPort-AV: E=Sophos;i="5.84,257,1620716400"; d="scan'208";a="272496396" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga104.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jul 2021 23:39:57 -0700 X-IronPort-AV: E=Sophos;i="5.84,257,1620716400"; d="scan'208";a="511390036" Received: from yhuang6-desk2.sh.intel.com ([10.239.159.119]) by fmsmga002-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jul 2021 23:39:53 -0700 From: Huang Ying To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Keith Busch , Dave Hansen , "Huang, Ying" , Yang Shi , Michal Hocko , Zi Yan , Wei Xu , David Rientjes , Dan Williams , David Hildenbrand , Greg Thelen , Oscar Salvador , Yang Shi Subject: [PATCH -V11 7/9] mm/vmscan: Consider anonymous pages without swap Date: Wed, 21 Jul 2021 14:39:24 +0800 Message-Id: <20210721063926.3024591-7-ying.huang@intel.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: <20210721063926.3024591-1-ying.huang@intel.com> References: <20210721063926.3024591-1-ying.huang@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Keith Busch Reclaim anonymous pages if a migration path is available now that demotion provides a non-swap recourse for reclaiming anon pages. Note that this check is subtly different from the can_age_anon_pages() checks. This mechanism checks whether a specific page in a specific context can actually be reclaimed, given current swap space and cgroup limits. can_age_anon_pages() is a much simpler and more preliminary check which just says whether there is a possibility of future reclaim. Link: https://lkml.kernel.org/r/20210715055145.195411-8-ying.huang@intel.com Cc: Keith Busch Signed-off-by: Dave Hansen Signed-off-by: "Huang, Ying" Reviewed-by: Yang Shi Cc: Michal Hocko Cc: Zi Yan Cc: Wei Xu Cc: David Rientjes Cc: Dan Williams Cc: David Hildenbrand Cc: Greg Thelen Cc: Oscar Salvador Cc: Yang Shi Signed-off-by: Andrew Morton --- mm/vmscan.c | 34 ++++++++++++++++++++++++++++++---- 1 file changed, 30 insertions(+), 4 deletions(-) diff --git a/mm/vmscan.c b/mm/vmscan.c index d79bf91700de..67a320c6571d 100644 --- a/mm/vmscan.c +++ b/mm/vmscan.c @@ -521,7 +521,7 @@ static long add_nr_deferred(long nr, struct shrinker *shrinker, static bool can_demote(int nid, struct scan_control *sc) { - if (sc->no_demotion) + if (sc && sc->no_demotion) return false; if (next_demotion_node(nid) == NUMA_NO_NODE) return false; @@ -530,6 +530,31 @@ static bool can_demote(int nid, struct scan_control *sc) return false; } +static inline bool can_reclaim_anon_pages(struct mem_cgroup *memcg, + int nid, + struct scan_control *sc) +{ + if (memcg == NULL) { + /* + * For non-memcg reclaim, is there + * space in any swap device? + */ + if (get_nr_swap_pages() > 0) + return true; + } else { + /* Is the memcg below its swap limit? */ + if (mem_cgroup_get_nr_swap_pages(memcg) > 0) + return true; + } + + /* + * The page can not be swapped. + * + * Can it be reclaimed from this node via demotion? + */ + return can_demote(nid, sc); +} + /* * This misses isolated pages which are not accounted for to save counters. * As the data only determines if reclaim or compaction continues, it is @@ -541,7 +566,7 @@ unsigned long zone_reclaimable_pages(struct zone *zone) nr = zone_page_state_snapshot(zone, NR_ZONE_INACTIVE_FILE) + zone_page_state_snapshot(zone, NR_ZONE_ACTIVE_FILE); - if (get_nr_swap_pages() > 0) + if (can_reclaim_anon_pages(NULL, zone_to_nid(zone), NULL)) nr += zone_page_state_snapshot(zone, NR_ZONE_INACTIVE_ANON) + zone_page_state_snapshot(zone, NR_ZONE_ACTIVE_ANON); @@ -2544,6 +2569,7 @@ enum scan_balance { static void get_scan_count(struct lruvec *lruvec, struct scan_control *sc, unsigned long *nr) { + struct pglist_data *pgdat = lruvec_pgdat(lruvec); struct mem_cgroup *memcg = lruvec_memcg(lruvec); unsigned long anon_cost, file_cost, total_cost; int swappiness = mem_cgroup_swappiness(memcg); @@ -2554,7 +2580,7 @@ static void get_scan_count(struct lruvec *lruvec, struct scan_control *sc, enum lru_list lru; /* If we have no swap space, do not bother scanning anon pages. */ - if (!sc->may_swap || mem_cgroup_get_nr_swap_pages(memcg) <= 0) { + if (!sc->may_swap || !can_reclaim_anon_pages(memcg, pgdat->node_id, sc)) { scan_balance = SCAN_FILE; goto out; } @@ -2924,7 +2950,7 @@ static inline bool should_continue_reclaim(struct pglist_data *pgdat, */ pages_for_compaction = compact_gap(sc->order); inactive_lru_pages = node_page_state(pgdat, NR_INACTIVE_FILE); - if (get_nr_swap_pages() > 0) + if (can_reclaim_anon_pages(NULL, pgdat->node_id, sc)) inactive_lru_pages += node_page_state(pgdat, NR_INACTIVE_ANON); return inactive_lru_pages > pages_for_compaction; -- 2.30.2