Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp9846657ybi; Wed, 10 Jul 2019 18:26:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqw7ojnCftBiMlerwX6yyRzqikHhCb6ueSJhgLLx3z+umV7v886PvT4cnmxOpZHnrREmiSj5 X-Received: by 2002:a17:902:aa09:: with SMTP id be9mr1437933plb.52.1562808391071; Wed, 10 Jul 2019 18:26:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1562808391; cv=none; d=google.com; s=arc-20160816; b=N8CdLB0X6jIgVUR32xfGondINhS898Yvi4vrKOrKEMaHkCwxr+PWqqcrY6LrGI0rJb 9l053KVxcM5vRWCtaclQuPVNO9hlQMnrFUhtWQrRqKsErFP4nJIE8uEqsNTL97klFpvo azdqO/UTBUtAWEhioKRs4vI8WqSt4RUizUTDqgqk8ltLw6qO5LH0gtgiSQeppsHaMezh 8SGPLmCZQXfUjyysg5kOh6uQN3awAb/IjHb2SLsAe7C/hsw+Tsiowh22se4FpQrFIYy0 XdgzenaTXno7aAA7UlbMwfu+uao2xr+gueBv6JY4zQDLFz2wBcCXBdcLWwoJ4ZYlkH6g SjtQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=DTuMNMfkZBV3p2/r2diQjoAt92HjKen6nNck8jRv0YQ=; b=osP2BkGQ4k/TWawjGzPwwVRZJW4GhxiFQwF9DL2/m4V93fwiUIIW9lybbbQjXabfZz hYldco09FcXm44MgOQi6Cr5jO45XoJSAYeN6fGk9x2nHOrpVfJo24yVZOJC/LvkKEQfD nmsqdVMezUAiZs5QwbiL94Xt8XhjwLgWEhIdIJrDMDNeGy/mDS2ozCoCPuMUo0sHdCyD q97+zAWdqhq0lWlUB14zeG87GLx1qZXjNmekYK3eQJk+QbHo/Km2sd0J+RsyhbCy4fko /XVAlnhKlMpgo6vygz5TR0OGYnJlGQJLVdzisnjQPWy7tRuJIJ+8QOu2iNbwrp0AOLxC fW+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=AnhTAJMz; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v41si3672463pgn.481.2019.07.10.18.26.14; Wed, 10 Jul 2019 18:26:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=AnhTAJMz; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727859AbfGKBZl (ORCPT + 99 others); Wed, 10 Jul 2019 21:25:41 -0400 Received: from mail-pl1-f193.google.com ([209.85.214.193]:46878 "EHLO mail-pl1-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727779AbfGKBZk (ORCPT ); Wed, 10 Jul 2019 21:25:40 -0400 Received: by mail-pl1-f193.google.com with SMTP id c2so2094391plz.13; Wed, 10 Jul 2019 18:25:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=DTuMNMfkZBV3p2/r2diQjoAt92HjKen6nNck8jRv0YQ=; b=AnhTAJMz00MDcKhRGnJRRhsgMQ1lyGWxqfRtjPfIxXRExEQkihioQanNbkUGnOd1oG 0W1RhNIF/ROIUqMGewQ8E7b1kEw1nE+NpTV4cqVpf4NnVlB+xhNrszTzSjKpOo2H5dRx koXVPtzFJkuHSWsjmWAMMLq5jmaZyD6sywelp7ZhDiEOf6Cp9Dmdd/KH34B79xwXLqfY uASHcTTtn2Ryw7xym3jV5w+adkKiLXFzwV7hs+YDW3Te5+QNfek1835F58hyaNXRS3vo ygPS/I8b6jNHq5DnWlocM5Z4jlAUoeGkaCTY6fFfNVg+ZTsoHV7ODKXmS7XOznrKVkIt VS9w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:to:cc:subject:date:message-id :mime-version:content-transfer-encoding; bh=DTuMNMfkZBV3p2/r2diQjoAt92HjKen6nNck8jRv0YQ=; b=X2NOtV58IK3qhT6WXvG/sCZQQhUQ7O6Iv6yqifmC3cLwtWOPKinD9mWw868Y+MZ+6F JwkRqjQrVNtBSOdeywlyX3v/z9lD7ZSdIT7ubgrq2ynto92xlWPIfqgAW6xKRt5E/cYw BkUCAZKk+J8Al8+Cf80s0Yi+ln8BF9XCT1iSFAEB0h0OA3BUJ0aGCZxQIpY8cpsXiaiu IS+DQ9DjOgSc+/oJZlFe+ldMOx2/q6JfCPZ4fxz3TAgGuKN7V7OBXkRtfP5IWBfFY/bn ioPc7ynZRUuyvTAGceLP6ja0judglq9Wc9iukhr2nNEX+lFh/iSd0/5qs2OZQTppdJI0 3cTw== X-Gm-Message-State: APjAAAU3fxTtgBo7lTyL2/rT8pXhsUsyxyOOVNszABgkYec7BfsBDFRs gSfAIn4UmA1EvKAiM+OiNBo= X-Received: by 2002:a17:902:9a42:: with SMTP id x2mr1480602plv.106.1562808339390; Wed, 10 Jul 2019 18:25:39 -0700 (PDT) Received: from bbox-2.seo.corp.google.com ([2401:fa00:d:0:98f1:8b3d:1f37:3e8]) by smtp.gmail.com with ESMTPSA id b37sm10031974pjc.15.2019.07.10.18.25.34 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Wed, 10 Jul 2019 18:25:38 -0700 (PDT) From: Minchan Kim To: Andrew Morton Cc: linux-mm , LKML , linux-api@vger.kernel.org, Michal Hocko , Johannes Weiner , Tim Murray , Joel Fernandes , Suren Baghdasaryan , Daniel Colascione , Shakeel Butt , Sonny Rao , oleksandr@redhat.com, hdanton@sina.com, lizeb@google.com, Dave Hansen , "Kirill A . Shutemov" , Minchan Kim Subject: [PATCH v4 0/4] Introduce MADV_COLD and MADV_PAGEOUT Date: Thu, 11 Jul 2019 10:25:24 +0900 Message-Id: <20190711012528.176050-1-minchan@kernel.org> X-Mailer: git-send-email 2.22.0.410.gd8fdbe21b5-goog MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This patch is part of previous series: https://lore.kernel.org/lkml/20190531064313.193437-1-minchan@kernel.org/ Originally, it was created for external madvise hinting feature. https://lkml.org/lkml/2019/5/31/463 Michal wanted to separte the discussion from external hinting interface so this patchset includes only first part of my entire patchset - introduce MADV_COLD and MADV_PAGEOUT hint to madvise. However, I keep entire description for others for easier understanding why this kinds of hint was born. Thanks. This patchset is against on next-20190710. Below is description of previous entire patchset. ================= &< ===================== - Background The Android terminology used for forking a new process and starting an app from scratch is a cold start, while resuming an existing app is a hot start. While we continually try to improve the performance of cold starts, hot starts will always be significantly less power hungry as well as faster so we are trying to make hot start more likely than cold start. To increase hot start, Android userspace manages the order that apps should be killed in a process called ActivityManagerService. ActivityManagerService tracks every Android app or service that the user could be interacting with at any time and translates that into a ranked list for lmkd(low memory killer daemon). They are likely to be killed by lmkd if the system has to reclaim memory. In that sense they are similar to entries in any other cache. Those apps are kept alive for opportunistic performance improvements but those performance improvements will vary based on the memory requirements of individual workloads. - Problem Naturally, cached apps were dominant consumers of memory on the system. However, they were not significant consumers of swap even though they are good candidate for swap. Under investigation, swapping out only begins once the low zone watermark is hit and kswapd wakes up, but the overall allocation rate in the system might trip lmkd thresholds and cause a cached process to be killed(we measured performance swapping out vs. zapping the memory by killing a process. Unsurprisingly, zapping is 10x times faster even though we use zram which is much faster than real storage) so kill from lmkd will often satisfy the high zone watermark, resulting in very few pages actually being moved to swap. - Approach The approach we chose was to use a new interface to allow userspace to proactively reclaim entire processes by leveraging platform information. This allowed us to bypass the inaccuracy of the kernel’s LRUs for pages that are known to be cold from userspace and to avoid races with lmkd by reclaiming apps as soon as they entered the cached state. Additionally, it could provide many chances for platform to use much information to optimize memory efficiency. To achieve the goal, the patchset introduce two new options for madvise. One is MADV_COLD which will deactivate activated pages and the other is MADV_PAGEOUT which will reclaim private pages instantly. These new options complement MADV_DONTNEED and MADV_FREE by adding non-destructive ways to gain some free memory space. MADV_PAGEOUT is similar to MADV_DONTNEED in a way that it hints the kernel that memory region is not currently needed and should be reclaimed immediately; MADV_COLD is similar to MADV_FREE in a way that it hints the kernel that memory region is not currently needed and should be reclaimed when memory pressure rises. * v3 - http://lore.kernel.org/lkml/20190627115405.255259-1-minchan@kernel.org * v2 - http://lore.kernel.org/lkml/20190610111252.239156-1-minchan@kernel.org * v1 - http://lore.kernel.org/lkml/20190603053655.127730-1-minchan@kernel.org Minchan Kim (4): mm: introduce MADV_COLD mm: change PAGEREF_RECLAIM_CLEAN with PAGE_REFRECLAIM mm: account nr_isolated_xxx in [isolate|putback]_lru_page mm: introduce MADV_PAGEOUT include/linux/swap.h | 2 + include/uapi/asm-generic/mman-common.h | 2 + mm/compaction.c | 2 - mm/gup.c | 7 +- mm/internal.h | 2 +- mm/khugepaged.c | 3 - mm/madvise.c | 377 ++++++++++++++++++++++++- mm/memory-failure.c | 3 - mm/memory_hotplug.c | 4 - mm/mempolicy.c | 6 +- mm/migrate.c | 37 +-- mm/oom_kill.c | 2 +- mm/swap.c | 42 +++ mm/vmscan.c | 83 +++++- 14 files changed, 507 insertions(+), 65 deletions(-) -- 2.22.0.410.gd8fdbe21b5-goog