Received: by 2002:a05:7412:1492:b0:e2:908c:2ebd with SMTP id s18csp720413rdh; Wed, 23 Aug 2023 12:47:34 -0700 (PDT) X-Google-Smtp-Source: AGHT+IElNCRzpUAGTpi67u/pkAe0nVH+GE+j9SHPXd4snR7H7a3+OP8d+rf8Hlmvqc3pF3DPaoJz X-Received: by 2002:a17:902:ee89:b0:1b8:865d:6e1d with SMTP id a9-20020a170902ee8900b001b8865d6e1dmr10670529pld.51.1692820054132; Wed, 23 Aug 2023 12:47:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1692820054; cv=none; d=google.com; s=arc-20160816; b=ZOxf5Vxu4p0pjUyQnYef/EsI+QD0FjIufNNEQv8K/4jj5hOSl7fDGO9WmHPBiTfD1A xDdZ4AF6X4uydPKExelI36ow9VkOnV5smbFrvihLt4xe7/gc9oLDfCX1mLaudC/9cIp+ 3LkVdVOnTnwzx3P0bZHWy3bbxKJ9PnCNecxftlfpUIdkAsfId1FKhGhLLx10BKiGLjm8 lcxWvAXdt6WVM1PK9ILgnagnmiNzTmZ9GgRYwJM2u2buAfvwLEXz0yL1ijKvRYdVIBLx c533k40i901q8iu6hugVea4X8QZ6mV0dWQxCPutswaYCBDOILMndWkp5BSpxWOX3/eRQ o5Lg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=8Md7Jwi7XojExJvu/aXnGRioex9bGIyCCCV5pPkWGBc=; fh=b5jBeBO35Om38J5KhOUXuzAaPqiHhCpV7eyplgVgWzs=; b=yOcBmJB838EPq+dgTEifevm4bYkvd6lupY+2fOlp5s8lAa2NYEWs32eLr+lCwGk8dV z/0k7jSYALgVYTmEjMV6aSCmRCC27cEKIzjKGzXR7Ft5FwoVCVpOZzY3cMDP5VKZ/5pR dsM41bwTRck4QquEL5QfmXPQy8OMXwgn8q/qEUrG5dztnwgGz0RQ/wfAbM4eAPg6pmII /Yeuw9URWeoSVPitWjVu5FDYA0EEi37huJi2vvTg4a0Sr7jLviDzrP0VALvjavnFm0w7 CRKZbEPNCmN11HKEFKxMw/4e0UZ0Ve76RHWGn2688tnWkzp4TZf0Y+KfPCqQYtrh8Qb+ NC+A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id m10-20020a170902e40a00b001b9e9fa3a38si10860789ple.646.2023.08.23.12.47.21; Wed, 23 Aug 2023 12:47:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236723AbjHWRBb (ORCPT + 99 others); Wed, 23 Aug 2023 13:01:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43076 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229518AbjHWRBa (ORCPT ); Wed, 23 Aug 2023 13:01:30 -0400 Received: from 66-220-144-178.mail-mxout.facebook.com (66-220-144-178.mail-mxout.facebook.com [66.220.144.178]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6F9BAE6D for ; Wed, 23 Aug 2023 10:01:26 -0700 (PDT) Received: by devbig1114.prn1.facebook.com (Postfix, from userid 425415) id 92739AADD014; Wed, 23 Aug 2023 10:01:08 -0700 (PDT) From: Stefan Roesch To: kernel-team@fb.com Cc: shr@devkernel.io, akpm@linux-foundation.org, david@redhat.com, linux-fsdevel@vger.kernel.org, hannes@cmpxchg.org, riel@surriel.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: [PATCH v5] proc/ksm: add ksm stats to /proc/pid/smaps Date: Wed, 23 Aug 2023 10:01:07 -0700 Message-Id: <20230823170107.1457915-1-shr@devkernel.io> X-Mailer: git-send-email 2.39.3 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-0.1 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_BLOCKED,RDNS_DYNAMIC,SPF_HELO_PASS,SPF_NEUTRAL, TVD_RCVD_IP autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org With madvise and prctl KSM can be enabled for different VMA's. Once it is enabled we can query how effective KSM is overall. However we cannot easily query if an individual VMA benefits from KSM. This commit adds a KSM section to the /prod//smaps file. It reports how many of the pages are KSM pages. Note that KSM-placed zeropages are not included, only actual KSM pages. Here is a typical output: 7f420a000000-7f421a000000 rw-p 00000000 00:00 0 Size: 262144 kB KernelPageSize: 4 kB MMUPageSize: 4 kB Rss: 51212 kB Pss: 8276 kB Shared_Clean: 172 kB Shared_Dirty: 42996 kB Private_Clean: 196 kB Private_Dirty: 7848 kB Referenced: 15388 kB Anonymous: 51212 kB KSM: 41376 kB LazyFree: 0 kB AnonHugePages: 0 kB ShmemPmdMapped: 0 kB FilePmdMapped: 0 kB Shared_Hugetlb: 0 kB Private_Hugetlb: 0 kB Swap: 202016 kB SwapPss: 3882 kB Locked: 0 kB THPeligible: 0 ProtectionKey: 0 ksm_state: 0 ksm_skip_base: 0 ksm_skip_count: 0 VmFlags: rd wr mr mw me nr mg anon This information also helps with the following workflow: - First enable KSM for all the VMA's of a process with prctl. - Then analyze with the above smaps report which VMA's benefit the most - Change the application (if possible) to add the corresponding madvise calls for the VMA's that benefit the most Signed-off-by: Stefan Roesch --- Documentation/filesystems/proc.rst | 4 ++++ fs/proc/task_mmu.c | 6 ++++++ 2 files changed, 10 insertions(+) diff --git a/Documentation/filesystems/proc.rst b/Documentation/filesyste= ms/proc.rst index 7897a7dafcbc..d49af173b5af 100644 --- a/Documentation/filesystems/proc.rst +++ b/Documentation/filesystems/proc.rst @@ -461,6 +461,7 @@ Memory Area, or VMA) there is a series of lines such = as the following:: Private_Dirty: 0 kB Referenced: 892 kB Anonymous: 0 kB + KSM: 0 kB LazyFree: 0 kB AnonHugePages: 0 kB ShmemPmdMapped: 0 kB @@ -501,6 +502,9 @@ accessed. a mapping associated with a file may contain anonymous pages: when MAP_P= RIVATE and a page is modified, the file page is replaced by a private anonymous= copy. =20 +"KSM" reports how many of the pages are KSM pages. Note that KSM-placed = zeropages +are not included, only actual KSM pages. + "LazyFree" shows the amount of memory which is marked by madvise(MADV_FR= EE). The memory isn't freed immediately with madvise(). It's freed in memory pressure if the memory is clean. Please note that the printed value migh= t diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c index 51315133cdc2..bfd25351865d 100644 --- a/fs/proc/task_mmu.c +++ b/fs/proc/task_mmu.c @@ -4,6 +4,7 @@ #include #include #include +#include #include #include #include @@ -396,6 +397,7 @@ struct mem_size_stats { unsigned long swap; unsigned long shared_hugetlb; unsigned long private_hugetlb; + unsigned long ksm; u64 pss; u64 pss_anon; u64 pss_file; @@ -452,6 +454,9 @@ static void smaps_account(struct mem_size_stats *mss,= struct page *page, mss->lazyfree +=3D size; } =20 + if (PageKsm(page)) + mss->ksm +=3D size; + mss->resident +=3D size; /* Accumulate the size in pages that have been accessed. */ if (young || page_is_young(page) || PageReferenced(page)) @@ -822,6 +827,7 @@ static void __show_smap(struct seq_file *m, const str= uct mem_size_stats *mss, SEQ_PUT_DEC(" kB\nPrivate_Dirty: ", mss->private_dirty); SEQ_PUT_DEC(" kB\nReferenced: ", mss->referenced); SEQ_PUT_DEC(" kB\nAnonymous: ", mss->anonymous); + SEQ_PUT_DEC(" kB\nKSM: ", mss->ksm); SEQ_PUT_DEC(" kB\nLazyFree: ", mss->lazyfree); SEQ_PUT_DEC(" kB\nAnonHugePages: ", mss->anonymous_thp); SEQ_PUT_DEC(" kB\nShmemPmdMapped: ", mss->shmem_thp); base-commit: f4a280e5bb4a764a75d3215b61bc0f02b4c26417 --=20 2.39.3