Received: by 2002:a25:f815:0:0:0:0:0 with SMTP id u21csp3179180ybd; Mon, 24 Jun 2019 21:05:41 -0700 (PDT) X-Google-Smtp-Source: APXvYqw2rMd4sVRJQ8wrSGbHojx6V3l+y0HykZVauR7cLJTa6sxLDPCrCgS8BvKWqt+yreeQ0+ea X-Received: by 2002:a17:902:7297:: with SMTP id d23mr139178843pll.254.1561435541585; Mon, 24 Jun 2019 21:05:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1561435541; cv=none; d=google.com; s=arc-20160816; b=uWog5UcnI2ziYxFQi1JOSBa2RqsqY3QSvVio4XqM3D1JOgcMs+v/zfX3zH7wxJq6YI IofA5Keq3zIfVrBAdLDH4Qw1jnnNc+vV0r5uPXetpV3zqdMIOO9HDebVXTLDsCV9VTEV C/AKwAushp8FmGOyFktlmHOQcSFs53+PDT0YmVfVOdoz6A5nsSyqs5TrwOx61r0E5sKE QCuqt+B485wm3kqDnWyBIqjkH9V9AeWfKYV2UoahYsDjHovNAx8w+qjQSX7CMpxasSb3 e4tmWE6eZgJ9wEjiwTDYkJxF1fMx4JAPS3nT/ix11XNIicOBL7PmB8boBQQlPX3FcXGI gJlg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:subject :smtp-origin-cluster:cc:to:smtp-origin-hostname:from :smtp-origin-hostprefix:dkim-signature; bh=sNQn/dOnZHgVylzBkIWRNq/AXNkb7iif//pYLPbgJ2I=; b=V9BS1iGY+mT9xJr6WmWWDq8yFcImeBbsyf7HY46XirFkZrEys+3s2TUmZj2wYKtLeY 4Zm7iZRirLpS1FvdTCYuoDPAH7fgIfO5k/Xdi3tnCSwBpKRiesB+A1N9ljIEIyBDAkSV 2s1TfqMwlPyBh/D3v7YvRViFIPt0sZePvj62b/oX5xs9sY7iE/JIejyzYIzQ7ui94mc4 3S/q5xyT7p4BpFNlIjjw13W2xbLJmaRDYtKi3hO+1smukuHsv6s+1hMuBbZsyhUDB4cb u/kJ6oRIUgAfNJdWNGnrJqD7pFpkgS9FiipiuB/Z9kRNCVLJg/X+WY7B7jehYvI+fd5q j8vQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fb.com header.s=facebook header.b="qc/1KMTv"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=fb.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r12si1436555pjp.56.2019.06.24.21.05.25; Mon, 24 Jun 2019 21:05:41 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@fb.com header.s=facebook header.b="qc/1KMTv"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=fb.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729522AbfFYAM6 (ORCPT + 99 others); Mon, 24 Jun 2019 20:12:58 -0400 Received: from mx0a-00082601.pphosted.com ([67.231.145.42]:21510 "EHLO mx0a-00082601.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729419AbfFYAMx (ORCPT ); Mon, 24 Jun 2019 20:12:53 -0400 Received: from pps.filterd (m0148461.ppops.net [127.0.0.1]) by mx0a-00082601.pphosted.com (8.16.0.27/8.16.0.27) with SMTP id x5P08VvG032472 for ; Mon, 24 Jun 2019 17:12:52 -0700 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fb.com; h=from : to : cc : subject : date : message-id : mime-version : content-type; s=facebook; bh=sNQn/dOnZHgVylzBkIWRNq/AXNkb7iif//pYLPbgJ2I=; b=qc/1KMTvYsBus8AHq5JcKSWdamajXwVOMhECHDrdcr8x2v9V9P7a65icFGihJURUqTfW EWXTiFme7L9uOqgahFxEaUCPEDf4R0m9ijMiZL0g6+1F/XUMnYxAkqdPTgZn+DToSd9g W7hJGB09EdtzQOZoxL6zqROXb0St/40Ye6E= Received: from mail.thefacebook.com (mailout.thefacebook.com [199.201.64.23]) by mx0a-00082601.pphosted.com with ESMTP id 2tb7gur91t-3 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=NOT) for ; Mon, 24 Jun 2019 17:12:52 -0700 Received: from mx-out.facebook.com (2620:10d:c081:10::13) by mail.thefacebook.com (2620:10d:c081:35::126) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA) id 15.1.1713.5; Mon, 24 Jun 2019 17:12:51 -0700 Received: by devbig006.ftw2.facebook.com (Postfix, from userid 4523) id 680D962E206E; Mon, 24 Jun 2019 17:12:49 -0700 (PDT) Smtp-Origin-Hostprefix: devbig From: Song Liu Smtp-Origin-Hostname: devbig006.ftw2.facebook.com To: , , CC: , , , , , , Song Liu Smtp-Origin-Cluster: ftw2c04 Subject: [PATCH v9 0/6] Enable THP for text section of non-shmem files Date: Mon, 24 Jun 2019 17:12:40 -0700 Message-ID: <20190625001246.685563-1-songliubraving@fb.com> X-Mailer: git-send-email 2.17.1 X-FB-Internal: Safe MIME-Version: 1.0 Content-Type: text/plain X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2019-06-24_16:,, signatures=0 X-Proofpoint-Spam-Details: rule=fb_default_notspam policy=fb_default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1906250000 X-FB-Internal: deliver Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Changes v8 => v9: 1. Fix bad use of IS_ENABLED (kbuild test robot) Changes v7 => v8: 1. Use IS_ENABLED wherever possible (Kirill A. Shutemov); 2. Improve handling of !PageUptodate case (Kirill A. Shutemov); 3. Add comment for calling lru_add_drain (Kirill A. Shutemov); 4. Add more information about DENYWRITE dynamic (Johannes Weiner). Changes v6 => v7: 1. Avoid accessing vma without holding mmap_sem (Hillf Dayton) 2. In collapse_file() use readahead API instead of gup API. This matches better with existing logic for shmem. 3. Add inline documentation for @nr_thps (kbuild test robot) Changes v5 => v6: 1. Improve THP stats in 3/6, (Kirill). Changes v4 => v5: 1. Move the logic to drop THP from pagecache to open() path (Rik). 2. Revise description of CONFIG_READ_ONLY_THP_FOR_FS. Changes v3 => v4: 1. Put the logic to drop THP from pagecache in a separate function (Rik). 2. Move the function to drop THP from pagecache to exit_mmap(). 3. Revise confusing commit log 6/6. Changes v2 => v3: 1. Removed the limitation (cannot write to file with THP) by truncating whole file during sys_open (see 6/6); 2. Fixed a VM_BUG_ON_PAGE() in filemap_fault() (see 2/6); 3. Split function rename to a separate patch (Rik); 4. Updated condition in hugepage_vma_check() (Rik). Changes v1 => v2: 1. Fixed a missing mem_cgroup_commit_charge() for non-shmem case. This set follows up discussion at LSF/MM 2019. The motivation is to put text section of an application in THP, and thus reduces iTLB miss rate and improves performance. Both Facebook and Oracle showed strong interests to this feature. To make reviews easier, this set aims a mininal valid product. Current version of the work does not have any changes to file system specific code. This comes with some limitations (discussed later). This set enables an application to "hugify" its text section by simply running something like: madvise(0x600000, 0x80000, MADV_HUGEPAGE); Before this call, the /proc//maps looks like: 00400000-074d0000 r-xp 00000000 00:27 2006927 app After this call, part of the text section is split out and mapped to THP: 00400000-00425000 r-xp 00000000 00:27 2006927 app 00600000-00e00000 r-xp 00200000 00:27 2006927 app <<< on THP 00e00000-074d0000 r-xp 00a00000 00:27 2006927 app Limitations: 1. This only works for text section (vma with VM_DENYWRITE). 2. Original limitation #2 is removed in v3. We gated this feature with an experimental config, READ_ONLY_THP_FOR_FS. Once we get better support on the write path, we can remove the config and enable it by default. Tested cases: 1. Tested with btrfs and ext4. 2. Tested with real work application (memcache like caching service). 3. Tested with "THP aware uprobe": https://patchwork.kernel.org/project/linux-mm/list/?series=131339 This set (plus a few uprobe patches) is also available at https://github.com/liu-song-6/linux/tree/uprobe-thp Please share your comments and suggestions on this. Thanks! Song Liu (6): filemap: check compound_head(page)->mapping in filemap_fault() filemap: update offset check in filemap_fault() mm,thp: stats for file backed THP khugepaged: rename collapse_shmem() and khugepaged_scan_shmem() mm,thp: add read-only THP support for (non-shmem) FS mm,thp: avoid writes to file with THP in pagecache drivers/base/node.c | 6 +++ fs/inode.c | 3 ++ fs/namei.c | 23 +++++++- fs/proc/meminfo.c | 4 ++ fs/proc/task_mmu.c | 4 +- include/linux/fs.h | 32 +++++++++++ include/linux/mmzone.h | 2 + mm/Kconfig | 11 ++++ mm/filemap.c | 9 ++-- mm/khugepaged.c | 117 ++++++++++++++++++++++++++++++++--------- mm/rmap.c | 12 +++-- mm/vmstat.c | 2 + 12 files changed, 189 insertions(+), 36 deletions(-) -- 2.17.1