LinuxLists.cc - [PATCH -V7 00/14] hugetlb: Add HugeTLB controller to control HugeTLB allocation

2012-05-30 14:39:34

Subject: [PATCH -V7 00/14] hugetlb: Add HugeTLB controller to control HugeTLB allocation

Hi,

This patchset implements a cgroup resource controller for HugeTLB
pages. The controller allows to limit the HugeTLB usage per control
group and enforces the controller limit during page fault. Since
HugeTLB doesn't support page reclaim, enforcing the limit at page
fault time implies that, the application will get SIGBUS signal if
it tries to access HugeTLB pages beyond its limit. This requires
the application to know beforehand how much HugeTLB pages it would
require for its use.

The goal is to control how many HugeTLB pages a group of task can
allocate. It can be looked at as an extension of the existing quota
interface which limits the number of HugeTLB pages per hugetlbfs
superblock. HPC job scheduler requires jobs to specify their resource
requirements in the job file. Once their requirements can be met,
job schedulers like (SLURM) will schedule the job. We need to make sure
that the jobs won't consume more resources than requested. If they do
we should either error out or kill the application.

Patches are on top of 731a7378b81c2f5fa88ca1ae20b83d548d5613dc

Changes from V6:
* Implement the controller as a seperate HugeTLB cgroup.
* Folded fixup patches in -mm to the original patches

Changes from V5:
* Address review feedback.

Changes from V4:
* Add support for charge/uncharge during page migration
* Drop the usage of page->lru in unmap_hugepage_range.

Changes from v3:
* Address review feedback.
* Fix a bug in cgroup removal related parent charging with use_hierarchy set

Changes from V2:
* Changed the implementation to limit the HugeTLB usage during page
fault time. This simplifies the extension and keep it closer to
memcg design. This also allows to support cgroup removal with less
complexity. Only caveat is the application should ensure its HugeTLB
usage doesn't cross the cgroup limit.

Changes from V1:
* Changed the implementation as a memcg extension. We still use
the same logic to track the cgroup and range.

Changes from RFC post:
* Added support for HugeTLB cgroup hierarchy
* Added support for task migration
* Added documentation patch
* Other bug fixes

-aneesh

2012-05-30 14:39:48

Subject: [PATCH -V7 00/14] hugetlb: Add HugeTLB controller to control HugeTLB allocation

Subject: [PATCH -V7 05/14] hugetlb: avoid taking i_mmap_mutex in unmap_single_vma() for hugetlb

Subject: [PATCH -V7 10/14] hugetlbfs: Add new HugeTLB cgroup

Subject: [PATCH -V7 12/14] hugetlb: add charge/uncharge calls for HugeTLB alloc/free

Subject: [PATCH -V7 02/14] hugetlbfs: don't use ERR_PTR with VM_FAULT* values

Subject: [PATCH -V7 14/14] hugetlb: add HugeTLB controller documentation

Subject: [PATCH -V7 11/14] hugetlbfs: add hugetlb cgroup control files

Subject: [PATCH -V7 07/14] mm/page_cgroup: Make page_cgroup point to the cgroup rather than the mem_cgroup

Subject: [PATCH -V7 13/14] hugetlb: migrate hugetlb cgroup info from oldpage to new page during migration

Subject: [PATCH -V7 09/14] hugetlbfs: Make some static variables global

Subject: [PATCH -V7 04/14] hugetlb: use mmu_gather instead of a temporary linked list for accumulating pages

Subject: [PATCH -V7 03/14] hugetlbfs: add an inline helper for finding hstate index

Subject: [PATCH -V7 08/14] hugetlbfs: add a list for tracking in-use HugeTLB pages

Subject: [PATCH -V7 06/14] hugetlb: simplify migrate_huge_page()

Subject: [PATCH -V7 01/14] hugetlb: rename max_hstate to hugetlb_max_hstate

Subject: Re: [PATCH -V7 01/14] hugetlb: rename max_hstate to hugetlb_max_hstate

Subject: Re: [PATCH -V7 02/14] hugetlbfs: don't use ERR_PTR with VM_FAULT* values

Subject: Re: [PATCH -V7 01/14] hugetlb: rename max_hstate to hugetlb_max_hstate

Subject: Re: [PATCH -V7 02/14] hugetlbfs: don't use ERR_PTR with VM_FAULT* values

Subject: Re: [PATCH -V7 03/14] hugetlbfs: add an inline helper for finding hstate index

Subject: Re: [PATCH -V7 10/14] hugetlbfs: Add new HugeTLB cgroup

Subject: Re: [PATCH -V7 11/14] hugetlbfs: add hugetlb cgroup control files

Subject: Re: [PATCH -V7 04/14] hugetlb: use mmu_gather instead of a temporary linked list for accumulating pages

Subject: Re: [PATCH -V7 05/14] hugetlb: avoid taking i_mmap_mutex in unmap_single_vma() for hugetlb

Subject: Re: [PATCH -V7 05/14] hugetlb: avoid taking i_mmap_mutex in unmap_single_vma() for hugetlb

Subject: Re: [PATCH -V7 04/14] hugetlb: use mmu_gather instead of a temporary linked list for accumulating pages

Subject: Re: [PATCH -V7 11/14] hugetlbfs: add hugetlb cgroup control files

Subject: Re: [PATCH -V7 10/14] hugetlbfs: Add new HugeTLB cgroup

Subject: Re: [PATCH -V7 02/14] hugetlbfs: don't use ERR_PTR with VM_FAULT* values

Subject: Re: [PATCH -V7 01/14] hugetlb: rename max_hstate to hugetlb_max_hstate

Subject: Re: [PATCH -V7 02/14] hugetlbfs: don't use ERR_PTR with VM_FAULT* values

Subject: Re: [PATCH -V7 10/14] hugetlbfs: Add new HugeTLB cgroup

Subject: Re: [PATCH -V7 10/14] hugetlbfs: Add new HugeTLB cgroup

Subject: Re: [PATCH -V7 07/14] mm/page_cgroup: Make page_cgroup point to the cgroup rather than the mem_cgroup

Subject: Re: [PATCH -V7 07/14] mm/page_cgroup: Make page_cgroup point to the cgroup rather than the mem_cgroup

Subject: Re: [PATCH -V7 07/14] mm/page_cgroup: Make page_cgroup point to the cgroup rather than the mem_cgroup

Subject: Re: [PATCH -V7 07/14] mm/page_cgroup: Make page_cgroup point to the cgroup rather than the mem_cgroup