LinuxLists.cc - [RFC 00/10] fs/dax: Fix FS DAX page reference counts

2024-04-11 00:57:54

Subject: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

FS DAX pages have always maintained their own page reference counts
without following the normal rules for page reference counting. In
particular pages are considered free when the refcount hits one rather
than zero and refcounts are not added when mapping the page.

Tracking this requires special PTE bits (PTE_DEVMAP) and a secondary
mechanism for allowing GUP to hold references on the page (see
get_dev_pagemap). However there doesn't seem to be any reason why FS
DAX pages need their own reference counting scheme.

This RFC is an initial attempt at removing the special reference
counting and instead refcount FS DAX pages the same as normal pages.

There are still a couple of rough edges - in particular I haven't
completely removed the devmap PTE bit references from arch specific
code and there is probably some more cleanup of dev_pagemap reference
counting that could be done, particular in mm/gup.c. I also haven't
yet compiled on anything other than x86_64.

Before continuing further with this clean-up though I would appreciate
some feedback on the viability of this approach and any issues I may
have overlooked, as I am not intimately familiar with FS DAX code (or
for that matter the FS layer in general).

I have of course run some basic testing which didn't reveal any
problems.

Signed-off-by: Alistair Popple <[email protected]>

Alistair Popple (10):
mm/gup.c: Remove redundant check for PCI P2PDMA page
mm/hmm: Remove dead check for HugeTLB and FS DAX
pci/p2pdma: Don't initialise page refcount to one
fs/dax: Don't track page mapping/index
fs/dax: Refactor wait for dax idle page
fs/dax: Add dax_page_free callback
mm: Allow compound zone device pages
fs/dax: Properly refcount fs dax pages
mm/khugepage.c: Warn if trying to scan devmap pmd
mm: Remove pXX_devmap

Documentation/mm/arch_pgtable_helpers.rst | 6 +-
arch/arm64/include/asm/pgtable.h | 24 +---
arch/powerpc/include/asm/book3s/64/pgtable.h | 42 +-----
arch/powerpc/mm/book3s64/hash_pgtable.c | 3 +-
arch/powerpc/mm/book3s64/pgtable.c | 8 +-
arch/powerpc/mm/book3s64/radix_pgtable.c | 5 +-
arch/powerpc/mm/pgtable.c | 2 +-
arch/x86/include/asm/pgtable.h | 31 +---
drivers/dax/super.c | 2 +-
drivers/gpu/drm/nouveau/nouveau_dmem.c | 2 +-
drivers/nvdimm/pmem.c | 10 +-
drivers/pci/p2pdma.c | 4 +-
fs/dax.c | 158 +++++++-----------
fs/ext4/inode.c | 5 +-
fs/fuse/dax.c | 4 +-
fs/fuse/virtio_fs.c | 8 +-
fs/userfaultfd.c | 2 +-
fs/xfs/xfs_file.c | 4 +-
include/linux/dax.h | 16 ++-
include/linux/huge_mm.h | 11 +-
include/linux/memremap.h | 12 +-
include/linux/migrate.h | 2 +-
include/linux/mm.h | 41 +-----
include/linux/page-flags.h | 6 +-
include/linux/pgtable.h | 17 +--
lib/test_hmm.c | 2 +-
mm/debug_vm_pgtable.c | 51 +------
mm/gup.c | 165 +------------------
mm/hmm.c | 40 +----
mm/huge_memory.c | 180 +++++++++-----------
mm/internal.h | 2 +-
mm/khugepaged.c | 2 +-
mm/mapping_dirty_helpers.c | 4 +-
mm/memory-failure.c | 6 +-
mm/memory.c | 109 ++++++++----
mm/memremap.c | 36 +---
mm/migrate_device.c | 6 +-
mm/mm_init.c | 5 +-
mm/mprotect.c | 2 +-
mm/mremap.c | 5 +-
mm/page_vma_mapped.c | 5 +-
mm/pgtable-generic.c | 7 +-
mm/swap.c | 2 +-
mm/vmscan.c | 5 +-
44 files changed, 338 insertions(+), 721 deletions(-)

base-commit: ffc253263a1375a65fa6c9f62a893e9767fbebfa
--
git-series 0.9.1

2024-04-11 00:58:24

Subject: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: [RFC 02/10] mm/hmm: Remove dead check for HugeTLB and FS DAX

Subject: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one

Subject: [RFC 05/10] fs/dax: Refactor wait for dax idle page

Subject: [RFC 06/10] fs/dax: Add dax_page_free callback

Subject: [RFC 07/10] mm: Allow compound zone device pages

Subject: [RFC 08/10] fs/dax: Properly refcount fs dax pages

Subject: [RFC 09/10] mm/khugepage.c: Warn if trying to scan devmap pmd

Subject: [RFC 01/10] mm/gup.c: Remove redundant check for PCI P2PDMA page

Subject: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: [RFC 10/10] mm: Remove pXX_devmap

Subject: Re: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one

Subject: Re: [RFC 07/10] mm: Allow compound zone device pages

Subject: Re: [RFC 02/10] mm/hmm: Remove dead check for HugeTLB and FS DAX

Subject: Re: [RFC 10/10] mm: Remove pXX_devmap

Subject: Re: [RFC 01/10] mm/gup.c: Remove redundant check for PCI P2PDMA page

Subject: Re: [RFC 02/10] mm/hmm: Remove dead check for HugeTLB and FS DAX

Subject: Re: [RFC 09/10] mm/khugepage.c: Warn if trying to scan devmap pmd

Subject: Re: [RFC 01/10] mm/gup.c: Remove redundant check for PCI P2PDMA page

Subject: Re: [RFC 07/10] mm: Allow compound zone device pages

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 02/10] mm/hmm: Remove dead check for HugeTLB and FS DAX

Subject: Re: [RFC 09/10] mm/khugepage.c: Warn if trying to scan devmap pmd

Subject: Re: [RFC 01/10] mm/gup.c: Remove redundant check for PCI P2PDMA page

Subject: Re: [RFC 07/10] mm: Allow compound zone device pages

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 05/10] fs/dax: Refactor wait for dax idle page

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: RE: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one

Subject: RE: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 00/10] fs/dax: Fix FS DAX page reference counts

Subject: Re: [RFC 05/10] fs/dax: Refactor wait for dax idle page

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 05/10] fs/dax: Refactor wait for dax idle page

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 04/10] fs/dax: Don't track page mapping/index

Subject: Re: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one

Subject: Re: [RFC 03/10] pci/p2pdma: Don't initialise page refcount to one