LinuxLists.cc - [PATCH 00/62] XArray November 2017 Edition

2017-11-22 21:36:28

Subject: [PATCH 00/62] XArray November 2017 Edition

From: Matthew Wilcox <[email protected]>

I've lost count of the number of times I've posted the XArray before,
so time for a new numbering scheme. Here're two earlier versions,
https://lkml.org/lkml/2017/3/17/724
https://lwn.net/Articles/715948/ (this one's more loquacious in its
description of things that are better about the radix tree API than the
XArray).

This time around, I've gone for an approach of many small changes.
Unfortunately, that means you get 62 moderate patches instead of dozens
of big ones.

Some of these can and should go in independently of whether the XArray
is a good idea. The first four fix some things in the test-suite.
Patch 6 changes the API for the IDR cyclic allocation. Patch 7 changes
the API for the IDR 'extended' API that was recently added without my
review.

Patch 15 is messy and breaks any time anybody touches the page cache.
I'd like it to go in soon. All it's doing is removing the page cache's
private spinlock and having it explicitly use the spinlock now embedded
in the xarray.

Patches 16-30 are all the fun infrastructure, adding pieces of the
xarray API. This is probably a good place to focus review, particularly
if you're intent on critiquing the API.

Patches 31 onwards start actually using the new API, taking advantage of
the built-in locking and memory allocation to get rid of the preload
API. There's probably the most scope for bugs here; I've tried to
understand the locking schemes used in thirty different places, and
maybe I got some of them wrong.

There will need to be many more patches. Dozens, maybe hundreds. Once
we've converted all the calls to radix_tree_insert() over to
xa_store(), we can get rid of the GFP flags stored in the xarray head.
That'll make the mm people a little happier. Once we've got rid of
idr_preload(), we can get rid of the per-CPU cache of radix_tree_nodes.
I have a lot more enhancements on my todo list, but I'd like to get
something merged in the next cycle, and if I wait until my todo list is
empty, I will get nothing merged.

If you looked at earlier iterations, there was much more support for
huge pages. That's not in this revision; I couldn't debug the problems
I was seeing. I still have all that code, and I'm intending to put it
back in; it's just a bit down the todo list right now.

Matthew Wilcox (62):
tools: Make __test_and_clear_bit available
radix tree test suite: Remove ARRAY_SIZE
radix tree test suite: Check reclaim bit
idr test suite: Fix ida_test_random()
radix tree: Add a missing cast to gfp_t
idr: Make cursor explicit for cyclic allocation
idr: Rewrite extended IDR API
Explicitly include radix-tree.h
arm64: Turn flush_dcache_mmap_lock into a no-op
unicore32: Turn flush_dcache_mmap_lock into a no-op
Export __set_page_dirty
xfs: Rename xa_ elements to ail_
fscache: Use appropriate radix tree accessors
xarray: Add the xa_lock to the radix_tree_root
page cache: Use xa_lock
xarray: Replace exceptional entries
xarray: Change definition of sibling entries
xarray: Add definition of struct xarray
xarray: Define struct xa_node
xarray: Add xa_load
xarray: Add xa_get_tag, xa_set_tag and xa_clear_tag
xarray: Add xa_store
xarray: Add xa_cmpxchg
xarray: Add xa_for_each
xarray: Add xa_init
xarray: Add xas_for_each_tag
xarray: Add xa_get_entries and xa_get_tagged
xarray: Add xa_destroy
xarray: Add xas_prev_any
xarray: Add xas_find_any / xas_next_any
Convert IDR to use xarray
ida: Convert to using xarray
page cache: Convert page_cache_next_hole to XArray
page cache: Use xarray for adding pages
page cache: Convert page_cache_tree_delete to xarray
page cache: Convert find_get_entry to xarray
shmem: Convert replace to xarray
shmem: Convert shmem_confirm_swap to XArray
shmem: Convert find_swap_entry to XArray
shmem: Convert shmem_tag_pins to XArray
shmem: Convert shmem_wait_for_pins to XArray
vmalloc: Convert to xarray
brd: Convert to XArray
xfs: Convert m_perag_tree to XArray
xfs: Convert pag_ici_root to XArray
xfs: Convert xfs dquot to XArray
xfs: Convert mru cache to XArray
block: Remove IDR preloading
rxrpc: Remove IDR preloading
cgroup: Remove IDR wrappers
dca: Remove idr_preload calls
ipc: Remove call to idr_preload
irq: Remove call to idr_preload
scsi: Remove idr_preload in st driver
firewire: Remove call to idr_preload
drm: Remove drm_minor_lock and idr_preload
drm: Remove drm_syncobj_fd_to_handle
drm: Remove qxl driver IDR locks
drm: Replace virtio IDRs with IDAs
drm: Replace vmwgfx IDRs with IDAs
net: Redesign act_api use of IDR
mm: Convert page-writeback to XArray

Documentation/cgroup-v1/memory.txt | 2 +-
Documentation/vm/page_migration | 14 +-
arch/arm/include/asm/cacheflush.h | 6 +-
arch/arm64/include/asm/cacheflush.h | 6 +-
arch/nios2/include/asm/cacheflush.h | 6 +-
arch/parisc/include/asm/cacheflush.h | 6 +-
arch/powerpc/include/asm/book3s/64/pgtable.h | 4 +-
arch/powerpc/include/asm/nohash/64/pgtable.h | 4 +-
arch/powerpc/platforms/cell/spufs/sched.c | 2 +-
arch/unicore32/include/asm/cacheflush.h | 6 +-
block/genhd.c | 23 +-
drivers/block/brd.c | 87 +-
drivers/dca/dca-sysfs.c | 22 +-
drivers/firewire/core-cdev.c | 20 +-
drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 8 +-
drivers/gpu/drm/drm_dp_aux_dev.c | 5 +-
drivers/gpu/drm/drm_drv.c | 25 +-
drivers/gpu/drm/drm_gem.c | 27 +-
drivers/gpu/drm/drm_syncobj.c | 23 +-
drivers/gpu/drm/etnaviv/etnaviv_gem_submit.c | 6 +-
drivers/gpu/drm/i915/i915_debugfs.c | 4 +-
drivers/gpu/drm/i915/i915_gem.c | 17 +-
drivers/gpu/drm/msm/msm_gem_submit.c | 10 +-
drivers/gpu/drm/qxl/qxl_cmd.c | 26 +-
drivers/gpu/drm/qxl/qxl_drv.h | 2 -
drivers/gpu/drm/qxl/qxl_kms.c | 2 -
drivers/gpu/drm/qxl/qxl_release.c | 12 +-
drivers/gpu/drm/vc4/vc4_gem.c | 4 +-
drivers/gpu/drm/virtio/virtgpu_drv.h | 6 +-
drivers/gpu/drm/virtio/virtgpu_kms.c | 18 +-
drivers/gpu/drm/virtio/virtgpu_vq.c | 12 +-
drivers/gpu/drm/vmwgfx/vmwgfx_drv.c | 6 +-
drivers/gpu/drm/vmwgfx/vmwgfx_drv.h | 2 +-
drivers/gpu/drm/vmwgfx/vmwgfx_resource.c | 28 +-
drivers/infiniband/core/cm.c | 5 +-
drivers/infiniband/hw/mlx4/cm.c | 3 +-
drivers/infiniband/hw/mlx4/mlx4_ib.h | 1 +
drivers/rapidio/rio_cm.c | 3 +-
drivers/rpmsg/qcom_glink_native.c | 7 +-
drivers/scsi/st.c | 18 +-
drivers/staging/lustre/lustre/llite/glimpse.c | 2 +-
drivers/staging/lustre/lustre/mdc/mdc_request.c | 10 +-
drivers/target/target_core_device.c | 4 +-
drivers/usb/host/xhci.h | 1 +
fs/afs/write.c | 2 +-
fs/btrfs/compression.c | 4 +-
fs/btrfs/extent_io.c | 8 +-
fs/btrfs/inode.c | 4 +-
fs/buffer.c | 13 +-
fs/cifs/file.c | 2 +-
fs/dax.c | 204 ++--
fs/f2fs/data.c | 6 +-
fs/f2fs/dir.c | 6 +-
fs/f2fs/gc.c | 2 +-
fs/f2fs/inline.c | 6 +-
fs/f2fs/node.c | 8 +-
fs/fs-writeback.c | 18 +-
fs/fscache/cookie.c | 2 +-
fs/fscache/object.c | 2 +-
fs/inode.c | 11 +-
fs/kernfs/dir.c | 5 +-
fs/nfsd/nfs4state.c | 4 +-
fs/nfsd/state.h | 1 +
fs/nilfs2/btnode.c | 20 +-
fs/nilfs2/page.c | 22 +-
fs/notify/inotify/inotify_user.c | 15 +-
fs/proc/loadavg.c | 2 +-
fs/proc/task_mmu.c | 2 +-
fs/xfs/libxfs/xfs_sb.c | 11 +-
fs/xfs/libxfs/xfs_sb.h | 2 +-
fs/xfs/xfs_aops.c | 15 +-
fs/xfs/xfs_buf_item.c | 10 +-
fs/xfs/xfs_dquot.c | 37 +-
fs/xfs/xfs_dquot_item.c | 11 +-
fs/xfs/xfs_icache.c | 142 +--
fs/xfs/xfs_icache.h | 10 +-
fs/xfs/xfs_inode.c | 24 +-
fs/xfs/xfs_inode_item.c | 22 +-
fs/xfs/xfs_log.c | 6 +-
fs/xfs/xfs_log_recover.c | 80 +-
fs/xfs/xfs_mount.c | 22 +-
fs/xfs/xfs_mount.h | 6 +-
fs/xfs/xfs_mru_cache.c | 71 +-
fs/xfs/xfs_qm.c | 32 +-
fs/xfs/xfs_qm.h | 18 +-
fs/xfs/xfs_trans.c | 18 +-
fs/xfs/xfs_trans_ail.c | 152 +--
fs/xfs/xfs_trans_buf.c | 4 +-
fs/xfs/xfs_trans_priv.h | 42 +-
include/drm/drm_file.h | 7 +-
include/linux/backing-dev.h | 12 +-
include/linux/fs.h | 68 +-
include/linux/fscache.h | 1 +
include/linux/fsnotify_backend.h | 1 +
include/linux/idr.h | 204 ++--
include/linux/kernfs.h | 1 +
include/linux/mm.h | 3 +-
include/linux/pagemap.h | 4 +-
include/linux/pid_namespace.h | 1 +
include/linux/radix-tree.h | 111 +-
include/linux/swapops.h | 19 +-
include/linux/xarray.h | 717 +++++++++++
include/net/act_api.h | 27 +-
include/net/sctp/sctp.h | 1 +
ipc/util.c | 28 +-
kernel/bpf/syscall.c | 8 +-
kernel/cgroup/cgroup.c | 63 +-
kernel/irq/timings.c | 4 +-
kernel/pid.c | 6 +-
kernel/pid_namespace.c | 12 +-
lib/Makefile | 2 +-
lib/dma-debug.c | 1 +
lib/idr.c | 470 ++++----
lib/radix-tree.c | 348 ++----
lib/xarray.c | 1444 +++++++++++++++++++++++
mm/filemap.c | 306 ++---
mm/huge_memory.c | 10 +-
mm/khugepaged.c | 51 +-
mm/madvise.c | 2 +-
mm/memcontrol.c | 4 +-
mm/migrate.c | 31 +-
mm/mincore.c | 2 +-
mm/page-writeback.c | 79 +-
mm/readahead.c | 4 +-
mm/rmap.c | 4 +-
mm/shmem.c | 196 ++-
mm/swap.c | 2 +-
mm/swap_state.c | 17 +-
mm/truncate.c | 34 +-
mm/vmalloc.c | 37 +-
mm/vmscan.c | 12 +-
mm/workingset.c | 34 +-
net/rxrpc/af_rxrpc.c | 2 +-
net/rxrpc/ar-internal.h | 1 +
net/rxrpc/conn_client.c | 21 +-
net/sched/act_api.c | 135 +--
net/sched/cls_basic.c | 20 +-
net/sched/cls_bpf.c | 20 +-
net/sched/cls_flower.c | 32 +-
net/sched/cls_u32.c | 44 +-
net/sctp/associola.c | 3 +-
net/sctp/protocol.c | 1 +
tools/include/asm-generic/bitops.h | 1 +
tools/include/asm-generic/bitops/atomic.h | 9 -
tools/include/asm-generic/bitops/non-atomic.h | 109 ++
tools/include/linux/spinlock.h | 2 +
tools/testing/radix-tree/.gitignore | 2 +
tools/testing/radix-tree/Makefile | 14 +-
tools/testing/radix-tree/idr-test.c | 28 +-
tools/testing/radix-tree/linux.c | 2 +-
tools/testing/radix-tree/linux/kernel.h | 2 -
tools/testing/radix-tree/linux/rcupdate.h | 2 +
tools/testing/radix-tree/linux/xarray.h | 2 +
tools/testing/radix-tree/multiorder.c | 53 +-
tools/testing/radix-tree/test.c | 8 +-
tools/testing/radix-tree/xarray-test.c | 122 ++
156 files changed, 4178 insertions(+), 2454 deletions(-)
create mode 100644 include/linux/xarray.h
create mode 100644 lib/xarray.c
create mode 100644 tools/include/asm-generic/bitops/non-atomic.h
create mode 100644 tools/testing/radix-tree/linux/xarray.h
create mode 100644 tools/testing/radix-tree/xarray-test.c

--
2.15.0

From 1584885213759013221@xxx Thu Nov 23 19:12:02 +0000 2017
X-GM-THRID: 1584885213759013221
X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread

2017-11-22 21:35:21

Subject: [PATCH 00/62] XArray November 2017 Edition

Subject: [PATCH 05/62] radix tree: Add a missing cast to gfp_t

Subject: Re: [PATCH 05/62] radix tree: Add a missing cast to gfp_t

Subject: [PATCH 08/62] Explicitly include radix-tree.h

Subject: [PATCH 24/62] xarray: Add xa_for_each

Subject: [PATCH 31/62] Convert IDR to use xarray

Subject: [PATCH 20/62] xarray: Add xa_load

Subject: [PATCH 26/62] xarray: Add xas_for_each_tag

Subject: [PATCH 32/62] ida: Convert to using xarray

Subject: [PATCH 17/62] xarray: Change definition of sibling entries

Subject: [PATCH 15/62] page cache: Use xa_lock

Subject: [PATCH 22/62] xarray: Add xa_store

Subject: [PATCH 07/62] idr: Rewrite extended IDR API

Subject: [PATCH 14/62] xarray: Add the xa_lock to the radix_tree_root

Subject: [PATCH 25/62] xarray: Add xa_init

Subject: [PATCH 06/62] idr: Make cursor explicit for cyclic allocation

Subject: [PATCH 02/62] radix tree test suite: Remove ARRAY_SIZE

Subject: [PATCH 29/62] xarray: Add xas_prev_any

Subject: [PATCH 44/62] xfs: Convert m_perag_tree to XArray

Subject: [PATCH 28/62] xarray: Add xa_destroy

Subject: [PATCH 37/62] shmem: Convert replace to xarray

Subject: [PATCH 54/62] scsi: Remove idr_preload in st driver

Subject: [PATCH 30/62] xarray: Add xas_find_any / xas_next_any

Subject: [PATCH 47/62] xfs: Convert mru cache to XArray

Subject: [PATCH 34/62] page cache: Use xarray for adding pages

Subject: [PATCH 39/62] shmem: Convert find_swap_entry to XArray

Subject: [PATCH 62/62] mm: Convert page-writeback to XArray

Subject: [PATCH 13/62] fscache: Use appropriate radix tree accessors

Subject: [PATCH 55/62] firewire: Remove call to idr_preload

Subject: [PATCH 49/62] rxrpc: Remove IDR preloading

Subject: [PATCH 19/62] xarray: Define struct xa_node

Subject: [PATCH 45/62] xfs: Convert pag_ici_root to XArray

Subject: [PATCH 59/62] drm: Replace virtio IDRs with IDAs

Subject: [PATCH 53/62] irq: Remove call to idr_preload

Subject: [PATCH 03/62] radix tree test suite: Check reclaim bit

Subject: [PATCH 61/62] net: Redesign act_api use of IDR

Subject: [PATCH 58/62] drm: Remove qxl driver IDR locks

Subject: [PATCH 36/62] page cache: Convert find_get_entry to xarray

Subject: [PATCH 38/62] shmem: Convert shmem_confirm_swap to XArray

Subject: [PATCH 11/62] Export __set_page_dirty

Subject: [PATCH 43/62] brd: Convert to XArray

Subject: [PATCH 27/62] xarray: Add xa_get_entries and xa_get_tagged

Subject: [PATCH 01/62] tools: Make __test_and_clear_bit available

Subject: [PATCH 50/62] cgroup: Remove IDR wrappers

Subject: [PATCH 12/62] xfs: Rename xa_ elements to ail_

Subject: [PATCH 60/62] drm: Replace vmwgfx IDRs with IDAs

Subject: [PATCH 41/62] shmem: Convert shmem_wait_for_pins to XArray

Subject: [PATCH 56/62] drm: Remove drm_minor_lock and idr_preload

Subject: [PATCH 23/62] xarray: Add xa_cmpxchg

Subject: [PATCH 40/62] shmem: Convert shmem_tag_pins to XArray

Subject: [PATCH 52/62] ipc: Remove call to idr_preload

Subject: [PATCH 42/62] vmalloc: Convert to xarray

Subject: [PATCH 51/62] dca: Remove idr_preload calls

Subject: [PATCH 09/62] arm64: Turn flush_dcache_mmap_lock into a no-op

Subject: [PATCH 10/62] unicore32: Turn flush_dcache_mmap_lock into a no-op

Subject: [PATCH 35/62] page cache: Convert page_cache_tree_delete to xarray

Subject: [PATCH 57/62] drm: Remove drm_syncobj_fd_to_handle

Subject: [PATCH 46/62] xfs: Convert xfs dquot to XArray

Subject: [PATCH 16/62] xarray: Replace exceptional entries

Subject: [PATCH 21/62] xarray: Add xa_get_tag, xa_set_tag and xa_clear_tag

Subject: [PATCH 04/62] idr test suite: Fix ida_test_random()

Subject: [PATCH 33/62] page cache: Convert page_cache_next_hole to XArray

Subject: [PATCH 18/62] xarray: Add definition of struct xarray

Subject: [PATCH 48/62] block: Remove IDR preloading