Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933526AbbHLKsj (ORCPT ); Wed, 12 Aug 2015 06:48:39 -0400 Received: from outbound-smtp01.blacknight.com ([81.17.249.7]:55366 "EHLO outbound-smtp01.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933202AbbHLKpi (ORCPT ); Wed, 12 Aug 2015 06:45:38 -0400 From: Mel Gorman To: Linux-MM Cc: Johannes Weiner , Rik van Riel , Vlastimil Babka , David Rientjes , Joonsoo Kim , Michal Hocko , LKML , Mel Gorman Subject: [PATCH 00/10] Remove zonelist cache and high-order watermark checking v2 Date: Wed, 12 Aug 2015 11:45:25 +0100 Message-Id: <1439376335-17895-1-git-send-email-mgorman@techsingularity.net> X-Mailer: git-send-email 2.4.6 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 7868 Lines: 136 Changelog since V1 o Rebase to 4.2-rc5 o Distinguish between high priority callers and callers that avoid sleep o Remove jump label related damage patches Overall, the intent of this series is to remove the zonelist cache which was introduced to avoid high overhead in the page allocator. Once this is done, it is necessary to reduce the cost of watermark checks. The zonelist cache has been around for a long time but it is of dubious merit with a lot of complexity. Some issues are explained in the first patch but the most important is that a failed THP allocation can cause a zone to be treated as "full". This potentially causes unnecessary stalls, reclaim activity or remote fallbacks. The issues could be fixed but it's not worth it. The series places a small number of other micro-optimisations on top before examining GFP flags watermarks. GFP flags specify the requirements of the caller. __GFP_WAIT historically identified callers that could not sleep and could access reserves. This was later abused to identify callers that simply prefer to avoid sleeping and have other options. A patch is added to distinguish between atomic callers, high-priority callers and those that simply wish to avoid sleep. High-order watermarks enforcement can cause high-order allocations to fail even though pages are free. The watermark checks both protect high-order atomic allocations and make kswapd aware of high-order pages but there is a much better way that can be handled using migrate types. This series uses page grouping by mobility to reserve pageblocks for high-order allocations with the size of the reservation depending on demand. kswapd awareness is maintained by examining the free lists. By patch 10 in this series, there are no high-order watermark checks while preserving the properties that motivated the introduction of the watermark checks. Documentation/vm/balance | 14 +- arch/arm/mm/dma-mapping.c | 4 +- arch/arm64/mm/dma-mapping.c | 4 +- arch/x86/kernel/pci-dma.c | 2 +- block/bio.c | 26 +- block/blk-core.c | 16 +- block/blk-ioc.c | 2 +- block/blk-mq-tag.c | 2 +- block/blk-mq.c | 8 +- block/cfq-iosched.c | 4 +- block/scsi_ioctl.c | 6 +- drivers/block/drbd/drbd_bitmap.c | 2 +- drivers/block/drbd/drbd_receiver.c | 2 +- drivers/block/mtip32xx/mtip32xx.c | 2 +- drivers/block/nvme-core.c | 4 +- drivers/block/osdblk.c | 2 +- drivers/block/paride/pd.c | 2 +- drivers/block/pktcdvd.c | 4 +- drivers/connector/connector.c | 3 +- drivers/firewire/core-cdev.c | 2 +- drivers/gpu/drm/i915/i915_gem.c | 4 +- drivers/ide/ide-atapi.c | 2 +- drivers/ide/ide-cd.c | 2 +- drivers/ide/ide-cd_ioctl.c | 2 +- drivers/ide/ide-devsets.c | 2 +- drivers/ide/ide-disk.c | 2 +- drivers/ide/ide-ioctls.c | 4 +- drivers/ide/ide-park.c | 2 +- drivers/ide/ide-pm.c | 4 +- drivers/ide/ide-tape.c | 4 +- drivers/ide/ide-taskfile.c | 4 +- drivers/infiniband/core/sa_query.c | 2 +- drivers/infiniband/hw/ipath/ipath_file_ops.c | 2 +- drivers/infiniband/hw/qib/qib_init.c | 2 +- drivers/iommu/amd_iommu.c | 2 +- drivers/iommu/intel-iommu.c | 2 +- drivers/md/dm-crypt.c | 6 +- drivers/misc/vmw_balloon.c | 2 +- drivers/mtd/mtdcore.c | 3 +- drivers/net/ethernet/broadcom/bnx2x/bnx2x_cmn.c | 2 +- drivers/scsi/scsi_error.c | 2 +- drivers/scsi/scsi_lib.c | 4 +- drivers/staging/android/ion/ion_system_heap.c | 2 +- .../lustre/include/linux/libcfs/libcfs_private.h | 2 +- drivers/usb/host/u132-hcd.c | 2 +- fs/btrfs/disk-io.c | 2 +- fs/btrfs/extent_io.c | 14 +- fs/btrfs/volumes.c | 4 +- fs/cachefiles/internal.h | 2 +- fs/direct-io.c | 2 +- fs/ext3/super.c | 2 +- fs/ext4/super.c | 2 +- fs/fscache/cookie.c | 2 +- fs/fscache/page.c | 6 +- fs/jbd/transaction.c | 4 +- fs/jbd2/transaction.c | 4 +- fs/nfs/file.c | 6 +- fs/nilfs2/mdt.h | 2 +- fs/xfs/xfs_qm.c | 2 +- include/linux/cpuset.h | 6 + include/linux/gfp.h | 68 ++- include/linux/mmzone.h | 85 +-- include/linux/skbuff.h | 6 +- include/net/sock.h | 2 +- include/trace/events/gfpflags.h | 5 +- kernel/audit.c | 6 +- kernel/locking/lockdep.c | 2 +- kernel/power/swap.c | 14 +- kernel/smp.c | 2 +- lib/idr.c | 4 +- lib/percpu_ida.c | 2 +- lib/radix-tree.c | 10 +- mm/backing-dev.c | 2 +- mm/dmapool.c | 2 +- mm/failslab.c | 8 +- mm/filemap.c | 2 +- mm/huge_memory.c | 4 +- mm/internal.h | 1 + mm/memcontrol.c | 8 +- mm/mempool.c | 10 +- mm/migrate.c | 2 +- mm/page_alloc.c | 569 +++++++-------------- mm/slab.c | 18 +- mm/slub.c | 6 +- mm/vmalloc.c | 2 +- mm/vmscan.c | 6 +- mm/vmstat.c | 2 +- net/core/skbuff.c | 8 +- net/core/sock.c | 6 +- net/netlink/af_netlink.c | 2 +- net/rxrpc/ar-connection.c | 2 +- net/sctp/associola.c | 2 +- security/integrity/ima/ima_crypto.c | 2 +- 93 files changed, 424 insertions(+), 684 deletions(-) -- 2.4.6 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/