From: riel@redhat.com
To: linux-kernel@vger.kernel.org
Cc: peterz@infradead.org, mgorman@suse.de, mingo@kernel.org,
        jhladky@redhat.com, lvenanci@redhat.com
Subject: [PATCH 0/2] numa,sched: improve performance for multi-threaded workloads
Date: Mon, 31 Jul 2017 15:28:45 -0400
Message-Id: <20170731192847.23050-1-riel@redhat.com>
Sender: linux-kernel-owner@vger.kernel.org
Content-Length: 619
Lines: 14

The NUMA balancing code spends way too much CPU time scanning and
faulting when running multi-threaded workloads.

This patch set slows down NUMA PTE scanning when there are lots
of shared faults, and when dealing with large NUMA groups that
have a large fraction of shared faults.

Some results from Jirka's half-week performance run, on
a 4 node system:
- improvements in the range of 10-30% for NAS benchmarks
  (mostly ft and lu subtests)
- SPECjbb2005 single instance mode - improvements in the range of 5-10%
- SPECjvm2008 - performance very similar to before, some small
  improvements for the scimark* subtests