From: "Liang, Kan" <kan.liang@intel.com>
To: Linus Torvalds <torvalds@linux-foundation.org>,
        Tim Chen <tim.c.chen@linux.intel.com>
CC: Mel Gorman <mgorman@techsingularity.net>, Mel Gorman <mgorman@suse.de>,
        "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
        Peter Zijlstra <peterz@infradead.org>, Ingo Molnar <mingo@elte.hu>,
        Andi Kleen <ak@linux.intel.com>,
        Andrew Morton <akpm@linux-foundation.org>,
        "Johannes Weiner" <hannes@cmpxchg.org>, Jan Kara <jack@suse.cz>,
        linux-mm <linux-mm@kvack.org>,
        Linux Kernel Mailing List <linux-kernel@vger.kernel.org>
Subject: RE: [PATCH 1/2] sched/wait: Break up long wake list walk
Thread-Topic: [PATCH 1/2] sched/wait: Break up long wake list walk
Thread-Index: AQHTFWNBYSKZKyu5OE6Y+fM96SxNwqKEIDEAgASaOPD//398AIAAsKCA//+X4wCAAQZZgIAAplUA//+BiwAAFSW90P//ia2AgAASl4CAAAVjAIAEq10A//91k2D//XJ4kIAEpo4A//9gzkAAGOTugP/+TzqA//0MIYD/+fF1AP/zMXwg
Date: Wed, 23 Aug 2017 20:55:15 +0000
Message-ID: <37D7C6CF3E00A74B8858931C1DB2F0775378EC56@shsmsx102.ccr.corp.intel.com>
References: <CA+55aFwzTMrZwh7TE_VeZt8gx5Syoop-kA=Xqs56=FkyakrM6g@mail.gmail.com>
 <37D7C6CF3E00A74B8858931C1DB2F077537879BB@SHSMSX103.ccr.corp.intel.com>
 <20170818144622.oabozle26hasg5yo@techsingularity.net>
 <37D7C6CF3E00A74B8858931C1DB2F07753787AE4@SHSMSX103.ccr.corp.intel.com>
 <CA+55aFxZjjqUM4kPvNEeZahPovBHFATiwADj-iPTDN0-jnU67Q@mail.gmail.com>
 <20170818185455.qol3st2nynfa47yc@techsingularity.net>
 <CA+55aFwX0yrUPULrDxTWVCg5c6DKh-yCG84NXVxaptXNQ4O_kA@mail.gmail.com>
 <20170821183234.kzennaaw2zt2rbwz@techsingularity.net>
 <37D7C6CF3E00A74B8858931C1DB2F07753788B58@SHSMSX103.ccr.corp.intel.com>
 <37D7C6CF3E00A74B8858931C1DB2F0775378A24A@SHSMSX103.ccr.corp.intel.com>
 <CA+55aFy=4y0fq9nL2WR1x8vwzJrDOdv++r036LXpR=6Jx8jpzg@mail.gmail.com>
 <37D7C6CF3E00A74B8858931C1DB2F0775378A377@SHSMSX103.ccr.corp.intel.com>
 <CA+55aFwavpFfKNW9NVgNhLggqhii-guc5aX1X5fxrPK+==id0g@mail.gmail.com>
 <37D7C6CF3E00A74B8858931C1DB2F0775378A8AB@SHSMSX103.ccr.corp.intel.com>
 <6e8b81de-e985-9222-29c5-594c6849c351@linux.intel.com>
 <CA+55aFzbom=qFc2pYk07XhiMBn083EXugSUHmSVbTuu8eJtHVQ@mail.gmail.com>
In-Reply-To: <CA+55aFzbom=qFc2pYk07XhiMBn083EXugSUHmSVbTuu8eJtHVQ@mail.gmail.com>
Accept-Language: zh-CN, en-US
Content-Language: en-US
dlp-product: dlpe-windows
dlp-version: 10.0.102.7
dlp-reaction: no-action
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Sender: linux-kernel-owner@vger.kernel.org
Content-Transfer-Encoding: 8bit
Content-Length: 1417
Lines: 44

> 
> On Wed, Aug 23, 2017 at 8:58 AM, Tim Chen <tim.c.chen@linux.intel.com>
> wrote:
> >
> > Will you still consider the original patch as a fail safe mechanism?
> 
> I don't think we have much choice, although I would *really* want to get this
> root-caused rather than just papering over the symptoms.
> 
> Maybe still worth testing that "sched/numa: Scale scan period with tasks in
> group and shared/private" patch that Mel mentioned.

The patch doesn’t help on our load.

Thanks,
Kan
> 
> In fact, looking at that patch description, it does seem to match this particular
> load a lot. Quoting from the commit message:
> 
>   "Running 80 tasks in the same group, or as threads of the same process,
>    results in the memory getting scanned 80x as fast as it would be if a
>    single task was using the memory.
> 
>    This really hurts some workloads"
> 
> So if 80 threads causes 80x as much scanning, a few thousand threads might
> indeed be really really bad.
> 
> So once more unto the breach, dear friends, once more.
> 
> Please.
> 
> The patch got applied to -tip as commit b5dd77c8bdad, and can be
> downloaded here:
> 
> 
> https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?id=b5dd
> 77c8bdada7b6262d0cba02a6ed525bf4e6e1
> 
> (Hmm. It says it's cc'd to me, but I never noticed that patch simply because it
> was in a big group of other -tip commits.. Oh well).
> 
>           Linus