Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752678AbdHVThR (ORCPT ); Tue, 22 Aug 2017 15:37:17 -0400 Received: from mga02.intel.com ([134.134.136.20]:26488 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752500AbdHVThQ (ORCPT ); Tue, 22 Aug 2017 15:37:16 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.41,413,1498546800"; d="scan'208";a="1187037897" Date: Tue, 22 Aug 2017 12:37:14 -0700 From: Andi Kleen To: Linus Torvalds Cc: Peter Zijlstra , "Liang, Kan" , Mel Gorman , Mel Gorman , "Kirill A. Shutemov" , Tim Chen , Ingo Molnar , Andrew Morton , Johannes Weiner , Jan Kara , linux-mm , Linux Kernel Mailing List Subject: Re: [PATCH 1/2] sched/wait: Break up long wake list walk Message-ID: <20170822193714.GZ28715@tassilo.jf.intel.com> References: <37D7C6CF3E00A74B8858931C1DB2F07753787AE4@SHSMSX103.ccr.corp.intel.com> <20170818185455.qol3st2nynfa47yc@techsingularity.net> <20170821183234.kzennaaw2zt2rbwz@techsingularity.net> <37D7C6CF3E00A74B8858931C1DB2F07753788B58@SHSMSX103.ccr.corp.intel.com> <37D7C6CF3E00A74B8858931C1DB2F0775378A24A@SHSMSX103.ccr.corp.intel.com> <20170822190828.GO32112@worktop.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.8.3 (2017-05-23) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 778 Lines: 19 > > Still, generating such a migration storm would be fairly tricky I think. > > Well, Mel seems to have been unable to generate a load that reproduces > the long page waitqueues. And I don't think we've had any other > reports of this either. It could be that it requires a fairly large system. On large systems under load a lot of things take much longer, so what's a tiny window on Mel's system may suddenly be very large, and with much more threads they have a higher chance of bad interactions anyways. We only see it on 4S+ today. But systems are always getting larger, so what's a large system today, will be a normal medium scale system tomorrow. BTW we also collected PT traces for the long hang cases, but it was hard to find a consistent pattern in them. -Andi