Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757388AbZLNOUK (ORCPT ); Mon, 14 Dec 2009 09:20:10 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1757359AbZLNOUI (ORCPT ); Mon, 14 Dec 2009 09:20:08 -0500 Received: from mx1.redhat.com ([209.132.183.28]:31087 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757361AbZLNOUH (ORCPT ); Mon, 14 Dec 2009 09:20:07 -0500 Subject: Re: [PATCH] vmscan: limit concurrent reclaimers in shrink_zone From: Larry Woodman To: Christoph Hellwig Cc: Rik van Riel , kosaki.motohiro@jp.fujitsu.com, akpm@linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, aarcange@redhat.com In-Reply-To: <20091214131444.GA8990@infradead.org> References: <20091210185626.26f9828a@cuia.bos.redhat.com> <20091214131444.GA8990@infradead.org> Content-Type: text/plain Date: Mon, 14 Dec 2009 09:22:16 -0500 Message-Id: <1260800536.6666.2.camel@dhcp-100-19-198.bos.redhat.com> Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1873 Lines: 40 On Mon, 2009-12-14 at 08:14 -0500, Christoph Hellwig wrote: > On Thu, Dec 10, 2009 at 06:56:26PM -0500, Rik van Riel wrote: > > Under very heavy multi-process workloads, like AIM7, the VM can > > get into trouble in a variety of ways. The trouble start when > > there are hundreds, or even thousands of processes active in the > > page reclaim code. > > > > Not only can the system suffer enormous slowdowns because of > > lock contention (and conditional reschedules) between thousands > > of processes in the page reclaim code, but each process will try > > to free up to SWAP_CLUSTER_MAX pages, even when the system already > > has lots of memory free. In Larry's case, this resulted in over > > 6000 processes fighting over locks in the page reclaim code, even > > though the system already had 1.5GB of free memory. > > > > It should be possible to avoid both of those issues at once, by > > simply limiting how many processes are active in the page reclaim > > code simultaneously. > > > > This sounds like a very good argument against using direct reclaim at > all. It reminds a bit of the issue we had in XFS with lots of processes > pushing the AIL and causing massive slowdowns due to lock contention > and cacheline bonucing. Moving all the AIL pushing into a dedicated > thread solved that nicely. In the VM we already have that dedicated > per-node kswapd thread, so moving off as much as possible work to > should be equivalent. Some of the new systems have 16 CPUs per-node. > > Of course any of this kind of tuning really requires a lot of testing > and benchrmarking to verify those assumptions. > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/