Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759936Ab0GBTww (ORCPT ); Fri, 2 Jul 2010 15:52:52 -0400 Received: from smtp1.linux-foundation.org ([140.211.169.13]:35062 "EHLO smtp1.linux-foundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757401Ab0GBTwt (ORCPT ); Fri, 2 Jul 2010 15:52:49 -0400 Date: Fri, 2 Jul 2010 12:51:55 -0700 From: Andrew Morton To: Mel Gorman Cc: linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, Dave Chinner , Chris Mason , Nick Piggin , Rik van Riel , Johannes Weiner , Christoph Hellwig , KAMEZAWA Hiroyuki , KOSAKI Motohiro , Andrea Arcangeli Subject: Re: [PATCH 12/14] vmscan: Do not writeback pages in direct reclaim Message-Id: <20100702125155.69c02f85.akpm@linux-foundation.org> In-Reply-To: <1277811288-5195-13-git-send-email-mel@csn.ul.ie> References: <1277811288-5195-1-git-send-email-mel@csn.ul.ie> <1277811288-5195-13-git-send-email-mel@csn.ul.ie> X-Mailer: Sylpheed 2.4.8 (GTK+ 2.12.9; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1946 Lines: 37 On Tue, 29 Jun 2010 12:34:46 +0100 Mel Gorman wrote: > When memory is under enough pressure, a process may enter direct > reclaim to free pages in the same manner kswapd does. If a dirty page is > encountered during the scan, this page is written to backing storage using > mapping->writepage. This can result in very deep call stacks, particularly > if the target storage or filesystem are complex. It has already been observed > on XFS that the stack overflows but the problem is not XFS-specific. > > This patch prevents direct reclaim writing back pages by not setting > may_writepage in scan_control. Instead, dirty pages are placed back on the > LRU lists for either background writing by the BDI threads or kswapd. If > in direct lumpy reclaim and dirty pages are encountered, the process will > stall for the background flusher before trying to reclaim the pages again. > > Memory control groups do not have a kswapd-like thread nor do pages get > direct reclaimed from the page allocator. Instead, memory control group > pages are reclaimed when the quota is being exceeded or the group is being > shrunk. As it is not expected that the entry points into page reclaim are > deep call chains memcg is still allowed to writeback dirty pages. I already had "[PATCH 01/14] vmscan: Fix mapping use after free" and I'll send that in for 2.6.35. I grabbed [02/14] up to [11/14]. Including "[PATCH 06/14] vmscan: kill prev_priority completely", grumpyouallsuck. I wimped out at this, "Do not writeback pages in direct reclaim". It really is a profound change and needs a bit more thought, discussion and if possible testing which is designed to explore possible pathologies. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/