Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752627AbYKNOhX (ORCPT ); Fri, 14 Nov 2008 09:37:23 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751259AbYKNOhJ (ORCPT ); Fri, 14 Nov 2008 09:37:09 -0500 Received: from mx2.redhat.com ([66.187.237.31]:55200 "EHLO mx2.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751256AbYKNOhI (ORCPT ); Fri, 14 Nov 2008 09:37:08 -0500 Message-ID: <491D8CEC.5050106@redhat.com> Date: Fri, 14 Nov 2008 09:36:28 -0500 From: Rik van Riel Organization: Red Hat, Inc User-Agent: Thunderbird 2.0.0.17 (X11/20080915) MIME-Version: 1.0 To: Andrew Morton CC: linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [PATCH -mm] vmscan: bail out of page reclaim after swap_cluster_max pages References: <20081113171208.6985638e@bree.surriel.com> <20081113192729.7d8eb133.akpm@linux-foundation.org> In-Reply-To: <20081113192729.7d8eb133.akpm@linux-foundation.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2144 Lines: 50 Andrew Morton wrote: > On Thu, 13 Nov 2008 17:12:08 -0500 Rik van Riel wrote: > >> Sometimes the VM spends the first few priority rounds rotating back >> referenced pages and submitting IO. Once we get to a lower priority, >> sometimes the VM ends up freeing way too many pages. >> >> The fix is relatively simple: in shrink_zone() we can check how many >> pages we have already freed and break out of the loop. >> >> However, in order to do this we do need to know how many pages we already >> freed, so move nr_reclaimed into scan_control. > > There was a reason for not doing this, but I forget what it was. It might require > some changelog archeology. iirc it was to do with balancing scanning rates > between the various things which we scan. I've seen worse symptoms without this code, though. Pretty much all 2.6 kernels show bad behaviour occasionally. Sometimes the VM gets in such a state where multiple processes cannot find anything readily evictable, and they all end up at a lower priority level. This can cause them to evict more than half of everything from memory, before breaking out of the pageout loop and swapping things back in. On my 2GB desktop, I've seen as much as 1200MB memory free due to such a swapout storm. It is possible more is free at the top of the cycle, but X and gnome-terminal and top and everything else is stuck, so that's not actually visible :) I am not convinced that a scanning imbalance is more serious. Of course, one thing we could do is exempt kswapd from this check. During light reclaim, kswapd does most of the eviction so scanning should remain balanced. Having one process fall down to a lower priority level is also not a big problem. As long as the direct reclaim processes do not also fall into the same trap, the situation should be manageable. Does that sound reasonable to you? -- All rights reversed. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/