Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752856Ab2KTTEw (ORCPT ); Tue, 20 Nov 2012 14:04:52 -0500 Received: from zene.cmpxchg.org ([85.214.230.12]:34644 "EHLO zene.cmpxchg.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751856Ab2KTTEv (ORCPT ); Tue, 20 Nov 2012 14:04:51 -0500 Date: Tue, 20 Nov 2012 14:04:41 -0500 From: Johannes Weiner To: Rik van Riel , Mel Gorman Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: kswapd endless loop for compaction Message-ID: <20121120190440.GA24381@cmpxchg.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1498 Lines: 35 Hi guys, while testing a 3.7-rc5ish kernel, I noticed that kswapd can drop into a busy spin state without doing reclaim. printk-style debugging told me that this happens when the distance between a zone's high watermark and its low watermark is less than two huge pages (DMA zone). 1. The first loop in balance_pgdat() over the zones finds all zones to be above their high watermark and only does goto out (all_zones_ok). 2. pgdat_balanced() at the out: label also just checks the high watermark, so the node is considered balanced and the order is not reduced. 3. In the `if (order)' block after it, compaction_suitable() checks if the zone's low watermark + twice the huge page size is okay, which it's not necessarily in a small zone, and so COMPACT_SKIPPED makes it it go back to loop_again:. This will go on until somebody else allocates and breaches the high watermark and then hopefully goes on to reclaim the zone above low watermark + 2 * THP. I'm not really sure what the correct solution is. Should we modify the zone_watermark_ok() checks in balance_pgdat() to take into account the higher watermark requirements for reclaim on behalf of compaction? Change the check in compaction_suitable() / not use it directly? Thanks, Johannes -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/