Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758886Ab0FVCYf (ORCPT ); Mon, 21 Jun 2010 22:24:35 -0400 Received: from fgwmail5.fujitsu.co.jp ([192.51.44.35]:34956 "EHLO fgwmail5.fujitsu.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752899Ab0FVCYd (ORCPT ); Mon, 21 Jun 2010 22:24:33 -0400 X-SecurityPolicyCheck-FJ: OK by FujitsuOutboundMailChecker v1.3.1 From: KOSAKI Motohiro To: Minchan Kim Subject: Re: [Patch] Call cond_resched() at bottom of main look in balance_pgdat() Cc: kosaki.motohiro@jp.fujitsu.com, Larry Woodman , linux-kernel@vger.kernel.org, linux-mm@kvack.org In-Reply-To: <20100621141315.GB2456@barrios-desktop> References: <20100618093954.FBE7.A69D9226@jp.fujitsu.com> <20100621141315.GB2456@barrios-desktop> Message-Id: <20100622112416.B554.A69D9226@jp.fujitsu.com> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Becky! ver. 2.50.07 [ja] Date: Tue, 22 Jun 2010 11:24:24 +0900 (JST) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2288 Lines: 62 > > ============================================================= > > Subject: [PATCH] Call cond_resched() at bottom of main look in balance_pgdat() > > From: Larry Woodman > > > > We are seeing a problem where kswapd gets stuck and hogs the CPU on a > > small single CPU system when an OOM kill should occur. When this > > happens swap space has been exhausted and the pagecache has been shrunk > > to zero. Once kswapd gets the CPU it never gives it up because at least > > one zone is below high. Adding a single cond_resched() at the end of > > the main loop in balance_pgdat() fixes the problem by allowing the > > watchdog and tasks to run and eventually do an OOM kill which frees up > > the resources. > > > > kosaki note: This seems regression caused by commit bb3ab59683 > > (vmscan: stop kswapd waiting on congestion when the min watermark is > > not being met) > > > > Signed-off-by: Larry Woodman > > Reviewed-by: KOSAKI Motohiro > > --- > > mm/vmscan.c | 1 + > > 1 files changed, 1 insertions(+), 0 deletions(-) > > > > diff --git a/mm/vmscan.c b/mm/vmscan.c > > index 9c7e57c..c5c46b7 100644 > > --- a/mm/vmscan.c > > +++ b/mm/vmscan.c > > @@ -2182,6 +2182,7 @@ loop_again: > > */ > > if (sc.nr_reclaimed >= SWAP_CLUSTER_MAX) > > break; > > + cond_resched(); > > } > > out: > > /* > > -- > > 1.6.5.2 > > Kosaki's patch's goal is that kswap doesn't yield cpu if the zone doesn't meet its > min watermark to avoid failing atomic allocation. > But this patch could yield kswapd's time slice at any time. > Doesn't the patch break your goal in bb3ab59683? No. it don't break. Typically, kswapd periodically call shrink_page_list() and it call cond_resched() even if bb3ab59683 case. Larry observed very exceptional situation. his system don't have reclaimable pages at all, then eventually shrink_page_list() was not called very long time. His patch only change such very rare situation, I think it's safe. Thanks. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/