Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756274Ab2FPRsL (ORCPT ); Sat, 16 Jun 2012 13:48:11 -0400 Received: from mail-lb0-f174.google.com ([209.85.217.174]:49794 "EHLO mail-lb0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754375Ab2FPRsJ (ORCPT ); Sat, 16 Jun 2012 13:48:09 -0400 MIME-Version: 1.0 In-Reply-To: <4FDAE3CC.60801@kernel.org> References: <1339661592-3915-1-git-send-email-kosaki.motohiro@gmail.com> <20120614145716.GA2097@barrios> <4FDAE3CC.60801@kernel.org> Date: Sat, 16 Jun 2012 23:18:07 +0530 Message-ID: Subject: Re: [resend][PATCH] mm, vmscan: fix do_try_to_free_pages() livelock From: Aaditya Kumar To: Minchan Kim Cc: KOSAKI Motohiro , linux-kernel@vger.kernel.org, linux-mm@kvack.org, akpm@linux-foundation.org, Nick Piggin , Michal Hocko , Johannes Weiner , Mel Gorman , KAMEZAWA Hiroyuki , Minchan Kim , frank.rowand@am.sony.com, tim.bird@am.sony.com, takuzo.ohara@ap.sony.com, kan.iibuchi@jp.sony.com Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2077 Lines: 54 On Fri, Jun 15, 2012 at 12:57 PM, Minchan Kim wrote: >> >> pgdat_balanced() doesn't recognized zone. Therefore kswapd may sleep >> if node has multiple zones. Hm ok, I realized my descriptions was >> slightly misleading. priority 0 is not needed. bakance_pddat() calls >> pgdat_balanced() >> every priority. Most easy case is, movable zone has a lot of free pages and >> normal zone has no reclaimable page. >> >> btw, current pgdat_balanced() logic seems not correct. kswapd should >> sleep only if every zones have much free pages than high water mark >> _and_ 25% of present pages in node are free. >> > > > Sorry. I can't understand your point. > Current kswapd doesn't sleep if relevant zones don't have free pages above high watermark. > It seems I am missing your point. > Please anybody correct me. Since currently direct reclaim is given up based on zone->all_unreclaimable flag, so for e.g in one of the scenarios: Lets say system has one node with two zones (NORMAL and MOVABLE) and we hot-remove the all the pages of the MOVABLE zone. While migrating pages during memory hot-unplugging, the allocation function (for new page to which the page in MOVABLE zone would be moved) can end up looping in direct reclaim path for ever. This is so because when most of the pages in the MOVABLE zone have been migrated, the zone now contains lots of free memory (basically above low watermark) BUT all are in MIGRATE_ISOLATE list of the buddy list. So kswapd() would not balance this zone as free pages are above low watermark (but all are in isolate list). So zone->all_unreclaimable flag would never be set for this zone and allocation function would end up looping forever. (assuming the zone NORMAL is left with no reclaimable memory) Regards, Aaditya Kumar Sony India Software Centre, Bangalore. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/