Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755504AbZD0MKS (ORCPT ); Mon, 27 Apr 2009 08:10:18 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753821AbZD0MKE (ORCPT ); Mon, 27 Apr 2009 08:10:04 -0400 Received: from rcpt-expgw.biglobe.ne.jp ([133.205.19.68]:35632 "EHLO rcpt-expgw.biglobe.ne.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753682AbZD0MKB (ORCPT ); Mon, 27 Apr 2009 08:10:01 -0400 X-Biglobe-Sender: Date: Mon, 27 Apr 2009 21:08:56 +0900 From: Daisuke Nishimura To: KAMEZAWA Hiroyuki Cc: "linux-mm@kvack.org" , "balbir@linux.vnet.ibm.com" , "nishimura@mxp.nes.nec.co.jp" , "hugh@veritas.com" , "akpm@linux-foundation.org" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH] fix leak of swap accounting as stale swap cache under memcg Message-Id: <20090427210856.d5f4109e.d-nishimura@mtf.biglobe.ne.jp> In-Reply-To: <20090427181259.6efec90b.kamezawa.hiroyu@jp.fujitsu.com> References: <20090427181259.6efec90b.kamezawa.hiroyu@jp.fujitsu.com> Reply-To: nishimura@mxp.nes.nec.co.jp X-Mailer: Sylpheed 2.5.0rc2 (GTK+ 2.12.12; i386-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1793 Lines: 42 > Index: mmotm-2.6.30-Apr24/mm/vmscan.c > =================================================================== > --- mmotm-2.6.30-Apr24.orig/mm/vmscan.c > +++ mmotm-2.6.30-Apr24/mm/vmscan.c > @@ -661,6 +661,9 @@ static unsigned long shrink_page_list(st > if (PageAnon(page) && !PageSwapCache(page)) { > if (!(sc->gfp_mask & __GFP_IO)) > goto keep_locked; > + /* avoid making more stale swap caches */ > + if (memcg_stale_swap_congestion()) > + goto keep_locked; > if (!add_to_swap(page)) > goto activate_locked; > may_enter_fs = 1; > Well, as I mentioned before(http://marc.info/?l=linux-kernel&m=124066623510867&w=2), this cannot avoid type-2(set !PageCgroupUsed by the owner process via page_remove_rmap()->mem_cgroup_uncharge_page() before being added to swap cache). If these swap caches go through shrink_page_list() without beeing freed for some reason, these swap caches doesn't go back to memcg's LRU. Type-2 doesn't pressure memsw.usage, but you can see it by plotting "grep SwapCached /proc/meminfo". And I don't think it's a good idea to add memcg_stale_swap_congestion() here. This means less possibility to reclaim pages. Do you dislike the patch I attached in the above mail ? If not, please merge it(I tested your prvious version with some fixes and my patch, and it worked well). Or shall I send is as a separate patch to fix type-2 after your patch(yours looks good to me for type-1)? (to tell the truth, I want reuse memcg_free_unused_swapcache() in another patch) Thanks, Daisuke Nishimura. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/