Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751391AbZFEEH6 (ORCPT ); Fri, 5 Jun 2009 00:07:58 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750739AbZFEEHu (ORCPT ); Fri, 5 Jun 2009 00:07:50 -0400 Received: from TYO201.gate.nec.co.jp ([202.32.8.193]:49672 "EHLO tyo201.gate.nec.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750706AbZFEEHu (ORCPT ); Fri, 5 Jun 2009 00:07:50 -0400 Date: Fri, 5 Jun 2009 13:04:45 +0900 From: Daisuke Nishimura To: balbir@linux.vnet.ibm.com Cc: nishimura@mxp.nes.nec.co.jp, KAMEZAWA Hiroyuki , "linux-mm@kvack.org" , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH] remove memory.limit v.s. memsw.limit comparison. Message-Id: <20090605130445.447ca03a.nishimura@mxp.nes.nec.co.jp> In-Reply-To: <20090605011019.GG7504@balbir.in.ibm.com> References: <20090604141043.9a1064fd.kamezawa.hiroyu@jp.fujitsu.com> <20090604123625.GE7504@balbir.in.ibm.com> <0921392c77890fc84fa69653ae4f31d9.squirrel@webmail-b.css.fujitsu.com> <20090605011019.GG7504@balbir.in.ibm.com> Organization: NEC Soft, Ltd. X-Mailer: Sylpheed 2.6.0 (GTK+ 2.10.14; i686-pc-mingw32) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3331 Lines: 80 On Fri, 5 Jun 2009 09:10:19 +0800, Balbir Singh wrote: > * KAMEZAWA Hiroyuki [2009-06-05 00:45:03]: > > > Balbir Singh wrote: > > > * KAMEZAWA Hiroyuki [2009-06-04 > > > 14:10:43]: > > > > > >> From: KAMEZAWA Hiroyuki > > >> > > >> Removes memory.limit < memsw.limit at setting limit check completely. > > >> > > >> The limitation "memory.limit <= memsw.limit" was added just because > > >> it seems sane ...if memory.limit > memsw.limit, only memsw.limit works. > > >> > > >> But To implement this limitation, we needed to use private mutex and > > >> make > > >> the code a bit complated. > > >> As Nishimura pointed out, in real world, there are people who only want > > >> to use memsw.limit. > > >> > > >> Then, this patch removes the check. user-land library or middleware can > > >> check > > >> this in userland easily if this really concerns. > > >> > > >> And this is a good change to charge-and-reclaim. > > >> > > >> Now, memory.limit is always checked before memsw.limit > > >> and it may do swap-out. But, if memory.limit == memsw.limit, swap-out is > > >> finally no help and hits memsw.limit again. So, let's allow the > > >> condition > > >> memory.limit > memsw.limit. Then we can skip unnecesary swap-out. > > >> > > >> Signed-off-by: KAMEZAWA Hiroyuki > > >> --- > > > > > > There is one other option, we could set memory.limit_in_bytes == > > > memory.memsw.limit_in_bytes provided it is set to LONG_LONG_MAX. I am > > > not convinced that we should allow memsw.limit_in_bytes to be less > > > that limit_in_bytes, it will create confusion and the API is already > > > exposed. > > > > > Ahhhh, I get your point. > > if memory.memsw.limit_in_bytes < memory.limit_in_bytes, no swap will > > be used bacause currnet try_to_free_pages() for memcg skips swap-out. > > Then, only global-LRU will use swap. > > This behavior is not easy to understand. > > > > Sorry, I don't push this patch as this is. But adding documentation about > > "What happens when you set memory.limit == memsw.limit" will be necessary. > > > > ...maybe give all jobs to user-land and keep the kernel as it is now > > is a good choice. > > Yes, probably and with libcgroup and configuration, defaults should > not be hard to setup. Worst case we can use a script to setup both the > values. > > > > > BTW, I'd like to avoid useless swap-out in memory.limit == memsw.limit case. > > If someone has good idea, please :( > > > > Are you seeing swap even with memory.limit == memory.memsw.limit? Only > global pressure should cause swapout, no? > I don't remember in detail, but I could see swap usage by "free" command even when I set "memory.limit == memory.memsw.limit" long ago, the usage was small though. It cannot be helped in current implementation, because mem.limit is checked before memsw.limit, and it is a waste of cpu time to scan anon-lru when "memory.limit == memory.memsw.limit". Thanks, Daisuke Nishimura. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/