Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752556Ab1C2AhI (ORCPT ); Mon, 28 Mar 2011 20:37:08 -0400 Received: from smtp-out.google.com ([74.125.121.67]:42457 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752141Ab1C2AhG (ORCPT ); Mon, 28 Mar 2011 20:37:06 -0400 DomainKey-Signature: a=rsa-sha1; c=nofws; d=google.com; s=beta; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=LHBvIk3lt1ntjfi7BjOOkAqBPv7zqibq9kZaGFKX+iWK5d9mQSWkAZ31C+6BfYLYxG s6DGGb2GlKatfhGq7izg== MIME-Version: 1.0 In-Reply-To: <20110329091254.20c7cfcb.kamezawa.hiroyu@jp.fujitsu.com> References: <20110328093957.089007035@suse.cz> <20110329091254.20c7cfcb.kamezawa.hiroyu@jp.fujitsu.com> Date: Mon, 28 Mar 2011 17:37:02 -0700 Message-ID: Subject: Re: [RFC 0/3] Implementation of cgroup isolation From: Ying Han To: KAMEZAWA Hiroyuki Cc: Michal Hocko , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Hugh Dickins , Suleiman Souhlal Content-Type: text/plain; charset=ISO-8859-1 X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2772 Lines: 69 On Mon, Mar 28, 2011 at 5:12 PM, KAMEZAWA Hiroyuki wrote: > On Mon, 28 Mar 2011 11:01:18 -0700 > Ying Han wrote: > >> On Mon, Mar 28, 2011 at 2:39 AM, Michal Hocko wrote: >> > Hi all, >> > >> > Memory cgroups can be currently used to throttle memory usage of a group of >> > processes. It, however, cannot be used for an isolation of processes from >> > the rest of the system because all the pages that belong to the group are >> > also placed on the global LRU lists and so they are eligible for the global >> > memory reclaim. >> > >> > This patchset aims at providing an opt-in memory cgroup isolation. This >> > means that a cgroup can be configured to be isolated from the rest of the >> > system by means of cgroup virtual filesystem (/dev/memctl/group/memory.isolated). >> >> Thank you Hugh pointing me to the thread. We are working on similar >> problem in memcg currently >> >> Here is the problem we see: >> 1. In memcg, a page is both on per-memcg-per-zone lru and global-lru. >> 2. Global memory reclaim will throw page away regardless of cgroup. >> 3. The zone->lru_lock is shared between per-memcg-per-zone lru and global-lru. >> >> And we know: >> 1. We shouldn't do global reclaim since it breaks memory isolation. >> 2. There is no need for a page to be on both LRU list, especially >> after having per-memcg background reclaim. >> >> So our approach is to take off page from global lru after it is >> charged to a memcg. Only pages allocated at root cgroup remains in >> global LRU, and each memcg reclaims pages on its isolated LRU. >> > > Why you don't use cpuset and virtual nodes ? It's what you want. We've been running cpuset + fakenuma nodes configuration in google to provide memory isolation. The configuration of having the virtual box is complex which user needs to know great details of the which node to assign to which cgroup. That is one of the motivations for us moving towards to memory controller which simply do memory accounting no matter where pages are allocated. By saying that, memcg simplified the memory accounting per-cgroup but the memory isolation is broken. This is one of examples where pages are shared between global LRU and per-memcg LRU. It is easy to get cgroup-A's page evicted by adding memory pressure to cgroup-B. The approach we are thinking to make the page->lru exclusive solve the problem. and also we should be able to break the zone->lru_lock sharing. --Ying > > Thanks, > -Kame > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/