Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932079AbWC1QTB (ORCPT ); Tue, 28 Mar 2006 11:19:01 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S932105AbWC1QTB (ORCPT ); Tue, 28 Mar 2006 11:19:01 -0500 Received: from ebiederm.dsl.xmission.com ([166.70.28.69]:17819 "EHLO ebiederm.dsl.xmission.com") by vger.kernel.org with ESMTP id S932079AbWC1QTA (ORCPT ); Tue, 28 Mar 2006 11:19:00 -0500 To: Nick Piggin Cc: Kirill Korotaev , haveblue@us.ibm.com, linux-kernel@vger.kernel.org, herbert@13thfloor.at, devel@openvz.org, serue@us.ibm.com, akpm@osdl.org, sam@vilain.net, Alexey Kuznetsov , Pavel Emelianov , Stanislav Protassov Subject: Re: [RFC] Virtualization steps References: <44242A3F.1010307@sw.ru> <44242D4D.40702@yahoo.com.au> <4428FB90.5000601@sw.ru> <4428FEA5.9020808@yahoo.com.au> From: ebiederm@xmission.com (Eric W. Biederman) Date: Tue, 28 Mar 2006 09:15:56 -0700 In-Reply-To: <4428FEA5.9020808@yahoo.com.au> (Nick Piggin's message of "Tue, 28 Mar 2006 19:15:17 +1000") Message-ID: User-Agent: Gnus/5.1007 (Gnus v5.10.7) Emacs/21.4 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3670 Lines: 88 Nick Piggin writes: > Kirill Korotaev wrote: >> Nick, will be glad to shed some light on it. >> > > Thanks very much Kirill. > > I don't think I'm qualified to make any decisions about this, > so I don't want to detract from the real discussions, but I > just had a couple more questions: > >> First of all, what it does which low level virtualization can't: >> - it allows to run 100 containers on 1GB RAM >> (it is called containers, VE - Virtual Environments, >> VPS - Virtual Private Servers). >> - it has no much overhead (<1-2%), which is unavoidable with hardware >> virtualization. For example, Xen has >20% overhead on disk I/O. > > Are any future hardware solutions likely to improve these problems? This isn't a direct competition, both solutions coincide nicely. The major efficiency differences are fundamental to the approaches and can only be solved in software and not hardware. The fundamental efficiency limits of low level virtualization are not sharing resources between instances well (think how hard memory hotplug is to solve), the fact that running a kernel takes at least 1MB for just the kernel, the fact that no matter how good your hypervisor is there will be some hardware interface it doesn't virtualize. Whereas what we are aiming at are just enough modifications to the kernel to allow multiple instances of user space. We aren't virtualizing anything that isn't already virtualized in the kernel. >> OS kernel virtualization >> ~~~~~~~~~~~~~~~~~~~~~~~~ > > Is this considered secure enough that multiple untrusted VEs are run > on production systems? Kirill or Herbert can give a better answer but that is of the major points of BSD Jails and their kin is it not? > What kind of users want this, who can't use alternatives like real > VMs? Well that question assumes a lot. The answer that assumes a lot in the other direction is that adding an additional unnecessary layers just complicates the problem and slows things down for no reason while making it so you can't assume the solution is always present. In addition to doing it in a non-portable way so it is only available on a few platforms. I can't even think of a straight answer to the users question. My users are in the high performance computing realm, and for that subset it is easy. Xen and it's kin don't virtualize the high bandwidth low latency communication hardware that is used, and that may not even be possible. Using a hypervisor in a situation like that certainly isn't general or easily maintainable. (Think about what a challenge it has been to get usable infiniband drivers merged). >> Summary of previous discussions on LKML >> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > > Have their been any discussions between the groups pushing this > virtualization, and important kernel developers who are not part of > a virtualization effort? Ie. is there any consensus about the > future of these patches? Yes, but just enough to give us hope :) Unless you count the mount namespace as part of this in which case pieces are already merged. The challenging is that writing kernel code that does this is easy. Writing kernel code that is mergeable and that the different groups all agree meets their requirements is much harder. It has taken us until now to have a basic approach that we all agree on. Now we get to beat each other up over the technical details :) Eric - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/