Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755946Ab1BWVVk (ORCPT ); Wed, 23 Feb 2011 16:21:40 -0500 Received: from out02.mta.xmission.com ([166.70.13.232]:52280 "EHLO out02.mta.xmission.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753150Ab1BWVVh (ORCPT ); Wed, 23 Feb 2011 16:21:37 -0500 From: ebiederm@xmission.com (Eric W. Biederman) To: David Howells Cc: "Serge E. Hallyn" , LSM , Andrew Morton , James Morris , Kees Cook , containers@lists.linux-foundation.org, kernel list , Alexey Dobriyan , Michael Kerrisk , xemul@parallels.com References: <20110217150257.GA26395@mail.hallyn.com> <20110217150224.GA26334@mail.hallyn.com> <3139.1298481393@redhat.com> Date: Wed, 23 Feb 2011 13:21:28 -0800 In-Reply-To: <3139.1298481393@redhat.com> (David Howells's message of "Wed, 23 Feb 2011 17:16:33 +0000") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-XM-SPF: eid=;;;mid=;;;hst=in02.mta.xmission.com;;;ip=98.207.153.68;;;frm=ebiederm@xmission.com;;;spf=neutral X-XM-AID: U2FsdGVkX18k3RmqBY1nl8bH/dw9Dw7CQSrLQoUj0SU= X-SA-Exim-Connect-IP: 98.207.153.68 X-SA-Exim-Mail-From: ebiederm@xmission.com X-Spam-Report: * -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP * 1.5 XMNoVowels Alpha-numberic number with no vowels * 0.0 T_TM2_M_HEADER_IN_MSG BODY: T_TM2_M_HEADER_IN_MSG * -3.0 BAYES_00 BODY: Bayes spam probability is 0 to 1% * [score: 0.0000] * -0.0 DCC_CHECK_NEGATIVE Not listed in DCC * [sa07 1397; Body=1 Fuz1=1 Fuz2=1] * 0.0 T_TooManySym_01 4+ unique symbols in subject * 0.1 XMSolicitRefs_0 Weightloss drug * 0.4 UNTRUSTED_Relay Comes from a non-trusted relay X-Spam-DCC: XMission; sa07 1397; Body=1 Fuz1=1 Fuz2=1 X-Spam-Combo: ;David Howells X-Spam-Relay-Country: Subject: Re: [PATCH 1/9] Add a user_namespace as creator/owner of uts_namespace X-Spam-Flag: No X-SA-Exim-Version: 4.2.1 (built Fri, 06 Aug 2010 16:31:04 -0600) X-SA-Exim-Scanned: Yes (on in02.mta.xmission.com) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5497 Lines: 162 David Howells writes: > Serge E. Hallyn wrote: > >> struct uts_namespace { >> struct kref kref; >> struct new_utsname name; >> + struct user_namespace *user_ns; >> }; > > If a uts_namespace belongs to a user_namespace, should CLONE_NEWUSER then > imply CLONE_NEWUTS? > > Or is uts_namespace::user_ns more an implication that the set of users in > user_namespace are the only ones authorised to alter the uts data. The later. > I presume that the uts_namespace of a process must be owned by one of the > user_namespaces in the alternating inheritance chain of namespaces and their > creators leading from current_user_ns() to init_user_ns. > > With that in mind, looking at patch 3: > > - if (!capable(CAP_SYS_ADMIN)) > + if (!ns_capable(current->nsproxy->uts_ns->user_ns, CAP_SYS_ADMIN)) > > what is it you're actually asking? I presume it's 'does this user have > CAP_SYS_ADMIN capability over objects belonging to the uts_namespace's > user_namespace?' Yes. > So, to look at the important bit of patch 2: > > -int cap_capable(struct task_struct *tsk, const struct cred *cred, int cap, > - int audit) > +int cap_capable(struct task_struct *tsk, const struct cred *cred, > + struct user_namespace *targ_ns, int cap, int audit) > { > - return cap_raised(cred->cap_effective, cap) ? 0 : -EPERM; > + for (;;) { > + /* The creator of the user namespace has all caps. */ > + if (targ_ns != &init_user_ns && targ_ns->creator == cred->user) > + return 0; > + > + /* Do we have the necessary capabilities? */ > + if (targ_ns == cred->user->user_ns) > + return cap_raised(cred->cap_effective, cap) ? 0 : -EPERM; > + > + /* Have we tried all of the parent namespaces? */ > + if (targ_ns == &init_user_ns) > + return -EPERM; > + > + /* If you have the capability in a parent user ns you have it > + * in the over all children user namespaces as well, so see > + * if this process has the capability in the parent user > + * namespace. > + */ > + targ_ns = targ_ns->creator->user_ns; > + } > + > + /* We never get here */ > + return -EPERM; > } > > On entry, as we're called from ns_capable(), cred->user is the user that the > current process is running as, and, as such, may be in a separate namespace > from uts_namespace - which may itself be in a separate namespace from > init_user_ns. > > So, assume for the sake of argument that there are three user_namespaces along > the chain from the calling process to the root, and that the uts_namespace > belongs to the middle one. So we have the nested stack of: user_ns3->creator->user_ns == user_ns2 user_ns2->creator->user_ns == &init_user_ns uts_ns2->user_ns == user_ns2 > > if (targ_ns != &init_user_ns && targ_ns->creator == cred->user) > return 0; > > Can never match because targ_ns->creator cannot be cred->user; even if the > uts_namespace belongs to our namespace, given that the creator lies outside > our namespace. Initially we come in with targ_ns == user_ns2 and cred->user->user_ns in one of (user_ns3, user_ns2, or &init_user_ns). targ_ns takes on values user_ns2 and &init_user_ns. So when targ_ns becomes &init_user_ns. If the user in question is uts_ns2->user_ns->creator. This check will indeed match. > if (targ_ns == cred->user->user_ns) > return cap_raised(cred->cap_effective, cap) ? 0 : -EPERM; > > Can only match if we are in the target user_namespace (ie. the one to which > uts_namespace belongs), whether or not we have CAP_SYS_ADMIN. As before targ_ns takes on values of user_ns2 and &init_user_ns. Which means this check will match if we have CAP_SYS_ADMIN in &init_user_ns or in user_ns2. > Which means that unless the uts_namespace belongs to our user_namespace, we > cannot change it. Is that correct? No. If you are root in a parent namespace you can also change it. > So ns_capable() restricts you to only doing interesting things to objects that > belong to a user_namespace if they are in your own user_namespace. Is that > correct? No. Root outside your user namespace is also allowed to do interesting things. > If that is so, is the loop required for ns_capable()? Yes. > Looking further at patch 2: > > #define nsown_capable(cap) (ns_capable(current_user_ns(), (cap))) > > Given what I've said above, I presume the loop isn't necessary here either. > > > I think you're using ns_capable() in two different ways: > > (1) You're using it to see if a process has power over its descendents in a > user_namespace that can be traced back to a clone() that it did with > CLONE_NEWUSER. > > For example, automatically granting a process permission to kill > descendents in a namespace it created. > > (2) You're using it to see if a process can access objects that might be > outside its own user_namespace. > > For example, setting the hostname. > > Is it worth giving two different interfaces to make this clearer (even if they > actually do the same thing)? > > > Sorry if this seems rambly, but I'm trying to get my head round your > code. I am all for making that loop a little clearer, because it is something you have to pause and think about to understand but so far I don't think the loop is wrong, and it is simple. Eric -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/