Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755779AbaJNWPC (ORCPT ); Tue, 14 Oct 2014 18:15:02 -0400 Received: from mail-lb0-f180.google.com ([209.85.217.180]:45442 "EHLO mail-lb0-f180.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753413AbaJNWPA (ORCPT ); Tue, 14 Oct 2014 18:15:00 -0400 MIME-Version: 1.0 In-Reply-To: <20141014221219.GA12338@mail.hallyn.com> References: <87tx36l3gp.fsf@x220.int.ebiederm.org> <20141014221219.GA12338@mail.hallyn.com> From: Andy Lutomirski Date: Tue, 14 Oct 2014 15:14:38 -0700 Message-ID: Subject: Re: [PATCH] fs: Treat non-ancestor-namespace mounts as MNT_NOSUID To: "Serge E. Hallyn" Cc: "Eric W. Biederman" , Linux FS Devel , "linux-kernel@vger.kernel.org" , Michael j Theall , fuse-devel@lists.sourceforge.net, Miklos Szeredi , "Serge H. Hallyn" , Seth Forshee Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Oct 14, 2014 at 3:12 PM, Serge E. Hallyn wrote: > Quoting Eric W. Biederman (ebiederm@xmission.com): >> Andy Lutomirski writes: >> >> > If a process gets access to a mount from a descendent or unrelated >> > user namespace, that process should not be able to take advantage of >> > setuid files or selinux entrypoints from that filesystem. >> > >> > This will make it safer to allow more complex filesystems to be >> > mounted in non-root user namespaces. >> > >> > This does not remove the need for MNT_LOCK_NOSUID. The setuid, >> > setgid, and file capability bits can no longer be abused if code in >> > a user namespace were to clear nosuid on an untrusted filesystem, >> > but this patch, by itself, is insufficient to protect the system >> > from abuse of files that, when execed, would increase MAC privilege. >> > >> > As a more concrete explanation, any task that can manipulate a >> > vfsmount associated with a given user namespace already has >> > capabilities in that namespace and all of its descendents. If they >> > can cause a malicious setuid, setgid, or file-caps executable to >> > appear in that mount, then that executable will only allow them to >> > elevate privileges in exactly the set of namespaces in which they >> > are already privileges. >> > >> > On the other hand, if they can cause a malicious executable to >> > appear with a dangerous MAC label, running it could change the >> > caller's security context in a way that should not have been >> > possible, even inside the namespace in which the task is confined. >> >> As presented this is complete and total nonsense. Mount propgation >> strongly weakens if not completely breaks the assumptions you are making >> in this code. >> >> To write any generic code that knows anything we need to capture a user >> namespace on struct super. >> >> Further I think all we really want is to filter out security labels from >> unprivileged mounts. uids/gids and the like should be completely fine >> because of the uid mappings. >> >> Having been down the route of comparing uids as userns uid tuples I am >> convinced that anything requires us to take the user namespace into >> account on a routine basis in the core will simply be broken for someone >> forgetting somewhere. This looks like a design that has that kind of >> susceptibility. > > The above paragraph is very compelling. However Andy's patch is a step > in the right direction from what we've got. I think given what you say > below and given Andy's rationale above, simply tweaking his patch to > ignore the parent-userns loop, and return false if current_user_ns() != > mount_userns, should be right? It'll prevent a child userns from > setting a selinux/apparmor entrypoint or POSIX file capabilities on a > file and having the parent userns trip over those. I'm a bit confused. How does removing that loop help? Shouldn't all usernses trust labels from the root userns? Admittedly, they should really only be execing things that propagated in instead of using /proc or fchdir, so I don't think we lose much by dropping the loop. --Andy -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/