Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6381940ybv; Wed, 12 Feb 2020 11:07:08 -0800 (PST) X-Google-Smtp-Source: APXvYqyMJvpgM+uOsuCQ/PE6oTQQQWv9c7bgVQxbZX9Lqr31buMHl5p/744bwRGFHGhizHkWd+LJ X-Received: by 2002:a9d:48d:: with SMTP id 13mr9755915otm.249.1581534428224; Wed, 12 Feb 2020 11:07:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581534428; cv=none; d=google.com; s=arc-20160816; b=ZZuwjQzxye8EneBgxH3uwGJLSS601nch7ecPOkg1rVg9u8qJErtSBVHyx7L/GmT8Wt 0R0OsGfVemIpDSNME3mSknqdWmhURkmQFbQVdjXOlDU9NJk+uFCUO9QR7ghjW4Xxej18 hJKNVZLEpXXai6kO7kYqRhn/Juy5Yxg1kOGiI3TqWBWqYD0O0dbX8vlpN1eMy1bxkd3d LP3dOL0pgGXgdX9rtZxBgQxBnU3SWSsWcMu+1APCfjf4RKmqD3odphUDiKs1F7Ezhsrq kDBs7U9/AMzGJbTLBImNCC80uiVxooMC1dqC50Y7Dj/A8Vt4C26T8AsN+iQ7C/yNKwem V1ew== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=foWRYWO78ULZBWItaixgqqE1dZLXg8h/ArW04ZpV+3U=; b=Pzw0tNsF+mKk/mmjKh7ZFm9+ulcR4TZAZPbAH77SHozc7albXPAFHlhaOWe70gycRQ oOdLhdfLKqh2D/uM1kWmwVuqlp4kMqa+cba1wvYw1WrH1MLGpikWcHG/sBBAkllVPOk8 EP07jPvrhLwJMO+PXajwMm2VuJ5IqdWvV8LJ8T0m+ielqLOzxko+JAwGCEQLbZ536xgx oPJsJPqLZ5ehCgIdR/3IdiydH10CJerSYK7zU6SPLZHsmN5MlnYGPbQwBT0HbMeWqs8D NSRgitbDTaUgGI4HIteksY9HUk+jjgMaRtuaoVKbchywb+DXB3ZwFsCcUNIxkl61HEGX ssyA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=aV9YD0PI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n11si621383otq.112.2020.02.12.11.06.49; Wed, 12 Feb 2020 11:07:08 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=aV9YD0PI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728866AbgBLTFd (ORCPT + 99 others); Wed, 12 Feb 2020 14:05:33 -0500 Received: from mail-ot1-f67.google.com ([209.85.210.67]:38320 "EHLO mail-ot1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727439AbgBLTFd (ORCPT ); Wed, 12 Feb 2020 14:05:33 -0500 Received: by mail-ot1-f67.google.com with SMTP id z9so3038415oth.5 for ; Wed, 12 Feb 2020 11:05:32 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=foWRYWO78ULZBWItaixgqqE1dZLXg8h/ArW04ZpV+3U=; b=aV9YD0PIqGsFxeLHhOyB8Ic2JiX3zbqDjHrTiODkiO0DoQ18h/n+sV2sdQkaEqbuuP buYE1urSPPsSwhLczKzrO7hU4fml11RrGw+fG6rdYBVA4IwNLn18yZOQw0GncgXlkBaE +a2FGKnnD/Kq+QmgJe+kwhuP1xaIigldCfY/DvmtZnTiA2LJjpsXhiKGWDM2hKDQ0UW0 weucAY2C1KlByL01wxA4iblHegp2xD2Zfu/wz/Q1ZpdE5+XjraGgBA2jDnLVTBjYxqHq MWJ8vk1fcDqo5XTfJr1HCSb8HJNlr+VV/T8mVGVjlpM+eK7SKWzwkJEqrXC0vwZ3nBPT KLMw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=foWRYWO78ULZBWItaixgqqE1dZLXg8h/ArW04ZpV+3U=; b=Gr1w0k5Phh77oSknK4u5Olg1VKfrjY6kZAfCzCjxMeE2Wwc5Q3R3XhGh4sIBpF2bK3 N/aMD+4l0l0nF+pPxDJ3ndGJ+4gbkdLVxfWRYDEhrxnWU7mMCHX7lH+62NFrc+fR4NsE tj5PmurjYFTy55ojr3DwCfyqXxVdB/YRoJwTGlhZrrrbkAS7UzrG5uq18icc61YKpFMn gMYQ9S75/w+uhRXBG8+YAF8A+9cDnSoKHNJJwF3xuiHpSOXuJL0lZb4TyUsq7bylzxdF bMvuq2uQ/vCfynft/2gQ9pxU72MIA5/56rKKIxNCHKrET4C3/lVaViNv2b6R9aJ05icM u+ZQ== X-Gm-Message-State: APjAAAVbdzIH3mw1ENSlmqO7CMw0om1dxyzMlnT4/jkvn04HtMlL8RRP YSI6yDla7BLo8FOK47ioXM82VMzpFkIrfzAKpWZ+GQ== X-Received: by 2002:a9d:7586:: with SMTP id s6mr10267881otk.342.1581534331557; Wed, 12 Feb 2020 11:05:31 -0800 (PST) MIME-Version: 1.0 References: <20200211225547.235083-1-dancol@google.com> <20200211225547.235083-4-dancol@google.com> <69f4ccce-18b2-42c1-71ac-3fe9caf2dfb6@tycho.nsa.gov> In-Reply-To: <69f4ccce-18b2-42c1-71ac-3fe9caf2dfb6@tycho.nsa.gov> From: Daniel Colascione Date: Wed, 12 Feb 2020 11:04:54 -0800 Message-ID: Subject: Re: [PATCH v2 3/6] Teach SELinux about a new userfaultfd class To: Stephen Smalley Cc: Tim Murray , Nosh Minwalla , Nick Kralevich , Lokesh Gidra , linux-kernel , Linux API , selinux@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 12, 2020 at 10:59 AM Stephen Smalley wrote: > > On 2/12/20 1:04 PM, Stephen Smalley wrote: > > On 2/12/20 12:19 PM, Daniel Colascione wrote: > >> Thanks for taking a look. > >> > >> On Wed, Feb 12, 2020 at 9:04 AM Stephen Smalley > >> wrote: > >>> > >>> On 2/11/20 5:55 PM, Daniel Colascione wrote: > >>>> Use the secure anonymous inode LSM hook we just added to let SELinux > >>>> policy place restrictions on userfaultfd use. The create operation > >>>> applies to processes creating new instances of these file objects; > >>>> transfer between processes is covered by restrictions on read, write, > >>>> and ioctl access already checked inside selinux_file_receive. > >>>> > >>>> Signed-off-by: Daniel Colascione > >>> > >>> (please add linux-fsdevel and viro to the cc for future versions of this > >>> patch since it changes the VFS) > >>> > >>>> --- > >>>> diff --git a/security/selinux/hooks.c b/security/selinux/hooks.c > >>>> index 1659b59fb5d7..e178f6f40e93 100644 > >>>> --- a/security/selinux/hooks.c > >>>> +++ b/security/selinux/hooks.c > >>>> @@ -2915,6 +2919,69 @@ static int selinux_inode_init_security(struct > >>>> inode *inode, struct inode *dir, > >>>> + > >>>> + /* > >>>> + * We shouldn't be creating secure anonymous inodes before LSM > >>>> + * initialization completes. > >>>> + */ > >>>> + if (unlikely(!selinux_state.initialized)) > >>>> + return -EBUSY; > >>> > >>> I don't think this is viable; any arbitrary actions are possible before > >>> policy is loaded, and a Linux distro can be brought up fully with > >>> SELinux enabled and no policy loaded. You'll just need to have a > >>> default behavior prior to initialization. > >> > >> We'd have to fail open then, I think, and return an S_PRIVATE inode > >> (the regular anon inode). > > > > Not sure why. You aren't doing anything in the hook that actually > > relies on selinux_state.initialized being set (i.e. nothing requires a > > policy). The avc_has_perm() call will just succeed until a policy is > > loaded. So if these inodes are created prior to policy load, they will > > get assigned the task SID (which would be the kernel SID prior to policy > > load or first exec or write to /proc/self/attr/current afterward) and > > UFFD class (in your current code), be permitted, and then once policy is > > loaded any further access will get checked against the kernel SID. > > > >>>> + /* > >>>> + * We only get here once per ephemeral inode. The inode has > >>>> + * been initialized via inode_alloc_security but is otherwise > >>>> + * untouched, so check that the state is as > >>>> + * inode_alloc_security left it. > >>>> + */ > >>>> + BUG_ON(isec->initialized != LABEL_INVALID); > >>>> + BUG_ON(isec->sclass != SECCLASS_FILE); > >>> > >>> I think the kernel discourages overuse of BUG_ON/BUG/... > >> > >> I'm not sure what counts as overuse. > > > > Me either (not my rule) but I'm pretty sure this counts or you'd see a > > lot more of these kinds of BUG_ON() checks throughout. Try to reserve > > them for really critical cases. > > > >>>> + > >>>> +#ifdef CONFIG_USERFAULTFD > >>>> + if (fops == &userfaultfd_fops) > >>>> + isec->sclass = SECCLASS_UFFD; > >>>> +#endif > >>> > >>> Not sure we want or need to introduce a new security class for each user > >>> of anonymous inodes since the permissions should be the same as for > >>> file. > >> > >> The purpose of this change is to apply special policy to userfaultfd > >> FDs in particular. Isn't having a UFFD security class the best way to > >> go about that? (There's no path.) Am I missing something? > > > > It is probably the simplest approach; it just doesn't generalize to all > > users of anonymous inodes. We can distinguish them in one of two ways: > > use a different class like you did (requires a code change every time we > > add a new one and yet another duplicate of the file class) or use a > > different SID/context/type. The latter could be achieved by calling > > security_transition_sid() with the provided name wrapped in a qstr and > > specifying type_transition rules on the name. Then policy could define > > derived types for each domain, ala > > type_transition init self:file "[userfaultfd]" init_userfaultfd; > > type_transition untrusted_app self:file "[userfaultfd]" > > untrusted_app_userfaultfd; > > ... > > > >>> Also not sure we want to be testing fops for each such case. > >> > >> I was also thinking of just providing some kind of context string > >> (maybe the name), which might be friendlier to modules, but the loose > >> coupling kind of scares me, and for this particular application, since > >> UFFD is always in the core and never in a module, checking the fops > >> seems a bit more robust and doesn't hurt anything. > > > > Yes, not sure how the vfs folks feel about either coupling (the > > name-based one or the fops-based one). Neither seems great. > > > >>> We > >>> were looking at possibly leveraging the name as a key and using > >>> security_transition_sid() to generate a distinct SID/context/type for > >>> the inode via type_transition rules in policy. We have some WIP along > >>> those lines. > >> > >> Where? Any chance it would be ready soon? I'd rather not hold up this > >> work for a more general mechanism. > > > > Hopefully will have a patch available soon. But not saying this > > necessarily has to wait either. > > > >>>> + /* > >>>> + * Always give secure anonymous inodes the sid of the > >>>> + * creating task. > >>>> + */ > >>>> + > >>>> + isec->sid = tsec->sid; > >>> > >>> This doesn't generalize for other users of anonymous inodes, e.g. the > >>> /dev/kvm case where we'd rather inherit the SID and class from the > >>> original /dev/kvm inode itself. > >> > >> I think someone mentioned on the first version of this patch that we > >> could make it more flexible if the need arose. If we do want to do it > >> now, we could have the anon_inode security hook accept a "parent" or > >> "context" inode that modules could inspect for the purposes of forming > >> the new inode's SID. Does that make sense to you? > > > > Yes, that's the approach in our current WIP, except we call it a > > "related" inode since it isn't necessarily connected to the anon inode > > in any vfs sense. > > The other key difference in our WIP approach is that we assumed that we > couldn't mandate allocating a separate anon inode for each of these fds > and we wanted to cover all anonymous inodes (not opt-in), so we are > storing the SID/class pair as additional fields in the > file_security_struct and have modified file_has_perm() and others to > look there for anonymous inodes. A separate inode seems like the simpler approach for now, because it means that we have fewer places to check for security information --- and it's not as if an inode is particularly expensive. We can always switch later.