Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6390251ybv; Wed, 12 Feb 2020 11:15:51 -0800 (PST) X-Google-Smtp-Source: APXvYqzaJ3I/+pbDL386ZcKB62T+Eu08EEVxPHRZ2d7DqPihCRCBOVfkAbPiQYM7d2jtAE90SLic X-Received: by 2002:a9d:7f8d:: with SMTP id t13mr9855677otp.175.1581534951555; Wed, 12 Feb 2020 11:15:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581534951; cv=none; d=google.com; s=arc-20160816; b=PddiPVznyUOpf53RzYgxGRs7G5NaMXruV2MjLDRHiH1KHHYBRr2rWn1nzU7fdgxiKD 6jirAk6goocZi+bi4w2plDjbs8gBJXvnxCkjgqxfvL1g01IIlXk4OD1h7Yb0RB4M/C8n 9ZYgrmOu5zeF/qdMXGIA+D28YkNaB9uJnoiRvpcgeI6uwKyGmCVe91F68rpwtDM1LRfK um/x6kMWASH+hvo58vHwb9vmxioadfs/FNWXoZRWLTvlDoo3CxNCQmeKYS7UKwFfhoM1 mP7o4c4XT6YAEkcWYWV6ycPq41ysPRVvztbh4s3r8vAqUlpeXddaRndhzOUOGsQacV4z EOjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=I06fyRDks5FMNPE8pzvW1AHtl9WVfw+Iclcibe5sL5I=; b=tE95Uj3Cy4atzkBJjkoq0EiScjABBQ6A0QMDt4xGAruH0oXMs7b2/d7cnQo2b4OVVJ NvD/NGjOa9mwyeRck7jllE31J2G0TOPbfdd6XgKccgdRVb6BU91OQ/Okylv/upojXKIj vmrj79pqOkmswaSuejMZckd4IogWlyXqGb3RxpFVIW6K2b6hKqsFH6ROYYD8hPy0pyjm IqeXfo4JC/mmk2X/3FpBHxPY3qWtSqZVLP4Xok+7JvARMA6fyG+oDo46+K01Pm5XwXvi RAAdJvPiePLvfogTaiud5fyl2YMnIqrhp8O/WF8y2aPLsoKZ0NuE2mif2YnG5pqFsIlc Cd+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=M6+NEBty; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i15si650636otk.120.2020.02.12.11.15.33; Wed, 12 Feb 2020 11:15:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=M6+NEBty; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728984AbgBLTN6 (ORCPT + 99 others); Wed, 12 Feb 2020 14:13:58 -0500 Received: from mail-lf1-f67.google.com ([209.85.167.67]:38277 "EHLO mail-lf1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728866AbgBLTN5 (ORCPT ); Wed, 12 Feb 2020 14:13:57 -0500 Received: by mail-lf1-f67.google.com with SMTP id r14so2402837lfm.5 for ; Wed, 12 Feb 2020 11:13:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=I06fyRDks5FMNPE8pzvW1AHtl9WVfw+Iclcibe5sL5I=; b=M6+NEBtyexJuRT56GUMQB0rJRMOVtxgJ1IVorHkU8iAnjo7i990+lTjuPGlqC0FFMK HvoFOyw71CLCqTyoe0DLTodTZqeb337TBUpPfBg9ygQqR8TAfzt3A6iYdvyNZoVz7yyM XnbSrU1QnyYGt6gvqcIJ+Yb6DQSDHc0bF60f/ZnhGcU7Tv9f8UBsvMqYnjFFYrzLvzMh N15NyWuAge+/aQUYQe5Ki4SD/eedKnJNHJamCx9CkFpiiHNQP7mvi/O0pGHJDuDYafr+ Uznkqg3PqSAAymgVPlwrPWnVjqYsaI28LaA60HVtRjaqbpJQ0Il8b4dc4huGF5sTbTsg I70Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=I06fyRDks5FMNPE8pzvW1AHtl9WVfw+Iclcibe5sL5I=; b=Se/PpqHkLiAL11AUFOEFIhYXxIBbPfVxlumVcZMrLrBawQm2djqCV36wvWCaLeOWli AopZwSPnvYhwKmOxmGOqLPaaZI0JnkhTrC0NW2GI4zfR+OnGS6dgrEom47PQpFqnDEbD 6nPFA6FjH8y2/xJJuaaj2DpuePcDD6L6PXc74R0+GObCSWM9asIAeNUc+zdLbv1VMeL3 Zx90TrKSbHZMEjkGVDisYs65H78p+c3Yl303QrFR3CPtvXLjr8XhfCZ2ZyjSl6r+1C+Y u1beFcfvL+XxYMSeTv/D0YbnJnzm1aAB9qA281nxLTghsF7bxFMP7EckGgVoaXxte6yG /GRg== X-Gm-Message-State: APjAAAWsvY9oYV6MVQ6itLwC9nSMuKvZNmVxolrmWQtDbB1YlXMkjhRf 135OInPYBuJUaABkLf5nYl8+/kBA/bd7DFrChqsUFA== X-Received: by 2002:ac2:5339:: with SMTP id f25mr7321250lfh.180.1581534833509; Wed, 12 Feb 2020 11:13:53 -0800 (PST) MIME-Version: 1.0 References: <20200211225547.235083-1-dancol@google.com> <20200211225547.235083-4-dancol@google.com> <69f4ccce-18b2-42c1-71ac-3fe9caf2dfb6@tycho.nsa.gov> <626a2302-5b5f-d7c1-fdef-51094bb1fe0d@tycho.nsa.gov> In-Reply-To: <626a2302-5b5f-d7c1-fdef-51094bb1fe0d@tycho.nsa.gov> From: Daniel Colascione Date: Wed, 12 Feb 2020 11:13:16 -0800 Message-ID: Subject: Re: [PATCH v2 3/6] Teach SELinux about a new userfaultfd class To: Stephen Smalley Cc: Tim Murray , Nosh Minwalla , Nick Kralevich , Lokesh Gidra , linux-kernel , Linux API , selinux@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 12, 2020 at 11:10 AM Stephen Smalley wrote: > > On 2/12/20 2:04 PM, Daniel Colascione wrote: > > On Wed, Feb 12, 2020 at 10:59 AM Stephen Smalley wrote: > >> > >> On 2/12/20 1:04 PM, Stephen Smalley wrote: > >>> On 2/12/20 12:19 PM, Daniel Colascione wrote: > >>>> Thanks for taking a look. > >>>> > >>>> On Wed, Feb 12, 2020 at 9:04 AM Stephen Smalley > >>>> wrote: > >>>>> > >>>>> On 2/11/20 5:55 PM, Daniel Colascione wrote: > >>>>>> Use the secure anonymous inode LSM hook we just added to let SELinux > >>>>>> policy place restrictions on userfaultfd use. The create operation > >>>>>> applies to processes creating new instances of these file objects; > >>>>>> transfer between processes is covered by restrictions on read, write, > >>>>>> and ioctl access already checked inside selinux_file_receive. > >>>>>> > >>>>>> Signed-off-by: Daniel Colascione > >>>>> > >>>>> (please add linux-fsdevel and viro to the cc for future versions of this > >>>>> patch since it changes the VFS) > >>>>> > >>>>>> --- > >>>>>> diff --git a/security/selinux/hooks.c b/security/selinux/hooks.c > >>>>>> index 1659b59fb5d7..e178f6f40e93 100644 > >>>>>> --- a/security/selinux/hooks.c > >>>>>> +++ b/security/selinux/hooks.c > >>>>>> @@ -2915,6 +2919,69 @@ static int selinux_inode_init_security(struct > >>>>>> inode *inode, struct inode *dir, > >>>>>> + > >>>>>> + /* > >>>>>> + * We shouldn't be creating secure anonymous inodes before LSM > >>>>>> + * initialization completes. > >>>>>> + */ > >>>>>> + if (unlikely(!selinux_state.initialized)) > >>>>>> + return -EBUSY; > >>>>> > >>>>> I don't think this is viable; any arbitrary actions are possible before > >>>>> policy is loaded, and a Linux distro can be brought up fully with > >>>>> SELinux enabled and no policy loaded. You'll just need to have a > >>>>> default behavior prior to initialization. > >>>> > >>>> We'd have to fail open then, I think, and return an S_PRIVATE inode > >>>> (the regular anon inode). > >>> > >>> Not sure why. You aren't doing anything in the hook that actually > >>> relies on selinux_state.initialized being set (i.e. nothing requires a > >>> policy). The avc_has_perm() call will just succeed until a policy is > >>> loaded. So if these inodes are created prior to policy load, they will > >>> get assigned the task SID (which would be the kernel SID prior to policy > >>> load or first exec or write to /proc/self/attr/current afterward) and > >>> UFFD class (in your current code), be permitted, and then once policy is > >>> loaded any further access will get checked against the kernel SID. > >>> > >>>>>> + /* > >>>>>> + * We only get here once per ephemeral inode. The inode has > >>>>>> + * been initialized via inode_alloc_security but is otherwise > >>>>>> + * untouched, so check that the state is as > >>>>>> + * inode_alloc_security left it. > >>>>>> + */ > >>>>>> + BUG_ON(isec->initialized != LABEL_INVALID); > >>>>>> + BUG_ON(isec->sclass != SECCLASS_FILE); > >>>>> > >>>>> I think the kernel discourages overuse of BUG_ON/BUG/... > >>>> > >>>> I'm not sure what counts as overuse. > >>> > >>> Me either (not my rule) but I'm pretty sure this counts or you'd see a > >>> lot more of these kinds of BUG_ON() checks throughout. Try to reserve > >>> them for really critical cases. > >>> > >>>>>> + > >>>>>> +#ifdef CONFIG_USERFAULTFD > >>>>>> + if (fops == &userfaultfd_fops) > >>>>>> + isec->sclass = SECCLASS_UFFD; > >>>>>> +#endif > >>>>> > >>>>> Not sure we want or need to introduce a new security class for each user > >>>>> of anonymous inodes since the permissions should be the same as for > >>>>> file. > >>>> > >>>> The purpose of this change is to apply special policy to userfaultfd > >>>> FDs in particular. Isn't having a UFFD security class the best way to > >>>> go about that? (There's no path.) Am I missing something? > >>> > >>> It is probably the simplest approach; it just doesn't generalize to all > >>> users of anonymous inodes. We can distinguish them in one of two ways: > >>> use a different class like you did (requires a code change every time we > >>> add a new one and yet another duplicate of the file class) or use a > >>> different SID/context/type. The latter could be achieved by calling > >>> security_transition_sid() with the provided name wrapped in a qstr and > >>> specifying type_transition rules on the name. Then policy could define > >>> derived types for each domain, ala > >>> type_transition init self:file "[userfaultfd]" init_userfaultfd; > >>> type_transition untrusted_app self:file "[userfaultfd]" > >>> untrusted_app_userfaultfd; > >>> ... > >>> > >>>>> Also not sure we want to be testing fops for each such case. > >>>> > >>>> I was also thinking of just providing some kind of context string > >>>> (maybe the name), which might be friendlier to modules, but the loose > >>>> coupling kind of scares me, and for this particular application, since > >>>> UFFD is always in the core and never in a module, checking the fops > >>>> seems a bit more robust and doesn't hurt anything. > >>> > >>> Yes, not sure how the vfs folks feel about either coupling (the > >>> name-based one or the fops-based one). Neither seems great. > >>> > >>>>> We > >>>>> were looking at possibly leveraging the name as a key and using > >>>>> security_transition_sid() to generate a distinct SID/context/type for > >>>>> the inode via type_transition rules in policy. We have some WIP along > >>>>> those lines. > >>>> > >>>> Where? Any chance it would be ready soon? I'd rather not hold up this > >>>> work for a more general mechanism. > >>> > >>> Hopefully will have a patch available soon. But not saying this > >>> necessarily has to wait either. > >>> > >>>>>> + /* > >>>>>> + * Always give secure anonymous inodes the sid of the > >>>>>> + * creating task. > >>>>>> + */ > >>>>>> + > >>>>>> + isec->sid = tsec->sid; > >>>>> > >>>>> This doesn't generalize for other users of anonymous inodes, e.g. the > >>>>> /dev/kvm case where we'd rather inherit the SID and class from the > >>>>> original /dev/kvm inode itself. > >>>> > >>>> I think someone mentioned on the first version of this patch that we > >>>> could make it more flexible if the need arose. If we do want to do it > >>>> now, we could have the anon_inode security hook accept a "parent" or > >>>> "context" inode that modules could inspect for the purposes of forming > >>>> the new inode's SID. Does that make sense to you? > >>> > >>> Yes, that's the approach in our current WIP, except we call it a > >>> "related" inode since it isn't necessarily connected to the anon inode > >>> in any vfs sense. > >> > >> The other key difference in our WIP approach is that we assumed that we > >> couldn't mandate allocating a separate anon inode for each of these fds > >> and we wanted to cover all anonymous inodes (not opt-in), so we are > >> storing the SID/class pair as additional fields in the > >> file_security_struct and have modified file_has_perm() and others to > >> look there for anonymous inodes. > > > > A separate inode seems like the simpler approach for now, because it > > means that we have fewer places to check for security information --- > > and it's not as if an inode is particularly expensive. We can always > > switch later. > > We'd prefer having a separate inode if possible but didn't think that > would fly with the vfs folks, Let's ask them. > especially if we try to apply this to all > anonymous inodes. For the moment, we're not. > It might be ok for userfaultfd usage as a specific > case but there is a reason why anonymous inodes were introduced and > creating a separate inode each time defeats that purpose IIUC. It will > be interesting to see how they respond. Sort of. Anonymous inodes also free other parts of the kernel from having to deal with special-purpose filesystems (like pipefs) on which to hang custom inodes. It's just a generic "just give me an inode and I don't care about the filesystem" feature, and if we actually get a new inode each time, we still do the job. Pipe seems to be good with creating inodes each time.