Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp4465799pxf; Tue, 16 Mar 2021 14:22:35 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxyTmPOtWqWfdrtzdkufiOYz94Udq1YURSNzpMpTEvjAbphAs9R/yp7MCoLiptuxCgp3+Wn X-Received: by 2002:a05:6402:3096:: with SMTP id de22mr38384473edb.141.1615929755329; Tue, 16 Mar 2021 14:22:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1615929755; cv=none; d=google.com; s=arc-20160816; b=OJ7lVA+BWALKyEaoQ+xe62LR1eaSgyoEbL0LxEfgoevUHrB8WfIgWWF6Ie5LV6piq4 wmeZNbh6++sL4KxdqANOl9VY9HW0CHY/mImiVYaMf/tPdNmCHO+nT4E/t3vgr0Zio37X UAV4IbFprtaHO47uKoRGTbg20GNeZ3QZ3SokH5qpbocofb+8T5ZClfYXGbcIc+Wv0Dyl KayVCzImDTwK9hDhLtrpRfxgkdl8ELdWKXgeUusa0ndqBXr4xBhjuo1+N7JFK09tZnb4 PxfXeQUfYNS04EBO6NzyART5yH5oCbdjNHDUJE+Vj+0hAshzBZkONwj62fzev5L2hmTP xoDQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=I1u7wrHWz33hW6RRZtMzV6+uupp82dD7a6rFtf3+8HA=; b=kdF6cBq5hRIM6yflxV1Qz7fNHcXPQh5u3gRHwASnAdwqUlhIeNJeKpRMKvQSqG81dj LJDG4gQktuB9y/MKv3WTuI/JH3x47yDkpBnA4E7OP9ETXiI5vd2xDq4yOUvNMRY7Oac9 cf9QLMS9+VuZH6453btPOREoFuS/dq3TxlPDJXFfcn6nncAegAz/Ma59st8xacY04jrD 2uVJR9rDOvC+UKyIPhyA3qp8MwuAMDKnLrhu1QYdPsR9DpHteJCbGeUHzjlnKu8zslwL 0aEIIGVW9QBC3HeCdQPDIjsR3u08ZRzHMjUuycobRBzt54c0zWk2s6zC5EB11MLx52HR dxMQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q11si14323831edn.333.2021.03.16.14.22.13; Tue, 16 Mar 2021 14:22:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240412AbhCPT0B (ORCPT + 99 others); Tue, 16 Mar 2021 15:26:01 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59770 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240421AbhCPTZq (ORCPT ); Tue, 16 Mar 2021 15:25:46 -0400 Received: from smtp-190f.mail.infomaniak.ch (smtp-190f.mail.infomaniak.ch [IPv6:2001:1600:3:17::190f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5AC4EC06174A for ; Tue, 16 Mar 2021 12:25:45 -0700 (PDT) Received: from smtp-3-0001.mail.infomaniak.ch (unknown [10.4.36.108]) by smtp-2-3000.mail.infomaniak.ch (Postfix) with ESMTPS id 4F0NYL3skZzMppFJ; Tue, 16 Mar 2021 20:25:42 +0100 (CET) Received: from ns3096276.ip-94-23-54.eu (unknown [23.97.221.149]) by smtp-3-0001.mail.infomaniak.ch (Postfix) with ESMTPA id 4F0NYD4j9Szlh8TJ; Tue, 16 Mar 2021 20:25:36 +0100 (CET) Subject: Re: [PATCH v4 1/1] fs: Allow no_new_privs tasks to call chroot(2) To: Kees Cook , Jann Horn Cc: Al Viro , James Morris , Serge Hallyn , Andy Lutomirski , Casey Schaufler , Christian Brauner , Christoph Hellwig , David Howells , Dominik Brodowski , "Eric W . Biederman" , John Johansen , Kentaro Takeda , Tetsuo Handa , Kernel Hardening , linux-fsdevel , kernel list , linux-security-module , =?UTF-8?Q?Micka=c3=abl_Sala=c3=bcn?= References: <20210316170135.226381-1-mic@digikod.net> <20210316170135.226381-2-mic@digikod.net> <202103161221.8291CC3E6@keescook> From: =?UTF-8?Q?Micka=c3=abl_Sala=c3=bcn?= Message-ID: Date: Tue, 16 Mar 2021 20:25:38 +0100 User-Agent: MIME-Version: 1.0 In-Reply-To: <202103161221.8291CC3E6@keescook> Content-Type: text/plain; charset=iso-8859-15 Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 16/03/2021 20:24, Kees Cook wrote: > On Tue, Mar 16, 2021 at 08:04:09PM +0100, Jann Horn wrote: >> On Tue, Mar 16, 2021 at 6:02 PM Micka?l Sala?n wrote: >>> One could argue that chroot(2) is useless without a properly populated >>> root hierarchy (i.e. without /dev and /proc). However, there are >>> multiple use cases that don't require the chrooting process to create >>> file hierarchies with special files nor mount points, e.g.: >>> * A process sandboxing itself, once all its libraries are loaded, may >>> not need files other than regular files, or even no file at all. >>> * Some pre-populated root hierarchies could be used to chroot into, >>> provided for instance by development environments or tailored >>> distributions. >>> * Processes executed in a chroot may not require access to these special >>> files (e.g. with minimal runtimes, or by emulating some special files >>> with a LD_PRELOADed library or seccomp). >>> >>> Unprivileged chroot is especially interesting for userspace developers >>> wishing to harden their applications. For instance, chroot(2) and Yama >>> enable to build a capability-based security (i.e. remove filesystem >>> ambient accesses) by calling chroot/chdir with an empty directory and >>> accessing data through dedicated file descriptors obtained with >>> openat2(2) and RESOLVE_BENEATH/RESOLVE_IN_ROOT/RESOLVE_NO_MAGICLINKS. >> >> I don't entirely understand. Are you writing this with the assumption >> that a future change will make it possible to set these RESOLVE flags >> process-wide, or something like that? > > I thought it meant "open all out-of-chroot dirs as fds using RESOLVE_... > flags then chroot". As in, there's no way to then escape "up" for the > old opens, and the new opens stay in the chroot. Yes, that was the idea. > >> [...] >>> diff --git a/fs/open.c b/fs/open.c >> [...] >>> +static inline int current_chroot_allowed(void) >>> +{ >>> + /* >>> + * Changing the root directory for the calling task (and its future >>> + * children) requires that this task has CAP_SYS_CHROOT in its >>> + * namespace, or be running with no_new_privs and not sharing its >>> + * fs_struct and not escaping its current root (cf. create_user_ns()). >>> + * As for seccomp, checking no_new_privs avoids scenarios where >>> + * unprivileged tasks can affect the behavior of privileged children. >>> + */ >>> + if (task_no_new_privs(current) && current->fs->users == 1 && >> >> this read of current->fs->users should be using READ_ONCE() > > Ah yeah, good call. I should remember this when I think "can this race?" > :P >