Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756573AbaKER2j (ORCPT ); Wed, 5 Nov 2014 12:28:39 -0500 Received: from mail-la0-f41.google.com ([209.85.215.41]:59434 "EHLO mail-la0-f41.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756562AbaKER2e (ORCPT ); Wed, 5 Nov 2014 12:28:34 -0500 MIME-Version: 1.0 In-Reply-To: References: <1415015305-15494-1-git-send-email-drysdale@google.com> <1415015305-15494-2-git-send-email-drysdale@google.com> <20141103152036.GA7996@ZenIV.linux.org.uk> From: Andy Lutomirski Date: Wed, 5 Nov 2014 09:28:10 -0800 Message-ID: Subject: Re: [PATCH 1/3] fs: add O_BENEATH flag to openat(2) To: David Drysdale Cc: "Eric W.Biederman" , Al Viro , "linux-kernel@vger.kernel.org" , Kees Cook , Greg Kroah-Hartman , Meredydd Luff , Will Drewry , Jorge Lucangeli Obes , Ricky Zhou , Lee Campbell , Julien Tinnes , Mike Depinet , James Morris , Paolo Bonzini , Paul Moore , Christoph Hellwig , Linux API , LSM List Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Nov 5, 2014 at 9:21 AM, David Drysdale wrote: > On Tue, Nov 4, 2014 at 9:40 AM, David Drysdale wrote: >> On Mon, Nov 3, 2014 at 5:22 PM, Eric W.Biederman wrote: >>> On November 3, 2014 7:42:58 AM PST, Andy Lutomirski wrote: >>>>On Mon, Nov 3, 2014 at 7:20 AM, Al Viro >>>>wrote: >>>>> On Mon, Nov 03, 2014 at 11:48:23AM +0000, David Drysdale wrote: >>>>>> Add a new O_BENEATH flag for openat(2) which restricts the >>>>>> provided path, rejecting (with -EACCES) paths that are not beneath >>>>>> the provided dfd. In particular, reject: >>>>>> - paths that contain .. components >>>>>> - paths that begin with / >>>>>> - symlinks that have paths as above. >>>>> >>>>> Yecch... The degree of usefulness aside (and I'm not convinced that >>>>it >>>>> is non-zero), >>>> >>>>This is extremely useful in conjunction with seccomp. >>>> >>>>> WTF pass one bit out of nameidata->flags in a separate argument? >>>>> Through the mutual recursion, no less... And then you are not even >>>>attempting >>>>> to detect symlinks that are not followed by interpretation of _any_ >>>>pathname. >>>> >>>>How many symlinks like that are there? Is there anything except >>>>nd_jump_link users? All of those are in /proc. Arguably O_BENEATH >>>>should prevent traversal of all of those links. >>> >>> Not commenting on the sanity of this one way or another, and I haven't read the patch. There is an absolutely trivial implementation of this. >>> >>> After the path is resolved, walk backwards along d_parent and the mount tree, and see if you come to the file or directory dfd refers to. >>> >>> That can handle magic proc symlinks, and does not need to disallow .. or / explicitly so it should be much simpler code. >>> >>> My gut says that if Al says blech when looking at your code it is too complex to give you a security guarantee. >>> >>> Eric >> >> Well, the 'yecch' was deserved for the unnecessary duplication of the >> flags. Without that, the patch looks much simpler -- I'll send out a v2 >> with those changes for discussion, and think about your alternative >> implementation suggestion (thanks!) separately. > > One concern with the "walk upwards and see if you get back where you > started" approach -- it will allow use of a symlink that lives outside the > original directory, but which points back inside it. That's going to be > slightly surprising behaviour for users, and I worry that there's the > potential for unexpected information leakage from it. > > (BTW, size-wise my initial naive implementation of the walk-upward > approach is only marginally smaller than the v2 patch.) It has another problem. Since we still haven't fixed the eternal /proc/PID/fd-doesn't-respect-file-mode issue, you can have a read-only fd somewhere and reopen it read-write using O_BENEATH on a different fd. For example: fd 3 points to /sandbox /sandbox/blocked has mode 0700 and isn't owned by us /sandbox/blocked/foo has mode 0666 fd 4 points to /sandbox/blocked/foo, O_RDONLY openat(3, "/proc/self/fd/4", O_RDWR | O_BENEATH) will get a read-write descriptor pointing at /sandbox/blocked/foo, which should have been impossible. Also, I really don't like the information leak. The result of asking a server for "/home/victim/compromising-directory/../../www/index.html" should not reveal whether compromising-directory exists. --Andy -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/