Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753798Ab2BPV3Q (ORCPT ); Thu, 16 Feb 2012 16:29:16 -0500 Received: from mail-pw0-f46.google.com ([209.85.160.46]:62153 "EHLO mail-pw0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753369Ab2BPV3N convert rfc822-to-8bit (ORCPT ); Thu, 16 Feb 2012 16:29:13 -0500 MIME-Version: 1.0 In-Reply-To: <4F3D7250.6040504@zytor.com> References: <1329422549-16407-1-git-send-email-wad@chromium.org> <1329422549-16407-3-git-send-email-wad@chromium.org> <4F3D61CB.2000301@zytor.com> <4F3D7250.6040504@zytor.com> From: Markus Gutschke Date: Thu, 16 Feb 2012 13:28:49 -0800 X-Google-Sender-Auth: 5VYwc35XjnR2ouPealOKy0wwjcs Message-ID: Subject: Re: [PATCH v8 3/8] seccomp: add system call filtering using BPF To: "H. Peter Anvin" Cc: Will Drewry , linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, linux-doc@vger.kernel.org, kernel-hardening@lists.openwall.com, netdev@vger.kernel.org, x86@kernel.org, arnd@arndb.de, davem@davemloft.net, mingo@redhat.com, oleg@redhat.com, peterz@infradead.org, rdunlap@xenotime.net, mcgrathr@chromium.org, tglx@linutronix.de, luto@mit.edu, eparis@redhat.com, serge.hallyn@canonical.com, djm@mindrot.org, scarybeasts@gmail.com, indan@nul.nu, pmoore@redhat.com, akpm@linux-foundation.org, corbet@lwn.net, eric.dumazet@gmail.com, keescook@chromium.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1342 Lines: 27 On Thu, Feb 16, 2012 at 13:17, H. Peter Anvin wrote: > The other thing that you really need in addition to system call number is > ABI identifier, since a syscall number may mean different things for > different entry points.  For example, on x86-64 system call number 4 is > write() if called via int $0x80 but stat() if called via syscall64. This is > a local property of the system call, not a global per process. I think, the documentation said that as soon as prctl() is used to set a bpf filter for system calls, it automatically disallows system calls using an entry point other than the one used by this particular prctl(). I was trying to come up with scenarios where this particular approach causes problem, but I can't think of any off the top of my head. So, it might actually turn out to be a very elegant way to reduce the attack surface of the kernel. If we are really worried about userspace compatibility, we could make the kernel send a signal instead of terminating the program, if the wrong entry point was used; not sure if that is needed, though. Markus -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/