Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3450059imu; Fri, 30 Nov 2018 00:01:15 -0800 (PST) X-Google-Smtp-Source: AFSGD/U0YfSZlGye4YkZ0vRlUYMC+Yop9zgWQE9C8QM5sxc46LvsE6mrgLXQGFMI2gfYg7fzAWIt X-Received: by 2002:a65:5088:: with SMTP id r8mr3876199pgp.15.1543564875359; Fri, 30 Nov 2018 00:01:15 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543564875; cv=none; d=google.com; s=arc-20160816; b=DvWAfbLTBumkYlG/K1VnXR4IFIBm0N66/GrHr+TjUASiEmqbzia6lKQS3y6woG5HrS CIWzQjr8gqS9qw1Ptikptn8hIscRQW+QqFCF2wTsKrymh79kgUwW6aGj6yuHRtdjW/fa 1o3eimRErjtUoboUTHdS22YX1lK47NRiwf8IimWTG1ujCXRKSPpMKxfSCRCn3f1H+MpV JGWxM5US8ZwQ2Kh4z1ajfdaLMbVJ7zeBucFIvd2T4DKyfMiDKq+m5k0aFPEr2GuErKN+ rbByVkCMFERN/7pqNM8FDXA2CxiGsJVlmZAGgcQ7CP4WIM/4yrQfTlXdCyEG8I/QxrH/ 2qzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=F0uW0UA+WdCHV/yyx+buo2WpKpO+0LKSm85TTQE7DFo=; b=aMfGA7Dg5CJyXXHeEEZQqUnxj2RtG8MsBmDQfe7F0mCef304QyR2vFd1pqRUaGaDYR SPzGer9vR2tLDbjNGF2HrLauXNXs0MunEGuI/ialBxD6Nbt7zgRExrtZsaSOLfKvgTKC 0Z+JUgJOrUT0dvbcAC/jVvqOY+9Da2sfbuQK9AJzT8GpNvKSDBSe4nCjJADiVNKrFIDj HAYPKWmEsW9Hysglfh8hVoakpshMlqbK8li1kTHxi9bG4jPmLKF0Awbhx5VHkwnkRPds Hjdyh8gF9WnDIiKsarR2OsL5pLEw9tuzaKIEIxIwnxzvFfCpkgrWJfyQ8mgIkWui5Gs2 aK5A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r197si5550129pfr.192.2018.11.30.00.01.00; Fri, 30 Nov 2018 00:01:15 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727044AbeK3TIw (ORCPT + 99 others); Fri, 30 Nov 2018 14:08:52 -0500 Received: from nov-007-i650.relay.mailchannels.net ([46.232.183.204]:19247 "EHLO nov-007-i650.relay.mailchannels.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726648AbeK3TIw (ORCPT ); Fri, 30 Nov 2018 14:08:52 -0500 X-Sender-Id: novatrend|x-authuser|juerg@bitron.ch Received: from relay.mailchannels.net (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTP id 904AB2EC0022; Fri, 30 Nov 2018 08:00:21 +0000 (UTC) Received: from srv17.tophost.ch (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTPA id C9DA72EC003A; Fri, 30 Nov 2018 08:00:17 +0000 (UTC) X-Sender-Id: novatrend|x-authuser|juerg@bitron.ch Received: from srv17.tophost.ch (srv17.tophost.ch [193.33.128.141]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384) by 0.0.0.0:2500 (trex/5.15.2); Fri, 30 Nov 2018 08:00:21 +0000 X-MC-Relay: Neutral X-MailChannels-SenderId: novatrend|x-authuser|juerg@bitron.ch X-MailChannels-Auth-Id: novatrend X-Chief-Eight: 29e765bf66472fcb_1543564821115_2806408014 X-MC-Loop-Signature: 1543564821115:3107605271 X-MC-Ingress-Time: 1543564821115 Received: from [88.98.246.21] (port=60558 helo=jx1y.mynet) by srv17.tophost.ch with esmtpsa (TLSv1.2:ECDHE-RSA-AES128-SHA256:128) (Exim 4.91) (envelope-from ) id 1gSdih-002eST-Ln; Fri, 30 Nov 2018 09:00:15 +0100 From: =?UTF-8?q?J=C3=BCrg=20Billeter?= To: Andrew Morton Cc: Oleg Nesterov , Thomas Gleixner , Eric Biederman , Kees Cook , Andy Lutomirski , linux-api@vger.kernel.org, linux-kernel@vger.kernel.org, =?UTF-8?q?J=C3=BCrg=20Billeter?= Subject: [PATCH v2 1/1] prctl: add PR_{GET,SET}_KILL_DESCENDANTS_ON_EXIT Date: Fri, 30 Nov 2018 08:00:04 +0000 Message-Id: <20181130080004.23635-2-j@bitron.ch> X-Mailer: git-send-email 2.19.2 In-Reply-To: <20181130080004.23635-1-j@bitron.ch> References: <20181127225408.7553-2-j@bitron.ch> <20181130080004.23635-1-j@bitron.ch> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-AuthUser: juerg@bitron.ch Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This introduces a new thread group flag that can be set by calling prctl(PR_SET_KILL_DESCENDANTS_ON_EXIT, 1, 0, 0, 0) When a thread group exits with this flag set, it will send SIGKILL to all descendant processes. This can be used to prevent stray child processes. This flag is cleared on privilege gaining execve(2) to ensure an unprivileged process cannot get a privileged process to send SIGKILL. Descendants that are orphaned and reparented to an ancestor of the current process before the current process exits, will not be killed. PR_SET_CHILD_SUBREAPER can be used to contain orphaned processes. If a descendant gained privileges, the current process may not be allowed to kill it, and the descendant process will survive. PR_SET_NO_NEW_PRIVS can be used to prevent descendant processes from gaining privileges. Suggested-by: Oleg Nesterov Signed-off-by: Jürg Billeter --- fs/exec.c | 6 ++++++ include/linux/sched/signal.h | 3 +++ include/uapi/linux/prctl.h | 4 ++++ kernel/exit.c | 12 ++++++++++++ kernel/sys.c | 11 +++++++++++ security/apparmor/lsm.c | 1 + security/selinux/hooks.c | 3 +++ 7 files changed, 40 insertions(+) diff --git a/fs/exec.c b/fs/exec.c index 044e296f2381..1c9520d83d6b 100644 --- a/fs/exec.c +++ b/fs/exec.c @@ -1343,6 +1343,12 @@ void setup_new_exec(struct linux_binprm * bprm) /* Make sure parent cannot signal privileged process. */ current->pdeath_signal = 0; + /* + * Do not send SIGKILL from privileged process as it may + * have been requested by an unprivileged process. + */ + current->signal->kill_descendants_on_exit = false; + /* * For secureexec, reset the stack limit to sane default to * avoid bad behavior from the prior rlimits. This has to diff --git a/include/linux/sched/signal.h b/include/linux/sched/signal.h index 0c3e396dca04..91ed7f480b60 100644 --- a/include/linux/sched/signal.h +++ b/include/linux/sched/signal.h @@ -124,6 +124,9 @@ struct signal_struct { unsigned int is_child_subreaper:1; unsigned int has_child_subreaper:1; + /* Send SIGKILL to descendant processes on exit */ + bool kill_descendants_on_exit; + #ifdef CONFIG_POSIX_TIMERS /* POSIX.1b Interval Timers */ diff --git a/include/uapi/linux/prctl.h b/include/uapi/linux/prctl.h index b17201edfa09..a31141236064 100644 --- a/include/uapi/linux/prctl.h +++ b/include/uapi/linux/prctl.h @@ -198,6 +198,10 @@ struct prctl_mm_map { # define PR_CAP_AMBIENT_LOWER 3 # define PR_CAP_AMBIENT_CLEAR_ALL 4 +/* Send SIGKILL to descendant processes on exit */ +#define PR_SET_KILL_DESCENDANTS_ON_EXIT 48 +#define PR_GET_KILL_DESCENDANTS_ON_EXIT 49 + /* arm64 Scalable Vector Extension controls */ /* Flag values must be kept in sync with ptrace NT_ARM_SVE interface */ #define PR_SVE_SET_VL 50 /* set task vector length */ diff --git a/kernel/exit.c b/kernel/exit.c index 0e21e6d21f35..7fe0c694685a 100644 --- a/kernel/exit.c +++ b/kernel/exit.c @@ -694,6 +694,15 @@ static void forget_original_parent(struct task_struct *father, list_splice_tail_init(&father->children, &reaper->children); } +static int kill_descendant_visitor(struct task_struct *p, void *data) +{ + /* This may fail, e.g., when a descendant process gained privileges. */ + group_send_sig_info(SIGKILL, SEND_SIG_NOINFO, p, PIDTYPE_TGID); + + /* Always continue walking the process tree. */ + return 1; +} + /* * Send signals to all our closest relatives so that they know * to properly mourn us.. @@ -704,6 +713,9 @@ static void exit_notify(struct task_struct *tsk, int group_dead) struct task_struct *p, *n; LIST_HEAD(dead); + if (group_dead && tsk->signal->kill_descendants_on_exit) + walk_process_tree(tsk, kill_descendant_visitor, NULL); + write_lock_irq(&tasklist_lock); forget_original_parent(tsk, &dead); diff --git a/kernel/sys.c b/kernel/sys.c index 123bd73046ec..8d9af81da093 100644 --- a/kernel/sys.c +++ b/kernel/sys.c @@ -2476,6 +2476,17 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, return -EINVAL; error = arch_prctl_spec_ctrl_set(me, arg2, arg3); break; + case PR_SET_KILL_DESCENDANTS_ON_EXIT: + if (arg3 || arg4 || arg5) + return -EINVAL; + me->signal->kill_descendants_on_exit = !!arg2; + break; + case PR_GET_KILL_DESCENDANTS_ON_EXIT: + if (arg3 || arg4 || arg5) + return -EINVAL; + error = put_user(me->signal->kill_descendants_on_exit, + (int __user *)arg2); + break; default: error = -EINVAL; break; diff --git a/security/apparmor/lsm.c b/security/apparmor/lsm.c index 7027c7aee00d..5de086d7546c 100644 --- a/security/apparmor/lsm.c +++ b/security/apparmor/lsm.c @@ -798,6 +798,7 @@ static void apparmor_bprm_committing_creds(struct linux_binprm *bprm) aa_inherit_files(bprm->cred, current->files); current->pdeath_signal = 0; + current->signal->kill_descendants_on_exit = false; /* reset soft limits and set hard limits for the new label */ __aa_transition_rlimits(label, new_label); diff --git a/security/selinux/hooks.c b/security/selinux/hooks.c index 67f1344e728a..4c041212702c 100644 --- a/security/selinux/hooks.c +++ b/security/selinux/hooks.c @@ -2657,6 +2657,9 @@ static void selinux_bprm_committing_creds(struct linux_binprm *bprm) /* Always clear parent death signal on SID transitions. */ current->pdeath_signal = 0; + /* Disable SIGKILL requested from before the SID transition. */ + current->signal->kill_descendants_on_exit = false; + /* Check whether the new SID can inherit resource limits from the old * SID. If not, reset all soft limits to the lower of the current * task's hard limit and the init task's soft limit. -- 2.19.2