Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp1495054ybb; Thu, 2 Apr 2020 01:46:09 -0700 (PDT) X-Google-Smtp-Source: APiQypIxDz5PEIgdhdbjR4n25LYw+AEtw69204Vnzj5IZ+4zKz5eVgdUlnbtxfFgBUlAm8yq0fui X-Received: by 2002:aca:4d47:: with SMTP id a68mr1531807oib.56.1585817168870; Thu, 02 Apr 2020 01:46:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585817168; cv=none; d=google.com; s=arc-20160816; b=NrRe62tQd/x1vMh2x/3Icv4EOO8UeJjwl5nLb9hFO4w1nv07lwgwfDULNJP8XeURs/ KtAheQRrjppgco0i7wy8CL2/vusfXcS6mW8aVeYQ2lAKzDAiKkMPB4kvqdWj9LfsaI7p 1RiKrlThrDbITS2ke/q+oFcWGm8P3c14w3x0QPnf/iw/s3jYCd2Jl09h8//VbluATS22 GNvErA0clbJl2R8m2Uy2/9ao7bKngQY7SfmX6nH+lCeMU9SoaiyJrvHXtCAZSdi7raBU 6iiWR/dAqAygIYN67w5qngqLWGSGKTZmrwwu/CuNWJiw7QG/K69Ct33MuVyxcOe07GOe TGoQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:organization:references:cc:to:from:subject:ironport-sdr :ironport-sdr; bh=CtKiSdrQHttpkqg5C8W9nr6fjs4F/Adx5/RTd0F3c9Q=; b=D/4MgJT1csGa5PwnZ4DYzIO083H+W2FVURMZmjPqJ9g60tGaiyfdZeu1fj+tTcgkSf ZZ09183zQ1uwQ7cpCDv63JrHquG1avc7nknuUiArYbvsTYCATeid43ZHfsiIM0P5malK x/JktFH8tgahAQeiWjBxRS0AFa8LPsyyt/lIIwTEXxGIHq3jukV+4x6LkPE/BOPUGReH qNwOXfumLXHwdOic5sACt4YaQHfs5kV/KHq3cHKzQeKL5j7oDCIlI7Y+lPunSBLWtTsX OhxG3UH9kFmDsxSE+AhVlXM6DjeEc3wWyS1jEnA/Mqbr0jjXcEnqcnOdzZ05yDeRig95 gnMg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o63si2156300oih.74.2020.04.02.01.45.56; Thu, 02 Apr 2020 01:46:08 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387747AbgDBIpi (ORCPT + 99 others); Thu, 2 Apr 2020 04:45:38 -0400 Received: from mga17.intel.com ([192.55.52.151]:5049 "EHLO mga17.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728803AbgDBIpi (ORCPT ); Thu, 2 Apr 2020 04:45:38 -0400 IronPort-SDR: 3BOyG8uhC4lGhfMipk8jnHEnXxd6lhgQGbFTcdy+YKtI5UJ5D4/xq9NoARiKvleWksHfRc5rLn WnC4vd6IL0MA== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga107.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 02 Apr 2020 01:45:37 -0700 IronPort-SDR: MymKeAEXMtI4cGrvDtyJS3oy7l5fgtIwtPVceurrcTJQfIe14IJglDtbi1hu9SdYJ8Ydhw1Oq7 mxOEqdNgJ4Fg== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.72,335,1580803200"; d="scan'208";a="242464844" Received: from linux.intel.com ([10.54.29.200]) by fmsmga008.fm.intel.com with ESMTP; 02 Apr 2020 01:45:37 -0700 Received: from [10.249.226.252] (abudanko-mobl.ccr.corp.intel.com [10.249.226.252]) by linux.intel.com (Postfix) with ESMTP id 4C361580781; Thu, 2 Apr 2020 01:45:32 -0700 (PDT) Subject: [PATCH v8 01/12] capabilities: introduce CAP_PERFMON to kernel and user space From: Alexey Budankov To: Peter Zijlstra , Arnaldo Carvalho de Melo , Alexei Starovoitov , Ingo Molnar , James Morris , Namhyung Kim Cc: Serge Hallyn , Jiri Olsa , Song Liu , Andi Kleen , Stephane Eranian , Igor Lubashev , Thomas Gleixner , linux-kernel , "linux-security-module@vger.kernel.org" , "selinux@vger.kernel.org" , "intel-gfx@lists.freedesktop.org" , "linux-doc@vger.kernel.org" , linux-man@vger.kernel.org References: Organization: Intel Corp. Message-ID: <5590d543-82c6-490a-6544-08e6a5517db0@linux.intel.com> Date: Thu, 2 Apr 2020 11:45:31 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Introduce CAP_PERFMON capability designed to secure system performance monitoring and observability operations so that CAP_PERFMON would assist CAP_SYS_ADMIN capability in its governing role for performance monitoring and observability subsystems. CAP_PERFMON hardens system security and integrity during performance monitoring and observability operations by decreasing attack surface that is available to a CAP_SYS_ADMIN privileged process [2]. Providing the access to system performance monitoring and observability operations under CAP_PERFMON capability singly, without the rest of CAP_SYS_ADMIN credentials, excludes chances to misuse the credentials and makes the operation more secure. Thus, CAP_PERFMON implements the principal of least privilege for performance monitoring and observability operations (POSIX IEEE 1003.1e: 2.2.2.39 principle of least privilege: A security design principle that states that a process or program be granted only those privileges (e.g., capabilities) necessary to accomplish its legitimate function, and only for the time that such privileges are actually required) CAP_PERFMON meets the demand to secure system performance monitoring and observability operations for adoption in security sensitive, restricted, multiuser production environments (e.g. HPC clusters, cloud and virtual compute environments), where root or CAP_SYS_ADMIN credentials are not available to mass users of a system, and securely unblocks applicability and scalability of system performance monitoring and observability operations beyond root and CAP_SYS_ADMIN use cases. CAP_PERFMON takes over CAP_SYS_ADMIN credentials related to system performance monitoring and observability operations and balances amount of CAP_SYS_ADMIN credentials following the recommendations in the capabilities man page [1] for CAP_SYS_ADMIN: "Note: this capability is overloaded; see Notes to kernel developers, below." For backward compatibility reasons access to system performance monitoring and observability subsystems of the kernel remains open for CAP_SYS_ADMIN privileged processes but CAP_SYS_ADMIN capability usage for secure system performance monitoring and observability operations is discouraged with respect to the designed CAP_PERFMON capability. Although the software running under CAP_PERFMON can not ensure avoidance of related hardware issues, the software can still mitigate these issues following the official hardware issues mitigation procedure [2]. The bugs in the software itself can be fixed following the standard kernel development process [3] to maintain and harden security of system performance monitoring and observability operations. [1] http://man7.org/linux/man-pages/man7/capabilities.7.html [2] https://www.kernel.org/doc/html/latest/process/embargoed-hardware-issues.html [3] https://www.kernel.org/doc/html/latest/admin-guide/security-bugs.html Signed-off-by: Alexey Budankov Acked-by: Song Liu Acked-by: Stephen Smalley Acked-by: James Morris Acked-by: Serge E. Hallyn --- include/linux/capability.h | 4 ++++ include/uapi/linux/capability.h | 8 +++++++- security/selinux/include/classmap.h | 4 ++-- 3 files changed, 13 insertions(+), 3 deletions(-) diff --git a/include/linux/capability.h b/include/linux/capability.h index ecce0f43c73a..027d7e4a853b 100644 --- a/include/linux/capability.h +++ b/include/linux/capability.h @@ -251,6 +251,10 @@ extern bool privileged_wrt_inode_uidgid(struct user_namespace *ns, const struct extern bool capable_wrt_inode_uidgid(const struct inode *inode, int cap); extern bool file_ns_capable(const struct file *file, struct user_namespace *ns, int cap); extern bool ptracer_capable(struct task_struct *tsk, struct user_namespace *ns); +static inline bool perfmon_capable(void) +{ + return capable(CAP_PERFMON) || capable(CAP_SYS_ADMIN); +} /* audit system wants to get cap info from files as well */ extern int get_vfs_caps_from_disk(const struct dentry *dentry, struct cpu_vfs_cap_data *cpu_caps); diff --git a/include/uapi/linux/capability.h b/include/uapi/linux/capability.h index 272dc69fa080..e58c9636741b 100644 --- a/include/uapi/linux/capability.h +++ b/include/uapi/linux/capability.h @@ -367,8 +367,14 @@ struct vfs_ns_cap_data { #define CAP_AUDIT_READ 37 +/* + * Allow system performance and observability privileged operations + * using perf_events, i915_perf and other kernel subsystems + */ + +#define CAP_PERFMON 38 -#define CAP_LAST_CAP CAP_AUDIT_READ +#define CAP_LAST_CAP CAP_PERFMON #define cap_valid(x) ((x) >= 0 && (x) <= CAP_LAST_CAP) diff --git a/security/selinux/include/classmap.h b/security/selinux/include/classmap.h index 986f3ac14282..d233ab3f1533 100644 --- a/security/selinux/include/classmap.h +++ b/security/selinux/include/classmap.h @@ -27,9 +27,9 @@ "audit_control", "setfcap" #define COMMON_CAP2_PERMS "mac_override", "mac_admin", "syslog", \ - "wake_alarm", "block_suspend", "audit_read" + "wake_alarm", "block_suspend", "audit_read", "perfmon" -#if CAP_LAST_CAP > CAP_AUDIT_READ +#if CAP_LAST_CAP > CAP_PERFMON #error New capability defined, please update COMMON_CAP2_PERMS. #endif -- 2.24.1