Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp557685imm; Fri, 11 May 2018 02:47:36 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpuhFxjfSVFaSy++eh/BoUs9lRV7+anKo5LSDlEu41WNlIRtAVMg4kwouNi94cZ3wHLpp05 X-Received: by 2002:a63:2ace:: with SMTP id q197-v6mr3946931pgq.60.1526032056343; Fri, 11 May 2018 02:47:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526032056; cv=none; d=google.com; s=arc-20160816; b=MEaMp14Zv/1EAUgnr2Rrbfxclrf44/Fd9pKKJxwWVD6Ngub6gBvi6jnGryEqdxLHnC sFX7olx43T+R0smPcBxlfacQQK8IRfsUklxybF1+b4APVVrpMtNbFD7UGpL3A083Pm+y Pjz84QaW4r5f7dtUvnPwBy4Zp5R8wcE16QsHstVPqH0jDrTicYQrNmT+KXN8MZ6qMZiB A1ddSMgpmNciGtv8/ToGX15QRa0FlRzqcuDQaFVUGFkmJvUPDMtOU1RSnvJf3R8DWkQU ejmJpRZ+7r0Dle2w2NCoJ+ve7IU1/SSbyLqQ5Ei9KnHYog0o4jpwQs4JmKDAdRYcJ6Pt GvsA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-disposition:mime-version :message-id:subject:cc:to:from:date:arc-authentication-results; bh=uDFBzXNfutadFONYghTX3AG4c3h6HKmgtG+K5QFwAEA=; b=GMvTe9qaaYLOHNX9hXCaqyIo9qbKwZRP4aBfQMfUlYcHTnp2LmyK5PK8p+Zm7GTFEA eP3Zp6MfNJPD1eurIWjQF8+HvjpEx2OLNXgGDnpporPVU7Qi0xxahgOZ0bpsWtp2Rs3c Pe5osl6SERPJSA6yJsUkyxf4c01KsVSw1s5Hp8sCK/FjgjKXIew8A90UvOnjPnWkWyK8 hjBY/I5lm0d1IY1Es0V+yw8f/vbaoPqdvXGzOvdgj7cad1CZfepzGTLHS5d0BK2e0aQF 1g0m3Vtd5wbSkg0G8TwyQwo5Et1yIXs07tlhl/7oYxqVNWi36o1Dgivy5TvuqRjNX94T 6eDg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h8-v6si2759620pfi.115.2018.05.11.02.47.21; Fri, 11 May 2018 02:47:36 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752899AbeEKJqq (ORCPT + 99 others); Fri, 11 May 2018 05:46:46 -0400 Received: from monster.unsafe.ru ([5.9.28.80]:55718 "EHLO mail.unsafe.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752362AbeEKJqo (ORCPT ); Fri, 11 May 2018 05:46:44 -0400 Received: from comp-core-i7-2640m-0182e6 (nat-pool-brq-t.redhat.com [213.175.37.10]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.unsafe.ru (Postfix) with ESMTPSA id 71952C61A9E; Fri, 11 May 2018 09:46:42 +0000 (UTC) Date: Fri, 11 May 2018 11:36:45 +0200 From: Alexey Gladkov To: Kees Cook , Andy Lutomirski , Andrew Morton , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, kernel-hardening@lists.openwall.com, linux-security-module@vger.kernel.org, linux-api@vger.kernel.org Cc: Greg Kroah-Hartman , Alexander Viro , Akinobu Mita , Oleg Nesterov , Jeff Layton , Ingo Molnar , Alexey Dobriyan , "Eric W. Biederman" , Linus Torvalds , aniel Micay , Jonathan Corbet , bfields@fieldses.org, Stephen Rothwell , solar@openwall.com, "Dmitry V. Levin" , Djalal Harouni Subject: [PATCH v5 6/7] proc: flush task dcache entries from all procfs instances Message-ID: <20180511093645.GA1365@comp-core-i7-2640m-0182e6> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Djalal Harouni This allows to flush dcache entries of a task on multiple procfs mounts per pid namespace. Cc: Kees Cook Cc: Andy Lutomirski Signed-off-by: Djalal Harouni --- fs/proc/base.c | 27 ++++++++++++++++++++++----- fs/proc/inode.c | 9 ++++++++- fs/proc/root.c | 10 ++++++++++ include/linux/pid_namespace.h | 42 ++++++++++++++++++++++++++++++++++++++++++ include/linux/proc_fs.h | 2 ++ 5 files changed, 84 insertions(+), 6 deletions(-) diff --git a/fs/proc/base.c b/fs/proc/base.c index 31baeef..987d702 100644 --- a/fs/proc/base.c +++ b/fs/proc/base.c @@ -3068,7 +3068,8 @@ static const struct inode_operations proc_tgid_base_inode_operations = { .permission = proc_pid_permission, }; -static void proc_flush_task_mnt(struct vfsmount *mnt, pid_t pid, pid_t tgid) +static void proc_flush_task_mnt_root(struct dentry *mnt_root, + pid_t pid, pid_t tgid) { struct dentry *dentry, *leader, *dir; char buf[10 + 1]; @@ -3077,7 +3078,7 @@ static void proc_flush_task_mnt(struct vfsmount *mnt, pid_t pid, pid_t tgid) name.name = buf; name.len = snprintf(buf, sizeof(buf), "%u", pid); /* no ->d_hash() rejects on procfs */ - dentry = d_hash_and_lookup(mnt->mnt_root, &name); + dentry = d_hash_and_lookup(mnt_root, &name); if (dentry) { d_invalidate(dentry); dput(dentry); @@ -3088,7 +3089,7 @@ static void proc_flush_task_mnt(struct vfsmount *mnt, pid_t pid, pid_t tgid) name.name = buf; name.len = snprintf(buf, sizeof(buf), "%u", tgid); - leader = d_hash_and_lookup(mnt->mnt_root, &name); + leader = d_hash_and_lookup(mnt_root, &name); if (!leader) goto out; @@ -3143,14 +3144,30 @@ void proc_flush_task(struct task_struct *task) int i; struct pid *pid, *tgid; struct upid *upid; + struct proc_fs_info *fs_info_entry; + struct pid_namespace *pid_ns; + struct dentry *mnt_root; pid = task_pid(task); tgid = task_tgid(task); for (i = 0; i <= pid->level; i++) { upid = &pid->numbers[i]; - proc_flush_task_mnt(upid->ns->proc_mnt, upid->nr, - tgid->numbers[i].nr); + pid_ns = upid->ns; + + pidns_proc_lock_shared(pid_ns); + list_for_each_entry(fs_info_entry, &pid_ns->procfs_mounts, + pidns_entry) { + if (proc_fs_newinstance(fs_info_entry)) { + mnt_root = fs_info_entry->sb->s_root; + proc_flush_task_mnt_root(mnt_root, upid->nr, + tgid->numbers[i].nr); + } + } + pidns_proc_unlock_shared(pid_ns); + + mnt_root = pid_ns->proc_mnt->mnt_root; + proc_flush_task_mnt_root(mnt_root, upid->nr, tgid->numbers[i].nr); } } diff --git a/fs/proc/inode.c b/fs/proc/inode.c index 985df4b..5e62598 100644 --- a/fs/proc/inode.c +++ b/fs/proc/inode.c @@ -498,10 +498,17 @@ struct inode *proc_get_inode(struct super_block *sb, struct proc_dir_entry *de) int proc_fill_super(struct super_block *s, void *data, int silent) { struct proc_fs_info *fs_info = proc_sb(s); + struct pid_namespace *ns = get_pid_ns(fs_info->pid_ns); struct inode *root_inode; int ret; - get_pid_ns(fs_info->pid_ns); + fs_info->sb = s; + + if (proc_fs_newinstance(fs_info)) { + pidns_proc_lock(ns); + list_add_tail(&fs_info->pidns_entry, &ns->procfs_mounts); + pidns_proc_unlock(ns); + } if (!proc_parse_options(data, fs_info)) return -EINVAL; diff --git a/fs/proc/root.c b/fs/proc/root.c index c5ce241..c72d22c 100644 --- a/fs/proc/root.c +++ b/fs/proc/root.c @@ -249,6 +249,13 @@ static void proc_kill_sb(struct super_block *sb) dput(fs_info->proc_self); if (fs_info->proc_thread_self) dput(fs_info->proc_thread_self); + + if (proc_fs_newinstance(fs_info)) { + pidns_proc_lock(ns); + list_del(&fs_info->pidns_entry); + pidns_proc_unlock(ns); + } + kill_anon_super(sb); put_pid_ns(ns); kfree(fs_info); @@ -357,6 +364,9 @@ int pid_ns_prepare_proc(struct pid_namespace *ns) return PTR_ERR(mnt); ns->proc_mnt = mnt; + init_rwsem(&ns->rw_procfs_mnts); + INIT_LIST_HEAD(&ns->procfs_mounts); + return 0; } diff --git a/include/linux/pid_namespace.h b/include/linux/pid_namespace.h index f91a8bf..767fc96 100644 --- a/include/linux/pid_namespace.h +++ b/include/linux/pid_namespace.h @@ -32,6 +32,8 @@ struct pid_namespace { struct pid_namespace *parent; #ifdef CONFIG_PROC_FS struct vfsmount *proc_mnt; /* Internal proc mounted during each new pidns */ + struct rw_semaphore rw_procfs_mnts; + struct list_head procfs_mounts; /* list of separated procfs mounts */ #endif #ifdef CONFIG_BSD_PROCESS_ACCT struct fs_pin *bacct; @@ -98,4 +100,44 @@ extern struct pid_namespace *task_active_pid_ns(struct task_struct *tsk); void pidhash_init(void); void pid_idr_init(void); +#ifdef CONFIG_PROC_FS +static inline void pidns_proc_lock(struct pid_namespace *pid_ns) +{ + down_write(&pid_ns->rw_procfs_mnts); +} + +static inline void pidns_proc_unlock(struct pid_namespace *pid_ns) +{ + up_write(&pid_ns->rw_procfs_mnts); +} + +static inline void pidns_proc_lock_shared(struct pid_namespace *pid_ns) +{ + down_read(&pid_ns->rw_procfs_mnts); +} + +static inline void pidns_proc_unlock_shared(struct pid_namespace *pid_ns) +{ + up_read(&pid_ns->rw_procfs_mnts); +} +#else /* !CONFIG_PROC_FS */ + +static inline void pidns_proc_lock(struct pid_namespace *pid_ns) +{ +} + +static inline void pidns_proc_unlock(struct pid_namespace *pid_ns) +{ +} + +static inline void pidns_proc_lock_shared(struct pid_namespace *pid_ns) +{ +} + +static inline void pidns_proc_unlock_shared(struct pid_namespace *pid_ns) +{ +} + +#endif /* CONFIG_PROC_FS */ + #endif /* _LINUX_PID_NS_H */ diff --git a/include/linux/proc_fs.h b/include/linux/proc_fs.h index 2d16d0e..70e8b10 100644 --- a/include/linux/proc_fs.h +++ b/include/linux/proc_fs.h @@ -19,7 +19,9 @@ enum { /* definitions for proc mount option limit_pids */ }; struct proc_fs_info { + struct super_block *sb; struct pid_namespace *pid_ns; + struct list_head pidns_entry; /* Node in procfs_mounts of a pidns */ struct dentry *proc_self; /* For /proc/self */ struct dentry *proc_thread_self; /* For /proc/thread-self/ */ bool newinstance; /* Private flag for new separated instances */ -- 2.10.5