Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3077324imu; Fri, 18 Jan 2019 04:30:57 -0800 (PST) X-Google-Smtp-Source: ALg8bN6Ht3nuyikjbjY/5BZTvKpuojHhuk/h2AcKO0fX+axmPhW8pLHxlh/4VioEkHiCO9Ys0ZGQ X-Received: by 2002:a63:5922:: with SMTP id n34mr17392428pgb.435.1547814657513; Fri, 18 Jan 2019 04:30:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1547814657; cv=none; d=google.com; s=arc-20160816; b=zIcwlLebo6YDPNc8JK6a33MQpDlQfuFmVy40/lhQhJMBhzj4uWv9Oqvjr0SWe99daZ vHb18zYzRVQmUEtx8B0zCoJy4WJ7SfLJWX8/ydsTMHBCkCEUsHMP4XpiomngLXMY7IQ8 ROxAolNH4jZ5ROW7DUCTCBL+ZA69iWe8vfJtbfBIOQ34cCx/XZNHTbuVmgYhWC0pudGx VjgFOao52Rg1Wzczrwu9aOFx8tf8aAyUW1irA4vR2zUfMp01fDQZoNqJobnvYBXpmTD1 hexane7BJJcM2QUMa4m5QogcI+aG+dh5l8BdVoqpav6doH0mZN8yNm9oVQLPus3tQjbc qGnQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from; bh=hZTRUGGGpZ5mGvk7SJEdrcwZsirejLZC4JEUhdSgKbY=; b=n3pYJNisNJxDTy3CDiQi0o6B+mo0j9P25H20davXro1HLFnEawv86D1F6LJs25Md24 bAFivb8qp5YlYH2nItxdqx0Ba5t0V0hatTzdy5e8dOw74yBQehU/9HKCzPHAr6/qZJUE BBCFz8FGl6LirkfmicoFFIshT3/IweE1L8LvX8QyFSw1TXW+QW/c9/YTb/INpyijANQX a4GBNb7WszhCndRrVeqHkFER9iqgutCq79v/EEo0yfNEj5u2NU71SQAE/RQXqkDU/6fe vNX2xv0OCiUDAJSkja7bB/K7K5yd+BwJnbLc2psaDgs0Bfd2hcvVamNCjdlG8Bt2cyIm 0RCg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v11si4333223plp.85.2019.01.18.04.30.42; Fri, 18 Jan 2019 04:30:57 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727740AbfARM16 (ORCPT + 99 others); Fri, 18 Jan 2019 07:27:58 -0500 Received: from mga05.intel.com ([192.55.52.43]:2175 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727679AbfARM15 (ORCPT ); Fri, 18 Jan 2019 07:27:57 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga105.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 18 Jan 2019 04:27:56 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.56,491,1539673200"; d="scan'208";a="268136120" Received: from elena-thinkpad-x230.fi.intel.com ([10.237.68.34]) by orsmga004.jf.intel.com with ESMTP; 18 Jan 2019 04:27:53 -0800 From: Elena Reshetova To: mingo@redhat.com Cc: peterz@infradead.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, viro@zeniv.linux.org.uk, akpm@linux-foundation.org, keescook@chromium.org, tglx@linutronix.de, Elena Reshetova Subject: [PATCH 4/5] sched/task_struct: convert task_struct.usage to refcount_t Date: Fri, 18 Jan 2019 14:27:29 +0200 Message-Id: <1547814450-18902-5-git-send-email-elena.reshetova@intel.com> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1547814450-18902-1-git-send-email-elena.reshetova@intel.com> References: <1547814450-18902-1-git-send-email-elena.reshetova@intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org atomic_t variables are currently used to implement reference counters with the following properties: - counter is initialized to 1 using atomic_set() - a resource is freed upon counter reaching zero - once counter reaches zero, its further increments aren't allowed - counter schema uses basic atomic operations (set, inc, inc_not_zero, dec_and_test, etc.) Such atomic variables should be converted to a newly provided refcount_t type and API that prevents accidental counter overflows and underflows. This is important since overflows and underflows can lead to use-after-free situation and be exploitable. The variable task_struct.usage is used as pure reference counter. Convert it to refcount_t and fix up the operations. **Important note for maintainers: Some functions from refcount_t API defined in lib/refcount.c have different memory ordering guarantees than their atomic counterparts. The full comparison can be seen in https://lkml.org/lkml/2017/11/15/57 and it is hopefully soon in state to be merged to the documentation tree. Normally the differences should not matter since refcount_t provides enough guarantees to satisfy the refcounting use cases, but in some rare cases it might matter. Please double check that you don't have some undocumented memory guarantees for this variable usage. For the task_struct.usage it might make a difference in following places: - put_task_struct(): decrement in refcount_dec_and_test() only provides RELEASE ordering and control dependency on success vs. fully ordered atomic counterpart Suggested-by: Kees Cook Reviewed-by: David Windsor Reviewed-by: Hans Liljestrand Signed-off-by: Elena Reshetova --- include/linux/sched.h | 3 ++- include/linux/sched/task.h | 4 ++-- init/init_task.c | 2 +- kernel/fork.c | 4 ++-- 4 files changed, 7 insertions(+), 6 deletions(-) diff --git a/include/linux/sched.h b/include/linux/sched.h index 5e66900..bead0c7 100644 --- a/include/linux/sched.h +++ b/include/linux/sched.h @@ -21,6 +21,7 @@ #include #include #include +#include #include #include #include @@ -608,7 +609,7 @@ struct task_struct { randomized_struct_fields_start void *stack; - atomic_t usage; + refcount_t usage; /* Per task flags (PF_*), defined further below: */ unsigned int flags; unsigned int ptrace; diff --git a/include/linux/sched/task.h b/include/linux/sched/task.h index 44c6f15..2e97a22 100644 --- a/include/linux/sched/task.h +++ b/include/linux/sched/task.h @@ -88,13 +88,13 @@ extern void sched_exec(void); #define sched_exec() {} #endif -#define get_task_struct(tsk) do { atomic_inc(&(tsk)->usage); } while(0) +#define get_task_struct(tsk) do { refcount_inc(&(tsk)->usage); } while(0) extern void __put_task_struct(struct task_struct *t); static inline void put_task_struct(struct task_struct *t) { - if (atomic_dec_and_test(&t->usage)) + if (refcount_dec_and_test(&t->usage)) __put_task_struct(t); } diff --git a/init/init_task.c b/init/init_task.c index 756be60..0ad9772 100644 --- a/init/init_task.c +++ b/init/init_task.c @@ -66,7 +66,7 @@ struct task_struct init_task #endif .state = 0, .stack = init_stack, - .usage = ATOMIC_INIT(2), + .usage = REFCOUNT_INIT(2), .flags = PF_KTHREAD, .prio = MAX_PRIO - 20, .static_prio = MAX_PRIO - 20, diff --git a/kernel/fork.c b/kernel/fork.c index 504324c..a4ac220 100644 --- a/kernel/fork.c +++ b/kernel/fork.c @@ -716,7 +716,7 @@ static inline void put_signal_struct(struct signal_struct *sig) void __put_task_struct(struct task_struct *tsk) { WARN_ON(!tsk->exit_state); - WARN_ON(atomic_read(&tsk->usage)); + WARN_ON(refcount_read(&tsk->usage)); WARN_ON(tsk == current); cgroup_free(tsk); @@ -895,7 +895,7 @@ static struct task_struct *dup_task_struct(struct task_struct *orig, int node) * One for us, one for whoever does the "release_task()" (usually * parent) */ - atomic_set(&tsk->usage, 2); + refcount_set(&tsk->usage, 2); #ifdef CONFIG_BLK_DEV_IO_TRACE tsk->btrace_seq = 0; #endif -- 2.7.4