Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3137845imu; Fri, 18 Jan 2019 05:31:39 -0800 (PST) X-Google-Smtp-Source: ALg8bN4oweD8AG7tXMRE12G3LUDJOdlOOYbAEbo9hF4BW9rmbLcRyWJ9nblf1CjekZlMOgJChr4j X-Received: by 2002:a63:e20a:: with SMTP id q10mr17193167pgh.206.1547818299790; Fri, 18 Jan 2019 05:31:39 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1547818299; cv=none; d=google.com; s=arc-20160816; b=dROObHM+gjzQ894cNP3WutGxnD7V+bb9AsM4j+ed5jkmJgtKRVArw3IfUy0CsZ+dks pE/jLgbFEIgQusqFxplTd0NUVX/NWQGTj5Tq9svxF1eJYY+eSJyW5ERM4RqFe11Su3iX UoQUHrZxy/Tgq220jRtXT6hMcLK1LhkQXPqXykCX9lT46YX7xFNm0NGiVUyyas/IlaD/ AlvGdVZ0mnTTuHD700w4bprvN7hR6QUTgqbO3+gqyslEeXnX9N3xmhAxLeAIoDyoBKu4 rUie0VAi+uRDWlrc0vulzyTK1pRrel45bMa2jwGdMHqqoZuU2wU3cicepTI+laiKUjIE 6iDA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from; bh=3rTfCGxBn9+p8RsUjy3E5VZMle8lWfcbau0oZjX3llc=; b=fyzatZiKJolh8mTuJRVAKZ+VnFidTT80bQWwviLC+Y0Byfu7S3RNnasGuEEW8Jbs/A kwzUrvAjo86KcGnOCprL4V2UA/eTJelsta5dIeuuBANWcRcTDkN6NpWw1a+0wlNQabwD E5TbK2Qq5pS7YOGO2NiWaLmawNufkDPL8s51qXqQ5DTNmmAB+SD4XS09Oxrje35woISY qgwRLXq6lKx6+l9KxCPDsjkeWIkebuzxM30pfydd/55ozaimTr2azJLfSvqre7Mic2Ad 7VLcW7I/9lDNtH0JGv4kNfeZ6TOa9Iwe5XF7BklplNFeqm57DxEr/0DlBWnqyPpSV9sJ UHrg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id cb1si4610830plb.37.2019.01.18.05.31.20; Fri, 18 Jan 2019 05:31:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727690AbfARM15 (ORCPT + 99 others); Fri, 18 Jan 2019 07:27:57 -0500 Received: from mga05.intel.com ([192.55.52.43]:2175 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727655AbfARM1y (ORCPT ); Fri, 18 Jan 2019 07:27:54 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga105.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 18 Jan 2019 04:27:53 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.56,491,1539673200"; d="scan'208";a="268136115" Received: from elena-thinkpad-x230.fi.intel.com ([10.237.68.34]) by orsmga004.jf.intel.com with ESMTP; 18 Jan 2019 04:27:50 -0800 From: Elena Reshetova To: mingo@redhat.com Cc: peterz@infradead.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, viro@zeniv.linux.org.uk, akpm@linux-foundation.org, keescook@chromium.org, tglx@linutronix.de, Elena Reshetova Subject: [PATCH 3/5] sched: convert numa_group.refcount to refcount_t Date: Fri, 18 Jan 2019 14:27:28 +0200 Message-Id: <1547814450-18902-4-git-send-email-elena.reshetova@intel.com> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1547814450-18902-1-git-send-email-elena.reshetova@intel.com> References: <1547814450-18902-1-git-send-email-elena.reshetova@intel.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org atomic_t variables are currently used to implement reference counters with the following properties: - counter is initialized to 1 using atomic_set() - a resource is freed upon counter reaching zero - once counter reaches zero, its further increments aren't allowed - counter schema uses basic atomic operations (set, inc, inc_not_zero, dec_and_test, etc.) Such atomic variables should be converted to a newly provided refcount_t type and API that prevents accidental counter overflows and underflows. This is important since overflows and underflows can lead to use-after-free situation and be exploitable. The variable numa_group.refcount is used as pure reference counter. Convert it to refcount_t and fix up the operations. **Important note for maintainers: Some functions from refcount_t API defined in lib/refcount.c have different memory ordering guarantees than their atomic counterparts. The full comparison can be seen in https://lkml.org/lkml/2017/11/15/57 and it is hopefully soon in state to be merged to the documentation tree. Normally the differences should not matter since refcount_t provides enough guarantees to satisfy the refcounting use cases, but in some rare cases it might matter. Please double check that you don't have some undocumented memory guarantees for this variable usage. For the numa_group.refcount it might make a difference in following places: - get_numa_group(): increment in refcount_inc_not_zero() only guarantees control dependency on success vs. fully ordered atomic counterpart - put_numa_group(): decrement in refcount_dec_and_test() only provides RELEASE ordering and control dependency on success vs. fully ordered atomic counterpart Suggested-by: Kees Cook Reviewed-by: David Windsor Reviewed-by: Hans Liljestrand Signed-off-by: Elena Reshetova --- kernel/sched/fair.c | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 7303e0b..32a2382 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -1035,7 +1035,7 @@ unsigned int sysctl_numa_balancing_scan_size = 256; unsigned int sysctl_numa_balancing_scan_delay = 1000; struct numa_group { - atomic_t refcount; + refcount_t refcount; spinlock_t lock; /* nr_tasks, tasks */ int nr_tasks; @@ -1104,7 +1104,7 @@ static unsigned int task_scan_start(struct task_struct *p) unsigned long shared = group_faults_shared(ng); unsigned long private = group_faults_priv(ng); - period *= atomic_read(&ng->refcount); + period *= refcount_read(&ng->refcount); period *= shared + 1; period /= private + shared + 1; } @@ -1127,7 +1127,7 @@ static unsigned int task_scan_max(struct task_struct *p) unsigned long private = group_faults_priv(ng); unsigned long period = smax; - period *= atomic_read(&ng->refcount); + period *= refcount_read(&ng->refcount); period *= shared + 1; period /= private + shared + 1; @@ -2203,12 +2203,12 @@ static void task_numa_placement(struct task_struct *p) static inline int get_numa_group(struct numa_group *grp) { - return atomic_inc_not_zero(&grp->refcount); + return refcount_inc_not_zero(&grp->refcount); } static inline void put_numa_group(struct numa_group *grp) { - if (atomic_dec_and_test(&grp->refcount)) + if (refcount_dec_and_test(&grp->refcount)) kfree_rcu(grp, rcu); } @@ -2229,7 +2229,7 @@ static void task_numa_group(struct task_struct *p, int cpupid, int flags, if (!grp) return; - atomic_set(&grp->refcount, 1); + refcount_set(&grp->refcount, 1); grp->active_nodes = 1; grp->max_faults_cpu = 0; spin_lock_init(&grp->lock); -- 2.7.4