Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp1543956pxb; Thu, 16 Sep 2021 09:36:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz5IoC3R/CzN2IhVpej3fGvTImYry6aMV41cHAqCT6asPVvIlm0VjqYk7BY6LWGVpCtzyyT X-Received: by 2002:a05:6602:3284:: with SMTP id d4mr5007374ioz.100.1631810179451; Thu, 16 Sep 2021 09:36:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1631810179; cv=none; d=google.com; s=arc-20160816; b=IOi3WbSK8cZJdcpzXrfoVh48BYG52fN8v6lOJNvt4cPj9NADdUP5Il9olUNsqkj4sW CKA0hMDS90cSrjXSHvHqRL/mWKCqUKaHuWtXtuB+xAp/JNNmCVpczVJ5yXo0GSGn5yhV x9RXB0HdT7EgI/32LAtWLIume48uricgW5qCjB3naRsGIpcx9Z4xIpTzdepQRSkMSDAo G9IgqFehbMo5WRNSo3GjIA+WREZH8WBOX/1Vir/nOAzWa+8j7TUn4Ywh4jKqyDfXp4VB 3LfOZgVioAahLT7uHdRbhbLcOLp6ee14anotfAJ/BmYj7RLe9MkM8Tj9NbUjwiupW3wq Vl6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=GuneV778EYBRz+fAIzuaJH3LMWHoKM2kvkYhBp1EZgE=; b=ZLwj4tgxLmFnGfkXMRCMwJblbkvvQ1fbBqQHUynmIRC3rNdpeunKt6nGClQ0sI5yFk H14AQMkyAg1D71EALJf7MkOeba5VM92GUbPicfZkdL8T6HM/zKWFbNhLznEMHv/EOjkG 6b0cm0pjUq00Apk6ACJDV0Lfmq+P4bgaMC9lVjWgQws5+wUJZGehyH+sPKME4t/X8Cc6 293aR8aaP4WIF4fTAjh731FGKVrtwx3k1ctLeLaGY6wQRJRswhYbJD8OOKNyTb1ibPRl E9xC4eWD/wRA1fSRwcKAwrvebAT6kVZtR8JuZ14RYHnTWhZKxDunNMWMh7nFBgYUqgTt 8YGQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=LHqTOrU3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id s3si3473991ilq.133.2021.09.16.09.36.08; Thu, 16 Sep 2021 09:36:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=LHqTOrU3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243901AbhIPQea (ORCPT + 99 others); Thu, 16 Sep 2021 12:34:30 -0400 Received: from mail.kernel.org ([198.145.29.99]:38934 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S241916AbhIPQ00 (ORCPT ); Thu, 16 Sep 2021 12:26:26 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id D98CE6124B; Thu, 16 Sep 2021 16:17:10 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1631809031; bh=86SkcsOZkAWlN95A54tyIhtwcSWmeBkWL2qRJF9eVT8=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=LHqTOrU339b7tF/WTbhOh7fC/utx6rd9NZHfRdTdJZ//BO+hdkCdZ8qrbX+VyV0RC 7qfKPeTgkAsfw89kwXHC0G0fC2MB3xCn7VQuf1+3WSkvS+xgGoUxVnuzSn6RWsk7z+ ONpE7jh9A8xSSlN/HvKaNEuVPdpl+RLfIjX9Y9+U= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Mike Kravetz , Guillaume Morin , Andrew Morton , Linus Torvalds Subject: [PATCH 5.10 287/306] hugetlb: fix hugetlb cgroup refcounting during vma split Date: Thu, 16 Sep 2021 18:00:32 +0200 Message-Id: <20210916155803.886798245@linuxfoundation.org> X-Mailer: git-send-email 2.33.0 In-Reply-To: <20210916155753.903069397@linuxfoundation.org> References: <20210916155753.903069397@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Mike Kravetz commit 09a26e832705fdb7a9484495b71a05e0bbc65207 upstream. Guillaume Morin reported hitting the following WARNING followed by GPF or NULL pointer deference either in cgroups_destroy or in the kill_css path.: percpu ref (css_release) <= 0 (-1) after switching to atomic WARNING: CPU: 23 PID: 130 at lib/percpu-refcount.c:196 percpu_ref_switch_to_atomic_rcu+0x127/0x130 CPU: 23 PID: 130 Comm: ksoftirqd/23 Kdump: loaded Tainted: G O 5.10.60 #1 RIP: 0010:percpu_ref_switch_to_atomic_rcu+0x127/0x130 Call Trace: rcu_core+0x30f/0x530 rcu_core_si+0xe/0x10 __do_softirq+0x103/0x2a2 run_ksoftirqd+0x2b/0x40 smpboot_thread_fn+0x11a/0x170 kthread+0x10a/0x140 ret_from_fork+0x22/0x30 Upon further examination, it was discovered that the css structure was associated with hugetlb reservations. For private hugetlb mappings the vma points to a reserve map that contains a pointer to the css. At mmap time, reservations are set up and a reference to the css is taken. This reference is dropped in the vma close operation; hugetlb_vm_op_close. However, if a vma is split no additional reference to the css is taken yet hugetlb_vm_op_close will be called twice for the split vma resulting in an underflow. Fix by taking another reference in hugetlb_vm_op_open. Note that the reference is only taken for the owner of the reserve map. In the more common fork case, the pointer to the reserve map is cleared for non-owning vmas. Link: https://lkml.kernel.org/r/20210830215015.155224-1-mike.kravetz@oracle.com Fixes: e9fe92ae0cd2 ("hugetlb_cgroup: add reservation accounting for private mappings") Signed-off-by: Mike Kravetz Reported-by: Guillaume Morin Suggested-by: Guillaume Morin Tested-by: Guillaume Morin Cc: Signed-off-by: Andrew Morton Signed-off-by: Linus Torvalds Signed-off-by: Greg Kroah-Hartman --- include/linux/hugetlb_cgroup.h | 12 ++++++++++++ mm/hugetlb.c | 4 +++- 2 files changed, 15 insertions(+), 1 deletion(-) --- a/include/linux/hugetlb_cgroup.h +++ b/include/linux/hugetlb_cgroup.h @@ -118,6 +118,13 @@ static inline void hugetlb_cgroup_put_rs css_put(&h_cg->css); } +static inline void resv_map_dup_hugetlb_cgroup_uncharge_info( + struct resv_map *resv_map) +{ + if (resv_map->css) + css_get(resv_map->css); +} + extern int hugetlb_cgroup_charge_cgroup(int idx, unsigned long nr_pages, struct hugetlb_cgroup **ptr); extern int hugetlb_cgroup_charge_cgroup_rsvd(int idx, unsigned long nr_pages, @@ -196,6 +203,11 @@ static inline void hugetlb_cgroup_put_rs { } +static inline void resv_map_dup_hugetlb_cgroup_uncharge_info( + struct resv_map *resv_map) +{ +} + static inline int hugetlb_cgroup_charge_cgroup(int idx, unsigned long nr_pages, struct hugetlb_cgroup **ptr) { --- a/mm/hugetlb.c +++ b/mm/hugetlb.c @@ -3659,8 +3659,10 @@ static void hugetlb_vm_op_open(struct vm * after this open call completes. It is therefore safe to take a * new reference here without additional locking. */ - if (resv && is_vma_resv_set(vma, HPAGE_RESV_OWNER)) + if (resv && is_vma_resv_set(vma, HPAGE_RESV_OWNER)) { + resv_map_dup_hugetlb_cgroup_uncharge_info(resv); kref_get(&resv->refs); + } } static void hugetlb_vm_op_close(struct vm_area_struct *vma)