Received: by 2002:ab2:6857:0:b0:1ef:ffd0:ce49 with SMTP id l23csp2606253lqp; Mon, 25 Mar 2024 04:20:16 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCUF0ILChLKNuL5t6mHhqsVtVeryaurhxwMwmrLbdGuCzC0KI/4vSNwkncfg0C4Cy5ua9/nNRlOOVesGrFuqEiCdVUohE8cvJExY4ntjTw== X-Google-Smtp-Source: AGHT+IEjYSCFexL7PpsvEVShxIDUf0uUg6WrhfL/jqSdkzPLfyHE+KZj2jKUzrB/oMG04i3Bkcuj X-Received: by 2002:a05:6a00:b43:b0:6e7:48e8:6aa4 with SMTP id p3-20020a056a000b4300b006e748e86aa4mr9410941pfo.15.1711365615792; Mon, 25 Mar 2024 04:20:15 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1711365615; cv=pass; d=google.com; s=arc-20160816; b=nHzE35sFJ6eIIVZ4poEwNa19ZozH+kscZZMc8DyiQF76anEuzfi2K0hysAf6R+1xOh MfTX5vqlAJt4lpt16QYW2nFnCTCcMEmuOxiOGvslWLgT0Fwjn6nrNOCX0SuLSw5dizoY IBds3UXixnL+XjQ8ifs6PAqIcFO0jza3emvtDimkR0kB72Z+7TMi2S1eJ3O1zNWl20Jq DF1yp+l6eKbgWA86nVfTRBVBdkib8PWknKVFmnt0jwu2gMzkczKdB6HmZQUGwEnTyL5z 3ioR/2sny/4J2BDc0c4xPvo///Qq5jpRN+LhOu0z8bRmLkIGDi3x4TPe48M/Vw1n1vPm VD6Q== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:in-reply-to:message-id :date:subject:cc:to:from:dkim-signature; bh=j/IUMxTEtKPOIUKE7hHrUEClGm6e6TZOhmhGGGlnHtg=; fh=KQYY3sTgolpGhQjVKI0cNcbPkFtYN1xfnB7YTzFbe8I=; b=aaelpD175iyDuMMmb4zx9TbcAAMgaggvvJNj5VFPXxiGv/LZdfYq3z2Poe395QCl8e ILiSME9ZeTWk9pdUDYuDgSaXXJpLePHSvxjHWeNEXMcDOP/OXWq/DAz0COr2DFIyqiCv CxE/qsO/IHZNoeIOYbtqGbPJ6bf2miMHnsKRuDNRYd2moLZG6Drr7FlKLiNSyQRl0Zhg pU2nR7vifX3Vkp+d9EiRHyv4FTwoch5zUxiMfkdhrHcrlgoGVOLBKYcSCPJApsrMPSre XsYPUEUb+h+zzZDg6s3EFC5jtCaqDD2ay7gsw2yeiN8lI5IodjP9HTitzeAXdKi2en0j 1/6g==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b="KQpV/RaO"; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-115522-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-115522-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [2604:1380:45e3:2400::1]) by mx.google.com with ESMTPS id fd24-20020a056a002e9800b006eaada3b713si1571928pfb.147.2024.03.25.04.20.15 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 25 Mar 2024 04:20:15 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-115522-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) client-ip=2604:1380:45e3:2400::1; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b="KQpV/RaO"; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-115522-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45e3:2400::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-115522-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id 0110B2C5AA7 for ; Mon, 25 Mar 2024 11:12:22 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id D717323E43C; Mon, 25 Mar 2024 02:42:15 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="KQpV/RaO" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 88FA31494BA; Sun, 24 Mar 2024 22:56:12 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1711320972; cv=none; b=bWnGLl4njuxTQGAt50zYA/tnkr2KK/NSN3PmZL+jiKoe1GQWHIS7H3QZe37bl6OD3AJGVIYZJNholm/seuKtwrHbQeu9mHfRr76ekYXs/vtfdWXFubMYwvFHDXORyFTC/1zLdmBOUbHRiJmew8LOjGmg/LyddtUnzszEN4OxH1E= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1711320972; c=relaxed/simple; bh=VRyHqkwnrWOxKYONL7l3ggEL9OY/Mc3b+ogGjCd+mZA=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=FHMqOkPH/Zl4k9jZTdmIqVKh+vbfcVTtiu+hmttWQsMi8D1XR94tXPvDtYbfR3XmCgsuMes66NNWlTtK8lI+I0ojqOrSClDKkzcK2BSiVdMNczdYL3r4Dj6f476QkbaonJPhHloxt3uuzSDrlXRxeysYlLVnsYS9gko+C5dQ1X0= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=KQpV/RaO; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 81F7EC43390; Sun, 24 Mar 2024 22:56:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1711320972; bh=VRyHqkwnrWOxKYONL7l3ggEL9OY/Mc3b+ogGjCd+mZA=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=KQpV/RaO8NG08Ge6hV3cnQj5Xgo7Em8cjZdXUvj1U4S8IUM/78h2jOmVVEY3+eMcN mBRywQMMJgqs/qt6XYU/yB6zRgmeqIiEuNHD3XS+yySNVb4Ru/nIbJV7zRY+oRWlkJ /OtuGSKzKdeh2pO2ey2zeZ/qvOYa+DF6/e4IB0WYXvhnjUifUzH0LObsgu/TSqGslB UqIZpq+aToWbdMnjrHEnYazjyn9vhjeOkc3YFyGztnEq+zUgyxKiuRQPXsiEGJN0bQ dFuSW6ni4bnGBGJe7m43qLs2uaBr8ABoMJnPzsUgHpEOHskU3YwynpE2rNU/C4TL/S 95kbq2jT8CmZQ== From: Sasha Levin To: linux-kernel@vger.kernel.org, stable@vger.kernel.org Cc: Changbin Du , Eric Chanudet , Luis Chamberlain , Xiaoyi Su , Andrew Morton , Sasha Levin Subject: [PATCH 6.7 535/713] modules: wait do_free_init correctly Date: Sun, 24 Mar 2024 18:44:21 -0400 Message-ID: <20240324224720.1345309-536-sashal@kernel.org> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20240324224720.1345309-1-sashal@kernel.org> References: <20240324224720.1345309-1-sashal@kernel.org> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 X-stable: review X-Patchwork-Hint: Ignore Content-Transfer-Encoding: 8bit From: Changbin Du [ Upstream commit 8f8cd6c0a43ed637e620bbe45a8d0e0c2f4d5130 ] The synchronization here is to ensure the ordering of freeing of a module init so that it happens before W+X checking. It is worth noting it is not that the freeing was not happening, it is just that our sanity checkers raced against the permission checkers which assume init memory is already gone. Commit 1a7b7d922081 ("modules: Use vmalloc special flag") moved calling do_free_init() into a global workqueue instead of relying on it being called through call_rcu(..., do_free_init), which used to allowed us call do_free_init() asynchronously after the end of a subsequent grace period. The move to a global workqueue broke the gaurantees for code which needed to be sure the do_free_init() would complete with rcu_barrier(). To fix this callers which used to rely on rcu_barrier() must now instead use flush_work(&init_free_wq). Without this fix, we still could encounter false positive reports in W+X checking since the rcu_barrier() here can not ensure the ordering now. Even worse, the rcu_barrier() can introduce significant delay. Eric Chanudet reported that the rcu_barrier introduces ~0.1s delay on a PREEMPT_RT kernel. [ 0.291444] Freeing unused kernel memory: 5568K [ 0.402442] Run /sbin/init as init process With this fix, the above delay can be eliminated. Link: https://lkml.kernel.org/r/20240227023546.2490667-1-changbin.du@huawei.com Fixes: 1a7b7d922081 ("modules: Use vmalloc special flag") Signed-off-by: Changbin Du Tested-by: Eric Chanudet Acked-by: Luis Chamberlain Cc: Xiaoyi Su Signed-off-by: Andrew Morton Signed-off-by: Sasha Levin --- include/linux/moduleloader.h | 8 ++++++++ init/main.c | 5 +++-- kernel/module/main.c | 9 +++++++-- 3 files changed, 18 insertions(+), 4 deletions(-) diff --git a/include/linux/moduleloader.h b/include/linux/moduleloader.h index 001b2ce83832e..89b1e0ed98114 100644 --- a/include/linux/moduleloader.h +++ b/include/linux/moduleloader.h @@ -115,6 +115,14 @@ int module_finalize(const Elf_Ehdr *hdr, const Elf_Shdr *sechdrs, struct module *mod); +#ifdef CONFIG_MODULES +void flush_module_init_free_work(void); +#else +static inline void flush_module_init_free_work(void) +{ +} +#endif + /* Any cleanup needed when module leaves. */ void module_arch_cleanup(struct module *mod); diff --git a/init/main.c b/init/main.c index e24b0780fdff7..9e6ab6d593bd8 100644 --- a/init/main.c +++ b/init/main.c @@ -88,6 +88,7 @@ #include #include #include +#include #include #include #include @@ -1402,11 +1403,11 @@ static void mark_readonly(void) if (rodata_enabled) { /* * load_module() results in W+X mappings, which are cleaned - * up with call_rcu(). Let's make sure that queued work is + * up with init_free_wq. Let's make sure that queued work is * flushed so that we don't hit false positives looking for * insecure pages which are W+X. */ - rcu_barrier(); + flush_module_init_free_work(); mark_rodata_ro(); rodata_test(); } else diff --git a/kernel/module/main.c b/kernel/module/main.c index 98fedfdb8db52..34d9e718c2c7d 100644 --- a/kernel/module/main.c +++ b/kernel/module/main.c @@ -2486,6 +2486,11 @@ static void do_free_init(struct work_struct *w) } } +void flush_module_init_free_work(void) +{ + flush_work(&init_free_wq); +} + #undef MODULE_PARAM_PREFIX #define MODULE_PARAM_PREFIX "module." /* Default value for module->async_probe_requested */ @@ -2590,8 +2595,8 @@ static noinline int do_init_module(struct module *mod) * Note that module_alloc() on most architectures creates W+X page * mappings which won't be cleaned up until do_free_init() runs. Any * code such as mark_rodata_ro() which depends on those mappings to - * be cleaned up needs to sync with the queued work - ie - * rcu_barrier() + * be cleaned up needs to sync with the queued work by invoking + * flush_module_init_free_work(). */ if (llist_add(&freeinit->node, &init_free_list)) schedule_work(&init_free_wq); -- 2.43.0