Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp1448843rwn; Thu, 8 Sep 2022 20:58:42 -0700 (PDT) X-Google-Smtp-Source: AA6agR7wQlc/zATFbA9IcfoEvpgJGGkiHfKbhVmC3Y7OL/yhujSiKl1M1enrfx4MNbfiK1y+lTmo X-Received: by 2002:a17:903:2582:b0:176:9ba8:3c9f with SMTP id jb2-20020a170903258200b001769ba83c9fmr11944051plb.87.1662695922434; Thu, 08 Sep 2022 20:58:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662695922; cv=none; d=google.com; s=arc-20160816; b=Ls8Fonw4xrtFKUJSkCbAoyYxquTYhbBkTBIZBcRMPP0YtZMkm+46arZ9hV+En6AlVL 6diNtPkfEykaqo55RH4F2N2Pq+gGIjqWFmejsiwQAXqi1WEbQEc/80yXAKJ7j5tTCD3Y 6q/ZA1Ni0LaVF8DlVzUxpP5oyiMdQ65DX4Qphn8OQW9MrUS1WC73cvrpEj9QCEd3dt/s GgUNTZlVsNAftI7jC2sjCoYX2oI88EDRmEBnAIgg1Zv5yd6qn1pa+BltPnTDL63Bys5V sp5o2PCO7IJHXTHEy3Lqd8UUGbtn6QFMmxJ3oLrwOiBhA+tFzFBaJV8nL9KX3dIxwNLh DifQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=qODERuM3tR58z1JvV5JOgknVDgt/9+j8/Xvp72hmD1U=; b=xaijUGEgbJ3t5UuXcqhdSjXorQgHLAbkL8Gmd1R0lWiL3Ct852ZNSiGFLWIDPzIjRl k2ChjDEe/2txMvFmZluMCtkLCdTCoFfvguES9dKnS4K8KROfYw8kxVMWZAQmfZEmKBEZ bwFCLrbNferLMoRqWkMsauO+hos2KFje3SxilP6ZSSostPerHoYm+bmSIORgWiuD7dE3 Y+pY41MmvbnPaRMJv19hmX8CHmtJv4X1vXInUIOP2XjOKyXBwAszkmfS1CmVaE1xxhs1 +RV7eysaDTh4I+JiSy4WBkTkVwG5iixJCS5Y6M8obZ3pwk1xfN5Hn2v5uRxGxdEuu4Fd up6A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=grqr98PF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id y20-20020a056a00191400b00540e1e14574si975140pfi.282.2022.09.08.20.58.31; Thu, 08 Sep 2022 20:58:42 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=grqr98PF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230216AbiIIDsX (ORCPT + 99 others); Thu, 8 Sep 2022 23:48:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58394 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229905AbiIIDsV (ORCPT ); Thu, 8 Sep 2022 23:48:21 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F047E1203DB for ; Thu, 8 Sep 2022 20:48:19 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 5F17AB8206F for ; Fri, 9 Sep 2022 03:48:18 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 0D581C433D7 for ; Fri, 9 Sep 2022 03:48:17 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1662695297; bh=PX3ihAO3NKjkC40uj7ai/McUKZqNLhwG6Yyy59W8/fI=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=grqr98PFbFMNPRXQBg2JfLMLtHMNDaLnPd95myEtneiWyON4sYMv/BBcD+Egf2jWK 9Wp9LSG0Ar1pTRlHdDJykgr3XmTJKyoV7Kg7zFV8rkoF7N8/v/C3DaWYDyrxpVgCYF JXyzWB8p2nG2F3R0b7SE6CR77QP0F8FfNoTbw9OUfiWueAK9vNILgOBg34T0uQljSw 6ealkbqy+E/zUCxWFo52wPUKaKoYP0KGqTA2hqLrrIkaW4Yz0eftsrPiSovcUjqCvf LF6kF7BlO03XVnhDpFmrUJ20AlOqw+LiCKtoZVJkMd9liBo+I9EnGFdCv8TGN+EdGk HEYRUKg5zHbCQ== Received: by mail-oa1-f47.google.com with SMTP id 586e51a60fabf-1278624b7c4so896593fac.5 for ; Thu, 08 Sep 2022 20:48:17 -0700 (PDT) X-Gm-Message-State: ACgBeo3SLRLoPoKHhGhVWg2lqyDtV0fOoNV19tqSnE/kWtmFBv6VIXxy 81/8JRSAyV8g0jTSmHBY3LL4zTRubl+GTllTsGs= X-Received: by 2002:a05:6808:2028:b0:344:246d:2bed with SMTP id q40-20020a056808202800b00344246d2bedmr2706690oiw.19.1662695296022; Thu, 08 Sep 2022 20:48:16 -0700 (PDT) MIME-Version: 1.0 References: <20220908150421.323-1-jszhang@kernel.org> In-Reply-To: <20220908150421.323-1-jszhang@kernel.org> From: Guo Ren Date: Fri, 9 Sep 2022 11:48:03 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v2] riscv: vdso: fix NULL deference in vdso_join_timens() when vfork To: Jisheng Zhang Cc: Paul Walmsley , Palmer Dabbelt , Albert Ou , linux-riscv@lists.infradead.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Great job! I've tested it on rv32-rootfs, and everything seems to be working. Tested-by: Guo Ren On Thu, Sep 8, 2022 at 11:13 PM Jisheng Zhang wrote: > > Testing tools/testing/selftests/timens/vfork_exec.c got below > kernel log: > > [ 6.838454] Unable to handle kernel access to user memory without uaccess routines at virtual address 0000000000000020 > [ 6.842255] Oops [#1] > [ 6.842871] Modules linked in: > [ 6.844249] CPU: 1 PID: 64 Comm: vfork_exec Not tainted 6.0.0-rc3-rt15+ #8 > [ 6.845861] Hardware name: riscv-virtio,qemu (DT) > [ 6.848009] epc : vdso_join_timens+0xd2/0x110 > [ 6.850097] ra : vdso_join_timens+0xd2/0x110 > [ 6.851164] epc : ffffffff8000635c ra : ffffffff8000635c sp : ff6000000181fbf0 > [ 6.852562] gp : ffffffff80cff648 tp : ff60000000fdb700 t0 : 3030303030303030 > [ 6.853852] t1 : 0000000000000030 t2 : 3030303030303030 s0 : ff6000000181fc40 > [ 6.854984] s1 : ff60000001e6c000 a0 : 0000000000000010 a1 : ffffffff8005654c > [ 6.856221] a2 : 00000000ffffefff a3 : 0000000000000000 a4 : 0000000000000000 > [ 6.858114] a5 : 0000000000000000 a6 : 0000000000000008 a7 : 0000000000000038 > [ 6.859484] s2 : ff60000001e6c068 s3 : ff6000000108abb0 s4 : 0000000000000000 > [ 6.860751] s5 : 0000000000001000 s6 : ffffffff8089dc40 s7 : ffffffff8089dc38 > [ 6.862029] s8 : ffffffff8089dc30 s9 : ff60000000fdbe38 s10: 000000000000005e > [ 6.863304] s11: ffffffff80cc3510 t3 : ffffffff80d1112f t4 : ffffffff80d1112f > [ 6.864565] t5 : ffffffff80d11130 t6 : ff6000000181fa00 > [ 6.865561] status: 0000000000000120 badaddr: 0000000000000020 cause: 000000000000000d > [ 6.868046] [] timens_commit+0x38/0x11a > [ 6.869089] [] timens_on_fork+0x72/0xb4 > [ 6.870055] [] begin_new_exec+0x3c6/0x9f0 > [ 6.871231] [] load_elf_binary+0x628/0x1214 > [ 6.872304] [] bprm_execve+0x1f2/0x4e4 > [ 6.873243] [] do_execveat_common+0x16e/0x1ee > [ 6.874258] [] sys_execve+0x3c/0x48 > [ 6.875162] [] ret_from_syscall+0x0/0x2 > [ 6.877484] ---[ end trace 0000000000000000 ]--- > > This is because the mm->context.vdso_info is NULL in vfork case. From > another side, mm->context.vdso_info either points to vdso info > for RV64 or vdso info for compat, there's no need to bloat riscv's > mm_context_t, we can handle the difference when setup the additional > page for vdso. > > Signed-off-by: Jisheng Zhang > Fixes: 3092eb456375 ("riscv: compat: vdso: Add setup additional pages implementation") > --- > > since v1: > - add "Fixes" tag > - fix build error when CONFIG_COMPAT is enabled. > > arch/riscv/include/asm/mmu.h | 1 - > arch/riscv/kernel/vdso.c | 178 +++++++++++++++++++---------------- > 2 files changed, 95 insertions(+), 84 deletions(-) > > diff --git a/arch/riscv/include/asm/mmu.h b/arch/riscv/include/asm/mmu.h > index cedcf8ea3c76..0099dc116168 100644 > --- a/arch/riscv/include/asm/mmu.h > +++ b/arch/riscv/include/asm/mmu.h > @@ -16,7 +16,6 @@ typedef struct { > atomic_long_t id; > #endif > void *vdso; > - void *vdso_info; > #ifdef CONFIG_SMP > /* A local icache flush is needed before user execution can resume. */ > cpumask_t icache_stale_mask; > diff --git a/arch/riscv/kernel/vdso.c b/arch/riscv/kernel/vdso.c > index 69b05b6c181b..8ca3b821e1b5 100644 > --- a/arch/riscv/kernel/vdso.c > +++ b/arch/riscv/kernel/vdso.c > @@ -27,6 +27,11 @@ extern char vdso_start[], vdso_end[]; > extern char compat_vdso_start[], compat_vdso_end[]; > #endif > > +enum vdso_abi { > + VDSO_ABI_RV64, > + VDSO_ABI_RV32, > +}; > + > enum vvar_pages { > VVAR_DATA_PAGE_OFFSET, > VVAR_TIMENS_PAGE_OFFSET, > @@ -68,67 +73,7 @@ static int vdso_mremap(const struct vm_special_mapping *sm, > return 0; > } > > -static void __init __vdso_init(struct __vdso_info *vdso_info) > -{ > - unsigned int i; > - struct page **vdso_pagelist; > - unsigned long pfn; > - > - if (memcmp(vdso_info->vdso_code_start, "\177ELF", 4)) > - panic("vDSO is not a valid ELF object!\n"); > - > - vdso_info->vdso_pages = ( > - vdso_info->vdso_code_end - > - vdso_info->vdso_code_start) >> > - PAGE_SHIFT; > - > - vdso_pagelist = kcalloc(vdso_info->vdso_pages, > - sizeof(struct page *), > - GFP_KERNEL); > - if (vdso_pagelist == NULL) > - panic("vDSO kcalloc failed!\n"); > - > - /* Grab the vDSO code pages. */ > - pfn = sym_to_pfn(vdso_info->vdso_code_start); > - > - for (i = 0; i < vdso_info->vdso_pages; i++) > - vdso_pagelist[i] = pfn_to_page(pfn + i); > - > - vdso_info->cm->pages = vdso_pagelist; > -} > - > #ifdef CONFIG_TIME_NS > -struct vdso_data *arch_get_vdso_data(void *vvar_page) > -{ > - return (struct vdso_data *)(vvar_page); > -} > - > -/* > - * The vvar mapping contains data for a specific time namespace, so when a task > - * changes namespace we must unmap its vvar data for the old namespace. > - * Subsequent faults will map in data for the new namespace. > - * > - * For more details see timens_setup_vdso_data(). > - */ > -int vdso_join_timens(struct task_struct *task, struct time_namespace *ns) > -{ > - struct mm_struct *mm = task->mm; > - struct vm_area_struct *vma; > - struct __vdso_info *vdso_info = mm->context.vdso_info; > - > - mmap_read_lock(mm); > - > - for (vma = mm->mmap; vma; vma = vma->vm_next) { > - unsigned long size = vma->vm_end - vma->vm_start; > - > - if (vma_is_special_mapping(vma, vdso_info->dm)) > - zap_page_range(vma, vma->vm_start, size); > - } > - > - mmap_read_unlock(mm); > - return 0; > -} > - > static struct page *find_timens_vvar_page(struct vm_area_struct *vma) > { > if (likely(vma->vm_mm == current->mm)) > @@ -197,13 +142,6 @@ static struct vm_special_mapping rv_vdso_maps[] __ro_after_init = { > }, > }; > > -static struct __vdso_info vdso_info __ro_after_init = { > - .name = "vdso", > - .vdso_code_start = vdso_start, > - .vdso_code_end = vdso_end, > - .dm = &rv_vdso_maps[RV_VDSO_MAP_VVAR], > - .cm = &rv_vdso_maps[RV_VDSO_MAP_VDSO], > -}; > > #ifdef CONFIG_COMPAT > static struct vm_special_mapping rv_compat_vdso_maps[] __ro_after_init = { > @@ -216,21 +154,97 @@ static struct vm_special_mapping rv_compat_vdso_maps[] __ro_after_init = { > .mremap = vdso_mremap, > }, > }; > +#endif > > -static struct __vdso_info compat_vdso_info __ro_after_init = { > - .name = "compat_vdso", > - .vdso_code_start = compat_vdso_start, > - .vdso_code_end = compat_vdso_end, > - .dm = &rv_compat_vdso_maps[RV_VDSO_MAP_VVAR], > - .cm = &rv_compat_vdso_maps[RV_VDSO_MAP_VDSO], > +static struct __vdso_info vdso_info[] __ro_after_init = { > + [VDSO_ABI_RV64] = { > + .name = "vdso", > + .vdso_code_start = vdso_start, > + .vdso_code_end = vdso_end, > + .dm = &rv_vdso_maps[RV_VDSO_MAP_VVAR], > + .cm = &rv_vdso_maps[RV_VDSO_MAP_VDSO], > + }, > +#ifdef CONFIG_COMPAT > + [VDSO_ABI_RV32] = { > + .name = "compat_vdso", > + .vdso_code_start = compat_vdso_start, > + .vdso_code_end = compat_vdso_end, > + .dm = &rv_compat_vdso_maps[RV_VDSO_MAP_VVAR], > + .cm = &rv_compat_vdso_maps[RV_VDSO_MAP_VDSO], > + }, > +#endif > }; > + > +static void __init __vdso_init(enum vdso_abi abi) > +{ > + unsigned int i; > + struct page **vdso_pagelist; > + unsigned long pfn; > + > + if (memcmp(vdso_info[abi].vdso_code_start, "\177ELF", 4)) > + panic("vDSO is not a valid ELF object!\n"); > + > + vdso_info[abi].vdso_pages = ( > + vdso_info[abi].vdso_code_end - > + vdso_info[abi].vdso_code_start) >> > + PAGE_SHIFT; > + > + vdso_pagelist = kcalloc(vdso_info[abi].vdso_pages, > + sizeof(struct page *), > + GFP_KERNEL); > + if (vdso_pagelist == NULL) > + panic("vDSO kcalloc failed!\n"); > + > + /* Grab the vDSO code pages. */ > + pfn = sym_to_pfn(vdso_info[abi].vdso_code_start); > + > + for (i = 0; i < vdso_info[abi].vdso_pages; i++) > + vdso_pagelist[i] = pfn_to_page(pfn + i); > + > + vdso_info[abi].cm->pages = vdso_pagelist; > +} > + > +#ifdef CONFIG_TIME_NS > +struct vdso_data *arch_get_vdso_data(void *vvar_page) > +{ > + return (struct vdso_data *)(vvar_page); > +} > + > +/* > + * The vvar mapping contains data for a specific time namespace, so when a task > + * changes namespace we must unmap its vvar data for the old namespace. > + * Subsequent faults will map in data for the new namespace. > + * > + * For more details see timens_setup_vdso_data(). > + */ > +int vdso_join_timens(struct task_struct *task, struct time_namespace *ns) > +{ > + struct mm_struct *mm = task->mm; > + struct vm_area_struct *vma; > + > + mmap_read_lock(mm); > + > + for (vma = mm->mmap; vma; vma = vma->vm_next) { > + unsigned long size = vma->vm_end - vma->vm_start; > + > + if (vma_is_special_mapping(vma, vdso_info[VDSO_ABI_RV64].dm)) > + zap_page_range(vma, vma->vm_start, size); > +#ifdef CONFIG_COMPAT > + if (vma_is_special_mapping(vma, vdso_info[VDSO_ABI_RV32].dm)) > + zap_page_range(vma, vma->vm_start, size); > +#endif > + } > + > + mmap_read_unlock(mm); > + return 0; > +} > #endif > > static int __init vdso_init(void) > { > - __vdso_init(&vdso_info); > + __vdso_init(VDSO_ABI_RV64); > #ifdef CONFIG_COMPAT > - __vdso_init(&compat_vdso_info); > + __vdso_init(VDSO_ABI_RV32); > #endif > > return 0; > @@ -240,14 +254,14 @@ arch_initcall(vdso_init); > static int __setup_additional_pages(struct mm_struct *mm, > struct linux_binprm *bprm, > int uses_interp, > - struct __vdso_info *vdso_info) > + enum vdso_abi abi) > { > unsigned long vdso_base, vdso_text_len, vdso_mapping_len; > void *ret; > > BUILD_BUG_ON(VVAR_NR_PAGES != __VVAR_PAGES); > > - vdso_text_len = vdso_info->vdso_pages << PAGE_SHIFT; > + vdso_text_len = vdso_info[abi].vdso_pages << PAGE_SHIFT; > /* Be sure to map the data page */ > vdso_mapping_len = vdso_text_len + VVAR_SIZE; > > @@ -258,18 +272,17 @@ static int __setup_additional_pages(struct mm_struct *mm, > } > > ret = _install_special_mapping(mm, vdso_base, VVAR_SIZE, > - (VM_READ | VM_MAYREAD | VM_PFNMAP), vdso_info->dm); > + (VM_READ | VM_MAYREAD | VM_PFNMAP), vdso_info[abi].dm); > if (IS_ERR(ret)) > goto up_fail; > > vdso_base += VVAR_SIZE; > mm->context.vdso = (void *)vdso_base; > - mm->context.vdso_info = (void *)vdso_info; > > ret = > _install_special_mapping(mm, vdso_base, vdso_text_len, > (VM_READ | VM_EXEC | VM_MAYREAD | VM_MAYWRITE | VM_MAYEXEC), > - vdso_info->cm); > + vdso_info[abi].cm); > > if (IS_ERR(ret)) > goto up_fail; > @@ -291,8 +304,7 @@ int compat_arch_setup_additional_pages(struct linux_binprm *bprm, > if (mmap_write_lock_killable(mm)) > return -EINTR; > > - ret = __setup_additional_pages(mm, bprm, uses_interp, > - &compat_vdso_info); > + ret = __setup_additional_pages(mm, bprm, uses_interp, VDSO_ABI_RV32); > mmap_write_unlock(mm); > > return ret; > @@ -307,7 +319,7 @@ int arch_setup_additional_pages(struct linux_binprm *bprm, int uses_interp) > if (mmap_write_lock_killable(mm)) > return -EINTR; > > - ret = __setup_additional_pages(mm, bprm, uses_interp, &vdso_info); > + ret = __setup_additional_pages(mm, bprm, uses_interp, VDSO_ABI_RV64); > mmap_write_unlock(mm); > > return ret; > -- > 2.34.1 > -- Best Regards Guo Ren