Received: by 2002:a05:6358:111d:b0:dc:6189:e246 with SMTP id f29csp3383413rwi; Tue, 1 Nov 2022 20:43:07 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7Ggx7PPdAuqf3Gz9SzO/enREREalK7BXWZfzOzdNg5FhpKvj7RZKPYWGYIPh1XIU10ZX5u X-Received: by 2002:a17:90b:4c87:b0:213:5fe0:6700 with SMTP id my7-20020a17090b4c8700b002135fe06700mr38112959pjb.128.1667360587195; Tue, 01 Nov 2022 20:43:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1667360587; cv=none; d=google.com; s=arc-20160816; b=ysDSj8gscDBbHVJtOC7pxWlBnkhUurtEPfUgLhhXX37J/s7hh9SuOUcmP0UGn2QxNl 7K0N+A39nadRkFQ0XXih1D/jE0RmH+AVY760oVk2EqGEW9EaJWoTcctEXY254yjfNX25 EErQeebciHtt6DM9svoCTxkirbZ7mtH9QdZXMwH8B1dsk7YxZGcDqCZZqqNcmzlIXf7n 3NN/2p1JAy8CkK7boL6sfHlfIrGoQBq72S1Az5ePkN7zRQR+Z+fZq/EcCMP56iYOsI9j y8qsKqZByxJZZiXlJhJgbT802jyuefI62WHmtUwOBqWWOo56BIEFCaha+4L2owvAe06D Hr6Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=4zER4Oj1ONkrlFJzSJllJB+BxuMsG/xuWHZ6+x7ci1w=; b=aSmphHstgoeZYFCyINY+4NyhaPBDciboEXiQBKJ3CluQp7WFbYWpR5EVCMDGFsLs4o RZ0v2k/BV2uJoyyPqn4BPh/mUK7WVAjwsQAdysNvfJGj0zzCcdVwGuRg4x/rp0o+Au03 zlH6+vGKZcZsKbk/iPGdytf1GBgpdjRTqLIsbNKKfZ9qcJq3lkplLN7Hey5Z3JTHdPtQ E8t8nsWQRR7aPigC7W13cF3AvU8SyAsXPqQzayDl12Esm0Dz39Gdggy38dKGm0SuvYDc 9pmsTOrmUOAYeFegrfDDRF2lc/iVmaLrX5uV6S5kndxWae3t/jZhA8W8MtuToMCd/0B4 Bm3Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id z8-20020a170902d54800b0016cca53817dsi14657001plf.341.2022.11.01.20.42.54; Tue, 01 Nov 2022 20:43:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231143AbiKBCxX (ORCPT + 97 others); Tue, 1 Nov 2022 22:53:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:42690 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231158AbiKBCxR (ORCPT ); Tue, 1 Nov 2022 22:53:17 -0400 Received: from szxga02-in.huawei.com (szxga02-in.huawei.com [45.249.212.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 63141220FF; Tue, 1 Nov 2022 19:53:12 -0700 (PDT) Received: from dggemv711-chm.china.huawei.com (unknown [172.30.72.57]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4N2BJh1hHTzHvVj; Wed, 2 Nov 2022 10:52:52 +0800 (CST) Received: from kwepemm600010.china.huawei.com (7.193.23.86) by dggemv711-chm.china.huawei.com (10.1.198.66) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Wed, 2 Nov 2022 10:53:09 +0800 Received: from [10.67.110.237] (10.67.110.237) by kwepemm600010.china.huawei.com (7.193.23.86) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Wed, 2 Nov 2022 10:53:08 +0800 Subject: Re: [PATCH] ftrace: Fix use-after-free for dynamic ftrace_ops To: "Masami Hiramatsu (Google)" CC: , , , References: <20221101064146.69551-1-lihuafei1@huawei.com> <20221101230716.22c7b52728990f6b192795c0@kernel.org> From: Li Huafei Message-ID: <2398e4aa-303f-9e58-d710-39c9d5d6fe6b@huawei.com> Date: Wed, 2 Nov 2022 10:53:08 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.9.0 MIME-Version: 1.0 In-Reply-To: <20221101230716.22c7b52728990f6b192795c0@kernel.org> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.67.110.237] X-ClientProxiedBy: dggems704-chm.china.huawei.com (10.3.19.181) To kwepemm600010.china.huawei.com (7.193.23.86) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022/11/1 22:07, Masami Hiramatsu (Google) wrote: > On Tue, 1 Nov 2022 14:41:46 +0800 > Li Huafei wrote: > >> KASAN reported a use-after-free with ftrace ops [1]. It was found from >> vmcore that perf had registered two ops with the same content >> successively, both dynamic. After unregistering the second ops, a >> use-after-free occurred. >> >> In ftrace_shutdown(), when the second ops is unregistered, the >> FTRACE_UPDATE_CALLS command is not set because there is another enabled >> ops with the same content. Also, both ops are dynamic and the ftrace >> callback function is ftrace_ops_list_func, so the >> FTRACE_UPDATE_TRACE_FUNC command will not be set. Eventually the value >> of 'command' will be 0 and ftrace_shutdown() will skip the rcu >> synchronization. >> >> However, ftrace may be activated. When the ops is released, another CPU >> may be accessing the ops. Add the missing synchronization to fix this >> problem. >> >> [1] >> BUG: KASAN: use-after-free in __ftrace_ops_list_func kernel/trace/ftrace.c:7020 [inline] >> BUG: KASAN: use-after-free in ftrace_ops_list_func+0x2b0/0x31c kernel/trace/ftrace.c:7049 >> Read of size 8 at addr ffff56551965bbc8 by task syz-executor.2/14468 >> >> CPU: 1 PID: 14468 Comm: syz-executor.2 Not tainted 5.10.0 #7 >> Hardware name: linux,dummy-virt (DT) >> Call trace: >> dump_backtrace+0x0/0x40c arch/arm64/kernel/stacktrace.c:132 >> show_stack+0x30/0x40 arch/arm64/kernel/stacktrace.c:196 >> __dump_stack lib/dump_stack.c:77 [inline] >> dump_stack+0x1b4/0x248 lib/dump_stack.c:118 >> print_address_description.constprop.0+0x28/0x48c mm/kasan/report.c:387 >> __kasan_report mm/kasan/report.c:547 [inline] >> kasan_report+0x118/0x210 mm/kasan/report.c:564 >> check_memory_region_inline mm/kasan/generic.c:187 [inline] >> __asan_load8+0x98/0xc0 mm/kasan/generic.c:253 >> __ftrace_ops_list_func kernel/trace/ftrace.c:7020 [inline] >> ftrace_ops_list_func+0x2b0/0x31c kernel/trace/ftrace.c:7049 >> ftrace_graph_call+0x0/0x4 >> __might_sleep+0x8/0x100 include/linux/perf_event.h:1170 >> __might_fault mm/memory.c:5183 [inline] >> __might_fault+0x58/0x70 mm/memory.c:5171 >> do_strncpy_from_user lib/strncpy_from_user.c:41 [inline] >> strncpy_from_user+0x1f4/0x4b0 lib/strncpy_from_user.c:139 >> getname_flags+0xb0/0x31c fs/namei.c:149 >> getname+0x2c/0x40 fs/namei.c:209 >> [...] >> >> Allocated by task 14445: >> kasan_save_stack+0x24/0x50 mm/kasan/common.c:48 >> kasan_set_track mm/kasan/common.c:56 [inline] >> __kasan_kmalloc mm/kasan/common.c:479 [inline] >> __kasan_kmalloc.constprop.0+0x110/0x13c mm/kasan/common.c:449 >> kasan_kmalloc+0xc/0x14 mm/kasan/common.c:493 >> kmem_cache_alloc_trace+0x440/0x924 mm/slub.c:2950 >> kmalloc include/linux/slab.h:563 [inline] >> kzalloc include/linux/slab.h:675 [inline] >> perf_event_alloc.part.0+0xb4/0x1350 kernel/events/core.c:11230 >> perf_event_alloc kernel/events/core.c:11733 [inline] >> __do_sys_perf_event_open kernel/events/core.c:11831 [inline] >> __se_sys_perf_event_open+0x550/0x15f4 kernel/events/core.c:11723 >> __arm64_sys_perf_event_open+0x6c/0x80 kernel/events/core.c:11723 >> [...] >> >> Freed by task 14445: >> kasan_save_stack+0x24/0x50 mm/kasan/common.c:48 >> kasan_set_track+0x24/0x34 mm/kasan/common.c:56 >> kasan_set_free_info+0x20/0x40 mm/kasan/generic.c:358 >> __kasan_slab_free.part.0+0x11c/0x1b0 mm/kasan/common.c:437 >> __kasan_slab_free mm/kasan/common.c:445 [inline] >> kasan_slab_free+0x2c/0x40 mm/kasan/common.c:446 >> slab_free_hook mm/slub.c:1569 [inline] >> slab_free_freelist_hook mm/slub.c:1608 [inline] >> slab_free mm/slub.c:3179 [inline] >> kfree+0x12c/0xc10 mm/slub.c:4176 >> perf_event_alloc.part.0+0xa0c/0x1350 kernel/events/core.c:11434 >> perf_event_alloc kernel/events/core.c:11733 [inline] >> __do_sys_perf_event_open kernel/events/core.c:11831 [inline] >> __se_sys_perf_event_open+0x550/0x15f4 kernel/events/core.c:11723 >> [...] >> > > Good catch! This should go stable tree too. > > Cc: stable@vger.kernel.org > Reviewed-by: Masami Hiramatsu (Google) Thanks! > > But I'm not sure what commit this is fixed. Maybe commit a4c35ed24112 > ("ftrace: Fix synchronization location disabling and freeing ftrace_ops"). I also can't find out the exact commit that was fixed. Hopefully Steve can give one. > Steve, can you add Fixed: ? > > Also, I found a typo below. > >> Signed-off-by: Li Huafei >> --- >> kernel/trace/ftrace.c | 14 +++++++++----- >> 1 file changed, 9 insertions(+), 5 deletions(-) >> >> diff --git a/kernel/trace/ftrace.c b/kernel/trace/ftrace.c >> index fbf2543111c0..4219cc2a04a6 100644 >> --- a/kernel/trace/ftrace.c >> +++ b/kernel/trace/ftrace.c >> @@ -3030,13 +3030,16 @@ int ftrace_shutdown(struct ftrace_ops *ops, int command) >> >> if (!command || !ftrace_enabled) { >> /* >> - * If these are dynamic or per_cpu ops, they still >> - * need their data freed. Since, function tracing is >> - * not currently active, we can just free them >> - * without synchronizing all CPUs. >> + * If these are dynamic, they still need their data freed. If >> + * function tracing is currently active, we neet to synchronize > ^need > Yes. Do I need to send a v2 patch? > Thank you! > >> + * all CPUs before we can release them. >> */ >> - if (ops->flags & FTRACE_OPS_FL_DYNAMIC) >> + if (ops->flags & FTRACE_OPS_FL_DYNAMIC) { >> + if (ftrace_enabled) >> + goto sync_rcu; >> + >> goto free_ops; >> + } >> >> return 0; >> } >> @@ -3083,6 +3086,7 @@ int ftrace_shutdown(struct ftrace_ops *ops, int command) >> * ops. >> */ >> if (ops->flags & FTRACE_OPS_FL_DYNAMIC) { >> + sync_rcu: >> /* >> * We need to do a hard force of sched synchronization. >> * This is because we use preempt_disable() to do RCU, but >> -- >> 2.17.1 >> > >