Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp971397pxf; Wed, 7 Apr 2021 16:29:10 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyi7+NOdtWfR6/hoev696N172rAC0Q9HIq/M5qSoXHXbMCkzmtdxl9n/k2DbtVbXVI1JbE5 X-Received: by 2002:aa7:c907:: with SMTP id b7mr7647989edt.37.1617838149903; Wed, 07 Apr 2021 16:29:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617838149; cv=none; d=google.com; s=arc-20160816; b=OWqfIp+hn0OZhkdogH+5kUwh0uWmyedLn2YOgcR9Wegn/+gePO5dlBRiD0YbHeFazs /GylTBvneVQfbuZ5Rxwe9mUTiVI636A+qOZ8vCeCmBxWXhqbE884k2k8wboU0PdQs1GC zZjRScYrg+xbn6yRoeMPsCvPhG5U//zKL6VbQ2UoL23nCE9rrewc0hqP0wyhv/ZHNksY K4+KpQa79/9kx+u6CumcR78BGhEo8SdQ52jd7IHoi9db3mG8AnKuE5fQTNE+kBx4/vpS br+xgEENLQDDCn+LQNq+OB/Z50Rn1l0PCeeX51Yil5A9Sp2hTVWUjdLLtbDC85jFTsRJ zMmg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=hOjU73yvB4ojZ+sZPOFXGqV9d/iCRTNwxzy+KbuVvK8=; b=PufnL45HoT1zqmzbRYGAgyVbFBoxPa8u9V+3Ezz4XrHtwfdNrEqjDQdM/njWt+kpwb gbO02O0vxck2W70Vn9+LlM6pw2i/UkEWYDehu7OZpkTMZB+YSQAxf/CI/wORhEcLpKKy H+bgC8ALM+4bCf/0or3oXmMEHW0lXA+jnHOdRihTGo5FUImyms6gxuRoLEznc3UBxtWI zoqy5I8W2SGC+fgcoLcgUzlaCbGbm0Xt/ITPUh0uUqROx/0IyMFz8M7KVzX/ncZnGITo ahGttbpOVvQ21aIGGx5eB23wPvfYwcvwA8wfn+JKzD+7WvBYGF9hCsSigaGsdJFdEtdn OSKQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=dXKcc6q8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id n13si4015202ejz.175.2021.04.07.16.28.44; Wed, 07 Apr 2021 16:29:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=dXKcc6q8; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229825AbhDGXZD (ORCPT + 99 others); Wed, 7 Apr 2021 19:25:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52972 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229607AbhDGXZC (ORCPT ); Wed, 7 Apr 2021 19:25:02 -0400 Received: from mail-pj1-x1031.google.com (mail-pj1-x1031.google.com [IPv6:2607:f8b0:4864:20::1031]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 67E77C061760; Wed, 7 Apr 2021 16:24:51 -0700 (PDT) Received: by mail-pj1-x1031.google.com with SMTP id lr1-20020a17090b4b81b02900ea0a3f38c1so3832517pjb.0; Wed, 07 Apr 2021 16:24:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=hOjU73yvB4ojZ+sZPOFXGqV9d/iCRTNwxzy+KbuVvK8=; b=dXKcc6q8GjUkbkJ/95poYFIObjLfRnwF6XNFwVLYjt+18e1joftKiBSSUAdHDppQao HqIsB8VSne3oSENxvv2Gr9iJSrKytvDtDerLRfeV6rqHSAj4qB6xEswq8Bf3OCSdiyR9 9rjhNTa+bMYIC8vyLQGRyr9kiLwOKV4t6/uhIQACtWP3ZoEdA3SlJveZJBqGQgrU8AQw x9tqqjUQWZkKjuBAUbs2Bq5fFaTL3qav/DybkVuPyApeq3dNBfiRYhpwLo5Knt4/DDdW l3Sboc/FDp4lzND3O5PtN5uKRlHBtg8NmW3PGeTwR+l7Er0Ye8oPkvhdxBeOc1aioaaI nRsg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=hOjU73yvB4ojZ+sZPOFXGqV9d/iCRTNwxzy+KbuVvK8=; b=JKVLu5ccrwXSyfFZcEjD/lsKfEzQUxXo+bI8sYgBVIbWm2kQAxOHr9kJFaQaPXgjz4 Dzu3rmZSSEpNwiRWmLtnBuVLljBDWEUV6BmM8exqUdBtN/itAnLCfOwyUc2kqULb842N mM53cFc4LNulCvt9sl7Lhk2WVTJhtM7m7pmvFCGy26h7uu5CUf3B2nCl4gkPgUxSrPx2 y+lrINc2OPFccFeoFq6A1tb20zP+hc9VhhCZ/7PkpIYo4NFJLZod4gREt9zD6tLOGfuM mQBUYw9NMzG2I6rGFGzO6Si6nJgG/X5v+B2WFuFLvrpXUaR7Vdsba5pvj7pSUwYFOZGQ 3eaA== X-Gm-Message-State: AOAM532nTq0ZbXEHkXOc+xqwyTUaicb6B4Lri/HLjDbxIE6f16Mc6lAG Gs69bVrOvr6w6OEKP9FBJ9I= X-Received: by 2002:a17:90a:73c2:: with SMTP id n2mr5514651pjk.170.1617837890735; Wed, 07 Apr 2021 16:24:50 -0700 (PDT) Received: from nuc10 (104.36.148.139.aurocloud.com. [104.36.148.139]) by smtp.gmail.com with ESMTPSA id mu6sm6123970pjb.35.2021.04.07.16.24.46 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 07 Apr 2021 16:24:50 -0700 (PDT) Date: Wed, 7 Apr 2021 16:24:44 -0700 From: Rustam Kovhaev To: Dmitry Vyukov , andrii@kernel.org Cc: syzbot , andrii@kernel.org, Alexei Starovoitov , bpf , Daniel Borkmann , John Fastabend , Martin KaFai Lau , KP Singh , LKML , netdev , Song Liu , syzkaller-bugs , Yonghong Song , Greg Kroah-Hartman Subject: Re: memory leak in bpf Message-ID: References: <000000000000911d3905b459824c@google.com> <000000000000e56a2605b616b2d9@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Mar 01, 2021 at 09:43:00PM +0100, Dmitry Vyukov wrote: > On Mon, Mar 1, 2021 at 9:39 PM Rustam Kovhaev wrote: > > > > On Mon, Mar 01, 2021 at 08:05:42PM +0100, Dmitry Vyukov wrote: > > > On Mon, Mar 1, 2021 at 5:21 PM Rustam Kovhaev wrote: > > > > > > > > On Wed, Dec 09, 2020 at 10:58:10PM -0800, syzbot wrote: > > > > > syzbot has found a reproducer for the following issue on: > > > > > > > > > > HEAD commit: a68a0262 mm/madvise: remove racy mm ownership check > > > > > git tree: upstream > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=11facf17500000 > > > > > kernel config: https://syzkaller.appspot.com/x/.config?x=4305fa9ea70c7a9f > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=f3694595248708227d35 > > > > > compiler: gcc (GCC) 10.1.0-syz 20200507 > > > > > syz repro: https://syzkaller.appspot.com/x/repro.syz?x=159a9613500000 > > > > > C reproducer: https://syzkaller.appspot.com/x/repro.c?x=11bf7123500000 > > > > > > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit: > > > > > Reported-by: syzbot+f3694595248708227d35@syzkaller.appspotmail.com > > > > > > > > > > Debian GNU/Linux 9 syzkaller ttyS0 > > > > > Warning: Permanently added '10.128.0.9' (ECDSA) to the list of known hosts. > > > > > executing program > > > > > executing program > > > > > executing program > > > > > BUG: memory leak > > > > > unreferenced object 0xffff88810efccc80 (size 64): > > > > > comm "syz-executor334", pid 8460, jiffies 4294945724 (age 13.850s) > > > > > hex dump (first 32 bytes): > > > > > c0 cb 14 04 00 ea ff ff c0 c2 11 04 00 ea ff ff ................ > > > > > c0 56 3f 04 00 ea ff ff 40 18 38 04 00 ea ff ff .V?.....@.8..... > > > > > backtrace: > > > > > [<0000000036ae98a7>] kmalloc_node include/linux/slab.h:575 [inline] > > > > > [<0000000036ae98a7>] bpf_ringbuf_area_alloc kernel/bpf/ringbuf.c:94 [inline] > > > > > [<0000000036ae98a7>] bpf_ringbuf_alloc kernel/bpf/ringbuf.c:135 [inline] > > > > > [<0000000036ae98a7>] ringbuf_map_alloc kernel/bpf/ringbuf.c:183 [inline] > > > > > [<0000000036ae98a7>] ringbuf_map_alloc+0x1be/0x410 kernel/bpf/ringbuf.c:150 > > > > > [<00000000d2cb93ae>] find_and_alloc_map kernel/bpf/syscall.c:122 [inline] > > > > > [<00000000d2cb93ae>] map_create kernel/bpf/syscall.c:825 [inline] > > > > > [<00000000d2cb93ae>] __do_sys_bpf+0x7d0/0x30a0 kernel/bpf/syscall.c:4381 > > > > > [<000000008feaf393>] do_syscall_64+0x2d/0x70 arch/x86/entry/common.c:46 > > > > > [<00000000e1f53cfd>] entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > > > > > > > > > > > > > > > > > i am pretty sure that this one is a false positive > > > > the problem with reproducer is that it does not terminate all of the > > > > child processes that it spawns > > > > > > > > i confirmed that it is a false positive by tracing __fput() and > > > > bpf_map_release(), i ran reproducer, got kmemleak report, then i > > > > manually killed those running leftover processes from reproducer and > > > > then both functions were executed and memory was freed > > > > > > > > i am marking this one as: > > > > #syz invalid > > > > > > Hi Rustam, > > > > > > Thanks for looking into this. > > > > > > I wonder how/where are these objects referenced? If they are not > > > leaked and referenced somewhere, KMEMLEAK should not report them as > > > leaks. > > > So even if this is a false positive for BPF, this is a true positive > > > bug and something to fix for KMEMLEAK ;) > > > And syzbot will probably re-create this bug report soon as this still > > > happens and is not a one-off thing. > > > > hi Dmitry, i haven't thought of it this way, but i guess you are right, > > it is a kmemleak bug, ideally kmemleak should be aware that there are > > still running processes holding references to bpf fd/anonymous inodes > > which in their turn hold references to allocated bpf maps > > KMEMLEAK scans whole memory, so if there are pointers to the object > anywhere in memory, KMEMLEAK should not report them as leaked. Running > processes have no direct effect on KMEMLEAK logic. > So the question is: where are these pointers to these objects? If we > answer this, we can check how/why KMEMLEAK misses them. Are they > mangled in some way? thank you for your comments, they make sense, and indeed, the pointer gets vmaped. i should have looked into this sooner, becaused syzbot did trigger the issue again, and Andrii had to look into the same bug, sorry about that. if i am understanding this correctly here is what the fix should be: --- kernel/bpf/ringbuf.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/kernel/bpf/ringbuf.c b/kernel/bpf/ringbuf.c index f25b719ac786..30400e74abe2 100644 --- a/kernel/bpf/ringbuf.c +++ b/kernel/bpf/ringbuf.c @@ -8,6 +8,7 @@ #include #include #include +#include #include #define RINGBUF_CREATE_FLAG_MASK (BPF_F_NUMA_NODE) @@ -105,6 +106,7 @@ static struct bpf_ringbuf *bpf_ringbuf_area_alloc(size_t data_sz, int numa_node) rb = vmap(pages, nr_meta_pages + 2 * nr_data_pages, VM_ALLOC | VM_USERMAP, PAGE_KERNEL); if (rb) { + kmemleak_not_leak((void *) pages); rb->pages = pages; rb->nr_pages = nr_pages; return rb; -- 2.30.2