Received: by 2002:a6b:fb09:0:0:0:0:0 with SMTP id h9csp3205110iog; Mon, 27 Jun 2022 11:22:18 -0700 (PDT) X-Google-Smtp-Source: AGRyM1t04Nu88mlkk3YmxeDhZfHZkYmHlgodQEpbzSHbgYUnNm1BgjpF555TIRAagFjnrLl1Rwtj X-Received: by 2002:a17:906:4ccc:b0:6fe:9155:47ae with SMTP id q12-20020a1709064ccc00b006fe915547aemr13574924ejt.246.1656354138343; Mon, 27 Jun 2022 11:22:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1656354138; cv=none; d=google.com; s=arc-20160816; b=mAgokqWS8GL5DNCyAPOFjrZwGrrdiEQxUtHjBR4Co1V5STObJ6+2zhHAjOjqJsqGs8 DbsE/9+7nhoeXN9S7+87v4dHbOEqug5IP6NHbbEgPd60GvO5nuEnoHyCp4oZ+VGxOBwK uGqKogRfIeWjw5o+Owy4/nD9HL4iHnHQzL436cQZk9gcxXUQgP4XrMd4uxSHRfUTND2a ggGu5di2E53YapU/lGG9S2uQ2FMeJbDzwPGqgeL32mQ9f3aGx+QrU957eI7H5sOInLRH v2AZ0ZNoBAyqhUQFzlt7xT8BsSqaNfzeQ+vvvezjE2X7zHG+/2sK4f3WRya/CCdQTxBw twNg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Ps6VaNQ8G9LWdGlOdA6zKBiegja0RukMzGRSe19NTKE=; b=UsF0O2V7r1f9GAD11BJqHBRzDSDYFOu3e8X4xPSPL9SqaNbKRu9BDknhurkmNfmGck lH+wZhL5+yMxK5+jO2E5mSRYYFBdgHamBrRd67SgOQHsUraaNVGnDMSVyuvwIrYpRUNG +E/eZTrvemIKK2V8vSf0gFrFOsPfbT0HVF9V48oPVt9v1vV+ZrcSTD9GKbQnItjaUk8U KRsRNcpnJjebrUn4XCudSGXTB1BUP5/dq+HOzYs87+Z7i5uHfMwjg3UlNqM7wOgPvjhz JPKV2SIPFAWJYLQStgpx42pNKTxnpKG72rtmmNLJJ1HU5Caqv6Aykd0QrTreU15Yaa6w 2S0w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=c18TZ+SI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id e5-20020a17090658c500b0070b657ba50dsi13538700ejs.469.2022.06.27.11.21.53; Mon, 27 Jun 2022 11:22:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=c18TZ+SI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237239AbiF0SOM (ORCPT + 99 others); Mon, 27 Jun 2022 14:14:12 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37220 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234890AbiF0SOL (ORCPT ); Mon, 27 Jun 2022 14:14:11 -0400 Received: from mail-pj1-x102c.google.com (mail-pj1-x102c.google.com [IPv6:2607:f8b0:4864:20::102c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 05F38DF28 for ; Mon, 27 Jun 2022 11:14:10 -0700 (PDT) Received: by mail-pj1-x102c.google.com with SMTP id m14-20020a17090a668e00b001ee6ece8368so4006073pjj.3 for ; Mon, 27 Jun 2022 11:14:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Ps6VaNQ8G9LWdGlOdA6zKBiegja0RukMzGRSe19NTKE=; b=c18TZ+SI4n0fPfnMLPBCE04N8KTcy4fSaI6exSXLoDMEHHFejyhSLrTYT/ITDeYefl DhMpvgQT5ZObcZ7x8r4WLAA+pl++t5sOrt+aYMeO4JlePpfI8+/wAydmviILMahkeM8W rq862g1nWuO2Ev0pD1tzeuPpSpuGVLulets+zexuHzg0jkWmX2xPWpWUUD4648OyxwFw JgpZzORtjlYwOrOdxrg9LQAd8KjPCV++xWkzFXGHFNCLi91eqWi6ISi2zt37Ucgnmmi5 OROjxhctvBCXCVywMfb479m4OtLa4/AnzegxSx4j2q85EiAYY+y7+q6BSMss62qXInkc mnZA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Ps6VaNQ8G9LWdGlOdA6zKBiegja0RukMzGRSe19NTKE=; b=2/WxRjFWAGdYKlsCUn+iLQOdfOy+bFz1SdRFeSHkYdVUf47ZXduJlUDfvL7L5goI9H SUl2RAxfE1s9Ac2t8lKx1ttgZAq+CyrPlLWhF79XbWa9FAFUbslPkwxceVXHvNi/vtBC dMUXY+sU7OsGN6941sqA0tdqy/gd9+KBM/ZzSnp5IaAvLwywLT/u1KfeHZ4Dj9cNFZai yykC8kvse1svcacsvlSUON/wuLgHM8CmIWb6zKLaXOY4SfWKooZ47kchLwP4r/e/3IP/ hH5pEkAmv/vTMXSg+z69iB7R5ma1atywgmsM3g0Iy2REPY42owubgm+h+wB8MJSp2gNW xfSg== X-Gm-Message-State: AJIora8Mky9smML41GGiKTDB15hKH936NWr/zIUcZgqCkp037fouEDPP HaV0Or4xd/GWia8t1N+AHw2RYrddGJhoY53ByZI= X-Received: by 2002:a17:90b:4c49:b0:1ec:e8a2:b5f0 with SMTP id np9-20020a17090b4c4900b001ece8a2b5f0mr21806353pjb.21.1656353649412; Mon, 27 Jun 2022 11:14:09 -0700 (PDT) MIME-Version: 1.0 References: <0000000000003189f305e19f5d3e@google.com> <20220617165256.b5b9b257b30bc1dbb21fc8c6@linux-foundation.org> In-Reply-To: From: Yang Shi Date: Mon, 27 Jun 2022 11:13:57 -0700 Message-ID: Subject: Re: [syzbot] KASAN: null-ptr-deref Read in hugepage_vma_check To: "Zach O'Keefe" Cc: Matthew Wilcox , Andrew Morton , syzbot , Linux Kernel Mailing List , Linux MM , syzkaller-bugs Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_ENVFROM_END_DIGIT, FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jun 27, 2022 at 9:11 AM Zach O'Keefe wrote: > > On 17 Jun 17:33, Yang Shi wrote: > > On Fri, Jun 17, 2022 at 5:14 PM Matthew Wilcox wrote: > > > > > > On Fri, Jun 17, 2022 at 04:52:56PM -0700, Andrew Morton wrote: > > > > On Fri, 17 Jun 2022 00:04:23 -0700 syzbot wrote: > > > > > > > > > Hello, > > > > > > > > > > syzbot found the following issue on: > > > > > > > > > > HEAD commit: c6d7e3b385f1 Add linux-next specific files for 20220616 > > > > > git tree: linux-next > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=10d9fb1bf00000 > > > > > kernel config: https://syzkaller.appspot.com/x/.config?x=9d495405e4a98620 > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4d875b4d2e2b60bae9b4 > > > > > compiler: gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2 > > > > > > > > > > Unfortunately, I don't have any reproducer for this issue yet. > > > > > > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit: > > > > > Reported-by: syzbot+4d875b4d2e2b60bae9b4@syzkaller.appspotmail.com > > > > > > > > > > ================================================================== > > > > > BUG: KASAN: null-ptr-deref in instrument_atomic_read include/linux/instrumented.h:71 [inline] > > > > > BUG: KASAN: null-ptr-deref in test_bit include/asm-generic/bitops/instrumented-non-atomic.h:134 [inline] > > > > > BUG: KASAN: null-ptr-deref in hugepage_vma_check+0x8e/0x750 mm/huge_memory.c:82 > > > > > Read of size 8 at addr 00000000000005a8 by task syz-executor.5/21978 > > > > > > > > vma->vm_mm is NULL in show_smap->hugepage_vma_check(). > > > > > > > > Perhaps m_start() should be checking the mm after taking > > > > mmap_read_lock() instead of before? > > > > > > > > Matthew, you mucked with it last ;) Can you please take a look? > > > > > > I think this is Yang Shi's fault. The last patch in that series which > > > touches this area does: > > > > > > seq_printf(m, "THPeligible: %d\n", > > > - hugepage_vma_check(vma, vma->vm_flags, true)); > > > + hugepage_vma_check(vma, vma->vm_flags, true, false)); > > > > > > and hugepage_vma_check() does: > > > > > > if ((vm_flags & VM_NOHUGEPAGE) || > > > test_bit(MMF_DISABLE_THP, &vma->vm_mm->flags)) > > > return false; > > > > > > Before those patches, it calls transparent_hugepage_active() which > > > does not (afaict) touch vma->vm_mm. > > > > No, transparent_hugepage_active() also could touch vma->vm_mm. The > > above code was transhuge_vma_enabled() called by > > transparent_hugepage_active() before my patch, my patch just > > open-coded it. > > > > But hugepage_vma_check() does check vma in different order from > > transparent_hugepage_active(), so the old code may not reach it at > > all. > > > > And I don't think the mm could go away since it is pinned, but the > > returned vma may not belong to this mm, for example, gate vma which > > doesn't have a mm. Its size is PAGE_SIZE, so the old code returns > > earlier before touching mm. > > > > One possible solution is to move vma size check before touching mm, like: > > > > diff --git a/mm/huge_memory.c b/mm/huge_memory.c > > index 2e2a8b5bc567..1a9f15a89091 100644 > > --- a/mm/huge_memory.c > > +++ b/mm/huge_memory.c > > @@ -73,6 +73,16 @@ bool hugepage_vma_check(struct vm_area_struct *vma, > > unsigned long vm_flags, > > bool smaps, bool in_pf) > > { > > + /* > > + * Check alignment for file vma and size for both file and anon vma. > > + * > > + * Skip the check for page fault. Huge fault does the check in fault > > + * handlers. And this check is not suitable for huge PUD fault. > > + */ > > + if (!in_pf && > > + !transhuge_vma_suitable(vma, (vma->vm_end - HPAGE_PMD_SIZE))) > > + return false; > > + > > /* > > * Explicitly disabled through madvise or prctl, or some > > * architectures may disable THP for some mappings, for > > @@ -99,16 +109,6 @@ bool hugepage_vma_check(struct vm_area_struct *vma, > > if (vm_flags & VM_NO_KHUGEPAGED) > > return false; > > > > - /* > > - * Check alignment for file vma and size for both file and anon vma. > > - * > > - * Skip the check for page fault. Huge fault does the check in fault > > - * handlers. And this check is not suitable for huge PUD fault. > > - */ > > - if (!in_pf && > > - !transhuge_vma_suitable(vma, (vma->vm_end - HPAGE_PMD_SIZE))) > > - return false; > > - > > /* > > * Enabled via shmem mount options or sysfs settings. > > * Must be done before hugepage flags check since shmem has its > > > > Or we may have to check gate vma explicitly. > > > > I think this is already fixed already with Andrew's fix in mm-unstable: > > diff --git a/mm/huge_memory.c b/mm/huge_memory.c > index df04c0057271..fbe02815c41a 100644 > --- a/mm/huge_memory.c > +++ b/mm/huge_memory.c > @@ -73,6 +73,9 @@ bool hugepage_vma_check(struct vm_area_struct *vma, > unsigned long vm_flags, > bool smaps) > { > + if (!vma->vm_mm) > + return false; > + > if (!transhuge_vma_enabled(vma, vm_flags)) > return false; > > Which isn't included in the tree testing here. IMHO this is cleaner than messing > with an already subtle ordering. Yes, we just need to keep one. And I agree Andrew's fix is much simpler. > > > > > > > > I haven't been paying attention to that patch series, so I don't have > > > a constructive suggestion for how to fix. > > > > > > > > CPU: 0 PID: 21978 Comm: syz-executor.5 Not tainted 5.19.0-rc2-next-20220616-syzkaller #0 > > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011 > > > > > Call Trace: > > > > > > > > > > __dump_stack lib/dump_stack.c:88 [inline] > > > > > dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106 > > > > > kasan_report+0xbe/0x1f0 mm/kasan/report.c:495 > > > > > check_region_inline mm/kasan/generic.c:183 [inline] > > > > > kasan_check_range+0x13d/0x180 mm/kasan/generic.c:189 > > > > > instrument_atomic_read include/linux/instrumented.h:71 [inline] > > > > > test_bit include/asm-generic/bitops/instrumented-non-atomic.h:134 [inline] > > > > > hugepage_vma_check+0x8e/0x750 mm/huge_memory.c:82 > > > > > show_smap+0x1c6/0x470 fs/proc/task_mmu.c:866 > > > > > traverse.part.0+0xcf/0x5f0 fs/seq_file.c:111 > > > > > traverse fs/seq_file.c:101 [inline] > > > > > seq_read_iter+0x90f/0x1280 fs/seq_file.c:195 > > > > > seq_read+0x337/0x4b0 fs/seq_file.c:162 > > > > > do_loop_readv_writev fs/read_write.c:763 [inline] > > > > > do_loop_readv_writev fs/read_write.c:750 [inline] > > > > > do_iter_read+0x4f8/0x750 fs/read_write.c:805 > > > > > vfs_readv+0xe5/0x150 fs/read_write.c:923 > > > > > do_preadv fs/read_write.c:1015 [inline] > > > > > __do_sys_preadv fs/read_write.c:1065 [inline] > > > > > __se_sys_preadv fs/read_write.c:1060 [inline] > > > > > __x64_sys_preadv+0x22b/0x310 fs/read_write.c:1060 > > > > > do_syscall_x64 arch/x86/entry/common.c:50 [inline] > > > > > do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 > > > > > entry_SYSCALL_64_after_hwframe+0x46/0xb0 > > > > > RIP: 0033:0x7f5c1d889109 > > > > > Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b8 ff ff ff f7 d8 64 89 01 48 > > > > > RSP: 002b:00007f5c1ea69168 EFLAGS: 00000246 ORIG_RAX: 0000000000000127 > > > > > RAX: ffffffffffffffda RBX: 00007f5c1d99bf60 RCX: 00007f5c1d889109 > > > > > RDX: 0000000000000001 RSI: 00000000200006c0 RDI: 0000000000000005 > > > > > RBP: 00007f5c1d8e305d R08: 0000000000000000 R09: 0000000000000000 > > > > > R10: 00000000fffffffe R11: 0000000000000246 R12: 0000000000000000 > > > > > R13: 00007ffd8514b43f R14: 00007f5c1ea69300 R15: 0000000000022000 > > > > > > > > > > ================================================================== > > > > > > > > > > > > > > > --- > > > > > This report is generated by a bot. It may contain errors. > > > > > See https://goo.gl/tpsmEJ for more information about syzbot. > > > > > syzbot engineers can be reached at syzkaller@googlegroups.com. > > > > > > > > > > syzbot will keep track of this issue. See: > > > > > https://goo.gl/tpsmEJ#status for how to communicate with syzbot. > >