Received: by 2002:a05:7412:40d:b0:e2:908c:2ebd with SMTP id 13csp42559rdf; Mon, 20 Nov 2023 15:39:02 -0800 (PST) X-Google-Smtp-Source: AGHT+IE0PQcOgGpU5ueymZeDVf4MR6KdZ7Av/dhlwCccAJBuZOxvPyO1hX0UINzmjVpid5u8do/9 X-Received: by 2002:a05:6a00:4286:b0:6cb:835a:8606 with SMTP id bx6-20020a056a00428600b006cb835a8606mr5798168pfb.4.1700523542349; Mon, 20 Nov 2023 15:39:02 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1700523542; cv=none; d=google.com; s=arc-20160816; b=ohkQbuXhc7/93jm1jg+vDKK+6+gR8Ocxh0yfL7r+FlloDdCWeyYx5jrCoyapIl0poa eFJoYRi7lJIig1QZMGdpnrW87nJ7lYE95HLmNYRMEUb4DlF4oWQCiK0F6XSfpCXucfNy ls6OM28fm0ujjya99YId1Q7DKkkTUpyD4vmMfK8sPvKgb78AfHWxVUZYhmpzG5jT/9ud 6LNXxYftbNE9S1F0S4oBSbtq3usxBwIy3tSOwsh+JsMGYp9PFeoy40A9A2hPZaMoe4Fc fbRvYtY0ZhKfCWnttsrsJtFgqPJx60+H06XOdA1k93cN5ARuFxg2NodJBJoOu3vM8ZDR xeRA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-disposition:mime-version:message-id :subject:cc:to:from:dkim-signature:date; bh=hYzm3PJkGaxBULjKx6yIXNaN8vleLHYsskEQneF97xU=; fh=HE2wJXftT8wTJoUbEIMaAfMQRs3pkFE9U49g7sE+aiA=; b=muENKQnH99eJgcd+u8wi6faJoxG/sqdLkCIS+xLAVhYBqTbjCIOX4Yr5uw0AmjYcnb +pzxqVfaWIFhHCDdWmhAutfIybzrFm1oqGioMobyrL/pPqvXCueycWA2xjI6Y3Wtbtih t+5oFh0+myn9FguABWXuxTLsUN5HLS3eNRxgs4qSAq8RBypxUHynxdHoGMtRdC12hu1n 8zXKSi3olpmzCOst2oBC2yLtK260ySOK5+jWMmTnZYSDdwwZFUeD1I0uZpS1uGh2lS5K qeaVfJhZ2mt6L/78LicGzYrDBi9HSkoKkAlgzsShZZbUw7MRVo4YvSviWam823OQzN4j RT/Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux.dev header.s=key1 header.b=rANOR0ez; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.dev Return-Path: Received: from snail.vger.email (snail.vger.email. [23.128.96.37]) by mx.google.com with ESMTPS id t63-20020a638142000000b005c1b323da1bsi8981252pgd.695.2023.11.20.15.39.01 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 20 Nov 2023 15:39:02 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) client-ip=23.128.96.37; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.dev header.s=key1 header.b=rANOR0ez; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.dev Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id 22CC580781E2; Mon, 20 Nov 2023 15:37:14 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229781AbjKTXhM (ORCPT + 99 others); Mon, 20 Nov 2023 18:37:12 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46318 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229539AbjKTXhK (ORCPT ); Mon, 20 Nov 2023 18:37:10 -0500 Received: from out-182.mta1.migadu.com (out-182.mta1.migadu.com [95.215.58.182]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F0A3295 for ; Mon, 20 Nov 2023 15:37:05 -0800 (PST) Date: Mon, 20 Nov 2023 18:36:59 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.dev; s=key1; t=1700523424; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=hYzm3PJkGaxBULjKx6yIXNaN8vleLHYsskEQneF97xU=; b=rANOR0ezOFsnHenEAdIKZIEENpjIq9HsnZaInIWE5DQWLZfIvmtYQTOyJUSUqP+IlzPbKw lg8fstoNuSXlLG+zoh5gzGF7u4C2rp64xPibYj6hZ00QAERynFsLTaMS0yyyESCReN+lHP YEGu9cvhZb/jEXzQoiSLh5UlMcJVhk8= X-Report-Abuse: Please report any abuse attempt to abuse@migadu.com and include these headers. From: Kent Overstreet To: Peter Zijlstra , Ingo Molnar , Will Deacon , Waiman Long , Boqun Feng Cc: linux-kernel@vger.kernel.org Subject: lockdep + kasan bug? Message-ID: <20231120233659.e36txv3fedbjn4sx@moria.home.lan> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Migadu-Flow: FLOW_OUT X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_BLOCKED, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Mon, 20 Nov 2023 15:37:14 -0800 (PST) I've been seeing a lot of reports like the following in a lot of my lockdep + kasan tests. Some lockdep patches are in my tree: they don't touch this code path (except I do have to increase MAX_LOCK_DEPTH from 48 to 63, perhaps that has unintended side effects?) https://evilpiepirate.org/git/bcachefs.git/log/?id=2f42f415f7573001b4f4887b785d8a8747b3757f bcachefs does take a _large_ number of locks for lockdep to track, also possibly relevant Have not dug into the lockdep hash table of outstanding locks code yet but happy to test patches... 04752 ========= TEST tiering_variable_buckets_replicas 04752 04752 WATCHDOG 3600 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): mounting version 1.3: rebalance_work opts=metadata_replicas=2,data_replicas=2,foreground_target=ssd,background_target=hdd,promote_target=ssd,fsck 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): initializing new filesystem 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): going read-write 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): marking superblocks 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): initializing freespace 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): done initializing freespace 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): reading snapshots table 04753 bcachefs (ea667958-8bbd-451b-9043-9132a2fd2fa4): reading snapshots done 04753 WATCHDOG 3600 04753 randrw: (g=0): rw=randrw, bs=(R) 4096B-1024KiB, (W) 4096B-1024KiB, (T) 4096B-1024KiB, ioengine=libaio, iodepth=64 04753 fio-3.33 04753 Starting 1 process 04753 randrw: Laying out IO file (1 file / 3500MiB) 05117 Jobs: 1 (f=1) 05117 BUG: KASAN: global-out-of-bounds in add_chain_block+0x44/0x288 05117 Read of size 4 at addr ffffffc081b7a8bc by task fio/120528 05117 05117 CPU: 11 PID: 120528 Comm: fio Tainted: G L 6.6.0-ktest-gc18b7260ddd3 #8209 05117 Hardware name: linux,dummy-virt (DT) 05117 Call trace: 05117 dump_backtrace+0xa8/0xe8 05117 show_stack+0x1c/0x30 05117 dump_stack_lvl+0x5c/0xa0 05117 print_report+0x1e4/0x5a0 05117 kasan_report+0x80/0xc0 05117 __asan_load4+0x90/0xb0 05117 add_chain_block+0x44/0x288 05117 __lock_acquire+0x1104/0x24f8 05117 lock_acquire+0x1e0/0x470 05117 _raw_spin_lock_nested+0x54/0x78 05117 raw_spin_rq_lock_nested+0x30/0x50 05117 try_to_wake_up+0x3b4/0x1050 05117 wake_up_process+0x1c/0x30 05117 kick_pool+0x104/0x1b0 05117 __queue_work+0x350/0xa58 05117 queue_work_on+0x98/0xd0 05117 __bch2_btree_node_write+0xec0/0x10a0 05117 bch2_btree_node_write+0x88/0x138 05117 btree_split+0x744/0x14a0 05117 bch2_btree_split_leaf+0x94/0x258 05117 bch2_trans_commit_error.isra.0+0x234/0x7d0 05117 __bch2_trans_commit+0x1128/0x3010 05117 bch2_extent_update+0x410/0x570 05117 bch2_write_index_default+0x404/0x598 05117 __bch2_write_index+0xb0/0x3b0 05117 __bch2_write+0x6f0/0x928 05117 bch2_write+0x368/0x8e0 05117 bch2_direct_write+0xaa8/0x12c0 05117 bch2_write_iter+0x2e4/0x1050 05117 aio_write.constprop.0+0x19c/0x420 05117 io_submit_one.constprop.0+0xf30/0x17a0 05117 __arm64_sys_io_submit+0x244/0x388 05117 invoke_syscall.constprop.0+0x64/0x138 05117 do_el0_svc+0x7c/0x120 05117 el0_svc+0x34/0x80 05117 el0t_64_sync_handler+0xb8/0xc0 05117 el0t_64_sync+0x14c/0x150 05117 05117 The buggy address belongs to the variable: 05117 nr_large_chain_blocks+0x3c/0x40 05117 05117 The buggy address belongs to the virtual mapping at 05117 [ffffffc081710000, ffffffc088861000) created by: 05117 paging_init+0x260/0x820 05117 05117 The buggy address belongs to the physical page: 05117 page:00000000ce625900 refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x41d7a 05117 flags: 0x4000(reserved|zone=0) 05117 page_type: 0xffffffff() 05117 raw: 0000000000004000 fffffffe00075e88 fffffffe00075e88 0000000000000000 05117 raw: 0000000000000000 0000000000000000 00000001ffffffff 0000000000000000 05117 page dumped because: kasan: bad access detected 05117 05117 Memory state around the buggy address: 05117 ffffffc081b7a780: 00 f9 f9 f9 f9 f9 f9 f9 00 f9 f9 f9 f9 f9 f9 f9 05117 ffffffc081b7a800: 00 f9 f9 f9 f9 f9 f9 f9 04 f9 f9 f9 f9 f9 f9 f9 05117 >ffffffc081b7a880: 04 f9 f9 f9 f9 f9 f9 f9 00 00 00 00 00 00 00 00 05117 ^ 05117 ffffffc081b7a900: f9 f9 f9 f9 04 f9 f9 f9 f9 f9 f9 f9 04 f9 f9 f9 05117 ffffffc081b7a980: f9 f9 f9 f9 04 f9 f9 f9 f9 f9 f9 f9 00 f9 f9 f9 05117 ================================================================== 05117 Kernel panic - not syncing: kasan.fault=panic set ... 05117 CPU: 11 PID: 120528 Comm: fio Tainted: G L 6.6.0-ktest-gc18b7260ddd3 #8209 05117 Hardware name: linux,dummy-virt (DT) 05117 Call trace: 05117 dump_backtrace+0xa8/0xe8 05117 show_stack+0x1c/0x30 05117 dump_stack_lvl+0x5c/0xa0 05117 dump_stack+0x18/0x20 05117 panic+0x3ac/0x408 05117 kasan_report_invalid_free+0x0/0x90 05117 kasan_report+0x90/0xc0 05117 __asan_load4+0x90/0xb0 05117 add_chain_block+0x44/0x288 05117 __lock_acquire+0x1104/0x24f8 05117 lock_acquire+0x1e0/0x470 05117 _raw_spin_lock_nested+0x54/0x78 05117 raw_spin_rq_lock_nested+0x30/0x50 05117 try_to_wake_up+0x3b4/0x1050 05117 wake_up_process+0x1c/0x30 05117 kick_pool+0x104/0x1b0 05117 __queue_work+0x350/0xa58 05117 queue_work_on+0x98/0xd0 05117 __bch2_btree_node_write+0xec0/0x10a0 05117 bch2_btree_node_write+0x88/0x138 05117 btree_split+0x744/0x14a0 05117 bch2_btree_split_leaf+0x94/0x258 05117 bch2_trans_commit_error.isra.0+0x234/0x7d0 05117 __bch2_trans_commit+0x1128/0x3010 05117 bch2_extent_update+0x410/0x570 05117 bch2_write_index_default+0x404/0x598 05117 __bch2_write_index+0xb0/0x3b0 05117 __bch2_write+0x6f0/0x928 05117 bch2_write+0x368/0x8e0 05117 bch2_direct_write+0xaa8/0x12c0 05117 bch2_write_iter+0x2e4/0x1050 05117 aio_write.constprop.0+0x19c/0x420 05117 io_submit_one.constprop.0+0xf30/0x17a0 05117 __arm64_sys_io_submit+0x244/0x388 05117 invoke_syscall.constprop.0+0x64/0x138 05117 do_el0_svc+0x7c/0x120 05117 el0_svc+0x34/0x80 05117 el0t_64_sync_handler+0xb8/0xc0 05117 el0t_64_sync+0x14c/0x150 05117 SMP: stopping secondary CPUs 05117 Kernel Offset: disabled 05117 CPU features: 0x0,00000000,70000001,1040500b 05117 Memory Limit: none 05117 ---[ end Kernel panic - not syncing: kasan.fault=panic set ... ]--- 05122 ========= FAILED TIMEOUT tiering_variable_buckets_replicas in 3600s