2023-11-03 19:19:02

by Mikhail Gavrilov

[permalink] [raw]
Subject: 6.7/kasan/regression/bisected: mt7921_regd_notifier+0x3e2

Hi,
another release cycle, and another regression.
Yesterday after another kernel update in Fedora Rawhide system stopped
entering in graphic mode.
In kernel log appears such backtrace:
[ 19.431838] ==================================================================
[ 19.431843] BUG: KASAN: null-ptr-deref in
amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
[ 19.432274] Read of size 4 at addr 0000000000000178 by task (udev-worker)/501

[ 19.432283] CPU: 8 PID: 501 Comm: (udev-worker) Tainted: G W
L ------- ---
6.7.0-0.rc0.20231102git21e80f3841c0.4.fc40.x86_64+debug #1
[ 19.432292] Hardware name: ASUSTeK COMPUTER INC. ROG Strix
G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
[ 19.432298] Call Trace:
[ 19.432302] <TASK>
[ 19.432305] dump_stack_lvl+0x76/0xd0
[ 19.432313] kasan_report+0xa6/0xe0
[ 19.432320] ? amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
[ 19.432443] kasan_check_range+0x105/0x1b0
[ 19.432443] amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
[ 19.432443] gmc_v9_0_late_init+0xcf/0x1b0 [amdgpu]
[ 19.432443] amdgpu_device_ip_late_init+0x103/0x7b0 [amdgpu]
[ 19.432443] amdgpu_device_init+0x7b33/0x8a90 [amdgpu]
[ 19.432443] ? __pfx_amdgpu_device_init+0x10/0x10 [amdgpu]
[ 19.432443] ? __pfx_pci_bus_read_config_word+0x10/0x10
[ 19.432443] ? do_pci_enable_device+0x22d/0x2a0
[ 19.432443] ? pci_update_current_state+0x1/0x1f0
[ 19.432443] ? _raw_spin_unlock_irqrestore+0x66/0x80
[ 19.432443] ? lockdep_hardirqs_on+0x81/0x110
[ 19.432443] ? __kasan_check_byte+0x13/0x50
[ 19.432443] amdgpu_driver_load_kms+0x1d/0x4b0 [amdgpu]
[ 19.432443] amdgpu_pci_probe+0x282/0xac0 [amdgpu]
[ 19.432443] ? __pfx_amdgpu_pci_probe+0x10/0x10 [amdgpu]
[ 19.432443] local_pci_probe+0xdd/0x190
[ 19.432443] pci_device_probe+0x23a/0x780
[ 19.432443] ? kernfs_add_one+0x326/0x490
[ 19.432443] ? kernfs_get.part.0+0x4c/0x70
[ 19.432443] ? __pfx_pci_device_probe+0x10/0x10
[ 19.432443] ? kernfs_create_link+0x16b/0x230
[ 19.432443] ? kernfs_put+0x1c/0x40
[ 19.432443] ? sysfs_do_create_link_sd+0x8e/0x100
[ 19.432443] really_probe+0x3e2/0xb80
[ 19.432443] __driver_probe_device+0x18c/0x450
[ 19.432443] driver_probe_device+0x4a/0x120
[ 19.432443] __driver_attach+0x1e5/0x4a0
[ 19.432443] ? __pfx___driver_attach+0x10/0x10
[ 19.432443] bus_for_each_dev+0x109/0x190
[ 19.432443] ? __pfx_bus_for_each_dev+0x10/0x10
[ 19.432443] bus_add_driver+0x2a1/0x570
[ 19.432443] driver_register+0x134/0x460
[ 19.432443] ? __pfx_amdgpu_init+0x10/0x10 [amdgpu]
[ 19.432443] do_one_initcall+0xd6/0x430
[ 19.432443] ? __pfx_do_one_initcall+0x10/0x10
[ 19.432443] ? kasan_unpoison+0x44/0x70
[ 19.432443] do_init_module+0x238/0x770
[ 19.432443] load_module+0x5581/0x6f10
[ 19.432443] ? __pfx_load_module+0x10/0x10
[ 19.432443] ? local_clock_noinstr+0x45/0xc0
[ 19.432443] ? __might_fault+0xc6/0x180
[ 19.432443] ? __pfx___might_resched+0x10/0x10
[ 19.432443] ? __do_sys_init_module+0x1f2/0x220
[ 19.432443] __do_sys_init_module+0x1f2/0x220
[ 19.432443] ? __pfx___do_sys_init_module+0x10/0x10
[ 19.432443] do_syscall_64+0x64/0xe0
[ 19.432443] ? asm_exc_page_fault+0x26/0x30
[ 19.432443] ? lockdep_hardirqs_on+0x81/0x110
[ 19.432443] entry_SYSCALL_64_after_hwframe+0x6e/0x76
[ 19.432443] RIP: 0033:0x7f9aac99f22e
[ 19.432443] Code: 48 8b 0d 05 1c 0c 00 f7 d8 64 89 01 48 83 c8 ff
c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00
00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d2 1b 0c 00 f7 d8 64 89
01 48
[ 19.432443] RSP: 002b:00007ffca4d0f2e8 EFLAGS: 00000246 ORIG_RAX:
00000000000000af
[ 19.432443] RAX: ffffffffffffffda RBX: 0000561575417f90 RCX: 00007f9aac99f22e
[ 19.432443] RDX: 0000561575423840 RSI: 00000000041c7e9e RDI: 00007f9aa65d3010
[ 19.432443] RBP: 00007ffca4d0f3a0 R08: 00005615753e5010 R09: 0000000000000007
[ 19.432443] R10: 0000000000000002 R11: 0000000000000246 R12: 0000561575423840
[ 19.432443] R13: 0000000000020000 R14: 00005615753e9190 R15: 0000561575406f40
[ 19.432443] </TASK>
[ 19.432443] ==================================================================
[ 19.435775] Disabling lock debugging due to kernel taint
[ 19.435787] general protection fault, probably for non-canonical
address 0xdffffc000000002f: 0000 [#1] PREEMPT SMP KASAN NOPTI
[ 19.435794] KASAN: null-ptr-deref in range
[0x0000000000000178-0x000000000000017f]
[ 19.435799] CPU: 8 PID: 501 Comm: (udev-worker) Tainted: G B W
L ------- ---
6.7.0-0.rc0.20231102git21e80f3841c0.4.fc40.x86_64+debug #1
[ 19.435807] Hardware name: ASUSTeK COMPUTER INC. ROG Strix
G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
[ 19.435813] RIP: 0010:amdgpu_ras_reset_error_count+0x2ec/0x3e0 [amdgpu]
[ 19.436132] Code: 00 00 00 48 8d b8 78 01 00 00 48 89 7c 24 08 e8
9a e3 26 c5 48 8b 7c 24 08 48 b8 00 00 00 00 00 fc ff df 48 89 f9 48
c1 e9 03 <0f> b6 0c 01 48 89 f8 83 e0 07 83 c0 03 38 c8 7c 04 84 c9 75
24 48
[ 19.436142] RSP: 0018:ffffc9000312f360 EFLAGS: 00010212
[ 19.436147] RAX: dffffc0000000000 RBX: ffff8881c6000000 RCX: 000000000000002f
[ 19.436152] RDX: fffffbfff1743de9 RSI: 0000000000000008 RDI: 0000000000000178
[ 19.436156] RBP: ffffffffc1f9e840 R08: 0000000000000001 R09: fffffbfff1743de8
[ 19.436160] R10: ffffffff8ba1ef47 R11: 0000000000000000 R12: 0000000000000006
[ 19.436165] R13: ffffffffc1f9e890 R14: ffff8881c604ea88 R15: 0000000000000000
[ 19.436169] FS: 00007f9aabfb0980(0000) GS:ffff888f8e200000(0000)
knlGS:0000000000000000
[ 19.436175] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 19.436179] CR2: 00007f73247fc010 CR3: 0000000168d50000 CR4: 0000000000f50ef0
[ 19.436184] PKRU: 55555554
[ 19.436186] Call Trace:
[ 19.436189] <TASK>
[ 19.436192] ? die_addr+0x40/0xa0
[ 19.436198] ? exc_general_protection+0x15c/0x240
[ 19.436204] ? asm_exc_general_protection+0x26/0x30
[ 19.436211] ? amdgpu_ras_reset_error_count+0x2ec/0x3e0 [amdgpu]
[ 19.436527] gmc_v9_0_late_init+0xcf/0x1b0 [amdgpu]
[ 19.436771] amdgpu_device_ip_late_init+0x103/0x7b0 [amdgpu]
[ 19.436771] amdgpu_device_init+0x7b33/0x8a90 [amdgpu]
[ 19.436771] ? __pfx_amdgpu_device_init+0x10/0x10 [amdgpu]
[ 19.436771] ? __pfx_pci_bus_read_config_word+0x10/0x10
[ 19.436771] ? do_pci_enable_device+0x22d/0x2a0
[ 19.436771] ? pci_update_current_state+0x1/0x1f0
[ 19.436771] ? _raw_spin_unlock_irqrestore+0x66/0x80
[ 19.436771] ? lockdep_hardirqs_on+0x81/0x110
[ 19.436771] ? __kasan_check_byte+0x13/0x50
[ 19.436771] amdgpu_driver_load_kms+0x1d/0x4b0 [amdgpu]
[ 19.436771] amdgpu_pci_probe+0x282/0xac0 [amdgpu]
[ 19.436771] ? __pfx_amdgpu_pci_probe+0x10/0x10 [amdgpu]
[ 19.436771] local_pci_probe+0xdd/0x190
[ 19.436771] pci_device_probe+0x23a/0x780
[ 19.436771] ? kernfs_add_one+0x326/0x490
[ 19.436771] ? kernfs_get.part.0+0x4c/0x70
[ 19.436771] ? __pfx_pci_device_probe+0x10/0x10
[ 19.436771] ? kernfs_create_link+0x16b/0x230
[ 19.436771] ? kernfs_put+0x1c/0x40
[ 19.436771] ? sysfs_do_create_link_sd+0x8e/0x100
[ 19.436771] really_probe+0x3e2/0xb80
[ 19.436771] __driver_probe_device+0x18c/0x450
[ 19.436771] driver_probe_device+0x4a/0x120
[ 19.436771] __driver_attach+0x1e5/0x4a0
[ 19.436771] ? __pfx___driver_attach+0x10/0x10
[ 19.436771] bus_for_each_dev+0x109/0x190
[ 19.436771] ? __pfx_bus_for_each_dev+0x10/0x10
[ 19.436771] bus_add_driver+0x2a1/0x570
[ 19.436771] driver_register+0x134/0x460
[ 19.436771] ? __pfx_amdgpu_init+0x10/0x10 [amdgpu]
[ 19.436771] do_one_initcall+0xd6/0x430
[ 19.436771] ? __pfx_do_one_initcall+0x10/0x10
[ 19.436771] ? kasan_unpoison+0x44/0x70
[ 19.436771] do_init_module+0x238/0x770
[ 19.436771] load_module+0x5581/0x6f10
[ 19.436771] ? __pfx_load_module+0x10/0x10
[ 19.436771] ? local_clock_noinstr+0x45/0xc0
[ 19.436771] ? __might_fault+0xc6/0x180
[ 19.436771] ? __pfx___might_resched+0x10/0x10
[ 19.436771] ? __do_sys_init_module+0x1f2/0x220
[ 19.436771] __do_sys_init_module+0x1f2/0x220
[ 19.436771] ? __pfx___do_sys_init_module+0x10/0x10
[ 19.436771] do_syscall_64+0x64/0xe0
[ 19.436771] ? asm_exc_page_fault+0x26/0x30
[ 19.436771] ? lockdep_hardirqs_on+0x81/0x110
[ 19.436771] entry_SYSCALL_64_after_hwframe+0x6e/0x76
[ 19.436771] RIP: 0033:0x7f9aac99f22e
[ 19.436771] Code: 48 8b 0d 05 1c 0c 00 f7 d8 64 89 01 48 83 c8 ff
c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00
00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d2 1b 0c 00 f7 d8 64 89
01 48
[ 19.436771] RSP: 002b:00007ffca4d0f2e8 EFLAGS: 00000246 ORIG_RAX:
00000000000000af
[ 19.436771] RAX: ffffffffffffffda RBX: 0000561575417f90 RCX: 00007f9aac99f22e
[ 19.436771] RDX: 0000561575423840 RSI: 00000000041c7e9e RDI: 00007f9aa65d3010
[ 19.436771] RBP: 00007ffca4d0f3a0 R08: 00005615753e5010 R09: 0000000000000007
[ 19.436771] R10: 0000000000000002 R11: 0000000000000246 R12: 0000561575423840
[ 19.436771] R13: 0000000000020000 R14: 00005615753e9190 R15: 0000561575406f40
[ 19.436771] </TASK>
[ 19.436771] Modules linked in: amdgpu(+) hid_asus asus_wmi
ledtrig_audio sparse_keymap platform_profile amdxcp i2c_algo_bit
drm_ttm_helper ttm crct10dif_pclmul drm_exec crc32_pclmul crc32c_intel
gpu_sched polyval_clmulni rfkill drm_suballoc_helper polyval_generic
hid_multitouch ucsi_acpi drm_buddy typec_ucsi nvme video
ghash_clmulni_intel drm_display_helper sha512_ssse3 nvme_core
serio_raw ccp r8169 cec typec sp5100_tco i2c_hid_acpi i2c_hid wmi
ip6_tables ip_tables fuse
[ 19.439296] ---[ end trace 0000000000000000 ]---
[ 19.439301] RIP: 0010:amdgpu_ras_reset_error_count+0x2ec/0x3e0 [amdgpu]
[ 19.439659] Code: 00 00 00 48 8d b8 78 01 00 00 48 89 7c 24 08 e8
9a e3 26 c5 48 8b 7c 24 08 48 b8 00 00 00 00 00 fc ff df 48 89 f9 48
c1 e9 03 <0f> b6 0c 01 48 89 f8 83 e0 07 83 c0 03 38 c8 7c 04 84 c9 75
24 48
[ 19.439669] RSP: 0018:ffffc9000312f360 EFLAGS: 00010212
[ 19.439674] RAX: dffffc0000000000 RBX: ffff8881c6000000 RCX: 000000000000002f
[ 19.439678] RDX: fffffbfff1743de9 RSI: 0000000000000008 RDI: 0000000000000178
[ 19.439683] RBP: ffffffffc1f9e840 R08: 0000000000000001 R09: fffffbfff1743de8
[ 19.439688] R10: ffffffff8ba1ef47 R11: 0000000000000000 R12: 0000000000000006
[ 19.439693] R13: ffffffffc1f9e890 R14: ffff8881c604ea88 R15: 0000000000000000
[ 19.439698] FS: 00007f9aabfb0980(0000) GS:ffff888f8e200000(0000)
knlGS:0000000000000000
[ 19.439703] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 19.439707] CR2: 00007f73247fc010 CR3: 0000000168d50000 CR4: 0000000000f50ef0
[ 19.439712] PKRU: 55555554
[ 19.462897] (udev-worker) (501) used greatest stack depth: 23216 bytes left
[ 130.872611] BTRFS info (device nvme1n1p3): using crc32c
(crc32c-intel) checksum algorithm
[ 130.872621] BTRFS info (device nvme1n1p3): using free space tree
[ 131.307530] BTRFS info (device nvme1n1p3): enabling ssd optimizations
[ 131.307535] BTRFS info (device nvme1n1p3): auto enabling async discard

Today I tried bisect it, but look like I faced another issue which
caused boot blocker:
[ 30.814125] mikhail-laptop kernel: general protection fault,
probably for non-canonical address 0xdffffc0000000002: 0000 [#1]
PREEMPT SMP KASAN NOPTI
[ 30.814129] mikhail-laptop kernel: KASAN: null-ptr-deref in range
[0x0000000000000010-0x0000000000000017]
[ 30.814132] mikhail-laptop kernel: CPU: 12 PID: 136 Comm:
kworker/12:1 Tainted: G W L
6.6.0-rc6-01-daee7aaba8491e64911438696c5f3f7cb77edf5e+ #127
[ 30.814134] mikhail-laptop kernel: Hardware name: ASUSTeK COMPUTER
INC. ROG Strix G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
[ 30.814136] mikhail-laptop kernel: Workqueue: events
mt7921_init_work [mt7921_common]
[ 30.814145] mikhail-laptop kernel: RIP:
0010:mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
[ 30.814151] mikhail-laptop kernel: Code: c1 ea 03 80 3c 02 00 0f 85
ec 03 00 00 4d 8b b4 24 d0 01 00 00 48 b8 00 00 00 00 00 fc ff df 49
8d 7e 14 48 89 fa 48 c1 ea 03 <0f> b6 14 02 48 89 f8 83 e0 07 83 c0 03
38 d0 7c 08 84 d2 0f 85 f9
[ 30.814153] mikhail-laptop kernel: RSP: 0018:ffffc900013577c8
EFLAGS: 00010213
[ 30.814155] mikhail-laptop kernel: RAX: dffffc0000000000 RBX:
ffff8881a61326e8 RCX: ffff8882398632c0
[ 30.814156] mikhail-laptop kernel: RDX: 0000000000000002 RSI:
ffffed104730c696 RDI: 0000000000000014
[ 30.814157] mikhail-laptop kernel: RBP: 000000000000001c R08:
ffff88823986c411 R09: 1ffff1104730d882
[ 30.814158] mikhail-laptop kernel: R10: ffffc90001357717 R11:
0000000000000001 R12: ffff8882398607a0
[ 30.814159] mikhail-laptop kernel: R13: 0000000000000000 R14:
0000000000000000 R15: dffffc0000000000
[ 30.814160] mikhail-laptop kernel: FS: 0000000000000000(0000)
GS:ffff888f8f200000(0000) knlGS:0000000000000000
[ 30.814161] mikhail-laptop kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
[ 30.814162] mikhail-laptop kernel: CR2: 00007f6bad1f74c0 CR3:
0000000130a70000 CR4: 0000000000f50ee0
[ 30.814164] mikhail-laptop kernel: PKRU: 55555554
[ 30.814164] mikhail-laptop kernel: Call Trace:
[ 30.814165] mikhail-laptop kernel: <TASK>
[ 30.814167] mikhail-laptop kernel: ? die_addr+0x40/0xa0
[ 30.814171] mikhail-laptop kernel: ? exc_general_protection+0x15c/0x240
[ 30.814177] mikhail-laptop kernel: ? asm_exc_general_protection+0x26/0x30
[ 30.814182] mikhail-laptop kernel: ?
mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
[ 30.814187] mikhail-laptop kernel: ?
mt7921_regd_notifier+0x215/0x7d0 [mt7921_common]
[ 30.814193] mikhail-laptop kernel: ? freq_reg_info+0xb7/0x150 [cfg80211]
[ 30.814237] mikhail-laptop kernel:
wiphy_update_regulatory+0xd99/0x2fa0 [cfg80211]
[ 30.814276] mikhail-laptop kernel: ?
nl80211_notify_wiphy+0x17e/0x210 [cfg80211]
[ 30.814317] mikhail-laptop kernel: ? __pfx___mutex_unlock_slowpath+0x10/0x10
[ 30.814321] mikhail-laptop kernel: ?
__pfx_wiphy_update_regulatory+0x10/0x10 [cfg80211]
[ 30.814360] mikhail-laptop kernel:
wiphy_regulatory_register+0x87/0x190 [cfg80211]
[ 30.814399] mikhail-laptop kernel: wiphy_register+0x1a14/0x2a90 [cfg80211]
[ 30.814437] mikhail-laptop kernel: ? netdev_run_todo+0x2b4/0xe20
[ 30.814442] mikhail-laptop kernel: ?
__pfx_wiphy_register+0x10/0x10 [cfg80211]
[ 30.814477] mikhail-laptop kernel: ? __kmalloc_large_node+0xe0/0x170
[ 30.814483] mikhail-laptop kernel:
ieee80211_register_hw+0x1f1e/0x3f70 [mac80211]
[ 30.814506] mikhail-laptop kernel: ?
__pfx_ieee80211_register_hw+0x10/0x10 [mac80211]
[ 30.814506] mikhail-laptop kernel: ? mt76_init_stream_cap+0x203/0x300 [mt76]
[ 30.814506] mikhail-laptop kernel: ? mt76_init_sband+0x29b/0x3e0 [mt76]
[ 30.814506] mikhail-laptop kernel: mt76_register_device+0x477/0x8e0 [mt76]
[ 30.814506] mikhail-laptop kernel: mt7921_init_work+0x144/0x4c0
[mt7921_common]
[ 30.814506] mikhail-laptop kernel: process_one_work+0x789/0x12a0
[ 30.814506] mikhail-laptop kernel: ? worker_thread+0x2a6/0x1300
[ 30.814506] mikhail-laptop kernel: ? __pfx_process_one_work+0x10/0x10
[ 30.814506] mikhail-laptop kernel: ? assign_work+0x16c/0x240
[ 30.814506] mikhail-laptop kernel: worker_thread+0x727/0x1300
[ 30.814506] mikhail-laptop kernel: ? __pfx_worker_thread+0x10/0x10
[ 30.814506] mikhail-laptop kernel: kthread+0x2f5/0x3d0
[ 30.814506] mikhail-laptop kernel: ? _raw_spin_unlock_irq+0x28/0x60
[ 30.814506] mikhail-laptop kernel: ? __pfx_kthread+0x10/0x10
[ 30.814506] mikhail-laptop kernel: ret_from_fork+0x34/0x70
[ 30.814506] mikhail-laptop kernel: ? __pfx_kthread+0x10/0x10
[ 30.814506] mikhail-laptop kernel: ret_from_fork_asm+0x1b/0x30
[ 30.814506] mikhail-laptop kernel: </TASK>
[ 30.814506] mikhail-laptop kernel: Modules linked in: sunrpc
snd_hda_codec_realtek binfmt_misc snd_hda_codec_generic
snd_hda_codec_hdmi intel_rapl_msr intel_rapl_common
snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir mt7921e
snd_sof_amd_acp mt7921_common snd_sof_pci snd_sof_xtensa_dsp
mt792x_lib snd_sof mt76_connac_lib mt76 edac_mce_amd snd_sof_utils
snd_soc_core kvm_amd mac80211 btusb snd_hda_intel snd_intel_dspcfg
btrtl snd_intel_sdw_acpi btintel snd_compress btbcm ac97_bus
snd_hda_codec snd_pcm_dmaengine btmtk kvm snd_pci_ps snd_hda_core vfat
snd_rpl_pci_acp6x snd_pci_acp6x fat snd_hwdep bluetooth snd_seq
snd_seq_device libarc4 snd_pcm snd_pci_acp5x irqbypass
snd_rn_pci_acp3x cfg80211 snd_timer rapl asus_nb_wmi snd_acp_config
wmi_bmof snd_soc_acpi snd pcspkr k10temp snd_pci_acp3x i2c_piix4
soundcore amd_pmc asus_wireless joydev loop zram amdgpu hid_asus
asus_wmi ledtrig_audio sparse_keymap platform_profile i2c_algo_bit
drm_ttm_helper ttm crct10dif_pclmul crc32_pclmul rfkill crc32c_intel
drm_exec drm_suballoc_helper
[ 30.814506] mikhail-laptop kernel: polyval_clmulni amdxcp
polyval_generic ucsi_acpi drm_buddy hid_multitouch typec_ucsi video
gpu_sched nvme ghash_clmulni_intel sha512_ssse3 nvme_core
drm_display_helper serio_raw ccp r8169 i2c_hid_acpi sp5100_tco cec
nvme_common typec wmi i2c_hid ip6_tables ip_tables fuse
[ 30.814727] mikhail-laptop kernel: ---[ end trace 0000000000000000 ]---
[ 30.814728] mikhail-laptop kernel: RIP:
0010:mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
[ 30.814734] mikhail-laptop kernel: Code: c1 ea 03 80 3c 02 00 0f 85
ec 03 00 00 4d 8b b4 24 d0 01 00 00 48 b8 00 00 00 00 00 fc ff df 49
8d 7e 14 48 89 fa 48 c1 ea 03 <0f> b6 14 02 48 89 f8 83 e0 07 83 c0 03
38 d0 7c 08 84 d2 0f 85 f9
[ 30.814735] mikhail-laptop kernel: RSP: 0018:ffffc900013577c8
EFLAGS: 00010213
[ 30.814737] mikhail-laptop kernel: RAX: dffffc0000000000 RBX:
ffff8881a61326e8 RCX: ffff8882398632c0
[ 30.814738] mikhail-laptop kernel: RDX: 0000000000000002 RSI:
ffffed104730c696 RDI: 0000000000000014
[ 30.814739] mikhail-laptop kernel: RBP: 000000000000001c R08:
ffff88823986c411 R09: 1ffff1104730d882
[ 30.814740] mikhail-laptop kernel: R10: ffffc90001357717 R11:
0000000000000001 R12: ffff8882398607a0
[ 30.814741] mikhail-laptop kernel: R13: 0000000000000000 R14:
0000000000000000 R15: dffffc0000000000
[ 30.814742] mikhail-laptop kernel: FS: 0000000000000000(0000)
GS:ffff888f8f200000(0000) knlGS:0000000000000000
[ 30.814743] mikhail-laptop kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
[ 30.814744] mikhail-laptop kernel: CR2: 00007f6bad1f74c0 CR3:
0000000130a70000 CR4: 0000000000f50ee0
[ 30.814745] mikhail-laptop kernel: PKRU: 55555554
[ 30.850952] mikhail-laptop kernel:
==================================================================
[ 30.909383] mikhail-laptop kernel: BUG: KASAN: slab-use-after-free
in mutex_can_spin_on_owner+0x191/0x1c0
[ 30.909383] mikhail-laptop kernel: Read of size 4 at addr
ffff88810c44b834 by task iw/1099
[ 30.909383] mikhail-laptop kernel:
[ 30.909383] mikhail-laptop kernel: CPU: 13 PID: 1099 Comm: iw
Tainted: G D W L
6.6.0-rc6-01-daee7aaba8491e64911438696c5f3f7cb77edf5e+ #127
[ 30.909383] mikhail-laptop kernel: Hardware name: ASUSTeK COMPUTER
INC. ROG Strix G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
[ 30.909383] mikhail-laptop kernel: Call Trace:
[ 30.909383] mikhail-laptop kernel: <TASK>
[ 30.914381] mikhail-laptop kernel: dump_stack_lvl+0x76/0xd0
[ 30.914381] mikhail-laptop kernel: print_report+0xcf/0x670
[ 30.914381] mikhail-laptop kernel: ? mutex_can_spin_on_owner+0x191/0x1c0
[ 30.914381] mikhail-laptop kernel: kasan_report+0xa6/0xe0
[ 30.914381] mikhail-laptop kernel: ? mutex_can_spin_on_owner+0x191/0x1c0
[ 30.914381] mikhail-laptop kernel: mutex_can_spin_on_owner+0x191/0x1c0
[ 30.914381] mikhail-laptop kernel: __mutex_lock+0x26a/0x18b0
[ 30.914381] mikhail-laptop kernel: ? nl80211_pre_doit+0x92/0x750 [cfg80211]
[ 30.925389] mikhail-laptop kernel: ? __nla_validate_parse+0xeb5/0x2430
[ 30.925389] mikhail-laptop kernel: ? __pfx___mutex_lock+0x10/0x10
[ 30.925389] mikhail-laptop kernel: ? __pfx___nla_validate_parse+0x10/0x10
[ 30.925389] mikhail-laptop kernel: ? kasan_set_track+0x25/0x30
[ 30.925389] mikhail-laptop kernel: ? nl80211_pre_doit+0x92/0x750 [cfg80211]
[ 30.931386] mikhail-laptop kernel: ?
genl_family_rcv_msg_attrs_parse.isra.0+0x150/0x230
[ 30.931386] mikhail-laptop kernel: nl80211_pre_doit+0x92/0x750 [cfg80211]
[ 30.934928] mikhail-laptop kernel: genl_family_rcv_msg_doit+0x1b1/0x2c0
[ 30.934928] mikhail-laptop kernel: ?
__pfx_genl_family_rcv_msg_doit+0x10/0x10
[ 30.934928] mikhail-laptop kernel: ? __pfx_bpf_lsm_capable+0x10/0x10
[ 30.934928] mikhail-laptop kernel: ? security_capable+0x74/0xb0
[ 30.938381] mikhail-laptop kernel: genl_rcv_msg+0x434/0x700
[ 30.938381] mikhail-laptop kernel: ? __pfx_genl_rcv_msg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ?
__pfx_nl80211_pre_doit+0x10/0x10 [cfg80211]
[ 30.938381] mikhail-laptop kernel: ?
__pfx_nl80211_req_set_reg+0x10/0x10 [cfg80211]
[ 30.938381] mikhail-laptop kernel: ?
__pfx_nl80211_post_doit+0x10/0x10 [cfg80211]
[ 30.938381] mikhail-laptop kernel: netlink_rcv_skb+0x140/0x3b0
[ 30.938381] mikhail-laptop kernel: ? __pfx_genl_rcv_msg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx_netlink_rcv_skb+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx_lock_acquired+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx_down_read+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? netlink_deliver_tap+0xd0/0xaf0
[ 30.938381] mikhail-laptop kernel: ? netlink_deliver_tap+0x13d/0xaf0
[ 30.938381] mikhail-laptop kernel: genl_rcv+0x28/0x40
[ 30.938381] mikhail-laptop kernel: netlink_unicast+0x42f/0x730
[ 30.938381] mikhail-laptop kernel: ? __pfx_netlink_unicast+0x10/0x10
[ 30.938381] mikhail-laptop kernel: netlink_sendmsg+0x7ce/0xca0
[ 30.938381] mikhail-laptop kernel: ? __pfx_netlink_sendmsg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx_netlink_sendmsg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ____sys_sendmsg+0x985/0xc50
[ 30.938381] mikhail-laptop kernel: ? __pfx_____sys_sendmsg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx_copy_msghdr_from_user+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ___sys_sendmsg+0x105/0x190
[ 30.938381] mikhail-laptop kernel: ? __pfx____sys_sendmsg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? rcu_is_watching+0x15/0xb0
[ 30.938381] mikhail-laptop kernel: ? __pfx_lock_release+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __fget_light+0x51/0x220
[ 30.938381] mikhail-laptop kernel: __sys_sendmsg+0xeb/0x190
[ 30.938381] mikhail-laptop kernel: ? __pfx___sys_sendmsg+0x10/0x10
[ 30.938381] mikhail-laptop kernel: ? __pfx___seccomp_filter+0x10/0x10
[ 30.938381] mikhail-laptop kernel: do_syscall_64+0x60/0x90
[ 30.938381] mikhail-laptop kernel: ? irqentry_exit_to_user_mode+0xe/0x40
[ 30.938381] mikhail-laptop kernel: ? rcu_is_watching+0x15/0xb0
[ 30.938381] mikhail-laptop kernel: ? irqentry_exit_to_user_mode+0xe/0x40
[ 30.938381] mikhail-laptop kernel: ? trace_hardirqs_on_prepare+0xe3/0x100
[ 30.938381] mikhail-laptop kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
[ 30.938381] mikhail-laptop kernel: RIP: 0033:0x7fc76d7b5414
[ 30.938381] mikhail-laptop kernel: Code: 15 21 0a 0c 00 f7 d8 64 89
02 b8 ff ff ff ff eb bf 0f 1f 44 00 00 f3 0f 1e fa 80 3d 55 8f 0c 00
00 74 13 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 4c c3 0f 1f 00 55
48 89 e5 48 83 ec 20 89 55
[ 30.938381] mikhail-laptop kernel: RSP: 002b:00007ffe62882bf8
EFLAGS: 00000202 ORIG_RAX: 000000000000002e
[ 30.938381] mikhail-laptop kernel: RAX: ffffffffffffffda RBX:
0000559cfddbf390 RCX: 00007fc76d7b5414
[ 30.938381] mikhail-laptop kernel: RDX: 0000000000000000 RSI:
00007ffe62882c30 RDI: 0000000000000003
[ 30.938381] mikhail-laptop kernel: RBP: 00007ffe62882c20 R08:
0000559cfddbf010 R09: 0000000000000007
[ 30.938381] mikhail-laptop kernel: R10: 0000559cfddbf2a0 R11:
0000000000000202 R12: 0000559cfddc4780
[ 30.938381] mikhail-laptop kernel: R13: 0000559cfddc48c0 R14:
00007ffe62882c30 R15: 0000559cfddc48c0
[ 30.938381] mikhail-laptop kernel: </TASK>
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: Allocated by task 2:
[ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
[ 30.938381] mikhail-laptop kernel: kasan_set_track+0x25/0x30
[ 30.938381] mikhail-laptop kernel: __kasan_slab_alloc+0x6e/0x70
[ 30.938381] mikhail-laptop kernel: kmem_cache_alloc_node+0x18d/0x420
[ 30.938381] mikhail-laptop kernel: copy_process+0x3be/0x6910
[ 30.938381] mikhail-laptop kernel: kernel_clone+0xc8/0x710
[ 30.938381] mikhail-laptop kernel: kernel_thread+0xb4/0xf0
[ 30.938381] mikhail-laptop kernel: kthreadd+0x9c7/0xe00
[ 30.938381] mikhail-laptop kernel: ret_from_fork+0x34/0x70
[ 30.938381] mikhail-laptop kernel: ret_from_fork_asm+0x1b/0x30
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: Freed by task 54:
[ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
[ 30.938381] mikhail-laptop kernel: kasan_set_track+0x25/0x30
[ 30.938381] mikhail-laptop kernel: kasan_save_free_info+0x2b/0x50
[ 30.938381] mikhail-laptop kernel: __kasan_slab_free+0x10b/0x1a0
[ 30.938381] mikhail-laptop kernel: slab_free_freelist_hook+0x12b/0x1e0
[ 30.938381] mikhail-laptop kernel: kmem_cache_free+0x174/0x480
[ 30.938381] mikhail-laptop kernel: delayed_put_task_struct+0x162/0x1c0
[ 30.938381] mikhail-laptop kernel: rcu_do_batch+0x448/0x1700
[ 30.938381] mikhail-laptop kernel: rcu_core+0x880/0xdb0
[ 30.938381] mikhail-laptop kernel: __do_softirq+0x21b/0x8bb
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: Last potentially related work creation:
[ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
[ 30.938381] mikhail-laptop kernel: __kasan_record_aux_stack+0x94/0xa0
[ 30.938381] mikhail-laptop kernel: __call_rcu_common.constprop.0+0xf8/0x1af0
[ 30.938381] mikhail-laptop kernel: __schedule+0x10b4/0x5e90
[ 30.938381] mikhail-laptop kernel: schedule_idle+0x60/0x90
[ 30.938381] mikhail-laptop kernel: do_idle+0x294/0x450
[ 30.938381] mikhail-laptop kernel: cpu_startup_entry+0x55/0x60
[ 30.938381] mikhail-laptop kernel: start_secondary+0x215/0x290
[ 30.938381] mikhail-laptop kernel:
secondary_startup_64_no_verify+0x17d/0x18b
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: The buggy address belongs to the
object at ffff88810c44b800
which belongs to the cache
task_struct of size 14040
[ 30.938381] mikhail-laptop kernel: The buggy address is located 52
bytes inside of
freed 14040-byte region
[ffff88810c44b800, ffff88810c44eed8)
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: The buggy address belongs to the
physical page:
[ 30.938381] mikhail-laptop kernel: page:00000000c8764bb1 refcount:1
mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x10c448
[ 30.938381] mikhail-laptop kernel: head:00000000c8764bb1 order:3
entire_mapcount:0 nr_pages_mapped:0 pincount:0
[ 30.938381] mikhail-laptop kernel: flags:
0x17ffffc0000840(slab|head|node=0|zone=2|lastcpupid=0x1fffff)
[ 30.938381] mikhail-laptop kernel: page_type: 0xffffffff()
[ 30.938381] mikhail-laptop kernel: raw: 0017ffffc0000840
ffff8881081d5cc0 dead000000000122 0000000000000000
[ 30.938381] mikhail-laptop kernel: raw: 0000000000000000
0000000000020002 00000001ffffffff 0000000000000000
[ 30.938381] mikhail-laptop kernel: page dumped because: kasan: bad
access detected
[ 30.938381] mikhail-laptop kernel:
[ 30.938381] mikhail-laptop kernel: Memory state around the buggy address:
[ 30.938381] mikhail-laptop kernel: ffff88810c44b700: fc fc fc fc
fc fc fc fc fc fc fc fc fc fc fc fc
[ 30.938381] mikhail-laptop kernel: ffff88810c44b780: fc fc fc fc
fc fc fc fc fc fc fc fc fc fc fc fc
[ 30.938381] mikhail-laptop kernel: >ffff88810c44b800: fa fb fb fb
fb fb fb fb fb fb fb fb fb fb fb fb
[ 30.938381] mikhail-laptop kernel: ^
[ 30.938381] mikhail-laptop kernel: ffff88810c44b880: fb fb fb fb
fb fb fb fb fb fb fb fb fb fb fb fb
[ 30.938381] mikhail-laptop kernel: ffff88810c44b900: fb fb fb fb
fb fb fb fb fb fb fb fb fb fb fb fb
[ 30.938381] mikhail-laptop kernel:
==================================================================

And bisect says that this commit blame in inability to boot system:
❯ git bisect good
09382d8f8641bc12fffc41a93eb9b37be0e653c0 is the first bad commit
commit 09382d8f8641bc12fffc41a93eb9b37be0e653c0
Author: Ming Yen Hsieh <[email protected]>
Date: Sat Sep 30 10:25:09 2023 +0800

wifi: mt76: mt7921: update the channel usage when the regd domain changed

The 5.9/6GHz channel license of a certain platform device has been
regulated in various countries. That may be difference with standard
Liunx regulatory domain settings. In this case, when .reg_notifier()
called for regulatory change, mt792x chipset should update the channel
usage based on clc or dts configurations.

Channel would be disabled by following cases.
* clc report the particular UNII-x is disabled.
* dts enabled and the channel is not configured.

Signed-off-by: Ming Yen Hsieh <[email protected]>
Co-developed-by: Deren Wu <[email protected]>
Signed-off-by: Deren Wu <[email protected]>
Signed-off-by: Felix Fietkau <[email protected]>

drivers/net/wireless/mediatek/mt76/eeprom.c | 7 +++-
drivers/net/wireless/mediatek/mt76/mt76.h | 5 +++
drivers/net/wireless/mediatek/mt76/mt7921/init.c | 51 ++++++++++++++++++++++++
drivers/net/wireless/mediatek/mt76/mt7921/mcu.c | 3 ++
4 files changed, 64 insertions(+), 2 deletions(-)

I will be grateful if anyone can tell me which commit fix it and I can
continue bisect the original problem.
Hardware specs: https://linux-hardware.org/?probe=85a38e7906

--
Best Regards,
Mike Gavrilov.


Attachments:
dmesg-from-all-bisect-steps.zip (536.30 kB)

2023-11-03 22:36:59

by Deren Wu

[permalink] [raw]
Subject: Re: 6.7/kasan/regression/bisected: mt7921_regd_notifier+0x3e2

Hi Mike,

On Sat, 2023-11-04 at 00:18 +0500, Mikhail Gavrilov wrote:
>
> External email : Please do not click links or open attachments until
> you have verified the sender or the content.
> Hi,
> another release cycle, and another regression.
> Yesterday after another kernel update in Fedora Rawhide system
> stopped
> entering in graphic mode.
> In kernel log appears such backtrace:
> [ 19.431838]
> ==================================================================
> [ 19.431843] BUG: KASAN: null-ptr-deref in
> amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
> [ 19.432274] Read of size 4 at addr 0000000000000178 by task (udev-
> worker)/501
>
> [ 19.432283] CPU: 8 PID: 501 Comm: (udev-worker) Tainted:
> G W
> L ------- ---
> 6.7.0-0.rc0.20231102git21e80f3841c0.4.fc40.x86_64+debug #1
> [ 19.432292] Hardware name: ASUSTeK COMPUTER INC. ROG Strix
> G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
> [ 19.432298] Call Trace:
> [ 19.432302] <TASK>
> [ 19.432305] dump_stack_lvl+0x76/0xd0
> [ 19.432313] kasan_report+0xa6/0xe0
> [ 19.432320] ? amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
> [ 19.432443] kasan_check_range+0x105/0x1b0
> [ 19.432443] amdgpu_ras_reset_error_count+0x2d6/0x3e0 [amdgpu]
> [ 19.432443] gmc_v9_0_late_init+0xcf/0x1b0 [amdgpu]
> [ 19.432443] amdgpu_device_ip_late_init+0x103/0x7b0 [amdgpu]
> [ 19.432443] amdgpu_device_init+0x7b33/0x8a90 [amdgpu]
> [ 19.432443] ? __pfx_amdgpu_device_init+0x10/0x10 [amdgpu]
> [ 19.432443] ? __pfx_pci_bus_read_config_word+0x10/0x10
> [ 19.432443] ? do_pci_enable_device+0x22d/0x2a0
> [ 19.432443] ? pci_update_current_state+0x1/0x1f0
> [ 19.432443] ? _raw_spin_unlock_irqrestore+0x66/0x80
> [ 19.432443] ? lockdep_hardirqs_on+0x81/0x110
> [ 19.432443] ? __kasan_check_byte+0x13/0x50
> [ 19.432443] amdgpu_driver_load_kms+0x1d/0x4b0 [amdgpu]
> [ 19.432443] amdgpu_pci_probe+0x282/0xac0 [amdgpu]
> [ 19.432443] ? __pfx_amdgpu_pci_probe+0x10/0x10 [amdgpu]
> [ 19.432443] local_pci_probe+0xdd/0x190
> [ 19.432443] pci_device_probe+0x23a/0x780
> [ 19.432443] ? kernfs_add_one+0x326/0x490
> [ 19.432443] ? kernfs_get.part.0+0x4c/0x70
> [ 19.432443] ? __pfx_pci_device_probe+0x10/0x10
> [ 19.432443] ? kernfs_create_link+0x16b/0x230
> [ 19.432443] ? kernfs_put+0x1c/0x40
> [ 19.432443] ? sysfs_do_create_link_sd+0x8e/0x100
> [ 19.432443] really_probe+0x3e2/0xb80
> [ 19.432443] __driver_probe_device+0x18c/0x450
> [ 19.432443] driver_probe_device+0x4a/0x120
> [ 19.432443] __driver_attach+0x1e5/0x4a0
> [ 19.432443] ? __pfx___driver_attach+0x10/0x10
> [ 19.432443] bus_for_each_dev+0x109/0x190
> [ 19.432443] ? __pfx_bus_for_each_dev+0x10/0x10
> [ 19.432443] bus_add_driver+0x2a1/0x570
> [ 19.432443] driver_register+0x134/0x460
> [ 19.432443] ? __pfx_amdgpu_init+0x10/0x10 [amdgpu]
> [ 19.432443] do_one_initcall+0xd6/0x430
> [ 19.432443] ? __pfx_do_one_initcall+0x10/0x10
> [ 19.432443] ? kasan_unpoison+0x44/0x70
> [ 19.432443] do_init_module+0x238/0x770
> [ 19.432443] load_module+0x5581/0x6f10
> [ 19.432443] ? __pfx_load_module+0x10/0x10
> [ 19.432443] ? local_clock_noinstr+0x45/0xc0
> [ 19.432443] ? __might_fault+0xc6/0x180
> [ 19.432443] ? __pfx___might_resched+0x10/0x10
> [ 19.432443] ? __do_sys_init_module+0x1f2/0x220
> [ 19.432443] __do_sys_init_module+0x1f2/0x220
> [ 19.432443] ? __pfx___do_sys_init_module+0x10/0x10
> [ 19.432443] do_syscall_64+0x64/0xe0
> [ 19.432443] ? asm_exc_page_fault+0x26/0x30
> [ 19.432443] ? lockdep_hardirqs_on+0x81/0x110
> [ 19.432443] entry_SYSCALL_64_after_hwframe+0x6e/0x76
> [ 19.432443] RIP: 0033:0x7f9aac99f22e
> [ 19.432443] Code: 48 8b 0d 05 1c 0c 00 f7 d8 64 89 01 48 83 c8 ff
> c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00
> 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d2 1b 0c 00 f7 d8 64
> 89
> 01 48
> [ 19.432443] RSP: 002b:00007ffca4d0f2e8 EFLAGS: 00000246 ORIG_RAX:
> 00000000000000af
> [ 19.432443] RAX: ffffffffffffffda RBX: 0000561575417f90 RCX:
> 00007f9aac99f22e
> [ 19.432443] RDX: 0000561575423840 RSI: 00000000041c7e9e RDI:
> 00007f9aa65d3010
> [ 19.432443] RBP: 00007ffca4d0f3a0 R08: 00005615753e5010 R09:
> 0000000000000007
> [ 19.432443] R10: 0000000000000002 R11: 0000000000000246 R12:
> 0000561575423840
> [ 19.432443] R13: 0000000000020000 R14: 00005615753e9190 R15:
> 0000561575406f40
> [ 19.432443] </TASK>
> [ 19.432443]
> ==================================================================
> [ 19.435775] Disabling lock debugging due to kernel taint
> [ 19.435787] general protection fault, probably for non-canonical
> address 0xdffffc000000002f: 0000 [#1] PREEMPT SMP KASAN NOPTI
> [ 19.435794] KASAN: null-ptr-deref in range
> [0x0000000000000178-0x000000000000017f]
> [ 19.435799] CPU: 8 PID: 501 Comm: (udev-worker) Tainted:
> G B W
> L ------- ---
> 6.7.0-0.rc0.20231102git21e80f3841c0.4.fc40.x86_64+debug #1
> [ 19.435807] Hardware name: ASUSTeK COMPUTER INC. ROG Strix
> G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
> [ 19.435813] RIP: 0010:amdgpu_ras_reset_error_count+0x2ec/0x3e0
> [amdgpu]
> [ 19.436132] Code: 00 00 00 48 8d b8 78 01 00 00 48 89 7c 24 08 e8
> 9a e3 26 c5 48 8b 7c 24 08 48 b8 00 00 00 00 00 fc ff df 48 89 f9 48
> c1 e9 03 <0f> b6 0c 01 48 89 f8 83 e0 07 83 c0 03 38 c8 7c 04 84 c9
> 75
> 24 48
> [ 19.436142] RSP: 0018:ffffc9000312f360 EFLAGS: 00010212
> [ 19.436147] RAX: dffffc0000000000 RBX: ffff8881c6000000 RCX:
> 000000000000002f
> [ 19.436152] RDX: fffffbfff1743de9 RSI: 0000000000000008 RDI:
> 0000000000000178
> [ 19.436156] RBP: ffffffffc1f9e840 R08: 0000000000000001 R09:
> fffffbfff1743de8
> [ 19.436160] R10: ffffffff8ba1ef47 R11: 0000000000000000 R12:
> 0000000000000006
> [ 19.436165] R13: ffffffffc1f9e890 R14: ffff8881c604ea88 R15:
> 0000000000000000
> [ 19.436169] FS: 00007f9aabfb0980(0000) GS:ffff888f8e200000(0000)
> knlGS:0000000000000000
> [ 19.436175] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 19.436179] CR2: 00007f73247fc010 CR3: 0000000168d50000 CR4:
> 0000000000f50ef0
> [ 19.436184] PKRU: 55555554
> [ 19.436186] Call Trace:
> [ 19.436189] <TASK>
> [ 19.436192] ? die_addr+0x40/0xa0
> [ 19.436198] ? exc_general_protection+0x15c/0x240
> [ 19.436204] ? asm_exc_general_protection+0x26/0x30
> [ 19.436211] ? amdgpu_ras_reset_error_count+0x2ec/0x3e0 [amdgpu]
> [ 19.436527] gmc_v9_0_late_init+0xcf/0x1b0 [amdgpu]
> [ 19.436771] amdgpu_device_ip_late_init+0x103/0x7b0 [amdgpu]
> [ 19.436771] amdgpu_device_init+0x7b33/0x8a90 [amdgpu]
> [ 19.436771] ? __pfx_amdgpu_device_init+0x10/0x10 [amdgpu]
> [ 19.436771] ? __pfx_pci_bus_read_config_word+0x10/0x10
> [ 19.436771] ? do_pci_enable_device+0x22d/0x2a0
> [ 19.436771] ? pci_update_current_state+0x1/0x1f0
> [ 19.436771] ? _raw_spin_unlock_irqrestore+0x66/0x80
> [ 19.436771] ? lockdep_hardirqs_on+0x81/0x110
> [ 19.436771] ? __kasan_check_byte+0x13/0x50
> [ 19.436771] amdgpu_driver_load_kms+0x1d/0x4b0 [amdgpu]
> [ 19.436771] amdgpu_pci_probe+0x282/0xac0 [amdgpu]
> [ 19.436771] ? __pfx_amdgpu_pci_probe+0x10/0x10 [amdgpu]
> [ 19.436771] local_pci_probe+0xdd/0x190
> [ 19.436771] pci_device_probe+0x23a/0x780
> [ 19.436771] ? kernfs_add_one+0x326/0x490
> [ 19.436771] ? kernfs_get.part.0+0x4c/0x70
> [ 19.436771] ? __pfx_pci_device_probe+0x10/0x10
> [ 19.436771] ? kernfs_create_link+0x16b/0x230
> [ 19.436771] ? kernfs_put+0x1c/0x40
> [ 19.436771] ? sysfs_do_create_link_sd+0x8e/0x100
> [ 19.436771] really_probe+0x3e2/0xb80
> [ 19.436771] __driver_probe_device+0x18c/0x450
> [ 19.436771] driver_probe_device+0x4a/0x120
> [ 19.436771] __driver_attach+0x1e5/0x4a0
> [ 19.436771] ? __pfx___driver_attach+0x10/0x10
> [ 19.436771] bus_for_each_dev+0x109/0x190
> [ 19.436771] ? __pfx_bus_for_each_dev+0x10/0x10
> [ 19.436771] bus_add_driver+0x2a1/0x570
> [ 19.436771] driver_register+0x134/0x460
> [ 19.436771] ? __pfx_amdgpu_init+0x10/0x10 [amdgpu]
> [ 19.436771] do_one_initcall+0xd6/0x430
> [ 19.436771] ? __pfx_do_one_initcall+0x10/0x10
> [ 19.436771] ? kasan_unpoison+0x44/0x70
> [ 19.436771] do_init_module+0x238/0x770
> [ 19.436771] load_module+0x5581/0x6f10
> [ 19.436771] ? __pfx_load_module+0x10/0x10
> [ 19.436771] ? local_clock_noinstr+0x45/0xc0
> [ 19.436771] ? __might_fault+0xc6/0x180
> [ 19.436771] ? __pfx___might_resched+0x10/0x10
> [ 19.436771] ? __do_sys_init_module+0x1f2/0x220
> [ 19.436771] __do_sys_init_module+0x1f2/0x220
> [ 19.436771] ? __pfx___do_sys_init_module+0x10/0x10
> [ 19.436771] do_syscall_64+0x64/0xe0
> [ 19.436771] ? asm_exc_page_fault+0x26/0x30
> [ 19.436771] ? lockdep_hardirqs_on+0x81/0x110
> [ 19.436771] entry_SYSCALL_64_after_hwframe+0x6e/0x76
> [ 19.436771] RIP: 0033:0x7f9aac99f22e
> [ 19.436771] Code: 48 8b 0d 05 1c 0c 00 f7 d8 64 89 01 48 83 c8 ff
> c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00
> 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d2 1b 0c 00 f7 d8 64
> 89
> 01 48
> [ 19.436771] RSP: 002b:00007ffca4d0f2e8 EFLAGS: 00000246 ORIG_RAX:
> 00000000000000af
> [ 19.436771] RAX: ffffffffffffffda RBX: 0000561575417f90 RCX:
> 00007f9aac99f22e
> [ 19.436771] RDX: 0000561575423840 RSI: 00000000041c7e9e RDI:
> 00007f9aa65d3010
> [ 19.436771] RBP: 00007ffca4d0f3a0 R08: 00005615753e5010 R09:
> 0000000000000007
> [ 19.436771] R10: 0000000000000002 R11: 0000000000000246 R12:
> 0000561575423840
> [ 19.436771] R13: 0000000000020000 R14: 00005615753e9190 R15:
> 0000561575406f40
> [ 19.436771] </TASK>
> [ 19.436771] Modules linked in: amdgpu(+) hid_asus asus_wmi
> ledtrig_audio sparse_keymap platform_profile amdxcp i2c_algo_bit
> drm_ttm_helper ttm crct10dif_pclmul drm_exec crc32_pclmul
> crc32c_intel
> gpu_sched polyval_clmulni rfkill drm_suballoc_helper polyval_generic
> hid_multitouch ucsi_acpi drm_buddy typec_ucsi nvme video
> ghash_clmulni_intel drm_display_helper sha512_ssse3 nvme_core
> serio_raw ccp r8169 cec typec sp5100_tco i2c_hid_acpi i2c_hid wmi
> ip6_tables ip_tables fuse
> [ 19.439296] ---[ end trace 0000000000000000 ]---
> [ 19.439301] RIP: 0010:amdgpu_ras_reset_error_count+0x2ec/0x3e0
> [amdgpu]
> [ 19.439659] Code: 00 00 00 48 8d b8 78 01 00 00 48 89 7c 24 08 e8
> 9a e3 26 c5 48 8b 7c 24 08 48 b8 00 00 00 00 00 fc ff df 48 89 f9 48
> c1 e9 03 <0f> b6 0c 01 48 89 f8 83 e0 07 83 c0 03 38 c8 7c 04 84 c9
> 75
> 24 48
> [ 19.439669] RSP: 0018:ffffc9000312f360 EFLAGS: 00010212
> [ 19.439674] RAX: dffffc0000000000 RBX: ffff8881c6000000 RCX:
> 000000000000002f
> [ 19.439678] RDX: fffffbfff1743de9 RSI: 0000000000000008 RDI:
> 0000000000000178
> [ 19.439683] RBP: ffffffffc1f9e840 R08: 0000000000000001 R09:
> fffffbfff1743de8
> [ 19.439688] R10: ffffffff8ba1ef47 R11: 0000000000000000 R12:
> 0000000000000006
> [ 19.439693] R13: ffffffffc1f9e890 R14: ffff8881c604ea88 R15:
> 0000000000000000
> [ 19.439698] FS: 00007f9aabfb0980(0000) GS:ffff888f8e200000(0000)
> knlGS:0000000000000000
> [ 19.439703] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 19.439707] CR2: 00007f73247fc010 CR3: 0000000168d50000 CR4:
> 0000000000f50ef0
> [ 19.439712] PKRU: 55555554
> [ 19.462897] (udev-worker) (501) used greatest stack depth: 23216
> bytes left
> [ 130.872611] BTRFS info (device nvme1n1p3): using crc32c
> (crc32c-intel) checksum algorithm
> [ 130.872621] BTRFS info (device nvme1n1p3): using free space tree
> [ 131.307530] BTRFS info (device nvme1n1p3): enabling ssd
> optimizations
> [ 131.307535] BTRFS info (device nvme1n1p3): auto enabling async
> discard
>
> Today I tried bisect it, but look like I faced another issue which
> caused boot blocker:
> [ 30.814125] mikhail-laptop kernel: general protection fault,
> probably for non-canonical address 0xdffffc0000000002: 0000 [#1]
> PREEMPT SMP KASAN NOPTI
> [ 30.814129] mikhail-laptop kernel: KASAN: null-ptr-deref in range
> [0x0000000000000010-0x0000000000000017]
> [ 30.814132] mikhail-laptop kernel: CPU: 12 PID: 136 Comm:
> kworker/12:1 Tainted: G W L
> 6.6.0-rc6-01-daee7aaba8491e64911438696c5f3f7cb77edf5e+ #127
> [ 30.814134] mikhail-laptop kernel: Hardware name: ASUSTeK COMPUTER
> INC. ROG Strix G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
> [ 30.814136] mikhail-laptop kernel: Workqueue: events
> mt7921_init_work [mt7921_common]
> [ 30.814145] mikhail-laptop kernel: RIP:
> 0010:mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
> [ 30.814151] mikhail-laptop kernel: Code: c1 ea 03 80 3c 02 00 0f
> 85
> ec 03 00 00 4d 8b b4 24 d0 01 00 00 48 b8 00 00 00 00 00 fc ff df 49
> 8d 7e 14 48 89 fa 48 c1 ea 03 <0f> b6 14 02 48 89 f8 83 e0 07 83 c0
> 03
> 38 d0 7c 08 84 d2 0f 85 f9
> [ 30.814153] mikhail-laptop kernel: RSP: 0018:ffffc900013577c8
> EFLAGS: 00010213
> [ 30.814155] mikhail-laptop kernel: RAX: dffffc0000000000 RBX:
> ffff8881a61326e8 RCX: ffff8882398632c0
> [ 30.814156] mikhail-laptop kernel: RDX: 0000000000000002 RSI:
> ffffed104730c696 RDI: 0000000000000014
> [ 30.814157] mikhail-laptop kernel: RBP: 000000000000001c R08:
> ffff88823986c411 R09: 1ffff1104730d882
> [ 30.814158] mikhail-laptop kernel: R10: ffffc90001357717 R11:
> 0000000000000001 R12: ffff8882398607a0
> [ 30.814159] mikhail-laptop kernel: R13: 0000000000000000 R14:
> 0000000000000000 R15: dffffc0000000000
> [ 30.814160] mikhail-laptop kernel: FS: 0000000000000000(0000)
> GS:ffff888f8f200000(0000) knlGS:0000000000000000
> [ 30.814161] mikhail-laptop kernel: CS: 0010 DS: 0000 ES: 0000
> CR0:
> 0000000080050033
> [ 30.814162] mikhail-laptop kernel: CR2: 00007f6bad1f74c0 CR3:
> 0000000130a70000 CR4: 0000000000f50ee0
> [ 30.814164] mikhail-laptop kernel: PKRU: 55555554
> [ 30.814164] mikhail-laptop kernel: Call Trace:
> [ 30.814165] mikhail-laptop kernel: <TASK>
> [ 30.814167] mikhail-laptop kernel: ? die_addr+0x40/0xa0
> [ 30.814171] mikhail-laptop kernel: ?
> exc_general_protection+0x15c/0x240
> [ 30.814177] mikhail-laptop kernel: ?
> asm_exc_general_protection+0x26/0x30
> [ 30.814182] mikhail-laptop kernel: ?
> mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
> [ 30.814187] mikhail-laptop kernel: ?
> mt7921_regd_notifier+0x215/0x7d0 [mt7921_common]

Based on this stacktrace, I think we have the patch here.

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/net/wireless/mediatek/mt76?id=169b7acb847e8dc656cd2289a91ff668f72405a0

Please help to verify your issue with this patch.

Regards,
Deren


> [ 30.814193] mikhail-laptop kernel: ? freq_reg_info+0xb7/0x150
> [cfg80211]
> [ 30.814237] mikhail-laptop kernel:
> wiphy_update_regulatory+0xd99/0x2fa0 [cfg80211]
> [ 30.814276] mikhail-laptop kernel: ?
> nl80211_notify_wiphy+0x17e/0x210 [cfg80211]
> [ 30.814317] mikhail-laptop kernel: ?
> __pfx___mutex_unlock_slowpath+0x10/0x10
> [ 30.814321] mikhail-laptop kernel: ?
> __pfx_wiphy_update_regulatory+0x10/0x10 [cfg80211]
> [ 30.814360] mikhail-laptop kernel:
> wiphy_regulatory_register+0x87/0x190 [cfg80211]
> [ 30.814399] mikhail-laptop kernel: wiphy_register+0x1a14/0x2a90
> [cfg80211]
> [ 30.814437] mikhail-laptop kernel: ? netdev_run_todo+0x2b4/0xe20
> [ 30.814442] mikhail-laptop kernel: ?
> __pfx_wiphy_register+0x10/0x10 [cfg80211]
> [ 30.814477] mikhail-laptop kernel: ?
> __kmalloc_large_node+0xe0/0x170
> [ 30.814483] mikhail-laptop kernel:
> ieee80211_register_hw+0x1f1e/0x3f70 [mac80211]
> [ 30.814506] mikhail-laptop kernel: ?
> __pfx_ieee80211_register_hw+0x10/0x10 [mac80211]
> [ 30.814506] mikhail-laptop kernel: ?
> mt76_init_stream_cap+0x203/0x300 [mt76]
> [ 30.814506] mikhail-laptop kernel: ? mt76_init_sband+0x29b/0x3e0
> [mt76]
> [ 30.814506] mikhail-laptop
> kernel: mt76_register_device+0x477/0x8e0 [mt76]
> [ 30.814506] mikhail-laptop kernel: mt7921_init_work+0x144/0x4c0
> [mt7921_common]
> [ 30.814506] mikhail-laptop kernel: process_one_work+0x789/0x12a0
> [ 30.814506] mikhail-laptop kernel: ? worker_thread+0x2a6/0x1300
> [ 30.814506] mikhail-laptop kernel: ?
> __pfx_process_one_work+0x10/0x10
> [ 30.814506] mikhail-laptop kernel: ? assign_work+0x16c/0x240
> [ 30.814506] mikhail-laptop kernel: worker_thread+0x727/0x1300
> [ 30.814506] mikhail-laptop kernel: ?
> __pfx_worker_thread+0x10/0x10
> [ 30.814506] mikhail-laptop kernel: kthread+0x2f5/0x3d0
> [ 30.814506] mikhail-laptop kernel: ?
> _raw_spin_unlock_irq+0x28/0x60
> [ 30.814506] mikhail-laptop kernel: ? __pfx_kthread+0x10/0x10
> [ 30.814506] mikhail-laptop kernel: ret_from_fork+0x34/0x70
> [ 30.814506] mikhail-laptop kernel: ? __pfx_kthread+0x10/0x10
> [ 30.814506] mikhail-laptop kernel: ret_from_fork_asm+0x1b/0x30
> [ 30.814506] mikhail-laptop kernel: </TASK>
> [ 30.814506] mikhail-laptop kernel: Modules linked in: sunrpc
> snd_hda_codec_realtek binfmt_misc snd_hda_codec_generic
> snd_hda_codec_hdmi intel_rapl_msr intel_rapl_common
> snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir mt7921e
> snd_sof_amd_acp mt7921_common snd_sof_pci snd_sof_xtensa_dsp
> mt792x_lib snd_sof mt76_connac_lib mt76 edac_mce_amd snd_sof_utils
> snd_soc_core kvm_amd mac80211 btusb snd_hda_intel snd_intel_dspcfg
> btrtl snd_intel_sdw_acpi btintel snd_compress btbcm ac97_bus
> snd_hda_codec snd_pcm_dmaengine btmtk kvm snd_pci_ps snd_hda_core
> vfat
> snd_rpl_pci_acp6x snd_pci_acp6x fat snd_hwdep bluetooth snd_seq
> snd_seq_device libarc4 snd_pcm snd_pci_acp5x irqbypass
> snd_rn_pci_acp3x cfg80211 snd_timer rapl asus_nb_wmi snd_acp_config
> wmi_bmof snd_soc_acpi snd pcspkr k10temp snd_pci_acp3x i2c_piix4
> soundcore amd_pmc asus_wireless joydev loop zram amdgpu hid_asus
> asus_wmi ledtrig_audio sparse_keymap platform_profile i2c_algo_bit
> drm_ttm_helper ttm crct10dif_pclmul crc32_pclmul rfkill crc32c_intel
> drm_exec drm_suballoc_helper
> [ 30.814506] mikhail-laptop kernel: polyval_clmulni amdxcp
> polyval_generic ucsi_acpi drm_buddy hid_multitouch typec_ucsi video
> gpu_sched nvme ghash_clmulni_intel sha512_ssse3 nvme_core
> drm_display_helper serio_raw ccp r8169 i2c_hid_acpi sp5100_tco cec
> nvme_common typec wmi i2c_hid ip6_tables ip_tables fuse
> [ 30.814727] mikhail-laptop kernel: ---[ end trace 0000000000000000
> ]---
> [ 30.814728] mikhail-laptop kernel: RIP:
> 0010:mt7921_regd_notifier+0x3e2/0x7d0 [mt7921_common]
> [ 30.814734] mikhail-laptop kernel: Code: c1 ea 03 80 3c 02 00 0f
> 85
> ec 03 00 00 4d 8b b4 24 d0 01 00 00 48 b8 00 00 00 00 00 fc ff df 49
> 8d 7e 14 48 89 fa 48 c1 ea 03 <0f> b6 14 02 48 89 f8 83 e0 07 83 c0
> 03
> 38 d0 7c 08 84 d2 0f 85 f9
> [ 30.814735] mikhail-laptop kernel: RSP: 0018:ffffc900013577c8
> EFLAGS: 00010213
> [ 30.814737] mikhail-laptop kernel: RAX: dffffc0000000000 RBX:
> ffff8881a61326e8 RCX: ffff8882398632c0
> [ 30.814738] mikhail-laptop kernel: RDX: 0000000000000002 RSI:
> ffffed104730c696 RDI: 0000000000000014
> [ 30.814739] mikhail-laptop kernel: RBP: 000000000000001c R08:
> ffff88823986c411 R09: 1ffff1104730d882
> [ 30.814740] mikhail-laptop kernel: R10: ffffc90001357717 R11:
> 0000000000000001 R12: ffff8882398607a0
> [ 30.814741] mikhail-laptop kernel: R13: 0000000000000000 R14:
> 0000000000000000 R15: dffffc0000000000
> [ 30.814742] mikhail-laptop kernel: FS: 0000000000000000(0000)
> GS:ffff888f8f200000(0000) knlGS:0000000000000000
> [ 30.814743] mikhail-laptop kernel: CS: 0010 DS: 0000 ES: 0000
> CR0:
> 0000000080050033
> [ 30.814744] mikhail-laptop kernel: CR2: 00007f6bad1f74c0 CR3:
> 0000000130a70000 CR4: 0000000000f50ee0
> [ 30.814745] mikhail-laptop kernel: PKRU: 55555554
> [ 30.850952] mikhail-laptop kernel:
> ==================================================================
> [ 30.909383] mikhail-laptop kernel: BUG: KASAN: slab-use-after-free
> in mutex_can_spin_on_owner+0x191/0x1c0
> [ 30.909383] mikhail-laptop kernel: Read of size 4 at addr
> ffff88810c44b834 by task iw/1099
> [ 30.909383] mikhail-laptop kernel:
> [ 30.909383] mikhail-laptop kernel: CPU: 13 PID: 1099 Comm: iw
> Tainted: G D W L
> 6.6.0-rc6-01-daee7aaba8491e64911438696c5f3f7cb77edf5e+ #127
> [ 30.909383] mikhail-laptop kernel: Hardware name: ASUSTeK COMPUTER
> INC. ROG Strix G513QY_G513QY/G513QY, BIOS G513QY.331 02/24/2023
> [ 30.909383] mikhail-laptop kernel: Call Trace:
> [ 30.909383] mikhail-laptop kernel: <TASK>
> [ 30.914381] mikhail-laptop kernel: dump_stack_lvl+0x76/0xd0
> [ 30.914381] mikhail-laptop kernel: print_report+0xcf/0x670
> [ 30.914381] mikhail-laptop kernel: ?
> mutex_can_spin_on_owner+0x191/0x1c0
> [ 30.914381] mikhail-laptop kernel: kasan_report+0xa6/0xe0
> [ 30.914381] mikhail-laptop kernel: ?
> mutex_can_spin_on_owner+0x191/0x1c0
> [ 30.914381] mikhail-laptop
> kernel: mutex_can_spin_on_owner+0x191/0x1c0
> [ 30.914381] mikhail-laptop kernel: __mutex_lock+0x26a/0x18b0
> [ 30.914381] mikhail-laptop kernel: ? nl80211_pre_doit+0x92/0x750
> [cfg80211]
> [ 30.925389] mikhail-laptop kernel: ?
> __nla_validate_parse+0xeb5/0x2430
> [ 30.925389] mikhail-laptop kernel: ? __pfx___mutex_lock+0x10/0x10
> [ 30.925389] mikhail-laptop kernel: ?
> __pfx___nla_validate_parse+0x10/0x10
> [ 30.925389] mikhail-laptop kernel: ? kasan_set_track+0x25/0x30
> [ 30.925389] mikhail-laptop kernel: ? nl80211_pre_doit+0x92/0x750
> [cfg80211]
> [ 30.931386] mikhail-laptop kernel: ?
> genl_family_rcv_msg_attrs_parse.isra.0+0x150/0x230
> [ 30.931386] mikhail-laptop kernel: nl80211_pre_doit+0x92/0x750
> [cfg80211]
> [ 30.934928] mikhail-laptop
> kernel: genl_family_rcv_msg_doit+0x1b1/0x2c0
> [ 30.934928] mikhail-laptop kernel: ?
> __pfx_genl_family_rcv_msg_doit+0x10/0x10
> [ 30.934928] mikhail-laptop kernel: ?
> __pfx_bpf_lsm_capable+0x10/0x10
> [ 30.934928] mikhail-laptop kernel: ? security_capable+0x74/0xb0
> [ 30.938381] mikhail-laptop kernel: genl_rcv_msg+0x434/0x700
> [ 30.938381] mikhail-laptop kernel: ? __pfx_genl_rcv_msg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_nl80211_pre_doit+0x10/0x10 [cfg80211]
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_nl80211_req_set_reg+0x10/0x10 [cfg80211]
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_nl80211_post_doit+0x10/0x10 [cfg80211]
> [ 30.938381] mikhail-laptop kernel: netlink_rcv_skb+0x140/0x3b0
> [ 30.938381] mikhail-laptop kernel: ? __pfx_genl_rcv_msg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_netlink_rcv_skb+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_lock_acquired+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ? __pfx_down_read+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> netlink_deliver_tap+0xd0/0xaf0
> [ 30.938381] mikhail-laptop kernel: ?
> netlink_deliver_tap+0x13d/0xaf0
> [ 30.938381] mikhail-laptop kernel: genl_rcv+0x28/0x40
> [ 30.938381] mikhail-laptop kernel: netlink_unicast+0x42f/0x730
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_netlink_unicast+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: netlink_sendmsg+0x7ce/0xca0
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_netlink_sendmsg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_netlink_sendmsg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ____sys_sendmsg+0x985/0xc50
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_____sys_sendmsg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx_copy_msghdr_from_user+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ___sys_sendmsg+0x105/0x190
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx____sys_sendmsg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ? rcu_is_watching+0x15/0xb0
> [ 30.938381] mikhail-laptop kernel: ? __pfx_lock_release+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ? __fget_light+0x51/0x220
> [ 30.938381] mikhail-laptop kernel: __sys_sendmsg+0xeb/0x190
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx___sys_sendmsg+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: ?
> __pfx___seccomp_filter+0x10/0x10
> [ 30.938381] mikhail-laptop kernel: do_syscall_64+0x60/0x90
> [ 30.938381] mikhail-laptop kernel: ?
> irqentry_exit_to_user_mode+0xe/0x40
> [ 30.938381] mikhail-laptop kernel: ? rcu_is_watching+0x15/0xb0
> [ 30.938381] mikhail-laptop kernel: ?
> irqentry_exit_to_user_mode+0xe/0x40
> [ 30.938381] mikhail-laptop kernel: ?
> trace_hardirqs_on_prepare+0xe3/0x100
> [ 30.938381] mikhail-laptop
> kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
> [ 30.938381] mikhail-laptop kernel: RIP: 0033:0x7fc76d7b5414
> [ 30.938381] mikhail-laptop kernel: Code: 15 21 0a 0c 00 f7 d8 64
> 89
> 02 b8 ff ff ff ff eb bf 0f 1f 44 00 00 f3 0f 1e fa 80 3d 55 8f 0c 00
> 00 74 13 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 4c c3 0f 1f 00
> 55
> 48 89 e5 48 83 ec 20 89 55
> [ 30.938381] mikhail-laptop kernel: RSP: 002b:00007ffe62882bf8
> EFLAGS: 00000202 ORIG_RAX: 000000000000002e
> [ 30.938381] mikhail-laptop kernel: RAX: ffffffffffffffda RBX:
> 0000559cfddbf390 RCX: 00007fc76d7b5414
> [ 30.938381] mikhail-laptop kernel: RDX: 0000000000000000 RSI:
> 00007ffe62882c30 RDI: 0000000000000003
> [ 30.938381] mikhail-laptop kernel: RBP: 00007ffe62882c20 R08:
> 0000559cfddbf010 R09: 0000000000000007
> [ 30.938381] mikhail-laptop kernel: R10: 0000559cfddbf2a0 R11:
> 0000000000000202 R12: 0000559cfddc4780
> [ 30.938381] mikhail-laptop kernel: R13: 0000559cfddc48c0 R14:
> 00007ffe62882c30 R15: 0000559cfddc48c0
> [ 30.938381] mikhail-laptop kernel: </TASK>
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: Allocated by task 2:
> [ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
> [ 30.938381] mikhail-laptop kernel: kasan_set_track+0x25/0x30
> [ 30.938381] mikhail-laptop kernel: __kasan_slab_alloc+0x6e/0x70
> [ 30.938381] mikhail-laptop
> kernel: kmem_cache_alloc_node+0x18d/0x420
> [ 30.938381] mikhail-laptop kernel: copy_process+0x3be/0x6910
> [ 30.938381] mikhail-laptop kernel: kernel_clone+0xc8/0x710
> [ 30.938381] mikhail-laptop kernel: kernel_thread+0xb4/0xf0
> [ 30.938381] mikhail-laptop kernel: kthreadd+0x9c7/0xe00
> [ 30.938381] mikhail-laptop kernel: ret_from_fork+0x34/0x70
> [ 30.938381] mikhail-laptop kernel: ret_from_fork_asm+0x1b/0x30
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: Freed by task 54:
> [ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
> [ 30.938381] mikhail-laptop kernel: kasan_set_track+0x25/0x30
> [ 30.938381] mikhail-laptop kernel: kasan_save_free_info+0x2b/0x50
> [ 30.938381] mikhail-laptop kernel: __kasan_slab_free+0x10b/0x1a0
> [ 30.938381] mikhail-laptop
> kernel: slab_free_freelist_hook+0x12b/0x1e0
> [ 30.938381] mikhail-laptop kernel: kmem_cache_free+0x174/0x480
> [ 30.938381] mikhail-laptop
> kernel: delayed_put_task_struct+0x162/0x1c0
> [ 30.938381] mikhail-laptop kernel: rcu_do_batch+0x448/0x1700
> [ 30.938381] mikhail-laptop kernel: rcu_core+0x880/0xdb0
> [ 30.938381] mikhail-laptop kernel: __do_softirq+0x21b/0x8bb
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: Last potentially related work
> creation:
> [ 30.938381] mikhail-laptop kernel: kasan_save_stack+0x33/0x60
> [ 30.938381] mikhail-laptop
> kernel: __kasan_record_aux_stack+0x94/0xa0
> [ 30.938381] mikhail-laptop
> kernel: __call_rcu_common.constprop.0+0xf8/0x1af0
> [ 30.938381] mikhail-laptop kernel: __schedule+0x10b4/0x5e90
> [ 30.938381] mikhail-laptop kernel: schedule_idle+0x60/0x90
> [ 30.938381] mikhail-laptop kernel: do_idle+0x294/0x450
> [ 30.938381] mikhail-laptop kernel: cpu_startup_entry+0x55/0x60
> [ 30.938381] mikhail-laptop kernel: start_secondary+0x215/0x290
> [ 30.938381] mikhail-laptop kernel:
> secondary_startup_64_no_verify+0x17d/0x18b
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: The buggy address belongs to
> the
> object at ffff88810c44b800
> which belongs to the cache
> task_struct of size 14040
> [ 30.938381] mikhail-laptop kernel: The buggy address is located 52
> bytes inside of
> freed 14040-byte region
> [ffff88810c44b800, ffff88810c44eed8)
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: The buggy address belongs to
> the
> physical page:
> [ 30.938381] mikhail-laptop kernel: page:00000000c8764bb1
> refcount:1
> mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x10c448
> [ 30.938381] mikhail-laptop kernel: head:00000000c8764bb1 order:3
> entire_mapcount:0 nr_pages_mapped:0 pincount:0
> [ 30.938381] mikhail-laptop kernel: flags:
> 0x17ffffc0000840(slab|head|node=0|zone=2|lastcpupid=0x1fffff)
> [ 30.938381] mikhail-laptop kernel: page_type: 0xffffffff()
> [ 30.938381] mikhail-laptop kernel: raw: 0017ffffc0000840
> ffff8881081d5cc0 dead000000000122 0000000000000000
> [ 30.938381] mikhail-laptop kernel: raw: 0000000000000000
> 0000000000020002 00000001ffffffff 0000000000000000
> [ 30.938381] mikhail-laptop kernel: page dumped because: kasan: bad
> access detected
> [ 30.938381] mikhail-laptop kernel:
> [ 30.938381] mikhail-laptop kernel: Memory state around the buggy
> address:
> [ 30.938381] mikhail-laptop kernel: ffff88810c44b700: fc fc fc fc
> fc fc fc fc fc fc fc fc fc fc fc fc
> [ 30.938381] mikhail-laptop kernel: ffff88810c44b780: fc fc fc fc
> fc fc fc fc fc fc fc fc fc fc fc fc
> [ 30.938381] mikhail-laptop kernel: >ffff88810c44b800: fa fb fb fb
> fb fb fb fb fb fb fb fb fb fb fb fb
> [ 30.938381] mikhail-laptop
> kernel: ^
> [ 30.938381] mikhail-laptop kernel: ffff88810c44b880: fb fb fb fb
> fb fb fb fb fb fb fb fb fb fb fb fb
> [ 30.938381] mikhail-laptop kernel: ffff88810c44b900: fb fb fb fb
> fb fb fb fb fb fb fb fb fb fb fb fb
> [ 30.938381] mikhail-laptop kernel:
> ==================================================================
>
> And bisect says that this commit blame in inability to boot system:
> ❯ git bisect good
> 09382d8f8641bc12fffc41a93eb9b37be0e653c0 is the first bad commit
> commit 09382d8f8641bc12fffc41a93eb9b37be0e653c0
> Author: Ming Yen Hsieh <[email protected]>
> Date: Sat Sep 30 10:25:09 2023 +0800
>
> wifi: mt76: mt7921: update the channel usage when the regd domain
> changed
>
> The 5.9/6GHz channel license of a certain platform device has
> been
> regulated in various countries. That may be difference with
> standard
> Liunx regulatory domain settings. In this case, when
> .reg_notifier()
> called for regulatory change, mt792x chipset should update the
> channel
> usage based on clc or dts configurations.
>
> Channel would be disabled by following cases.
> * clc report the particular UNII-x is disabled.
> * dts enabled and the channel is not configured.
>
> Signed-off-by: Ming Yen Hsieh <[email protected]>
> Co-developed-by: Deren Wu <[email protected]>
> Signed-off-by: Deren Wu <[email protected]>
> Signed-off-by: Felix Fietkau <[email protected]>
>
> drivers/net/wireless/mediatek/mt76/eeprom.c | 7 +++-
> drivers/net/wireless/mediatek/mt76/mt76.h | 5 +++
> drivers/net/wireless/mediatek/mt76/mt7921/init.c | 51
> ++++++++++++++++++++++++
> drivers/net/wireless/mediatek/mt76/mt7921/mcu.c | 3 ++
> 4 files changed, 64 insertions(+), 2 deletions(-)
>
> I will be grateful if anyone can tell me which commit fix it and I
> can
> continue bisect the original problem.
> Hardware specs: https://linux-hardware.org/?probe=85a38e7906
>
> --
> Best Regards,
> Mike Gavrilov.
>

2023-11-06 13:40:48

by Mikhail Gavrilov

[permalink] [raw]
Subject: Re: 6.7/kasan/regression/bisected: mt7921_regd_notifier+0x3e2

On Sat, Nov 4, 2023 at 3:37 AM Deren Wu (武德仁) <[email protected]> wrote:
>
>
> Based on this stacktrace, I think we have the patch here.
>
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/net/wireless/mediatek/mt76?id=169b7acb847e8dc656cd2289a91ff668f72405a0
>
> Please help to verify your issue with this patch.

Thanks, I can confirm that this fixes KASAN: null-ptr-deref in range
[0x0000000000000178-0x000000000000017f] at mt7921_regd_notifier+0x3e2

Tested-by: Mikhail Gavrilov <[email protected]>

--
Best Regards,
Mike Gavrilov.