2022-03-11 21:43:22

by kernel test robot

[permalink] [raw]
Subject: [bpf] 4d06f388e1: WARNING:at_kernel/bpf/core.c:#bpf_prog_pack_free



Greeting,

FYI, we noticed the following commit (built with gcc-9):

commit: 4d06f388e14c69d938cbc1e4081029c14d8bc654 ("[PATCH v3 bpf-next] bpf: select proper size for bpf_prog_pack")
url: https://github.com/0day-ci/linux/commits/Song-Liu/bpf-select-proper-size-for-bpf_prog_pack/20220310-023737
base: https://git.kernel.org/cgit/linux/kernel/git/bpf/bpf-next.git master
patch link: https://lore.kernel.org/bpf/[email protected]

in testcase: stress-ng
version: stress-ng-x86_64-0.11-06_20220309
with following parameters:

nr_threads: 100%
testtime: 60s
class: memory
test: stream
cpufreq_governor: performance
ucode: 0xd000331



on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz with 128G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <[email protected]>


[ 18.435242][ T767] WARNING: CPU: 2 PID: 767 at kernel/bpf/core.c:946 bpf_prog_pack_free (kernel/bpf/core.c:946 (discriminator 3))
[ 18.444373][ T767] Modules linked in: ip_tables
[ 18.444375][ T767] CPU: 2 PID: 767 Comm: kworker/2:1 Not tainted 5.17.0-rc6-01953-g4d06f388e14c #1
[ 18.444378][ T767] Workqueue: events bpf_prog_free_deferred
[ 18.444380][ T767] RIP: 0010:bpf_prog_pack_free (kernel/bpf/core.c:946 (discriminator 3))
[ 18.444382][ T767] Code: ed 61 f6 ff 48 89 ef e8 e5 21 11 00 eb 9f 80 3d 7a dd b5 01 00 75 96 48 c7 c7 22 e1 56 82 c6 05 6a dd b5 01 01 e8 86 31 a4 00 <0f> 0b e9 7c ff ff ff 0f 0b 0f 0b b9 ff ff ff ff e9 59 ff ff ff 66
All code
========
0: ed in (%dx),%eax
1: 61 (bad)
2: f6 ff idiv %bh
4: 48 89 ef mov %rbp,%rdi
7: e8 e5 21 11 00 callq 0x1121f1
c: eb 9f jmp 0xffffffffffffffad
e: 80 3d 7a dd b5 01 00 cmpb $0x0,0x1b5dd7a(%rip) # 0x1b5dd8f
15: 75 96 jne 0xffffffffffffffad
17: 48 c7 c7 22 e1 56 82 mov $0xffffffff8256e122,%rdi
1e: c6 05 6a dd b5 01 01 movb $0x1,0x1b5dd6a(%rip) # 0x1b5dd8f
25: e8 86 31 a4 00 callq 0xa431b0
2a:* 0f 0b ud2 <-- trapping instruction
2c: e9 7c ff ff ff jmpq 0xffffffffffffffad
31: 0f 0b ud2
33: 0f 0b ud2
35: b9 ff ff ff ff mov $0xffffffff,%ecx
3a: e9 59 ff ff ff jmpq 0xffffffffffffff98
3f: 66 data16

Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: e9 7c ff ff ff jmpq 0xffffffffffffff83
7: 0f 0b ud2
9: 0f 0b ud2
b: b9 ff ff ff ff mov $0xffffffff,%ecx
10: e9 59 ff ff ff jmpq 0xffffffffffffff6e
15: 66 data16
[ 18.444384][ T767] RSP: 0000:ffa00000086a7e30 EFLAGS: 00010286
[ 18.444386][ T767] RAX: 0000000000000000 RBX: ffffffffc0400000 RCX: c0000000ffff7fff
[ 18.444387][ T767] RDX: ffa00000086a7c58 RSI: 00000000ffff7fff RDI: 0000000000000000
[ 18.444388][ T767] RBP: ffffffff82c4c5f0 R08: 0000000000000000 R09: ffa00000086a7c50
[ 18.444389][ T767] R10: 0000000000000001 R11: 0000000000000001 R12: ffffffffc0600640
[ 18.444389][ T767] R13: ff1100103f2af400 R14: 0000000000000000 R15: ff1100103f2af405
[ 18.444390][ T767] FS: 0000000000000000(0000) GS:ff1100103f280000(0000) knlGS:0000000000000000
[ 18.444391][ T767] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 18.444392][ T767] CR2: 000055c0f768b248 CR3: 000000010005e002 CR4: 0000000000771ee0
[ 18.444393][ T767] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 18.444393][ T767] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 18.444394][ T767] PKRU: 55555554
[ 18.444395][ T767] Call Trace:
[ 18.444397][ T767] <TASK>
[ 18.444398][ T767] bpf_jit_binary_pack_free (kernel/bpf/core.c:1151)
[ 18.444400][ T767] bpf_jit_free+0x6c/0x80
[ 18.444401][ T767] process_one_work (kernel/workqueue.c:2307)
[ 18.444405][ T767] worker_thread (include/linux/list.h:292 kernel/workqueue.c:2455)
[ 18.444407][ T767] ? process_one_work (kernel/workqueue.c:2397)
[ 18.444409][ T767] kthread (kernel/kthread.c:377)
[ 18.444410][ T767] ? kthread_complete_and_exit (kernel/kthread.c:332)
[ 18.444412][ T767] ret_from_fork (arch/x86/entry/entry_64.S:295)
[ 18.444417][ T767] </TASK>
[ 18.444417][ T767] ---[ end trace 0000000000000000 ]---
[ 18.789890][ T1318] IPMI message handler: version 39.2
9mHuge Pages Fil[ 18.835704][ T1345] ioatdma: Intel(R) QuickData Technology Driver 5.00
[ 18.841494][ T1324] ipmi_si IPI0001:00: ipmi_platform: probing via ACPI
[[ 18.855954][ T1324] ipmi_si IPI0001:00: ipmi_platform: [io 0x0ca2-0x0ca3] regsize 1 spacing 1 irq 0
and Kernel File[ 18.875833][ T1323] ACPI: bus type drm_connector registered
[ 18.883069][ T1324] ipmi_si: Adding ACPI-specified kcs state machine
[ 18.890060][ T1324] ipmi_si: Trying ACPI-specified kcs state machine at i/o address 0xca2, slave address 0x0, irq 0
[ 18.892421][ T1317] power_meter ACPI000D:00: Found ACPI power meter.
[ 18.906871][ T1317] power_meter ACPI000D:00: Ignoring unsafe software power cap!
Starting[ 18.914268][ T1317] power_meter ACPI000D:00: hwmon_device_register() is deprecated. Please convert the driver to use hwmon_device_register_with_info().
[[ 18.936700][ T1174] ahci 0000:00:17.0: version 3.0
[ 18.952686][ T1174] ahci 0000:00:17.0: flags: 64bit ncq sntf led clo only pio slum part ems deso sadm sds apst
[ 18.953557][ T1369] RAPL PMU: API unit is 2^-32 Joules, 2 fixed counters, 655360 ms ovfl timer
[ 18.961014][ T1324] ipmi_si IPI0001:00: IPMI message handler: Found new BMC (man_id: 0x000157, prod_id: 0x0098, dev_id: 0x22)
[ 18.982661][ T1369] RAPL PMU: hw unit of domain package 2^-14 Joules
[ 18.989691][ T1174] scsi host0: ahci
;39mLoad Kernel [ 19.000736][ T1174] scsi host1: ahci
[ 19.005180][ T1174] scsi host2: ahci
Starting [ 19.010631][ T1174] scsi host3: ahci
[ 19.019744][ T1324] ipmi_si IPI0001:00: IPMI kcs interface initialized
[0m...
[ 19.031746][ T1174] scsi host6: ahci
[ 19.041630][ T1174] ata1: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100100 irq 405
[ 19.049974][ T1174] ata2: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100180 irq 405
[ 19.058319][ T1174] ata3: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100200 irq 405
[ 19.066665][ T1174] ata4: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100280 irq 405
[ 19.075012][ T1174] ata5: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100300 irq 405
[ 19.083358][ T1174] ata6: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100380 irq 405
[ 19.091702][ T1174] ata7: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100400 irq 405
[ 19.100048][ T1174] ata8: SATA max UDMA/133 abar m524288@0x93100000 port 0x93100480 irq 405
39mKernel Configuration File Sys[ 19.109981][ T1174] mei_me 0000:00:16.0: Device doesn't have valid ME Interface
[ 19.261650][ T17] intel_rapl_common: Found RAPL domain dram
[ 19.267390][ T17] intel_rapl_common: DRAM domain energy unit 15300pj
[ 19.273915][ T17] intel_rapl_common: RAPL package-0 domain package locked by BIOS
[0m.
[ 19.284649][ T173] intel_rapl_common: Found RAPL domain package
[ 19.290669][ T173] intel_rapl_common: Found RAPL domain dram
[ 19.290671][ T173] intel_rapl_common: DRAM domain energy unit 15300pj
[ 19.290673][ T173] intel_rapl_common: RAPL package-1 domain package locked by BIOS
[ 19.424065][ T1493] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 19.430950][ T1507] ata8: SATA link down (SStatus 0 SControl 300)
[ 19.437060][ T1503] ata6: SATA link down (SStatus 0 SControl 300)
[ 19.443169][ T1505] ata7: SATA link down (SStatus 0 SControl 300)
[ 19.449282][ T1495] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 19.456170][ T1497] ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 19.463064][ T1499] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 19.469951][ T1493] ata1.00: ATA-10: INTEL SSDSC2KB240G8, XCV10120, max UDMA/133
[ 19.477346][ T1493] ata1.00: 468862128 sectors, multi 1: LBA48 NCQ (depth 32)
[ 19.484506][ T1499] ata4.00: ATA-10: WDC WD20SPZX-08UA7, 02.01A02, max UDMA/133
[ 19.491816][ T1497] ata3.00: ATA-9: INTEL SSDSC2BB800G4, D2010370, max UDMA/133
[ 19.499128][ T1497] ata3.00: 1562824368 sectors, multi 1: LBA48 NCQ (depth 32)
[ 19.506351][ T1495] ata2.00: ATA-9: INTEL SSDSC2BA400G4, G2010150, max UDMA/133
[ 19.513659][ T1495] ata2.00: 781422768 sectors, multi 1: LBA48 NCQ (depth 32)
[ 19.520805][ T1493] ata1.00: configured for UDMA/133
[0m.
[ 19.526310][ T721] scsi 0:0:0:0: Direct-Access ATA INTEL SSDSC2KB24 0120 PQ: 0 ANSI: 5
[ 19.526457][ T1495] ata2.00: configured for UDMA/133
[ 19.540115][ T1497] ata3.00: configured for UDMA/133
[ 19.540121][ T1499] ata4.00: 3907029168 sectors, multi 16: LBA48 NCQ (depth 32), AA
[ 19.540124][ T1499] ata4.00: Features: NCQ-prio
[ 19.540446][ T656] scsi 1:0:0:0: Direct-Access ATA INTEL SSDSC2BA40 0150 PQ: 0 ANSI: 5
1;39mNetwork Tim[ 19.583118][ T682] scsi 3:0:0:0: Direct-Access ATA WDC WD20SPZX-08U 1A02 PQ: 0 ANSI: 5
[ 19.607486][ T1263] scsi 0:0:0:0: Attached scsi generic sg0 type 0
[ 19.613842][ T1263] scsi 1:0:0:0: Attached scsi generic sg1 type 0
[ 19.659635][ T740] ata2.00: Enabling discard_zeroes_data
[ 19.665037][ T677] ata3.00: Enabling discard_zeroes_data
[ 19.670457][ T684] sd 3:0:0:0: [sdd] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB)
[ 19.670457][ T740] sd 1:0:0:0: [sdb] 781422768 512-byte logical blocks: (400 GB/373 GiB)
[ 19.670457][ T677] sd 2:0:0:0: [sdc] 1562824368 512-byte logical blocks: (800 GB/745 GiB)
[ 19.670460][ T740] sd 1:0:0:0: [sdb] 4096-byte physical blocks
[ 19.670460][ T677] sd 2:0:0:0: [sdc] 4096-byte physical blocks
[ 19.670464][ T677] sd 2:0:0:0: [sdc] Write Protect is off
[ 19.670465][ T677] sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[ 19.670472][ T677] sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 19.671106][ T677] ata3.00: Enabling discard_zeroes_data
[ 19.671113][ T679] sd 0:0:0:0: [sda] 468862128 512-byte logical blocks: (240 GB/224 GiB)
[ 19.671115][ T679] sd 0:0:0:0: [sda] 4096-byte physical blocks
[ 19.671119][ T679] sd 0:0:0:0: [sda] Write Protect is off
[ 19.671120][ T679] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 19.671125][ T679] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 19.671893][ T679] ata1.00: Enabling discard_zeroes_data
[ 19.674306][ T679] sda: sda1 sda2
[ 19.674488][ T679] ata1.00: Enabling discard_zeroes_data
[ 19.674517][ T679] sd 0:0:0:0: [sda] Attached SCSI disk
[ 19.677790][ T677] sdc: sdc1 sdc2 sdc3
[ 19.677979][ T677] ata3.00: Enabling discard_zeroes_data
[ 19.678012][ T677] sd 2:0:0:0: [sdc] Attached SCSI disk
[ 19.678886][ T684] sd 3:0:0:0: [sdd] 4096-byte physical blocks
[ 19.678893][ T684] sd 3:0:0:0: [sdd] Write Protect is off
[ 19.687063][ T740] sd 1:0:0:0: [sdb] Write Protect is off
[ 19.695318][ T684] sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
[ 19.701237][ T740] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00


To reproduce:

git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file

# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.



---
0-DAY CI Kernel Test Service
https://lists.01.org/hyperkitty/list/[email protected]

Thanks,
Oliver Sang


Attachments:
(No filename) (11.97 kB)
config-5.17.0-rc6-01953-g4d06f388e14c (164.77 kB)
job-script (8.13 kB)
dmesg.xz (36.93 kB)
stress-ng (14.44 kB)
job.yaml (5.51 kB)
reproduce (350.00 B)
Download all attachments