LinuxLists.cc - [clocksource] 8901ecc231: stress-ng.lockbus.ops_per

2021-05-21 20:09:08

Subject: [clocksource] 8901ecc231: stress-ng.lockbus.ops_per_sec -9.5% regression

Greeting,

FYI, we noticed a

commit: 8901ecc2315b850f35 Thanks,
Oliver Sang

-9.5% regression of stress-ng.lockbus.ops_per_sec due to commit:
a7b8c1b73b12388b72aa78 ("clocksource: Retry clock read if long delays detected")
el.org/cgit/linux/kernel/git/next/linux-next.git">https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 192G memory
in dmesg.xz (attached)
interrupt took 1878423 ns
timekeeping watchdog on CPU53: hpet read-back delay of 169583ns, attempt 4, marking unstable
Marking TSC unstable due to clocksource watchdog
found unstable after boot, most likely due to broken BIOS. Use 'tsc=unstable'.
Marking unstable (30052964508, 499342225)<-(30915547410, -363240730)
Switched to clocksource hpet
kindly add following tag
test robot <[email protected]>
------------------------------------------------------------------------>
href="https://github.com/intel/lkp-tests.git">https://github.com/intel/lkp-tests.git
job.yaml # job file is attached in this email
--compatible job.yaml # generate the yaml file for lkp run
generated-yaml-file
===============================================================
ernor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
/1HDD/x86_64-rhel-8.3/100%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp5/lockbus/stress-ng/60s/0x5003006
Retry clock read if long delays detected")
8901ecc2315b850f35a7b8c1b73
---------------------------
%stddev
\
-9.5% 225157 stress-ng.lockbus.ops
3742 stress-ng.lockbus.ops_per_sec
8470 stress-ng.time.percent_of_cpu_this_job_got
+61.5% 323.71 ? 7% stress-ng.time.system_time
4946 stress-ng.time.user_time
-6.7% 12.48 ? 5% boot-time.dhcp
+676.6% 1616642 ? 24% cpuidle.POLL.time
-17.4% 557.33 ? 10% interrupts.CPU95.CAL:Function_call_interrupts
+22.1% 207860 ? 6% softirqs.RCU
+1.3% 227.89 turbostat.PkgWatt
79.17 vmstat.cpu.us
11.10 ? 6% iostat.cpu.system
81.41 iostat.cpu.user
0.83 ? 42% mpstat.cpu.all.irq%
74.73 mpstat.cpu.all.usr%
0.01 ? 35% perf-sched.sch_delay.avg.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.02 ? 13% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_select
0.01 ? 5% perf-sched.sch_delay.max.ms.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.02 ? 60% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_select
4.67 ? 45% perf-sched.wait_and_delay.count.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
+5.5% 580618 proc-vmstat.nr_file_pages
+10.0% 338160 ? 2% proc-vmstat.nr_mapped
3246 proc-vmstat.nr_page_table_pages
+9.9% 333614 ? 3% proc-vmstat.nr_shmem
+3.0% 714653 proc-vmstat.numa_hit
+3.4% 628114 proc-vmstat.numa_local
+5.4% 541036 proc-vmstat.pgfault
7124 ? 4% proc-vmstat.pgreuse
+13.1% 950002 ? 4% numa-meminfo.node0.Inactive
+13.1% 949852 ? 4% numa-meminfo.node0.Inactive(anon)
+9.2% 58066 ? 6% numa-meminfo.node0.KReclaimable
+12.2% 705752 ? 2% numa-meminfo.node0.Mapped
9680 ? 3% numa-meminfo.node0.PageTables
+9.2% 58066 ? 6% numa-meminfo.node0.SReclaimable
+12.7% 675027 ? 3% numa-meminfo.node0.Shmem
+12.5% 680806 ? 2% numa-meminfo.node1.Mapped
+11.1% 691846 ? 2% numa-meminfo.node1.Shmem
+19.0% 230483 ? 3% numa-vmstat.node0.nr_inactive_anon
+19.5% 170125 ? 2% numa-vmstat.node0.nr_mapped
2348 ? 3% numa-vmstat.node0.nr_page_table_pages
+18.9% 162611 ? 2% numa-vmstat.node0.nr_shmem
+9.5% 14473 ? 6% numa-vmstat.node0.nr_slab_reclaimable
+19.0% 230467 ? 3% numa-vmstat.node0.nr_zone_inactive_anon
+11.2% 1258864 ? 5% numa-vmstat.node0.numa_hit
+15.9% 166725 numa-vmstat.node1.nr_file_pages
+20.1% 163039 numa-vmstat.node1.nr_mapped
+17.3% 166704 numa-vmstat.node1.nr_shmem
1.63 ? 24% perf-stat.i.MPKI
+61.5% 1.202e+09 ? 8% perf-stat.i.branch-instructions
0.94 ? 8% perf-stat.i.branch-miss-rate%
-46.8% 14082434 ? 6% perf-stat.i.branch-misses
-27.7% 1857035 ? 5% perf-stat.i.cache-misses
2416 ? 2% perf-stat.i.context-switches
51.67 ? 11% perf-stat.i.cpi
-16.9% 107705 perf-stat.i.cpu-clock
-18.4% 155.23 ? 2% perf-stat.i.cpu-migrations
-29.0% 212034 ? 4% perf-stat.i.cycles-between-cache-misses
0.26 ? 11% perf-stat.i.dTLB-store-miss-rate%
-41.6% 1.913e+08 ? 3% perf-stat.i.dTLB-stores
84.53 perf-stat.i.iTLB-load-miss-rate%
-46.9% 297416 ? 10% perf-stat.i.iTLB-loads
+52.4% 5.52e+09 ? 8% perf-stat.i.instructions
7456 ? 4% perf-stat.i.instructions-per-iTLB-miss
0.13 ? 7% perf-stat.i.ipc
41.43 ? 7% perf-stat.i.major-faults
2.35 perf-stat.i.metric.GHz
20.88 ? 8% perf-stat.i.metric.M/sec
3265 ? 4% perf-stat.i.minor-faults
82.54 perf-stat.i.node-load-miss-rate%
79105 ? 3% perf-stat.i.node-loads
75.94 ? 3% perf-stat.i.node-store-miss-rate%
-50.4% 181434 ? 5% perf-stat.i.node-stores
3306 ? 4% perf-stat.i.page-faults
-16.9% 107712 perf-stat.i.task-clock
1.33 ? 20% perf-stat.overall.MPKI
0.57 ? 8% perf-stat.overall.branch-miss-rate%
52.81 ? 12% perf-stat.overall.cpi
-21.7% 203189 ? 3% perf-stat.overall.cycles-between-cache-misses
85.10 perf-stat.overall.iTLB-load-miss-rate%
7184 ? 4% perf-stat.overall.instructions-per-iTLB-miss
0.02 ? 11% perf-stat.overall.ipc
82.64 perf-stat.overall.node-load-miss-rate%
64.68 ? 2% perf-stat.overall.node-store-miss-rate%
+393.9% 9.789e+08 ? 10% perf-stat.ps.branch-instructions
+45.5% 1140674 ? 6% perf-stat.ps.cache-misses
+170.4% 6.138e+08 ? 9% perf-stat.ps.dTLB-loads
+36.6% 616231 ? 8% perf-stat.ps.iTLB-load-misses
+363.0% 4.432e+09 ? 10% perf-stat.ps.instructions
12.33 perf-stat.ps.major-faults
1812 ? 5% perf-stat.ps.minor-faults
218412 ? 7% perf-stat.ps.node-load-misses
45791 ? 5% perf-stat.ps.node-loads
121369 ? 7% perf-stat.ps.node-store-misses
66048 perf-stat.ps.node-stores
1824 ? 5% perf-stat.ps.page-faults
+567.9% 2.698e+11 ? 9% perf-stat.total.instructions
15.65 ? 27% perf-profile.calltrace.cycles-pp.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt
15.56 ? 27% perf-profile.calltrace.cycles-pp.update_process_times.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt
19.65 ? 26% perf-profile.calltrace.cycles-pp.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
29.94 ? 9% perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
14.02 ? 27% perf-profile.calltrace.cycles-pp.scheduler_tick.update_process_times.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues
7.16 ? 30% perf-profile.calltrace.cycles-pp.task_tick_fair.scheduler_tick.update_process_times.tick_sched_handle.tick_sched_timer
0.63 ? 12% perf-profile.calltrace.cycles-pp.perf_event_task_tick.scheduler_tick.update_process_times.tick_sched_handle.tick_sched_timer
2.27 ? 31% perf-profile.calltrace.cycles-pp.update_load_avg.task_tick_fair.scheduler_tick.update_process_times.tick_sched_handle
1.71 ? 28% perf-profile.calltrace.cycles-pp.update_curr.task_tick_fair.scheduler_tick.update_process_times.tick_sched_handle
1.51 ? 31% perf-profile.calltrace.cycles-pp.hrtimer_active.task_tick_fair.scheduler_tick.update_process_times.tick_sched_handle
0.31 ?100% perf-profile.calltrace.cycles-pp.trigger_load_balance.update_process_times.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues
0.35 ?107% perf-profile.calltrace.cycles-pp.rcu_sched_clock_irq.update_process_times.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues
0.59 ? 54% perf-profile.calltrace.cycles-pp.update_cfs_group.task_tick_fair.scheduler_tick.update_process_times.tick_sched_handle
0.47 ? 80% perf-profile.calltrace.cycles-pp.__update_load_avg_cfs_rq.update_load_avg.task_tick_fair.scheduler_tick.update_process_times
0.69 ? 55% perf-profile.calltrace.cycles-pp.__update_load_avg_se.update_load_avg.task_tick_fair.scheduler_tick.update_process_times
1.33 ? 14% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
1.33 ? 14% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.00 ? 10% perf-profile.calltrace.cycles-pp.load_balance.rebalance_domains.__softirqentry_text_start.irq_exit_rcu.sysvec_apic_timer_interrupt
2.70 ? 9% perf-profile.calltrace.cycles-pp.rebalance_domains.__softirqentry_text_start.irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
0.96 ? 12% perf-profile.calltrace.cycles-pp.execve
0.96 ? 12% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.execve
0.96 ? 12% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve
1.31 ? 9% perf-profile.calltrace.cycles-pp.find_busiest_group.load_balance.rebalance_domains.__softirqentry_text_start.irq_exit_rcu
0.96 ? 12% perf-profile.calltrace.cycles-pp.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve
0.96 ? 12% perf-profile.calltrace.cycles-pp.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve
1.02 ? 16% perf-profile.calltrace.cycles-pp.update_sd_lb_stats.find_busiest_group.load_balance.rebalance_domains.__softirqentry_text_start
0.79 ? 13% perf-profile.calltrace.cycles-pp.bprm_execve.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.29 ?100% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork
0.70 ? 13% perf-profile.calltrace.cycles-pp.exec_binprm.bprm_execve.do_execveat_common.__x64_sys_execve.do_syscall_64
0.67 ? 13% perf-profile.calltrace.cycles-pp.load_elf_binary.exec_binprm.bprm_execve.do_execveat_common.__x64_sys_execve
0.41 ? 71% perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
0.77 ? 12% perf-profile.calltrace.cycles-pp.asm_exc_page_fault
0.85 ? 16% perf-profile.calltrace.cycles-pp.smpboot_thread_fn.kthread.ret_from_fork
0.39 ? 72% perf-profile.calltrace.cycles-pp.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.39 ? 72% perf-profile.calltrace.cycles-pp.do_group_exit.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.39 ? 72% perf-profile.calltrace.cycles-pp.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.74 ? 13% perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
0.73 ? 13% perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
0.61 ? 14% perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
0.62 ? 13% perf-profile.calltrace.cycles-pp.__libc_fork
0.51 ? 46% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.update_sd_lb_stats.find_busiest_group.load_balance.rebalance_domains
1.64 ? 11% perf-profile.calltrace.cycles-pp.read
1.62 ? 11% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
1.62 ? 11% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
1.60 ? 11% perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
1.61 ? 11% perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
1.54 ? 12% perf-profile.calltrace.cycles-pp.new_sync_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.77 ? 21% perf-profile.calltrace.cycles-pp.__irqentry_text_end
1.33 ? 11% perf-profile.calltrace.cycles-pp.seq_read_iter.new_sync_read.vfs_read.ksys_read.do_syscall_64
1.25 ? 12% perf-profile.calltrace.cycles-pp.show_stat.seq_read_iter.new_sync_read.vfs_read.ksys_read
1.56 ? 31% perf-profile.calltrace.cycles-pp.rcu_core.__softirqentry_text_start.irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
3.37 ? 20% perf-profile.calltrace.cycles-pp.read_hpet.ktime_get.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt
3.67 ? 21% perf-profile.calltrace.cycles-pp.ktime_get.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt
80.88 ? 2% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt
4.65 ? 23% perf-profile.calltrace.cycles-pp.read_hpet.ktime_get.sched_clock_tick.scheduler_tick.update_process_times
5.18 ? 25% perf-profile.calltrace.cycles-pp.ktime_get.sched_clock_tick.scheduler_tick.update_process_times.tick_sched_handle
5.29 ? 25% perf-profile.calltrace.cycles-pp.sched_clock_tick.scheduler_tick.update_process_times.tick_sched_handle.tick_sched_timer
9.24 ? 26% perf-profile.calltrace.cycles-pp.perf_mux_hrtimer_handler.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
8.77 ? 25% perf-profile.calltrace.cycles-pp.ktime_get.perf_mux_hrtimer_handler.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt
8.16 ? 25% perf-profile.calltrace.cycles-pp.read_hpet.ktime_get.perf_mux_hrtimer_handler.__hrtimer_run_queues.hrtimer_interrupt
78.13 ? 3% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
15.75 ? 20% perf-profile.calltrace.cycles-pp.ktime_get_update_offsets_now.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
72.32 ? 5% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
72.23 ? 5% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
14.64 ? 21% perf-profile.calltrace.cycles-pp.read_hpet.ktime_get_update_offsets_now.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
25.58 ? 6% perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
20.55 ? 6% perf-profile.calltrace.cycles-pp.read_hpet.ktime_get.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt
22.46 ? 5% perf-profile.calltrace.cycles-pp.ktime_get.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
17.57 ? 25% perf-profile.children.cycles-pp.tick_sched_handle
17.47 ? 25% perf-profile.children.cycles-pp.update_process_times
22.13 ? 23% perf-profile.children.cycles-pp.tick_sched_timer
33.48 ? 8% perf-profile.children.cycles-pp.__hrtimer_run_queues
15.70 ? 25% perf-profile.children.cycles-pp.scheduler_tick
7.95 ? 28% perf-profile.children.cycles-pp.task_tick_fair
4.26 ? 77% perf-profile.children.cycles-pp.asm_exc_nmi
0.72 ? 11% perf-profile.children.cycles-pp.perf_event_task_tick
2.77 ? 25% perf-profile.children.cycles-pp.update_load_avg
2.11 ? 23% perf-profile.children.cycles-pp.update_curr
0.86 ? 44% perf-profile.children.cycles-pp.ghes_notify_nmi
1.66 ? 29% perf-profile.children.cycles-pp.hrtimer_active
0.28 ? 32% perf-profile.children.cycles-pp.__intel_pmu_enable_all
0.58 ? 20% perf-profile.children.cycles-pp.trigger_load_balance
0.65 ? 34% perf-profile.children.cycles-pp.rcu_sched_clock_irq
1.12 ? 50% perf-profile.children.cycles-pp.native_sched_clock
7.03 ? 11% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
7.02 ? 11% perf-profile.children.cycles-pp.do_syscall_64
0.69 ? 21% perf-profile.children.cycles-pp.irqtime_account_irq
0.73 ? 31% perf-profile.children.cycles-pp.update_cfs_group
0.44 ? 9% perf-profile.children.cycles-pp._raw_spin_lock
0.28 ? 17% perf-profile.children.cycles-pp.update_rq_clock
0.82 ? 25% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.09 ? 73% perf-profile.children.cycles-pp.nmi_handle
3.02 ? 9% perf-profile.children.cycles-pp.rebalance_domains
0.95 ? 26% perf-profile.children.cycles-pp.__update_load_avg_se
2.27 ? 9% perf-profile.children.cycles-pp.load_balance
0.56 ? 50% perf-profile.children.cycles-pp.native_flush_tlb_one_user
0.24 ? 30% perf-profile.children.cycles-pp.arch_scale_freq_tick
0.40 ? 67% perf-profile.children.cycles-pp.paranoid_entry
0.25 ? 17% perf-profile.children.cycles-pp.sched_clock_cpu
0.19 ? 60% perf-profile.children.cycles-pp.exc_nmi
0.29 ? 70% perf-profile.children.cycles-pp.repeat_nmi
1.47 ? 10% perf-profile.children.cycles-pp.find_busiest_group
1.37 ? 9% perf-profile.children.cycles-pp.update_sd_lb_stats
0.18 ? 30% perf-profile.children.cycles-pp.sync_regs
0.35 ? 7% perf-profile.children.cycles-pp.hrtimer_update_next_event
0.97 ? 12% perf-profile.children.cycles-pp.__x64_sys_execve
0.97 ? 12% perf-profile.children.cycles-pp.do_execveat_common
0.26 ? 8% perf-profile.children.cycles-pp.__remove_hrtimer
0.96 ? 12% perf-profile.children.cycles-pp.execve
0.15 ? 30% perf-profile.children.cycles-pp.update_irq_load_avg
0.32 ? 31% perf-profile.children.cycles-pp.acpi_os_read_memory
0.80 ? 13% perf-profile.children.cycles-pp.bprm_execve
1.57 ? 10% perf-profile.children.cycles-pp.asm_exc_page_fault
0.71 ? 13% perf-profile.children.cycles-pp.exec_binprm
1.46 ? 10% perf-profile.children.cycles-pp.exc_page_fault
0.68 ? 13% perf-profile.children.cycles-pp.load_elf_binary
1.45 ? 9% perf-profile.children.cycles-pp.do_user_addr_fault
1.20 ? 10% perf-profile.children.cycles-pp.handle_mm_fault
0.36 ? 31% perf-profile.children.cycles-pp.reweight_entity
1.12 ? 10% perf-profile.children.cycles-pp.__handle_mm_fault
0.19 ? 10% perf-profile.children.cycles-pp.native_irq_return_iret
0.29 ? 10% perf-profile.children.cycles-pp.enqueue_hrtimer
0.25 ? 31% perf-profile.children.cycles-pp.update_min_vruntime
0.19 ? 6% perf-profile.children.cycles-pp.timerqueue_del
0.38 ? 16% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.27 ? 76% perf-profile.children.cycles-pp.first_nmi
0.73 ? 14% perf-profile.children.cycles-pp.exit_mmap
0.74 ? 15% perf-profile.children.cycles-pp.mmput
0.24 ? 9% perf-profile.children.cycles-pp.timerqueue_add
0.14 ? 28% perf-profile.children.cycles-pp.account_user_time
0.67 ? 19% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.48 ? 11% perf-profile.children.cycles-pp.do_filp_open
0.20 ? 8% perf-profile.children.cycles-pp.__hrtimer_next_event_base
0.47 ? 10% perf-profile.children.cycles-pp.path_openat
0.85 ? 16% perf-profile.children.cycles-pp.smpboot_thread_fn
0.50 ? 11% perf-profile.children.cycles-pp.do_sys_open
0.49 ? 11% perf-profile.children.cycles-pp.do_sys_openat2
0.18 ? 8% perf-profile.children.cycles-pp.__hrtimer_get_next_event
0.63 ? 14% perf-profile.children.cycles-pp.__libc_fork
0.78 ? 12% perf-profile.children.cycles-pp.finish_task_switch
0.35 ? 12% perf-profile.children.cycles-pp.begin_new_exec
0.52 ? 12% perf-profile.children.cycles-pp.worker_thread
0.09 ? 49% perf-profile.children.cycles-pp.default_do_nmi
0.31 ? 12% perf-profile.children.cycles-pp.vm_mmap_pgoff
0.29 ? 12% perf-profile.children.cycles-pp.do_mmap
0.43 ? 16% perf-profile.children.cycles-pp.copy_process
0.08 ? 38% perf-profile.children.cycles-pp.ghes_copy_tofrom_phys
0.46 ? 13% perf-profile.children.cycles-pp.process_one_work
0.46 ? 15% perf-profile.children.cycles-pp.__do_sys_clone
0.46 ? 15% perf-profile.children.cycles-pp.kernel_clone
0.59 ? 10% perf-profile.children.cycles-pp.do_fault
0.40 ? 20% perf-profile.children.cycles-pp.try_to_wake_up
0.03 ?100% perf-profile.children.cycles-pp.rb_erase
0.27 ? 13% perf-profile.children.cycles-pp.mmap_region
0.07 ? 23% perf-profile.children.cycles-pp.run_posix_cpu_timers
0.36 ? 15% perf-profile.children.cycles-pp.unmap_vmas
0.08 ? 16% perf-profile.children.cycles-pp.tick_program_event
0.12 ? 16% perf-profile.children.cycles-pp.hrtimer_forward
0.03 ?100% perf-profile.children.cycles-pp.account_process_tick
0.24 ? 7% perf-profile.children.cycles-pp.ksys_mmap_pgoff
0.25 ? 12% perf-profile.children.cycles-pp.kmem_cache_alloc
0.55 ? 16% perf-profile.children.cycles-pp.__x64_sys_exit_group
0.55 ? 16% perf-profile.children.cycles-pp.do_group_exit
0.55 ? 16% perf-profile.children.cycles-pp.do_exit
0.14 ? 66% perf-profile.children.cycles-pp.intel_pmu_handle_irq
0.09 ? 51% perf-profile.children.cycles-pp.__native_set_fixmap
0.29 ? 8% perf-profile.children.cycles-pp.__alloc_pages
0.28 ? 20% perf-profile.children.cycles-pp.dup_mm
0.23 ? 20% perf-profile.children.cycles-pp.walk_component
0.33 ? 15% perf-profile.children.cycles-pp.unmap_page_range
0.09 ? 54% perf-profile.children.cycles-pp.calc_global_load_tick
0.26 ? 20% perf-profile.children.cycles-pp.dup_mmap
0.24 ? 20% perf-profile.children.cycles-pp.link_path_walk
0.06 ?118% perf-profile.children.cycles-pp.intel_bts_disable_local
0.45 ? 16% perf-profile.children.cycles-pp.run_ksoftirqd
0.30 ? 15% perf-profile.children.cycles-pp.zap_pte_range
0.16 ? 22% perf-profile.children.cycles-pp.__calc_delta
0.06 ?119% perf-profile.children.cycles-pp.__intel_pmu_disable_all
0.39 ? 13% perf-profile.children.cycles-pp.filemap_map_pages
0.15 ? 28% perf-profile.children.cycles-pp.__accumulate_pelt_segments
0.26 ? 12% perf-profile.children.cycles-pp.setlocale
0.06 ? 46% perf-profile.children.cycles-pp.sched_slice
0.04 ? 75% perf-profile.children.cycles-pp.perf_trace_sched_stat_runtime
0.20 ? 11% perf-profile.children.cycles-pp.get_page_from_freelist
0.04 ?104% perf-profile.children.cycles-pp.native_set_fixmap
0.20 ? 15% perf-profile.children.cycles-pp.free_pgtables
0.04 ? 72% perf-profile.children.cycles-pp.schedule_idle
0.07 ? 18% perf-profile.children.cycles-pp.__poll
0.20 ? 15% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.19 ? 19% perf-profile.children.cycles-pp.call_timer_fn
0.08 ? 6% perf-profile.children.cycles-pp.rb_insert_color
0.18 ? 21% perf-profile.children.cycles-pp.alloc_pages_vma
0.11 ? 27% perf-profile.children.cycles-pp.filename_lookup
0.13 ? 10% perf-profile.children.cycles-pp.__do_munmap
0.17 ? 11% perf-profile.children.cycles-pp.__open64_nocancel
0.03 ?108% perf-profile.children.cycles-pp.__evlist__enable
0.11 ? 24% perf-profile.children.cycles-pp.path_lookupat
0.21 ? 12% perf-profile.children.cycles-pp.next_uptodate_page
0.08 ? 24% perf-profile.children.cycles-pp.__do_sys_newstat
0.10 ? 16% perf-profile.children.cycles-pp.step_into
0.21 ? 19% perf-profile.children.cycles-pp.wp_page_copy
0.08 ? 24% perf-profile.children.cycles-pp.vfs_statx
0.10 ? 18% perf-profile.children.cycles-pp.elf_map
0.06 ? 46% perf-profile.children.cycles-pp.do_sys_poll
0.09 ? 12% perf-profile.children.cycles-pp.dput
0.12 ? 11% perf-profile.children.cycles-pp.__split_vma
0.06 ? 47% perf-profile.children.cycles-pp.__x64_sys_poll
0.07 ? 19% perf-profile.children.cycles-pp.vma_interval_tree_insert
0.09 ? 31% perf-profile.children.cycles-pp.enqueue_task_fair
0.09 ? 31% perf-profile.children.cycles-pp.ttwu_do_activate
0.09 ? 15% perf-profile.children.cycles-pp.__pagevec_lru_add
0.07 ? 21% perf-profile.children.cycles-pp.vma_link
0.13 ? 14% perf-profile.children.cycles-pp.do_anonymous_page
0.10 ? 25% perf-profile.children.cycles-pp.proc_reg_read_iter
0.09 ? 15% perf-profile.children.cycles-pp.__alloc_file
0.10 ? 15% perf-profile.children.cycles-pp.alloc_empty_file
0.13 ? 16% perf-profile.children.cycles-pp.tlb_finish_mmu
0.12 ? 17% perf-profile.children.cycles-pp.tlb_flush_mmu
0.09 ? 15% perf-profile.children.cycles-pp.__mmap
0.19 ? 18% perf-profile.children.cycles-pp.write
0.24 ? 14% perf-profile.children.cycles-pp.kmem_cache_free
0.13 ? 12% perf-profile.children.cycles-pp.release_pages
0.09 ? 16% perf-profile.children.cycles-pp.copy_strings
0.09 ? 14% perf-profile.children.cycles-pp.__vma_adjust
0.24 ? 16% perf-profile.children.cycles-pp.cpumask_next_and
0.09 ? 22% perf-profile.children.cycles-pp.do_mprotect_pkey
0.03 ?101% perf-profile.children.cycles-pp.get_obj_cgroup_from_current
0.03 ?100% perf-profile.children.cycles-pp.__xstat64
0.10 ? 21% perf-profile.children.cycles-pp.__x64_sys_mprotect
0.07 ? 18% perf-profile.children.cycles-pp.rmqueue
0.08 ? 16% perf-profile.children.cycles-pp.do_open
0.10 ? 23% perf-profile.children.cycles-pp.__lookup_slow
0.10 ? 11% perf-profile.children.cycles-pp.prep_new_page
0.09 ? 15% perf-profile.children.cycles-pp.unlink_anon_vmas
0.09 ? 14% perf-profile.children.cycles-pp.lookup_fast
0.09 ? 21% perf-profile.children.cycles-pp.mprotect_fixup
0.06 ? 50% perf-profile.children.cycles-pp.__get_user_pages_remote
0.04 ? 75% perf-profile.children.cycles-pp.anon_vma_fork
0.03 ? 70% perf-profile.children.cycles-pp.__get_free_pages
0.03 ? 99% perf-profile.children.cycles-pp.strnlen_user
0.03 ? 99% perf-profile.children.cycles-pp.page_counter_try_charge
0.08 ? 21% perf-profile.children.cycles-pp.unlink_file_vma
0.06 ? 50% perf-profile.children.cycles-pp.__get_user_pages
0.11 ? 12% perf-profile.children.cycles-pp.___might_sleep
0.05 ? 46% perf-profile.children.cycles-pp.remove_vma
0.04 ? 71% perf-profile.children.cycles-pp.mutex_lock
0.12 ? 15% perf-profile.children.cycles-pp.page_remove_rmap
0.12 ? 13% perf-profile.children.cycles-pp.select_task_rq_fair
0.03 ?100% perf-profile.children.cycles-pp.__check_object_size
0.05 ? 46% perf-profile.children.cycles-pp.unmap_region
0.04 ? 71% perf-profile.children.cycles-pp.getname_flags
0.08 ? 20% perf-profile.children.cycles-pp.d_alloc_parallel
0.06 ? 19% perf-profile.children.cycles-pp.do_open_execat
0.04 ? 73% perf-profile.children.cycles-pp.__pte_alloc
0.10 ? 19% perf-profile.children.cycles-pp.vmstat_update
0.05 ? 46% perf-profile.children.cycles-pp.do_dentry_open
0.08 ? 17% perf-profile.children.cycles-pp.select_idle_sibling
0.08 ? 21% perf-profile.children.cycles-pp.__cgroup_account_cputime_field
0.04 ? 71% perf-profile.children.cycles-pp.__dentry_kill
0.03 ?100% perf-profile.children.cycles-pp.__memcg_kmem_charge_page
0.05 ? 48% perf-profile.children.cycles-pp.lru_add_drain
0.07 ? 14% perf-profile.children.cycles-pp.open64
0.04 ? 71% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
0.09 ? 10% perf-profile.children.cycles-pp.clear_page_erms
0.07 ? 21% perf-profile.children.cycles-pp.get_arg_page
0.08 ? 24% perf-profile.children.cycles-pp.__wake_up_common_lock
0.04 ? 72% perf-profile.children.cycles-pp.lru_add_drain_cpu
0.08 ? 14% perf-profile.children.cycles-pp.__fput
0.07 ? 16% perf-profile.children.cycles-pp.__clear_user
0.03 ?100% perf-profile.children.cycles-pp.__d_alloc
0.05 ? 47% perf-profile.children.cycles-pp.obj_cgroup_charge_pages
0.09 ? 20% perf-profile.children.cycles-pp._dl_addr
0.06 ? 17% perf-profile.children.cycles-pp.__mod_lruvec_page_state
0.09 ? 22% perf-profile.children.cycles-pp.perf_event_mmap
0.05 ? 50% perf-profile.children.cycles-pp.___perf_sw_event
0.07 ? 14% perf-profile.children.cycles-pp.pte_alloc_one
0.05 ? 50% perf-profile.children.cycles-pp.vm_area_dup
0.06 ? 13% perf-profile.children.cycles-pp.down_write
0.06 ? 18% perf-profile.children.cycles-pp.malloc
0.06 ? 9% perf-profile.children.cycles-pp.sum_zone_numa_state
0.07 ? 11% perf-profile.children.cycles-pp.sysfs_kf_seq_show
0.07 ? 11% perf-profile.children.cycles-pp.dev_attr_show
0.06 ? 8% perf-profile.children.cycles-pp.__might_sleep
0.06 ? 17% perf-profile.children.cycles-pp.__perf_sw_event
0.11 ? 13% perf-profile.children.cycles-pp.seq_put_decimal_ull_width
0.09 ? 25% perf-profile.children.cycles-pp.ksoftirqd_running
0.06 ? 13% perf-profile.children.cycles-pp.num_to_str
0.77 ? 21% perf-profile.children.cycles-pp.__irqentry_text_end
0.39 ? 10% perf-profile.children.cycles-pp.get_cpu_idle_time_us
0.39 ? 11% perf-profile.children.cycles-pp.get_idle_time
0.42 ? 11% perf-profile.children.cycles-pp.get_cpu_iowait_time_us
0.42 ? 11% perf-profile.children.cycles-pp.get_iowait_time
0.47 ? 21% perf-profile.children.cycles-pp.sched_clock_local
1.87 ? 11% perf-profile.children.cycles-pp.ksys_read
1.82 ? 11% perf-profile.children.cycles-pp.vfs_read
1.64 ? 11% perf-profile.children.cycles-pp.read
1.57 ? 12% perf-profile.children.cycles-pp.new_sync_read
1.49 ? 11% perf-profile.children.cycles-pp.seq_read_iter
1.26 ? 11% perf-profile.children.cycles-pp.show_stat
1.86 ? 29% perf-profile.children.cycles-pp.rcu_core
10.15 ? 26% perf-profile.children.cycles-pp.perf_mux_hrtimer_handler
5.98 ? 22% perf-profile.children.cycles-pp.sched_clock_tick
91.85 ? 3% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
86.74 ? 2% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
80.42 ? 4% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
80.31 ? 4% perf-profile.children.cycles-pp.hrtimer_interrupt
17.51 ? 19% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
28.15 ? 6% perf-profile.children.cycles-pp.clockevents_program_event
45.11 ? 3% perf-profile.children.cycles-pp.ktime_get
57.25 ? 5% perf-profile.children.cycles-pp.read_hpet
4.24 ? 77% perf-profile.self.cycles-pp.asm_exc_nmi
0.86 ? 44% perf-profile.self.cycles-pp.ghes_notify_nmi
0.57 ? 8% perf-profile.self.cycles-pp.perf_event_task_tick
0.25 ? 27% perf-profile.self.cycles-pp.__intel_pmu_enable_all
1.55 ? 30% perf-profile.self.cycles-pp.hrtimer_active
1.41 ? 24% perf-profile.self.cycles-pp.update_curr
1.16 ? 19% perf-profile.self.cycles-pp.ktime_get_update_offsets_now
1.09 ? 49% perf-profile.self.cycles-pp.native_sched_clock
0.71 ? 30% perf-profile.self.cycles-pp.task_tick_fair
0.52 ? 21% perf-profile.self.cycles-pp.trigger_load_balance
0.59 ? 33% perf-profile.self.cycles-pp.rcu_sched_clock_irq
0.21 ? 22% perf-profile.self.cycles-pp.update_process_times
0.98 ? 23% perf-profile.self.cycles-pp.update_load_avg
0.09 ? 73% perf-profile.self.cycles-pp.nmi_handle
0.68 ? 33% perf-profile.self.cycles-pp.update_cfs_group
0.56 ? 50% perf-profile.self.cycles-pp.native_flush_tlb_one_user
0.40 ? 67% perf-profile.self.cycles-pp.paranoid_entry
0.19 ? 60% perf-profile.self.cycles-pp.exc_nmi
0.34 ? 11% perf-profile.self.cycles-pp._raw_spin_lock
0.29 ? 70% perf-profile.self.cycles-pp.repeat_nmi
0.49 ? 16% perf-profile.self.cycles-pp.irqtime_account_irq
0.18 ? 29% perf-profile.self.cycles-pp.sync_regs
0.68 ? 29% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.21 ? 31% perf-profile.self.cycles-pp.arch_scale_freq_tick
0.32 ? 31% perf-profile.self.cycles-pp.acpi_os_read_memory
0.77 ? 29% perf-profile.self.cycles-pp.__update_load_avg_se
0.15 ? 10% perf-profile.self.cycles-pp.native_irq_return_iret
0.27 ? 76% perf-profile.self.cycles-pp.first_nmi
0.15 ? 21% perf-profile.self.cycles-pp.scheduler_tick
0.23 ? 10% perf-profile.self.cycles-pp.__hrtimer_run_queues
0.12 ? 12% perf-profile.self.cycles-pp.update_irq_load_avg
0.22 ? 32% perf-profile.self.cycles-pp.update_min_vruntime
0.37 ? 8% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.09 ? 49% perf-profile.self.cycles-pp.default_do_nmi
0.61 ? 8% perf-profile.self.cycles-pp.update_sd_lb_stats
0.10 ? 21% perf-profile.self.cycles-pp.tick_sched_timer
0.08 ? 38% perf-profile.self.cycles-pp.ghes_copy_tofrom_phys
0.28 ? 34% perf-profile.self.cycles-pp.reweight_entity
0.32 ? 6% perf-profile.self.cycles-pp._raw_spin_lock_irq
0.05 ? 76% perf-profile.self.cycles-pp.account_user_time
0.12 ? 21% perf-profile.self.cycles-pp.perf_mux_hrtimer_handler
0.14 ? 66% perf-profile.self.cycles-pp.intel_pmu_handle_irq
0.09 ? 51% perf-profile.self.cycles-pp.__native_set_fixmap
0.17 ? 7% perf-profile.self.cycles-pp.__hrtimer_next_event_base
0.15 ? 4% perf-profile.self.cycles-pp.timerqueue_add
0.06 ?118% perf-profile.self.cycles-pp.intel_bts_disable_local
0.09 ? 11% perf-profile.self.cycles-pp.__sysvec_apic_timer_interrupt
0.03 ?101% perf-profile.self.cycles-pp.account_process_tick
0.06 ?119% perf-profile.self.cycles-pp.__intel_pmu_disable_all
0.09 ? 55% perf-profile.self.cycles-pp.calc_global_load_tick
0.10 ? 7% perf-profile.self.cycles-pp.hrtimer_forward
0.04 ?104% perf-profile.self.cycles-pp.native_set_fixmap
0.07 ? 21% perf-profile.self.cycles-pp.tick_program_event
0.07 ? 25% perf-profile.self.cycles-pp.run_posix_cpu_timers
0.07 ? 15% perf-profile.self.cycles-pp.__remove_hrtimer
0.09 ? 13% perf-profile.self.cycles-pp.__hrtimer_get_next_event
0.16 ? 22% perf-profile.self.cycles-pp.__calc_delta
0.03 ?100% perf-profile.self.cycles-pp.sched_slice
0.12 ? 5% perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore
0.04 ? 75% perf-profile.self.cycles-pp.perf_trace_sched_stat_runtime
0.10 ? 11% perf-profile.self.cycles-pp.timerqueue_del
0.13 ? 32% perf-profile.self.cycles-pp.__accumulate_pelt_segments
0.07 ? 11% perf-profile.self.cycles-pp.rb_insert_color
0.05 ? 47% perf-profile.self.cycles-pp.kmem_cache_alloc
0.07 ? 10% perf-profile.self.cycles-pp.rcu_nocb_flush_deferred_wakeup
0.07 ? 26% perf-profile.self.cycles-pp.update_group_capacity
0.07 ? 18% perf-profile.self.cycles-pp.ksoftirqd_running
0.22 ? 10% perf-profile.self.cycles-pp.exit_to_user_mode_prepare
0.14 ? 9% perf-profile.self.cycles-pp.run_rebalance_domains
0.29 ? 10% perf-profile.self.cycles-pp.irq_exit_rcu
0.17 ? 32% perf-profile.self.cycles-pp.rcu_core
0.45 ? 14% perf-profile.self.cycles-pp.idle_cpu
0.25 ? 21% perf-profile.self.cycles-pp.sched_clock_local
0.63 ? 10% perf-profile.self.cycles-pp.__irqentry_text_end
0.49 ? 14% perf-profile.self.cycles-pp.__softirqentry_text_start
3.64 ? 5% perf-profile.self.cycles-pp.ktime_get
54.23 ? 8% perf-profile.self.cycles-pp.read_hpet

stress-ng.time.user_time

--------------------------------------------------+
OO O O O O O O O |
O OO O OO O O O O OO O |
O O O |
|
|
|
|
+ + + + + + +. + |
+. + ++ :: :+ :: : + + :.+ : : + : |
: + : : : :: : : :: : :+ ++ + : : : : : : : |
: : :: : + :: : + :: + :.+ :: :.+ ::|
+.+ + + : + : + : + :|
+ + + +|
--------------------------------------------------+

stress-ng.time.percent_of_cpu_this_job_got

--------------------------------------------------+
|
O O OOO OO O O O OO O O OO O |
O O OO O O O OO O |
O O |
|
|
|
|
+ + + + + + + |
+. + :: :: :: : .+ : :+ : |
: + : ++ :: : + :: : :.+++ +.+ : : : + : : |
: ::+ + + :: + + :: + :.+ :: :.+ ::|
:+ +.: + + : + : + : + :|
--------------------------------------------------+

stress-ng.lockbus.ops

--------------------------------------------------+
+.++.+++.++.+ +++ + ++.+ ++ ++ + +.++ + ++.+|
|
|
|
|
|
|
O |
O O O O |
O O O O O |
O O O |
O OO O O O O OO |
O O O O O O |
--------------------------------------------------+

stress-ng.lockbus.ops_per_sec

--------------------------------------------------+
+.+ ++ ++ + +.+ +.+ +.+ |
|
|
|
|
|
|
O |
O O O |
O O O O |
OO O O O |
O O O OO O O |
OO O O O O O |
--------------------------------------------------+

estimated based on internal Intel analysis and are provided
purposes only. Any difference in system hardware or software
may affect actual performance.
Open Source Technology Center
.org/hyperkitty/list/lkp@lists.01.org">https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation

Attachments:

(No filename) (56.49 kB)
config-5.13.0-rc1-00001-g8901ecc2315b (176.78 kB)
job-script (8.28 kB)
job.yaml (5.74 kB)
reproduce (350.00 B)
dmesg.xz (27.83 kB)
Download all attachments

2021-05-26 11:25:38

On Thu, Aug 05, 2021 at 08:37:27AM -0700, Paul E. McKenney wrote:
>On Thu, Aug 05, 2021 at 01:39:40PM +0800, Chao Gao wrote:
>> [snip]
>> >> This patch works well; no false-positive (marking TSC unstable) in a
>> >> 10hr stress test.
>> >
>> >Very good, thank you! May I add your Tested-by?
>>
>> sure.
>> Tested-by: Chao Gao <[email protected]>
>
>Very good, thank you! I will apply this on the next rebase.
>
>> >I expect that I will need to modify the patch a bit more to check for
>> >a system where it is -never- able to get a good fine-grained read from
>> >the clock.
>>
>> Agreed.
>>
>> >And it might be that your test run ended up in that state.
>>
>> Not that case judging from kernel logs. Coarse-grained check happened 6475
>> times in 43k seconds (by grep "coarse-grained skew check" in kernel logs).
>> So, still many checks were fine-grained.
>
>Whew! ;-)
>
>So about once per 13 clocksource watchdog checks.
>
>To Andi's point, do you have enough information in your console log to
>work out the longest run of course-grained clocksource checks?

Yes. 5 consecutive course-grained clocksource checks. Note that
considering the reinitialization after course-grained check, in my
calculation, two course-grained checks are considered consecutive if
they happens in 1s(+/- 0.3s).

Thanks
Chao