Received: by 10.223.164.202 with SMTP id h10csp481196wrb; Wed, 15 Nov 2017 03:06:48 -0800 (PST) X-Google-Smtp-Source: AGs4zMaPdHVCJ4Qm0I9UwGu9Y8PLXAqksnKLheCJOHi8dxszEpOpstT3wfpyu3+bdRSinXr1IGmy X-Received: by 10.84.169.1 with SMTP id g1mr15894519plb.349.1510744008469; Wed, 15 Nov 2017 03:06:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1510744008; cv=none; d=google.com; s=arc-20160816; b=lC1l16VjGlvXqljDPZIhGcea43c6nS4+0IHl1JRglaLH1TqkHDo20BaCY/4dHWq+J0 WN0G4CpyUGvUZcEt+XRhZf0PKRyPRLpajFPT+gOmF2YGxHKhE4WPpu8YrKrPRaGftF+p itKFZb3Y/qHt/0w/+uC4H4cSN7kQgS5ZKLXqBmM66P04F25fz93rhvfPtirjtjXmcZWE iwiDAaexNC/Rmx8x1FwFMwUKYtrZX+saH6AE1gAfh3eCTaEi0MwDtRQorOmDYxAlERxY BiSTHPFtLSVuL/NQj91yci5YBac57tu1O7Aql9x52KBEq2dXRrrYohpiQ8tiajdrW2Wd 5XPw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=pUrkvEc4jh2l/jqONNWqs8tUQ+sUt2o3OY39jFsenlc=; b=KJDt4mhp9xs2FahHNLVA/1PazLuzUMFpW7O1TFNHFlaWO2xk4ljOefZzbsjJnHfTeD 6DjR1XSC2ApWUYH5ZCnB56lluDf0KOpi2KdGEpGQsCNuWfmqJL7n/OB4YHGtKXKp6yZR LGZJNF1pl1STneBsRPJ2slmKR7DvWuyhlT6G1VJiOSIHdaP1gqH6hj/mXlJEQKSy7fE/ wpQRr1wPcGeQWLmZ9dFz5US/H96gKn99/SYvER//or6E61wCLbNxTIf6pCbWAxwwKTtj vN3TU8l6ul0zzzNoZksbhxC+pToeKht5dHxi8gQCGW8pZahaPLchvXNvyxz9LqrtIQUq HiPg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ucloud.cn Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v187si13938814pfv.227.2017.11.15.03.06.34; Wed, 15 Nov 2017 03:06:48 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ucloud.cn Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932219AbdKOJeh (ORCPT + 88 others); Wed, 15 Nov 2017 04:34:37 -0500 Received: from m97139.mail.qiye.163.com ([220.181.97.139]:54023 "EHLO m97139.mail.qiye.163.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756000AbdKOJeb (ORCPT ); Wed, 15 Nov 2017 04:34:31 -0500 Received: from localhost (unknown [106.38.57.250]) by smtp10 (Coremail) with SMTP id vOCowAA3zHH7CQxacG0ZAA--.2S3; Wed, 15 Nov 2017 17:33:47 +0800 (CST) Date: Wed, 15 Nov 2017 17:33:47 +0800 From: WANG Chao To: "Rafael J. Wysocki" Cc: Linux Kernel Mailing List , Linus Torvalds , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" , Vikas Shivappa , Kate Stewart , Len Brown , Greg Kroah-Hartman , Philippe Ombredanne , Mathias Krause , the arch/x86 maintainers , Linux PM , "Rafael J. Wysocki" Subject: Re: [PATCH] x86 / CPU: Always show current CPU frequency in /proc/cpuinfo Message-ID: <20171115093347.GA28750@WANG-Chaos-MacBook-Pro.local> References: <20171109103814.70688-1-chao.wang@ucloud.cn> <2067351.8xeh6jAbVr@aspire.rjw.lan> <50366815.86mgx8Isne@aspire.rjw.lan> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <50366815.86mgx8Isne@aspire.rjw.lan> User-Agent: Mutt/1.9.1 (2017-09-22) X-CM-TRANSID: vOCowAA3zHH7CQxacG0ZAA--.2S3 X-Coremail-Antispam: 1Uf129KBjvJXoW3JF13KFWxAr4xJw18try5urg_yoWfGr4rpF ZIkryxtr4rXryDAw15AF48Ww15Zrs7Xr4jq34fKFWrAw1Yvr18X3WDKry5Ar17CrWDC3Wx trW5JFZIgrykJrDanT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDUYxBIdaVFxhVjvjDU0xZFpf9x0Jb_6pPUUUUU= X-Originating-IP: [106.38.57.250] X-CM-SenderInfo: pfkd0hpzdqwq5xfo03fgof0/1tbiaxuRVFlZtS4akAAAsk Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/15/17 at 02:13P, Rafael J. Wysocki wrote: > From: Rafael J. Wysocki > > After commit 890da9cf0983 (Revert "x86: do not use cpufreq_quick_get() > for /proc/cpuinfo "cpu MHz"") the "cpu MHz" number in /proc/cpuinfo > on x86 can be either the nominal CPU frequency (which is constant) > or the frequency most recently requested by a scaling governor in > cpufreq, depending on the cpufreq configuration. That is somewhat > inconsistent and is different from what it was before 4.13, so in > order to restore the previous behavior, make it report the current > CPU frequency like the scaling_cur_freq sysfs file in cpufreq. > > To that end, modify the /proc/cpuinfo implementation on x86 to use > aperfmperf_snapshot_khz() to snapshot the APERF and MPERF feedback > registers, if available, and use their values to compute the CPU > frequency to be reported as "cpu MHz". > > However, do that carefully enough to avoid accumulating delays that > lead to unacceptable access times for /proc/cpuinfo on systems with > many CPUs. Run aperfmperf_snapshot_khz() once on all CPUs > asynchronously at the /proc/cpuinfo open time, add a single delay > upfront (if necessary) at that point and simply compute the current > frequency while running show_cpuinfo() for each individual CPU. Hi, Rafael I tested your patch. It's much faster. But from what I got, calling aperfmperf_snapshot_khz() asynchronously with 10ms sleep takes much longer than calling aperfmperf_snapshot_khz() synchronously. Here's my result on 64 CPUs: - async aperfmperf_snapshot_khz() w/ 10ms sleep: # time cat /proc/cpuinfo > /dev/null real 0m0.014s user 0m0.000s sys 0m0.002s - sync aperfmperf_snapshot_khz() w/o any sleep: # time cat /proc/cpuinfo > /dev/null real 0m0.002s user 0m0.000s sys 0m0.002s Thanks, WANG Chao > > Also, to avoid slowing down /proc/cpuinfo accesses too much, reduce > the default delay between consecutive APERF and MPERF reads to 10 ms, > which should be sufficient to get large enough numbers for the > frequency computation in all cases. > > Fixes: 890da9cf0983 (Revert "x86: do not use cpufreq_quick_get() for /proc/cpuinfo "cpu MHz"") > Signed-off-by: Rafael J. Wysocki > --- > > Resent with a changelog & tags. > > I'm going to route it via the linux-pm tree. > > --- > arch/x86/kernel/cpu/aperfmperf.c | 74 +++++++++++++++++++++++++++------------ > arch/x86/kernel/cpu/cpu.h | 3 + > arch/x86/kernel/cpu/proc.c | 6 ++- > fs/proc/cpuinfo.c | 6 +++ > include/linux/cpufreq.h | 1 > 5 files changed, 67 insertions(+), 23 deletions(-) > > Index: linux-pm/arch/x86/kernel/cpu/proc.c > =================================================================== > --- linux-pm.orig/arch/x86/kernel/cpu/proc.c > +++ linux-pm/arch/x86/kernel/cpu/proc.c > @@ -5,6 +5,8 @@ > #include > #include > > +#include "cpu.h" > + > /* > * Get CPU information for use by the procfs. > */ > @@ -78,9 +80,11 @@ static int show_cpuinfo(struct seq_file > seq_printf(m, "microcode\t: 0x%x\n", c->microcode); > > if (cpu_has(c, X86_FEATURE_TSC)) { > - unsigned int freq = cpufreq_quick_get(cpu); > + unsigned int freq = aperfmperf_get_khz(cpu); > > if (!freq) > + freq = cpufreq_quick_get(cpu); > + if (!freq) > freq = cpu_khz; > seq_printf(m, "cpu MHz\t\t: %u.%03u\n", > freq / 1000, (freq % 1000)); > Index: linux-pm/arch/x86/kernel/cpu/aperfmperf.c > =================================================================== > --- linux-pm.orig/arch/x86/kernel/cpu/aperfmperf.c > +++ linux-pm/arch/x86/kernel/cpu/aperfmperf.c > @@ -14,6 +14,8 @@ > #include > #include > > +#include "cpu.h" > + > struct aperfmperf_sample { > unsigned int khz; > ktime_t time; > @@ -24,7 +26,7 @@ struct aperfmperf_sample { > static DEFINE_PER_CPU(struct aperfmperf_sample, samples); > > #define APERFMPERF_CACHE_THRESHOLD_MS 10 > -#define APERFMPERF_REFRESH_DELAY_MS 20 > +#define APERFMPERF_REFRESH_DELAY_MS 10 > #define APERFMPERF_STALE_THRESHOLD_MS 1000 > > /* > @@ -38,8 +40,6 @@ static void aperfmperf_snapshot_khz(void > u64 aperf, aperf_delta; > u64 mperf, mperf_delta; > struct aperfmperf_sample *s = this_cpu_ptr(&samples); > - ktime_t now = ktime_get(); > - s64 time_delta = ktime_ms_delta(now, s->time); > unsigned long flags; > > local_irq_save(flags); > @@ -57,38 +57,68 @@ static void aperfmperf_snapshot_khz(void > if (mperf_delta == 0) > return; > > - s->time = now; > + s->time = ktime_get(); > s->aperf = aperf; > s->mperf = mperf; > - > - /* If the previous iteration was too long ago, discard it. */ > - if (time_delta > APERFMPERF_STALE_THRESHOLD_MS) > - s->khz = 0; > - else > - s->khz = div64_u64((cpu_khz * aperf_delta), mperf_delta); > + s->khz = div64_u64((cpu_khz * aperf_delta), mperf_delta); > } > > -unsigned int arch_freq_get_on_cpu(int cpu) > +static bool aperfmperf_snapshot_cpu(int cpu, ktime_t now, bool wait) > { > - s64 time_delta; > - unsigned int khz; > + s64 time_delta = ktime_ms_delta(now, per_cpu(samples.time, cpu)); > + > + /* Don't bother re-computing within the cache threshold time. */ > + if (time_delta < APERFMPERF_CACHE_THRESHOLD_MS) > + return true; > + > + smp_call_function_single(cpu, aperfmperf_snapshot_khz, NULL, wait); > + > + /* Return false if the previous iteration was too long ago. */ > + return time_delta <= APERFMPERF_STALE_THRESHOLD_MS; > +} > > +unsigned int aperfmperf_get_khz(int cpu) > +{ > if (!cpu_khz) > return 0; > > if (!static_cpu_has(X86_FEATURE_APERFMPERF)) > return 0; > > - /* Don't bother re-computing within the cache threshold time. */ > - time_delta = ktime_ms_delta(ktime_get(), per_cpu(samples.time, cpu)); > - khz = per_cpu(samples.khz, cpu); > - if (khz && time_delta < APERFMPERF_CACHE_THRESHOLD_MS) > - return khz; > + aperfmperf_snapshot_cpu(cpu, ktime_get(), true); > + return per_cpu(samples.khz, cpu); > +} > > - smp_call_function_single(cpu, aperfmperf_snapshot_khz, NULL, 1); > - khz = per_cpu(samples.khz, cpu); > - if (khz) > - return khz; > +void arch_freq_prepare_all(void) > +{ > + ktime_t now = ktime_get(); > + bool wait = false; > + int cpu; > + > + if (!cpu_khz) > + return; > + > + if (!static_cpu_has(X86_FEATURE_APERFMPERF)) > + return; > + > + for_each_online_cpu(cpu) > + if (!aperfmperf_snapshot_cpu(cpu, now, false)) > + wait = true; > + > + if (wait) > + msleep(APERFMPERF_REFRESH_DELAY_MS); > +} > + > +unsigned int arch_freq_get_on_cpu(int cpu) > +{ > + if (!cpu_khz) > + return 0; > + > + if (!static_cpu_has(X86_FEATURE_APERFMPERF)) > + return 0; > + > + if (aperfmperf_snapshot_cpu(cpu, ktime_get(), true)) > + return per_cpu(samples.khz, cpu); > > msleep(APERFMPERF_REFRESH_DELAY_MS); > smp_call_function_single(cpu, aperfmperf_snapshot_khz, NULL, 1); > Index: linux-pm/arch/x86/kernel/cpu/cpu.h > =================================================================== > --- linux-pm.orig/arch/x86/kernel/cpu/cpu.h > +++ linux-pm/arch/x86/kernel/cpu/cpu.h > @@ -47,4 +47,7 @@ extern const struct cpu_dev *const __x86 > > extern void get_cpu_cap(struct cpuinfo_x86 *c); > extern void cpu_detect_cache_sizes(struct cpuinfo_x86 *c); > + > +unsigned int aperfmperf_get_khz(int cpu); > + > #endif /* ARCH_X86_CPU_H */ > Index: linux-pm/fs/proc/cpuinfo.c > =================================================================== > --- linux-pm.orig/fs/proc/cpuinfo.c > +++ linux-pm/fs/proc/cpuinfo.c > @@ -1,12 +1,18 @@ > // SPDX-License-Identifier: GPL-2.0 > +#include > #include > #include > #include > #include > > +__weak void arch_freq_prepare_all(void) > +{ > +} > + > extern const struct seq_operations cpuinfo_op; > static int cpuinfo_open(struct inode *inode, struct file *file) > { > + arch_freq_prepare_all(); > return seq_open(file, &cpuinfo_op); > } > > Index: linux-pm/include/linux/cpufreq.h > =================================================================== > --- linux-pm.orig/include/linux/cpufreq.h > +++ linux-pm/include/linux/cpufreq.h > @@ -917,6 +917,7 @@ static inline bool policy_has_boost_freq > } > #endif > > +extern void arch_freq_prepare_all(void); > extern unsigned int arch_freq_get_on_cpu(int cpu); > > extern void arch_set_freq_scale(struct cpumask *cpus, unsigned long cur_freq, > From 1584121286665402741@xxx Wed Nov 15 08:49:45 +0000 2017 X-GM-THRID: 1584093074206377441 X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread