Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754927AbaFNPpm (ORCPT ); Sat, 14 Jun 2014 11:45:42 -0400 Received: from cmta8.telus.net ([209.171.16.81]:34258 "EHLO cmta8.telus.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754797AbaFNPpi convert rfc822-to-8bit (ORCPT ); Sat, 14 Jun 2014 11:45:38 -0400 X-Authority-Analysis: v=2.0 cv=ffdzPTsF c=1 sm=2 a=zJWegnE7BH9C0Gl4FFgQyA==:17 a=DRQr04eNgRgA:10 a=LGgl8L9ij00A:10 a=IkcTkHD0fZMA:10 a=aatUQebYAAAA:8 a=Pyq9K9CWowscuQLKlpiwfMBGOR0=:19 a=t_AnKmSXQ3x9DNeFk34A:9 a=QEXdDO2ut3YA:10 a=zJWegnE7BH9C0Gl4FFgQyA==:117 X-Telus-Outbound-IP: 173.180.45.4 From: "Doug Smythies" To: "'Stratos Karafotis'" , "'Dirk Brandewie'" , "'Rafael J. Wysocki'" , "'Viresh Kumar'" , "'Dirk Brandewie'" Cc: , "'LKML'" References: <53962067.4060504@semaphore.gr> <53972CE1.1060209@gmail.com> <539731D1.6040504@semaphore.gr> In-Reply-To: <539731D1.6040504@semaphore.gr> Subject: RE: [PATCH 2/7] cpufreq: intel_pstate: Avoid duplicate call of intel_pstate_get_scaled_busy Date: Sat, 14 Jun 2014 08:45:36 -0700 Message-ID: <000b01cf87e7$b06afb30$1140f190$@net> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 8BIT X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: Ac+EyM8aC1SS74MbTiy5CXgedwI9EgDG2LdA Content-Language: en-ca Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org I am sorry to be late chiming in on this one. On 2014.06.10 09:27 Stratos Karafotis wrote: > On 10/06/2014 07:05 μμ, Dirk Brandewie wrote: > On 06/09/2014 02:00 PM, Stratos Karafotis wrote: >> Store busy_scaled value to avoid to duplicate call of >> intel_pstate_get_scaled_busy on every sampling interval. >> >> >> The second call *only* happens if the tracepoint is being used otherwise >> the whole function call to trace_pstate_sample() is a noop. > Yes, I'm sorry, I forgot to add this in my changelog. I have written this > in cover letter. > I made this change mostly to support patch 3/7. >> This makes the code less readable IMHO the reader is left wondering >> how cpu->sample.busy_scaled was set in intel_pstate_adjust_busy_pstate() >> > I agree that the the original code is more readable. If we don't care > about the small overhead when tracing is on and forget patch 3/7, > of course the original code is by far better. Actually, when reading the code, I found it odd to call the function twice. However by far the much more important issue here, in my opinion, is that if one is using the tracepoint stuff, then the second call to intel_pstate_get_scaled_busy can give a different result than the first call. Why? Because "cpu->pstate.current_pstate" may have changed between the two calls. In the end the user (me in this case) of the tracepoint stuff can end up pulling (what's left of) their hair out and going around in circles attempting to figure out why doing the so simple math by hand doesn't seem to agree with the tracepoint data. As a side note: I am now pulling the tracepoint data into a spreadsheet and calculating what "scaled" should be myself. >>> Also, rename the function to intel_pstate_calc_scaled_busy. >>> >>> Signed-off-by: Stratos Karafotis >>> --- >>> drivers/cpufreq/intel_pstate.c | 12 ++++++------ >>> 1 file changed, 6 insertions(+), 6 deletions(-) >>> >>> diff --git a/drivers/cpufreq/intel_pstate.c b/drivers/cpufreq/intel_pstate.c >>> index 4e7f492..31e2ae5 100644 >>> --- a/drivers/cpufreq/intel_pstate.c >>> +++ b/drivers/cpufreq/intel_pstate.c >>> @@ -55,6 +55,7 @@ static inline int32_t div_fp(int32_t x, int32_t y) >>> >>> struct sample { >>> int32_t core_pct_busy; >>> + int32_t busy_scaled; >>> u64 aperf; >>> u64 mperf; >>> int freq; >>> @@ -604,7 +605,7 @@ static inline void intel_pstate_set_sample_time(struct cpudata *cpu) >>> mod_timer_pinned(&cpu->timer, jiffies + delay); >>> } >>> >>> -static inline int32_t intel_pstate_get_scaled_busy(struct cpudata *cpu) >>> +static inline void intel_pstate_calc_scaled_busy(struct cpudata *cpu) >>> { >>> int32_t core_busy, max_pstate, current_pstate, sample_ratio; >>> u32 duration_us; >>> @@ -624,20 +625,19 @@ static inline int32_t intel_pstate_get_scaled_busy(struct cpudata *cpu) >>> core_busy = mul_fp(core_busy, sample_ratio); >>> } >>> >>> - return core_busy; >>> + cpu->sample.busy_scaled = core_busy; >>> } >>> >>> static inline void intel_pstate_adjust_busy_pstate(struct cpudata *cpu) >>> { >>> - int32_t busy_scaled; >>> struct _pid *pid; >>> signed int ctl = 0; >>> int steps; >>> >>> pid = &cpu->pid; >>> - busy_scaled = intel_pstate_get_scaled_busy(cpu); >>> + intel_pstate_calc_scaled_busy(cpu); >>> >>> - ctl = pid_calc(pid, busy_scaled); >>> + ctl = pid_calc(pid, cpu->sample.busy_scaled); >>> >>> steps = abs(ctl); >>> >>> @@ -659,7 +659,7 @@ static void intel_pstate_timer_func(unsigned long __data) >>> intel_pstate_adjust_busy_pstate(cpu); >>> >>> trace_pstate_sample(fp_toint(sample->core_pct_busy), >>> - fp_toint(intel_pstate_get_scaled_busy(cpu)), >>> + fp_toint(sample->busy_scaled), >>> cpu->pstate.current_pstate, >>> sample->mperf, >>> sample->aperf, >>> >> >> -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/