Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755230Ab1FOWQp (ORCPT ); Wed, 15 Jun 2011 18:16:45 -0400 Received: from mail.skyhub.de ([78.46.96.112]:34437 "EHLO mail.skyhub.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751573Ab1FOWQl (ORCPT ); Wed, 15 Jun 2011 18:16:41 -0400 Date: Thu, 16 Jun 2011 00:16:36 +0200 From: Borislav Petkov To: Konrad Rzeszutek Wilk Cc: linux-kernel@vger.kernel.org, davej@redhat.com, tglx@linutronix.de, mingo@redhat.com, hpa@zytor.com, x86@kernel.org, cpufreq@vger.kernel.org, andre.przywara@amd.com, Mark.Langsdorf@amd.com Subject: Re: [PATCH 1/2] [CPUFREQ] powernow-k8: Don't notify of successful transition if we failed. Message-ID: <20110615221636.GB9725@liondog.tnic> Mail-Followup-To: Borislav Petkov , Konrad Rzeszutek Wilk , linux-kernel@vger.kernel.org, davej@redhat.com, tglx@linutronix.de, mingo@redhat.com, hpa@zytor.com, x86@kernel.org, cpufreq@vger.kernel.org, andre.przywara@amd.com, Mark.Langsdorf@amd.com References: <1308164520-14145-1-git-send-email-konrad.wilk@oracle.com> <1308164520-14145-2-git-send-email-konrad.wilk@oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <1308164520-14145-2-git-send-email-konrad.wilk@oracle.com> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3940 Lines: 94 On Wed, Jun 15, 2011 at 03:01:59PM -0400, Konrad Rzeszutek Wilk wrote: > Before this patch if we failed the transition (either p-state or > voltage) we would still try to submit the "new" frequencies to cpufreq. > That is incorrect - also we could submit a non-existing frequency value > which would cause cpufreq to crash. The ultimate fix is in cpufreq > to deal with incorrect values, but this patch improves the error > recovery in the AMD powernowk8 driver. > > The failure that was reported was as follow: > > powernow-k8: Found 1 AMD Athlon(tm) 64 Processor 3700+ (1 cpu cores) (version 2.20.00) > powernow-k8: fid 0x2 (1000 MHz), vid 0x12 > powernow-k8: fid 0xa (1800 MHz), vid 0xa > powernow-k8: fid 0xc (2000 MHz), vid 0x8 > powernow-k8: fid 0xe (2200 MHz), vid 0x8 > Marking TSC unstable due to cpufreq changes > powernow-k8: fid trans failed, fid 0x2, curr 0x0 > BUG: unable to handle kernel paging request at ffff880807e07b78 > IP: [] cpufreq_stats_update+0x46/0x5b > ... > > And transition fails and data->currfid ends up with 0. Since > the machine does not support 800Mhz value when the calculation is > done ('find_khz_freq_from_fid(data->currfid);') it reports the > new frequency as 800000 which is bogus. This patch fixes > the issue during target setting. > > The patch however does not fix the issue in 'powernowk8_cpu_init' > where the pol->cur can also be set with the 800000 value: > > pol->cur = find_khz_freq_from_fid(data->currfid); > dprintk("policy current frequency %d kHz\n", pol->cur); > > /* min/max the cpu is capable of */ > if (cpufreq_frequency_table_cpuinfo(pol, data->powernow_table)) { > > The fix for that looks to update cpufreq_frequency_table_cpuinfo to > check pol->cur.... but that would cause an regression in how the > acpi-cpufreq driver works (it sets cpu->cur after calling > cpufreq_frequency_table_cpuinfo). Instead the fix will be to let > cpufreq gracefully handle bogus data. > > Reported-by: Tobias Diedrich > Tested-by: Tobias Diedrich > Signed-off-by: Konrad Rzeszutek Wilk > --- > arch/x86/kernel/cpu/cpufreq/powernow-k8.c | 7 ++++++- > 1 files changed, 6 insertions(+), 1 deletions(-) > > diff --git a/arch/x86/kernel/cpu/cpufreq/powernow-k8.c b/arch/x86/kernel/cpu/cpufreq/powernow-k8.c > index 2368e38..7e4a664 100644 > --- a/arch/x86/kernel/cpu/cpufreq/powernow-k8.c > +++ b/arch/x86/kernel/cpu/cpufreq/powernow-k8.c > @@ -1079,8 +1079,10 @@ static int transition_frequency_fidvid(struct powernow_k8_data *data, > } > > res = transition_fid_vid(data, fid, vid); > - freqs.new = find_khz_freq_from_fid(data->currfid); > + if (res) > + return res; > > + freqs.new = find_khz_freq_from_fid(data->currfid); > for_each_cpu(i, data->available_cores) { > freqs.cpu = i; > cpufreq_notify_transition(&freqs, CPUFREQ_POSTCHANGE); > @@ -1112,6 +1114,9 @@ static int transition_frequency_pstate(struct powernow_k8_data *data, > } > > res = transition_pstate(data, pstate); > + if (res) > + return res; That's wrong because transition_pstate() returns 0 unconditionally (at least it does so on 3.0-rc3). But this change accidentally fixes a different bug because res is used uninitialized, containing stack garbage otherwise. A proper fix should be to check against data->max_hw_pstate and check whether the entry is not CPUFREQ_ENTRY_INVALID (look at fill_powernow_table_pstate() for example). I'm guessing this oops happens when powernow-k8 is loaded in the guest and that the actual power management is done in the hypervisor? Adding some more people to CC. -- Regards/Gruss, Boris. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/