Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758986Ab1EMAEw (ORCPT ); Thu, 12 May 2011 20:04:52 -0400 Received: from mx1.redhat.com ([209.132.183.28]:49393 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758930Ab1EMAEv (ORCPT ); Thu, 12 May 2011 20:04:51 -0400 Date: Thu, 12 May 2011 19:59:38 -0400 From: Chuck Ebbert To: linux-kernel@vger.kernel.org Cc: Hans Rosenfeld , Boris Ostrovsky , Borislav Petkov Subject: [PATCH] cpu, AMD: Fix another bug in the new errata checking code Message-ID: <20110512195938.1728ab52@katamari> Organization: Red Hat, Inc. Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2795 Lines: 79 Fix a bug that causes CPU hangs due to missing timer interrupts, introduced by these three patches: (1) commit d78d671db478eb8b14c78501c0cee1cc7baf6967 "x86, cpu: AMD errata checking framework" (2) commit 9d8888c2a214aece2494a49e699a097c2ba9498b "x86, cpu: Clean up AMD erratum 400 workaround" (3) commit b87cf80af3ba4b4c008b4face3c68d604e1715c6 "x86, AMD: Set ARAT feature on AMD processors" Patch (1) introduced a new framework that allowed checking for errata using AMD's OSVW (OS visible workaround) feature combined with explicit lists of models. It checked OSVW first, and completely relied on that if it was present and usable. Patch (2) switched the checking for erratum 400 to use the new framework. But the original code checked for an explicit model range first, then used OSVW if the CPU was not within that range. Patch (2) also inexplicably added a second model range (for Family 10h) that was never in the original code. Then patch (3) used the new erratum 400 checks to decide whether to enable the ARAT feature (always running APIC timer.) However, this causes notebooks using the Sempron processor (Family 10h Model 6 Stepping 2) to enable ARAT when they shouldn't because the explicit check for that model gets skipped. The fix is to check the model list first, then use OSVW if the CPU is not in that list. Signed-off-by: Chuck Ebbert --- NOTE: Untested, but this looks like the obvious fix. --- a/arch/x86/kernel/cpu/amd.c +++ b/arch/x86/kernel/cpu/amd.c @@ -723,6 +723,17 @@ bool cpu_has_amd_erratum(const int *erra if (cpu->x86_vendor != X86_VENDOR_AMD) return false; + /* + * Must match family-model-stepping range first so that the + * range checks will override OSVW checking. + */ + ms = (cpu->x86_model << 4) | cpu->x86_mask; + while ((range = *erratum++)) + if ((cpu->x86 == AMD_MODEL_RANGE_FAMILY(range)) && + (ms >= AMD_MODEL_RANGE_START(range)) && + (ms <= AMD_MODEL_RANGE_END(range))) + return true; + if (osvw_id >= 0 && osvw_id < 65536 && cpu_has(cpu, X86_FEATURE_OSVW)) { u64 osvw_len; @@ -737,15 +748,6 @@ bool cpu_has_amd_erratum(const int *erra } } - /* OSVW unavailable or ID unknown, match family-model-stepping range */ - ms = (cpu->x86_model << 4) | cpu->x86_mask; - while ((range = *erratum++)) - if ((cpu->x86 == AMD_MODEL_RANGE_FAMILY(range)) && - (ms >= AMD_MODEL_RANGE_START(range)) && - (ms <= AMD_MODEL_RANGE_END(range))) - return true; - return false; } - EXPORT_SYMBOL_GPL(cpu_has_amd_erratum); _ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/