Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756438Ab2FJTra (ORCPT ); Sun, 10 Jun 2012 15:47:30 -0400 Received: from mail.pr.hu ([87.242.0.5]:34471 "EHLO mail.pr.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756202Ab2FJTr3 (ORCPT ); Sun, 10 Jun 2012 15:47:29 -0400 X-Greylist: delayed 1380 seconds by postgrey-1.27 at vger.kernel.org; Sun, 10 Jun 2012 15:47:29 EDT Message-ID: <4FD4F45D.5050103@pr.hu> Date: Sun, 10 Jun 2012 21:24:13 +0200 From: Boszormenyi Zoltan User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1 MIME-Version: 1.0 To: linux-kernel@vger.kernel.org Subject: AMD FX CPU bug, not fixed by latest microcode? Content-Type: text/plain; charset=ISO-8859-2; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Score: -2.8 (--) X-Scan-Signature: 642093ab6d8c10dc6804d9159895b1ad X-Spam-Tracer: backend.mail.pr.hu -2.8 20120610192347Z Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2597 Lines: 60 Hi, I have an AMD FX-8120 boxed CPU in an ASUS M5A99X-EVO mainboard with 32GB DDR3/1600 memory, running Fedora 17, upgraded from 16. memtest86+ show no problems. Still, I get occasional crashes and signal 11 during kernel compilation even with single-job make. Sometimes the compiler jumps out with a strange error message, like "stray \NNN character in the source". When re-running make, the error doesn't happen in the same file and the source file doesn't contain the character being complained about when inspecting with an editor or hexdump. Now, a few minutes ago I was able to catch this bug when I copied the kernel GIT tree to apply a patch manually and did "git commit -a". Strangely, the commit contained one extra file that I didn't touch. git diff showed this for the extra file: ============================== --- a/drivers/usb/gadget/fsl_usb2_udc.h +++ b/drivers/usb/gadget/fsl_usb2_udc.h @@ -427,7 +427,7 @@ struct ep_td_struct { #define DTD_ADDR_MASK 0xFFFFFFE0 #define DTD_PACKET_SIZE 0x7FFF0000 #define DTD_LENGTH_BIT_POS 16 -#define DTD_ERROR_MASK (DTD_STATUS_HALTED | \ +#define DTD_ERROR_MASK (DTD_STATUS_HALTED | ^Z DTD_STATUS_DATA_BUFF_ERR | \ DTD_STATUS_TRANSACTION_ERR) /* Alignment requirements; must be a power of two */ ============================== The "^Z" is a 0-character in the file and is not present in the original source tree, only in the copy. Similar errors happened during copying large files on the same machine but it seems it's enough to trigger if the total amount of data read is large enough. The mainboard has the latest (UEFI) firmware flashed which contains the latest AMD microcode, so microcode_ctl doesn't need to apply it anymore. Previously, I used amd-ucode-2012-01-17.tar from www.amd64.org/support/microcode.html which is now part of microcode_ctl in Fedora. Since the error happens during compiling a source file and not only copying, the bug seems to happens during *reading* data. Does anyone know whether it's a known problem in AMD FX CPUs? Does AMD have a newer microcode to fix this bug, or should I apply for warranty? Thanks in advance, Zolt?n B?sz?rm?nyi -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/