Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756323AbZICUL7 (ORCPT ); Thu, 3 Sep 2009 16:11:59 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754053AbZICUL6 (ORCPT ); Thu, 3 Sep 2009 16:11:58 -0400 Received: from idcmail-mo2no.shaw.ca ([64.59.134.9]:56443 "EHLO idcmail-mo2no.shaw.ca" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752916AbZICUL6 (ORCPT ); Thu, 3 Sep 2009 16:11:58 -0400 X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.0 c=1 a=35ISBW3HG1kA:10 a=w4iE+TBsmj5y1WloLYF40w==:17 a=BUpYkHguLb7oZ9ECNr4A:9 a=SJ63gP8kBidTtP8gFCcA:7 a=ejDZuhq_z0llZKWsgPW3H4ZOxtcA:4 a=vkV03TReNj7VkvCi:21 a=-KxbjhXvOYv0Dapi:21 From: Thomas Fjellstrom Reply-To: tfjellstrom@shaw.ca To: linux-kernel@vger.kernel.org Subject: Re: BIOS update == more errors (was Re: sata exception frozen timeout?) Date: Thu, 3 Sep 2009 14:11:59 -0600 User-Agent: KMail/1.12.90 (Linux/2.6.30-1-amd64; KDE/4.3.64; x86_64; svn-1004284; 2009-07-29) References: <200909010316.58715.tfjellstrom@shaw.ca> <200909031338.39785.tfjellstrom@shaw.ca> In-Reply-To: <200909031338.39785.tfjellstrom@shaw.ca> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Message-Id: <200909031411.59191.tfjellstrom@shaw.ca> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 11071 Lines: 212 On Thu September 3 2009, Thomas Fjellstrom wrote: > I've updated my bios to try and see if it would help at all (it did seem to > fix other issues). > > But now I'm getting the following warnings and errors from dmesg on boot: > (debian sid 2.6.30, with "noapic" to see if the original problem was an > interupt issue, as everyone seems to have hinted at). > > [ 1.024337] ------------[ cut here ]------------ > [ 1.024408] WARNING: at /build/buildd-linux-2.6_2.6.30-6-amd64- > s9DPiZ/linux-2.6-2.6.30/debian/build/source_amd64_none/drivers/ata/libata- > core.c:6174 ata_host_activate+0x47/0xe0 [libata]() > [ 1.024492] Hardware name: GA-MA790FXT-UD5P > [ 1.024546] Modules linked in: crc_itu_t atiixp(+) ide_core pata_jmicron > ahci(+) ehci_hcd(+) libata scsi_mod r8169 mii thermal fan thermal_sys > [ 1.025196] Pid: 796, comm: work_for_cpu Not tainted 2.6.30-1-amd64 #1 > [ 1.025252] Call Trace: > [ 1.025318] [] ? ata_host_activate+0x47/0xe0 [libata] > [ 1.025385] [] ? ata_host_activate+0x47/0xe0 [libata] > [ 1.025445] [] ? warn_slowpath_common+0x77/0xa3 > [ 1.025505] [] ? ahci_interrupt+0x0/0x454 [ahci] > [ 1.025572] [] ? ata_host_activate+0x47/0xe0 [libata] > [ 1.025632] [] ? ahci_init_one+0xbb4/0xbd4 [ahci] > [ 1.025691] [] ? do_work_for_cpu+0x0/0x1b > [ 1.025749] [] ? local_pci_probe+0x12/0x16 > [ 1.025806] [] ? do_work_for_cpu+0xb/0x1b > [ 1.025862] [] ? kthread+0x54/0x80 > [ 1.025918] [] ? child_rip+0xa/0x20 > [ 1.025975] [] ? kthread+0x0/0x80 > [ 1.026030] [] ? child_rip+0x0/0x20 > [ 1.026085] ---[ end trace 54d3fd405814ad85 ]--- > ... > [ 14.872233] ata10.15: qc timeout (cmd 0xe4) > [ 14.872925] ata10.15: failed to read PMP GSCR[0] (Emask=0x4) > [ 14.873256] ata9.15: qc timeout (cmd 0xe4) > [ 14.873344] ata10: limiting SATA link speed to 1.5 Gbps > [ 14.874008] ata9.15: failed to read PMP GSCR[0] (Emask=0x4) > [ 14.874271] ata9: limiting SATA link speed to 1.5 Gbps > [ 15.548002] irq 7: nobody cared (try booting with the "irqpoll" option) > [ 15.548121] Pid: 0, comm: swapper Tainted: G W 2.6.30-1-amd64 #1 > [ 15.548240] Call Trace: > [ 15.548346] [] ? __report_bad_irq+0x30/0x7d > [ 15.548569] [] ? note_interrupt+0x105/0x170 > [ 15.548689] [] ? handle_level_irq+0x7c/0xaf > [ 15.548809] [] ? handle_irq+0x17/0x1d > [ 15.548929] [] ? do_IRQ+0x57/0xbf > [ 15.549044] [] ? ret_from_intr+0x0/0x11 > [ 15.549161] [] ? early_idt_handler+0x0/0x71 > [ 15.549379] [] ? native_safe_halt+0x2/0x3 > [ 15.549497] [] ? default_idle+0x40/0x68 > [ 15.549612] [] ? clockevents_notify+0x2b/0x75 > [ 15.549730] [] ? c1e_idle+0xe5/0x10d > [ 15.549848] [] ? cpu_idle+0x50/0x91 > [ 15.549963] [] ? start_kernel+0x37a/0x386 > [ 15.550081] [] ? x86_64_start_kernel+0xf9/0x106 > [ 15.550199] handlers: > [ 15.550311] [] (usb_hcd_irq+0x0/0x7e) > [ 15.550630] Disabling IRQ #7 > [ 15.636304] irq 11: nobody cared (try booting with the "irqpoll" option) > [ 15.636377] Pid: 0, comm: swapper Tainted: G W 2.6.30-1-amd64 #1 > [ 15.636434] Call Trace: > [ 15.636486] [] ? __report_bad_irq+0x30/0x7d > [ 15.636586] [] ? note_interrupt+0x105/0x170 > [ 15.636643] [] ? handle_level_irq+0x7c/0xaf > [ 15.636699] [] ? handle_irq+0x17/0x1d > [ 15.636755] [] ? do_IRQ+0x57/0xbf > [ 15.636811] [] ? ret_from_intr+0x0/0x11 > [ 15.636866] [] ? early_idt_handler+0x0/0x71 > [ 15.636966] [] ? native_safe_halt+0x2/0x3 > [ 15.637022] [] ? default_idle+0x40/0x68 > [ 15.637078] [] ? clockevents_notify+0x2b/0x75 > [ 15.637135] [] ? c1e_idle+0xe5/0x10d > [ 15.637191] [] ? cpu_idle+0x50/0x91 > [ 15.637247] [] ? start_kernel+0x37a/0x386 > [ 15.637304] [] ? x86_64_start_kernel+0xf9/0x106 > [ 15.637359] handlers: > [ 15.637411] [] (usb_hcd_irq+0x0/0x7e) > [ 15.637549] [] (usb_hcd_irq+0x0/0x7e) > [ 15.637687] [] (usb_hcd_irq+0x0/0x7e) > [ 15.637825] Disabling IRQ #11 > [ 17.536129] usb-storage: device scan complete > ... > [ 21.388126] ata9.15: qc timeout (cmd 0xe4) > [ 21.388194] ata9.15: failed to read PMP GSCR[0] (Emask=0x4) > [ 21.588135] ata10.15: qc timeout (cmd 0xe4) > [ 21.588204] ata10.15: failed to read PMP GSCR[0] (Emask=0x4) > [ 24.832135] ata9: SATA link up 1.5 Gbps (SStatus 113 SControl 310) > [ 25.632183] ata10: SATA link up 1.5 Gbps (SStatus 113 SControl 310) > ... > [ 198.322072] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 > frozen > [ 198.322080] ata8.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 > [ 198.322081] res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x4 > (timeout) > [ 198.322083] ata8.00: status: { DRDY } > [ 198.322088] ata8: hard resetting link > [ 198.916035] ata8: softreset failed (device not ready) > [ 198.916039] ata8: failed due to HW bug, retry pmp=0 > [ 199.080024] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) > [ 199.092059] ata8.00: configured for UDMA/133 > [ 199.092072] ata8: EH complete > [ 227.900583] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 > frozen > [ 227.900590] ata8.00: cmd b0/d8:00:01:4f:c2/00:00:00:00:00/00 tag 0 > [ 227.900591] res 40/00:00:af:88:e0/00:00:e8:00:00/e0 Emask 0x4 > (timeout) > [ 227.900594] ata8.00: status: { DRDY } > [ 227.900598] ata8: hard resetting link > [ 228.384016] ata8: softreset failed (device not ready) > [ 228.384020] ata8: failed due to HW bug, retry pmp=0 > [ 228.548024] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) > [ 228.560372] ata8.00: configured for UDMA/133 > [ 228.560385] ata8: EH complete > [ 238.805198] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 > frozen > [ 238.805218] ata8.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0 > [ 238.805221] res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 > (timeout) > [ 238.805229] ata8.00: status: { DRDY } > [ 238.805241] ata8: hard resetting link > [ 239.404154] ata8: softreset failed (device not ready) > [ 239.404163] ata8: failed due to HW bug, retry pmp=0 > [ 239.568186] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) > [ 239.580343] ata8.00: configured for UDMA/133 > [ 239.580374] ata8: EH complete > [ 246.808086] ata8.00: NCQ disabled due to excessive errors > [ 246.808099] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 > frozen > [ 246.808115] ata8.00: cmd b0/d8:00:01:4f:c2/00:00:00:00:00/00 tag 0 > [ 246.808119] res 40/00:00:af:88:e0/00:00:e8:00:00/e0 Emask 0x4 > (timeout) > [ 246.808126] ata8.00: status: { DRDY } > [ 246.808138] ata8: hard resetting link > [ 247.292158] ata8: softreset failed (device not ready) > [ 247.292167] ata8: failed due to HW bug, retry pmp=0 > [ 247.456174] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) > [ 247.468955] ata8.00: configured for UDMA/133 > [ 247.468984] ata8: EH complete > [ 272.804207] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 > frozen > [ 272.804227] ata8.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0 > [ 272.804231] res 40/00:00:af:88:e0/00:00:e8:00:00/e0 Emask 0x4 > (timeout) > [ 272.804238] ata8.00: status: { DRDY } > [ 272.804250] ata8: hard resetting link > [ 273.292161] ata8: softreset failed (device not ready) > [ 273.292169] ata8: failed due to HW bug, retry pmp=0 > [ 273.456173] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) > [ 273.468892] ata8.00: configured for UDMA/133 > [ 273.468916] ata8: EH complete > > On boot, it seemed to hang the disk up for a good few minutes, even though > nothing is using it at the moment (I have to manually bring up the mdraid0 > array, so it can't possibly be mounted), and smartctl was erroring out for > a while, but now its fine, and smart shows no issues. > > I'm going to try without noapic on 2.6.30, and 2.6.31-rc5 and see what > happens. back with 2.6.30 apic enabled, all the traces are gone, but I still get the SATA errors, and a new message: ata8: SError: { HostInt } [ 415.781659] ata8.00: exception Emask 0x40 SAct 0x0 SErr 0x800 action 0x6 frozen [ 415.781672] ata8: SError: { HostInt } [ 415.781687] ata8.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0 [ 415.781690] res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x44 (timeout) [ 415.781697] ata8.00: status: { DRDY } [ 415.781708] ata8: hard resetting link [ 416.264190] ata8: softreset failed (device not ready) [ 416.264199] ata8: failed due to HW bug, retry pmp=0 [ 416.428196] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [ 416.440517] ata8.00: configured for UDMA/133 [ 416.440544] ata8: EH complete [ 424.781778] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen [ 424.781798] ata8.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0 [ 424.781801] res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) [ 424.781809] ata8.00: status: { DRDY } [ 424.781820] ata8: hard resetting link [ 425.265677] ata8: softreset failed (device not ready) [ 425.265686] ata8: failed due to HW bug, retry pmp=0 [ 425.429546] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [ 425.442153] ata8.00: configured for UDMA/133 [ 425.442179] ata8: EH complete [ 458.482002] CE: hpet increasing min_delta_ns to 15000 nsec [ 499.780213] ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen [ 499.780233] ata8.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0 [ 499.780237] res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) [ 499.780244] ata8.00: status: { DRDY } [ 499.780256] ata8: hard resetting link [ 500.320191] ata8: softreset failed (device not ready) [ 500.320200] ata8: failed due to HW bug, retry pmp=0 [ 500.485084] ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [ 500.497235] ata8.00: configured for UDMA/133 [ 500.497256] ata8: EH complete And now with 2.6.31-rc5, instant ata exceptions, same as before (just no SError line this time). -- Thomas Fjellstrom tfjellstrom@shaw.ca -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/