Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754776AbbGXRin (ORCPT ); Fri, 24 Jul 2015 13:38:43 -0400 Received: from mail-yk0-f175.google.com ([209.85.160.175]:34020 "EHLO mail-yk0-f175.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753314AbbGXRil (ORCPT ); Fri, 24 Jul 2015 13:38:41 -0400 MIME-Version: 1.0 In-Reply-To: References: <20150723183548.GS15934@mtj.duckdns.org> Date: Fri, 24 Jul 2015 13:38:40 -0400 Message-ID: Subject: Re: ATA failure regression in kernel 4.2 From: Alex Deucher To: Tejun Heo , Jiang Liu Cc: LKML Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 9412 Lines: 201 On Thu, Jul 23, 2015 at 2:44 PM, Alex Deucher wrote: > On Thu, Jul 23, 2015 at 2:35 PM, Tejun Heo wrote: >> Hello, >> >> On Thu, Jul 23, 2015 at 01:48:24PM -0400, Alex Deucher wrote: >>> Something new in kernel 4.2 seems to have broken one of my hard drives >>> (ssd) in kernel 4.2. 4.1 and older kernels work fine. Here are the >>> relevant logs. >>> >> ... >>> [ 6.547628] ata2.00: qc timeout (cmd 0xec) >>> [ 6.547721] ata2.00: failed to IDENTIFY (I/O error, err_mask=0x4) >>> [ 7.007213] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300) >>> [ 16.997819] ata2.00: qc timeout (cmd 0xec) >>> [ 16.997910] ata2.00: failed to IDENTIFY (I/O error, err_mask=0x4) >>> [ 16.997995] ata2: limiting SATA link speed to 3.0 Gbps >>> [ 17.457400] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 320) >>> [ 47.429257] ata2.00: qc timeout (cmd 0xec) >>> [ 47.429349] ata2.00: failed to IDENTIFY (I/O error, err_mask=0x4) >>> [ 47.888822] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 320) >> >> Nothing really rings a bell. Timeouts on IDENTIFY. Could be IRQ >> related. Which controller is it (lspci -nn)? Also, can you try to >> bisect the issue? > > 00:11.0 SATA controller [0106]: Advanced Micro Devices, Inc. [AMD] FCH > SATA Controller [AHCI mode] [1022:7801] (rev 40) > 00:14.1 IDE interface [0101]: Advanced Micro Devices, Inc. [AMD] FCH > IDE Controller [1022:780c] > > I can take a look at bisecting later this week. You were right about the interrupts. This is an AMD Kaveri APU system. 52f518a3a7c2f80551a38d38be28bc9f335e713c is the first bad commit commit 52f518a3a7c2f80551a38d38be28bc9f335e713c Author: Jiang Liu Date: Mon Apr 13 14:11:35 2015 +0800 x86/MSI: Use hierarchical irqdomains to manage MSI interrupts Enhance MSI code to support hierarchical irqdomains, it helps to make the architecture more clear. Signed-off-by: Jiang Liu Cc: Konrad Rzeszutek Wilk Cc: David Cohen Cc: Sander Eikelenboom Cc: David Vrabel Cc: Tony Luck Cc: Greg Kroah-Hartman Cc: iommu@lists.linux-foundation.org Cc: Joerg Roedel Cc: Bjorn Helgaas Cc: Benjamin Herrenschmidt Cc: Rafael J. Wysocki Cc: Randy Dunlap Cc: Yinghai Lu Cc: Borislav Petkov Cc: Dimitri Sivanich Cc: Joerg Roedel Link: http://lkml.kernel.org/r/1428905519-23704-14-git-send-email-jiang.liu@linux.intel.com Signed-off-by: Thomas Gleixner :040000 040000 786bcad9a3fad413e0b744e2cfa20da7ff402db6 22618cac66dee85a7752bb3af81169fff3a242d8 M arch :040000 040000 acee54015803d4cd52d582a9e5e93aa56ad08482 40d2c7a02c0f8677e596c98c936404b2211336a3 M drivers git bisect start # bad: [3aa20508a6fe386c2a893027ef4c4ef78ee4eac2] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/linux-security git bisect bad 3aa20508a6fe386c2a893027ef4c4ef78ee4eac2 # good: [b953c0d234bc72e8489d3bf51a276c5c4ec85345] Linux 4.1 git bisect good b953c0d234bc72e8489d3bf51a276c5c4ec85345 # bad: [0db9723cacf4d62bc3685fb15179b39ee4e17679] Merge branch 'next' of git://git.kernel.org/pub/scm/linux/kernel/git/rzhang/linux git bisect bad 0db9723cacf4d62bc3685fb15179b39ee4e17679 # bad: [e3d8238d7f5c3f539a29f5ac596cd342d847e099] Merge tag 'arm64-upstream' of git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux git bisect bad e3d8238d7f5c3f539a29f5ac596cd342d847e099 # bad: [c70c5fb2b96dae0996fb0877d996458d3ca57eda] Merge tag 'pwm/for-4.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/thierry.reding/linux-pwm git bisect bad c70c5fb2b96dae0996fb0877d996458d3ca57eda # good: [e75c73ad64478c12b3a44b86a3e7f62a4f65b93e] Merge branch 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip git bisect good e75c73ad64478c12b3a44b86a3e7f62a4f65b93e # bad: [43224b96af3154cedd7220f7b90094905f07ac78] Merge branch 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip git bisect bad 43224b96af3154cedd7220f7b90094905f07ac78 # bad: [9dda1658a9bd450d65da5153a2427955785d17c2] Merge branch 'x86/asm' into x86/core, to prepare for new patch git bisect bad 9dda1658a9bd450d65da5153a2427955785d17c2 # bad: [191a66353b22fad8ac89404ab4c929cbe7b0afb2] Merge branch 'x86/asm' into x86/apic, to resolve a conflict git bisect bad 191a66353b22fad8ac89404ab4c929cbe7b0afb2 # bad: [099c5c03487f6bca30c628e14e666788dd61fb33] irq_remapping/vt-d: Move struct irq_2_iommu into intel_irq_remapping.c git bisect bad 099c5c03487f6bca30c628e14e666788dd61fb33 # bad: [90d84fe95dd6b418383aa0e0e5cace8f1b1e7e30] x86/MSI: Replace msi_update_msg() with irq_chip_compose_msi_msg() git bisect bad 90d84fe95dd6b418383aa0e0e5cace8f1b1e7e30 # good: [3cb96f0c97330834929abe9bd2ca3c252a83def0] x86/hpet: Enhance HPET IRQ to support hierarchical irqdomains git bisect good 3cb96f0c97330834929abe9bd2ca3c252a83def0 # bad: [b1855c752e67d1125d41fadb499014b49a245db8] x86/MSI: Clean up unused MSI related code and interfaces git bisect bad b1855c752e67d1125d41fadb499014b49a245db8 # bad: [2b43817e481da9f5118adb56aef46b3f0298c685] irq_remapping/vt-d: Clean up unused MSI related code git bisect bad 2b43817e481da9f5118adb56aef46b3f0298c685 # bad: [80aa283364a17998dceb577bd185e3380b927544] x86/irq: Directly call native_compose_msi_msg() for DMAR IRQ git bisect bad 80aa283364a17998dceb577bd185e3380b927544 # bad: [52f518a3a7c2f80551a38d38be28bc9f335e713c] x86/MSI: Use hierarchical irqdomains to manage MSI interrupts git bisect bad 52f518a3a7c2f80551a38d38be28bc9f335e713c # first bad commit: [52f518a3a7c2f80551a38d38be28bc9f335e713c] x86/MSI: Use hierarchical irqdomains to manage MSI interrupts lscpi: 00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1422] 00:00.2 IOMMU [0806]: Advanced Micro Devices, Inc. [AMD] Device [1022:1423] 00:01.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Kaveri [Radeon R7 200 Series] [1002:130f] 00:01.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Device [1002:1308] 00:02.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1424] 00:03.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1424] 00:04.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:1424] 00:10.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB XHCI Controller [1022:7814] (rev 09) 00:10.1 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB XHCI Controller [1022:7814] (rev 09) 00:11.0 SATA controller [0106]: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] [1022:7801] (rev 40) 00:12.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB OHCI Controller [1022:7807] (rev 11) 00:12.2 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB EHCI Controller [1022:7808] (rev 11) 00:13.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB OHCI Controller [1022:7807] (rev 11) 00:13.2 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB EHCI Controller [1022:7808] (rev 11) 00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:780b] (rev 16) 00:14.1 IDE interface [0101]: Advanced Micro Devices, Inc. [AMD] FCH IDE Controller [1022:780c] 00:14.2 Audio device [0403]: Advanced Micro Devices, Inc. [AMD] FCH Azalia Controller [1022:780d] (rev 01) 00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:780e] (rev 11) 00:14.4 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] FCH PCI Bridge [1022:780f] (rev 40) 00:14.5 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] FCH USB OHCI Controller [1022:7809] (rev 11) 00:15.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Hudson PCI to PCI bridge (PCIE port 0) [1022:43a0] 00:15.2 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Hudson PCI to PCI bridge (PCIE port 2) [1022:43a2] 00:15.3 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Hudson PCI to PCI bridge (PCIE port 3) [1022:43a3] 00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141a] 00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141b] 00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141c] 00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141d] 00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141e] 00:18.5 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Device [1022:141f] 03:00.0 USB controller [0c03]: ASMedia Technology Inc. ASM1042 SuperSpeed USB Host Controller [1b21:1042] 04:00.0 Ethernet controller [0200]: Qualcomm Atheros QCA8171 Gigabit Ethernet [1969:10a1] (rev 10) > > Alex > >> >> Thanks. >> >> -- >> tejun -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/