Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756179Ab2K2AJD (ORCPT ); Wed, 28 Nov 2012 19:09:03 -0500 Received: from mail-lb0-f174.google.com ([209.85.217.174]:61398 "EHLO mail-lb0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754772Ab2K2AJB convert rfc822-to-8bit (ORCPT ); Wed, 28 Nov 2012 19:09:01 -0500 MIME-Version: 1.0 In-Reply-To: <00cc01cdccac$7c273d40$7475b7c0$@lucidpixels.com> References: <097501cdca7b$7e303890$7a90a9b0$@lucidpixels.com> <20121126224245.248f0901@neptune.home> <004201cdcc3a$a7538310$f5fa8930$@lucidpixels.com> <00a501cdcca3$c2638510$472a8f30$@lucidpixels.com> <00b501cdcca6$03f88af0$0be9a0d0$@lucidpixels.com> <00b701cdcca6$ff097620$fd1c6260$@lucidpixels.com> <00cc01cdccac$7c273d40$7475b7c0$@lucidpixels.com> From: Bjorn Helgaas Date: Wed, 28 Nov 2012 17:08:38 -0700 Message-ID: Subject: Re: Supermicro X9SRL-F - channel enumeration error & ACPI/firmware bug question To: Justin Piszcz Cc: =?ISO-8859-1?Q?Bruno_Pr=E9mont?= , support@supermicro.com, linux-kernel@vger.kernel.org, Dan Williams Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5171 Lines: 118 On Tue, Nov 27, 2012 at 7:35 AM, Justin Piszcz wrote: > > > -----Original Message----- > From: Justin Piszcz [mailto:jpiszcz@lucidpixels.com] > Sent: Tuesday, November 27, 2012 8:56 AM > To: 'Bjorn Helgaas' > Cc: 'Bruno Pr?mont'; support@supermicro.com; linux-kernel@vger.kernel.org; > 'Dan Williams' > Subject: RE: Supermicro X9SRL-F - channel enumeration error & ACPI/firmware > bug question > > >> It is _not_ working on the: > >> 2) Supermicro X8DTH-F (the boot drive in this system is running off a > PCI-e >> card, could the IRQ for the I/O controller be getting re-mapped and > fail?)-- >> worse case I can move the SSD from the 6.0gbpa SATA card to the > motherboard >> and see if that works, but that kind of defeats the purpose of a 6.0gbps >> SATA SSD. > > When I removed the Highpoint 2-port SATA card and plugged it into the > motherboard, the system boots (plugged the SSD into the motherboard). > So if you use a HIGHPOINT 2-PORT SATA 6.0gbps card, do NOT enable IOMMU or > it will fail to initialize the Highpoint 2-port SATA controller card! > I also tried upgrading the BIOS (of the mobo, no diff) > I also tried just leaving the SATA card in and plugging it into the > motherboard (no diff) > Removed the Highpoint 2-port SATA card and then success, it would be nice to > use that card with IOMMU support though, is it just not compatible > (marvell-problem?) or is a driver bug? Based on the pictures/etc sent > earlier? I would guess this is a core bug, but it's hard to tell without more information. If you boot with "intel_iommu=off", I would guess the Highpoint card would work (this should have the same effect as turning off CONFIG_INTEL_IOMMU). I'd like to compare the complete dmesg log for that boot with the one that fails. It sounds like it might be hard to collect the log for the failing case -- you said the boot fails when the Highpoint card is in the system even if the SSD is connected to the motherboard instead of the Highpoint card. The panic in the photo2 image looks like it's just a failure to mount the root filesystem, which is what I'd expect if we can't find the SSD. It seems like we ought to be able to *boot* with the SSD connected to the motherboard, even if the Highpoint card doesn't work. But worst-case, a video of the failing boot might be enough, especially if you can slow it down with "boot_delay=" > $ dmesg|grep -i iommu > [ 0.055134] dmar: IOMMU 0: reg_base_addr cfdfe000 ver 1:0 cap > c90780106f0462 ecap f020f6 > [ 0.055396] dmar: IOMMU 1: reg_base_addr fecfe000 ver 1:0 cap > c90780106f0462 ecap f020f6 > [ 0.760665] IOMMU 0 0xcfdfe000: using Queued invalidation > [ 0.760803] IOMMU 1 0xfecfe000: using Queued invalidation > [ 0.760937] IOMMU: Setting RMRR: > [ 0.761102] IOMMU: Setting identity map for device 0000:00:1d.0 > [0xbf7ec000 - 0xbf7fffff] > [ 0.761329] IOMMU: Setting identity map for device 0000:00:1d.1 > [0xbf7ec000 - 0xbf7fffff] > [ 0.761542] IOMMU: Setting identity map for device 0000:00:1d.2 > [0xbf7ec000 - 0xbf7fffff] > [ 0.761758] IOMMU: Setting identity map for device 0000:00:1d.7 > [0xbf7ec000 - 0xbf7fffff] > [ 0.761974] IOMMU: Setting identity map for device 0000:00:1a.0 > [0xbf7ec000 - 0xbf7fffff] > [ 0.762190] IOMMU: Setting identity map for device 0000:00:1a.1 > [0xbf7ec000 - 0xbf7fffff] > [ 0.762407] IOMMU: Setting identity map for device 0000:00:1a.2 > [0xbf7ec000 - 0xbf7fffff] > [ 0.762620] IOMMU: Setting identity map for device 0000:00:1a.7 > [0xbf7ec000 - 0xbf7fffff] > [ 0.762816] IOMMU: Setting identity map for device 0000:00:1d.0 [0xec000 > - 0xeffff] > [ 0.763010] IOMMU: Setting identity map for device 0000:00:1d.1 [0xec000 > - 0xeffff] > [ 0.763197] IOMMU: Setting identity map for device 0000:00:1d.2 [0xec000 > - 0xeffff] > [ 0.763382] IOMMU: Setting identity map for device 0000:00:1d.7 [0xec000 > - 0xeffff] > [ 0.763567] IOMMU: Setting identity map for device 0000:00:1a.0 [0xec000 > - 0xeffff] > [ 0.763749] IOMMU: Setting identity map for device 0000:00:1a.1 [0xec000 > - 0xeffff] > [ 0.763934] IOMMU: Setting identity map for device 0000:00:1a.2 [0xec000 > - 0xeffff] > [ 0.764127] IOMMU: Setting identity map for device 0000:00:1a.7 [0xec000 > - 0xeffff] > [ 0.764311] IOMMU: Prepare 0-16MiB unity mapping for LPC > [ 0.764465] IOMMU: Setting identity map for device 0000:00:1f.0 [0x0 - > 0xffffff] > > -- > > > ==> Further issues with the X9SRL-F -- does this board support ASPM or is > this a Linux/ASPM implementation issue? > [ 0.632170] pci0000:ff: ACPI _OSC support notification failed, disabling > PCIe ASPM > [ 0.632239] pci0000:ff: Unable to request _OSC control (_OSC support > mask: 0x08) I'm going to ignore this issue for the time being. I know we complain about this on many machines, and I don't know whether it's a real problem or just an overly alarming message. Bjorn -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/