2008-08-11 20:12:50

by Maciej Rutecki

[permalink] [raw]
Subject: [2.6.26.*] boot problem (ahci/irq related?)

A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
booting stops on this message:

[...]
Uniform Multi-Platform E-IDE driver
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
ICH7: not 100% native mode: will probe irqs later
ICH7: IDE port disabled
ide0: BM-DMA at 0x60a0-0x60a7

afer should be:
hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
hda: MWDMA2 mode selected
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
[...]

Initially I was think, that is hardware problem. But XP and 2.6.25.8
work with _no_ problem (booting from xp, linux, knoppix, linux
firmware development kit - about 50 reboots).

So I try without CONFIG_IDE option and try libata.noacpi=1 option and
without it. It doesn't help. Next messages (from dmesg) below are
with libata.noacpi=1 option enabled and CONFIG_IDE=n (libata only use
- 2.6.26.2 kernel). Firstly i try 2.6.26.2 and very often boot stops
on this message:

http://unixy.pl/maciek/download/kernel/dupa/img_0001.jpg

I observe few times this event:
1. boot to 2.6.26.2 fails
2. power on by power button and boot XP (works OK, clean logs)
3. back to 2.6.26.2 (fail)
4. power on by power button and boot XP (works OK)
5. back to 2.6.26.2 (fail or sometimes boot normally)

(The same sequence when XP change to 2.6.25.8)



Also (1-2 times) I observed this:
ahci 0000:00:1f.2: version 3.0
ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
ahci 0000:00:1f.2: flags: 64bit ncq ilck stag pm led clo pmp pio slum part
PCI: Setting latency timer of device 0000:00:1f.2 to 64
scsi0 : ahci
scsi1 : ahci
scsi2 : ahci
scsi3 : ahci
ata1: SATA max UDMA/133 irq_stat 0x00000040, connection status changed irq 220
ata2: DUMMY
ata3: DUMMY
ata4: DUMMY
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
ata1.00: ATA-7: ST9120822AS, 3.ALC, max UDMA/133
ata1.00: 234441648 sectors, multi 16: LBA48 NCQ (depth 31/32)
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
ata1.00: configured for UDMA/133
ata1.00: configured for UDMA/133
ata1: EH complete
scsi 0:0:0:0: Direct-Access ATA ST9120822AS 3.AL PQ: 0 ANSI: 5
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
sd 0:0:0:0: [sda] 234441648 512-byte hardware sectors (120034 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
sd 0:0:0:0: [sda] Attached SCSI disk
ata_piix 0000:00:1f.1: version 2.12
ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:1f.1 to 64
scsi4 : ata_piix
scsi5 : ata_piix
ata5: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x60a0 irq 14
ata6: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x60a8 irq 15
ata5.00: ATAPI: MATSHITADVD-RAM UJ-840S, 1.11, max MWDMA2
ata5.00: configured for MWDMA2
ata6: port disabled. ignoring.
ata5.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata5.00: cmd a0/01:00:00:60:00/00:00:00:00:00/a0 tag 0 dma 96 in
cdb 12 00 00 00 60 00 00 00 00 00 00 00 00 00 00 00
res 40/00:02:00:24:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
ata5.00: status: { DRDY }
ata5: soft resetting link
ata5.00: configured for MWDMA2
ata5: EH complete
ata5.00: limiting speed to MWDMA1:PIO4
ata5.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata5.00: cmd a0/01:00:00:60:00/00:00:00:00:00/a0 tag 0 dma 96 in
cdb 12 00 00 00 60 00 00 00 00 00 00 00 00 00 00 00
res 40/00:02:00:24:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
ata5.00: status: { DRDY }
ata5: soft resetting link
ata5.00: configured for MWDMA1
ata5: EH complete
ata5.00: limiting speed to PIO4
ata5.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata5.00: cmd a0/01:00:00:60:00/00:00:00:00:00/a0 tag 0 dma 96 in
cdb 12 00 00 00 60 00 00 00 00 00 00 00 00 00 00 00
res 40/00:02:00:24:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
ata5.00: status: { DRDY }
ata5: soft resetting link
ata5.00: configured for PIO4
ata5: EH complete
scsi 4:0:0:0: CD-ROM MATSHITA DVD-RAM UJ-840S 1.11 PQ: 0 ANSI: 5

and then booting normally.


2.6.26.2:
lspci -vvv: http://unixy.pl/maciek/download/kernel/dupa/lspci.txt
interrupts: http://unixy.pl/maciek/download/kernel/dupa/przerwania.txt
config: http://unixy.pl/maciek/download/kernel/dupa/config-2.6.26.2
dmesg (normal booting):
http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt

2.6.25.8:
config: http://unixy.pl/maciek/download/kernel/dupa/config-2.6.25.8
dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt

I also try update BIOS - it doesn't help.

--
Maciej Rutecki
http://www.maciek.unixy.pl


2008-08-11 20:28:46

by Ingo Molnar

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)


* Maciej Rutecki <[email protected]> wrote:

> A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
> booting stops on this message:
>
> [...]
> Uniform Multi-Platform E-IDE driver
> ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
> ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
> ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> ICH7: not 100% native mode: will probe irqs later
> ICH7: IDE port disabled
> ide0: BM-DMA at 0x60a0-0x60a7
>
> afer should be:
> hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
> hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
> hda: MWDMA2 mode selected
> ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> [...]
>
> Initially I was think, that is hardware problem. But XP and 2.6.25.8
> work with _no_ problem (booting from xp, linux, knoppix, linux
> firmware development kit - about 50 reboots).

> http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt
> dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt

i had a look at diff -up dmesg-2.6.25.8.txt dmesg-2.6.26.2.txt, and
these changes stood out:

ACPI: bus type pci registered
-PCI: BIOS Bug: MCFG area at f8000000 is not E820-reserved
+PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
PCI: Not using MMCONFIG.

ACPI: Using IOAPIC for interrupt routing
+PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
+PCI: MCFG area at f8000000 reserved in ACPI motherboard resources
+PCI: Using MMCONFIG for extended config space
ACPI: EC: GPE = 0x16, I/O: command/status = 0x66, data = 0x62

IO window: 0x00007400-0x000074ff
- PREFETCH window: 0x60000000-0x63ffffff
- MEM window: 0x64000000-0x67ffffff
+ MEM window: 0x60000000-0x63ffffff
PCI: Bridge: 0000:00:1e.0
IO window: 7000-7fff
MEM window: 0xe8100000-0xe83fffff
- PREFETCH window: 0x0000000060000000-0x0000000063ffffff
+ PREFETCH window: disabled.

so a quirk that triggered previously does not seem to trigger anymore.

find below some of the other deltas. I edited out the dmesg deltas that
i considered non-interesting - this should make it shorter/easier for
others to review the changes.

Ingo

--- dmesg-2.6.25.8.txt 2008-08-10 21:49:49.000000000 +0200
+++ dmesg-2.6.26.2.txt 2008-08-10 21:49:50.000000000 +0200
@@ -112,13 +111,14 @@ CPU: Processor Core ID: 0
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
using mwait in idle threads.
-Compat vDSO mapped to ffffe000.
Checking 'hlt' instruction... OK.
-ACPI: Core revision 20070126
+ACPI: Core revision 20080321
+ENABLING IO-APIC IRQs
+..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
CPU0: Intel Genuine Intel(R) CPU T2050 @ 1.60GHz stepping 08
@@ -126,32 +126,33 @@ CPU: Processor Core ID: 1
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
CPU1: Intel Genuine Intel(R) CPU T2050 @ 1.60GHz stepping 08
-Total of 2 processors activated (6387.08 BogoMIPS).
-ENABLING IO-APIC IRQs
-..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
checking TSC synchronization [CPU#0 -> CPU#1]: passed.
Brought up 2 CPUs
No dock devices found.
ACPI: bus type pci registered
-PCI: BIOS Bug: MCFG area at f8000000 is not E820-reserved
+PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
PCI: Not using MMCONFIG.
PCI: PCI BIOS revision 2.10 entry at 0xf0322, last bus=32
-PCI: Using configuration type 1
+PCI: Using configuration type 1 for base access
Setting up standard PCI resources
ACPI: EC: Look up EC in DSDT
ACPI: EC: non-query interrupt received, switching to interrupt mode
ACPI: Interpreter enabled
ACPI: (supports S0 S3 S4 S5)
ACPI: Using IOAPIC for interrupt routing
+PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
+PCI: MCFG area at f8000000 reserved in ACPI motherboard resources
+PCI: Using MMCONFIG for extended config space
ACPI: EC: GPE = 0x16, I/O: command/status = 0x66, data = 0x62
ACPI: EC: driver started in interrupt mode
ACPI: PCI Root Bridge [C002] (0000:00)
@@ -170,14 +171,13 @@ ACPI: PCI Interrupt Link [C111] (IRQs *1
ACPI: PCI Interrupt Link [C12A] (IRQs *10 11)
ACPI: PCI Interrupt Link [C12B] (IRQs 10 *11)
ACPI: PCI Interrupt Link [C12C] (IRQs 10 11) *0, disabled.
-ACPI Exception (pci_link-0184): AE_NOT_FOUND, Evaluating _PRS [20070126]
+ACPI Exception (pci_link-0184): AE_NOT_FOUND, Evaluating _PRS [20080321]
ACPI: Power Resource [C21A] (on)
ACPI: Power Resource [C222] (off)
ACPI: Power Resource [C321] (off)
ACPI: Power Resource [C322] (off)
ACPI: Power Resource [C323] (off)
ACPI: Power Resource [C324] (off)
-ACPI: WMI: Mapper loaded
Linux Plug and Play Support v0.97 (c) Adam Belay
pnp: PnP ACPI init
ACPI: bus type pnp registered
@@ -186,7 +186,6 @@ ACPI: ACPI bus type pnp unregistered
SCSI subsystem initialized
libata version 3.00 loaded.
PCI: Using ACPI for IRQ routing
-PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report
hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0
hpet0: 3 64-bit timers, 14318180 Hz
ACPI: RTC can wake from S4
@@ -208,6 +207,7 @@ system 00:0a: iomem range 0xfed45000-0xf
system 00:0a: iomem range 0xfed90000-0xfed9afff could not be reserved
system 00:0b: iomem range 0xfeda0000-0xfedbffff could not be reserved
system 00:0b: iomem range 0xfee00000-0xfee00fff could not be reserved
+PCI: region 0000:02:06.0/9 too large: 0x0000000000000000-0x0000000003ffffff
PCI: Bridge: 0000:00:1c.0
IO window: disabled.
MEM window: 0xe8000000-0xe80fffff
@@ -223,12 +223,11 @@ PCI: Bridge: 0000:00:1c.3
PCI: Bus 3, cardbus bridge: 0000:02:06.0
IO window: 0x00007000-0x000070ff
IO window: 0x00007400-0x000074ff
- PREFETCH window: 0x60000000-0x63ffffff
- MEM window: 0x64000000-0x67ffffff
+ MEM window: 0x60000000-0x63ffffff
PCI: Bridge: 0000:00:1e.0
IO window: 7000-7fff
MEM window: 0xe8100000-0xe83fffff
- PREFETCH window: 0x0000000060000000-0x0000000063ffffff
+ PREFETCH window: disabled.
ACPI: PCI Interrupt 0000:00:1c.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:1c.0 to 64
ACPI: PCI Interrupt 0000:00:1c.2[C] -> GSI 18 (level, low) -> IRQ 18
@@ -243,9 +242,11 @@ TCP established hash table entries: 1310
TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
TCP: Hash tables configured (established 131072 bind 65536)
TCP reno registered
+NET: Registered protocol family 1
audit: initializing netlink socket (disabled)
-type=2000 audit(1218404726.442:1): initialized
+type=2000 audit(1218404863.342:1): initialized
highmem bounce pool size: 64 pages
+msgmni has been set to 1757
io scheduler noop registered
io scheduler anticipatory registered
io scheduler deadline registered
@@ -255,21 +256,22 @@ PCI: Setting latency timer of device 000
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:1c.0:pcie00]
Allocate Port Service[0000:00:1c.0:pcie02]
+Allocate Port Service[0000:00:1c.0:pcie03]
PCI: Setting latency timer of device 0000:00:1c.2 to 64
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:1c.2:pcie00]
Allocate Port Service[0000:00:1c.2:pcie02]
+Allocate Port Service[0000:00:1c.2:pcie03]
PCI: Setting latency timer of device 0000:00:1c.3 to 64
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:1c.3:pcie00]
Allocate Port Service[0000:00:1c.3:pcie02]
+Allocate Port Service[0000:00:1c.3:pcie03]
vesafb: framebuffer at 0xd0000000, mapped to 0xf8880000, using 6144k, total 7872k
vesafb: mode is 1024x768x32, linelength=4096, pages=1
vesafb: scrolling: redraw
vesafb: Truecolor: size=8:8:8:8, shift=24:16:8:0
Console: switching to colour frame buffer device 128x48
@@ -297,21 +301,8 @@ hpet_resources: 0xfed00000 is busy
intel_rng: FWH not detected
Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
brd: module loaded
-Uniform Multi-Platform E-IDE driver
-ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
-ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
-ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
-ICH7: not 100% native mode: will probe irqs later
-ICH7: IDE port disabled
- ide0: BM-DMA at 0x60a0-0x60a7, BIOS settings: hda:DMA, hdb:PIO
-Probing IDE interface ide0...
-hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
-hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
-hda: MWDMA2 mode selected
-ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
-hda: ATAPI 24X DVD-ROM DVD-R-RAM CD-R/RW drive, 2048kB Cache
-Uniform CD-ROM driver Revision: 3.20
Driver 'sd' needs updating - please use bus_type methods
+Driver 'sr' needs updating - please use bus_type methods
ahci 0000:00:1f.2: version 3.0
ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
@@ -329,13 +320,13 @@ ata1: SATA link up 1.5 Gbps (SStatus 113
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
-ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
+ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
ata1.00: ATA-7: ST9120822AS, 3.ALC, max UDMA/133
ata1.00: 234441648 sectors, multi 16: LBA48 NCQ (depth 31/32)
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
-ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
+ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
ata1.00: configured for UDMA/133
ata1.00: configured for UDMA/133
ata1: EH complete
@@ -350,6 +341,20 @@ sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 0
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
sd 0:0:0:0: [sda] Attached SCSI disk
+ata_piix 0000:00:1f.1: version 2.12
+ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
+PCI: Setting latency timer of device 0000:00:1f.1 to 64
+scsi4 : ata_piix
+scsi5 : ata_piix
+ata5: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x60a0 irq 14
+ata6: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x60a8 irq 15
+ata5.00: ATAPI: MATSHITADVD-RAM UJ-840S, 1.11, max MWDMA2
+ata5.00: configured for MWDMA2
+ata6: port disabled. ignoring.
+scsi 4:0:0:0: CD-ROM MATSHITA DVD-RAM UJ-840S 1.11 PQ: 0 ANSI: 5
+sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
+Uniform CD-ROM driver Revision: 3.20
+sr 4:0:0:0: Attached scsi CD-ROM sr0
PNP: PS/2 Controller [PNP0303:C217,PNP0f13:C218] at 0x60,0x64 irq 1,12
i8042.c: Detected active multiplexing controller, rev 1.1.
serio: i8042 KBD port at 0x60,0x64 irq 1

2008-08-11 21:00:52

by Andi Kleen

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Mon, Aug 11, 2008 at 10:27:31PM +0200, Ingo Molnar wrote:
>
> * Maciej Rutecki <[email protected]> wrote:
>
> > A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
> > booting stops on this message:
> >
> > [...]
> > Uniform Multi-Platform E-IDE driver
> > ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
> > ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
> > ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> > ICH7: not 100% native mode: will probe irqs later
> > ICH7: IDE port disabled
> > ide0: BM-DMA at 0x60a0-0x60a7
> >
> > afer should be:
> > hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
> > hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
> > hda: MWDMA2 mode selected
> > ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> > [...]
> >
> > Initially I was think, that is hardware problem. But XP and 2.6.25.8
> > work with _no_ problem (booting from xp, linux, knoppix, linux
> > firmware development kit - about 50 reboots).
>
> > http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt
> > dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt
>
> i had a look at diff -up dmesg-2.6.25.8.txt dmesg-2.6.26.2.txt, and
> these changes stood out:

Yes post 2.6.25 changed to trust the ACPI motherboard resources
to signify working mmcfg, previously it would use the e820 table
which excluded near all motherboards which were not white listed
by PCI-ID because BIOS programmers were generally not aware
they had to reserve mcfg in e820.

You can verify this by booting with pci=nommconf. If it works
then it's likely that.

The new heuristic is apparently the same as what Vista does.

Robert Hancock (cc'ed, fullquote) is mr. mmconfig quirk.

-Andi

>
> ACPI: bus type pci registered
> -PCI: BIOS Bug: MCFG area at f8000000 is not E820-reserved
> +PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
> PCI: Not using MMCONFIG.
>
> ACPI: Using IOAPIC for interrupt routing
> +PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
> +PCI: MCFG area at f8000000 reserved in ACPI motherboard resources
> +PCI: Using MMCONFIG for extended config space
> ACPI: EC: GPE = 0x16, I/O: command/status = 0x66, data = 0x62
>
> IO window: 0x00007400-0x000074ff
> - PREFETCH window: 0x60000000-0x63ffffff
> - MEM window: 0x64000000-0x67ffffff
> + MEM window: 0x60000000-0x63ffffff
> PCI: Bridge: 0000:00:1e.0
> IO window: 7000-7fff
> MEM window: 0xe8100000-0xe83fffff
> - PREFETCH window: 0x0000000060000000-0x0000000063ffffff
> + PREFETCH window: disabled.
>
> so a quirk that triggered previously does not seem to trigger anymore.
>
> find below some of the other deltas. I edited out the dmesg deltas that
> i considered non-interesting - this should make it shorter/easier for
> others to review the changes.
>
> Ingo
>
> --- dmesg-2.6.25.8.txt 2008-08-10 21:49:49.000000000 +0200
> +++ dmesg-2.6.26.2.txt 2008-08-10 21:49:50.000000000 +0200
> @@ -112,13 +111,14 @@ CPU: Processor Core ID: 0
> Intel machine check architecture supported.
> Intel machine check reporting enabled on CPU#0.
> using mwait in idle threads.
> -Compat vDSO mapped to ffffe000.
> Checking 'hlt' instruction... OK.
> -ACPI: Core revision 20070126
> +ACPI: Core revision 20080321
> +ENABLING IO-APIC IRQs
> +..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
> CPU0: Intel Genuine Intel(R) CPU T2050 @ 1.60GHz stepping 08
> @@ -126,32 +126,33 @@ CPU: Processor Core ID: 1
> Intel machine check architecture supported.
> Intel machine check reporting enabled on CPU#1.
> CPU1: Intel Genuine Intel(R) CPU T2050 @ 1.60GHz stepping 08
> -Total of 2 processors activated (6387.08 BogoMIPS).
> -ENABLING IO-APIC IRQs
> -..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
> checking TSC synchronization [CPU#0 -> CPU#1]: passed.
> Brought up 2 CPUs
> No dock devices found.
> ACPI: bus type pci registered
> -PCI: BIOS Bug: MCFG area at f8000000 is not E820-reserved
> +PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
> PCI: Not using MMCONFIG.
> PCI: PCI BIOS revision 2.10 entry at 0xf0322, last bus=32
> -PCI: Using configuration type 1
> +PCI: Using configuration type 1 for base access
> Setting up standard PCI resources
> ACPI: EC: Look up EC in DSDT
> ACPI: EC: non-query interrupt received, switching to interrupt mode
> ACPI: Interpreter enabled
> ACPI: (supports S0 S3 S4 S5)
> ACPI: Using IOAPIC for interrupt routing
> +PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
> +PCI: MCFG area at f8000000 reserved in ACPI motherboard resources
> +PCI: Using MMCONFIG for extended config space
> ACPI: EC: GPE = 0x16, I/O: command/status = 0x66, data = 0x62
> ACPI: EC: driver started in interrupt mode
> ACPI: PCI Root Bridge [C002] (0000:00)
> @@ -170,14 +171,13 @@ ACPI: PCI Interrupt Link [C111] (IRQs *1
> ACPI: PCI Interrupt Link [C12A] (IRQs *10 11)
> ACPI: PCI Interrupt Link [C12B] (IRQs 10 *11)
> ACPI: PCI Interrupt Link [C12C] (IRQs 10 11) *0, disabled.
> -ACPI Exception (pci_link-0184): AE_NOT_FOUND, Evaluating _PRS [20070126]
> +ACPI Exception (pci_link-0184): AE_NOT_FOUND, Evaluating _PRS [20080321]
> ACPI: Power Resource [C21A] (on)
> ACPI: Power Resource [C222] (off)
> ACPI: Power Resource [C321] (off)
> ACPI: Power Resource [C322] (off)
> ACPI: Power Resource [C323] (off)
> ACPI: Power Resource [C324] (off)
> -ACPI: WMI: Mapper loaded
> Linux Plug and Play Support v0.97 (c) Adam Belay
> pnp: PnP ACPI init
> ACPI: bus type pnp registered
> @@ -186,7 +186,6 @@ ACPI: ACPI bus type pnp unregistered
> SCSI subsystem initialized
> libata version 3.00 loaded.
> PCI: Using ACPI for IRQ routing
> -PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report
> hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0
> hpet0: 3 64-bit timers, 14318180 Hz
> ACPI: RTC can wake from S4
> @@ -208,6 +207,7 @@ system 00:0a: iomem range 0xfed45000-0xf
> system 00:0a: iomem range 0xfed90000-0xfed9afff could not be reserved
> system 00:0b: iomem range 0xfeda0000-0xfedbffff could not be reserved
> system 00:0b: iomem range 0xfee00000-0xfee00fff could not be reserved
> +PCI: region 0000:02:06.0/9 too large: 0x0000000000000000-0x0000000003ffffff
> PCI: Bridge: 0000:00:1c.0
> IO window: disabled.
> MEM window: 0xe8000000-0xe80fffff
> @@ -223,12 +223,11 @@ PCI: Bridge: 0000:00:1c.3
> PCI: Bus 3, cardbus bridge: 0000:02:06.0
> IO window: 0x00007000-0x000070ff
> IO window: 0x00007400-0x000074ff
> - PREFETCH window: 0x60000000-0x63ffffff
> - MEM window: 0x64000000-0x67ffffff
> + MEM window: 0x60000000-0x63ffffff
> PCI: Bridge: 0000:00:1e.0
> IO window: 7000-7fff
> MEM window: 0xe8100000-0xe83fffff
> - PREFETCH window: 0x0000000060000000-0x0000000063ffffff
> + PREFETCH window: disabled.
> ACPI: PCI Interrupt 0000:00:1c.0[A] -> GSI 16 (level, low) -> IRQ 16
> PCI: Setting latency timer of device 0000:00:1c.0 to 64
> ACPI: PCI Interrupt 0000:00:1c.2[C] -> GSI 18 (level, low) -> IRQ 18
> @@ -243,9 +242,11 @@ TCP established hash table entries: 1310
> TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
> TCP: Hash tables configured (established 131072 bind 65536)
> TCP reno registered
> +NET: Registered protocol family 1
> audit: initializing netlink socket (disabled)
> -type=2000 audit(1218404726.442:1): initialized
> +type=2000 audit(1218404863.342:1): initialized
> highmem bounce pool size: 64 pages
> +msgmni has been set to 1757
> io scheduler noop registered
> io scheduler anticipatory registered
> io scheduler deadline registered
> @@ -255,21 +256,22 @@ PCI: Setting latency timer of device 000
> assign_interrupt_mode Found MSI capability
> Allocate Port Service[0000:00:1c.0:pcie00]
> Allocate Port Service[0000:00:1c.0:pcie02]
> +Allocate Port Service[0000:00:1c.0:pcie03]
> PCI: Setting latency timer of device 0000:00:1c.2 to 64
> assign_interrupt_mode Found MSI capability
> Allocate Port Service[0000:00:1c.2:pcie00]
> Allocate Port Service[0000:00:1c.2:pcie02]
> +Allocate Port Service[0000:00:1c.2:pcie03]
> PCI: Setting latency timer of device 0000:00:1c.3 to 64
> assign_interrupt_mode Found MSI capability
> Allocate Port Service[0000:00:1c.3:pcie00]
> Allocate Port Service[0000:00:1c.3:pcie02]
> +Allocate Port Service[0000:00:1c.3:pcie03]
> vesafb: framebuffer at 0xd0000000, mapped to 0xf8880000, using 6144k, total 7872k
> vesafb: mode is 1024x768x32, linelength=4096, pages=1
> vesafb: scrolling: redraw
> vesafb: Truecolor: size=8:8:8:8, shift=24:16:8:0
> Console: switching to colour frame buffer device 128x48
> @@ -297,21 +301,8 @@ hpet_resources: 0xfed00000 is busy
> intel_rng: FWH not detected
> Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
> brd: module loaded
> -Uniform Multi-Platform E-IDE driver
> -ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
> -ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
> -ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> -ICH7: not 100% native mode: will probe irqs later
> -ICH7: IDE port disabled
> - ide0: BM-DMA at 0x60a0-0x60a7, BIOS settings: hda:DMA, hdb:PIO
> -Probing IDE interface ide0...
> -hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
> -hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
> -hda: MWDMA2 mode selected
> -ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> -hda: ATAPI 24X DVD-ROM DVD-R-RAM CD-R/RW drive, 2048kB Cache
> -Uniform CD-ROM driver Revision: 3.20
> Driver 'sd' needs updating - please use bus_type methods
> +Driver 'sr' needs updating - please use bus_type methods
> ahci 0000:00:1f.2: version 3.0
> ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
> ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
> @@ -329,13 +320,13 @@ ata1: SATA link up 1.5 Gbps (SStatus 113
> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
> ata1.00: ATA-7: ST9120822AS, 3.ALC, max UDMA/133
> ata1.00: 234441648 sectors, multi 16: LBA48 NCQ (depth 31/32)
> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
> ata1.00: configured for UDMA/133
> ata1.00: configured for UDMA/133
> ata1: EH complete
> @@ -350,6 +341,20 @@ sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 0
> sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
> sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
> sd 0:0:0:0: [sda] Attached SCSI disk
> +ata_piix 0000:00:1f.1: version 2.12
> +ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> +PCI: Setting latency timer of device 0000:00:1f.1 to 64
> +scsi4 : ata_piix
> +scsi5 : ata_piix
> +ata5: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x60a0 irq 14
> +ata6: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x60a8 irq 15
> +ata5.00: ATAPI: MATSHITADVD-RAM UJ-840S, 1.11, max MWDMA2
> +ata5.00: configured for MWDMA2
> +ata6: port disabled. ignoring.
> +scsi 4:0:0:0: CD-ROM MATSHITA DVD-RAM UJ-840S 1.11 PQ: 0 ANSI: 5
> +sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
> +Uniform CD-ROM driver Revision: 3.20
> +sr 4:0:0:0: Attached scsi CD-ROM sr0
> PNP: PS/2 Controller [PNP0303:C217,PNP0f13:C218] at 0x60,0x64 irq 1,12
> i8042.c: Detected active multiplexing controller, rev 1.1.
> serio: i8042 KBD port at 0x60,0x64 irq 1
>

2008-08-11 23:49:15

by Robert Hancock

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

Andi Kleen wrote:
> On Mon, Aug 11, 2008 at 10:27:31PM +0200, Ingo Molnar wrote:
>> * Maciej Rutecki <[email protected]> wrote:
>>
>>> A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
>>> booting stops on this message:
>>>
>>> [...]
>>> Uniform Multi-Platform E-IDE driver
>>> ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
>>> ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
>>> ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
>>> ICH7: not 100% native mode: will probe irqs later
>>> ICH7: IDE port disabled
>>> ide0: BM-DMA at 0x60a0-0x60a7
>>>
>>> afer should be:
>>> hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
>>> hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
>>> hda: MWDMA2 mode selected
>>> ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
>>> [...]
>>>
>>> Initially I was think, that is hardware problem. But XP and 2.6.25.8
>>> work with _no_ problem (booting from xp, linux, knoppix, linux
>>> firmware development kit - about 50 reboots).
>>> http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt
>>> dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt
>> i had a look at diff -up dmesg-2.6.25.8.txt dmesg-2.6.26.2.txt, and
>> these changes stood out:
>
> Yes post 2.6.25 changed to trust the ACPI motherboard resources
> to signify working mmcfg, previously it would use the e820 table
> which excluded near all motherboards which were not white listed
> by PCI-ID because BIOS programmers were generally not aware
> they had to reserve mcfg in e820.
>
> You can verify this by booting with pci=nommconf. If it works
> then it's likely that.
>
> The new heuristic is apparently the same as what Vista does.
>
> Robert Hancock (cc'ed, fullquote) is mr. mmconfig quirk.
>
> -Andi

Wouldn't hurt to try it, but I'm kind of doubtful it's that - MMCONFIG
failures don't normally get that far into bootup, and I believe we're
not using it for non-extended config space anyways - I doubt IDE is
doing extended config space access..

2008-08-12 05:50:26

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

2008/8/11 Andi Kleen <[email protected]>:
[...]
>
> You can verify this by booting with pci=nommconf. If it works
> then it's likely that.
>
[...]

It doesn't help
--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-12 06:52:58

by Yinghai Lu

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Mon, Aug 11, 2008 at 1:27 PM, Ingo Molnar <[email protected]> wrote:
>
> * Maciej Rutecki <[email protected]> wrote:
>
>> A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
>> booting stops on this message:
>>
>> [...]
>> Uniform Multi-Platform E-IDE driver
>> ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
>> ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
>> ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
>> ICH7: not 100% native mode: will probe irqs later
>> ICH7: IDE port disabled
>> ide0: BM-DMA at 0x60a0-0x60a7
>>
>> afer should be:
>> hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
>> hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
>> hda: MWDMA2 mode selected
>> ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
>> [...]
>>
>> Initially I was think, that is hardware problem. But XP and 2.6.25.8
>> work with _no_ problem (booting from xp, linux, knoppix, linux
>> firmware development kit - about 50 reboots).
>
>> http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt
>> dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt
>
> i had a look at diff -up dmesg-2.6.25.8.txt dmesg-2.6.26.2.txt, and
> these changes stood out:

> -Uniform Multi-Platform E-IDE driver
> -ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
> -ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
> -ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> -ICH7: not 100% native mode: will probe irqs later
> -ICH7: IDE port disabled
> - ide0: BM-DMA at 0x60a0-0x60a7, BIOS settings: hda:DMA, hdb:PIO
> -Probing IDE interface ide0...
> -hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
> -hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
> -hda: MWDMA2 mode selected
> -ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> -hda: ATAPI 24X DVD-ROM DVD-R-RAM CD-R/RW drive, 2048kB Cache
> -Uniform CD-ROM driver Revision: 3.20
> Driver 'sd' needs updating - please use bus_type methods
> +Driver 'sr' needs updating - please use bus_type methods
> ahci 0000:00:1f.2: version 3.0
> ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
> ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode

> @@ -329,13 +320,13 @@ ata1: SATA link up 1.5 Gbps (SStatus 113
> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
> ata1.00: ATA-7: ST9120822AS, 3.ALC, max UDMA/133
> ata1.00: 234441648 sectors, multi 16: LBA48 NCQ (depth 31/32)
> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
> ata1.00: configured for UDMA/133
> ata1.00: configured for UDMA/133
> ata1: EH complete
> @@ -350,6 +341,20 @@ sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 0
> sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
> sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
> sd 0:0:0:0: [sda] Attached SCSI disk
> +ata_piix 0000:00:1f.1: version 2.12
> +ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
> +PCI: Setting latency timer of device 0000:00:1f.1 to 64
> +scsi4 : ata_piix
> +scsi5 : ata_piix
> +ata5: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x60a0 irq 14
> +ata6: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x60a8 irq 15
> +ata5.00: ATAPI: MATSHITADVD-RAM UJ-840S, 1.11, max MWDMA2
> +ata5.00: configured for MWDMA2
> +ata6: port disabled. ignoring.
> +scsi 4:0:0:0: CD-ROM MATSHITA DVD-RAM UJ-840S 1.11 PQ: 0 ANSI: 5
> +sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
> +Uniform CD-ROM driver Revision: 3.20
> +sr 4:0:0:0: Attached scsi CD-ROM sr0

looks like you are using ata driver for pata device instead of ide driver.

so could be caused by bug in ata_piix...

YH

2008-08-12 07:04:17

by Andi Kleen

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Tue, Aug 12, 2008 at 07:50:09AM +0200, Maciej Rutecki wrote:
> 2008/8/11 Andi Kleen <[email protected]>:
> [...]
> >
> > You can verify this by booting with pci=nommconf. If it works
> > then it's likely that.
> >
> [...]
>
> It doesn't help

Then mmconfig is not the problem. Must be something else.

In the worst case you can always bisect I guess.

-Andi

2008-08-12 07:22:19

by Yinghai Lu

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Mon, Aug 11, 2008 at 11:52 PM, Yinghai Lu <[email protected]> wrote:
> On Mon, Aug 11, 2008 at 1:27 PM, Ingo Molnar <[email protected]> wrote:
>>
>> * Maciej Rutecki <[email protected]> wrote:
>>
>>> A have problem with booting 2.6.26* and 2.6.27-rc* kernels. Very often
>>> booting stops on this message:
>>>
>>> [...]
>>> Uniform Multi-Platform E-IDE driver
>>> ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
>>> ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
>>> ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
>>> ICH7: not 100% native mode: will probe irqs later
>>> ICH7: IDE port disabled
>>> ide0: BM-DMA at 0x60a0-0x60a7
>>>
>>> afer should be:
>>> hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
>>> hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
>>> hda: MWDMA2 mode selected
>>> ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
>>> [...]
>>>
>>> Initially I was think, that is hardware problem. But XP and 2.6.25.8
>>> work with _no_ problem (booting from xp, linux, knoppix, linux
>>> firmware development kit - about 50 reboots).
>>
>>> http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26.2.txt
>>> dmesg: http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.25.8.txt
>>
>> i had a look at diff -up dmesg-2.6.25.8.txt dmesg-2.6.26.2.txt, and
>> these changes stood out:
>
>> -Uniform Multi-Platform E-IDE driver
>> -ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
>> -ICH7: IDE controller (0x8086:0x27df rev 0x01) at PCI slot 0000:00:1f.1
>> -ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
>> -ICH7: not 100% native mode: will probe irqs later
>> -ICH7: IDE port disabled
>> - ide0: BM-DMA at 0x60a0-0x60a7, BIOS settings: hda:DMA, hdb:PIO
>> -Probing IDE interface ide0...
>> -hda: MATSHITADVD-RAM UJ-840S, ATAPI CD/DVD-ROM drive
>> -hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
>> -hda: MWDMA2 mode selected
>> -ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
>> -hda: ATAPI 24X DVD-ROM DVD-R-RAM CD-R/RW drive, 2048kB Cache
>> -Uniform CD-ROM driver Revision: 3.20
>> Driver 'sd' needs updating - please use bus_type methods
>> +Driver 'sr' needs updating - please use bus_type methods
>> ahci 0000:00:1f.2: version 3.0
>> ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
>> ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
>
>> @@ -329,13 +320,13 @@ ata1: SATA link up 1.5 Gbps (SStatus 113
>> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
>> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
>> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
>> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
>> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
>> ata1.00: ATA-7: ST9120822AS, 3.ALC, max UDMA/133
>> ata1.00: 234441648 sectors, multi 16: LBA48 NCQ (depth 31/32)
>> ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
>> ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
>> ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
>> -ata1.00: ACPI cmd ef/10:03:00:00:00:a0 succeeded
>> +ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
>> ata1.00: configured for UDMA/133
>> ata1.00: configured for UDMA/133
>> ata1: EH complete
>> @@ -350,6 +341,20 @@ sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 0
>> sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
>> sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
>> sd 0:0:0:0: [sda] Attached SCSI disk
>> +ata_piix 0000:00:1f.1: version 2.12
>> +ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 16 (level, low) -> IRQ 16
>> +PCI: Setting latency timer of device 0000:00:1f.1 to 64
>> +scsi4 : ata_piix
>> +scsi5 : ata_piix
>> +ata5: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x60a0 irq 14
>> +ata6: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x60a8 irq 15
>> +ata5.00: ATAPI: MATSHITADVD-RAM UJ-840S, 1.11, max MWDMA2
>> +ata5.00: configured for MWDMA2
>> +ata6: port disabled. ignoring.
>> +scsi 4:0:0:0: CD-ROM MATSHITA DVD-RAM UJ-840S 1.11 PQ: 0 ANSI: 5
>> +sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
>> +Uniform CD-ROM driver Revision: 3.20
>> +sr 4:0:0:0: Attached scsi CD-ROM sr0
>
> looks like you are using ata driver for pata device instead of ide driver.
>
> so could be caused by bug in ata_piix...

can you enable
CONFIG_IDE and CONFIG_BLK_DEV_PIIX

like 2.6.25

YH

2008-08-12 07:35:28

by Alan

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

> > +sr 4:0:0:0: Attached scsi CD-ROM sr0
>
> looks like you are using ata driver for pata device instead of ide driver.
>
> so could be caused by bug in ata_piix...

See the original bug report - he tried both and got the same kind of
behaviour.

2008-08-12 08:08:13

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

2008/8/12 Alan Cox <[email protected]>:
>> > +sr 4:0:0:0: Attached scsi CD-ROM sr0
>>
>> looks like you are using ata driver for pata device instead of ide driver.
>>
>> so could be caused by bug in ata_piix...
>
> See the original bug report - he tried both and got the same kind of
> behaviour.
>
>

Yes I have similar behaviour, very often kernel stops on this message:
http://unixy.pl/maciek/download/kernel/dupa/2.6.26.2_config_ide/img_0001.jpg

I added CONFIG_IDE, difference between pervious config and current:
http://unixy.pl/maciek/download/kernel/dupa/2.6.26.2_config_ide/config.diff

Full .config:
http://unixy.pl/maciek/download/kernel/dupa/2.6.26.2_config_ide/config-2.6.26.2

Dmesg (1 time works ok):
http://unixy.pl/maciek/download/kernel/dupa/2.6.26.2_config_ide/dmesg.txt

--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-12 14:43:33

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Tuesday, 12 of August 2008, Maciej Rutecki wrote:
> 2008/8/12 Alan Cox <[email protected]>:
> >> > +sr 4:0:0:0: Attached scsi CD-ROM sr0
> >>
> >> looks like you are using ata driver for pata device instead of ide driver.
> >>
> >> so could be caused by bug in ata_piix...
> >
> > See the original bug report - he tried both and got the same kind of
> > behaviour.
> >
> >
>
> Yes I have similar behaviour, very often kernel stops on this message:
> http://unixy.pl/maciek/download/kernel/dupa/2.6.26.2_config_ide/img_0001.jpg

In both cases one of the last messages is

ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17

and that comes from AHCI, or rather from pci_enable_device() invoked by
AHCI. Then, there are no more messages from AHCI, although there should
be at least

ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
ahci 0000:00:1f.2: flags: 64bit ncq ilck stag pm led clo pmp pio slum part
PCI: Setting latency timer of device 0000:00:1f.2 to 64

So, it seems like the enabling of AHCI interrupt causes the failure and since
that's not 100% reproducible, it indicates a timing issue.

Thanks,
Rafael

2008-08-13 07:40:41

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

I remove this options (2.6.27-rc2:
CONFIG_IDE, CONFIG_ATA_PIIX, CONFIG_ATA_SFF

from libata only I have CONFIG_ATA_ACPI and CONFIG_SATA_AHCI

boot dies on:

[...]
ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 17 (level, low) -> IRQ 17
ahci 0000:00:1f.2: AHCI 0001.0100 32 slots 4 ports 1.5 Gbps 0x1 impl SATA mode
ahci 0000:00:1f.2: flags: 64bit ncq ilck stag pm led clo pmp pio slum part
PCI: Setting latency timer of device 0000:00:1f.2 to 64
scsi0 : ahci
scsi1 : ahci
scsi2 : ahci
scsi3 : ahci
ata1: SATA max UDMA/133 irq_stat 0x00000040, connection status changed irq 220
ata2: DUMMY
ata3: DUMMY
ata4: DUMMY

after should be:
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd b1/c1:00:00:00:00:a0 filtered out
ata1.00: ACPI cmd c6/00:10:00:00:00:a0 succeeded
ata1.00: ACPI cmd ef/10:03:00:00:00:a0 filtered out
[...]

--
Maciek

2008-08-16 09:36:24

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

I test few versions of linux kernel and 2.6.26-rc3 hangs during boot,
2.6.26-rc2 (propably) is good. I wrote "propably" because bug is not
100% reproducible and must have more time to be sure. After I will try
bisect. Any suggestions?

--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-20 18:37:22

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

2008/8/16 Maciej Rutecki <[email protected]>:
> I test few versions of linux kernel and 2.6.26-rc3 hangs during boot,
> 2.6.26-rc2 (propably) is good. I wrote "propably" because bug is not
> 100% reproducible and must have more time to be sure. After I will try
> bisect. Any suggestions?
>
> --
> Maciej Rutecki
> http://www.maciek.unixy.pl
>

I try bisect (no results), and one time I boot kernel with
initcall_debug option. Kernel stops on this:
http://unixy.pl/maciek/download/kernel/dupa/initcall_debug.jpg

I found this threads:
http://www.uwsg.indiana.edu/hypermail/linux/kernel/0805.2/1082.html
http://marc.info/?l=linux-netdev&m=121920792922949&w=2

Seems be similar (maybe I'm wrong).

--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-21 12:42:38

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Wednesday, 20 of August 2008, Maciej Rutecki wrote:
> 2008/8/16 Maciej Rutecki <[email protected]>:
> > I test few versions of linux kernel and 2.6.26-rc3 hangs during boot,
> > 2.6.26-rc2 (propably) is good. I wrote "propably" because bug is not
> > 100% reproducible and must have more time to be sure. After I will try
> > bisect. Any suggestions?
> >
> > --
> > Maciej Rutecki
> > http://www.maciek.unixy.pl
> >
>
> I try bisect (no results), and one time I boot kernel with
> initcall_debug option. Kernel stops on this:
> http://unixy.pl/maciek/download/kernel/dupa/initcall_debug.jpg

Do you have a log where it doesn't hang?

Rafael

2008-08-22 08:58:36

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

2008/8/21 Rafael J. Wysocki <[email protected]>:

>
> Do you have a log where it doesn't hang?
>
> Rafael
>

http://unixy.pl/maciek/download/kernel/2.6.27-rc3/dmesg-2.6.27-rc3.txt

--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-22 10:07:48

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Friday, 22 of August 2008, Maciej Rutecki wrote:
> 2008/8/21 Rafael J. Wysocki <[email protected]>:
>
> >
> > Do you have a log where it doesn't hang?
> >
> > Rafael
> >
>
> http://unixy.pl/maciek/download/kernel/2.6.27-rc3/dmesg-2.6.27-rc3.txt

This doesn't look like the same configuration as in the picture you
sent previously. In particular, the ICH7 lines are missing.

Thanks,
Rafael

2008-08-22 10:50:29

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

2008/8/22 Rafael J. Wysocki <[email protected]>:

>> http://unixy.pl/maciek/download/kernel/2.6.27-rc3/dmesg-2.6.27-rc3.txt
>
> This doesn't look like the same configuration as in the picture you
> sent previously. In particular, the ICH7 lines are missing.
>
> Thanks,
> Rafael
>

I'm sorry, I boot wrong kernel. This is correct:
http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26-rc2-initcall_debug.txt


--
Maciej Rutecki
http://www.maciek.unixy.pl

2008-08-24 21:23:59

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [2.6.26.*] boot problem (ahci/irq related?)

On Friday, 22 of August 2008, Maciej Rutecki wrote:
> 2008/8/22 Rafael J. Wysocki <[email protected]>:
>
> >> http://unixy.pl/maciek/download/kernel/2.6.27-rc3/dmesg-2.6.27-rc3.txt
> >
> > This doesn't look like the same configuration as in the picture you
> > sent previously. In particular, the ICH7 lines are missing.
> >
> > Thanks,
> > Rafael
> >
>
> I'm sorry, I boot wrong kernel. This is correct:
> http://unixy.pl/maciek/download/kernel/dupa/dmesg-2.6.26-rc2-initcall_debug.txt

Thanks.

This shows that the crash happens most probably in ide_scan_pcibus(),
before the probing of interfaces.

Rafael