Subject: PCIE ASPM support hangs my laptop pretty often

I've patched my kernel with the PCIe ASPM and after setting
echo powersave > /sys/module/pcie_aspm/parameters/policy

I started to experience random hangs of my laptop.
Hardware info:
Thinkpad x60s 1704-5UG
also tested on a firends X60s 1702-F6U

Kernel is 2.6.24 + these patches:
tuxonice 3.0-rc5
thinkpad_acpi v0.19-20080107
tp_smapi 0.36


--
Damjan Georgievski


2008-02-05 18:46:56

by Arjan van de Ven

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

On Tue, 5 Feb 2008 18:40:04 +0100
Дамјан Георгиевски <[email protected]> wrote:

> I've patched my kernel with the PCIe ASPM and after setting
> echo powersave > /sys/module/pcie_aspm/parameters/policy
>
> I started to experience random hangs of my laptop.
> Hardware info:
> Thinkpad x60s 1704-5UG

the x60's chipset doesn't support ASPM properly afaik... bad idea.
>


--
If you want to reach me at my work email, use [email protected]
For development, discussion and tips for power savings,
visit http://www.lesswatts.org

2008-02-05 18:55:53

by Greg KH

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

On Tue, Feb 05, 2008 at 10:46:23AM -0800, Arjan van de Ven wrote:
> On Tue, 5 Feb 2008 18:40:04 +0100
> ???????????? ?????????????????????? <[email protected]> wrote:
>
> > I've patched my kernel with the PCIe ASPM and after setting
> > echo powersave > /sys/module/pcie_aspm/parameters/policy
> >
> > I started to experience random hangs of my laptop.
> > Hardware info:
> > Thinkpad x60s 1704-5UG
>
> the x60's chipset doesn't support ASPM properly afaik... bad idea.

Well, the code shouldn't then cause a crash of the machine :)

thanks,

greg k-h

2008-02-05 20:59:52

by Kok, Auke

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

Greg KH wrote:
> On Tue, Feb 05, 2008 at 10:46:23AM -0800, Arjan van de Ven wrote:
>> On Tue, 5 Feb 2008 18:40:04 +0100
>> ???????????? ?????????????????????? <[email protected]> wrote:
>>
>>> I've patched my kernel with the PCIe ASPM and after setting
>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
>>>
>>> I started to experience random hangs of my laptop.
>>> Hardware info:
>>> Thinkpad x60s 1704-5UG
>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
>
> Well, the code shouldn't then cause a crash of the machine :)

The user enabled it specifically (where it is disabled by default)

ASPM has been crashing e1000(e), which is why I've recently merged a patch to
disable L1 ASPM for the onboard 82573 nic on those platforms.

this new infrastructure should work in the default configuration - enabling ASPM
where this system leaves it disabled is expected to give problems unless you know
what you are doing.

Auke

Subject: Re: PCIE ASPM support hangs my laptop pretty often

> >>> I've patched my kernel with the PCIe ASPM and after setting
> >>> echo powersave > /sys/module/pcie_aspm/parameters/policy
> >>>
> >>> I started to experience random hangs of my laptop.
> >>> Hardware info:
> >>> Thinkpad x60s 1704-5UG
> >>
> >> the x60's chipset doesn't support ASPM properly afaik... bad idea.
> >
> > Well, the code shouldn't then cause a crash of the machine :)
>
> The user enabled it specifically (where it is disabled by default)
>
> ASPM has been crashing e1000(e), which is why I've recently merged a patch
> to disable L1 ASPM for the onboard 82573 nic on those platforms.
>
> this new infrastructure should work in the default configuration - enabling
> ASPM where this system leaves it disabled is expected to give problems
> unless you know what you are doing.

In my defense, the patch documentation didn't say it doesn't work with my
hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
looked nice.

So, are there any benefits of ASPM if I have it in the kernel but it's set to
default? I got the impression that "default" means not much power savings?


--
Damjan Georgievski
Free Software Macedonia

2008-02-06 00:26:38

by Kok, Auke

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

?????? ??????????? wrote:
>>>>> I've patched my kernel with the PCIe ASPM and after setting
>>>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
>>>>>
>>>>> I started to experience random hangs of my laptop.
>>>>> Hardware info:
>>>>> Thinkpad x60s 1704-5UG
>>>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
>>> Well, the code shouldn't then cause a crash of the machine :)
>> The user enabled it specifically (where it is disabled by default)
>>
>> ASPM has been crashing e1000(e), which is why I've recently merged a patch
>> to disable L1 ASPM for the onboard 82573 nic on those platforms.
>>
>> this new infrastructure should work in the default configuration - enabling
>> ASPM where this system leaves it disabled is expected to give problems
>> unless you know what you are doing.
>
> In my defense, the patch documentation didn't say it doesn't work with my
> hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
> looked nice.
>
> So, are there any benefits of ASPM if I have it in the kernel but it's set to
> default? I got the impression that "default" means not much power savings?

did the Kconfig not come with a big fat (EXPERIMENTAL) ?

it actually depends for each device on the PCI-Express bus. Most PCI-E ports
support it but the device has the option of advertising enablement of that
capability or not.

both platform and each device on the pci-e bus are involved. some sata chipsets
work great with it, some that might not even advertise the capability... but it's
really hit and miss.

Your report is great of course, no doubt about it. I hope that people understand
that this feature can seriously break things at the bus level. It makes me feel a
lot better about the issues we had with some of our network cards and ASPM :)

once we get some feeling about how good ASPM works in the field for people we
might have to blacklist certain platforms or devices.

you could (for instance) try to see which device on your busses support ASPM and
work on per-device ASPM parameters (which is one of the things I suggested before)
so that we get an idea of which device is badly behaving with ASPM on your system.

Cheers,

Auke

2008-02-06 14:11:08

by Pavel Machek

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

On Tue 2008-02-05 16:22:55, Kok, Auke wrote:
> ?????? ??????????? wrote:
> >>>>> I've patched my kernel with the PCIe ASPM and after setting
> >>>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
> >>>>>
> >>>>> I started to experience random hangs of my laptop.
> >>>>> Hardware info:
> >>>>> Thinkpad x60s 1704-5UG
> >>>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
> >>> Well, the code shouldn't then cause a crash of the machine :)
> >> The user enabled it specifically (where it is disabled by default)
> >>
> >> ASPM has been crashing e1000(e), which is why I've recently merged a patch
> >> to disable L1 ASPM for the onboard 82573 nic on those platforms.
> >>
> >> this new infrastructure should work in the default configuration - enabling
> >> ASPM where this system leaves it disabled is expected to give problems
> >> unless you know what you are doing.
> >
> > In my defense, the patch documentation didn't say it doesn't work with my
> > hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
> > looked nice.
> >
> > So, are there any benefits of ASPM if I have it in the kernel but it's set to
> > default? I got the impression that "default" means not much power savings?
>
> did the Kconfig not come with a big fat (EXPERIMENTAL) ?

(EXPERIMENTAL) is something different from (KNOWN BROKEN).

If we know about broken setups, we should probably be blacklisting
them.

--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2008-02-06 17:27:26

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

On Wednesday, 6 of February 2008, Pavel Machek wrote:
> On Tue 2008-02-05 16:22:55, Kok, Auke wrote:
> > ?????? ??????????? wrote:
> > >>>>> I've patched my kernel with the PCIe ASPM and after setting
> > >>>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
> > >>>>>
> > >>>>> I started to experience random hangs of my laptop.
> > >>>>> Hardware info:
> > >>>>> Thinkpad x60s 1704-5UG
> > >>>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
> > >>> Well, the code shouldn't then cause a crash of the machine :)
> > >> The user enabled it specifically (where it is disabled by default)
> > >>
> > >> ASPM has been crashing e1000(e), which is why I've recently merged a patch
> > >> to disable L1 ASPM for the onboard 82573 nic on those platforms.
> > >>
> > >> this new infrastructure should work in the default configuration - enabling
> > >> ASPM where this system leaves it disabled is expected to give problems
> > >> unless you know what you are doing.
> > >
> > > In my defense, the patch documentation didn't say it doesn't work with my
> > > hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
> > > looked nice.
> > >
> > > So, are there any benefits of ASPM if I have it in the kernel but it's set to
> > > default? I got the impression that "default" means not much power savings?
> >
> > did the Kconfig not come with a big fat (EXPERIMENTAL) ?
>
> (EXPERIMENTAL) is something different from (KNOWN BROKEN).
>
> If we know about broken setups, we should probably be blacklisting
> them.

Well, the ASPM thing seems to break every single setup I've tested. So,
perhaps we should whitelist the working ones?

Rafael

2008-02-06 21:48:49

by Kok, Auke

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

Rafael J. Wysocki wrote:
> On Wednesday, 6 of February 2008, Pavel Machek wrote:
>> On Tue 2008-02-05 16:22:55, Kok, Auke wrote:
>>> ?????? ??????????? wrote:
>>>>>>>> I've patched my kernel with the PCIe ASPM and after setting
>>>>>>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
>>>>>>>>
>>>>>>>> I started to experience random hangs of my laptop.
>>>>>>>> Hardware info:
>>>>>>>> Thinkpad x60s 1704-5UG
>>>>>>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
>>>>>> Well, the code shouldn't then cause a crash of the machine :)
>>>>> The user enabled it specifically (where it is disabled by default)
>>>>>
>>>>> ASPM has been crashing e1000(e), which is why I've recently merged a patch
>>>>> to disable L1 ASPM for the onboard 82573 nic on those platforms.
>>>>>
>>>>> this new infrastructure should work in the default configuration - enabling
>>>>> ASPM where this system leaves it disabled is expected to give problems
>>>>> unless you know what you are doing.
>>>> In my defense, the patch documentation didn't say it doesn't work with my
>>>> hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
>>>> looked nice.
>>>>
>>>> So, are there any benefits of ASPM if I have it in the kernel but it's set to
>>>> default? I got the impression that "default" means not much power savings?
>>> did the Kconfig not come with a big fat (EXPERIMENTAL) ?
>> (EXPERIMENTAL) is something different from (KNOWN BROKEN).
>>
>> If we know about broken setups, we should probably be blacklisting
>> them.
>
> Well, the ASPM thing seems to break every single setup I've tested. So,
> perhaps we should whitelist the working ones?

greg KH is reverting this patch alltogether in mainline, maybe the original writer
can accomodate some of the comments in the rewrite.

Auke

2008-02-06 22:00:26

by Greg KH

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often

On Wed, Feb 06, 2008 at 01:46:22PM -0800, Kok, Auke wrote:
> Rafael J. Wysocki wrote:
> > On Wednesday, 6 of February 2008, Pavel Machek wrote:
> >> On Tue 2008-02-05 16:22:55, Kok, Auke wrote:
> >>> ?????? ??????????? wrote:
> >>>>>>>> I've patched my kernel with the PCIe ASPM and after setting
> >>>>>>>> echo powersave > /sys/module/pcie_aspm/parameters/policy
> >>>>>>>>
> >>>>>>>> I started to experience random hangs of my laptop.
> >>>>>>>> Hardware info:
> >>>>>>>> Thinkpad x60s 1704-5UG
> >>>>>>> the x60's chipset doesn't support ASPM properly afaik... bad idea.
> >>>>>> Well, the code shouldn't then cause a crash of the machine :)
> >>>>> The user enabled it specifically (where it is disabled by default)
> >>>>>
> >>>>> ASPM has been crashing e1000(e), which is why I've recently merged a patch
> >>>>> to disable L1 ASPM for the onboard 82573 nic on those platforms.
> >>>>>
> >>>>> this new infrastructure should work in the default configuration - enabling
> >>>>> ASPM where this system leaves it disabled is expected to give problems
> >>>>> unless you know what you are doing.
> >>>> In my defense, the patch documentation didn't say it doesn't work with my
> >>>> hardware, nor that it hangs the chipset :) and the promised 1.3w surelly
> >>>> looked nice.
> >>>>
> >>>> So, are there any benefits of ASPM if I have it in the kernel but it's set to
> >>>> default? I got the impression that "default" means not much power savings?
> >>> did the Kconfig not come with a big fat (EXPERIMENTAL) ?
> >> (EXPERIMENTAL) is something different from (KNOWN BROKEN).
> >>
> >> If we know about broken setups, we should probably be blacklisting
> >> them.
> >
> > Well, the ASPM thing seems to break every single setup I've tested. So,
> > perhaps we should whitelist the working ones?
>
> greg KH is reverting this patch alltogether in mainline, maybe the original writer
> can accomodate some of the comments in the rewrite.

It's already reverted.

thanks,

greg k-h

2008-02-19 06:05:17

by Shaohua Li

[permalink] [raw]
Subject: Re: PCIE ASPM support hangs my laptop pretty often


On Wed, 2008-02-06 at 01:40 +0800, Дамјан Георгиевски wrote:
> I've patched my kernel with the PCIe ASPM and after setting
> echo powersave > /sys/module/pcie_aspm/parameters/policy
>
> I started to experience random hangs of my laptop.
> Hardware info:
> Thinkpad x60s 1704-5UG
> also tested on a firends X60s 1702-F6U
>
> Kernel is 2.6.24 + these patches:
> tuxonice 3.0-rc5
> thinkpad_acpi v0.19-20080107
> tp_smapi 0.36
Hi,
Sorry for the long delay, I'm just back from vocation. Some devices or
chipsets don't work well with ASPM. This is one of the reason why the
default policy of the patch is per BIOS setting. Ideally drivers should
disable ASPM for specific devices, the patch provides an API
(pci_disable_link_state) for this too. As Auke suggested, you can use
the per-device interface to control separate links to see which device
is broken. If you found one, please report to driver maintainer and me,
we can disable ASPM in the driver.

Thanks,
Shaohua