2008-08-20 14:02:37

by Arkadiusz Miśkiewicz

[permalink] [raw]
Subject: resume from ram hangs on 2.6.26+ kernels on thinkpad z60m


I'm seeing problems with resume (using s2ram) on thinkpad z60m notebook with
very fresh git kernel (built yesterday). It looks like this - I get screen
resumed with X window running on it, I see mouse pointer but pointer doesn't
react to movement and sysrq keys also don't respond (tried to
umount+sync+reboot via sysrq).

The hangs doesn't always happen but it happens quite often (often enough to
get kmail trashed several times).

The problem is also happening on 2.6.26 kernels (I wasn't testing .26 too
much).

Now I'm back on 2.6.25 where the problem doesn't occur.

I didn't see any reports related to that on lkml which is bad thing 8)

Any ideas?

linux git config
http://carme.pld-linux.org/~arekm/suspend-config-problematic

2.6.25.15 config:
http://carme.pld-linux.org/~arekm/suspend-config-working

--
Arkadiusz Miśkiewicz PLD/Linux Team
arekm / maven.pl http://ftp.pld-linux.org/


2008-08-21 12:40:19

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: resume from ram hangs on 2.6.26+ kernels on thinkpad z60m

On Wednesday, 20 of August 2008, Arkadiusz Miskiewicz wrote:
>
> I'm seeing problems with resume (using s2ram) on thinkpad z60m notebook with
> very fresh git kernel (built yesterday). It looks like this - I get screen
> resumed with X window running on it, I see mouse pointer but pointer doesn't
> react to movement and sysrq keys also don't respond (tried to
> umount+sync+reboot via sysrq).
>
> The hangs doesn't always happen but it happens quite often (often enough to
> get kmail trashed several times).
>
> The problem is also happening on 2.6.26 kernels (I wasn't testing .26 too
> much).
>
> Now I'm back on 2.6.25 where the problem doesn't occur.
>
> I didn't see any reports related to that on lkml which is bad thing 8)
>
> Any ideas?

Not really. We must have broken something in 2.6.26, but there were not too
many suspend-specific patches in there vs .25.

What s2ram options are used?

What hardware is there in the box (chipset, graphics)?

Thanks,
Rafael

2008-08-24 14:31:42

by Arkadiusz Miśkiewicz

[permalink] [raw]
Subject: Re: resume from ram hangs on 2.6.26+ kernels on thinkpad z60m

On Thursday 21 August 2008, Rafael J. Wysocki wrote:
> On Wednesday, 20 of August 2008, Arkadiusz Miskiewicz wrote:
> > I'm seeing problems with resume (using s2ram) on thinkpad z60m notebook
> > with very fresh git kernel (built yesterday). It looks like this - I get
> > screen resumed with X window running on it, I see mouse pointer but
> > pointer doesn't react to movement and sysrq keys also don't respond
> > (tried to
> > umount+sync+reboot via sysrq).
> >
> > The hangs doesn't always happen but it happens quite often (often enough
> > to get kmail trashed several times).
> >
> > The problem is also happening on 2.6.26 kernels (I wasn't testing .26 too
> > much).
> >
> > Now I'm back on 2.6.25 where the problem doesn't occur.
> >
> > I didn't see any reports related to that on lkml which is bad thing 8)
> >
> > Any ideas?
>
> Not really. We must have broken something in 2.6.26, but there were not
> too many suspend-specific patches in there vs .25.

Ok, will get back to the problem when I get/notice something that can get us a
clue.

> What s2ram options are used?

None.

>
> What hardware is there in the box (chipset, graphics)?

[arekm@tarm ~]$ lspci
00:00.0 Host bridge: Intel Corporation Mobile 915GM/PM/GMS/910GML Express
Processor to DRAM Controller (rev 03)
00:01.0 PCI bridge: Intel Corporation Mobile 915GM/PM Express PCI Express Root
Port (rev 03)
00:1b.0 Audio device: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
High Definition Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI
Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI
Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI
Express Port 3 (rev 03)
00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI
Express Port 4 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
USB UHCI #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
USB UHCI #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
USB UHCI #3 (rev 03)
00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
USB UHCI #4 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family)
USB2 EHCI Controller (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev d3)
00:1f.0 ISA bridge: Intel Corporation 82801FBM (ICH6M) LPC Interface Bridge
(rev 03)
00:1f.2 IDE interface: Intel Corporation 82801FBM (ICH6M) SATA Controller (rev
03)
00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus
Controller (rev 03)
01:00.0 VGA compatible controller: ATI Technologies Inc M24 1P [Radeon
Mobility X600]
02:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5751M Gigabit
Ethernet PCI Express (rev 11)
14:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev b3)
14:00.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C552 IEEE 1394 Controller (rev
08)
14:00.2 SD Host controller: Ricoh Co Ltd R5C822 SD/SDIO/MMC/MS/MSPro Host
Adapter (rev 17)
14:00.3 System peripheral: Ricoh Co Ltd R5C592 Memory Stick Bus Host Adapter
(rev 08)
14:02.0 Network controller: Intel Corporation PRO/Wireless 2915ABG Network
Connection (rev 05)

[arekm@tarm ~]$ lsusb
Bus 005 Device 001: ID 1d6b:0001
Bus 004 Device 002: ID 0483:2016 SGS Thomson Microelectronics Fingerprint
Reader
Bus 004 Device 001: ID 1d6b:0001
Bus 003 Device 002: ID 046d:c404 Logitech, Inc. TrackMan Wheel
Bus 003 Device 001: ID 1d6b:0001
Bus 002 Device 001: ID 1d6b:0001
Bus 001 Device 001: ID 1d6b:0002


> Thanks,
> Rafael

--
Arkadiusz Miśkiewicz PLD/Linux Team
arekm / maven.pl http://ftp.pld-linux.org/

2008-08-28 16:46:21

by Arkadiusz Miśkiewicz

[permalink] [raw]
Subject: Re: resume from ram hangs on 2.6.26+ kernels on thinkpad z60m

On Sunday 24 August 2008, Arkadiusz Miskiewicz wrote:

> Ok, will get back to the problem when I get/notice something that can get
> us a clue.

So here I am. Not directly resume hang but may be related. git kernel from ~2
days ago.

resume succeeded but dmesg indicated tons of:

[51055.352019] Freezing of tasks failed after 20.00 seconds (1 tasks refusing
to freeze):
[51055.352025] task PC stack pid father
[51055.352031] init D 97af68e4 0 1 0
[51055.352038] f7851e9c 00000086 f7848000 97af68e4 00002e6a c8559005
f7851e80 f7848274
[51055.352050] 97c8187b 00002e6a f7851e8c f72a6000 00000000 00000000
00000000 00000000
[51055.352061] f7851f10 f7851ea8 c0138788 0000000b f7851ed4 c0125912
c8559000 00000000
[51055.352073] Call Trace:
[51055.352079] [<c0138788>] refrigerator+0x5e/0x66
[51055.352092] [<c0125912>] get_signal_to_deliver+0x3e/0x2ac
[51055.352102] [<c0102b04>] do_notify_resume+0x69/0x6bc
[51055.352111] [<c01c6c88>] ? copy_to_user+0x2c/0xfc
[51055.352122] [<c0172a5a>] ? sys_select+0x124/0x147
[51055.352131] [<c010380a>] work_notifysig+0x13/0x19
[51055.352161] =======================
[51055.352165] kthreadd S 3ae87c78 0 2 0
[51055.352172] f7853fc4 00000046 f7848c60 3ae87c78 00000998 00000000
00000003 f7848ed4
[51055.352183] 3ae888d4 00000998 00001f47 e8a65540 f7853fc4 c011588a
e83efd70 00001f47
[51055.352194] c034b1a8 f7853fe0 c012a543 00000000 00000001 c012a4e6
00000000 00000000
[51055.352205] Call Trace:
[51055.352209] [<c011588a>] ? complete+0x1f/0x28
[51055.352219] [<c012a543>] kthreadd+0x5d/0xee
[51055.352226] [<c012a4e6>] ? kthreadd+0x0/0xee
[51055.352233] [<c010425f>] kernel_thread_helper+0x7/0x10
[51055.352241] =======================
[51055.352244] ksoftirqd/0 S 978f04aa 0 3 2
[51055.352251] f7859fc4 00000046 f78498c0 978f04aa 00002e6a f2fe4a40
0000000a f7849b34
[51055.352262] 97a1861e 00002e6a c0120150 e78cca80 c012017f 00000000
00000000 c012017f
[51055.352273] 00000000 f7859fd0 c01201a0 00000000 f7859fe0 c012a60d
c012a5d4 00000000


config: http://carme.pld-linux.org/~arekm/suspend-config-20080828
full dmesg: http://carme.pld-linux.org/~arekm/dmesg-suspend-20080828

--
Arkadiusz Miśkiewicz PLD/Linux Team
arekm / maven.pl http://ftp.pld-linux.org/

2008-08-28 19:18:48

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: resume from ram hangs on 2.6.26+ kernels on thinkpad z60m

On Thursday, 28 of August 2008, Arkadiusz Miskiewicz wrote:
> On Sunday 24 August 2008, Arkadiusz Miskiewicz wrote:
>
> > Ok, will get back to the problem when I get/notice something that can get
> > us a clue.
>
> So here I am. Not directly resume hang but may be related. git kernel from ~2
> days ago.
>
> resume succeeded but dmesg indicated tons of:
>
> [51055.352019] Freezing of tasks failed after 20.00 seconds (1 tasks refusing
> to freeze):
> [51055.352025] task PC stack pid father
> [51055.352031] init D 97af68e4 0 1 0
> [51055.352038] f7851e9c 00000086 f7848000 97af68e4 00002e6a c8559005
> f7851e80 f7848274
> [51055.352050] 97c8187b 00002e6a f7851e8c f72a6000 00000000 00000000
> 00000000 00000000
> [51055.352061] f7851f10 f7851ea8 c0138788 0000000b f7851ed4 c0125912
> c8559000 00000000
> [51055.352073] Call Trace:
> [51055.352079] [<c0138788>] refrigerator+0x5e/0x66
> [51055.352092] [<c0125912>] get_signal_to_deliver+0x3e/0x2ac
> [51055.352102] [<c0102b04>] do_notify_resume+0x69/0x6bc
> [51055.352111] [<c01c6c88>] ? copy_to_user+0x2c/0xfc
> [51055.352122] [<c0172a5a>] ? sys_select+0x124/0x147
> [51055.352131] [<c010380a>] work_notifysig+0x13/0x19
> [51055.352161] =======================
> [51055.352165] kthreadd S 3ae87c78 0 2 0
> [51055.352172] f7853fc4 00000046 f7848c60 3ae87c78 00000998 00000000
> 00000003 f7848ed4
> [51055.352183] 3ae888d4 00000998 00001f47 e8a65540 f7853fc4 c011588a
> e83efd70 00001f47
> [51055.352194] c034b1a8 f7853fe0 c012a543 00000000 00000001 c012a4e6
> 00000000 00000000
> [51055.352205] Call Trace:
> [51055.352209] [<c011588a>] ? complete+0x1f/0x28
> [51055.352219] [<c012a543>] kthreadd+0x5d/0xee
> [51055.352226] [<c012a4e6>] ? kthreadd+0x0/0xee
> [51055.352233] [<c010425f>] kernel_thread_helper+0x7/0x10
> [51055.352241] =======================
> [51055.352244] ksoftirqd/0 S 978f04aa 0 3 2
> [51055.352251] f7859fc4 00000046 f78498c0 978f04aa 00002e6a f2fe4a40
> 0000000a f7849b34
> [51055.352262] 97a1861e 00002e6a c0120150 e78cca80 c012017f 00000000
> 00000000 c012017f
> [51055.352273] 00000000 f7859fd0 c01201a0 00000000 f7859fe0 c012a60d
> c012a5d4 00000000
>
>
> config: http://carme.pld-linux.org/~arekm/suspend-config-20080828
> full dmesg: http://carme.pld-linux.org/~arekm/dmesg-suspend-20080828

Hm, cifsd. It is kind of known to have problems.

How reproducible is this?

Rafael