2009-11-16 22:47:35

by Rafael J. Wysocki

[permalink] [raw]
Subject: 2.6.32-rc7-git1: Reported regressions from 2.6.31

This message contains a list of some regressions from 2.6.31, for which there
are no fixes in the mainline I know of. If any of them have been fixed already,
please let me know.

If you know of any other unresolved regressions from 2.6.31, please let me know
either and I'll add them to the list. Also, please let me know if any of the
entries below are invalid.

Each entry from the list will be sent additionally in an automatic reply to
this message with CCs to the people involved in reporting and handling the
issue.


Listed regressions statistics:

Date Total Pending Unresolved
----------------------------------------
2009-11-16 84 46 41
2009-10-26 66 42 37
2009-10-12 48 31 27
2009-10-02 22 15 9


Unresolved regressions
----------------------

Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14629
Subject : Oops on i915 on 8086:a011 pine trail
Submitter : Luis R. Rodriguez <[email protected]>
Date : 2009-11-10 23:27 (7 days old)
References : http://marc.info/?l=linux-kernel&m=125789570519147&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14628
Subject : drm/ksm -> s2disk -> resume -> [drm:r100_ring_test] *ERROR* radeon: ring test failed
Submitter : Christian Hartmann <[email protected]>
Date : 2009-11-06 15:46 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125752241331067&w=4
Handled-By : Jerome Glisse <[email protected]>


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14627
Subject : i915: *ERROR* Execbuf while wedged
Submitter : Michael <[email protected]>
Date : 2009-11-15 10:48 (2 days old)
References : http://lkml.org/lkml/2009/11/15/40


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
Subject : oops on boot starting udev
Submitter : Soeren Sonnenburg <[email protected]>
Date : 2009-11-14 10:16 (3 days old)
References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14625
Subject : Commit d451564 breaks ARM
Submitter : Russell King <[email protected]>
Date : 2009-11-13 15:11 (4 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d4515646699b6ad7b1a98ceb871296b957f3ef47
References : http://marc.info/?l=linux-kernel&m=125812520315835&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14624
Subject : ath9k: BUG kmalloc-8192: Poison overwritten
Submitter : Miles Lane <[email protected]>
Date : 2009-11-12 4:58 (5 days old)
References : http://marc.info/?l=linux-kernel&m=125800196520396&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14622
Subject : Second IDE device not found
Submitter : Zeno Davatz <[email protected]>
Date : 2009-11-11 17:31 (6 days old)
References : http://marc.info/?l=linux-kernel&m=125796105822353&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14621
Subject : specjbb2005 and aim7 regression with 2.6.32-rc kernels
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-11-06 7:38 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125749310413174&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14620
Subject : WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask
Submitter : Rogério Brito <[email protected]>
Date : 2009-11-06 23:10 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125754907413892&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14619
Subject : ext3/jbd oops in journal_start
Submitter : Sage Weil <[email protected]>
Date : 2009-10-31 6:14 (17 days old)
References : http://marc.info/?l=linux-kernel&m=125696970418300&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14618
Subject : OOM killer, page fault
Submitter : Norbert Preining <[email protected]>
Date : 2009-10-30 6:32 (18 days old)
References : http://marc.info/?l=linux-kernel&m=125688434909582&w=4
Handled-By : Minchan Kim <[email protected]>


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14616
Subject : [2.6.32 regression] sata_nv: commit 6489e3262e6b188a1a009b65e8a94b7aa17645b7 slows down system boot
Submitter : Artem S. Tashkinov <[email protected]>
Date : 2009-11-16 19:49 (1 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14577
Subject : Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)
Submitter : <[email protected]>
Date : 2009-11-10 13:31 (7 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14538
Subject : Unable to associate with AP after resume since 2.6.32-rc6
Submitter : Christian Casteyde <[email protected]>
Date : 2009-11-03 22:07 (14 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14504
Subject : intermittent hibernation problem again
Submitter : Ferenc Wágner <[email protected]>
Date : 2009-10-28 23:49 (20 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14487
Subject : PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0
Submitter : Justin P. Mattock <[email protected]>
Date : 2009-10-23 16:45 (25 days old)
References : http://lkml.org/lkml/2009/10/23/252


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14485
Subject : System lockup running "cat /sys/kernel/debug/dri/0/i915_regs"
Submitter : Miles Lane <[email protected]>
Date : 2009-10-26 4:00 (22 days old)
References : http://marc.info/?l=linux-kernel&m=125652968117713&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
Subject : Interrupts enabled after irqrouter_resume - iMac9,1
Submitter : Justin Mattock <[email protected]>
Date : 2009-10-25 19:58 (23 days old)
References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14482
Subject : kernel BUG at fs/dcache.c:670 +lvm +md +ext3
Submitter : Alexander Clouter <[email protected]>
Date : 2009-10-23 10:30 (25 days old)
References : http://lkml.org/lkml/2009/10/23/50


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14481
Subject : umount blocked for more than 120 seconds after USB drive removal
Submitter : Robert Hancock <[email protected]>
Date : 2009-10-21 5:26 (27 days old)
References : http://marc.info/?l=linux-kernel&m=125610280532245&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14472
Subject : EXT4 corruption
Submitter : Shawn Starr <[email protected]>
Date : 2009-10-13 2:07 (35 days old)
References : http://marc.info/?l=linux-kernel&m=125539997508256&w=4
Handled-By : Theodore Tso <[email protected]>


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14467
Subject : Linker errors on ia64 with NR_CPUS=4096
Submitter : Jeff Mahoney <[email protected]>
Date : 2009-10-18 22:28 (30 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=34d76c41554a05425613d16efebb3069c4c545f0
References : http://marc.info/?l=linux-kernel&m=125590493116720&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14442
Subject : resume after hibernate: /dev/sdb drops and returns as /dev/sde
Submitter : Duncan <[email protected]>
Date : 2009-10-20 01:52 (28 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14436
Subject : Computer becomes unusable without any apparent reason
Submitter : Pitxyoki <[email protected]>
Date : 2009-10-18 18:32 (30 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14430
Subject : sync() hangs in bdi_sched_wait
Submitter : Petr Vandrovec <[email protected]>
Date : 2009-10-17 19:14 (31 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14426
Subject : CE: hpet increasing min_delta_ns flood
Submitter : Thibault Mondary <[email protected]>
Date : 2009-10-17 09:29 (31 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14415
Subject : Reboot on kernel load
Submitter : Brian Beardall <[email protected]>
Date : 2009-10-15 23:57 (33 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14406
Subject : uvcvideo stopped work on Toshiba
Submitter : okias <[email protected]>
Date : 2009-10-14 19:08 (34 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14384
Subject : tbench regression with 2.6.32-rc1
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-10-09 9:51 (39 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=59abf02644c45f1591e1374ee7bb45dc757fcb88
References : http://marc.info/?l=linux-kernel&m=125508216713138&w=4
Handled-By : Peter Zijlstra <[email protected]>


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14383
Subject : hackbench regression with kernel 2.6.32-rc1
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-10-09 9:19 (39 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=29cd8bae396583a2ee9a3340db8c5102acf9f6fd
References : http://marc.info/?l=linux-kernel&m=125508007510274&w=4
Handled-By : Peter Zijlstra <[email protected]>


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14376
Subject : Kernel NULL pointer dereference/ kvm subsystem
Submitter : Don Dupuis <[email protected]>
Date : 2009-10-06 14:38 (42 days old)
References : http://marc.info/?l=linux-kernel&m=125484025021737&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14373
Subject : Task blocked for more than 120 seconds
Submitter : Zeno Davatz <[email protected]>
Date : 2009-10-02 10:16 (46 days old)
References : http://marc.info/?l=linux-kernel&m=125447858618412&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14372
Subject : ath5k wireless not working after suspend-resume - eeepc
Submitter : Fabio Comolli <[email protected]>
Date : 2009-10-03 15:36 (45 days old)
References : http://lkml.org/lkml/2009/10/3/91


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
Submitter : Benjamin Herrenschmidt <[email protected]>
Date : 2009-10-10 03:07 (38 days old)
References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14354
Subject : Bad corruption with 2.6.32-rc1 and upwards
Submitter : Holger Freyther <[email protected]>
Date : 2009-10-09 15:42 (39 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14352
Subject : WARNING: at net/mac80211/scan.c:267
Submitter : Maciej Rutecki <[email protected]>
Date : 2009-10-08 00:30 (40 days old)
References : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14331
Subject : Radeon XPRESS 200M: System hang with radeon DRI and Fedora 10 userspace unless DRI=off
Submitter : Alex Villacis Lasso <[email protected]>
Date : 2009-10-06 00:29 (42 days old)


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14298
Subject : warning at manage.c:361 (set_irq_wake), matrix-keypad related?
Submitter : Pavel Machek <[email protected]>
Date : 2009-09-30 20:07 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125434130703538&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14297
Subject : console resume broken since ba15ab0e8d
Submitter : Sascha Hauer <[email protected]>
Date : 2009-09-30 15:11 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125432349404060&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14296
Subject : spitz boots but suspend/resume is broken
Submitter : Pavel Machek <[email protected]>
Date : 2009-09-30 12:06 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125431244516449&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14277
Subject : Caught 8-bit read from freed memory in b43 driver at association
Submitter : Christian Casteyde <[email protected]>
Date : 2009-09-30 18:06 (48 days old)


Regressions with patches
------------------------

Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14630
Subject : sched_rt_periodic_timer vs cpu hotplug
Submitter : Heiko Carstens <[email protected]>
Date : 2009-11-11 10:18 (6 days old)
References : http://marc.info/?l=linux-kernel&m=125793470309588&w=4
Handled-By : Peter Zijlstra <[email protected]>
Patch : http://patchwork.kernel.org/patch/60250/


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14597
Subject : thinkpad-acpi: driver fails to load on old BIOS for the A31, T23-T30, X30-X31
Submitter : Henrique de Moraes Holschuh <[email protected]>
Date : 2009-11-13 20:45 (4 days old)
Handled-By : Henrique de Moraes Holschuh <[email protected]>
Patch : http://bugzilla.kernel.org/attachment.cgi?id=23770


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14480
Subject : 2 locks held by cat -- running "find /sys | head -c 4" --> system hang
Submitter : Miles Lane <[email protected]>
Date : 2009-10-20 16:11 (28 days old)
References : http://marc.info/?l=linux-kernel&m=125605511728088&w=4
Handled-By : Chris Wilson <[email protected]>
Patch : http://patchwork.kernel.org/patch/54974/


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14380
Subject : Video tearing/glitching with T400 laptops
Submitter : Theodore Ts'o <[email protected]>
Date : 2009-10-02 22:40 (46 days old)
References : http://marc.info/?l=linux-kernel&m=125452324520623&w=4
Handled-By : Jesse Barnes <[email protected]>
Patch : http://marc.info/?l=linux-kernel&m=125591495325000&w=4


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14379
Subject : ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String
Submitter : Justin Mattock <[email protected]>
Date : 2009-10-08 21:46 (40 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d9adc2e031bd22d5d9607a53a8d3b30e0b675f39
References : http://marc.info/?l=linux-kernel&m=125504031328941&w=4
Handled-By : Alexey Starikovskiy <[email protected]>
Patch : http://bugzilla.kernel.org/attachment.cgi?id=23347


For details, please visit the bug entries and follow the links given in
references.

As you can see, there is a Bugzilla entry for each of the listed regressions.
There also is a Bugzilla entry used for tracking the regressions from 2.6.31,
unresolved as well as resolved, at:

http://bugzilla.kernel.org/show_bug.cgi?id=14230

Please let me know if there are any Bugzilla entries that should be added to
the list in there.

Thanks,
Rafael


2009-11-16 22:47:54

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14277] Caught 8-bit read from freed memory in b43 driver at association

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14277
Subject : Caught 8-bit read from freed memory in b43 driver at association
Submitter : Christian Casteyde <[email protected]>
Date : 2009-09-30 18:06 (48 days old)

2009-11-16 22:51:53

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14296] spitz boots but suspend/resume is broken

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14296
Subject : spitz boots but suspend/resume is broken
Submitter : Pavel Machek <[email protected]>
Date : 2009-09-30 12:06 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125431244516449&w=4

2009-11-16 22:52:07

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14298] warning at manage.c:361 (set_irq_wake), matrix-keypad related?

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14298
Subject : warning at manage.c:361 (set_irq_wake), matrix-keypad related?
Submitter : Pavel Machek <[email protected]>
Date : 2009-09-30 20:07 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125434130703538&w=4

2009-11-16 22:52:20

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14297] console resume broken since ba15ab0e8d

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14297
Subject : console resume broken since ba15ab0e8d
Submitter : Sascha Hauer <[email protected]>
Date : 2009-09-30 15:11 (48 days old)
References : http://marc.info/?l=linux-kernel&m=125432349404060&w=4

2009-11-16 22:52:04

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14352] WARNING: at net/mac80211/scan.c:267

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14352
Subject : WARNING: at net/mac80211/scan.c:267
Submitter : Maciej Rutecki <[email protected]>
Date : 2009-10-08 00:30 (40 days old)
References : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7

2009-11-16 23:02:56

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14331] Radeon XPRESS 200M: System hang with radeon DRI and Fedora 10 userspace unless DRI=off

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14331
Subject : Radeon XPRESS 200M: System hang with radeon DRI and Fedora 10 userspace unless DRI=off
Submitter : Alex Villacis Lasso <[email protected]>
Date : 2009-10-06 00:29 (42 days old)

2009-11-16 22:52:40

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14355] USB serial regression after 2.6.31.1 with Huawei E169 GSM modem

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
Submitter : Benjamin Herrenschmidt <[email protected]>
Date : 2009-10-10 03:07 (38 days old)
References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4

2009-11-16 22:53:32

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14372] ath5k wireless not working after suspend-resume - eeepc

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14372
Subject : ath5k wireless not working after suspend-resume - eeepc
Submitter : Fabio Comolli <[email protected]>
Date : 2009-10-03 15:36 (45 days old)
References : http://lkml.org/lkml/2009/10/3/91

2009-11-16 23:02:47

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14354] Bad corruption with 2.6.32-rc1 and upwards

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14354
Subject : Bad corruption with 2.6.32-rc1 and upwards
Submitter : Holger Freyther <[email protected]>
Date : 2009-10-09 15:42 (39 days old)

2009-11-16 23:02:08

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14373] Task blocked for more than 120 seconds

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14373
Subject : Task blocked for more than 120 seconds
Submitter : Zeno Davatz <[email protected]>
Date : 2009-10-02 10:16 (46 days old)
References : http://marc.info/?l=linux-kernel&m=125447858618412&w=4

2009-11-16 22:52:41

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14379] ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14379
Subject : ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String
Submitter : Justin Mattock <[email protected]>
Date : 2009-10-08 21:46 (40 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d9adc2e031bd22d5d9607a53a8d3b30e0b675f39
References : http://marc.info/?l=linux-kernel&m=125504031328941&w=4
Handled-By : Alexey Starikovskiy <[email protected]>
Patch : http://bugzilla.kernel.org/attachment.cgi?id=23347

2009-11-16 22:52:51

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14380] Video tearing/glitching with T400 laptops

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14380
Subject : Video tearing/glitching with T400 laptops
Submitter : Theodore Ts'o <[email protected]>
Date : 2009-10-02 22:40 (46 days old)
References : http://marc.info/?l=linux-kernel&m=125452324520623&w=4
Handled-By : Jesse Barnes <[email protected]>
Patch : http://marc.info/?l=linux-kernel&m=125591495325000&w=4

2009-11-16 23:00:49

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14383] hackbench regression with kernel 2.6.32-rc1

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14383
Subject : hackbench regression with kernel 2.6.32-rc1
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-10-09 9:19 (39 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=29cd8bae396583a2ee9a3340db8c5102acf9f6fd
References : http://marc.info/?l=linux-kernel&m=125508007510274&w=4
Handled-By : Peter Zijlstra <[email protected]>

2009-11-16 23:02:11

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14376] Kernel NULL pointer dereference/ kvm subsystem

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14376
Subject : Kernel NULL pointer dereference/ kvm subsystem
Submitter : Don Dupuis <[email protected]>
Date : 2009-10-06 14:38 (42 days old)
References : http://marc.info/?l=linux-kernel&m=125484025021737&w=4

2009-11-16 22:53:13

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14406] uvcvideo stopped work on Toshiba

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14406
Subject : uvcvideo stopped work on Toshiba
Submitter : okias <[email protected]>
Date : 2009-10-14 19:08 (34 days old)

2009-11-16 22:53:35

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14415] Reboot on kernel load

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14415
Subject : Reboot on kernel load
Submitter : Brian Beardall <[email protected]>
Date : 2009-10-15 23:57 (33 days old)

2009-11-16 23:02:24

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14384] tbench regression with 2.6.32-rc1

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14384
Subject : tbench regression with 2.6.32-rc1
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-10-09 9:51 (39 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=59abf02644c45f1591e1374ee7bb45dc757fcb88
References : http://marc.info/?l=linux-kernel&m=125508216713138&w=4
Handled-By : Peter Zijlstra <[email protected]>

2009-11-16 22:53:46

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14442] resume after hibernate: /dev/sdb drops and returns as /dev/sde

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14442
Subject : resume after hibernate: /dev/sdb drops and returns as /dev/sde
Submitter : Duncan <[email protected]>
Date : 2009-10-20 01:52 (28 days old)

2009-11-16 22:53:44

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14467] Linker errors on ia64 with NR_CPUS=4096

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14467
Subject : Linker errors on ia64 with NR_CPUS=4096
Submitter : Jeff Mahoney <[email protected]>
Date : 2009-10-18 22:28 (30 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=34d76c41554a05425613d16efebb3069c4c545f0
References : http://marc.info/?l=linux-kernel&m=125590493116720&w=4

2009-11-16 23:01:21

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14436] Computer becomes unusable without any apparent reason

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14436
Subject : Computer becomes unusable without any apparent reason
Submitter : Pitxyoki <[email protected]>
Date : 2009-10-18 18:32 (30 days old)

2009-11-16 22:53:31

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14426] CE: hpet increasing min_delta_ns flood

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14426
Subject : CE: hpet increasing min_delta_ns flood
Submitter : Thibault Mondary <[email protected]>
Date : 2009-10-17 09:29 (31 days old)

2009-11-16 23:01:56

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14430] sync() hangs in bdi_sched_wait

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14430
Subject : sync() hangs in bdi_sched_wait
Submitter : Petr Vandrovec <[email protected]>
Date : 2009-10-17 19:14 (31 days old)

2009-11-16 22:53:47

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14481] umount blocked for more than 120 seconds after USB drive removal

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14481
Subject : umount blocked for more than 120 seconds after USB drive removal
Submitter : Robert Hancock <[email protected]>
Date : 2009-10-21 5:26 (27 days old)
References : http://marc.info/?l=linux-kernel&m=125610280532245&w=4

2009-11-16 22:53:45

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14472] EXT4 corruption

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14472
Subject : EXT4 corruption
Submitter : Shawn Starr <[email protected]>
Date : 2009-10-13 2:07 (35 days old)
References : http://marc.info/?l=linux-kernel&m=125539997508256&w=4
Handled-By : Theodore Tso <[email protected]>

2009-11-16 22:59:54

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14480] 2 locks held by cat -- running "find /sys | head -c 4" --> system hang

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14480
Subject : 2 locks held by cat -- running "find /sys | head -c 4" --> system hang
Submitter : Miles Lane <[email protected]>
Date : 2009-10-20 16:11 (28 days old)
References : http://marc.info/?l=linux-kernel&m=125605511728088&w=4
Handled-By : Chris Wilson <[email protected]>
Patch : http://patchwork.kernel.org/patch/54974/

2009-11-16 23:01:42

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14482] kernel BUG at fs/dcache.c:670 +lvm +md +ext3

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14482
Subject : kernel BUG at fs/dcache.c:670 +lvm +md +ext3
Submitter : Alexander Clouter <[email protected]>
Date : 2009-10-23 10:30 (25 days old)
References : http://lkml.org/lkml/2009/10/23/50

2009-11-16 22:53:52

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14485] System lockup running "cat /sys/kernel/debug/dri/0/i915_regs"

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14485
Subject : System lockup running "cat /sys/kernel/debug/dri/0/i915_regs"
Submitter : Miles Lane <[email protected]>
Date : 2009-10-26 4:00 (22 days old)
References : http://marc.info/?l=linux-kernel&m=125652968117713&w=4

2009-11-16 22:58:01

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14504] intermittent hibernation problem again

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14504
Subject : intermittent hibernation problem again
Submitter : Ferenc Wágner <[email protected]>
Date : 2009-10-28 23:49 (20 days old)

2009-11-16 22:58:54

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14487] PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14487
Subject : PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0
Submitter : Justin P. Mattock <[email protected]>
Date : 2009-10-23 16:45 (25 days old)
References : http://lkml.org/lkml/2009/10/23/252

2009-11-16 22:59:10

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14483] Interrupts enabled after irqrouter_resume - iMac9,1

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
Subject : Interrupts enabled after irqrouter_resume - iMac9,1
Submitter : Justin Mattock <[email protected]>
Date : 2009-10-25 19:58 (23 days old)
References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4

2009-11-16 22:54:05

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14619] ext3/jbd oops in journal_start

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14619
Subject : ext3/jbd oops in journal_start
Submitter : Sage Weil <[email protected]>
Date : 2009-10-31 6:14 (17 days old)
References : http://marc.info/?l=linux-kernel&m=125696970418300&w=4

2009-11-16 22:54:04

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14577] Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14577
Subject : Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)
Submitter : <[email protected]>
Date : 2009-11-10 13:31 (7 days old)

2009-11-16 22:54:01

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14616] [2.6.32 regression] sata_nv: commit 6489e3262e6b188a1a009b65e8a94b7aa17645b7 slows down system boot

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14616
Subject : [2.6.32 regression] sata_nv: commit 6489e3262e6b188a1a009b65e8a94b7aa17645b7 slows down system boot
Submitter : Artem S. Tashkinov <[email protected]>
Date : 2009-11-16 19:49 (1 days old)

2009-11-16 22:57:37

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14618] OOM killer, page fault

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14618
Subject : OOM killer, page fault
Submitter : Norbert Preining <[email protected]>
Date : 2009-10-30 6:32 (18 days old)
References : http://marc.info/?l=linux-kernel&m=125688434909582&w=4
Handled-By : Minchan Kim <[email protected]>

2009-11-16 22:57:37

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14597] thinkpad-acpi: driver fails to load on old BIOS for the A31, T23-T30, X30-X31

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14597
Subject : thinkpad-acpi: driver fails to load on old BIOS for the A31, T23-T30, X30-X31
Submitter : Henrique de Moraes Holschuh <[email protected]>
Date : 2009-11-13 20:45 (4 days old)
Handled-By : Henrique de Moraes Holschuh <[email protected]>
Patch : http://bugzilla.kernel.org/attachment.cgi?id=23770

2009-11-16 22:58:03

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14538] Unable to associate with AP after resume since 2.6.32-rc6

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14538
Subject : Unable to associate with AP after resume since 2.6.32-rc6
Submitter : Christian Casteyde <[email protected]>
Date : 2009-11-03 22:07 (14 days old)

2009-11-16 22:54:30

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14621] specjbb2005 and aim7 regression with 2.6.32-rc kernels

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14621
Subject : specjbb2005 and aim7 regression with 2.6.32-rc kernels
Submitter : Zhang, Yanmin <[email protected]>
Date : 2009-11-06 7:38 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125749310413174&w=4

2009-11-16 22:56:11

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14622] Second IDE device not found

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14622
Subject : Second IDE device not found
Submitter : Zeno Davatz <[email protected]>
Date : 2009-11-11 17:31 (6 days old)
References : http://marc.info/?l=linux-kernel&m=125796105822353&w=4

2009-11-16 22:56:08

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14620] WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14620
Subject : WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask
Submitter : Rogério Brito <[email protected]>
Date : 2009-11-06 23:10 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125754907413892&w=4

2009-11-16 22:54:37

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14625] Commit d451564 breaks ARM

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14625
Subject : Commit d451564 breaks ARM
Submitter : Russell King <[email protected]>
Date : 2009-11-13 15:11 (4 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d4515646699b6ad7b1a98ceb871296b957f3ef47
References : http://marc.info/?l=linux-kernel&m=125812520315835&w=4

2009-11-16 22:55:17

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14627] i915: *ERROR* Execbuf while wedged

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14627
Subject : i915: *ERROR* Execbuf while wedged
Submitter : Michael <[email protected]>
Date : 2009-11-15 10:48 (2 days old)
References : http://lkml.org/lkml/2009/11/15/40

2009-11-16 22:56:09

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14626] oops on boot starting udev

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
Subject : oops on boot starting udev
Submitter : Soeren Sonnenburg <[email protected]>
Date : 2009-11-14 10:16 (3 days old)
References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4

2009-11-16 22:56:35

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14624] ath9k: BUG kmalloc-8192: Poison overwritten

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14624
Subject : ath9k: BUG kmalloc-8192: Poison overwritten
Submitter : Miles Lane <[email protected]>
Date : 2009-11-12 4:58 (5 days old)
References : http://marc.info/?l=linux-kernel&m=125800196520396&w=4

2009-11-16 22:54:34

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14628] drm/ksm -> s2disk -> resume -> [drm:r100_ring_test] *ERROR* radeon: ring test failed

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14628
Subject : drm/ksm -> s2disk -> resume -> [drm:r100_ring_test] *ERROR* radeon: ring test failed
Submitter : Christian Hartmann <[email protected]>
Date : 2009-11-06 15:46 (11 days old)
References : http://marc.info/?l=linux-kernel&m=125752241331067&w=4
Handled-By : Jerome Glisse <[email protected]>

2009-11-16 22:54:51

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14629] Oops on i915 on 8086:a011 pine trail

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14629
Subject : Oops on i915 on 8086:a011 pine trail
Submitter : Luis R. Rodriguez <[email protected]>
Date : 2009-11-10 23:27 (7 days old)
References : http://marc.info/?l=linux-kernel&m=125789570519147&w=4

2009-11-16 22:55:52

by Rafael J. Wysocki

[permalink] [raw]
Subject: [Bug #14630] sched_rt_periodic_timer vs cpu hotplug

This message has been generated automatically as a part of a report
of recent regressions.

The following bug entry is on the current list of known regressions
from 2.6.31. Please verify if it still should be listed and let me know
(either way).


Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14630
Subject : sched_rt_periodic_timer vs cpu hotplug
Submitter : Heiko Carstens <[email protected]>
Date : 2009-11-11 10:18 (6 days old)
References : http://marc.info/?l=linux-kernel&m=125793470309588&w=4
Handled-By : Peter Zijlstra <[email protected]>
Patch : http://patchwork.kernel.org/patch/60250/

2009-11-16 22:55:23

by Jiri Kosina

[permalink] [raw]
Subject: Re: [Bug #14467] Linker errors on ia64 with NR_CPUS=4096

On Mon, 16 Nov 2009, Rafael J. Wysocki wrote:

> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14467
> Subject : Linker errors on ia64 with NR_CPUS=4096
> Submitter : Jeff Mahoney <[email protected]>
> Date : 2009-10-18 22:28 (30 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=34d76c41554a05425613d16efebb3069c4c545f0
> References : http://marc.info/?l=linux-kernel&m=125590493116720&w=4

Fixed by the following two commits


commit 4a6cc4bd32e580722882115d4c8b964d732c11e4
Author: Jiri Kosina <[email protected]>
Date: Thu Oct 29 00:26:00 2009 +0900

sched: move rq_weight data array out of .percpu

commit 403a91b1659cb149dbddc5885f892734ae4542d8
Author: Jiri Kosina <[email protected]>
Date: Thu Oct 29 00:25:59 2009 +0900

percpu: allow pcpu_alloc() to be called with IRQs off

--
Jiri Kosina
SUSE Labs, Novell Inc.

2009-11-16 23:05:26

by Oliver Neukum

[permalink] [raw]
Subject: Re: [Bug #14355] USB serial regression after 2.6.31.1 with Huawei E169 GSM modem

Am Montag, 16. November 2009 23:37:39 schrieb Rafael J. Wysocki:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
> Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
> Submitter : Benjamin Herrenschmidt <[email protected]>
> Date : 2009-10-10 03:07 (38 days old)
> References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4

Benjamin has fixed this bug.

Regards
Oliver

2009-11-16 23:09:30

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14467] Linker errors on ia64 with NR_CPUS=4096

On Monday 16 November 2009, Jiri Kosina wrote:
> On Mon, 16 Nov 2009, Rafael J. Wysocki wrote:
>
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14467
> > Subject : Linker errors on ia64 with NR_CPUS=4096
> > Submitter : Jeff Mahoney <[email protected]>
> > Date : 2009-10-18 22:28 (30 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=34d76c41554a05425613d16efebb3069c4c545f0
> > References : http://marc.info/?l=linux-kernel&m=125590493116720&w=4
>
> Fixed by the following two commits
>
>
> commit 4a6cc4bd32e580722882115d4c8b964d732c11e4
> Author: Jiri Kosina <[email protected]>
> Date: Thu Oct 29 00:26:00 2009 +0900
>
> sched: move rq_weight data array out of .percpu
>
> commit 403a91b1659cb149dbddc5885f892734ae4542d8
> Author: Jiri Kosina <[email protected]>
> Date: Thu Oct 29 00:25:59 2009 +0900
>
> percpu: allow pcpu_alloc() to be called with IRQs off

Thanks, closed.

Rafael

2009-11-16 23:10:53

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14355] USB serial regression after 2.6.31.1 with Huawei E169 GSM modem

On Tuesday 17 November 2009, Oliver Neukum wrote:
> Am Montag, 16. November 2009 23:37:39 schrieb Rafael J. Wysocki:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
> > Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
> > Submitter : Benjamin Herrenschmidt <[email protected]>
> > Date : 2009-10-10 03:07 (38 days old)
> > References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4
>
> Benjamin has fixed this bug.

Thanks, closed.

Rafael

2009-11-16 23:30:54

by Andy Lutomirski

[permalink] [raw]
Subject: Re: [Bug #14472] EXT4 corruption

I'm think this was the journal checksumming bug, which is fixed.

-Andy



On Nov 16, 2009, at 5:37 PM, "Rafael J. Wysocki" <[email protected]> wrote:

> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me
> know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14472
> Subject : EXT4 corruption
> Submitter : Shawn Starr <[email protected]>
> Date : 2009-10-13 2:07 (35 days old)
> References : http://marc.info/?l=linux-kernel&m=125539997508256&w=4
> Handled-By : Theodore Tso <[email protected]>
>
>

2009-11-17 00:19:07

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [Bug #14379] ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String

Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14379
> Subject : ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String
> Submitter : Justin Mattock<[email protected]>
> Date : 2009-10-08 21:46 (40 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d9adc2e031bd22d5d9607a53a8d3b30e0b675f39
> References : http://marc.info/?l=linux-kernel&m=125504031328941&w=4
> Handled-By : Alexey Starikovskiy<[email protected]>
> Patch : http://bugzilla.kernel.org/attachment.cgi?id=23347
>
>
>
>
o.k. just pulled the latest to see, and
the warning message is there.
so yes this bug report should be open
until this is fixed.

Justin P. Mattock

2009-11-17 00:20:57

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [Bug #14483] Interrupts enabled after irqrouter_resume - iMac9,1

Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
> Subject : Interrupts enabled after irqrouter_resume - iMac9,1
> Submitter : Justin Mattock<[email protected]>
> Date : 2009-10-25 19:58 (23 days old)
> References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4
>
>
>
>
I have to say that doing a bisect on this
was more than I had anticipated.
(still need to go through and revert commits,
to see if I can find exactly where).

So yes this should still be open.

Justin P. Mattock

2009-11-17 00:39:40

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [Bug #14487] PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0

Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14487
> Subject : PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0
> Submitter : Justin P. Mattock<[email protected]>
> Date : 2009-10-23 16:45 (25 days old)
> References : http://lkml.org/lkml/2009/10/23/252
>
>
>
>
This one has me a bit dazed i.g. after looking into the issue
I did find a workaround(keep in mind it's not pretty),
by commenting out set_fixmap_nocache and
init_ohci1394_reset_and_init_dma.
(by doing so I was able to load both machines and
execute early debugging in case a problem occurs).

Now as to what might be happening, after going through as
much as I can comprehend the only thing in mind was
reading fixmap.h the comments are stating that vsyscalls
only covers 32bit, and that there needs to be another set
for 64, leading me to believe that this is what I might be hitting.
(my system is pure64, taking in no 32bit at all).

At this point I think I need somebody to give me some info on this,
and if the 64bit issue mentioned above is the case, then we can probably
close this and leave it up to the x86_64 builders to create a 64bit
call for this whenever they get to it.(main thing is I'm able to
run dma early in case of an emergency).

Justin P. Mattock

2009-11-17 01:18:03

by Greg KH

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> Subject : oops on boot starting udev
> Submitter : Soeren Sonnenburg <[email protected]>
> Date : 2009-11-14 10:16 (3 days old)
> References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4

This looks like an input core problem, as the evdev module was just
loaded and died.

Any input developers have any ideas?

thanks,

greg k-h

2009-11-17 02:06:24

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [Bug #14354] Bad corruption with 2.6.32-rc1 and upwards

On Mon, Nov 16, 2009 at 11:37:39PM +0100, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14354
> Subject : Bad corruption with 2.6.32-rc1 and upwards
> Submitter : Holger Freyther <[email protected]>
> Date : 2009-10-09 15:42 (39 days old)

Um, this was marked as resolved, until you reopened it and then reset
the state to New. Why did you do this?

It's fixed in mainline as of commit d4da6c9 when Linus reverted commit
d0646f7. Users could still see it if they mount a file system with -o
journal_checksum, but (a) it's no longer the default, and (b)
corruption if you use the non-default journal_checksum mount option is
not a regression.

We have fixes to make journal_checksum safe queued for 2.6.33, but the
revert fixes the regression problem.

- Ted

2009-11-17 02:04:57

by Dmitry Torokhov

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > Subject : oops on boot starting udev
> > Submitter : Soeren Sonnenburg <[email protected]>
> > Date : 2009-11-14 10:16 (3 days old)
> > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
>
> This looks like an input core problem, as the evdev module was just
> loaded and died.
>
> Any input developers have any ideas?
>


Hmm, evdev does:

dev_set_name(&evdev->dev, "event%d", minor);

Not sure how it can go wrong...

--
Dmitry

2009-11-17 02:47:05

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [Bug #14620] WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask

On Mon, Nov 16, 2009 at 11:37:47PM +0100, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14620
> Subject : WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask
> Submitter : Rog?rio Brito <[email protected]>
> Date : 2009-11-06 23:10 (11 days old)
> References : http://marc.info/?l=linux-kernel&m=125754907413892&w=4

This isn't technically a regression, since the warning is simply
complaining about something that apparently ext4 has been doing for a
long time, which is that it allocates some very large order data
buffers. So the change referenced simply printed a warning message
that complained about the fact.

The actual problem will be fixed in 2.6.32, as we no longer allocate
the big data buffers at mount time.

- Ted

2009-11-17 02:59:06

by Soeren Sonnenburg

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > This message has been generated automatically as a part of a report
> > > of recent regressions.
> > >
> > > The following bug entry is on the current list of known regressions
> > > from 2.6.31. Please verify if it still should be listed and let me know
> > > (either way).
> > >
> > >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > Subject : oops on boot starting udev
> > > Submitter : Soeren Sonnenburg <[email protected]>
> > > Date : 2009-11-14 10:16 (3 days old)
> > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> >
> > This looks like an input core problem, as the evdev module was just
> > loaded and died.
> >
> > Any input developers have any ideas?
> >
>
>
> Hmm, evdev does:
>
> dev_set_name(&evdev->dev, "event%d", minor);
>
> Not sure how it can go wrong...

Anything I should/could do to narrow it down a bit (apart from
bisecting?).

Soeren
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962

2009-11-17 04:01:09

by Dmitry Torokhov

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > This message has been generated automatically as a part of a report
> > > > of recent regressions.
> > > >
> > > > The following bug entry is on the current list of known regressions
> > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > (either way).
> > > >
> > > >
> > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > Subject : oops on boot starting udev
> > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > Date : 2009-11-14 10:16 (3 days old)
> > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > >
> > > This looks like an input core problem, as the evdev module was just
> > > loaded and died.
> > >
> > > Any input developers have any ideas?
> > >
> >
> >
> > Hmm, evdev does:
> >
> > dev_set_name(&evdev->dev, "event%d", minor);
> >
> > Not sure how it can go wrong...
>
> Anything I should/could do to narrow it down a bit (apart from
> bisecting?).
>

Umm, I looked through the changes between -rc6 and 7 but nothing jumped
out at me... You don't happen to have any local changes in your tree?

--
Dmitry

2009-11-17 04:06:45

by Soeren Sonnenburg

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> > On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > > This message has been generated automatically as a part of a report
> > > > > of recent regressions.
> > > > >
> > > > > The following bug entry is on the current list of known regressions
> > > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > > (either way).
> > > > >
> > > > >
> > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > > Subject : oops on boot starting udev
> > > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > > Date : 2009-11-14 10:16 (3 days old)
> > > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > > >
> > > > This looks like an input core problem, as the evdev module was just
> > > > loaded and died.
> > > >
> > > > Any input developers have any ideas?
> > > >
> > >
> > >
> > > Hmm, evdev does:
> > >
> > > dev_set_name(&evdev->dev, "event%d", minor);
> > >
> > > Not sure how it can go wrong...
> >
> > Anything I should/could do to narrow it down a bit (apart from
> > bisecting?).
> >
>
> Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> out at me... You don't happen to have any local changes in your tree?

Well only the mouse button #1 emulation - though I don't see what could
go wrong there.

Soeren
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962


Attachments:
signature.asc (836.00 B)
This is a digitally signed message part

2009-11-17 06:04:20

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267

2009/11/16 Rafael J. Wysocki <[email protected]>:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31.  Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry       : http://bugzilla.kernel.org/show_bug.cgi?id=14352
> Subject         : WARNING: at net/mac80211/scan.c:267
> Submitter       : Maciej Rutecki <[email protected]>
> Date            : 2009-10-08 00:30 (40 days old)
> References      : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7
>
>

In 2.6.32-rc7 problem seems be fixed.

Regards
--
Maciej Rutecki
http://www.maciek.unixy.pl

2009-11-17 06:06:06

by Ingo Molnar

[permalink] [raw]
Subject: Re: [Bug #14483] Interrupts enabled after irqrouter_resume - iMac9,1


* Rafael J. Wysocki <[email protected]> wrote:

> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
> Subject : Interrupts enabled after irqrouter_resume - iMac9,1
> Submitter : Justin Mattock <[email protected]>
> Date : 2009-10-25 19:58 (23 days old)
> References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4

Looks like a suspend bug, not an irq bug. The new warnings in the
suspend/resume code might have triggered an old bug in that particular
driver.

Ingo

2009-11-17 08:05:27

by Fabio Comolli

[permalink] [raw]
Subject: Re: [Bug #14372] ath5k wireless not working after suspend-resume - eeepc

The offending commit got reverted in -rc7, therefore this regression is solved.

On Mon, Nov 16, 2009 at 11:37 PM, Rafael J. Wysocki <[email protected]> wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31.  Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry       : http://bugzilla.kernel.org/show_bug.cgi?id=14372
> Subject         : ath5k wireless not working after suspend-resume - eeepc
> Submitter       : Fabio Comolli <[email protected]>
> Date            : 2009-10-03 15:36 (45 days old)
> References      : http://lkml.org/lkml/2009/10/3/91
>
>
>

2009-11-17 08:40:21

by Minchan Kim

[permalink] [raw]
Subject: Re: [Bug #14618] OOM killer, page fault

I think we can ignore this bug.
We can see the mm_fault_error in stack trace.
I guess it's from returning VM_FAULT_OOM of any driver.

As I know, It happens very rarely. Norbert, right?

In addition, I don't see the any similar report.

So I think It would be better to ignore this bug
until we can meet similar report again.


On Tue, Nov 17, 2009 at 7:37 AM, Rafael J. Wysocki <[email protected]> wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31.  Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry       : http://bugzilla.kernel.org/show_bug.cgi?id=14618
> Subject         : OOM killer, page fault
> Submitter       : Norbert Preining <[email protected]>
> Date            : 2009-10-30 6:32 (18 days old)
> References      : http://marc.info/?l=linux-kernel&m=125688434909582&w=4
> Handled-By      : Minchan Kim <[email protected]>
>
>
>



--
Kind regards,
Minchan Kim

2009-11-17 09:03:35

by Benjamin Herrenschmidt

[permalink] [raw]
Subject: Re: [Bug #14355] USB serial regression after 2.6.31.1 with Huawei E169 GSM modem

On Tue, 2009-11-17 at 00:12 +0100, Rafael J. Wysocki wrote:
> On Tuesday 17 November 2009, Oliver Neukum wrote:
> > Am Montag, 16. November 2009 23:37:39 schrieb Rafael J. Wysocki:
> > > This message has been generated automatically as a part of a report
> > > of recent regressions.
> > >
> > > The following bug entry is on the current list of known regressions
> > > from 2.6.31. Please verify if it still should be listed and let me know
> > > (either way).
> > >
> > >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
> > > Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
> > > Submitter : Benjamin Herrenschmidt <[email protected]>
> > > Date : 2009-10-10 03:07 (38 days old)
> > > References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4
> >
> > Benjamin has fixed this bug.
>
> Thanks, closed.
>

I asked Greg to re-open for a while to track other issues with
various Huawei modems. However, it appears that most of them would
be tricky to workaround in the kernel and are fixed by FW updates
(though updating those modem FW can be non trivial).

I'll leave it open for a little while see how things go on the
corresponding ubuntu bug, and if things settle, we can then close it,
in the meantime, you can remove it from your list of regressions.

Cheers,
Ben.

2009-11-17 09:19:24

by Norbert Preining

[permalink] [raw]
Subject: Re: [Bug #14618] OOM killer, page fault

On Tue, 17 Nov 2009, Minchan Kim wrote:
> I think we can ignore this bug.

Agreed. It is so hard to reproduce, and might be some other reason for
that.

> As I know, It happens very rarely. Norbert, right?

Yes definitely.

> So I think It would be better to ignore this bug
> until we can meet similar report again.

Agreed, please close or whatever.

In case I find a method to reproduce it more easily I will come back to
you.

Best wishes

Norbert

-------------------------------------------------------------------------------
Dr. Norbert Preining Associate Professor
JAIST Japan Advanced Institute of Science and Technology [email protected]
Vienna University of Technology [email protected]
Debian Developer (Debian TeX Task Force) [email protected]
gpg DSA: 0x09C5B094 fp: 14DF 2E6C 0307 BE6D AD76 A9C0 D2BF 4AA3 09C5 B094
-------------------------------------------------------------------------------
SMEARISARY (n.)
The correct name for a junior apprentice greengrocer whose main duty
is to arrange the fruit so that the bad side is underneath. From the
name of a character not in Dickens.
--- Douglas Adams, The Meaning of Liff

2009-11-17 12:35:56

by Lukas Kolbe

[permalink] [raw]
Subject: Re: [Bug #14577] Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)

Rafael J. Wysocki wrote:

>This message has been generated automatically as a part of a report
>of recent regressions.
>
>The following bug entry is on the current list of known regressions
>from 2.6.31. Please verify if it still should be listed and let me know
>(either way).

It is still valid. We haven't yet been able to verify if it is either a
hardware problem (working with the adaptec folks to sort that out) or a
kernel problem (working with you to find that out ;). Kernel 2.6.30, as
already said, seems to think everything is fine, so it really might be a
regression.

>Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14577
>Subject : Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)
>Submitter : <[email protected]>
>Date : 2009-11-10 13:31 (7 days old)

--
Lukas Kolbe

2009-11-17 12:52:19

by Arkadiusz Miskiewicz

[permalink] [raw]
Subject: Re: [Bug #14380] Video tearing/glitching with T400 laptops

On Monday 16 of November 2009, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).

It's mostly gone. It still happens for like 1/4s once per 6hours but jbarnes
asked me to fill separate bug for that and gave such patch for testing (see below).

So far haven't seen 1/4s problem with this patch applied.

diff --git a/drivers/gpu/drm/i915/intel_display.c b/drivers/gpu/drm/i915/intel_display.c
index 3ba6546..b2cbf7f 100644
--- a/drivers/gpu/drm/i915/intel_display.c
+++ b/drivers/gpu/drm/i915/intel_display.c
@@ -2491,6 +2491,8 @@ static void g4x_update_wm(struct drm_device *dev, int planea_clock,
/* Use ns/us then divide to preserve precision */
sr_entries = (((sr_latency_ns / line_time_us) + 1) *
pixel_size * sr_hdisplay) / 1000;
+ if (sr_entries > G4X_FIFO_SIZE)
+ sr_entries = G4X_FIFO_SIZE;
sr_entries = roundup(sr_entries / cacheline_size, 1);
DRM_DEBUG("self-refresh entries: %d\n", sr_entries);
I915_WRITE(FW_BLC_SELF, FW_BLC_SELF_EN);

>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14380
> Subject : Video tearing/glitching with T400 laptops
> Submitter : Theodore Ts'o <[email protected]>
> Date : 2009-10-02 22:40 (46 days old)
> References : http://marc.info/?l=linux-kernel&m=125452324520623&w=4
> Handled-By : Jesse Barnes <[email protected]>
> Patch : http://marc.info/?l=linux-kernel&m=125591495325000&w=4
>


--
Arkadiusz Miśkiewicz PLD/Linux Team
arekm / maven.pl http://ftp.pld-linux.org/

2009-11-17 22:18:59

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267

On Tuesday 17 November 2009, Maciej Rutecki wrote:
> 2009/11/16 Rafael J. Wysocki <[email protected]>:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14352
> > Subject : WARNING: at net/mac80211/scan.c:267
> > Submitter : Maciej Rutecki <[email protected]>
> > Date : 2009-10-08 00:30 (40 days old)
> > References : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7
> >
> >
>
> In 2.6.32-rc7 problem seems be fixed.

Thanks, closing.

Rafael

2009-11-17 22:21:50

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14354] Bad corruption with 2.6.32-rc1 and upwards

On Tuesday 17 November 2009, Theodore Tso wrote:
> On Mon, Nov 16, 2009 at 11:37:39PM +0100, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14354
> > Subject : Bad corruption with 2.6.32-rc1 and upwards
> > Submitter : Holger Freyther <[email protected]>
> > Date : 2009-10-09 15:42 (39 days old)
>
> Um, this was marked as resolved, until you reopened it and then reset
> the state to New. Why did you do this?

I wasn't quite sure what the status was, because there was some activity in the
bug entry after it had been marked as resolved.

> It's fixed in mainline as of commit d4da6c9 when Linus reverted commit
> d0646f7. Users could still see it if they mount a file system with -o
> journal_checksum, but (a) it's no longer the default, and (b)
> corruption if you use the non-default journal_checksum mount option is
> not a regression.
>
> We have fixes to make journal_checksum safe queued for 2.6.33, but the
> revert fixes the regression problem.

OK, great, thanks for the confirmation.

I'll close it now.

Best,
Rafael

2009-11-17 22:23:12

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14355] USB serial regression after 2.6.31.1 with Huawei E169 GSM modem

On Tuesday 17 November 2009, Benjamin Herrenschmidt wrote:
> On Tue, 2009-11-17 at 00:12 +0100, Rafael J. Wysocki wrote:
> > On Tuesday 17 November 2009, Oliver Neukum wrote:
> > > Am Montag, 16. November 2009 23:37:39 schrieb Rafael J. Wysocki:
> > > > This message has been generated automatically as a part of a report
> > > > of recent regressions.
> > > >
> > > > The following bug entry is on the current list of known regressions
> > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > (either way).
> > > >
> > > >
> > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14355
> > > > Subject : USB serial regression after 2.6.31.1 with Huawei E169 GSM modem
> > > > Submitter : Benjamin Herrenschmidt <[email protected]>
> > > > Date : 2009-10-10 03:07 (38 days old)
> > > > References : http://marc.info/?l=linux-kernel&m=125513456327542&w=4
> > >
> > > Benjamin has fixed this bug.
> >
> > Thanks, closed.
> >
>
> I asked Greg to re-open for a while to track other issues with
> various Huawei modems. However, it appears that most of them would
> be tricky to workaround in the kernel and are fixed by FW updates
> (though updating those modem FW can be non trivial).
>
> I'll leave it open for a little while see how things go on the
> corresponding ubuntu bug, and if things settle, we can then close it,
> in the meantime, you can remove it from your list of regressions.

OK, I will.

Thanks,
Rafael

2009-11-17 22:24:47

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14372] ath5k wireless not working after suspend-resume - eeepc

On Tuesday 17 November 2009, Fabio Comolli wrote:
> The offending commit got reverted in -rc7, therefore this regression is solved.
>
> On Mon, Nov 16, 2009 at 11:37 PM, Rafael J. Wysocki <[email protected]> wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14372
> > Subject : ath5k wireless not working after suspend-resume - eeepc
> > Submitter : Fabio Comolli <[email protected]>
> > Date : 2009-10-03 15:36 (45 days old)
> > References : http://lkml.org/lkml/2009/10/3/91

Thanks, closed.

Rafael

2009-11-17 22:27:10

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14379] ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String

On Tuesday 17 November 2009, Justin P. Mattock wrote:
> Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14379
> > Subject : ACPI Warning for _SB_.BAT0._BIF: Converted Buffer to expected String
> > Submitter : Justin Mattock<[email protected]>
> > Date : 2009-10-08 21:46 (40 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d9adc2e031bd22d5d9607a53a8d3b30e0b675f39
> > References : http://marc.info/?l=linux-kernel&m=125504031328941&w=4
> > Handled-By : Alexey Starikovskiy<[email protected]>
> > Patch : http://bugzilla.kernel.org/attachment.cgi?id=23347
> >
> >
> >
> >
> o.k. just pulled the latest to see, and
> the warning message is there.
> so yes this bug report should be open
> until this is fixed.

Thanks for the update.

Rafael

2009-11-17 22:29:13

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14380] Video tearing/glitching with T400 laptops

On Tuesday 17 November 2009, Arkadiusz Miskiewicz wrote:
> On Monday 16 of November 2009, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
>
> It's mostly gone. It still happens for like 1/4s once per 6hours but jbarnes
> asked me to fill separate bug for that and gave such patch for testing (see below).
>
> So far haven't seen 1/4s problem with this patch applied.

Great.

> diff --git a/drivers/gpu/drm/i915/intel_display.c b/drivers/gpu/drm/i915/intel_display.c
> index 3ba6546..b2cbf7f 100644
> --- a/drivers/gpu/drm/i915/intel_display.c
> +++ b/drivers/gpu/drm/i915/intel_display.c
> @@ -2491,6 +2491,8 @@ static void g4x_update_wm(struct drm_device *dev, int planea_clock,
> /* Use ns/us then divide to preserve precision */
> sr_entries = (((sr_latency_ns / line_time_us) + 1) *
> pixel_size * sr_hdisplay) / 1000;
> + if (sr_entries > G4X_FIFO_SIZE)
> + sr_entries = G4X_FIFO_SIZE;
> sr_entries = roundup(sr_entries / cacheline_size, 1);
> DRM_DEBUG("self-refresh entries: %d\n", sr_entries);
> I915_WRITE(FW_BLC_SELF, FW_BLC_SELF_EN);
>
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14380
> > Subject : Video tearing/glitching with T400 laptops
> > Submitter : Theodore Ts'o <[email protected]>
> > Date : 2009-10-02 22:40 (46 days old)
> > References : http://marc.info/?l=linux-kernel&m=125452324520623&w=4
> > Handled-By : Jesse Barnes <[email protected]>
> > Patch : http://marc.info/?l=linux-kernel&m=125591495325000&w=4

Thanks for the update.

Rafael

2009-11-17 22:31:35

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14472] EXT4 corruption

On Tuesday 17 November 2009, Andy Lutomirski wrote:
> I'm think this was the journal checksumming bug, which is fixed.

Thanks for the update.


> On Nov 16, 2009, at 5:37 PM, "Rafael J. Wysocki" <[email protected]> wrote:
>
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me
> > know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14472
> > Subject : EXT4 corruption
> > Submitter : Shawn Starr <[email protected]>
> > Date : 2009-10-13 2:07 (35 days old)
> > References : http://marc.info/?l=linux-kernel&m=125539997508256&w=4
> > Handled-By : Theodore Tso <[email protected]>

I'm going to close the bug.

Rafael

2009-11-17 22:40:46

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14483] Interrupts enabled after irqrouter_resume - iMac9,1

On Tuesday 17 November 2009, Ingo Molnar wrote:
>
> * Rafael J. Wysocki <[email protected]> wrote:
>
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
> > Subject : Interrupts enabled after irqrouter_resume - iMac9,1
> > Submitter : Justin Mattock <[email protected]>
> > Date : 2009-10-25 19:58 (23 days old)
> > References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4
>
> Looks like a suspend bug, not an irq bug. The new warnings in the
> suspend/resume code might have triggered an old bug in that particular
> driver.

That's quite possible, although that's rather core code than a driver.

Anyway, I haven't been able to find the bug in there so far.

Thanks,
Rafael

2009-11-17 22:43:15

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14487] PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0

On Tuesday 17 November 2009, Justin P. Mattock wrote:
> Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14487
> > Subject : PANIC: early exception 08 rip 246:10 error ffffffff810251b5 cr2 0
> > Submitter : Justin P. Mattock<[email protected]>
> > Date : 2009-10-23 16:45 (25 days old)
> > References : http://lkml.org/lkml/2009/10/23/252
> >
> >
> >
> >
> This one has me a bit dazed i.g. after looking into the issue
> I did find a workaround(keep in mind it's not pretty),
> by commenting out set_fixmap_nocache and
> init_ohci1394_reset_and_init_dma.
> (by doing so I was able to load both machines and
> execute early debugging in case a problem occurs).
>
> Now as to what might be happening, after going through as
> much as I can comprehend the only thing in mind was
> reading fixmap.h the comments are stating that vsyscalls
> only covers 32bit, and that there needs to be another set
> for 64, leading me to believe that this is what I might be hitting.
> (my system is pure64, taking in no 32bit at all).
>
> At this point I think I need somebody to give me some info on this,
> and if the 64bit issue mentioned above is the case, then we can probably
> close this and leave it up to the x86_64 builders to create a 64bit
> call for this whenever they get to it.(main thing is I'm able to
> run dma early in case of an emergency).

Thanks for the update.

Rafael

2009-11-17 22:44:13

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14577] Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)

On Tuesday 17 November 2009, Lukas Kolbe wrote:
> Rafael J. Wysocki wrote:
>
> >This message has been generated automatically as a part of a report
> >of recent regressions.
> >
> >The following bug entry is on the current list of known regressions
> >from 2.6.31. Please verify if it still should be listed and let me know
> >(either way).
>
> It is still valid. We haven't yet been able to verify if it is either a
> hardware problem (working with the adaptec folks to sort that out) or a
> kernel problem (working with you to find that out ;). Kernel 2.6.30, as
> already said, seems to think everything is fine, so it really might be a
> regression.
>
> >Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14577
> >Subject : Data Corruption with Adaptec 52445, Firmware 5.2-0 (17380)
> >Submitter : <[email protected]>
> >Date : 2009-11-10 13:31 (7 days old)

Thanks for the update.

Rafael

2009-11-17 22:49:24

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14618] OOM killer, page fault

On Tuesday 17 November 2009, Norbert Preining wrote:
> On Tue, 17 Nov 2009, Minchan Kim wrote:
> > I think we can ignore this bug.
>
> Agreed. It is so hard to reproduce, and might be some other reason for
> that.
>
> > As I know, It happens very rarely. Norbert, right?
>
> Yes definitely.
>
> > So I think It would be better to ignore this bug
> > until we can meet similar report again.
>
> Agreed, please close or whatever.

Thanks for the update, I'm going to close it.

Rafael

2009-11-17 22:52:31

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14620] WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask

On Tuesday 17 November 2009, Theodore Tso wrote:
> On Mon, Nov 16, 2009 at 11:37:47PM +0100, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14620
> > Subject : WARNING: at mm/page_alloc.c:1805 __alloc_pages_nodemask
> > Submitter : Rog?rio Brito <[email protected]>
> > Date : 2009-11-06 23:10 (11 days old)
> > References : http://marc.info/?l=linux-kernel&m=125754907413892&w=4
>
> This isn't technically a regression, since the warning is simply
> complaining about something that apparently ext4 has been doing for a
> long time, which is that it allocates some very large order data
> buffers. So the change referenced simply printed a warning message
> that complained about the fact.
>
> The actual problem will be fixed in 2.6.32, as we no longer allocate
> the big data buffers at mount time.

Thanks, I'm going to close the bug.

Rafael

2009-11-17 23:08:14

by Thomas Gleixner

[permalink] [raw]
Subject: Re: [Bug #14483] Interrupts enabled after irqrouter_resume - iMac9,1

On Tue, 17 Nov 2009, Rafael J. Wysocki wrote:

ACPI folks Cc'ed

> On Tuesday 17 November 2009, Ingo Molnar wrote:
> >
> > * Rafael J. Wysocki <[email protected]> wrote:
> >
> > > This message has been generated automatically as a part of a report
> > > of recent regressions.
> > >
> > > The following bug entry is on the current list of known regressions
> > > from 2.6.31. Please verify if it still should be listed and let me know
> > > (either way).
> > >
> > >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14483
> > > Subject : Interrupts enabled after irqrouter_resume - iMac9,1
> > > Submitter : Justin Mattock <[email protected]>
> > > Date : 2009-10-25 19:58 (23 days old)
> > > References : http://marc.info/?l=linux-kernel&m=125650070420168&w=4
> >
> > Looks like a suspend bug, not an irq bug. The new warnings in the
> > suspend/resume code might have triggered an old bug in that particular
> > driver.
>
> That's quite possible, although that's rather core code than a driver.
>
> Anyway, I haven't been able to find the bug in there so far.

irqrouter_resume() seems to be solely ACPI code. I have not seen where
it might reenable interrupts, but ACPI folks might shed some light on
that.

Thanks,

tglx

2009-11-18 00:11:46

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [Bug #14354] Bad corruption with 2.6.32-rc1 and upwards

On Tue, Nov 17, 2009 at 11:23:11PM +0100, Rafael J. Wysocki wrote:
>
> I wasn't quite sure what the status was, because there was some
> activity in the bug entry after it had been marked as resolved.

Yeah, the actual regression had been resolved (by changing the
default), but the root cause was due to the fact that not enough
people had done proper power-fail testing for the journal_checksum,
even though the code had been in the kernel for a very long time. The
discussion afterwards was focused around fixing those problems so we
could make journal_checksum be the default at some point in the future.

- Ted

2009-11-18 09:19:45

by Pavel Machek

[permalink] [raw]
Subject: Re: [Bug #14296] spitz boots but suspend/resume is broken

> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14296
> Subject : spitz boots but suspend/resume is broken
> Submitter : Pavel Machek <[email protected]>
> Date : 2009-09-30 12:06 (48 days old)
> References : http://marc.info/?l=linux-kernel&m=125431244516449&w=4

this one was fixed by generic pxa fix.

--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

2009-11-18 11:09:47

by Peter Zijlstra

[permalink] [raw]
Subject: Re: [Bug #14383] hackbench regression with kernel 2.6.32-rc1

On Mon, 2009-11-16 at 23:37 +0100, Rafael J. Wysocki wrote:

> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14383
> Subject : hackbench regression with kernel 2.6.32-rc1
> Submitter : Zhang, Yanmin <[email protected]>
> Date : 2009-10-09 9:19 (39 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=29cd8bae396583a2ee9a3340db8c5102acf9f6fd
> References : http://marc.info/?l=linux-kernel&m=125508007510274&w=4
> Handled-By : Peter Zijlstra <[email protected]>


> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14384
> Subject : tbench regression with 2.6.32-rc1
> Submitter : Zhang, Yanmin <[email protected]>
> Date : 2009-10-09 9:51 (39 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=59abf02644c45f1591e1374ee7bb45dc757fcb88
> References : http://marc.info/?l=linux-kernel&m=125508216713138&w=4
> Handled-By : Peter Zijlstra <[email protected]>



> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14621
> Subject : specjbb2005 and aim7 regression with 2.6.32-rc kernels
> Submitter : Zhang, Yanmin <[email protected]>
> Date : 2009-11-06 7:38 (11 days old)
> References : http://marc.info/?l=linux-kernel&m=125749310413174&w=4


Yanmin, could you please update me on the status of these regressions?

Mike seems to have done a lot to address issues while I was out, and
while I (hopefully) did read all resulting email, I must admit to
loosing track of where we stand.


2009-11-18 22:18:41

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14296] spitz boots but suspend/resume is broken

On Wednesday 18 November 2009, Pavel Machek wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14296
> > Subject : spitz boots but suspend/resume is broken
> > Submitter : Pavel Machek <[email protected]>
> > Date : 2009-09-30 12:06 (48 days old)
> > References : http://marc.info/?l=linux-kernel&m=125431244516449&w=4
>
> this one was fixed by generic pxa fix.

Thanks, closing.

Rafael

2009-11-19 02:59:39

by Dmitry Torokhov

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
> On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> > On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> > > On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > > > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > > > This message has been generated automatically as a part of a report
> > > > > > of recent regressions.
> > > > > >
> > > > > > The following bug entry is on the current list of known regressions
> > > > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > > > (either way).
> > > > > >
> > > > > >
> > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > > > Subject : oops on boot starting udev
> > > > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > > > Date : 2009-11-14 10:16 (3 days old)
> > > > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > > > >
> > > > > This looks like an input core problem, as the evdev module was just
> > > > > loaded and died.
> > > > >
> > > > > Any input developers have any ideas?
> > > > >
> > > >
> > > >
> > > > Hmm, evdev does:
> > > >
> > > > dev_set_name(&evdev->dev, "event%d", minor);
> > > >
> > > > Not sure how it can go wrong...
> > >
> > > Anything I should/could do to narrow it down a bit (apart from
> > > bisecting?).
> > >
> >
> > Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> > out at me... You don't happen to have any local changes in your tree?
>
> Well only the mouse button #1 emulation - though I don't see what could
> go wrong there.
>

I have been looking through the changes and I really don't see anything
suspicious. I am also not hittign this oops on any of my boxes. Any
chance you could bisect?

Thanks.

--
Dmitry

2009-11-19 20:05:36

by David Miller

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

From: "Rafael J. Wysocki" <[email protected]>
Date: Mon, 16 Nov 2009 23:37:47 +0100 (CET)

> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14622
> Subject : Second IDE device not found
> Submitter : Zeno Davatz <[email protected]>
> Date : 2009-11-11 17:31 (6 days old)
> References : http://marc.info/?l=linux-kernel&m=125796105822353&w=4

We're going to need more information to diagnose this.

And linux-ide should have been at least CC:'d from the very
beginning.

>From what I can discern the problem is introduced somewhere between
2.6.27 and 2.6.31, you have a Serverworks CSB5 and primarily the issue
is that attaching or detaching your CD-ROM driver influences whether
both of your disks are properly detected. Correct?

You seem to have played around with using the IDE layer vs. the
ATA layer. Can you see any difference in behavior if you try
using just the IDE layer vs. just the ATA layer with the 2.6.31
kernel?

Thanks.

2009-11-20 05:38:31

by Yanmin Zhang

[permalink] [raw]
Subject: Re: [Bug #14383] hackbench regression with kernel 2.6.32-rc1

On Wed, 2009-11-18 at 12:09 +0100, Peter Zijlstra wrote:
> On Mon, 2009-11-16 at 23:37 +0100, Rafael J. Wysocki wrote:

Sorry for replying late. There was a severe power failure in my Lab.

Below are updates against 2.6.32-rc7 kernel.

>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14383
> > Subject : hackbench regression with kernel 2.6.32-rc1
> > Submitter : Zhang, Yanmin <[email protected]>
> > Date : 2009-10-09 9:19 (39 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=29cd8bae396583a2ee9a3340db8c5102acf9f6fd
> > References : http://marc.info/?l=linux-kernel&m=125508007510274&w=4
> > Handled-By : Peter Zijlstra <[email protected]>
On core2 arch machines, hackbench regression disappears and there is much
improvement instead of regression.
On Nehalem machine, no big change, comparing with 2.6.31.

On Itanium machines (2 sockets or 4 sockets), the regression become
about 20%. Originally it's 70%.


>
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14384
> > Subject : tbench regression with 2.6.32-rc1
> > Submitter : Zhang, Yanmin <[email protected]>
> > Date : 2009-10-09 9:51 (39 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=59abf02644c45f1591e1374ee7bb45dc757fcb88
> > References : http://marc.info/?l=linux-kernel&m=125508216713138&w=4
> > Handled-By : Peter Zijlstra <[email protected]>

On core2 arch machines, tbench regression becomes about 4%. Originally, the
regression is about 33%.

On Nehalem, tbench regression is about 4%. Original is 7%.

On Itanium, tbench regression is about 16%. Original is 26%


>
>
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14621
> > Subject : specjbb2005 and aim7 regression with 2.6.32-rc kernels
> > Submitter : Zhang, Yanmin <[email protected]>
> > Date : 2009-11-06 7:38 (11 days old)
> > References : http://marc.info/?l=linux-kernel&m=125749310413174&w=4
specjbb2005 and aim7 results almost have no variation.

>
>
> Yanmin, could you please update me on the status of these regressions?
>
> Mike seems to have done a lot to address issues while I was out, and
> while I (hopefully) did read all resulting email, I must admit to
> loosing track of where we stand.

Mike's patch 1b9508f6831e10 could improve netperf loopback testing.
The latest upstream doesn't merge it yet.


2009-11-20 06:52:29

by Mike Galbraith

[permalink] [raw]
Subject: Re: [Bug #14383] hackbench regression with kernel 2.6.32-rc1

On Fri, 2009-11-20 at 13:40 +0800, Zhang, Yanmin wrote:

> Mike's patch 1b9508f6831e10 could improve netperf loopback testing.
> The latest upstream doesn't merge it yet.

The kinda ugly thing below gives me around a 4% boost for pinned tasks.
Looking around is expensive to fast movers, some cost can be avoided.

---
kernel/sched_fair.c | 40 +++++++++++++++++++++++++++++-----------
1 file changed, 29 insertions(+), 11 deletions(-)

Index: linux-2.6/kernel/sched_fair.c
===================================================================
--- linux-2.6.orig/kernel/sched_fair.c
+++ linux-2.6/kernel/sched_fair.c
@@ -1396,26 +1396,36 @@ static int select_task_rq_fair(struct ta
{
struct sched_domain *tmp, *affine_sd = NULL, *sd = NULL;
int cpu = smp_processor_id();
- int prev_cpu = task_cpu(p);
- int new_cpu = cpu;
- int want_affine = 0;
- int want_sd = 1;
+ int new_cpu, prev_cpu = task_cpu(p);
+ int pinned, want_sd, want_affine = 0;
int sync = wake_flags & WF_SYNC;

- if (sd_flag & SD_BALANCE_WAKE) {
- if (sched_feat(AFFINE_WAKEUPS) &&
- cpumask_test_cpu(cpu, &p->cpus_allowed))
- want_affine = 1;
+ rcu_read_lock();
+ pinned = !(cpumask_weight(&p->cpus_allowed) > 1);
+ new_cpu = pinned ? prev_cpu : cpu;
+ want_sd = !pinned;
+
+#ifndef CONFIG_FAIR_GROUP_SCHED
+ /*
+ * If we don't need to balance shares, we can skip
+ * everything below, and save some time.
+ */
+ if (pinned)
+ goto out;
+#endif
+
+ if ((sd_flag & SD_BALANCE_WAKE) && sched_feat(AFFINE_WAKEUPS) &&
+ cpumask_test_cpu(cpu, &p->cpus_allowed)) {
+ want_affine = 1;
new_cpu = prev_cpu;
}

- rcu_read_lock();
for_each_domain(cpu, tmp) {
/*
* If power savings logic is enabled for a domain, see if we
* are not overloaded, if so, don't balance wider.
*/
- if (tmp->flags & (SD_POWERSAVINGS_BALANCE|SD_PREFER_LOCAL)) {
+ if (want_sd && tmp->flags & (SD_POWERSAVINGS_BALANCE|SD_PREFER_LOCAL)) {
unsigned long power = 0;
unsigned long nr_running = 0;
unsigned long capacity;
@@ -1454,7 +1464,7 @@ static int select_task_rq_fair(struct ta
* If there's an idle sibling in this domain, make that
* the wake_affine target instead of the current cpu.
*/
- if (tmp->flags & SD_PREFER_SIBLING)
+ if (!pinned && tmp->flags & SD_PREFER_SIBLING)
target = select_idle_sibling(p, tmp, target);

if (target >= 0) {
@@ -1476,6 +1486,7 @@ static int select_task_rq_fair(struct ta
sd = tmp;
}

+#ifdef CONFIG_FAIR_GROUP_SCHED
if (sched_feat(LB_SHARES_UPDATE)) {
/*
* Pick the largest domain to update shares over
@@ -1490,6 +1501,13 @@ static int select_task_rq_fair(struct ta
update_shares(tmp);
}

+ /*
+ * Balance shares, but don't waste time.
+ */
+ if (pinned)
+ goto out;
+#endif
+
if (affine_sd && wake_affine(affine_sd, p, sync)) {
new_cpu = cpu;
goto out;

2009-11-20 07:59:54

by Zeno Davatz

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On Thu, Nov 19, 2009 at 9:05 PM, David Miller <[email protected]> wrote:
> From: "Rafael J. Wysocki" <[email protected]>
> Date: Mon, 16 Nov 2009 23:37:47 +0100 (CET)
>
>> This message has been generated automatically as a part of a report
>> of recent regressions.
>>
>> The following bug entry is on the current list of known regressions
>> from 2.6.31. ?Please verify if it still should be listed and let me know
>> (either way).
>>
>>
>> Bug-Entry ? ? : http://bugzilla.kernel.org/show_bug.cgi?id=14622
>> Subject ? ? ? ? ? ? ? : Second IDE device not found
>> Submitter ? ? : Zeno Davatz <[email protected]>
>> Date ? ? ? ? ?: 2009-11-11 17:31 (6 days old)
>> References ? ?: http://marc.info/?l=linux-kernel&m=125796105822353&w=4
>
> We're going to need more information to diagnose this.
>
> And linux-ide should have been at least CC:'d from the very
> beginning.
>
> From what I can discern the problem is introduced somewhere between
> 2.6.27 and 2.6.31, you have a Serverworks CSB5 and primarily the issue
> is that attaching or detaching your CD-ROM driver influences whether
> both of your disks are properly detected. ?Correct?
>
> You seem to have played around with using the IDE layer vs. the
> ATA layer. ?Can you see any difference in behavior if you try
> using just the IDE layer vs. just the ATA layer with the 2.6.31
> kernel?

Please see:

http://www.flickr.com/photos/zrr/4118682747/

and

http://www.flickr.com/photos/zrr/4119453092/

I makes no difference if I choose ATA or only IDE.

I am also attaching you my Kernel .config

I could also test with 2.6.32-rc8 (torvalds-git)

I suggest you send me two .config that I can both test with 2.6.31.6

Thank you for your Feedback.

Best
Zeno


Attachments:
.config (64.49 kB)

2009-11-20 08:41:47

by Jeff Garzik

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On 11/20/2009 02:59 AM, Zeno Davatz wrote:
> On Thu, Nov 19, 2009 at 9:05 PM, David Miller<[email protected]> wrote:
>> From: "Rafael J. Wysocki"<[email protected]>
>> Date: Mon, 16 Nov 2009 23:37:47 +0100 (CET)
>>
>>> This message has been generated automatically as a part of a report
>>> of recent regressions.
>>>
>>> The following bug entry is on the current list of known regressions
>>> from 2.6.31. Please verify if it still should be listed and let me know
>>> (either way).
>>>
>>>
>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14622
>>> Subject : Second IDE device not found
>>> Submitter : Zeno Davatz<[email protected]>
>>> Date : 2009-11-11 17:31 (6 days old)
>>> References : http://marc.info/?l=linux-kernel&m=125796105822353&w=4
>>
>> We're going to need more information to diagnose this.
>>
>> And linux-ide should have been at least CC:'d from the very
>> beginning.
>>
>> From what I can discern the problem is introduced somewhere between
>> 2.6.27 and 2.6.31, you have a Serverworks CSB5 and primarily the issue
>> is that attaching or detaching your CD-ROM driver influences whether
>> both of your disks are properly detected. Correct?
>>
>> You seem to have played around with using the IDE layer vs. the
>> ATA layer. Can you see any difference in behavior if you try
>> using just the IDE layer vs. just the ATA layer with the 2.6.31
>> kernel?
>
> Please see:
>
> http://www.flickr.com/photos/zrr/4118682747/
>
> and
>
> http://www.flickr.com/photos/zrr/4119453092/

Unfortunately, both of these photos only show that MD (block major 9)
device could not be found for root.

For either ATA or IDE, we would need to see full dmesg somehow --
perhaps capturing serial console output? (Documentation/serial-console.txt)


> I makes no difference if I choose ATA or only IDE.
>
> I am also attaching you my Kernel .config
>
> I could also test with 2.6.32-rc8 (torvalds-git)
>
> I suggest you send me two .config that I can both test with 2.6.31.6

I bet libata fails because CONFIG_BLK_DEV_SD is not enabled. Probably
want to enable CONFIG_BLK_DEV_SR too, for libata CD-ROM support. I
would be interested to see your failing ATA config, with IDE disabled.

Jeff

2009-11-20 09:29:04

by Zeno Davatz

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On Fri, Nov 20, 2009 at 9:41 AM, Jeff Garzik <[email protected]> wrote:
> On 11/20/2009 02:59 AM, Zeno Davatz wrote:
>>
>> On Thu, Nov 19, 2009 at 9:05 PM, David Miller<[email protected]> ?wrote:
>>>
>>> From: "Rafael J. Wysocki"<[email protected]>
>>> Date: Mon, 16 Nov 2009 23:37:47 +0100 (CET)
>>>
>>>> This message has been generated automatically as a part of a report
>>>> of recent regressions.
>>>>
>>>> The following bug entry is on the current list of known regressions
>>>> from 2.6.31. ?Please verify if it still should be listed and let me know
>>>> (either way).
>>>>
>>>>
>>>> Bug-Entry ? ? : http://bugzilla.kernel.org/show_bug.cgi?id=14622
>>>> Subject ? ? ? ? ? ? ? : Second IDE device not found
>>>> Submitter ? ? : Zeno Davatz<[email protected]>
>>>> Date ? ? ? ? ?: 2009-11-11 17:31 (6 days old)
>>>> References ? ?: http://marc.info/?l=linux-kernel&m=125796105822353&w=4
>>>
>>> We're going to need more information to diagnose this.
>>>
>>> And linux-ide should have been at least CC:'d from the very
>>> beginning.
>>>
>>> ?From what I can discern the problem is introduced somewhere between
>>> 2.6.27 and 2.6.31, you have a Serverworks CSB5 and primarily the issue
>>> is that attaching or detaching your CD-ROM driver influences whether
>>> both of your disks are properly detected. ?Correct?
>>>
>>> You seem to have played around with using the IDE layer vs. the
>>> ATA layer. ?Can you see any difference in behavior if you try
>>> using just the IDE layer vs. just the ATA layer with the 2.6.31
>>> kernel?
>>
>> Please see:
>>
>> http://www.flickr.com/photos/zrr/4118682747/
>>
>> and
>>
>> http://www.flickr.com/photos/zrr/4119453092/
>
> Unfortunately, both of these photos only show that MD (block major 9) device
> could not be found for root.
>
> For either ATA or IDE, we would need to see full dmesg somehow -- perhaps
> capturing serial console output? ?(Documentation/serial-console.txt)

Ok, I appended

console=ttyS1,9600 console=tty0

to the kernel command line at lilo after choosing the kernel image and
I get a ton of output. But how to I save the output to a file?

>> I makes no difference if I choose ATA or only IDE.
>>
>> I am also attaching you my Kernel .config
>>
>> I could also test with 2.6.32-rc8 (torvalds-git)
>>
>> I suggest you send me two .config that I can both test with 2.6.31.6
>
> I bet libata fails because CONFIG_BLK_DEV_SD is not enabled. ?Probably want
> to enable CONFIG_BLK_DEV_SR too, for libata CD-ROM support. ?I would be
> interested to see your failing ATA config, with IDE disabled.

I enabled CONFIG_BLK_DEV_SR. Do you want above test with or without

CONFIG_BLK_DEV_SVWKS (OSB4/CSB5) enabled?

Best
Zeno

2009-11-20 11:31:43

by Jeff Garzik

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On 11/20/2009 04:29 AM, Zeno Davatz wrote:
> On Fri, Nov 20, 2009 at 9:41 AM, Jeff Garzik<[email protected]> wrote:
>> On 11/20/2009 02:59 AM, Zeno Davatz wrote:
>>>
>>> On Thu, Nov 19, 2009 at 9:05 PM, David Miller<[email protected]> wrote:
>>>>
>>>> From: "Rafael J. Wysocki"<[email protected]>
>>>> Date: Mon, 16 Nov 2009 23:37:47 +0100 (CET)
>>>>
>>>>> This message has been generated automatically as a part of a report
>>>>> of recent regressions.
>>>>>
>>>>> The following bug entry is on the current list of known regressions
>>>>> from 2.6.31. Please verify if it still should be listed and let me know
>>>>> (either way).
>>>>>
>>>>>
>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14622
>>>>> Subject : Second IDE device not found
>>>>> Submitter : Zeno Davatz<[email protected]>
>>>>> Date : 2009-11-11 17:31 (6 days old)
>>>>> References : http://marc.info/?l=linux-kernel&m=125796105822353&w=4
>>>>
>>>> We're going to need more information to diagnose this.
>>>>
>>>> And linux-ide should have been at least CC:'d from the very
>>>> beginning.
>>>>
>>>> From what I can discern the problem is introduced somewhere between
>>>> 2.6.27 and 2.6.31, you have a Serverworks CSB5 and primarily the issue
>>>> is that attaching or detaching your CD-ROM driver influences whether
>>>> both of your disks are properly detected. Correct?
>>>>
>>>> You seem to have played around with using the IDE layer vs. the
>>>> ATA layer. Can you see any difference in behavior if you try
>>>> using just the IDE layer vs. just the ATA layer with the 2.6.31
>>>> kernel?
>>>
>>> Please see:
>>>
>>> http://www.flickr.com/photos/zrr/4118682747/
>>>
>>> and
>>>
>>> http://www.flickr.com/photos/zrr/4119453092/
>>
>> Unfortunately, both of these photos only show that MD (block major 9) device
>> could not be found for root.
>>
>> For either ATA or IDE, we would need to see full dmesg somehow -- perhaps
>> capturing serial console output? (Documentation/serial-console.txt)
>
> Ok, I appended
>
> console=ttyS1,9600 console=tty0
>
> to the kernel command line at lilo after choosing the kernel image and
> I get a ton of output. But how to I save the output to a file?

I use minicom and a null modem serial cable. One of minicom's commands
will capture everything sent to the serial port, to a file. Other
options are available. Googling for "linux serial console" found
several useful starting-point links.

The basic idea is to sent console output to a serial port, and then have
some method of reading and capturing the serial port's data.

Another alternative is netconsole (google for "linux netconsole"), which
permits output over the network.


>>> I makes no difference if I choose ATA or only IDE.
>>>
>>> I am also attaching you my Kernel .config
>>>
>>> I could also test with 2.6.32-rc8 (torvalds-git)
>>>
>>> I suggest you send me two .config that I can both test with 2.6.31.6
>>
>> I bet libata fails because CONFIG_BLK_DEV_SD is not enabled. Probably want
>> to enable CONFIG_BLK_DEV_SR too, for libata CD-ROM support. I would be
>> interested to see your failing ATA config, with IDE disabled.
>
> I enabled CONFIG_BLK_DEV_SR. Do you want above test with or without
>
> CONFIG_BLK_DEV_SVWKS (OSB4/CSB5) enabled?

Well, this goes back to David's basic request: IDE-only or ATA-only.

You really, really, really should not to enable both at the same time.

If you are choosing ATA-only (libata), then you should disable
CONFIG_IDE and everything associated with CONFIG_IDE, including
CONFIG_BLK_DEV_SVWKS.

libata will want something like
CONFIG_ATA
CONFIG_ATA_VERBOSE_ERROR <-- optional, but helpful
CONFIG_PATA_SERVERWORKS

CONFIG_SCSI
CONFIG_SCSI_LOGGING <-- ditto
CONFIG_SCSI_CONSTANTS <-- ditto
CONFIG_BLK_DEV_SD
CONFIG_BLK_DEV_SR <-- required only for CD-ROM support

Regards,

Jeff



2009-11-20 13:35:06

by Zeno Davatz

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On Fri, Nov 20, 2009 at 12:31 PM, Jeff Garzik <[email protected]> wrote:

> I use minicom and a null modem serial cable. ?One of minicom's commands will
> capture everything sent to the serial port, to a file. ?Other options are
> available. ?Googling for "linux serial console" found several useful
> starting-point links.
>
> The basic idea is to sent console output to a serial port, and then have
> some method of reading and capturing the serial port's data.
>
> Another alternative is netconsole (google for "linux netconsole"), which
> permits output over the network.

Ok, I will try to look into this.

> Well, this goes back to David's basic request: ?IDE-only or ATA-only.

I find it a bit irritating, that CONFIG_IDE is mentioned as ATA in
make menuconfig. But I'm trying to understand your point.

> You really, really, really should not to enable both at the same time.

I recompiled the kernel only with below settings all enabled.

> If you are choosing ATA-only (libata), then you should disable CONFIG_IDE
> and everything associated with CONFIG_IDE, including CONFIG_BLK_DEV_SVWKS.

I disabled CONFIG_IDE this time.

> libata will want something like
> CONFIG_ATA
> CONFIG_ATA_VERBOSE_ERROR ? ? ? ?<-- optional, but helpful
> CONFIG_PATA_SERVERWORKS
>
> CONFIG_SCSI
> CONFIG_SCSI_LOGGING ? ? ? ? ? ? <-- ditto
> CONFIG_SCSI_CONSTANTS ? ? ? ? ? <-- ditto
> CONFIG_BLK_DEV_SD
> CONFIG_BLK_DEV_SR ? ? ? ? ? ? ? <-- required only for CD-ROM support

Recompiled with above enabled. And indeed: It worked. Booting as normal.

Thank you for the detailed instructions.

Best
Zeno

2009-11-20 15:06:48

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [Bug #14619] ext3/jbd oops in journal_start

On Mon, Nov 16, 2009 at 11:37:46PM +0100, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.31. Please verify if it still should be listed and let me know
> (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14619
> Subject : ext3/jbd oops in journal_start
> Submitter : Sage Weil <[email protected]>
> Date : 2009-10-31 6:14 (17 days old)
> References : http://marc.info/?l=linux-kernel&m=125696970418300&w=4
>

Sage, any updates on this? What was the last kernel version where you
weren't having this problem? Sounds like you can't mount any ext3
file systems at all?

- Ted

2009-11-20 15:19:04

by Chris Mason

[permalink] [raw]
Subject: Re: [Bug #14619] ext3/jbd oops in journal_start

On Fri, Nov 20, 2009 at 10:06:48AM -0500, [email protected] wrote:
> On Mon, Nov 16, 2009 at 11:37:46PM +0100, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.31. Please verify if it still should be listed and let me know
> > (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14619
> > Subject : ext3/jbd oops in journal_start
> > Submitter : Sage Weil <[email protected]>
> > Date : 2009-10-31 6:14 (17 days old)
> > References : http://marc.info/?l=linux-kernel&m=125696970418300&w=4
> >
>
> Sage, any updates on this? What was the last kernel version where you
> weren't having this problem? Sounds like you can't mount any ext3
> file systems at all?

This is a btrfs bug and is fixed. I've updated the bugzilla.

-chris

2009-11-20 15:32:33

by Theodore Ts'o

[permalink] [raw]
Subject: Re: [Bug #14619] ext3/jbd oops in journal_start

On Fri, Nov 20, 2009 at 10:18:22AM -0500, Chris Mason wrote:
> > Sage, any updates on this? What was the last kernel version where you
> > weren't having this problem? Sounds like you can't mount any ext3
> > file systems at all?
>
> This is a btrfs bug and is fixed. I've updated the bugzilla.

Thanks! I remember seeing that but I didn't connect it to this
bugzilla entry.

Just trying to do my part to clear out open regressions after Linus
sent out his pre-thanksgiving grump. :-)

- Ted

2009-11-20 17:45:25

by David Miller

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

From: Zeno Davatz <[email protected]>
Date: Fri, 20 Nov 2009 14:35:07 +0100

> Recompiled with above enabled. And indeed: It worked. Booting as normal.
>
> Thank you for the detailed instructions.

Rafael, I think we can close this entry now.

2009-11-20 20:40:59

by Rafael J. Wysocki

[permalink] [raw]
Subject: Re: [Bug #14622] Second IDE device not found

On Friday 20 November 2009, David Miller wrote:
> From: Zeno Davatz <[email protected]>
> Date: Fri, 20 Nov 2009 14:35:07 +0100
>
> > Recompiled with above enabled. And indeed: It worked. Booting as normal.
> >
> > Thank you for the detailed instructions.
>
> Rafael, I think we can close this entry now.

Yup, closed.

Thanks,
Rafael

2009-11-21 06:21:48

by Soeren Sonnenburg

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
> > On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> > > On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> > > > On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > > > > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > > > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > > > > This message has been generated automatically as a part of a report
> > > > > > > of recent regressions.
> > > > > > >
> > > > > > > The following bug entry is on the current list of known regressions
> > > > > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > > > > (either way).
> > > > > > >
> > > > > > >
> > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > > > > Subject : oops on boot starting udev
> > > > > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > > > > Date : 2009-11-14 10:16 (3 days old)
> > > > > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > > > > >
> > > > > > This looks like an input core problem, as the evdev module was just
> > > > > > loaded and died.
> > > > > >
> > > > > > Any input developers have any ideas?
> > > > > >
> > > > >
> > > > >
> > > > > Hmm, evdev does:
> > > > >
> > > > > dev_set_name(&evdev->dev, "event%d", minor);
> > > > >
> > > > > Not sure how it can go wrong...
> > > >
> > > > Anything I should/could do to narrow it down a bit (apart from
> > > > bisecting?).
> > > >
> > >
> > > Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> > > out at me... You don't happen to have any local changes in your tree?
> >
> > Well only the mouse button #1 emulation - though I don't see what could
> > go wrong there.
> >
>
> I have been looking through the changes and I really don't see anything
> suspicious. I am also not hittign this oops on any of my boxes. Any
> chance you could bisect?

I cannot promise whether I find the time to do this :/ One thing I
noticed is that applesmc seems to freak out every now and then on boot
(after the oopses). Only on this macbook pro.

Soeren

PS: I don't have this oops on a desktop machine either.
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962

2009-11-21 08:56:54

by Soeren Sonnenburg

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
> > On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> > > On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> > > > On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > > > > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > > > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > > > > This message has been generated automatically as a part of a report
> > > > > > > of recent regressions.
> > > > > > >
> > > > > > > The following bug entry is on the current list of known regressions
> > > > > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > > > > (either way).
> > > > > > >
> > > > > > >
> > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > > > > Subject : oops on boot starting udev
> > > > > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > > > > Date : 2009-11-14 10:16 (3 days old)
> > > > > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > > > > >
> > > > > > This looks like an input core problem, as the evdev module was just
> > > > > > loaded and died.
> > > > > >
> > > > > > Any input developers have any ideas?
> > > > > >
> > > > >
> > > > >
> > > > > Hmm, evdev does:
> > > > >
> > > > > dev_set_name(&evdev->dev, "event%d", minor);
> > > > >
> > > > > Not sure how it can go wrong...
> > > >
> > > > Anything I should/could do to narrow it down a bit (apart from
> > > > bisecting?).
> > > >
> > >
> > > Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> > > out at me... You don't happen to have any local changes in your tree?
> >
> > Well only the mouse button #1 emulation - though I don't see what could
> > go wrong there.
> >
>
> I have been looking through the changes and I really don't see anything
> suspicious. I am also not hittign this oops on any of my boxes. Any
> chance you could bisect?
>
> Thanks.

Alright so I tried to do a bisect when I noticed that building a knwon
to work -rc5 did no longer work either. Thought it might be a gcc
problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
Then I recognized that it crashes on loading basically *any* module,
tried tun and applesmc. Attaching the crashes...

I am starting to run out of ideas...

Soeren
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962


Attachments:
oops-applesmc (52.07 kB)
oops-tun (51.23 kB)
Download all attachments

2009-11-21 09:29:34

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [Bug #14626] oops on boot starting udev

Soeren Sonnenburg wrote:
> On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
>
>> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
>>
>>> On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
>>>
>>>> On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
>>>>
>>>>> On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
>>>>>
>>>>>> On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
>>>>>>
>>>>>>> On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
>>>>>>>
>>>>>>>> This message has been generated automatically as a part of a report
>>>>>>>> of recent regressions.
>>>>>>>>
>>>>>>>> The following bug entry is on the current list of known regressions
>>>>>>>> from 2.6.31. Please verify if it still should be listed and let me know
>>>>>>>> (either way).
>>>>>>>>
>>>>>>>>
>>>>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
>>>>>>>> Subject : oops on boot starting udev
>>>>>>>> Submitter : Soeren Sonnenburg<[email protected]>
>>>>>>>> Date : 2009-11-14 10:16 (3 days old)
>>>>>>>> References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
>>>>>>>>
>>>>>>> This looks like an input core problem, as the evdev module was just
>>>>>>> loaded and died.
>>>>>>>
>>>>>>> Any input developers have any ideas?
>>>>>>>
>>>>>>>
>>>>>> Hmm, evdev does:
>>>>>>
>>>>>> dev_set_name(&evdev->dev, "event%d", minor);
>>>>>>
>>>>>> Not sure how it can go wrong...
>>>>>>
>>>>> Anything I should/could do to narrow it down a bit (apart from
>>>>> bisecting?).
>>>>>
>>>>>
>>>> Umm, I looked through the changes between -rc6 and 7 but nothing jumped
>>>> out at me... You don't happen to have any local changes in your tree?
>>>>
>>> Well only the mouse button #1 emulation - though I don't see what could
>>> go wrong there.
>>>
>>>
>> I have been looking through the changes and I really don't see anything
>> suspicious. I am also not hittign this oops on any of my boxes. Any
>> chance you could bisect?
>>
>> Thanks.
>>
>
> Alright so I tried to do a bisect when I noticed that building a knwon
> to work -rc5 did no longer work either. Thought it might be a gcc
> problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
> Then I recognized that it crashes on loading basically *any* module,
> tried tun and applesmc. Attaching the crashes...
>
> I am starting to run out of ideas...
>
> Soeren
>
from what I remember wait status failed debug message was
removed from the kernel(but could be wrong).
could you maybe have some type of userspace thing going on
causing this? i.g. running the latest git(on a macbook)
with a from scratch system with nothing of that sort,
or at least cant reproduce your error.

Justin P. Mattock

2009-11-21 09:35:09

by Soeren Sonnenburg

[permalink] [raw]
Subject: [SOLVED] kernel module loading does not work with binutils-gold (was Re: [Bug #14626] oops on boot starting udev)

On Sat, 2009-11-21 at 09:56 +0100, Soeren Sonnenburg wrote:
> On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
> > On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
> > > On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> > > > On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> > > > > On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> > > > > > On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> > > > > > > On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> > > > > > > > This message has been generated automatically as a part of a report
> > > > > > > > of recent regressions.
> > > > > > > >
> > > > > > > > The following bug entry is on the current list of known regressions
> > > > > > > > from 2.6.31. Please verify if it still should be listed and let me know
> > > > > > > > (either way).
> > > > > > > >
> > > > > > > >
> > > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> > > > > > > > Subject : oops on boot starting udev
> > > > > > > > Submitter : Soeren Sonnenburg <[email protected]>
> > > > > > > > Date : 2009-11-14 10:16 (3 days old)
> > > > > > > > References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> > > > > > >
> > > > > > > This looks like an input core problem, as the evdev module was just
> > > > > > > loaded and died.
> > > > > > >
> > > > > > > Any input developers have any ideas?
> > > > > > >
> > > > > >
> > > > > >
> > > > > > Hmm, evdev does:
> > > > > >
> > > > > > dev_set_name(&evdev->dev, "event%d", minor);
> > > > > >
> > > > > > Not sure how it can go wrong...
> > > > >
> > > > > Anything I should/could do to narrow it down a bit (apart from
> > > > > bisecting?).
> > > > >
> > > >
> > > > Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> > > > out at me... You don't happen to have any local changes in your tree?
> > >
> > > Well only the mouse button #1 emulation - though I don't see what could
> > > go wrong there.
> > >
> >
> > I have been looking through the changes and I really don't see anything
> > suspicious. I am also not hittign this oops on any of my boxes. Any
> > chance you could bisect?
> >
> > Thanks.
>
> Alright so I tried to do a bisect when I noticed that building a knwon
> to work -rc5 did no longer work either. Thought it might be a gcc
> problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
> Then I recognized that it crashes on loading basically *any* module,
> tried tun and applesmc. Attaching the crashes...
>
> I am starting to run out of ideas...

OK, I've found the culprit: binutils-gold

I build all kernels upto and including -rc6 with the old binutils and
since then have upgraded to binutils gold 2.20-4 which - in contrast to
the old binutils - uses --no-add-needed per default.

So I suspect it triggers an error(?) in the way how the kernel links
modules: It is now required to provide all needed libraries to the
linker when building the modules. I guess this problem could be worked
around by adding --add-needed to the LDFLAGS_MODULE ...

Soeren
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962

2009-11-21 10:04:02

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [SOLVED] kernel module loading does not work with binutils-gold (was Re: [Bug #14626] oops on boot starting udev)

Soeren Sonnenburg wrote:
> On Sat, 2009-11-21 at 09:56 +0100, Soeren Sonnenburg wrote:
>
>> On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
>>
>>> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
>>>
>>>> On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
>>>>
>>>>> On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
>>>>>
>>>>>> On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
>>>>>>
>>>>>>> On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
>>>>>>>
>>>>>>>> On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
>>>>>>>>
>>>>>>>>> This message has been generated automatically as a part of a report
>>>>>>>>> of recent regressions.
>>>>>>>>>
>>>>>>>>> The following bug entry is on the current list of known regressions
>>>>>>>>> from 2.6.31. Please verify if it still should be listed and let me know
>>>>>>>>> (either way).
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
>>>>>>>>> Subject : oops on boot starting udev
>>>>>>>>> Submitter : Soeren Sonnenburg<[email protected]>
>>>>>>>>> Date : 2009-11-14 10:16 (3 days old)
>>>>>>>>> References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
>>>>>>>>>
>>>>>>>> This looks like an input core problem, as the evdev module was just
>>>>>>>> loaded and died.
>>>>>>>>
>>>>>>>> Any input developers have any ideas?
>>>>>>>>
>>>>>>>>
>>>>>>> Hmm, evdev does:
>>>>>>>
>>>>>>> dev_set_name(&evdev->dev, "event%d", minor);
>>>>>>>
>>>>>>> Not sure how it can go wrong...
>>>>>>>
>>>>>> Anything I should/could do to narrow it down a bit (apart from
>>>>>> bisecting?).
>>>>>>
>>>>>>
>>>>> Umm, I looked through the changes between -rc6 and 7 but nothing jumped
>>>>> out at me... You don't happen to have any local changes in your tree?
>>>>>
>>>> Well only the mouse button #1 emulation - though I don't see what could
>>>> go wrong there.
>>>>
>>>>
>>> I have been looking through the changes and I really don't see anything
>>> suspicious. I am also not hittign this oops on any of my boxes. Any
>>> chance you could bisect?
>>>
>>> Thanks.
>>>
>> Alright so I tried to do a bisect when I noticed that building a knwon
>> to work -rc5 did no longer work either. Thought it might be a gcc
>> problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
>> Then I recognized that it crashes on loading basically *any* module,
>> tried tun and applesmc. Attaching the crashes...
>>
>> I am starting to run out of ideas...
>>
>
> OK, I've found the culprit: binutils-gold
>
> I build all kernels upto and including -rc6 with the old binutils and
> since then have upgraded to binutils gold 2.20-4 which - in contrast to
> the old binutils - uses --no-add-needed per default.
>
> So I suspect it triggers an error(?) in the way how the kernel links
> modules: It is now required to provide all needed libraries to the
> linker when building the modules. I guess this problem could be worked
> around by adding --add-needed to the LDFLAGS_MODULE ...
>
> Soeren
>
tough to say... some how your hitting
__wait_status during your initial boot.

by looking at the comment(in applesmc.c):
__wait_status - Wait up to 32ms for the status port to get a certain value
* (masked with 0x0f), returning zero if the value is obtained.

maybe your hitting a different value because of binutls.
(keep in mind I have the latest binutils running on the macbook,
but nothing switched to gold during compilation time)

Justin P. Mattock


2009-11-21 10:08:31

by Soeren Sonnenburg

[permalink] [raw]
Subject: Re: [SOLVED] kernel module loading does not work with binutils-gold (was Re: [Bug #14626] oops on boot starting udev)

On Sat, 2009-11-21 at 01:58 -0800, Justin P. Mattock wrote:
> Soeren Sonnenburg wrote:
> > On Sat, 2009-11-21 at 09:56 +0100, Soeren Sonnenburg wrote:
> >
> >> On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
> >>
> >>> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
> >>>
> >>>> On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
> >>>>
> >>>>> On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
> >>>>>
> >>>>>> On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
> >>>>>>
> >>>>>>> On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
> >>>>>>>
> >>>>>>>> On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
> >>>>>>>>
> >>>>>>>>> This message has been generated automatically as a part of a report
> >>>>>>>>> of recent regressions.
> >>>>>>>>>
> >>>>>>>>> The following bug entry is on the current list of known regressions
> >>>>>>>>> from 2.6.31. Please verify if it still should be listed and let me know
> >>>>>>>>> (either way).
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
> >>>>>>>>> Subject : oops on boot starting udev
> >>>>>>>>> Submitter : Soeren Sonnenburg<[email protected]>
> >>>>>>>>> Date : 2009-11-14 10:16 (3 days old)
> >>>>>>>>> References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
> >>>>>>>>>
> >>>>>>>> This looks like an input core problem, as the evdev module was just
> >>>>>>>> loaded and died.
> >>>>>>>>
> >>>>>>>> Any input developers have any ideas?
> >>>>>>>>
> >>>>>>>>
> >>>>>>> Hmm, evdev does:
> >>>>>>>
> >>>>>>> dev_set_name(&evdev->dev, "event%d", minor);
> >>>>>>>
> >>>>>>> Not sure how it can go wrong...
> >>>>>>>
> >>>>>> Anything I should/could do to narrow it down a bit (apart from
> >>>>>> bisecting?).
> >>>>>>
> >>>>>>
> >>>>> Umm, I looked through the changes between -rc6 and 7 but nothing jumped
> >>>>> out at me... You don't happen to have any local changes in your tree?
> >>>>>
> >>>> Well only the mouse button #1 emulation - though I don't see what could
> >>>> go wrong there.
> >>>>
> >>>>
> >>> I have been looking through the changes and I really don't see anything
> >>> suspicious. I am also not hittign this oops on any of my boxes. Any
> >>> chance you could bisect?
> >>>
> >>> Thanks.
> >>>
> >> Alright so I tried to do a bisect when I noticed that building a knwon
> >> to work -rc5 did no longer work either. Thought it might be a gcc
> >> problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
> >> Then I recognized that it crashes on loading basically *any* module,
> >> tried tun and applesmc. Attaching the crashes...
> >>
> >> I am starting to run out of ideas...
> >>
> >
> > OK, I've found the culprit: binutils-gold
> >
> > I build all kernels upto and including -rc6 with the old binutils and
> > since then have upgraded to binutils gold 2.20-4 which - in contrast to
> > the old binutils - uses --no-add-needed per default.
> >
> > So I suspect it triggers an error(?) in the way how the kernel links
> > modules: It is now required to provide all needed libraries to the
> > linker when building the modules. I guess this problem could be worked
> > around by adding --add-needed to the LDFLAGS_MODULE ...
> >
> > Soeren
> >
> tough to say... some how your hitting
> __wait_status during your initial boot.
>
> by looking at the comment(in applesmc.c):
> __wait_status - Wait up to 32ms for the status port to get a certain value
> * (masked with 0x0f), returning zero if the value is obtained.
>
> maybe your hitting a different value because of binutls.

It could be anything missing...

> (keep in mind I have the latest binutils running on the macbook,
> but nothing switched to gold during compilation time)

Note that everything works fine with the old binutils here too.You will
need binutils gold to see the problem and it is described here too

http://wiki.debian.org/qa.debian.org/FTBFS#A2009-11-02Packagesfailingbecausebinutils-gold.2BAC8-indirectlinking

Soeren
--
For the one fact about the future of which we can be certain is that it
will be utterly fantastic. -- Arthur C. Clarke, 1962


Attachments:
signature.asc (836.00 B)
This is a digitally signed message part

2009-11-21 10:28:21

by Justin P. Mattock

[permalink] [raw]
Subject: Re: [SOLVED] kernel module loading does not work with binutils-gold (was Re: [Bug #14626] oops on boot starting udev)

Soeren Sonnenburg wrote:
> On Sat, 2009-11-21 at 01:58 -0800, Justin P. Mattock wrote:
>
>> Soeren Sonnenburg wrote:
>>
>>> On Sat, 2009-11-21 at 09:56 +0100, Soeren Sonnenburg wrote:
>>>
>>>
>>>> On Wed, 2009-11-18 at 18:59 -0800, Dmitry Torokhov wrote:
>>>>
>>>>
>>>>> On Tue, Nov 17, 2009 at 05:06:47AM +0100, Soeren Sonnenburg wrote:
>>>>>
>>>>>
>>>>>> On Mon, 2009-11-16 at 20:01 -0800, Dmitry Torokhov wrote:
>>>>>>
>>>>>>
>>>>>>> On Tue, Nov 17, 2009 at 03:59:03AM +0100, Soeren Sonnenburg wrote:
>>>>>>>
>>>>>>>
>>>>>>>> On Mon, 2009-11-16 at 18:04 -0800, Dmitry Torokhov wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>> On Mon, Nov 16, 2009 at 05:14:55PM -0800, Greg KH wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> On Mon, Nov 16, 2009 at 11:37:48PM +0100, Rafael J. Wysocki wrote:
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>> This message has been generated automatically as a part of a report
>>>>>>>>>>> of recent regressions.
>>>>>>>>>>>
>>>>>>>>>>> The following bug entry is on the current list of known regressions
>>>>>>>>>>> from 2.6.31. Please verify if it still should be listed and let me know
>>>>>>>>>>> (either way).
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14626
>>>>>>>>>>> Subject : oops on boot starting udev
>>>>>>>>>>> Submitter : Soeren Sonnenburg<[email protected]>
>>>>>>>>>>> Date : 2009-11-14 10:16 (3 days old)
>>>>>>>>>>> References : http://marc.info/?l=linux-kernel&m=125819380206800&w=4
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>> This looks like an input core problem, as the evdev module was just
>>>>>>>>>> loaded and died.
>>>>>>>>>>
>>>>>>>>>> Any input developers have any ideas?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> Hmm, evdev does:
>>>>>>>>>
>>>>>>>>> dev_set_name(&evdev->dev, "event%d", minor);
>>>>>>>>>
>>>>>>>>> Not sure how it can go wrong...
>>>>>>>>>
>>>>>>>>>
>>>>>>>> Anything I should/could do to narrow it down a bit (apart from
>>>>>>>> bisecting?).
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>> Umm, I looked through the changes between -rc6 and 7 but nothing jumped
>>>>>>> out at me... You don't happen to have any local changes in your tree?
>>>>>>>
>>>>>>>
>>>>>> Well only the mouse button #1 emulation - though I don't see what could
>>>>>> go wrong there.
>>>>>>
>>>>>>
>>>>>>
>>>>> I have been looking through the changes and I really don't see anything
>>>>> suspicious. I am also not hittign this oops on any of my boxes. Any
>>>>> chance you could bisect?
>>>>>
>>>>> Thanks.
>>>>>
>>>>>
>>>> Alright so I tried to do a bisect when I noticed that building a knwon
>>>> to work -rc5 did no longer work either. Thought it might be a gcc
>>>> problem (gcc-4.3 here) so upgraded to 4.4 - same thing.
>>>> Then I recognized that it crashes on loading basically *any* module,
>>>> tried tun and applesmc. Attaching the crashes...
>>>>
>>>> I am starting to run out of ideas...
>>>>
>>>>
>>> OK, I've found the culprit: binutils-gold
>>>
>>> I build all kernels upto and including -rc6 with the old binutils and
>>> since then have upgraded to binutils gold 2.20-4 which - in contrast to
>>> the old binutils - uses --no-add-needed per default.
>>>
>>> So I suspect it triggers an error(?) in the way how the kernel links
>>> modules: It is now required to provide all needed libraries to the
>>> linker when building the modules. I guess this problem could be worked
>>> around by adding --add-needed to the LDFLAGS_MODULE ...
>>>
>>> Soeren
>>>
>>>
>> tough to say... some how your hitting
>> __wait_status during your initial boot.
>>
>> by looking at the comment(in applesmc.c):
>> __wait_status - Wait up to 32ms for the status port to get a certain value
>> * (masked with 0x0f), returning zero if the value is obtained.
>>
>> maybe your hitting a different value because of binutls.
>>
>
> It could be anything missing...
>
>
>> (keep in mind I have the latest binutils running on the macbook,
>> but nothing switched to gold during compilation time)
>>
>
> Note that everything works fine with the old binutils here too.You will
> need binutils gold to see the problem and it is described here too
>
> http://wiki.debian.org/qa.debian.org/FTBFS#A2009-11-02Packagesfailingbecausebinutils-gold.2BAC8-indirectlinking
>
> Soeren
>
Well I'd like to go into building
gcc with the switch of: --disable-multilib
but I cant because of the whole "gold factor"

maybe somebody else with this knowledge of(gold) gcc etc...
can assist you because I have no knowledge of that .
(I'll have to try building a system this way one day);

Justin P. Mattock

2009-11-21 10:40:09

by Norbert Preining

[permalink] [raw]
Subject: Re: [Bug #14618] OOM killer, page fault

Hi everyone,

(as usual, please cc, thanks)

not that I ask for reopen ..

On Mo, 16 Nov 2009, Rafael J. Wysocki wrote:
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14618
> Subject : OOM killer, page fault

Well, it hit with OOM although I had 60+% in cache.

I cached some more output
/proc/meminfo, vmstat, zoneinfo pklus the dmesg log

I have no idea how to read all that, but it looks like there should be
enough mem free. In fact I only had firefox and some gnome terminals
running.

Well, hope you see something in that.

BTW, that is 2.6.32-rc8 with the patches for showing some more infos
in proc:
0001-Add-recent-rotated-scanned-info-to-proc-zoneinfo.patch
proc-filecache-v2.patch


Best wishes

Norbert

-------------------------------------------------------------------------------
Dr. Norbert Preining Associate Professor
JAIST Japan Advanced Institute of Science and Technology [email protected]
Vienna University of Technology [email protected]
Debian Developer (Debian TeX Task Force) [email protected]
gpg DSA: 0x09C5B094 fp: 14DF 2E6C 0307 BE6D AD76 A9C0 D2BF 4AA3 09C5 B094
-------------------------------------------------------------------------------
GLEMENUILT (n.)
The kind of guilt which you'd completely forgotten about which comes
roaring back on discovering an old letter in a cupboard.
--- Douglas Adams, The Meaning of Liff

2009-11-27 13:46:17

by Sebastian Ott

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267


On Tue, 17 Nov 2009, Rafael J. Wysocki wrote:

> On Tuesday 17 November 2009, Maciej Rutecki wrote:
> > 2009/11/16 Rafael J. Wysocki <[email protected]>:
> > > This message has been generated automatically as a part of a report
> > > of recent regressions.
> > >
> > > The following bug entry is on the current list of known regressions
> > > from 2.6.31. Please verify if it still should be listed and let me know
> > > (either way).
> > >
> > >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=14352
> > > Subject : WARNING: at net/mac80211/scan.c:267
> > > Submitter : Maciej Rutecki <[email protected]>
> > > Date : 2009-10-08 00:30 (40 days old)
> > > References : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7
> > >
> > >
> >
> > In 2.6.32-rc7 problem seems be fixed.
>
> Thanks, closing.

looks like this one is still present in rc8:

[ 5724.754068] iwl3945 0000:03:00.0: Error sending REPLY_SCAN_CMD: time
out after 500ms.
[ 5725.886986] ------------[ cut here ]------------
[ 5725.887021] WARNING: at net/mac80211/scan.c:267
ieee80211_scan_completed+0x48/0x198 [mac80211]()
[ 5725.887051] Hardware name: 8741J3G
[ 5725.887054] Modules linked in: fuse ipt_MASQUERADE iptable_nat nf_nat
rfcomm sco bridge stp llc bnep l2cap nfsd lockd nfs_acl auth_rpcgss
exportfs sunrpc xt_physdev ip6t_REJECT nf_conntrack_ipv6 ip6table_filter
ip6_tables ipv6 cpufreq_ondemand acpi_cpufreq dm_multipath kvm uinput
snd_hda_codec_analog snd_hda_intel snd_hda_codec arc4 snd_hwdep snd_seq
ecb iwl3945 snd_seq_device iwlcore snd_pcm snd_timer nsc_ircc mac80211 snd
irda btusb iTCO_wdt soundcore iTCO_vendor_support bluetooth thinkpad_acpi
video snd_page_alloc pcspkr output i2c_i801 joydev cfg80211 rfkill
crc_ccitt yenta_socket rsrc_nonstatic e1000e radeon ttm drm_kms_helper drm
i2c_algo_bit i2c_core [last unloaded: microcode]
[ 5725.887263] Pid: 436, comm: iwl3945 Not tainted 2.6.32-rc8 #6
[ 5725.887269] Call Trace:
[ 5725.887283] [<c0436c53>] warn_slowpath_common+0x6a/0x81
[ 5725.887308] [<f8bd0e8f>] ? ieee80211_scan_completed+0x48/0x198
[mac80211]
[ 5725.887318] [<c0436c7c>] warn_slowpath_null+0x12/0x15
[ 5725.887340] [<f8bd0e8f>] ieee80211_scan_completed+0x48/0x198
[mac80211]
[ 5725.887370] [<f8c97b81>] iwl_bg_scan_completed+0x97/0xcf [iwlcore]
[ 5725.887382] [<c044ad6b>] worker_thread+0x13f/0x1b7
[ 5725.887409] [<f8c97aea>] ? iwl_bg_scan_completed+0x0/0xcf [iwlcore]
[ 5725.887421] [<c044e535>] ? autoremove_wake_function+0x0/0x34
[ 5725.887431] [<c044ac2c>] ? worker_thread+0x0/0x1b7
[ 5725.887441] [<c044e2f7>] kthread+0x64/0x69
[ 5725.887450] [<c044e293>] ? kthread+0x0/0x69
[ 5725.887461] [<c040400f>] kernel_thread_helper+0x7/0x10
[ 5725.887469] ---[ end trace f4077df61007acfa ]---

regards, sebastian
>
> Rafael
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>

2009-11-27 14:10:42

by Johannes Berg

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267

On Fri, 2009-11-27 at 14:46 +0100, Sebastian Ott wrote:

> [ 5724.754068] iwl3945 0000:03:00.0: Error sending REPLY_SCAN_CMD: time
> out after 500ms.

That tells you there's something wrong with the device -- and maybe the
driver is misbehaving by telling mac80211 that it couldn't scan but then
still calling scan_completed().

In any case, driver bug as far as I can tell.

johannes


Attachments:
signature.asc (801.00 B)
This is a digitally signed message part

2009-11-27 14:11:54

by Johannes Berg

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267

On Fri, 2009-11-27 at 15:10 +0100, Johannes Berg wrote:
> On Fri, 2009-11-27 at 14:46 +0100, Sebastian Ott wrote:
>
> > [ 5724.754068] iwl3945 0000:03:00.0: Error sending REPLY_SCAN_CMD: time
> > out after 500ms.
>
> That tells you there's something wrong with the device -- and maybe the
> driver is misbehaving by telling mac80211 that it couldn't scan but then
> still calling scan_completed().
>
> In any case, driver bug as far as I can tell.

Oh and it's also not /this/ bug for sure, it just happens to hit the
same WARN_ON(), but the cause is very different.

johannes


Attachments:
signature.asc (801.00 B)
This is a digitally signed message part

2009-11-27 20:22:40

by Maciej Rutecki

[permalink] [raw]
Subject: Re: [Bug #14352] WARNING: at net/mac80211/scan.c:267

2009/11/27 Sebastian Ott <[email protected]>:
>
> On Tue, 17 Nov 2009, Rafael J. Wysocki wrote:
>
>> On Tuesday 17 November 2009, Maciej Rutecki wrote:
>> > 2009/11/16 Rafael J. Wysocki <[email protected]>:
>> > > This message has been generated automatically as a part of a report
>> > > of recent regressions.
>> > >
>> > > The following bug entry is on the current list of known regressions
>> > > from 2.6.31.  Please verify if it still should be listed and let me know
>> > > (either way).
>> > >
>> > >
>> > > Bug-Entry       : http://bugzilla.kernel.org/show_bug.cgi?id=14352
>> > > Subject         : WARNING: at net/mac80211/scan.c:267
>> > > Submitter       : Maciej Rutecki <[email protected]>
>> > > Date            : 2009-10-08 00:30 (40 days old)
>> > > References      : http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2089#c7
>> > >
>> > >
>> >
>> > In 2.6.32-rc7 problem seems be fixed.
>>
>> Thanks, closing.
>
> looks like this one is still present in rc8:
>

I also observed once again this warning in -rc8. From syslog:
Nov 26 19:04:21 gumis kernel: [43497.192073] ------------[ cut here
]------------
Nov 26 19:04:21 gumis kernel: [43497.192102] WARNING: at
net/mac80211/scan.c:267 ieee80211_scan_completed+0x299/0
x2b0 [mac80211]()
Nov 26 19:04:21 gumis kernel: [43497.192110] Hardware name: HP Compaq
nx6310 (EY501ES#AKD)
Nov 26 19:04:21 gumis kernel: [43497.192115] Modules linked in:
xt_tcpudp xt_limit xt_state iptable_filter nf_con
ntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables x_tables i915
drm_kms_helper drm i2c_algo_bit i2c_core sco bnep
rfcomm l2cap crc16 vboxnetadp vboxnetflt vboxdrv fuse hp_wmi sbp2
loop aes_i586 aes_generic cbc dm_crypt dm_mod
snd_hda_codec_si3054 snd_hda_codec_analog arc4 ecb snd_hda_intel
snd_hda_codec snd_pcm_oss snd_mixer_oss snd_pcm
snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event
snd_seq iwl3945(-) snd_timer iwlcore firmwa
re_class snd_seq_device mac80211 btusb led_class snd pcmcia b44
bluetooth soundcore video cfg80211 yenta_socket i
ntel_agp ssb backlight rtc_cmos rsrc_nonstatic ohci1394 uhci_hcd
psmouse ehci_hcd rtc_core snd_page_alloc rfkill
serio_raw evdev ieee1394 agpgart rtc_lib mii pcmcia_core usbcore sg
output ac battery fan button
Nov 26 19:04:21 gumis kernel: [43497.192284] Pid: 4153, comm: rmmod
Tainted: G W 2.6.32-rc8 #1
Nov 26 19:04:21 gumis kernel: [43497.192291] Call Trace:
Nov 26 19:04:21 gumis kernel: [43497.192305] [<c03ec85c>] ? printk+0x1d/0x21
Nov 26 19:04:21 gumis kernel: [43497.192323] [<f8645a69>] ?
ieee80211_scan_completed+0x299/0x2b0 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192335] [<c013c521>]
warn_slowpath_common+0x71/0xc0
Nov 26 19:04:21 gumis kernel: [43497.192353] [<f8645a69>] ?
ieee80211_scan_completed+0x299/0x2b0 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192363] [<c013c58a>]
warn_slowpath_null+0x1a/0x20
Nov 26 19:04:21 gumis kernel: [43497.192380] [<f8645a69>]
ieee80211_scan_completed+0x299/0x2b0 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192396] [<f8645ac9>]
ieee80211_scan_cancel+0x49/0x80 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192414] [<f864ddc7>]
ieee80211_stop+0x587/0x590 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192425] [<c03ef0d2>] ?
_spin_unlock_bh+0x12/0x20
Nov 26 19:04:21 gumis kernel: [43497.192439] [<c0378f0b>] dev_close+0x6b/0xc0
Nov 26 19:04:21 gumis kernel: [43497.192449] [<c0378fa6>]
rollback_registered+0x46/0x290
Nov 26 19:04:21 gumis kernel: [43497.192458] [<c01d13d1>] ?
add_partial+0x21/0x70
Nov 26 19:04:21 gumis kernel: [43497.192468] [<c037920e>]
unregister_netdevice+0x1e/0x70
Nov 26 19:04:21 gumis kernel: [43497.192477] [<c03ee0e9>] ?
mutex_lock+0x19/0x40
Nov 26 19:04:21 gumis kernel: [43497.192496] [<f864d384>]
ieee80211_remove_interfaces+0x74/0xb0 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192506] [<c03ee0e9>] ?
mutex_lock+0x19/0x40
Nov 26 19:04:21 gumis kernel: [43497.192523] [<f8642090>]
ieee80211_unregister_hw+0x40/0xe0 [mac80211]
Nov 26 19:04:21 gumis kernel: [43497.192537] [<f8689540>]
iwl3945_pci_remove+0x70/0x5dd [iwl3945]
Nov 26 19:04:21 gumis kernel: [43497.192549] [<c015bb05>] ?
notifier_call_chain+0x35/0x70
Nov 26 19:04:21 gumis kernel: [43497.192563] [<c029edfe>]
pci_device_remove+0x1e/0x40
Nov 26 19:04:21 gumis kernel: [43497.192573] [<c0312166>]
__device_release_driver+0x56/0xa0
Nov 26 19:04:21 gumis kernel: [43497.192581] [<c0312237>]
driver_detach+0x87/0x90
Nov 26 19:04:21 gumis kernel: [43497.192594] [<c03113a3>]
bus_remove_driver+0x63/0xb0
Nov 26 19:04:21 gumis kernel: [43497.192604] [<c03127b9>]
driver_unregister+0x49/0x80
Nov 26 19:04:21 gumis kernel: [43497.192614] [<c02253d2>] ?
sysfs_remove_file+0x12/0x20
Nov 26 19:04:21 gumis kernel: [43497.192625] [<c029f045>]
pci_unregister_driver+0x35/0xa0
Nov 26 19:04:21 gumis kernel: [43497.192640] [<f8689abf>]
iwl3945_exit+0x12/0x19 [iwl3945]
Nov 26 19:04:21 gumis kernel: [43497.192651] [<c016e952>]
sys_delete_module+0x162/0x210
Nov 26 19:04:21 gumis kernel: [43497.192661] [<c01c573c>] ?
do_munmap+0x21c/0x270
Nov 26 19:04:21 gumis kernel: [43497.192671] [<c0122176>] ?
do_page_fault+0x176/0x2f0
Nov 26 19:04:21 gumis kernel: [43497.192682] [<c0102f04>]
sysenter_do_call+0x12/0x22
Nov 26 19:04:21 gumis kernel: [43497.192689] ---[ end trace
3daa09d98d92597c ]---


Regards
--
Maciej Rutecki
http://www.maciek.unixy.pl