2007-12-16 00:06:22

by Volker Armin Hemmann

[permalink] [raw]
Subject: almost daily Kernel oops with 2.6.23.9

Hi,

for some time I experience almost daily kernel oops.

At first I suspected the nvidia driver, but today I got it without the driver
loaded (kernel modul removed from harddisk, nv set in config).
This is a pretty much 'vanilla' kernel.org kernel. Only patch added is for
reiser4 support.

When this happens, ps aux is stuck. The process mentioned in the oops is stuck
too. This happens completly unrelated to the load or temperature of the
system.

It is a little bit hard to reproduce. Some days I don't get the oops, on other
days I get two. The only thing I can say for sure: with 2.6.21 this did not
happen and I can not remember any problems with 2.6.22.
If the oops happens the system usually has had run for several hours.
Sometimes it is kmail which triggers it, sometimes kio_http, emerge... Kernel
is 'freshly' built to add CONFIG_KALLSYM which I have usually disabled.


[ 8763.521595] Unable to handle kernel paging request at ffff910073e22db0 RIP:
[ 8763.521600] [<ffffffff8044246a>] _spin_lock+0x0/0xf
[ 8763.521606] PGD 0
[ 8763.521608] Oops: 0002 [1] SMP
[ 8763.521610] CPU 1
[ 8763.521611] Modules linked in: isofs snd_usb_audio snd_usb_lib k8temp
w83627ehf hwmon_vid hwmon i2c_core snd_seq_midi snd_emu10k1_synth
snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss
snd_seq_oss snd_seq_midi_event snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec
ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem
snd_hwdep snd r8169
[ 8763.521626] Pid: 10497, comm: emerge Not tainted 2.6.23.9reiser4 #4
[ 8763.521628] RIP: 0010:[<ffffffff8044246a>] [<ffffffff8044246a>]
_spin_lock+0x0/0xf
[ 8763.521631] RSP: 0018:ffff81005e185e10 EFLAGS: 00010282
[ 8763.521632] RAX: ffff81005e485f00 RBX: ffff910073e22db0 RCX:
0000000000000000
[ 8763.521634] RDX: 0000000000000000 RSI: ffff810040aeb3d8 RDI:
ffff910073e22db0
[ 8763.521635] RBP: ffff81005e485f00 R08: 800000007f3c2005 R09:
000000000003ccd0
[ 8763.521637] R10: ffff81000250d820 R11: 0000000000000001 R12:
ffff81003a3b3820
[ 8763.521638] R13: ffff81007e610f00 R14: ffff810040aeb330 R15:
ffff81007e610f00
[ 8763.521640] FS: 00002b2dce7a9140(0000) GS:ffff81007ff3bec0(0000)
knlGS:00000000f7b83b80
[ 8763.521642] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 8763.521644] CR2: ffff910073e22db0 CR3: 000000003ba69000 CR4:
00000000000006e0
[ 8763.521645] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 8763.521647] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[ 8763.521649] Process emerge (pid: 10497, threadinfo ffff81005e184000, task
ffff8100580bb080)
[ 8763.521650] Stack: ffffffff8026b048 0000000000000001 ffff81005e485f00
0000000000000001
[ 8763.521654] ffffffff80230d0c 00002b2dce7a91d0 0000000000000000
ffff81005e185f58
[ 8763.521656] 00007fffdc32be90 0000000001200011 0000000000000000
ffff81003a3b3820
[ 8763.521658] Call Trace:
[ 8763.521662] [<ffffffff8026b048>] anon_vma_link+0x1a/0x40
[ 8763.521666] [<ffffffff80230d0c>] copy_process+0xb03/0x1301
[ 8763.521671] [<ffffffff80231670>] do_fork+0xb1/0x1fc
[ 8763.521675] [<ffffffff8020b66e>] system_call+0x7e/0x83
[ 8763.521677] [<ffffffff8020b987>] ptregscall_common+0x67/0xb0
[ 8763.521681]
[ 8763.521682]
[ 8763.521682] Code: f0 ff 0f 79 09 f3 90 83 3f 00 7e f9 eb f2 c3 f0 81 2f 00
00
[ 8763.521688] RIP [<ffffffff8044246a>] _spin_lock+0x0/0xf
[ 8763.521690] RSP <ffff81005e185e10>
[ 8763.521691] CR2: ffff910073e22db0

dmesg:
[ 0.000000] Linux version 2.6.23.9reiser4 (root@energy) (gcc version 4.2.2
(Gentoo 4.2.2 p1.0)) #4 SMP Sat Dec 15 20:06:33 CET 2007
[ 0.000000] Command line: root=/dev/sda3 nmi_watchdog=0
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[ 0.000000] BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000e6000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 000000007ffb0000 (usable)
[ 0.000000] BIOS-e820: 000000007ffb0000 - 000000007ffc0000 (ACPI data)
[ 0.000000] BIOS-e820: 000000007ffc0000 - 000000007fff0000 (ACPI NVS)
[ 0.000000] BIOS-e820: 000000007fff0000 - 0000000080000000 (reserved)
[ 0.000000] BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
[ 0.000000] BIOS-e820: 00000000fee00000 - 00000000fef00000 (reserved)
[ 0.000000] BIOS-e820: 00000000ff380000 - 0000000100000000 (reserved)
[ 0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[ 0.000000] Entering add_active_range(0, 256, 524208) 1 entries of 256 used
[ 0.000000] end_pfn_map = 1048576
[ 0.000000] DMI present.
[ 0.000000] ACPI: RSDP 000FA530, 0014 (r0 ACPIAM)
[ 0.000000] ACPI: RSDT 7FFB0000, 003C (r1 A M I OEMRSDT 9000720 MSFT
97)
[ 0.000000] ACPI: FACP 7FFB0200, 0084 (r2 A_M_I OEMFACP 12000601 MSFT
97)
[ 0.000000] ACPI: DSDT 7FFB0450, 4FFA (r1 ASR19 ASR1915A 15A INTL
20051117)
[ 0.000000] ACPI: FACS 7FFC0000, 0040
[ 0.000000] ACPI: APIC 7FFB0390, 0080 (r1 A M I OEMAPIC 9000720 MSFT
97)
[ 0.000000] ACPI: MCFG 7FFB0410, 003C (r1 A M I OEMMCFG 9000720 MSFT
97)
[ 0.000000] ACPI: OEMB 7FFC0040, 0060 (r1 A M I AMI_OEM 9000720 MSFT
97)
[ 0.000000] ACPI: HPET 7FFB5450, 0038 (r1 A M I OEMHPET0 9000720 MSFT
97)
[ 0.000000] ACPI: SSDT 7FFB5490, 030E (r1 A M I POWERNOW 1 AMD
1)
[ 0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[ 0.000000] Entering add_active_range(0, 256, 524208) 1 entries of 256 used
[ 0.000000] No mptable found.
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0 -> 4096
[ 0.000000] DMA32 4096 -> 1048576
[ 0.000000] Normal 1048576 -> 1048576
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[2] active PFN ranges
[ 0.000000] 0: 0 -> 159
[ 0.000000] 0: 256 -> 524208
[ 0.000000] On node 0 totalpages: 524111
[ 0.000000] DMA zone: 56 pages used for memmap
[ 0.000000] DMA zone: 1038 pages reserved
[ 0.000000] DMA zone: 2905 pages, LIFO batch:0
[ 0.000000] DMA32 zone: 7110 pages used for memmap
[ 0.000000] DMA32 zone: 513002 pages, LIFO batch:31
[ 0.000000] Normal zone: 0 pages used for memmap
[ 0.000000] Movable zone: 0 pages used for memmap
[ 0.000000] ACPI: PM-Timer IO Port: 0x2008
[ 0.000000] ACPI: Local APIC address 0xfee00000
[ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[ 0.000000] Processor #0 (Bootup-CPU)
[ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[ 0.000000] Processor #1
[ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[ 0.000000] ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
[ 0.000000] IOAPIC[0]: apic_id 2, address 0xfec00000, GSI 0-23
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 14 global_irq 14 high edge)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 15 global_irq 15 high edge)
[ 0.000000] ACPI: IRQ0 used by override.
[ 0.000000] ACPI: IRQ2 used by override.
[ 0.000000] ACPI: IRQ9 used by override.
[ 0.000000] ACPI: IRQ14 used by override.
[ 0.000000] ACPI: IRQ15 used by override.
[ 0.000000] Setting APIC routing to flat
[ 0.000000] ACPI: HPET id: 0x10de8201 base: 0xfed00000
[ 0.000000] Using ACPI (MADT) for SMP configuration information
[ 0.000000] swsusp: Registered nosave memory region: 000000000009f000 -
00000000000a0000
[ 0.000000] swsusp: Registered nosave memory region: 00000000000a0000 -
00000000000e6000
[ 0.000000] swsusp: Registered nosave memory region: 00000000000e6000 -
0000000000100000
[ 0.000000] Allocating PCI resources starting at 88000000 (gap:
80000000:7ec00000)
[ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs
[ 0.000000] PERCPU: Allocating 29608 bytes of per cpu data
[ 0.000000] Built 1 zonelists in Zone order. Total pages: 515907
[ 0.000000] Kernel command line: root=/dev/sda3 nmi_watchdog=0
[ 0.000000] Initializing CPU#0
[ 0.000000] PID hash table entries: 4096 (order: 12, 32768 bytes)
[ 0.000000] Marking TSC unstable due to TSCs unsynchronized
[ 30.860942] time.c: Detected 3013.887 MHz processor.
[ 30.865131] Console: colour VGA+ 80x25
[ 30.865134] console [tty0] enabled
[ 30.876134] Dentry cache hash table entries: 262144 (order: 9, 2097152
bytes)
[ 30.877449] Inode-cache hash table entries: 131072 (order: 8, 1048576
bytes)
[ 30.877821] Checking aperture...
[ 30.877980] CPU 0: aperture @ e888000000 size 32 MB
[ 30.878137] Aperture too small (32 MB)
[ 30.883953] No AGP bridge found
[ 30.895855] Memory: 2059832k/2096832k available (2324k kernel code, 36320k
reserved, 950k data, 180k init)
[ 30.977505] Calibrating delay using timer specific routine.. 6034.61
BogoMIPS (lpj=10053777)
[ 30.977857] Mount-cache hash table entries: 256
[ 30.978095] CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64
bytes/line)
[ 30.978256] CPU: L2 Cache: 1024K (64 bytes/line)
[ 30.978415] CPU: Physical Processor ID: 0
[ 30.978573] CPU: Processor Core ID: 0
[ 30.978747] SMP alternatives: switching to UP code
[ 30.979293] ACPI: Core revision 20070126
[ 31.014735] Using local APIC timer interrupts.
[ 31.048000] result 12557871
[ 31.048155] Detected 12.557 MHz APIC timer.
[ 31.050745] SMP alternatives: switching to SMP code
[ 31.051103] Booting processor 1/2 APIC 0x1
[ 31.061408] Initializing CPU#1
[ 31.143888] Calibrating delay using timer specific routine.. 6030.02
BogoMIPS (lpj=10046434)
[ 31.143893] CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64
bytes/line)
[ 31.143894] CPU: L2 Cache: 1024K (64 bytes/line)
[ 31.143896] CPU: Physical Processor ID: 0
[ 31.143897] CPU: Processor Core ID: 1
[ 31.143979] AMD Athlon(tm) 64 X2 Dual Core Processor 6000+ stepping 03
[ 31.147126] Brought up 2 CPUs
[ 31.148720] NET: Registered protocol family 16
[ 31.148962] ACPI: bus type pci registered
[ 31.149170] PCI: BIOS Bug: MCFG area at e0000000 is not E820-reserved
[ 31.149331] PCI: Not using MMCONFIG.
[ 31.149488] PCI: Using configuration type 1
[ 31.152045] ACPI: EC: Look up EC in DSDT
[ 31.155660] ACPI: Interpreter enabled
[ 31.155818] ACPI: (supports S0 S1 S3 S4 S5)
[ 31.156106] ACPI: Using IOAPIC for interrupt routing
[ 31.156447] Error attaching device data
[ 31.156608] Error attaching device data
[ 31.156768] Error attaching device data
[ 31.156929] Error attaching device data
[ 31.162533] ACPI: PCI Root Bridge [PCI0] (0000:00)
[ 31.163163] PCI: Transparent bridge - 0000:00:08.0
[ 31.163591] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
[ 31.163732] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
[ 31.163838] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR10._PRT]
[ 31.163898] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR11._PRT]
[ 31.163957] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR13._PRT]
[ 31.164016] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR12._PRT]
[ 31.168099] ACPI: PCI Interrupt Link [LNKA] (IRQs 16 17 18 19) *5
[ 31.168556] ACPI: PCI Interrupt Link [LNKB] (IRQs 16 17 18 19) *10
[ 31.169011] ACPI: PCI Interrupt Link [LNKC] (IRQs 16 17 18 19) *0,
disabled.
[ 31.169491] ACPI: PCI Interrupt Link [LNKD] (IRQs 16 17 18 19) *11
[ 31.169946] ACPI: PCI Interrupt Link [LNEA] (IRQs 16 17 18 19) *0,
disabled.
[ 31.170427] ACPI: PCI Interrupt Link [LNEB] (IRQs 16 17 18 19) *10
[ 31.170883] ACPI: PCI Interrupt Link [LNEC] (IRQs 16 17 18 19) *0,
disabled.
[ 31.171363] ACPI: PCI Interrupt Link [LNED] (IRQs 16 17 18 19) *10
[ 31.171819] ACPI: PCI Interrupt Link [LUB0] (IRQs 20 21 22 23) *5
[ 31.172274] ACPI: PCI Interrupt Link [LMAD] (IRQs 20 21 22 23) *0,
disabled.
[ 31.172755] ACPI: PCI Interrupt Link [LUB2] (IRQs 20 21 22 23) *7
[ 31.173210] ACPI: PCI Interrupt Link [LMAC] (IRQs 20 21 22 23) *0,
disabled.
[ 31.173691] ACPI: PCI Interrupt Link [LAZA] (IRQs 20 21 22 23) *0,
disabled.
[ 31.174186] ACPI: PCI Interrupt Link [LSMB] (IRQs 20 21 22 23) *11
[ 31.174642] ACPI: PCI Interrupt Link [LPMU] (IRQs 20 21 22 23) *0,
disabled.
[ 31.175122] ACPI: PCI Interrupt Link [LSA0] (IRQs 20 21 22 23) *15
[ 31.175577] ACPI: PCI Interrupt Link [LSA1] (IRQs 20 21 22 23) *0,
disabled.
[ 31.176082] ACPI: PCI Interrupt Link [LATA] (IRQs 20 21 22 23) *0,
disabled.
[ 31.176563] ACPI: PCI Interrupt Link [UB11] (IRQs 20 21 22 23) *0,
disabled.
[ 31.177044] ACPI: PCI Interrupt Link [UB12] (IRQs 20 21 22 23) *0,
disabled.
[ 31.177501] Linux Plug and Play Support v0.97 (c) Adam Belay
[ 31.177674] pnp: PnP ACPI init
[ 31.177835] ACPI: bus type pnp registered
[ 31.180623] pnp: PnP ACPI: found 13 devices
[ 31.180782] ACPI: ACPI bus type pnp unregistered
[ 31.181044] SCSI subsystem initialized
[ 31.181223] libata version 2.21 loaded.
[ 31.181275] usbcore: registered new interface driver usbfs
[ 31.181453] usbcore: registered new interface driver hub
[ 31.181631] usbcore: registered new device driver usb
[ 31.181825] PCI: Using ACPI for IRQ routing
[ 31.181984] PCI: If a device doesn't work, try "pci=routeirq". If it
helps, post a report
[ 31.193725] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 31
[ 31.193982] hpet0: 3 32-bit timers, 25000000 Hz
[ 31.197104] Time: hpet clocksource has been installed.
[ 31.203675] pnp: 00:05: iomem range 0x0-0x0 could not be reserved
[ 31.203837] pnp: 00:05: iomem range 0xfee01000-0xfeefffff has been reserved
[ 31.204001] pnp: 00:07: iomem range 0xfec00000-0xfec00fff could not be
reserved
[ 31.204291] pnp: 00:07: iomem range 0xfee00000-0xfee00fff could not be
reserved
[ 31.204585] pnp: 00:0a: ioport range 0x290-0x29f has been reserved
[ 31.204747] pnp: 00:0b: iomem range 0xe0000000-0xefffffff has been reserved
[ 31.204909] pnp: 00:0c: iomem range 0x0-0x9ffff could not be reserved
[ 31.205069] pnp: 00:0c: iomem range 0xc0000-0xcffff has been reserved
[ 31.205230] pnp: 00:0c: iomem range 0xe0000-0xfffff could not be reserved
[ 31.205391] pnp: 00:0c: iomem range 0x100000-0x7fffffff could not be
reserved
[ 31.205724] PCI: Bridge: 0000:00:08.0
[ 31.205882] IO window: d000-dfff
[ 31.206041] MEM window: f9f00000-f9ffffff
[ 31.206200] PREFETCH window: 88000000-880fffff
[ 31.206359] PCI: Bridge: 0000:00:0b.0
[ 31.206517] IO window: c000-cfff
[ 31.206675] MEM window: f9e00000-f9efffff
[ 31.206833] PREFETCH window: disabled.
[ 31.206993] PCI: Bridge: 0000:00:0c.0
[ 31.207150] IO window: disabled.
[ 31.207308] MEM window: disabled.
[ 31.207466] PREFETCH window: disabled.
[ 31.207624] PCI: Bridge: 0000:00:0d.0
[ 31.207782] IO window: e000-efff
[ 31.207939] MEM window: fa000000-febfffff
[ 31.208097] PREFETCH window: d0000000-dfffffff
[ 31.208255] PCI: Bridge: 0000:00:0e.0
[ 31.208413] IO window: disabled.
[ 31.208570] MEM window: disabled.
[ 31.208728] PREFETCH window: disabled.
[ 31.208891] PCI: Setting latency timer of device 0000:00:08.0 to 64
[ 31.208897] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 31.208902] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 31.208907] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 31.208911] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 31.208930] NET: Registered protocol family 2
[ 31.246914] IP route cache hash table entries: 65536 (order: 7, 524288
bytes)
[ 31.247469] TCP established hash table entries: 131072 (order: 9, 3145728
bytes)
[ 31.248924] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[ 31.252802] TCP: Hash tables configured (established 131072 bind 65536)
[ 31.252963] TCP reno registered
[ 31.264399] Total HugeTLB memory allocated, 0
[ 31.264639] Loading Reiser4. See http://www.namesys.com for a description of
Reiser4.
[ 31.264998] io scheduler noop registered
[ 31.265165] io scheduler cfq registered (default)
[ 31.266123] Boot video device is 0000:04:00.0
[ 31.266243] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 31.266259] assign_interrupt_mode Found MSI capability
[ 31.266434] Allocate Port Service[0000:00:0b.0:pcie00]
[ 31.266477] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 31.266492] assign_interrupt_mode Found MSI capability
[ 31.266660] Allocate Port Service[0000:00:0c.0:pcie00]
[ 31.266698] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 31.266712] assign_interrupt_mode Found MSI capability
[ 31.266880] Allocate Port Service[0000:00:0d.0:pcie00]
[ 31.266917] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 31.266932] assign_interrupt_mode Found MSI capability
[ 31.267106] Allocate Port Service[0000:00:0e.0:pcie00]
[ 31.267273] input: Power Button (FF) as /class/input/input0
[ 31.267433] ACPI: Power Button (FF) [PWRF]
[ 31.267644] input: Power Button (CM) as /class/input/input1
[ 31.267804] ACPI: Power Button (CM) [PWRB]
[ 31.268072] ACPI Exception (processor_core-0818): AE_NOT_FOUND, Processor
Device is not present [20070126]
[ 31.268419] ACPI Exception (processor_core-0818): AE_NOT_FOUND, Processor
Device is not present [20070126]
[ 31.272588] Real Time Clock Driver v1.12ac
[ 31.272827] hpet_resources: 0xfed00000 is busy
[ 31.272838] Linux agpgart interface v0.102
[ 31.272997] Hangcheck: starting hangcheck timer 0.9.0 (tick is 180 seconds,
margin is 60 seconds).
[ 31.273288] Hangcheck: Using get_cycles().
[ 31.273505] tun: Universal TUN/TAP device driver, 1.6
[ 31.273667] tun: (C) 1999-2004 Max Krasnyansky <[email protected]>
[ 31.273847] Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
[ 31.274008] ide: Assuming 33MHz system bus speed for PIO modes; override
with idebus=xx
[ 31.274322] NFORCE-MCP65: IDE controller at PCI slot 0000:00:09.0
[ 31.274490] NFORCE-MCP65: chipset revision 161
[ 31.274647] NFORCE-MCP65: not 100% native mode: will probe irqs later
[ 31.274810] NFORCE-MCP65: 0000:00:09.0 (rev a1) UDMA133 controller
[ 31.274973] ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:DMA,
hdb:pio
[ 31.275318] Probing IDE interface ide0...
[ 31.563038] hda: SAMSUNG SP1213N, ATA DISK drive
[ 32.204893] hda: selected mode 0x45
[ 32.204959] ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
[ 32.207566] hda: max request size: 512KiB
[ 32.207776] hda: 234493056 sectors (120060 MB) w/8192KiB Cache,
CHS=16383/255/63, UDMA(100)
[ 32.208548] hda: cache flushes supported
[ 32.208724] hda: hda1 hda2 hda3 < hda5 hda6 hda7 >
[ 32.252230] ahci 0000:00:0a.0: version 2.3
[ 32.252405] ACPI: PCI Interrupt Link [LSA0] enabled at IRQ 23
[ 32.252569] ACPI: PCI Interrupt 0000:00:0a.0[A] -> Link [LSA0] -> GSI 23
(level, low) -> IRQ 23
[ 33.255804] ahci 0000:00:0a.0: AHCI 0001.0100 32 slots 4 ports 3 Gbps 0xf
impl IDE mode
[ 33.256100] ahci 0000:00:0a.0: flags: 64bit sntf led clo pmp pio
[ 33.256261] PCI: Setting latency timer of device 0000:00:0a.0 to 64
[ 33.256475] scsi0 : ahci
[ 33.256700] scsi1 : ahci
[ 33.256891] scsi2 : ahci
[ 33.257083] scsi3 : ahci
[ 33.257265] ata1: SATA max UDMA/133 cmd 0xffffc20000020100 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 33.257558] ata2: SATA max UDMA/133 cmd 0xffffc20000020180 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 33.257851] ata3: SATA max UDMA/133 cmd 0xffffc20000020200 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 33.258144] ata4: SATA max UDMA/133 cmd 0xffffc20000020280 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 38.733347] ata1: port is slow to respond, please be patient (Status 0xd0)
[ 43.579086] ata1: softreset failed (device not ready)
[ 44.054683] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 44.055482] ata1.00: ATA-7: WDC WD1600JS-00MHB1, 10.02E01, max UDMA/133
[ 44.055643] ata1.00: 312581808 sectors, multi 16: LBA48
[ 44.056388] ata1.00: configured for UDMA/133
[ 44.370635] ata2: SATA link down (SStatus 0 SControl 300)
[ 44.686590] ata3: SATA link down (SStatus 0 SControl 300)
[ 45.002543] ata4: SATA link down (SStatus 0 SControl 300)
[ 45.002855] scsi 0:0:0:0: Direct-Access ATA WDC WD1600JS-00M 10.0
PQ: 0 ANSI: 5
[ 45.003212] sd 0:0:0:0: [sda] 312581808 512-byte hardware sectors (160042
MB)
[ 45.003378] sd 0:0:0:0: [sda] Write Protect is off
[ 45.003537] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 45.003546] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled,
doesn't support DPO or FUA
[ 45.003860] sd 0:0:0:0: [sda] 312581808 512-byte hardware sectors (160042
MB)
[ 45.004025] sd 0:0:0:0: [sda] Write Protect is off
[ 45.004183] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 45.004191] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled,
doesn't support DPO or FUA
[ 45.004484] sda: sda1 sda2 sda3 sda4 < sda5 sda6 >
[ 45.046817] sd 0:0:0:0: [sda] Attached SCSI disk
[ 45.047020] sd 0:0:0:0: Attached scsi generic sg0 type 0
[ 45.047202] usbmon: debugfs is not available
[ 45.047600] ACPI: PCI Interrupt Link [LUB2] enabled at IRQ 22
[ 45.047765] ACPI: PCI Interrupt 0000:00:02.1[B] -> Link [LUB2] -> GSI 22
(level, low) -> IRQ 22
[ 45.048343] PCI: Setting latency timer of device 0000:00:02.1 to 64
[ 45.048347] ehci_hcd 0000:00:02.1: EHCI Host Controller
[ 45.048599] ehci_hcd 0000:00:02.1: new USB bus registered, assigned bus
number 1
[ 45.048910] ehci_hcd 0000:00:02.1: debug port 1
[ 45.049070] PCI: cache line size of 64 is not supported by device
0000:00:02.1
[ 45.049078] ehci_hcd 0000:00:02.1: irq 22, io mem 0xf9dfec00
[ 45.049249] ehci_hcd 0000:00:02.1: USB 2.0 started, EHCI 1.00, driver 10
Dec 2004
[ 45.049647] usb usb1: configuration #1 chosen from 1 choice
[ 45.049834] hub 1-0:1.0: USB hub found
[ 45.049995] hub 1-0:1.0: 10 ports detected
[ 45.155677] ohci_hcd: 2006 August 04 USB 1.1 'Open' Host Controller (OHCI)
Driver
[ 45.155904] ACPI: PCI Interrupt Link [LUB0] enabled at IRQ 21
[ 45.156070] ACPI: PCI Interrupt 0000:00:02.0[A] -> Link [LUB0] -> GSI 21
(level, low) -> IRQ 21
[ 45.156551] PCI: Setting latency timer of device 0000:00:02.0 to 64
[ 45.156555] ohci_hcd 0000:00:02.0: OHCI Host Controller
[ 45.156796] ohci_hcd 0000:00:02.0: new USB bus registered, assigned bus
number 2
[ 45.157101] ohci_hcd 0000:00:02.0: irq 21, io mem 0xf9dff000
[ 45.214253] usb usb2: configuration #1 chosen from 1 choice
[ 45.214440] hub 2-0:1.0: USB hub found
[ 45.214602] hub 2-0:1.0: 10 ports detected
[ 45.318648] Initializing USB Mass Storage driver...
[ 45.385017] usb 1-3: new high speed USB device using ehci_hcd and address 2
[ 45.519386] usb 1-3: configuration #1 chosen from 1 choice
[ 45.519849] hub 1-3:1.0: USB hub found
[ 45.520326] hub 1-3:1.0: 4 ports detected
[ 46.083539] usb 1-5: new high speed USB device using ehci_hcd and address 4
[ 46.219320] usb 1-5: configuration #1 chosen from 1 choice
[ 46.459366] usb 2-4: new full speed USB device using ohci_hcd and address 2
[ 46.616086] usb 2-4: configuration #1 chosen from 1 choice
[ 46.622040] hub 2-4:1.0: USB hub found
[ 46.625010] hub 2-4:1.0: 4 ports detected
[ 46.966245] usb 2-4.1: new full speed USB device using ohci_hcd and address
3
[ 47.101005] usb 2-4.1: configuration #1 chosen from 1 choice
[ 47.325442] usb 2-4.4: new low speed USB device using ohci_hcd and address
4
[ 47.439245] usb 2-4.4: configuration #1 chosen from 1 choice
[ 47.446182] usbcore: registered new interface driver usb-storage
[ 47.446342] USB Mass Storage support registered.
[ 47.446545] PNP: PS/2 Controller [PNP0303:PS2K,PNP0f03:PS2M] at 0x60,0x64
irq 1,12
[ 47.449605] serio: i8042 KBD port at 0x60,0x64 irq 1
[ 47.449766] serio: i8042 AUX port at 0x60,0x64 irq 12
[ 47.450011] mice: PS/2 mouse device common for all mice
[ 47.457273] input: Logitech USB-PS/2 Optical Mouse as /class/input/input2
[ 47.457473] input: USB HID v1.10 Mouse [Logitech USB-PS/2 Optical Mouse] on
usb-0000:00:02.0-4.4
[ 47.457820] usbcore: registered new interface driver usbhid
[ 47.457979] drivers/hid/usbhid/hid-core.c: v2.6:USB HID core driver
[ 47.458164] TCP cubic registered
[ 47.458329] NET: Registered protocol family 1
[ 47.458488] NET: Registered protocol family 17
[ 47.458661] powernow-k8: Found 1 AMD Athlon(tm) 64 X2 Dual Core Processor
6000+ processors (2 cpu cores) (version 2.00.00)
[ 47.458896] powernow-k8: 0 : fid 0x16 (3000 MHz), vid 0x6
[ 47.459057] powernow-k8: 1 : fid 0x14 (2800 MHz), vid 0x8
[ 47.459217] powernow-k8: 2 : fid 0x12 (2600 MHz), vid 0xa
[ 47.459385] powernow-k8: 3 : fid 0x10 (2400 MHz), vid 0xc
[ 47.459545] powernow-k8: 4 : fid 0xe (2200 MHz), vid 0xe
[ 47.459705] powernow-k8: 5 : fid 0xc (2000 MHz), vid 0x10
[ 47.459864] powernow-k8: 6 : fid 0xa (1800 MHz), vid 0x10
[ 47.460023] powernow-k8: 7 : fid 0x2 (1000 MHz), vid 0x12
[ 47.473477] input: AT Translated Set 2 keyboard as /class/input/input3
[ 47.585999] ReiserFS: sda3: found reiserfs format "3.6" with standard
journal
[ 47.586163] ReiserFS: sda3: using ordered data mode
[ 47.595573] ReiserFS: sda3: journal params: device sda3, size 8192, journal
first block 18, max trans len 1024, max batch 900, max commit age 30, max
trans age 30
[ 47.597024] ReiserFS: sda3: checking transaction log (sda3)
[ 47.639879] ReiserFS: sda3: Using r5 hash to sort names
[ 47.640052] VFS: Mounted root (reiserfs filesystem) readonly.
[ 47.640224] Freeing unused kernel memory: 180k freed
[ 52.296004] ReiserFS: sda3: switching to journaled data mode
[ 52.661837] r8169 Gigabit Ethernet driver 2.2LK loaded
[ 52.662119] ACPI: PCI Interrupt Link [LNEB] enabled at IRQ 19
[ 52.662127] ACPI: PCI Interrupt 0000:01:00.0[A] -> Link [LNEB] -> GSI 19
(level, low) -> IRQ 19
[ 52.662143] PCI: Setting latency timer of device 0000:01:00.0 to 64
[ 52.662318] eth0: RTL8168b/8111b at 0xffffc20000176000, 00:19:66:47:f3:cb,
XID 38000000 IRQ 19
[ 53.079405] reiser4: sda5: found disk format 4.0.0.
[ 58.429182] reiser4: sda6: found disk format 4.0.0.
[ 59.790443] reiser4: hda7: found disk format 4.0.0.
[ 60.459639] Adding 1951888k swap on /dev/sda2. Priority:-1 extents:1
across:1951888k
[ 63.394904] ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 18
[ 63.394914] ACPI: PCI Interrupt 0000:02:09.0[A] -> Link [LNKB] -> GSI 18
(level, low) -> IRQ 18
[ 63.397618] Audigy2 value: Special config.
[ 64.938749] hda: selected mode 0x45
[ 65.739980] w83627ehf: Found W83627EHG chip at 0x290
[ 66.375065] r8169: eth0: link up
[ 66.375207] r8169: eth0: link up
[ 115.015710] kio_http_cache_[4201] general protection rip:3919ff1fe9
rsp:7fffa32dd7f0 error:0
[ 555.532998] usb 1-3.2: new full speed USB device using ehci_hcd and address
5
[ 555.601070] usb 1-3.2: configuration #1 chosen from 2 choices
[ 555.603656] scsi4 : SCSI emulation for USB Mass Storage devices
[ 555.603308] usb-storage: device found at 5
[ 555.603312] usb-storage: waiting for device to settle before scanning
[ 559.174754] scsi 4:0:0:0: Direct-Access Motorola Motorola Phone 2.31
PQ: 0 ANSI: 2
[ 559.190122] sd 4:0:0:0: [sdb] 979456 512-byte hardware sectors (501 MB)
[ 559.198183] sd 4:0:0:0: [sdb] Write Protect is off
[ 559.198186] sd 4:0:0:0: [sdb] Mode Sense: 0b 00 00 08
[ 559.198187] sd 4:0:0:0: [sdb] Assuming drive cache: write through
[ 559.228047] sd 4:0:0:0: [sdb] 979456 512-byte hardware sectors (501 MB)
[ 559.235932] sd 4:0:0:0: [sdb] Write Protect is off
[ 559.235935] sd 4:0:0:0: [sdb] Mode Sense: 0b 00 00 08
[ 559.235936] sd 4:0:0:0: [sdb] Assuming drive cache: write through
[ 559.235945] sdb: sdb1
[ 559.257659] sd 4:0:0:0: [sdb] Attached SCSI removable disk
[ 559.257695] sd 4:0:0:0: Attached scsi generic sg1 type 0
[ 559.257897] usb-storage: device scan complete
[ 3348.892721] kio_http_cache_[5911] general protection rip:3919ff1fe9
rsp:7fffab797ca0 error:0
[ 4584.408120] kio_http_cache_[7020] general protection rip:3919ff1fe9
rsp:7ffff44b69c0 error:0
[ 5799.546999] kio_http_cache_[7826] general protection rip:3919ff1fe9
rsp:7fff0551ca30 error:0
[ 6372.520336] usb 1-5: USB disconnect, address 4
[ 6372.689352] usb 1-5: new high speed USB device using ehci_hcd and address 6
[ 6372.775552] usb 1-5: configuration #1 chosen from 1 choice
[ 6372.836210] usbcore: registered new interface driver snd-usb-audio
[ 6373.438681] usb 1-5: USB disconnect, address 6
[ 6373.617312] usb 1-5: new high speed USB device using ehci_hcd and address 7
[ 6373.695645] usb 1-5: configuration #1 chosen from 1 choice
[ 7005.779226] kio_http_cache_[8829] general protection rip:3919ff1fe9
rsp:7fffa7ca71b0 error:0
[ 8257.197220] kio_http_cache_[10032] general protection rip:3919ff1fe9
rsp:7fff3d69dbb0 error:0
[ 8540.395721] usb 1-3.4: new high speed USB device using ehci_hcd and address
8
[ 8540.457758] usb 1-3.4: configuration #1 chosen from 1 choice
[ 8540.458096] scsi5 : SCSI emulation for USB Mass Storage devices
[ 8540.458304] usb-storage: device found at 8
[ 8540.458306] usb-storage: waiting for device to settle before scanning
[ 8543.784201] scsi 5:0:0:0: CD-ROM _NEC DVD_RW ND-3500AG 2.19
PQ: 0 ANSI: 0
[ 8543.802196] sr0: scsi3-mmc drive: 48x/48x writer cd/rw xa/form2 cdda tray
[ 8543.802199] Uniform CD-ROM driver Revision: 3.20
[ 8543.793153] sr 5:0:0:0: Attached scsi CD-ROM sr0
[ 8543.793206] sr 5:0:0:0: Attached scsi generic sg2 type 5
[ 8543.793374] usb-storage: device scan complete
[ 8572.123753] ISO 9660 Extensions: RRIP_1991A
[ 8605.798506] ISO 9660 Extensions: RRIP_1991A
[ 8763.521595] Unable to handle kernel paging request at ffff910073e22db0 RIP:
[ 8763.521600] [<ffffffff8044246a>] _spin_lock+0x0/0xf
[ 8763.521606] PGD 0
[ 8763.521608] Oops: 0002 [1] SMP
[ 8763.521610] CPU 1
[ 8763.521611] Modules linked in: isofs snd_usb_audio snd_usb_lib k8temp
w83627ehf hwmon_vid hwmon i2c_core snd_seq_midi snd_emu10k1_synth
snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss
snd_seq_oss snd_seq_midi_event snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec
ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem
snd_hwdep snd r8169
[ 8763.521626] Pid: 10497, comm: emerge Not tainted 2.6.23.9reiser4 #4
[ 8763.521628] RIP: 0010:[<ffffffff8044246a>] [<ffffffff8044246a>]
_spin_lock+0x0/0xf
[ 8763.521631] RSP: 0018:ffff81005e185e10 EFLAGS: 00010282
[ 8763.521632] RAX: ffff81005e485f00 RBX: ffff910073e22db0 RCX:
0000000000000000
[ 8763.521634] RDX: 0000000000000000 RSI: ffff810040aeb3d8 RDI:
ffff910073e22db0
[ 8763.521635] RBP: ffff81005e485f00 R08: 800000007f3c2005 R09:
000000000003ccd0
[ 8763.521637] R10: ffff81000250d820 R11: 0000000000000001 R12:
ffff81003a3b3820
[ 8763.521638] R13: ffff81007e610f00 R14: ffff810040aeb330 R15:
ffff81007e610f00
[ 8763.521640] FS: 00002b2dce7a9140(0000) GS:ffff81007ff3bec0(0000)
knlGS:00000000f7b83b80
[ 8763.521642] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 8763.521644] CR2: ffff910073e22db0 CR3: 000000003ba69000 CR4:
00000000000006e0
[ 8763.521645] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 8763.521647] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[ 8763.521649] Process emerge (pid: 10497, threadinfo ffff81005e184000, task
ffff8100580bb080)
[ 8763.521650] Stack: ffffffff8026b048 0000000000000001 ffff81005e485f00
0000000000000001
[ 8763.521654] ffffffff80230d0c 00002b2dce7a91d0 0000000000000000
ffff81005e185f58
[ 8763.521656] 00007fffdc32be90 0000000001200011 0000000000000000
ffff81003a3b3820
[ 8763.521658] Call Trace:
[ 8763.521662] [<ffffffff8026b048>] anon_vma_link+0x1a/0x40
[ 8763.521666] [<ffffffff80230d0c>] copy_process+0xb03/0x1301
[ 8763.521671] [<ffffffff80231670>] do_fork+0xb1/0x1fc
[ 8763.521675] [<ffffffff8020b66e>] system_call+0x7e/0x83
[ 8763.521677] [<ffffffff8020b987>] ptregscall_common+0x67/0xb0
[ 8763.521681]
[ 8763.521682]
[ 8763.521682] Code: f0 ff 0f 79 09 f3 90 83 3f 00 7e f9 eb f2 c3 f0 81 2f 00
00
[ 8763.521688] RIP [<ffffffff8044246a>] _spin_lock+0x0/0xf
[ 8763.521690] RSP <ffff81005e185e10>
[ 8763.521691] CR2: ffff910073e22db0

sh ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux energy 2.6.23.9reiser4 #4 SMP Sat Dec 15 20:06:33 CET 2007 x86_64 AMD
Athlon(tm) 64 X2 Dual Core Processor 6000+ AuthenticAMD GNU/Linux

Gnu C 4.2.2
Gnu make 3.81
binutils Binutils
util-linux 2.13
mount 2.13
module-init-tools 3.4
e2fsprogs 1.40.3
reiserfsprogs 3.6.19
reiser4progs 1.0.6
PPP 2.4.4
Linux C Library 2.7
Dynamic linker (ldd) 2.7
Procps 3.2.7
Net-tools 1.60
Kbd 1.13
Sh-utils 6.9
udev 117
Modules Loaded isofs snd_usb_audio snd_usb_lib k8temp w83627ehf
hwmon_vid hwmon i2c_core snd_seq_midi snd_emu10k1_synth snd_emux_synth
snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss
snd_seq_midi_event snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus
snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd
r8169

If you need more info, please tell me so. Config is attached because of its
size.

Gl?ck Auf,
Volker


Attachments:
(No filename) (33.00 kB)
.config (38.34 kB)
Download all attachments

2007-12-17 14:59:41

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Hi.

I got another crash, now with 2.6.23.11 on logout from KDE (two differences,
new kernel, 4gb ram instead of 2gb):

[ 1771.063731] Unable to handle kernel paging request at 0000040000000000 RIP:
[ 1771.063735] [<ffffffff8044256a>] _spin_lock+0x0/0xf
[ 1771.063740] PGD 0
[ 1771.063741] Oops: 0002 [1] SMP
[ 1771.063743] CPU 0
[ 1771.063744] Modules linked in: k8temp w83627ehf hwmon_vid hwmon i2c_core
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi
snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event
snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm
snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd r8169
[ 1771.063756] Pid: 4418, comm: kdm Not tainted 2.6.23.11reiser4 #1
[ 1771.063758] RIP: 0010:[<ffffffff8044256a>] [<ffffffff8044256a>]
_spin_lock+0x0/0xf
[ 1771.063760] RSP: 0018:ffff81012937de10 EFLAGS: 00010206
[ 1771.063762] RAX: ffff81012bd78870 RBX: 0000040000000000 RCX:
0000000000000000
[ 1771.063764] RDX: 0000000000000000 RSI: ffff81012c549e58 RDI:
0000040000000000
[ 1771.063765] RBP: ffff81012bd78870 R08: 800000012c52c045 R09:
0000000000000005
[ 1771.063767] R10: ffff8100050df9f8 R11: 0000000000000002 R12:
ffff8101280c3760
[ 1771.063768] R13: ffff81012f05fac0 R14: ffff81012c549db0 R15:
ffff81012f05fac0
[ 1771.063770] FS: 00002b438009bb40(0000) GS:ffffffff80533000(0000)
knlGS:0000000000000000
[ 1771.063772] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 1771.063773] CR2: 0000040000000000 CR3: 000000012d689000 CR4:
00000000000006e0
[ 1771.063775] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 1771.063776] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[ 1771.063778] Process kdm (pid: 4418, threadinfo ffff81012937c000, task
ffff81012fca7860)
[ 1771.063779] Stack: ffffffff8026b084 ffff81012ba17138 ffff81012bd78870
0000000000000000
[ 1771.063782] ffffffff80230d0c 00002b438009bbd0 0000000000000000
ffff81012937df58
[ 1771.063785] 00007fff2aa3d9d0 0000000001200011 0000000000000000
ffff8101280c3760
[ 1771.063787] Call Trace:
[ 1771.063790] [<ffffffff8026b084>] anon_vma_link+0x1a/0x40
[ 1771.063793] [<ffffffff80230d0c>] copy_process+0xb03/0x1301
[ 1771.063798] [<ffffffff80231670>] do_fork+0xb1/0x1fc
[ 1771.063802] [<ffffffff8023aa56>] recalc_sigpending+0xe/0x25
[ 1771.063804] [<ffffffff8020b66e>] system_call+0x7e/0x83
[ 1771.063806] [<ffffffff8020b987>] ptregscall_common+0x67/0xb0
[ 1771.063810]
[ 1771.063811]
[ 1771.063811] Code: f0 ff 0f 79 09 f3 90 83 3f 00 7e f9 eb f2 c3 f0 81 2f 00
00
[ 1771.063816] RIP [<ffffffff8044256a>] _spin_lock+0x0/0xf
[ 1771.063819] RSP <ffff81012937de10>
[ 1771.063820] CR2: 0000040000000000

also I got some strange message yesterday before increasing ramsize:
19546.639528] swap_free: Bad swap offset entry 04000000
[19733.026587] kio_http_cache_[9814] general protection rip:3919ff1fe9
rsp:7fff7e1b59f0 error:0

I did swapoff - a, mkswap /dev/sda, swapon -a:
[20105.297668] Adding 1951888k swap on /dev/sda2. Priority:-2 extents:1
across:1951888k
[21013.797335] kio_http_cache_[10921] general protection rip:3919ff1fe9
rsp:7fff39a6d2a0 error:0
[22381.409172] kio_http_cache_[11459] general protection rip:3919ff1fe9
rsp:7fffd84c4d00 error:0
[23877.759927] kio_http_cache_[11959] general protection rip:3919ff1fe9
rsp:7fff9895c190 error:0
[25080.581142] kio_http_cache_[13146] general protection rip:3919ff1fe9
rsp:7fff790e0920 error:0
[26483.315522] kio_http_cache_[13746] general protection rip:3919ff1fe9
rsp:7fff51933170 error:0
[27696.301584] kio_http_cache_[14417] general protection rip:3919ff1fe9
rsp:7fff8f38abc0 error:0
[27999.370777] swap_free: Bad swap offset entry 04000000
[27999.434282] swap_free: Bad swap offset entry 04000000
[27999.466035] swap_free: Bad swap offset entry 04000000
[27999.521132] swap_free: Bad swap offset entry 04000000
[27999.561621] VM: killing process ld-linux-x86-64
[27999.561719] swap_free: Bad swap offset entry 04000000

complete dmesg:
[ 0.000000] Linux version 2.6.23.11reiser4 (root@energy) (gcc version 4.2.2
(Gentoo 4.2.2 p1.0)) #1 SMP Sun Dec 16 05:14:21 CET 2007
[ 0.000000] Command line: root=/dev/sda3 nmi_watchdog=0
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[ 0.000000] BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000e6000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 00000000cffb0000 (usable)
[ 0.000000] BIOS-e820: 00000000cffb0000 - 00000000cffc0000 (ACPI data)
[ 0.000000] BIOS-e820: 00000000cffc0000 - 00000000cfff0000 (ACPI NVS)
[ 0.000000] BIOS-e820: 00000000cfff0000 - 00000000d0000000 (reserved)
[ 0.000000] BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
[ 0.000000] BIOS-e820: 00000000fee00000 - 00000000fef00000 (reserved)
[ 0.000000] BIOS-e820: 00000000ff380000 - 0000000100000000 (reserved)
[ 0.000000] BIOS-e820: 0000000100000000 - 0000000130000000 (usable)
[ 0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[ 0.000000] Entering add_active_range(0, 256, 851888) 1 entries of 256 used
[ 0.000000] Entering add_active_range(0, 1048576, 1245184) 2 entries of 256
used
[ 0.000000] end_pfn_map = 1245184
[ 0.000000] DMI present.
[ 0.000000] ACPI: RSDP 000FA530, 0014 (r0 ACPIAM)
[ 0.000000] ACPI: RSDT CFFB0000, 003C (r1 A M I OEMRSDT 9000720 MSFT
97)
[ 0.000000] ACPI: FACP CFFB0200, 0084 (r2 A_M_I OEMFACP 12000601 MSFT
97)
[ 0.000000] ACPI: DSDT CFFB0450, 4FFA (r1 ASR19 ASR1915A 15A INTL
20051117)
[ 0.000000] ACPI: FACS CFFC0000, 0040
[ 0.000000] ACPI: APIC CFFB0390, 0080 (r1 A M I OEMAPIC 9000720 MSFT
97)
[ 0.000000] ACPI: MCFG CFFB0410, 003C (r1 A M I OEMMCFG 9000720 MSFT
97)
[ 0.000000] ACPI: OEMB CFFC0040, 0060 (r1 A M I AMI_OEM 9000720 MSFT
97)
[ 0.000000] ACPI: HPET CFFB5450, 0038 (r1 A M I OEMHPET0 9000720 MSFT
97)
[ 0.000000] ACPI: SSDT CFFB5490, 030E (r1 A M I POWERNOW 1 AMD
1)
[ 0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[ 0.000000] Entering add_active_range(0, 256, 851888) 1 entries of 256 used
[ 0.000000] Entering add_active_range(0, 1048576, 1245184) 2 entries of 256
used
[ 0.000000] No mptable found.
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0 -> 4096
[ 0.000000] DMA32 4096 -> 1048576
[ 0.000000] Normal 1048576 -> 1245184
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[3] active PFN ranges
[ 0.000000] 0: 0 -> 159
[ 0.000000] 0: 256 -> 851888
[ 0.000000] 0: 1048576 -> 1245184
[ 0.000000] On node 0 totalpages: 1048399
[ 0.000000] DMA zone: 56 pages used for memmap
[ 0.000000] DMA zone: 1040 pages reserved
[ 0.000000] DMA zone: 2903 pages, LIFO batch:0
[ 0.000000] DMA32 zone: 14280 pages used for memmap
[ 0.000000] DMA32 zone: 833512 pages, LIFO batch:31
[ 0.000000] Normal zone: 2688 pages used for memmap
[ 0.000000] Normal zone: 193920 pages, LIFO batch:31
[ 0.000000] Movable zone: 0 pages used for memmap
[ 0.000000] ACPI: PM-Timer IO Port: 0x2008
[ 0.000000] ACPI: Local APIC address 0xfee00000
[ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[ 0.000000] Processor #0 (Bootup-CPU)
[ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[ 0.000000] Processor #1
[ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[ 0.000000] ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
[ 0.000000] IOAPIC[0]: apic_id 2, address 0xfec00000, GSI 0-23
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 14 global_irq 14 high edge)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 15 global_irq 15 high edge)
[ 0.000000] ACPI: IRQ0 used by override.
[ 0.000000] ACPI: IRQ2 used by override.
[ 0.000000] ACPI: IRQ9 used by override.
[ 0.000000] ACPI: IRQ14 used by override.
[ 0.000000] ACPI: IRQ15 used by override.
[ 0.000000] Setting APIC routing to flat
[ 0.000000] ACPI: HPET id: 0x10de8201 base: 0xfed00000
[ 0.000000] Using ACPI (MADT) for SMP configuration information
[ 0.000000] swsusp: Registered nosave memory region: 000000000009f000 -
00000000000a0000
[ 0.000000] swsusp: Registered nosave memory region: 00000000000a0000 -
00000000000e6000
[ 0.000000] swsusp: Registered nosave memory region: 00000000000e6000 -
0000000000100000
[ 0.000000] swsusp: Registered nosave memory region: 00000000cffb0000 -
00000000cffc0000
[ 0.000000] swsusp: Registered nosave memory region: 00000000cffc0000 -
00000000cfff0000
[ 0.000000] swsusp: Registered nosave memory region: 00000000cfff0000 -
00000000d0000000
[ 0.000000] swsusp: Registered nosave memory region: 00000000d0000000 -
00000000fec00000
[ 0.000000] swsusp: Registered nosave memory region: 00000000fec00000 -
00000000fec01000
[ 0.000000] swsusp: Registered nosave memory region: 00000000fec01000 -
00000000fee00000
[ 0.000000] swsusp: Registered nosave memory region: 00000000fee00000 -
00000000fef00000
[ 0.000000] swsusp: Registered nosave memory region: 00000000fef00000 -
00000000ff380000
[ 0.000000] swsusp: Registered nosave memory region: 00000000ff380000 -
0000000100000000
[ 0.000000] Allocating PCI resources starting at d4000000 (gap:
d0000000:2ec00000)
[ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs
[ 0.000000] PERCPU: Allocating 29608 bytes of per cpu data
[ 0.000000] Built 1 zonelists in Zone order. Total pages: 1030335
[ 0.000000] Kernel command line: root=/dev/sda3 nmi_watchdog=0
[ 0.000000] Initializing CPU#0
[ 0.000000] PID hash table entries: 4096 (order: 12, 32768 bytes)
[ 0.000000] Marking TSC unstable due to TSCs unsynchronized
[ 90.500818] time.c: Detected 3013.888 MHz processor.
[ 90.504978] Console: colour VGA+ 80x25
[ 90.504980] console [tty0] enabled
[ 90.519605] Dentry cache hash table entries: 524288 (order: 10, 4194304
bytes)
[ 90.522165] Inode-cache hash table entries: 262144 (order: 9, 2097152
bytes)
[ 90.523063] Checking aperture...
[ 90.523222] CPU 0: aperture @ e8c8000000 size 32 MB
[ 90.523380] Aperture too small (32 MB)
[ 90.529205] No AGP bridge found
[ 90.529362] Your BIOS doesn't leave a aperture memory hole
[ 90.529521] Please enable the IOMMU option in the BIOS setup
[ 90.529681] This costs you 64 MB of RAM
[ 90.555047] Mapping aperture over 65536 KB of RAM @ 8000000
[ 90.578803] Memory: 4048944k/4980736k available (2324k kernel code, 144432k
reserved, 950k data, 180k init)
[ 90.660617] Calibrating delay using timer specific routine.. 6034.59
BogoMIPS (lpj=10053750)
[ 90.660968] Mount-cache hash table entries: 256
[ 90.661195] CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64
bytes/line)
[ 90.661357] CPU: L2 Cache: 1024K (64 bytes/line)
[ 90.661516] CPU: Physical Processor ID: 0
[ 90.661674] CPU: Processor Core ID: 0
[ 90.661847] SMP alternatives: switching to UP code
[ 90.662343] ACPI: Core revision 20070126
[ 90.697647] Using local APIC timer interrupts.
[ 90.730912] result 12557873
[ 90.731067] Detected 12.557 MHz APIC timer.
[ 90.733849] SMP alternatives: switching to SMP code
[ 90.734208] Booting processor 1/2 APIC 0x1
[ 90.744513] Initializing CPU#1
[ 90.827002] Calibrating delay using timer specific routine.. 6030.06
BogoMIPS (lpj=10046502)
[ 90.827006] CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64
bytes/line)
[ 90.827008] CPU: L2 Cache: 1024K (64 bytes/line)
[ 90.827010] CPU: Physical Processor ID: 0
[ 90.827010] CPU: Processor Core ID: 1
[ 90.827091] AMD Athlon(tm) 64 X2 Dual Core Processor 6000+ stepping 03
[ 90.830237] Brought up 2 CPUs
[ 90.831824] NET: Registered protocol family 16
[ 90.832058] ACPI: bus type pci registered
[ 90.832262] PCI: BIOS Bug: MCFG area at e0000000 is not E820-reserved
[ 90.832422] PCI: Not using MMCONFIG.
[ 90.832580] PCI: Using configuration type 1
[ 90.835108] ACPI: EC: Look up EC in DSDT
[ 90.838603] ACPI: Interpreter enabled
[ 90.838762] ACPI: (supports S0 S1 S3 S4 S5)
[ 90.839049] ACPI: Using IOAPIC for interrupt routing
[ 90.839385] Error attaching device data
[ 90.839546] Error attaching device data
[ 90.839706] Error attaching device data
[ 90.839866] Error attaching device data
[ 90.845388] ACPI: PCI Root Bridge [PCI0] (0000:00)
[ 90.846012] PCI: Transparent bridge - 0000:00:08.0
[ 90.846432] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
[ 90.846567] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
[ 90.846669] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR10._PRT]
[ 90.846726] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR11._PRT]
[ 90.846784] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR13._PRT]
[ 90.846842] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.BR12._PRT]
[ 90.850940] ACPI: PCI Interrupt Link [LNKA] (IRQs 16 17 18 19) *5
[ 90.851393] ACPI: PCI Interrupt Link [LNKB] (IRQs 16 17 18 19) *10
[ 90.851844] ACPI: PCI Interrupt Link [LNKC] (IRQs 16 17 18 19) *0,
disabled.
[ 90.852320] ACPI: PCI Interrupt Link [LNKD] (IRQs 16 17 18 19) *11
[ 90.852772] ACPI: PCI Interrupt Link [LNEA] (IRQs 16 17 18 19) *0,
disabled.
[ 90.853248] ACPI: PCI Interrupt Link [LNEB] (IRQs 16 17 18 19) *10
[ 90.853714] ACPI: PCI Interrupt Link [LNEC] (IRQs 16 17 18 19) *0,
disabled.
[ 90.854191] ACPI: PCI Interrupt Link [LNED] (IRQs 16 17 18 19) *10
[ 90.854643] ACPI: PCI Interrupt Link [LUB0] (IRQs 20 21 22 23) *5
[ 90.855094] ACPI: PCI Interrupt Link [LMAD] (IRQs 20 21 22 23) *0,
disabled.
[ 90.855570] ACPI: PCI Interrupt Link [LUB2] (IRQs 20 21 22 23) *7
[ 90.856022] ACPI: PCI Interrupt Link [LMAC] (IRQs 20 21 22 23) *0,
disabled.
[ 90.856498] ACPI: PCI Interrupt Link [LAZA] (IRQs 20 21 22 23) *0,
disabled.
[ 90.856986] ACPI: PCI Interrupt Link [LSMB] (IRQs 20 21 22 23) *11
[ 90.857438] ACPI: PCI Interrupt Link [LPMU] (IRQs 20 21 22 23) *0,
disabled.
[ 90.857915] ACPI: PCI Interrupt Link [LSA0] (IRQs 20 21 22 23) *15
[ 90.858367] ACPI: PCI Interrupt Link [LSA1] (IRQs 20 21 22 23) *0,
disabled.
[ 90.858867] ACPI: PCI Interrupt Link [LATA] (IRQs 20 21 22 23) *0,
disabled.
[ 90.859344] ACPI: PCI Interrupt Link [UB11] (IRQs 20 21 22 23) *0,
disabled.
[ 90.859821] ACPI: PCI Interrupt Link [UB12] (IRQs 20 21 22 23) *0,
disabled.
[ 90.860253] Linux Plug and Play Support v0.97 (c) Adam Belay
[ 90.860425] pnp: PnP ACPI init
[ 90.860586] ACPI: bus type pnp registered
[ 90.863337] pnp: PnP ACPI: found 13 devices
[ 90.863497] ACPI: ACPI bus type pnp unregistered
[ 90.863761] SCSI subsystem initialized
[ 90.863940] libata version 2.21 loaded.
[ 90.863990] usbcore: registered new interface driver usbfs
[ 90.864167] usbcore: registered new interface driver hub
[ 90.864345] usbcore: registered new device driver usb
[ 90.864536] PCI: Using ACPI for IRQ routing
[ 90.864695] PCI: If a device doesn't work, try "pci=routeirq". If it
helps, post a report
[ 90.876832] PCI-DMA: Disabling AGP.
[ 90.877685] PCI-DMA: aperture base @ 8000000 size 65536 KB
[ 90.877849] PCI-DMA: using GART IOMMU.
[ 90.878010] PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture
[ 90.878290] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 31
[ 90.878547] hpet0: 3 32-bit timers, 25000000 Hz
[ 90.880123] Time: hpet clocksource has been installed.
[ 90.890149] pnp: 00:05: ioport range 0x4d0-0x4d1 has been reserved
[ 90.890309] pnp: 00:05: ioport range 0x800-0x80f has been reserved
[ 90.890470] pnp: 00:05: ioport range 0x2000-0x207f has been reserved
[ 90.890630] pnp: 00:05: ioport range 0x2080-0x20ff has been reserved
[ 90.890791] pnp: 00:05: ioport range 0x2400-0x247f has been reserved
[ 90.890952] pnp: 00:05: ioport range 0x2480-0x24ff has been reserved
[ 90.891113] pnp: 00:05: ioport range 0x2800-0x287f has been reserved
[ 90.891274] pnp: 00:05: ioport range 0x2880-0x28ff has been reserved
[ 90.894675] pnp: 00:05: ioport range 0x2c00-0x2c7f has been reserved
[ 90.894836] pnp: 00:05: ioport range 0x2c80-0x2cff has been reserved
[ 90.894997] pnp: 00:05: iomem range 0x0-0x0 could not be reserved
[ 90.895158] pnp: 00:05: iomem range 0xfee01000-0xfeefffff has been reserved
[ 90.895323] pnp: 00:07: iomem range 0xfec00000-0xfec00fff could not be
reserved
[ 90.895614] pnp: 00:07: iomem range 0xfee00000-0xfee00fff could not be
reserved
[ 90.895908] pnp: 00:0a: ioport range 0x290-0x29f has been reserved
[ 90.896071] pnp: 00:0b: iomem range 0xe0000000-0xefffffff has been reserved
[ 90.896235] pnp: 00:0c: iomem range 0x0-0x9ffff could not be reserved
[ 90.896396] pnp: 00:0c: iomem range 0xc0000-0xcffff has been reserved
[ 90.896557] pnp: 00:0c: iomem range 0xe0000-0xfffff could not be reserved
[ 90.896718] pnp: 00:0c: iomem range 0x100000-0xcfffffff could not be
reserved
[ 90.896882] pnp: 00:0c: iomem range 0xff380000-0xffffffff could not be
reserved
[ 90.897365] PCI: Bridge: 0000:00:08.0
[ 90.897523] IO window: d000-dfff
[ 90.897682] MEM window: f9f00000-f9ffffff
[ 90.897841] PREFETCH window: f0000000-f00fffff
[ 90.898000] PCI: Bridge: 0000:00:0b.0
[ 90.898158] IO window: c000-cfff
[ 90.898316] MEM window: f9e00000-f9efffff
[ 90.898475] PREFETCH window: disabled.
[ 90.898633] PCI: Bridge: 0000:00:0c.0
[ 90.898790] IO window: disabled.
[ 90.898948] MEM window: disabled.
[ 90.899105] PREFETCH window: disabled.
[ 90.899264] PCI: Bridge: 0000:00:0d.0
[ 90.899422] IO window: e000-efff
[ 90.899580] MEM window: fa000000-febfffff
[ 90.899738] PREFETCH window: d0000000-dfffffff
[ 90.899897] PCI: Bridge: 0000:00:0e.0
[ 90.900054] IO window: disabled.
[ 90.900214] MEM window: disabled.
[ 90.900371] PREFETCH window: disabled.
[ 90.900535] PCI: Setting latency timer of device 0000:00:08.0 to 64
[ 90.900543] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 90.900549] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 90.900553] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 90.900558] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 90.900578] NET: Registered protocol family 2
[ 90.933353] IP route cache hash table entries: 131072 (order: 8, 1048576
bytes)
[ 90.934123] TCP established hash table entries: 131072 (order: 9, 3145728
bytes)
[ 90.935536] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[ 90.936120] TCP: Hash tables configured (established 131072 bind 65536)
[ 90.936281] TCP reno registered
[ 90.947470] Total HugeTLB memory allocated, 0
[ 90.947710] Loading Reiser4. See http://www.namesys.com for a description of
Reiser4.
[ 90.948068] io scheduler noop registered
[ 90.948235] io scheduler cfq registered (default)
[ 90.949046] Boot video device is 0000:04:00.0
[ 90.949162] PCI: Setting latency timer of device 0000:00:0b.0 to 64
[ 90.949177] assign_interrupt_mode Found MSI capability
[ 90.949352] Allocate Port Service[0000:00:0b.0:pcie00]
[ 90.949395] PCI: Setting latency timer of device 0000:00:0c.0 to 64
[ 90.949409] assign_interrupt_mode Found MSI capability
[ 90.949577] Allocate Port Service[0000:00:0c.0:pcie00]
[ 90.949614] PCI: Setting latency timer of device 0000:00:0d.0 to 64
[ 90.949628] assign_interrupt_mode Found MSI capability
[ 90.949796] Allocate Port Service[0000:00:0d.0:pcie00]
[ 90.949832] PCI: Setting latency timer of device 0000:00:0e.0 to 64
[ 90.949846] assign_interrupt_mode Found MSI capability
[ 90.950014] Allocate Port Service[0000:00:0e.0:pcie00]
[ 90.950188] input: Power Button (FF) as /class/input/input0
[ 90.950348] ACPI: Power Button (FF) [PWRF]
[ 90.950560] input: Power Button (CM) as /class/input/input1
[ 90.950720] ACPI: Power Button (CM) [PWRB]
[ 90.950985] ACPI Exception (processor_core-0818): AE_NOT_FOUND, Processor
Device is not present [20070126]
[ 90.951333] ACPI Exception (processor_core-0818): AE_NOT_FOUND, Processor
Device is not present [20070126]
[ 90.955438] Real Time Clock Driver v1.12ac
[ 90.955675] hpet_resources: 0xfed00000 is busy
[ 90.955686] Linux agpgart interface v0.102
[ 90.955845] Hangcheck: starting hangcheck timer 0.9.0 (tick is 180 seconds,
margin is 60 seconds).
[ 90.956137] Hangcheck: Using get_cycles().
[ 90.956352] tun: Universal TUN/TAP device driver, 1.6
[ 90.956511] tun: (C) 1999-2004 Max Krasnyansky <[email protected]>
[ 90.956690] Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
[ 90.956853] ide: Assuming 33MHz system bus speed for PIO modes; override
with idebus=xx
[ 90.957166] NFORCE-MCP65: IDE controller at PCI slot 0000:00:09.0
[ 90.957335] NFORCE-MCP65: chipset revision 161
[ 90.957494] NFORCE-MCP65: not 100% native mode: will probe irqs later
[ 90.957658] NFORCE-MCP65: 0000:00:09.0 (rev a1) UDMA133 controller
[ 90.957820] ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:DMA,
hdb:pio
[ 90.958166] Probing IDE interface ide0...
[ 91.246150] hda: SAMSUNG SP1213N, ATA DISK drive
[ 91.888005] hda: selected mode 0x45
[ 91.888070] ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
[ 91.890652] hda: max request size: 512KiB
[ 91.890861] hda: 234493056 sectors (120060 MB) w/8192KiB Cache,
CHS=16383/255/63, UDMA(100)
[ 91.891601] hda: cache flushes supported
[ 91.891777] hda: hda1 hda2 hda3 < hda5 hda6 hda7 >
[ 91.931121] ahci 0000:00:0a.0: version 2.3
[ 91.931299] ACPI: PCI Interrupt Link [LSA0] enabled at IRQ 23
[ 91.931465] ACPI: PCI Interrupt 0000:00:0a.0[A] -> Link [LSA0] -> GSI 23
(level, low) -> IRQ 23
[ 92.935591] ahci 0000:00:0a.0: AHCI 0001.0100 32 slots 4 ports 3 Gbps 0xf
impl IDE mode
[ 92.935888] ahci 0000:00:0a.0: flags: 64bit sntf led clo pmp pio
[ 92.936049] PCI: Setting latency timer of device 0000:00:0a.0 to 64
[ 92.936263] scsi0 : ahci
[ 92.936486] scsi1 : ahci
[ 92.936677] scsi2 : ahci
[ 92.936867] scsi3 : ahci
[ 92.937048] ata1: SATA max UDMA/133 cmd 0xffffc20000020100 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 92.937342] ata2: SATA max UDMA/133 cmd 0xffffc20000020180 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 92.937636] ata3: SATA max UDMA/133 cmd 0xffffc20000020200 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 92.937930] ata4: SATA max UDMA/133 cmd 0xffffc20000020280 ctl
0x0000000000000000 bmdma 0x0000000000000000 irq 315
[ 93.411080] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 93.411883] ata1.00: ATA-7: WDC WD1600JS-00MHB1, 10.02E01, max UDMA/133
[ 93.412045] ata1.00: 312581808 sectors, multi 16: LBA48
[ 93.412797] ata1.00: configured for UDMA/133
[ 93.727032] ata2: SATA link down (SStatus 0 SControl 300)
[ 94.042987] ata3: SATA link down (SStatus 0 SControl 300)
[ 94.358941] ata4: SATA link down (SStatus 0 SControl 300)
[ 94.359249] scsi 0:0:0:0: Direct-Access ATA WDC WD1600JS-00M 10.0
PQ: 0 ANSI: 5
[ 94.359599] sd 0:0:0:0: [sda] 312581808 512-byte hardware sectors (160042
MB)
[ 94.359765] sd 0:0:0:0: [sda] Write Protect is off
[ 94.359925] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 94.359933] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled,
doesn't support DPO or FUA
[ 94.360249] sd 0:0:0:0: [sda] 312581808 512-byte hardware sectors (160042
MB)
[ 94.360414] sd 0:0:0:0: [sda] Write Protect is off
[ 94.360573] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[ 94.360581] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled,
doesn't support DPO or FUA
[ 94.360873] sda: sda1 sda2 sda3 sda4 < sda5 sda6 >
[ 94.402906] sd 0:0:0:0: [sda] Attached SCSI disk
[ 94.403107] sd 0:0:0:0: Attached scsi generic sg0 type 0
[ 94.403289] usbmon: debugfs is not available
[ 94.403675] ACPI: PCI Interrupt Link [LUB2] enabled at IRQ 22
[ 94.403840] ACPI: PCI Interrupt 0000:00:02.1[B] -> Link [LUB2] -> GSI 22
(level, low) -> IRQ 22
[ 94.404361] PCI: Setting latency timer of device 0000:00:02.1 to 64
[ 94.404365] ehci_hcd 0000:00:02.1: EHCI Host Controller
[ 94.404613] ehci_hcd 0000:00:02.1: new USB bus registered, assigned bus
number 1
[ 94.404924] ehci_hcd 0000:00:02.1: debug port 1
[ 94.405084] PCI: cache line size of 64 is not supported by device
0000:00:02.1
[ 94.405093] ehci_hcd 0000:00:02.1: irq 22, io mem 0xf9dfec00
[ 94.415579] ehci_hcd 0000:00:02.1: USB 2.0 started, EHCI 1.00, driver 10
Dec 2004
[ 94.415978] usb usb1: configuration #1 chosen from 1 choice
[ 94.416164] hub 1-0:1.0: USB hub found
[ 94.416325] hub 1-0:1.0: 10 ports detected
[ 94.522056] ohci_hcd: 2006 August 04 USB 1.1 'Open' Host Controller (OHCI)
Driver
[ 94.522280] ACPI: PCI Interrupt Link [LUB0] enabled at IRQ 21
[ 94.522446] ACPI: PCI Interrupt 0000:00:02.0[A] -> Link [LUB0] -> GSI 21
(level, low) -> IRQ 21
[ 94.522921] PCI: Setting latency timer of device 0000:00:02.0 to 64
[ 94.522925] ohci_hcd 0000:00:02.0: OHCI Host Controller
[ 94.523165] ohci_hcd 0000:00:02.0: new USB bus registered, assigned bus
number 2
[ 94.523467] ohci_hcd 0000:00:02.0: irq 21, io mem 0xf9dff000
[ 94.580626] usb usb2: configuration #1 chosen from 1 choice
[ 94.580811] hub 2-0:1.0: USB hub found
[ 94.580975] hub 2-0:1.0: 10 ports detected
[ 94.685025] Initializing USB Mass Storage driver...
[ 94.754717] usb 1-3: new high speed USB device using ehci_hcd and address 2
[ 94.889115] usb 1-3: configuration #1 chosen from 1 choice
[ 94.889578] hub 1-3:1.0: USB hub found
[ 94.890054] hub 1-3:1.0: 4 ports detected
[ 95.456567] usb 1-5: new high speed USB device using ehci_hcd and address 4
[ 95.595780] usb 1-5: configuration #1 chosen from 1 choice
[ 95.835711] usb 2-4: new full speed USB device using ohci_hcd and address 2
[ 95.992441] usb 2-4: configuration #1 chosen from 1 choice
[ 95.998394] hub 2-4:1.0: USB hub found
[ 96.001364] hub 2-4:1.0: 4 ports detected
[ 96.342599] usb 2-4.1: new full speed USB device using ohci_hcd and address
3
[ 96.477358] usb 2-4.1: configuration #1 chosen from 1 choice
[ 96.701795] usb 2-4.4: new low speed USB device using ohci_hcd and address
4
[ 96.815598] usb 2-4.4: configuration #1 chosen from 1 choice
[ 96.822535] usbcore: registered new interface driver usb-storage
[ 96.822696] USB Mass Storage support registered.
[ 96.822901] PNP: PS/2 Controller [PNP0303:PS2K,PNP0f03:PS2M] at 0x60,0x64
irq 1,12
[ 96.825753] serio: i8042 KBD port at 0x60,0x64 irq 1
[ 96.825915] serio: i8042 AUX port at 0x60,0x64 irq 12
[ 96.826150] mice: PS/2 mouse device common for all mice
[ 96.832628] input: Logitech USB-PS/2 Optical Mouse as /class/input/input2
[ 96.832827] input: USB HID v1.10 Mouse [Logitech USB-PS/2 Optical Mouse] on
usb-0000:00:02.0-4.4
[ 96.833173] usbcore: registered new interface driver usbhid
[ 96.833333] drivers/hid/usbhid/hid-core.c: v2.6:USB HID core driver
[ 96.833521] TCP cubic registered
[ 96.833686] NET: Registered protocol family 1
[ 96.833847] NET: Registered protocol family 17
[ 96.834019] powernow-k8: Found 1 AMD Athlon(tm) 64 X2 Dual Core Processor
6000+ processors (2 cpu cores) (version 2.00.00)
[ 96.834249] powernow-k8: 0 : fid 0x16 (3000 MHz), vid 0x6
[ 96.834410] powernow-k8: 1 : fid 0x14 (2800 MHz), vid 0x8
[ 96.837818] powernow-k8: 2 : fid 0x12 (2600 MHz), vid 0xa
[ 96.837978] powernow-k8: 3 : fid 0x10 (2400 MHz), vid 0xc
[ 96.838138] powernow-k8: 4 : fid 0xe (2200 MHz), vid 0xe
[ 96.838297] powernow-k8: 5 : fid 0xc (2000 MHz), vid 0x10
[ 96.838457] powernow-k8: 6 : fid 0xa (1800 MHz), vid 0x10
[ 96.838616] powernow-k8: 7 : fid 0x2 (1000 MHz), vid 0x12
[ 96.851725] input: AT Translated Set 2 keyboard as /class/input/input3
[ 96.950416] ReiserFS: sda3: found reiserfs format "3.6" with standard
journal
[ 96.950579] ReiserFS: sda3: using ordered data mode
[ 96.959961] ReiserFS: sda3: journal params: device sda3, size 8192, journal
first block 18, max trans len 1024, max batch 900, max commit age 30, max
trans age 30
[ 96.961243] ReiserFS: sda3: checking transaction log (sda3)
[ 96.995957] ReiserFS: sda3: Using r5 hash to sort names
[ 96.996128] VFS: Mounted root (reiserfs filesystem) readonly.
[ 96.996299] Freeing unused kernel memory: 180k freed
[ 101.654229] ReiserFS: sda3: switching to journaled data mode
[ 102.013470] r8169 Gigabit Ethernet driver 2.2LK loaded
[ 102.013713] ACPI: PCI Interrupt Link [LNEB] enabled at IRQ 19
[ 102.013721] ACPI: PCI Interrupt 0000:01:00.0[A] -> Link [LNEB] -> GSI 19
(level, low) -> IRQ 19
[ 102.013734] PCI: Setting latency timer of device 0000:01:00.0 to 64
[ 102.013905] eth0: RTL8168b/8111b at 0xffffc20000176000, 00:19:66:47:f3:cb,
XID 38000000 IRQ 19
[ 102.427149] reiser4: sda5: found disk format 4.0.0.
[ 107.810143] reiser4: sda6: found disk format 4.0.0.
[ 109.081151] reiser4: hda7: found disk format 4.0.0.
[ 109.782349] Adding 1951888k swap on /dev/sda2. Priority:-1 extents:1
across:1951888k
[ 112.863597] ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 18
[ 112.863606] ACPI: PCI Interrupt 0000:02:09.0[A] -> Link [LNKB] -> GSI 18
(level, low) -> IRQ 18
[ 112.866305] Audigy2 value: Special config.
[ 115.009319] hda: selected mode 0x45
[ 115.033261] w83627ehf: Found W83627EHG chip at 0x290
[ 116.395585] r8169: eth0: link up
[ 116.395725] r8169: eth0: link up
[ 141.354848] usb 2-2: new low speed USB device using ohci_hcd and address 5
[ 141.462517] usb 2-2: configuration #1 chosen from 1 choice
[ 141.481877] input: Logitech USB Receiver as /class/input/input4
[ 141.481969] input: USB HID v1.11 Mouse [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 141.490496] input: Logitech USB Receiver as /class/input/input5
[ 141.490587] input: USB HID v1.11 Device [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 143.719023] usb 2-2: USB disconnect, address 5
[ 143.728038] hald-addon-keyb[4055]: segfault at 00000000fffffff8 rip
000000387fa71845 rsp 00007fff9bb8d350 error 4
[ 148.917772] usb 2-2: new low speed USB device using ohci_hcd and address 6
[ 149.025834] usb 2-2: configuration #1 chosen from 1 choice
[ 149.042218] input: Logitech USB Receiver as /class/input/input6
[ 149.042262] input: USB HID v1.11 Mouse [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 149.053719] input: Logitech USB Receiver as /class/input/input7
[ 149.053857] input: USB HID v1.11 Device [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 149.626175] usb 2-2: USB disconnect, address 6
[ 149.634782] hald-addon-keyb[4099]: segfault at 00000000fffffff8 rip
000000387fa71845 rsp 00007fff75d9e560 error 4
[ 151.496394] usb 2-1: new low speed USB device using ohci_hcd and address 7
[ 151.604409] usb 2-1: configuration #1 chosen from 1 choice
[ 151.618438] input: Logitech USB Receiver as /class/input/input8
[ 151.618530] input: USB HID v1.11 Mouse [Logitech USB Receiver] on
usb-0000:00:02.0-1
[ 151.627073] input: Logitech USB Receiver as /class/input/input9
[ 151.627153] input: USB HID v1.11 Device [Logitech USB Receiver] on
usb-0000:00:02.0-1
[ 152.301309] usb 2-1: USB disconnect, address 7
[ 152.306039] hald-addon-keyb[4143]: segfault at 00000000fffffff8 rip
000000387fa71845 rsp 00007fffabc6e430 error 4
[ 163.250003] usb 1-3.1: new low speed USB device using ehci_hcd and address
8
[ 163.314229] usb 1-3.1: configuration #1 chosen from 1 choice
[ 163.322789] input: Logitech USB Receiver as /class/input/input10
[ 163.322704] input: USB HID v1.11 Mouse [Logitech USB Receiver] on
usb-0000:00:02.1-3.1
[ 163.326853] input: Logitech USB Receiver as /class/input/input11
[ 163.326877] input: USB HID v1.11 Device [Logitech USB Receiver] on
usb-0000:00:02.1-3.1
[ 165.337646] usb 1-3.1: USB disconnect, address 8
[ 165.339431] hald-addon-keyb[4188]: segfault at 00000000fffffff8 rip
000000387fa71845 rsp 00007fff04610df0 error 4
[ 168.790680] usb 2-2: new low speed USB device using ohci_hcd and address 8
[ 168.898534] usb 2-2: configuration #1 chosen from 1 choice
[ 168.912448] input: Logitech USB Receiver as /class/input/input12
[ 168.912594] input: USB HID v1.11 Mouse [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 168.921142] input: Logitech USB Receiver as /class/input/input13
[ 168.921234] input: USB HID v1.11 Device [Logitech USB Receiver] on
usb-0000:00:02.0-2
[ 242.510214] usb 2-4.4: USB disconnect, address 4
[ 343.072949] kio_http_cache_[4577] general protection rip:3919ff1fe9
rsp:7fffb5d915d0 error:0
[ 1771.063731] Unable to handle kernel paging request at 0000040000000000 RIP:
[ 1771.063735] [<ffffffff8044256a>] _spin_lock+0x0/0xf
[ 1771.063740] PGD 0
[ 1771.063741] Oops: 0002 [1] SMP
[ 1771.063743] CPU 0
[ 1771.063744] Modules linked in: k8temp w83627ehf hwmon_vid hwmon i2c_core
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi
snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event
snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus snd_pcm
snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd r8169
[ 1771.063756] Pid: 4418, comm: kdm Not tainted 2.6.23.11reiser4 #1
[ 1771.063758] RIP: 0010:[<ffffffff8044256a>] [<ffffffff8044256a>]
_spin_lock+0x0/0xf
[ 1771.063760] RSP: 0018:ffff81012937de10 EFLAGS: 00010206
[ 1771.063762] RAX: ffff81012bd78870 RBX: 0000040000000000 RCX:
0000000000000000
[ 1771.063764] RDX: 0000000000000000 RSI: ffff81012c549e58 RDI:
0000040000000000
[ 1771.063765] RBP: ffff81012bd78870 R08: 800000012c52c045 R09:
0000000000000005
[ 1771.063767] R10: ffff8100050df9f8 R11: 0000000000000002 R12:
ffff8101280c3760
[ 1771.063768] R13: ffff81012f05fac0 R14: ffff81012c549db0 R15:
ffff81012f05fac0
[ 1771.063770] FS: 00002b438009bb40(0000) GS:ffffffff80533000(0000)
knlGS:0000000000000000
[ 1771.063772] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 1771.063773] CR2: 0000040000000000 CR3: 000000012d689000 CR4:
00000000000006e0
[ 1771.063775] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 1771.063776] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[ 1771.063778] Process kdm (pid: 4418, threadinfo ffff81012937c000, task
ffff81012fca7860)
[ 1771.063779] Stack: ffffffff8026b084 ffff81012ba17138 ffff81012bd78870
0000000000000000
[ 1771.063782] ffffffff80230d0c 00002b438009bbd0 0000000000000000
ffff81012937df58
[ 1771.063785] 00007fff2aa3d9d0 0000000001200011 0000000000000000
ffff8101280c3760
[ 1771.063787] Call Trace:
[ 1771.063790] [<ffffffff8026b084>] anon_vma_link+0x1a/0x40
[ 1771.063793] [<ffffffff80230d0c>] copy_process+0xb03/0x1301
[ 1771.063798] [<ffffffff80231670>] do_fork+0xb1/0x1fc
[ 1771.063802] [<ffffffff8023aa56>] recalc_sigpending+0xe/0x25
[ 1771.063804] [<ffffffff8020b66e>] system_call+0x7e/0x83
[ 1771.063806] [<ffffffff8020b987>] ptregscall_common+0x67/0xb0
[ 1771.063810]
[ 1771.063811]
[ 1771.063811] Code: f0 ff 0f 79 09 f3 90 83 3f 00 7e f9 eb f2 c3 f0 81 2f 00
00
[ 1771.063816] RIP [<ffffffff8044256a>] _spin_lock+0x0/0xf
[ 1771.063819] RSP <ffff81012937de10>
[ 1771.063820] CR2: 0000040000000000

Gl?ck Auf,
Volker

2007-12-17 15:48:15

by Hugh Dickins

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Mon, 17 Dec 2007, Hemmann, Volker Armin wrote:
>
> I got another crash, now with 2.6.23.11 on logout from KDE (two differences,
> new kernel, 4gb ram instead of 2gb):
>
> [ 1771.063731] Unable to handle kernel paging request at 0000040000000000 RIP:
> also I got some strange message yesterday before increasing ramsize:
> [19546.639528] swap_free: Bad swap offset entry 04000000
> [27999.370777] swap_free: Bad swap offset entry 04000000
> [27999.434282] swap_free: Bad swap offset entry 04000000
> [27999.466035] swap_free: Bad swap offset entry 04000000
> [27999.521132] swap_free: Bad swap offset entry 04000000
> [27999.561621] VM: killing process ld-linux-x86-64
> [27999.561719] swap_free: Bad swap offset entry 04000000

You're seeing a single bit set where it shouldn't be: please give
memtest86+ a good try; if it's not actually your memory that's bad,
then I'd guess it's something like overheating (please correct me,
ye who know better).

Hugh

2007-12-17 20:10:53

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Montag, 17. Dezember 2007, you wrote:
> On Mon, 17 Dec 2007, Hemmann, Volker Armin wrote:
> > I got another crash, now with 2.6.23.11 on logout from KDE (two
> > differences, new kernel, 4gb ram instead of 2gb):

> > also I got some strange message yesterday before increasing ramsize:
> > [19546.639528] swap_free: Bad swap offset entry 04000000
> > [27999.370777] swap_free: Bad swap offset entry 04000000
> > [27999.434282] swap_free: Bad swap offset entry 04000000
> > [27999.466035] swap_free: Bad swap offset entry 04000000
> > [27999.521132] swap_free: Bad swap offset entry 04000000
> > [27999.561621] VM: killing process ld-linux-x86-64
> > [27999.561719] swap_free: Bad swap offset entry 04000000
>
> You're seeing a single bit set where it shouldn't be: please give
> memtest86+ a good try; if it's not actually your memory that's bad,
> then I'd guess it's something like overheating (please correct me,
> ye who know better).
>
> Hugh

first of all, the 2 with which I was seeing that have had their memtest run
for some hours some weeks ago, without problems. I can compile stuff - like
the latest kde4 rc without segfaults or problems (except when the oops is
happening), and this mess only started recently. To be more correct:
the swap-mess only started with 2.6.23.11. With 2.6.23.9 I get the kio_http...
rip's, but no swap related messages.

Overheating is very unlikely. I made sure that my computer is very well
cooled. Even under high load I get something like 50?C from lmsensors and
bios - and the errors are completly unrelated to load. Or temperature.
Without load my cpu idles at ~30?C. Again, lmsensors and bios are very close
about that.

Gl?ck Auf,
Volker

2007-12-20 02:14:28

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Montag, 17. Dezember 2007, you wrote:

and another one, this time tainted with the nvidia module:
5194.130985] Unable to handle kernel paging request at 0000030000000000 RIP:
[ 5194.130988] [<ffffffff804449fa>] _spin_lock+0x0/0xf
[ 5194.130993] PGD 0
[ 5194.130994] Oops: 0002 [1] SMP
[ 5194.130996] CPU 1
[ 5194.130997] Modules linked in: rfcomm l2cap hci_usb bluetooth snd_usb_audio
ohci1394 snd_usb_lib ieee1394 aic7xxx i2c_nforce2 nvidia(P) k8temp w83627ehf
hwmon_vid hwmon i2c_core snd_seq_midi snd_emu10k1_synth snd_emux_synth
snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss
snd_seq_midi_event snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec ac97_bus
snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd
r8169
[ 5194.131014] Pid: 22490, comm: sleep Tainted: P 2.6.23.11reiser4 #4
[ 5194.131015] RIP: 0010:[<ffffffff804449fa>] [<ffffffff804449fa>]
_spin_lock+0x0/0xf
[ 5194.131018] RSP: 0018:ffff81009278be70 EFLAGS: 00010206
[ 5194.131020] RAX: 00002ab90bfb5000 RBX: ffff810117d44db0 RCX:
00002ab90bdb5000
[ 5194.131021] RDX: ffff81011519f810 RSI: 000000388aa08fff RDI:
0000030000000000
[ 5194.131023] RBP: 0000030000000000 R08: ffff81012f190ea0 R09:
0000000000000000
[ 5194.131024] R10: 0000000000000008 R11: 0000000000000246 R12:
ffff810117d44db0
[ 5194.131026] R13: 00002ab90bdb0000 R14: 0000000000000000 R15:
0000000000000000
[ 5194.131028] FS: 00002ab90bde3070(0000) GS:ffff81012fc6cec0(0000)
knlGS:00000000f7f756c0
[ 5194.131030] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 5194.131031] CR2: 0000030000000000 CR3: 0000000093605000 CR4:
00000000000006e0
[ 5194.131033] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 5194.131034] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[ 5194.131036] Process sleep (pid: 22490, threadinfo ffff81009278a000, task
ffff8100960630c0)
[ 5194.131037] Stack: ffffffff8026afbc ffff810117d44db0 ffff810115e2cbb8
ffff810117d44db0
[ 5194.131040] ffffffff80265ec3 ffff81009278bee0 ffff81009278bee0
ffff810115e2c3d8
[ 5194.131043] ffff8100a076cb80 0000000000000002 00007fff9ecf7808
00007fff9ecf7810
[ 5194.131045] Call Trace:
[ 5194.131048] [<ffffffff8026afbc>] anon_vma_unlink+0x1a/0x64
[ 5194.131051] [<ffffffff80265ec3>] free_pgtables+0x64/0xc4
[ 5194.131054] [<ffffffff80267174>] exit_mmap+0x91/0xeb
[ 5194.131057] [<ffffffff80230191>] mmput+0x28/0xa0
[ 5194.131060] [<ffffffff802353db>] do_exit+0x211/0x786
[ 5194.131063] [<ffffffff802359cf>] sys_exit_group+0x0/0xe
[ 5194.131065] [<ffffffff8020b66e>] system_call+0x7e/0x83
[ 5194.131069]
[ 5194.131070]
[ 5194.131070] Code: f0 ff 0f 79 09 f3 90 83 3f 00 7e f9 eb f2 c3 f0 81 2f 00
00
[ 5194.131076] RIP [<ffffffff804449fa>] _spin_lock+0x0/0xf
[ 5194.131078] RSP <ffff81009278be70>
[ 5194.131079] CR2: 0000030000000000
[ 5194.131101] Fixing recursive fault but reboot is needed!

2007-12-20 04:18:19

by Scott

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well


On Thu, 2007-12-20 at 03:13 +0100, Hemmann, Volker Armin wrote:
> On Montag, 17. Dezember 2007, you wrote:
>
> and another one, this time tainted with the nvidia module:
> 5194.130985] Unable to handle kernel paging request at 0000030000000000 RIP:

This really sounds like bad hardware. Either memory or the mobo/riser
card the memory is on. You might try lowering the memory timings of your
memory in BIOS. Try removing 1/2 of your memory. If it still remove the
other 1/2 and put the first 1/2 back and try again.

--
Scott <[email protected]>

2007-12-20 05:53:41

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Donnerstag, 20. Dezember 2007, you wrote:
> On Thu, 2007-12-20 at 03:13 +0100, Hemmann, Volker Armin wrote:
> > On Montag, 17. Dezember 2007, you wrote:
> >
> > and another one, this time tainted with the nvidia module:
> > 5194.130985] Unable to handle kernel paging request at 0000030000000000
> > RIP:
>
> This really sounds like bad hardware. Either memory or the mobo/riser
> card the memory is on. You might try lowering the memory timings of your
> memory in BIOS. Try removing 1/2 of your memory. If it still remove the
> other 1/2 and put the first 1/2 back and try again.

if this is bad hardware why:

- didn't this show up earlier?

- did a several hour memtest run couple of weeks ago didn't show up anything?

- and does stuff like compiling all of kde 3.5.8 or the latest kde4 rc finish
without any problems?

If it would be bad hardware, I should see segfaults left and right, right? but
I don't see them. In fact, apart from the oopses the system works fine -
even with the oopses the system works fine, apart from the occasional stuck
ps aux

And this messages:
[41160.823959] kio_http_cache_[25229] general protection rip:32621f1fe9
rsp:7fff59a3d270 error:0
show up on closing konqueror tabs/Konqueror. There are no surprising exits, no
apps vanishing.

But I will run memtest86+ (or should I use memtest86?).

Gl?ck Auf,
Volker

2007-12-20 14:41:37

by David Newall

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well


>>> On Montag, 17. Dezember 2007, you wrote:
>>>
>>> and another one, this time tainted with the nvidia module:
>>> 5194.130985] Unable to handle kernel paging request at 0000030000000000
>>> RIP:

Numbers like that don't suggest hardware faults. All those zeros: It's
far too round. Sounds very like software. In fact, it sounds like the
start of significant hardware region. And lo! there's a closed-source,
possibly buggy nvidia module. Try another; older or newer are equally good.

2007-12-20 15:47:04

by Stefan Richter

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Hemmann, Volker Armin wrote:
> [ 5194.131014] Pid: 22490, comm: sleep Tainted: P 2.6.23.11reiser4 #4

The subject line is wrong.
You apparently run Linux, but not Linux 2.6.23.y.
--
Stefan Richter
-=====-=-=== ==-- =-=--
http://arcgraph.de/sr/

2007-12-20 18:12:49

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Donnerstag, 20. Dezember 2007, you wrote:
> Hemmann, Volker Armin wrote:
> > [ 5194.131014] Pid: 22490, comm: sleep Tainted: P 2.6.23.11reiser4
> > #4
>
> The subject line is wrong.
> You apparently run Linux, but not Linux 2.6.23.y.

first of all, apart from this oops all other oopses I reported were with a
not-tainted kernel. You might want to read the other mails I have sent.

Also, besides of the reiser4 patch there is no other patch added to the
kernel. And since people have had successfully reported problems with
heavily distro-patched kernels in the past it looks a little bit hypocritical
to put my reports aside because of one single patch - don't you think?

Gl?ck Auf,
Volker

2007-12-20 18:37:48

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Donnerstag, 20. Dezember 2007, David Newall wrote:
> >>> On Montag, 17. Dezember 2007, you wrote:
> >>>
> >>> and another one, this time tainted with the nvidia module:
> >>> 5194.130985] Unable to handle kernel paging request at 0000030000000000
> >>> RIP:
>
> Numbers like that don't suggest hardware faults. All those zeros: It's
> far too round. Sounds very like software. In fact, it sounds like the
> start of significant hardware region. And lo! there's a closed-source,
> possibly buggy nvidia module. Try another; older or newer are equally
> good.

and this one was without the nvidia module:
http://marc.info/?l=linux-kernel&m=119790371708690&w=2

and the first one I reported, was without nvidia and not-tainted too:
http://marc.info/?l=linux-kernel&m=119776365425514&w=2

I am not a complete idiot. If I have a problem, I try to reproduce without
nvidia first (after a clean shutdown and boot, with the module not even on
harddisk). And I reproduced it without the module. The last oops with the
module was just an example that it does not matter if the module is loaded or
not and to (maybe) give some additional information.

Gl?ck Auf,
Volker

2007-12-20 19:06:17

by Ingo Molnar

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well


* Hemmann, Volker Armin <[email protected]> wrote:

> On Donnerstag, 20. Dezember 2007, you wrote:
> > Hemmann, Volker Armin wrote:
> > > [ 5194.131014] Pid: 22490, comm: sleep Tainted: P 2.6.23.11reiser4
> > > #4
> >
> > The subject line is wrong.
> > You apparently run Linux, but not Linux 2.6.23.y.
>
> first of all, apart from this oops all other oopses I reported were
> with a not-tainted kernel. You might want to read the other mails I
> have sent.
>
> Also, besides of the reiser4 patch there is no other patch added to
> the kernel. And since people have had successfully reported problems
> with heavily distro-patched kernels in the past it looks a little bit
> hypocritical to put my reports aside because of one single patch -
> don't you think?

reiser4 isnt just a single random patch, it's a huge patch with lots of
interactions with file and memory management. Would it be hard for you
to reproduce the crash without reiser4? (or is all your stuff on
reiser4?)

Ingo

2007-12-20 19:07:21

by Stefan Richter

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Hemmann, Volker Armin wrote:
> On Donnerstag, 20. Dezember 2007, you wrote:
>> The subject line is wrong.
>> You apparently run Linux, but not Linux 2.6.23.y.
>
> first of all, apart from this oops all other oopses I reported were with a
> not-tainted kernel. You might want to read the other mails I have sent.
>
> Also, besides of the reiser4 patch there is no other patch added to the
> kernel. And since people have had successfully reported problems with
> heavily distro-patched kernels in the past it looks a little bit hypocritical
> to put my reports aside because of one single patch - don't you think?

I didn't say anything about putting your report aside.

For successful reports (as in 'leading to a fix'), it's among else
necessary that the issue can be narrowed down enough. Sometimes this is
a quick process; e.g. user X finds a very specific driver bug while
using a patched and old kernel, driver developer Y takes the time to
confirm this bug in a recent mainline kernel because he already had a
good idea where to look and how to recreate the respective conditions,
and fixes the bug. Sometimes it takes much much more work to identify
the circumstances of the bug. It is then necessary that the reporter
knows exactly what he is running, simplifies his system to eliminate as
many potential causes for problems as possible, and always clearly
states under what circumstances the bug happens.

If you already found the bug in an untainted (but patched?) kernel, then
what information does another report against a tainted kernel add? The
tainted kernel has more unknowns than the untainted one. Progress can
only be made if the number of unknowns are successively reduced.

Regarding other people's reports and hypocrisy and whatnot: I myself am
monitoring a few distro bug trackers more or less frequently for bug
reports concerning the kernel subsystem I'm interested in. With varying
success though. In order make use of a report against a distro kernel,
I need to have a good picture of what stuff is in that kernel. Looking
at distro bug trackers does only work for me because my field of
interest is a driver subsystem which is somewhat decoupled from other
kernel parts; so if there is trouble concerning hardware covered by this
subsystem, it is usually not too hard to figure out whether the problem
is in this subsystem or somewhere else. If it weren't that easy most of
the time, I might for example depend on the reporters to test specific
mainline kernels or specific development kernels. (Though the latter
becomes necessary after all in cases when more targeted debug output is
needed from the reporter, or in order to test proposed fixes without
having to wait for the distributor to build a test package for the
reporter.)
--
Stefan Richter
-=====-=-=== ==-- =-=--
http://arcgraph.de/sr/

2007-12-20 20:29:23

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

On Donnerstag, 20. Dezember 2007, Ingo Molnar wrote:
> * Hemmann, Volker Armin <[email protected]> wrote:
> > On Donnerstag, 20. Dezember 2007, you wrote:
> > > Hemmann, Volker Armin wrote:
> > > > [ 5194.131014] Pid: 22490, comm: sleep Tainted: P
> > > > 2.6.23.11reiser4 #4
> > >
> > > The subject line is wrong.
> > > You apparently run Linux, but not Linux 2.6.23.y.
> >
> > first of all, apart from this oops all other oopses I reported were
> > with a not-tainted kernel. You might want to read the other mails I
> > have sent.
> >
> > Also, besides of the reiser4 patch there is no other patch added to
> > the kernel. And since people have had successfully reported problems
> > with heavily distro-patched kernels in the past it looks a little bit
> > hypocritical to put my reports aside because of one single patch -
> > don't you think?
>
> reiser4 isnt just a single random patch, it's a huge patch with lots of
> interactions with file and memory management. Would it be hard for you
> to reproduce the crash without reiser4? (or is all your stuff on
> reiser4?)

/home (and /var, /tmp) is on reiser4 and my biggest partition. And since it
needs up to 3 days to reproduce this - yes, hard to do without r4.

Gl?ck Auf,
Volker

2007-12-20 21:48:39

by Pekka Enberg

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Hi,

On Dec 20, 2007 4:38 PM, David Newall <[email protected]> wrote:
> >>> and another one, this time tainted with the nvidia module:
> >>> 5194.130985] Unable to handle kernel paging request at 0000030000000000
> >>> RIP:
>
> Numbers like that don't suggest hardware faults. All those zeros: It's
> far too round. Sounds very like software. In fact, it sounds like the
> start of significant hardware region.

Nah, it's just that vma->anon_vma is probably supposed to be NULL here. And if
you look at all the oopses, they do suggest one particular byte lane
is dodgy (the
corruption is in bits 41-43 and 45).

The whole thing reminds me of another bug where memtest86 didn't find anything
because it's doing cached memory accesses: http://lkml.org/lkml/2007/10/3/259

Pekka

2007-12-20 22:14:36

by Ingo Molnar

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well


* Pekka Enberg <[email protected]> wrote:

> Nah, it's just that vma->anon_vma is probably supposed to be NULL
> here. And if you look at all the oopses, they do suggest one
> particular byte lane is dodgy (the corruption is in bits 41-43 and
> 45).
>
> The whole thing reminds me of another bug where memtest86 didn't find
> anything because it's doing cached memory accesses:
> http://lkml.org/lkml/2007/10/3/259

memtest86+ has an uncached test:

const struct tseq tseq[] = {
{1, 5, 3, 0, 0, "[Address test, walking ones] "},
{1, 6, 3, 2, 0, "[Address test, own address] "},
{1, 0, 3, 14, 0, "[Moving inversions, ones & zeros] "},
{1, 1, 2, 80, 0, "[Moving inversions, 8 bit pattern] "},
{1, 10, 60, 300, 0, "[Moving inversions, random pattern] "},
{1, 7, 64, 66, 0, "[Block move, 64 moves] "},
{1, 2, 2, 320, 0, "[Moving inversions, 32 bit pattern] "},
{1, 9, 40, 120, 0, "[Random number sequence] "},
{1, 3, 4, 240, 0, "[Modulo 20, ones & zeros] "},
{1, 8, 1, 2, 0, "[Bit fade test, 90 min, 2 patterns] "},
{0, 4, 3, 2, 0, "[[Moving inversions, 0 & 1, uncached] "},
{0, 0, 0, 0, 0, NULL}
};

find that "Moving inversions, 0 & 1" test and run that one alone,
overnight.

Ingo

2007-12-21 02:10:45

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Ok, so after the holidays I will do the following:

let memtest86+ run several hours.
do a full backup&restore to switch to r3 and build an unpatched kernel.
see if I can reproduce the oops with .21 and .22 (because AFAIR no oops with
21.. but I might be wrong).

Not exactly in that order.

Gl?ck Auf
Volker


ps: please cc me. I am not subscribed to lkml.

2007-12-21 11:19:45

by Ingo Molnar

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well


* Hemmann, Volker Armin <[email protected]> wrote:

> Ok, so after the holidays I will do the following:
>
> let memtest86+ run several hours. do a full backup&restore to switch
> to r3 and build an unpatched kernel. see if I can reproduce the oops
> with .21 and .22 (because AFAIR no oops with 21.. but I might be
> wrong).
>
> Not exactly in that order.

yeah, that would help. But generally it a big PITA to figure out such
bugs where there's no specific 'smoking gun' in the oopses themselves.
What usually happens is that people try to figure out a faster way of
triggering the bug - and then bisection can be done. But the hardware
must be eliminated first as the cause of the bug. Taking out half of the
RAM (i know it's painful ...) can help too in isolating RAM problems.

Ingo

2007-12-29 23:03:24

by Volker Armin Hemmann

[permalink] [raw]
Subject: Re: almost daily Kernel oops with 2.6.23.9 - and now 2.6.23.11 as well

Hi,

you guys were right, I was wrong.

It is the hardware.

I increased ram voltage by 0.15V on the 22nd and hadn't any oopses since then.
And I did torture the system.

I am deeply sorry that I wasted your time (but still puzzled that the oopses
started after kernel update - maybe I should buy a new psu... ).

So it is not reiser4 nor the kernel, just the ram needs a little more 'juice'
than the board delivers on 'auto' settings.

Gl?ck Auf
Volker