2002-08-26 12:31:58

by h.grosenick

[permalink] [raw]
Subject: System freeze on 2.4.18 / 19 SMP

Hello

i have a reproducible system freeze using SuSE 8.0 with original kernel 2.4.19
(same with SuSE 2.4.18 kernel).

Hardware:

Asus P2B-Dual with 2x PIII 700 MHz (Bios 1013 - current)
896 MB RAM
Promise PDC20267 off board IDE Controller (current bios release)
aic7880 scsi-controller for CD-writers
nvidia graphic card
RTL-8139A based network card

/dev/hda: _NEC DV-5700B DVD-ROM on piix onboard controller, ide0
/dev/hdc: IBM IC35L060AVVA07-0 on PDC20267 ide1
/dev/hdd: IBM IC35L080AVVA07-0 on PDC20267 ide1
/dev/hde: IBM IC35L060AVVA07-0 on PDC20267 ide2

So far I have had a complete system freeze once a month, but now i started the
oracle installer and then this freeze happens within a minute during
installation and can be reproduced. But the freeze does not always happen
exactly at the same action. The system gets unstable first, so that for
example a bash gets a "segementation violation". But then some seconds later
the system will freeze and i have to power off.

I tried the following:

- run memtest86, everything OK
- stopped most of the other services
- started oracle installation from a remote X server, so that there is no
XFree running on the SMP-mashine, only xdm without local X-server
- tried boot parameter "idex=serialize"
- tried ext3 and reiserfs as file system for the installation target partition

=> no changes, the system still freezes.

I haven't done any kernel programming so far, but i would be able to give more
input, if someone has an idea what to do.

Holger Grosenick



partititions:
(The oracle installer copies data from hdc11 / reiser to hdd1 / ext3)
-------------------------------------------------------------

major minor #blocks name
33 0 60051600 hde
33 1 30208 hde1
33 2 8192016 hde2
33 3 660744 hde3
33 4 1 hde4
33 5 1570936 hde5
33 6 1032664 hde6
33 7 3584416 hde7
33 8 4608040 hde8
33 9 5120104 hde9
33 10 1024096 hde10
33 11 34228120 hde11
22 0 60051600 hdc
22 1 30208 hdc1
22 2 8192016 hdc2
22 3 660744 hdc3
22 4 1 hdc4
22 5 1570936 hdc5
22 6 1032664 hdc6
22 7 3584416 hdc7
22 8 4608040 hdc8
22 9 5120104 hdc9
22 10 1024096 hdc10
22 11 34228120 hdc11
22 64 80418240 hdd
22 65 80418208 hdd1

interrups
-----------------------------------------------------------

CPU0 CPU1
0: 147718 151530 IO-APIC-edge timer
1: 2646 2683 IO-APIC-edge keyboard
2: 0 0 XT-PIC cascade
4: 26150 26083 IO-APIC-edge serial
5: 1662 1359 IO-APIC-edge SoundBlaster
11: 14748 14638 IO-APIC-level ide1, ide2, usb-uhci
12: 171 165 IO-APIC-level aic7xxx
14: 4 2 IO-APIC-edge ide0
15: 5609 5383 IO-APIC-level eth0
NMI: 0 0
LOC: 299187 299066
ERR: 0
MIS: 0


lspci -vvv
-----------------------------------------------------------

00:00.0 Host bridge: Intel Corp. 440BX/ZX/DX - 82443BX/ZX/DX Host bridge (rev
03)
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+
FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort+ >
SERR- <PERR-
Latency: 64
Region 0: Memory at e4000000 (32-bit, prefetchable) [size=64M]
Capabilities: [a0] AGP version 1.0
Status: RQ=31 SBA+ 64bit- FW- Rate=x1,x2
Command: RQ=0 SBA- AGP- 64bit- FW- Rate=<none>

00:01.0 PCI bridge: Intel Corp. 440BX/ZX/DX - 82443BX/ZX/DX AGP bridge (rev
03) (prog-if 00
[Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr-
Stepping- SERR+
FastB2B-
Status: Cap- 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 64
Bus: primary=00, secondary=01, subordinate=01, sec-latency=64
I/O behind bridge: 0000e000-0000dfff
Memory behind bridge: e0000000-e1dfffff
Prefetchable memory behind bridge: e1f00000-e3ffffff
BridgeCtl: Parity- SERR- NoISA- VGA+ MAbort- >Reset- FastB2B+

00:04.0 ISA bridge: Intel Corp. 82371AB/EB/MB PIIX4 ISA (rev 02)
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 0

00:04.1 IDE interface: Intel Corp. 82371AB/EB/MB PIIX4 IDE (rev 01) (prog-if
80 [Master])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 0
Region 4: I/O ports at d800 [size=16]

00:04.2 USB Controller: Intel Corp. 82371AB/EB/MB PIIX4 USB (rev 01) (prog-if
00 [UHCI])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 64
Interrupt: pin D routed to IRQ 11
Region 4: I/O ports at d400 [size=32]

00:04.3 Bridge: Intel Corp. 82371AB/EB/MB PIIX4 ACPI (rev 02)
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Interrupt: pin ? routed to IRQ 9

00:09.0 Unknown mass storage controller: Promise Technology, Inc. 20267 (rev
02)
Subsystem: Promise Technology, Inc. Ultra100
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 0
Interrupt: pin A routed to IRQ 11
Region 0: I/O ports at d000 [size=8]
Region 1: I/O ports at b800 [size=4]
Region 2: I/O ports at b400 [size=8]
Region 3: I/O ports at b000 [size=4]
Region 4: I/O ports at a800 [size=64]
Region 5: Memory at df800000 (32-bit, non-prefetchable) [size=128K]
Expansion ROM at <unassigned> [disabled] [size=64K]
Capabilities: [58] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-

00:0a.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139 (rev 10)
Subsystem: Realtek Semiconductor Co., Ltd. RT8139
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 64 (8000ns min, 16000ns max)
Interrupt: pin A routed to IRQ 15
Region 0: I/O ports at a400 [size=256]
Region 1: Memory at df000000 (32-bit, non-prefetchable) [size=256]
00:0b.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev 01)
Subsystem: Adaptec AHA-2940UW SCSI Host Adapter
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 64 (2000ns min, 2000ns max), cache line size 08
Interrupt: pin A routed to IRQ 12
Region 0: I/O ports at a000 [disabled] [size=256]
Region 1: Memory at de800000 (32-bit, non-prefetchable) [size=4K]
Expansion ROM at <unassigned> [disabled] [size=64K]
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-

01:00.0 VGA compatible controller: nVidia Corporation Vanta [NV6] (rev 15)
(prog-if 00 [VGA]
)
Subsystem: Elsa AG: Unknown device 0c3a
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR-
FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >
SERR- <PERR-
Latency: 64 (1250ns min, 250ns max)
Interrupt: pin A routed to IRQ 10
Region 0: Memory at e0000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at e2000000 (32-bit, prefetchable) [size=32M]
Expansion ROM at e1ff0000 [disabled] [size=64K]
Capabilities: [60] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [44] AGP version 2.0
Status: RQ=31 SBA- 64bit- FW- Rate=x1,x2
Command: RQ=0 SBA- AGP- 64bit- FW- Rate=<none>


module-list, all compiled for the current kernel release,
no high memory support
---------------------------------------------------------------
snd-pcm-oss 35364 0 (autoclean)
snd-mixer-oss 9312 1 (autoclean)
snd-seq-midi 3168 0 (autoclean) (unused)
snd-emu8000-synth 4804 0 (autoclean)
snd-emux-synth 24064 0 (autoclean) [snd-emu8000-synth]
snd-seq-virmidi 2664 0 (autoclean) [snd-emux-synth]
snd-util-mem 1088 0 (autoclean) [snd-emu8000-synth
snd-emux-synth]
snd-seq-oss 24896 0 (autoclean)
snd-seq-midi-event 2744 0 (autoclean) [snd-seq-midi snd-seq-virmidi
snd-seq-oss]
snd-opl3-synth 8732 0
snd-seq-instr 4608 0 [snd-opl3-synth]
snd-seq-midi-emul 4192 0 [snd-emux-synth snd-opl3-synth]
snd-seq 39500 2 [snd-seq-midi snd-emux-synth snd-seq-virmidi
snd-seq-oss snd-seq-midi-event snd-opl3-synth snd-seq-instr
snd-seq-midi-emul]
snd-ainstr-fm 1460 0 [snd-opl3-synth]
snd-sbawe 16096 1 [snd-emu8000-synth]
snd-sb16-dsp 5888 0 [snd-sbawe]
snd-pcm 46816 0 [snd-pcm-oss snd-sb16-dsp]
snd-sb16-csp 15776 0 [snd-sbawe]
snd-sb-common 6248 0 [snd-sbawe snd-sb16-dsp snd-sb16-csp]
snd-opl3-lib 5408 0 [snd-opl3-synth snd-sbawe]
snd-hwdep 3648 0 [snd-sb16-csp snd-opl3-lib]
snd-timer 10496 0 [snd-seq snd-pcm snd-opl3-lib]
snd-mpu401-uart 2864 0 [snd-sbawe snd-sb16-dsp]
snd-rawmidi 12288 0 [snd-seq-midi snd-seq-virmidi
snd-mpu401-uart]
snd-seq-device 3952 0 [snd-seq-midi snd-emu8000-synth
snd-emux-synth snd-seq-oss snd-opl3-synth snd-seq snd-sbawe snd-opl3-lib
snd-rawmidi]
snd 25224 0 [snd-pcm-oss snd-mixer-oss snd-seq-midi
snd-emu8000-synth snd-emux-synth snd-seq-virmidi snd-util-mem snd-seq-oss
snd-seq-midi-event snd-opl3-synth snd-seq-instr snd-seq snd-sbawe
snd-sb16-dsp snd-pcm snd-sb16-csp snd-sb-common snd-opl3-lib snd-hwdep
snd-timer snd-mpu401-uart snd-rawmidi snd-seq-device]
soundcore 3396 10 [snd]
isa-pnp 27580 0 [snd-sbawe]
sr_mod 12888 0 (autoclean)
sg 25156 0 (autoclean)
uhci 24968 0 (unused)
usbcore 55296 1 [uhci]
8139too 13984 1
reiserfs 157504 1 (autoclean)


2002-08-26 12:40:47

by Alan

[permalink] [raw]
Subject: Re: System freeze on 2.4.18 / 19 SMP

On Mon, 2002-08-26 at 13:36, Holger Grosenick wrote:

> module-list, all compiled for the current kernel release,
> no high memory support

The 2.4.19 kernel doesnt have any snd-* modules. This appears to be a
dump from something else - ALSA patches ?

> ---------------------------------------------------------------
> snd-pcm-oss 35364 0 (autoclean)
> snd-mixer-oss 9312 1 (autoclean)

I don't think ALSA is likely to be the cause however. Does it behave
stably running a non SMP kernel ?

2002-08-26 15:50:55

by h.grosenick

[permalink] [raw]
Subject: Re: System freeze on 2.4.18 / 19 SMP


> I don't think ALSA is likely to be the cause however. Does it behave
> stably running a non SMP kernel ?

yes alsa is running too - i'll deactivate it next time.

It just tried to run the installer on a PII 400MHz with SuSE kernel 2.4.18-4GB
and only 128MB of physical RAM / 380 MB swap (short test, i will recompile
the 2.4.19 later):

- the installer itself (java-app) crashed with segmentation fault
- afterwards the system is unstable, starting the installer again leads to
kernel messages as follows and even "umount" gets a segmentation fault.


Aug 26 17:32:31 holger kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000004
Aug 26 17:32:31 holger kernel: printing eip:
Aug 26 17:32:31 holger kernel: c0145ab6
Aug 26 17:32:31 holger kernel: *pde = 00000000
Aug 26 17:32:31 holger kernel: Oops: 0002
Aug 26 17:32:31 holger kernel: CPU: 0
Aug 26 17:32:31 holger kernel: EIP: 0010:[iput+302/464] Not tainted
Aug 26 17:32:31 holger kernel: EFLAGS: 00010246
Aug 26 17:32:31 holger kernel: eax: 00000000 ebx: c6c407e0 ecx: 00000000
edx: c6c407e8
Aug 26 17:32:31 holger kernel: esi: c13a5c00 edi: c16250c0 ebp: 000001d0
esp: c7fe3f10
Aug 26 17:32:31 holger kernel: ds: 0018 es: 0018 ss: 0018
Aug 26 17:32:31 holger kernel: Process kswapd (pid: 5, stackpage=c7fe3000)
Aug 26 17:32:31 holger kernel: Stack: c7903da0 c6c407e0 000005aa c01436c6
c6c407e0 c7903db8 c7903da0 c01439d4
Aug 26 17:32:31 holger kernel: c7903da0 00000000 c114de40 0000001e
c0143c8b 00000689 c012c586 00000006
Aug 26 17:32:31 holger kernel: 000001d0 00000020 c02a9510 000001d0
c02a9510 c7fe2000 ffffffff 0000c40f
Aug 26 17:32:32 holger kernel: Call Trace: [dentry_iput+70/116]
[prune_dcache+148/280] [shrink_dcache_memory+27/52] [shrink_cache+606/816]
[shrink_caches+93/104]
Aug 26 17:32:32 holger kernel: [try_to_free_pages+92/240]
[kswapd_balance_pgdat+85/164] [kswapd_balance+22/44] [kswapd+155/196]
[kernel_thread+40/56]
Aug 26 17:32:32 holger kernel:
Aug 26 17:32:32 holger kernel: Code: 89 48 04 89 01 a1 64 a3 2a c0 89 50 04 89
43 08 c7 42 04 64

-------------------------------------------------------------------------

The time before, i started the installer, it failed the i tried to read an
"rpm" from a NFS share and got:


Aug 26 15:24:18 holger kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000028
Aug 26 15:24:18 holger kernel: printing eip:
Aug 26 15:24:18 holger kernel: c01453dc
Aug 26 15:24:18 holger kernel: *pde = 00000000
Aug 26 15:24:18 holger kernel: Oops: 0000
Aug 26 15:24:18 holger kernel: CPU: 0
Aug 26 15:24:18 holger kernel: EIP: 0010:[find_inode+32/76] Not tainted
Aug 26 15:24:18 holger kernel: EFLAGS: 00010203
Aug 26 15:24:18 holger kernel: eax: c25d1e80 ebx: 00000000 ecx: 0000000d
edx: c1607ec4
Aug 26 15:24:18 holger kernel: esi: 00000000 edi: c1607ec4 ebp: 000042e8
esp: c25d1e2c
Aug 26 15:24:18 holger kernel: ds: 0018 es: 0018 ss: 0018
Aug 26 15:24:18 holger kernel: Process rpm (pid: 1368, stackpage=c25d1000)
Aug 26 15:24:18 holger kernel: Stack: 00000000 c1212cc0 000042e8 c13a4c00
c0145860 c13a4c00 000042e8 c1212cc0
Aug 26 15:24:18 holger kernel: c1607ec4 c25d1e80 00000000 c25d1f04
c25d1ea0 c0803640 c1607f07 c13a4c00
Aug 26 15:24:18 holger kernel: 000042e8 c1607ec4 c25d1e80 00000000
00000001 000042df c1604228 c13a4c00
Aug 26 15:24:18 holger kernel: Call Trace: [iget4+64/272]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51
020092/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51020025/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.
4.18-4GB/kernel/fs/nfsd/nfsd.+-51020092/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51035608/96]
Aug 26 15:24:18 holger kernel: [real_lookup+83/196]
[link_path_walk+1270/1908] [path_walk+26/28] [__user_walk+53/80] [sys_st
at64+25/120] [system_call+51/64]
Aug 26 15:24:18 holger kernel:
Aug 26 15:24:18 holger kernel: Code: 39 6e 28 75 ef 8b 44 24 14 39 86 9c 00 00
00 75 e3 85 ff 74

Aug 26 15:25:13 holger kernel: <1>Unable to handle kernel NULL pointer
dereference at virtual address 00000028
Aug 26 15:25:13 holger kernel: printing eip:
Aug 26 15:25:13 holger kernel: c01453dc
Aug 26 15:25:13 holger kernel: *pde = 00000000
Aug 26 15:25:13 holger kernel: Oops: 0000
Aug 26 15:25:13 holger kernel: CPU: 0
Aug 26 15:25:13 holger kernel: EIP: 0010:[find_inode+32/76] Not tainted
Aug 26 15:25:13 holger kernel: EFLAGS: 00010203
Aug 26 15:25:13 holger kernel: eax: c25d1e80 ebx: 00000000 ecx: 0000000d
edx: c1607ec4
Aug 26 15:25:13 holger kernel: esi: 00000000 edi: c1607ec4 ebp: 000062e7
esp: c25d1e2c
Aug 26 15:25:13 holger kernel: ds: 0018 es: 0018 ss: 0018
Aug 26 15:25:13 holger kernel: Process rpm (pid: 1373, stackpage=c25d1000)
Aug 26 15:25:13 holger kernel: Stack: 00000000 c1212cc0 000062e7 c13a4c00
c0145860 c13a4c00 000062e7 c1212cc0
Aug 26 15:25:13 holger kernel: c1607ec4 c25d1e80 00000000 c25d1f04
c25d1ea0 c2c22320 c1607f07 c13a4c00
Aug 26 15:25:13 holger kernel: 000062e7 c1607ec4 c25d1e80 00000000
00000001 000062dd c1604228 c13a4c00
Aug 26 15:25:13 holger kernel: Call Trace: [iget4+64/272]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51
020092/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51020025/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.
4.18-4GB/kernel/fs/nfsd/nfsd.+-51020092/96]
[nfsd:__insmod_nfsd_O/lib/modules/2.4.18-4GB/kernel/fs/nfsd/nfsd.+-51035608/96]
Aug 26 15:25:13 holger kernel: [real_lookup+83/196]
[link_path_walk+1270/1908] [path_walk+26/28] [__user_walk+53/80] [sys_ls
tat64+25/112] [system_call+51/64]
Aug 26 15:25:13 holger kernel:
Aug 26 15:25:49 holger kernel: <1>Unable to handle kernel NULL pointer
dereference at virtual address 00000004
Aug 26 15:25:49 holger kernel: printing eip:
Aug 26 15:25:49 holger kernel: c0144751
Aug 26 15:25:49 holger kernel: *pde = 00000000
Aug 26 15:25:49 holger kernel: Oops: 0002
Aug 26 15:25:49 holger kernel: CPU: 0
Aug 26 15:25:49 holger kernel: EIP: 0010:[__mark_inode_dirty+93/120] Not
tainted
Aug 26 15:25:49 holger kernel: EFLAGS: 00010203
Aug 26 15:25:49 holger kernel: eax: 00000000 ebx: c31137e0 ecx: 00000000
edx: c31137e8
Aug 26 15:25:49 holger kernel: esi: c13a4c00 edi: 00000001 ebp: c31137e0
esp: c5fa1f30
Aug 26 15:25:49 holger kernel: ds: 0018 es: 0018 ss: 0018
Aug 26 15:25:49 holger kernel: Process ldconfig (pid: 1379,
stackpage=c5fa1000)
Aug 26 15:25:49 holger kernel: Stack: c5fa0000 c3148ec0 c5fa1fa4 c0145cce
c31137e0 00000001 c013c410 c31137e0
Aug 26 15:25:49 holger kernel: c77c4000 00000000 c5fa1fa4 00000009
00000001 00000009 00000000 c77c4019
Aug 26 15:25:49 holger kernel: c3148ec0 c77c4009 00000010 7f180bac
c013c596 c013ca15 c5fa0000 c5fa1fa4
Aug 26 15:25:49 holger kernel: Call Trace: [update_atime+78/84]
[link_path_walk+1544/1908] [path_walk+26/28] [__user_walk+53/80
] [sys_stat64+25/120]
Aug 26 15:25:49 holger kernel: [sys_close+67/84] [system_call+51/64]



I hope that i don't destroy my installation with that ...

2002-08-26 18:11:48

by Andre Hedrick

[permalink] [raw]
Subject: Re: System freeze on 2.4.18 / 19 SMP

On Mon, 26 Aug 2002, Holger Grosenick wrote:

> Hello
>
> i have a reproducible system freeze using SuSE 8.0 with original kernel 2.4.19
> (same with SuSE 2.4.18 kernel).
>
> Hardware:
>
> Asus P2B-Dual with 2x PIII 700 MHz (Bios 1013 - current)
> 896 MB RAM
> Promise PDC20267 off board IDE Controller (current bios release)
> aic7880 scsi-controller for CD-writers
> nvidia graphic card
> RTL-8139A based network card
>
> /dev/hda: _NEC DV-5700B DVD-ROM on piix onboard controller, ide0
> /dev/hdc: IBM IC35L060AVVA07-0 on PDC20267 ide1
> /dev/hdd: IBM IC35L080AVVA07-0 on PDC20267 ide1
> /dev/hde: IBM IC35L060AVVA07-0 on PDC20267 ide2

OH MY!

The channels got decoupled! This is very bad.

> /dev/hda: _NEC DV-5700B DVD-ROM on piix onboard controller, ide0

> /dev/hde: IBM IC35L060AVVA07-0 on PDC20267 ide1
> /dev/hdf: IBM IC35L080AVVA07-0 on PDC20267 ide1
> /dev/hdg: IBM IC35L060AVVA07-0 on PDC20267 ide2

That is what it should be.

Andre Hedrick
LAD Storage Consulting Group

2002-08-28 05:12:41

by h.grosenick

[permalink] [raw]
Subject: Re: System freeze on 2.4.18 / 19 SMP


now i changed CMOS settings and have

/dev/hda ide0 DVD
/dev/hde ide2 60 GB
/dev/hdf ide2 80 GB (only 1 large partitition)
/dev/hdg ide3 60 GB

but that didn't change the behaviour. After i mounted /dev/hdf the system gets
unstable: a normal "df" receives a "segmentation fault".

Might there be a problem with large disks? My system is two years old and i
have these problems since i had to replace 2x30 GB by 2x60 GB, because one of
the hard discs crashed.

I mount /dev/hdf only when a want to make a backup, so normally the disc is
switched off (mobile rack) and then the system is quite stable, it might
freeze but that can't be reproduced.