2003-08-28 09:00:07

by Nico Schottelius

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

Very interesting..
with the test4 I experiene the same/similar problems on my laptop..
all of sudden yesterday several programs died -> Out of Memory.
I ran
Xfree
dhcpcd
opera
several xterms (about 6)
qmail
named

first opera was Out of Memory, then died the whole X system with all
xterms and X beeing Out of Memory.

MemTotal: 385600 kB

which should be more than enough!

Nico


Ross Clarke [Thu, Aug 28, 2003 at 12:41:30AM +0200]:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Brandon wrote:
>
> |Hi Everyone,
> |
> | I'm having some bothersome problems with a couple servers of
> |mine. I'm hoping some of you have some advice on how to trouble shoot
> |this, because my little brain is running out of ideas.
> |
> |All the servers are running Redhat 7.3, 2.4.20-19smp kernels,
> |apache-1.3.27, and Soft Raid-1.
> |
> |Here is what is happening, all of a sudden the server load average
> |climbs real high. It climbs to 100+ within a few minutes, then
> |constantly grows after that. The last server that had this happen was
> |at 375 avg when I rebooted it, which always needs to be a hard reboot -
> |because the shutdown -r now command doesn't do anything.
> |
> |While this is happening, I can not run commands like 'ps fax', 'pstree',
> |'top', 'killall' etc without them hanging . Most other commands work. I
> |can SSH to the server no problem. If I do a 'ps ax' I can see a list of
> |processes, but it always hangs before displaying them all. I narrowed it
> |down to anything that needs a full process list hangs.
> |
> |I wrote a script that runs 'ls -la /proc/$P', and 'cat /proc/$P/cmdline'
> |on each process in /proc.
> |
> |What I found is the processes that hang ps and whatnot are all owned by
> |apache. The script hangs on the ls -la /proc/$P whenever it hits an
> |apache process. The processes it hangs on can not be killed with kill
> |-9. The number of apache owned processes was at 250, while on a regular
> |server it is only at 20 or so.
> |
> |Running sar -v shows the dentunusd grow huge at about the time of the
> |issues:
> |
> |04:30:00 PM dentunusd file-sz %file-sz inode-sz super-sz %super-sz
> |dquot-sz %dquot-sz rtsig-sz %rtsig-sz
> |05:30:00 PM 38823 25900 12.35 24755 0 0.00
> |0 0.00 7 0.68
> |05:40:00 PM 39757 25854 12.33 25054 0 0.00
> |0 0.00 7 0.68
> |05:50:00 PM 4294967057 23526 11.22 4303 0 0.00
> |0 0.00 18 1.76
> |
> |Also, the number of sockets grows by about 3X:
> |
> |4:30:00 PM totsck tcpsck udpsck rawsck ip-frag
> |04:40:00 PM 136 60 5 0 0
> |04:50:00 PM 112 35 5 0 0
> |05:00:00 PM 121 40 7 0 0
> |05:10:00 PM 126 44 5 0 0
> |05:20:00 PM 115 38 5 0 0
> |05:30:00 PM 119 36 8 0 0
> |05:40:00 PM 120 42 6 0 0
> |05:50:00 PM 526 236 5 0 1
> |06:00:00 PM 531 224 5 0 0
> |06:10:00 PM 535 224 5 0 0
> |
> |
> |That is just about all I have come up so far. If anyone has seen this,
> |or can recommend on what steps I should take next, I could certainly us
> |the advice.
> |
> |Thank you all
> |
> |Brandon Belshaw
> |
> |
> |
> |
> |
> |-
> |To unsubscribe from this list: send the line "unsubscribe linux-admin" in
> |the body of a message to [email protected]
> |More majordomo info at http://vger.kernel.org/majordomo-info.html
> |
>
> I just had the same similiar problem twice with 2.6.0-test4, I also used
> to experience it on 2.4.18. I managed to get ps to list tho, before all
> commands stopped working, and I noticed many of the proccesses went into
> D and Z states. I beleive they were getting stuck in the I/O subsystem,
> my other filesystems were still responding since my XMMS didnt die till
> it hit an mp3 on my main filesystem, which was about 30 minutes after
> the problem started. Any currently open application was still working,
> until I tried to do anything that required I/O, then they died aswell.
> That last happened to me about 12 hours ago, and I had to recover my
> entire /home directory. I couldnt find out what cuased it, the first
> time it was MozillaFirebird that died first, the 2nd time it was vim.
> Also both times I tried hitting the power button to see if I could get
> any form of shutdown where the data would sync, both times the kernel
> OOPS'ed on the apmd event.
>
> Anybody got any ideas?
>
> Regards,
> Ross Clarke
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.2 (GNU/Linux)
> Comment: Using GnuPG with Mozilla Thunderbird - http://enigmail.mozdev.org
>
> iD8DBQE/TTOa1+7fkD/L8TgRAkmdAJ9ciSYT6tAQGT0Uk+RD7Y8gkbmEIwCffLIT
> z2SGntQl8+1sI1QRVFZtxho=
> =utNU
> -----END PGP SIGNATURE-----
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-admin" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>

--
quote: there are two time a day you should do nothing: before 12 and after 12
(Nico Schottelius after writin' a very senseless email)
cmd: echo God bless America | sed 's/.*\(A.*\)$/Why \1?/'
pgp: new id: 0x8D0E27A4 | ftp.schottelius.org/pub/familiy/nico/pgp-key.new
url: http://nerd-hosting.net - domains for nerds (from a nerd)


Attachments:
(No filename) (5.53 kB)
(No filename) (189.00 B)
Download all attachments

2003-08-28 10:08:54

by Nick Piggin

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes



Nico Schottelius wrote:

>Very interesting..
>with the test4 I experiene the same/similar problems on my laptop..
>all of sudden yesterday several programs died -> Out of Memory.
>I ran
> Xfree
> dhcpcd
> opera
> several xterms (about 6)
> qmail
> named
>
>first opera was Out of Memory, then died the whole X system with all
>xterms and X beeing Out of Memory.
>
>MemTotal: 385600 kB
>
>which should be more than enough!
>

You might have a process with a memory leak. How much free memory do
you have before everything dies? How much swapping activity is going
on? What do /proc/meminfo and /proc/slabinfo say?


2003-08-29 09:36:57

by Nico Schottelius

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

I am attaching /proc/meminfo,slapinfo,uptime from now.
The system is f*** slow..
And I am currently just able to write this, moving windows
in X is more than painful!

Nico

Nick Piggin [Thu, Aug 28, 2003 at 07:33:25PM +1000]:
> Nico Schottelius wrote:
>
> >Very interesting..
> >with the test4 I experiene the same/similar problems on my laptop..
> >all of sudden yesterday several programs died -> Out of Memory.
> >I ran
> > Xfree
> > dhcpcd
> > opera
> > several xterms (about 6)
> > qmail
> > named
> >
> >first opera was Out of Memory, then died the whole X system with all
> >xterms and X beeing Out of Memory.
> >
> >MemTotal: 385600 kB
> >
> >which should be more than enough!
> >
>
> You might have a process with a memory leak. How much free memory do
> you have before everything dies? How much swapping activity is going
> on? What do /proc/meminfo and /proc/slabinfo say?
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-admin" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>

--
quote: there are two time a day you should do nothing: before 12 and after 12
(Nico Schottelius after writin' a very senseless email)
cmd: echo God bless America | sed 's/.*\(A.*\)$/Why \1?/'
pgp: new id: 0x8D0E27A4 | ftp.schottelius.org/pub/familiy/nico/pgp-key.new
url: http://nerd-hosting.net - domains for nerds (from a nerd)


Attachments:
(No filename) (0.00 B)
(No filename) (189.00 B)
Download all attachments

2003-08-29 11:01:55

by Nico Schottelius

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

Btw, the only thing I can see now is
nico@flapp:~/archiv $ dmesg
spurious 8259A interrupt: IRQ7.

not more..
and now I have to reboot as typing this mail is very hard as the system is
very slow. with no load! (i am even using the preempt patch..)

Nico

Nico Schottelius [Fri, Aug 29, 2003 at 11:01:29AM +0200]:
> I am attaching /proc/meminfo,slapinfo,uptime from now.
> The system is f*** slow..
> And I am currently just able to write this, moving windows
> in X is more than painful!
>
> Nico
>
> Nick Piggin [Thu, Aug 28, 2003 at 07:33:25PM +1000]:
> > Nico Schottelius wrote:
> >
> > >Very interesting..
> > >with the test4 I experiene the same/similar problems on my laptop..
> > >all of sudden yesterday several programs died -> Out of Memory.
> > >I ran
> > > Xfree
> > > dhcpcd
> > > opera
> > > several xterms (about 6)
> > > qmail
> > > named
> > >
> > >first opera was Out of Memory, then died the whole X system with all
> > >xterms and X beeing Out of Memory.
> > >
> > >MemTotal: 385600 kB
> > >
> > >which should be more than enough!
> > >
> >
> > You might have a process with a memory leak. How much free memory do
> > you have before everything dies? How much swapping activity is going
> > on? What do /proc/meminfo and /proc/slabinfo say?
> >
> >
> > -
> > To unsubscribe from this list: send the line "unsubscribe linux-admin" in
> > the body of a message to [email protected]
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> >
>
> --
> quote: there are two time a day you should do nothing: before 12 and after 12
> (Nico Schottelius after writin' a very senseless email)
> cmd: echo God bless America | sed 's/.*\(A.*\)$/Why \1?/'
> pgp: new id: 0x8D0E27A4 | ftp.schottelius.org/pub/familiy/nico/pgp-key.new
> url: http://nerd-hosting.net - domains for nerds (from a nerd)

> MemTotal: 385600 kB
> MemFree: 76904 kB
> Buffers: 16016 kB
> Cached: 227236 kB
> SwapCached: 0 kB
> Active: 76576 kB
> Inactive: 217944 kB
> HighTotal: 0 kB
> HighFree: 0 kB
> LowTotal: 385600 kB
> LowFree: 76904 kB
> SwapTotal: 0 kB
> SwapFree: 0 kB
> Dirty: 220 kB
> Writeback: 0 kB
> Mapped: 67192 kB
> Slab: 10628 kB
> Committed_AS: 64664 kB
> PageTables: 692 kB
> VmallocTotal: 647132 kB
> VmallocUsed: 1048 kB
> VmallocChunk: 645956 kB

> slabinfo - version: 2.0
> # name <active_objs> <num_objs> <objsize> <objperslab> <pagesperslab> : tunables <batchcount> <limit> <sharedfactor> : slabdata <active_slabs> <num_slabs> <sharedavail>
> fib6_nodes 5 113 32 113 1 : tunables 120 60 0 : slabdata 1 1 0
> ip6_dst_cache 5 17 224 17 1 : tunables 120 60 0 : slabdata 1 1 0
> ndisc_cache 1 24 160 24 1 : tunables 120 60 0 : slabdata 1 1 0
> raw6_sock 0 0 544 7 1 : tunables 54 27 0 : slabdata 0 0 0
> udp6_sock 1 7 544 7 1 : tunables 54 27 0 : slabdata 1 1 0
> tcp6_sock 11 12 960 4 1 : tunables 54 27 0 : slabdata 3 3 0
> ip_fib_hash 9 202 16 202 1 : tunables 120 60 0 : slabdata 1 1 0
> isofs_inode_cache 0 0 352 11 1 : tunables 54 27 0 : slabdata 0 0 0
> unix_sock 28 44 352 11 1 : tunables 54 27 0 : slabdata 4 4 0
> ip_mrt_cache 0 0 96 40 1 : tunables 120 60 0 : slabdata 0 0 0
> tcp_tw_bucket 1 30 128 30 1 : tunables 120 60 0 : slabdata 1 1 0
> tcp_bind_bucket 18 202 16 202 1 : tunables 120 60 0 : slabdata 1 1 0
> tcp_open_request 0 0 96 40 1 : tunables 120 60 0 : slabdata 0 0 0
> inet_peer_cache 1 59 64 59 1 : tunables 120 60 0 : slabdata 1 1 0
> secpath_cache 0 0 128 30 1 : tunables 120 60 0 : slabdata 0 0 0
> xfrm_dst_cache 0 0 288 13 1 : tunables 54 27 0 : slabdata 0 0 0
> ip_dst_cache 98 104 288 13 1 : tunables 54 27 0 : slabdata 8 8 0
> arp_cache 4 30 128 30 1 : tunables 120 60 0 : slabdata 1 1 0
> raw4_sock 0 0 416 9 1 : tunables 54 27 0 : slabdata 0 0 0
> udp_sock 3 9 416 9 1 : tunables 54 27 0 : slabdata 1 1 0
> tcp_sock 22 36 832 9 2 : tunables 54 27 0 : slabdata 4 4 0
> flow_cache 0 0 96 40 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_acl 0 0 304 13 1 : tunables 54 27 0 : slabdata 0 0 0
> xfs_chashlist 0 0 20 169 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_ili 0 0 140 28 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_ifork 0 0 56 67 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_efi_item 0 0 260 15 1 : tunables 54 27 0 : slabdata 0 0 0
> xfs_efd_item 0 0 260 15 1 : tunables 54 27 0 : slabdata 0 0 0
> xfs_buf_item 0 0 148 26 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_dabuf 0 0 16 202 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_da_state 0 0 336 11 1 : tunables 54 27 0 : slabdata 0 0 0
> xfs_trans 0 0 592 13 2 : tunables 54 27 0 : slabdata 0 0 0
> xfs_inode 0 0 368 10 1 : tunables 54 27 0 : slabdata 0 0 0
> xfs_btree_cur 0 0 132 29 1 : tunables 120 60 0 : slabdata 0 0 0
> xfs_bmap_free_item 0 0 12 253 1 : tunables 120 60 0 : slabdata 0 0 0
> page_buf_t 0 0 224 17 1 : tunables 120 60 0 : slabdata 0 0 0
> linvfs_icache 0 0 352 11 1 : tunables 54 27 0 : slabdata 0 0 0
> reiser_inode_cache 0 0 384 10 1 : tunables 54 27 0 : slabdata 0 0 0
> devfsd_event 0 0 20 169 1 : tunables 120 60 0 : slabdata 0 0 0
> ext2_inode_cache 0 0 480 8 1 : tunables 54 27 0 : slabdata 0 0 0
> ext2_xattr 0 0 48 78 1 : tunables 120 60 0 : slabdata 0 0 0
> journal_handle 8 126 28 126 1 : tunables 120 60 0 : slabdata 1 1 0
> journal_head 45 156 48 78 1 : tunables 120 60 0 : slabdata 2 2 0
> revoke_table 2 253 12 253 1 : tunables 120 60 0 : slabdata 1 1 0
> revoke_record 0 0 16 202 1 : tunables 120 60 0 : slabdata 0 0 0
> ext3_inode_cache 5656 5656 480 8 1 : tunables 54 27 0 : slabdata 707 707 0
> ext3_xattr 0 0 48 78 1 : tunables 120 60 0 : slabdata 0 0 0
> dquot 0 0 128 30 1 : tunables 120 60 0 : slabdata 0 0 0
> eventpoll_pwq 0 0 36 101 1 : tunables 120 60 0 : slabdata 0 0 0
> eventpoll_epi 0 0 96 40 1 : tunables 120 60 0 : slabdata 0 0 0
> kioctx 0 0 160 24 1 : tunables 120 60 0 : slabdata 0 0 0
> kiocb 0 0 160 24 1 : tunables 120 60 0 : slabdata 0 0 0
> dnotify_cache 0 0 20 169 1 : tunables 120 60 0 : slabdata 0 0 0
> file_lock_cache 18 42 92 42 1 : tunables 120 60 0 : slabdata 1 1 0
> fasync_cache 1 202 16 202 1 : tunables 120 60 0 : slabdata 1 1 0
> shmem_inode_cache 4 9 416 9 1 : tunables 54 27 0 : slabdata 1 1 0
> idr_layer_cache 0 0 136 28 1 : tunables 120 60 0 : slabdata 0 0 0
> posix_timers_cache 0 0 80 48 1 : tunables 120 60 0 : slabdata 0 0 0
> uid_cache 5 113 32 113 1 : tunables 120 60 0 : slabdata 1 1 0
> deadline_drq 0 0 52 72 1 : tunables 120 60 0 : slabdata 0 0 0
> as_arq 59 59 64 59 1 : tunables 120 60 0 : slabdata 1 1 0
> blkdev_requests 48 48 160 24 1 : tunables 120 60 0 : slabdata 2 2 0
> biovec-BIO_MAX_PAGES 256 260 3072 5 4 : tunables 24 12 0 : slabdata 52 52 0
> biovec-128 256 260 1536 5 2 : tunables 24 12 0 : slabdata 52 52 0
> biovec-64 260 260 768 5 1 : tunables 54 27 0 : slabdata 52 52 0
> biovec-16 256 260 192 20 1 : tunables 120 60 0 : slabdata 13 13 0
> biovec-4 256 295 64 59 1 : tunables 120 60 0 : slabdata 5 5 0
> biovec-1 308 404 16 202 1 : tunables 120 60 0 : slabdata 2 2 0
> bio 317 354 64 59 1 : tunables 120 60 0 : slabdata 6 6 0
> sock_inode_cache 65 80 384 10 1 : tunables 54 27 0 : slabdata 8 8 0
> skbuff_head_cache 180 180 192 20 1 : tunables 120 60 0 : slabdata 9 9 0
> sock 3 12 320 12 1 : tunables 54 27 0 : slabdata 1 1 0
> proc_inode_cache 63 77 352 11 1 : tunables 54 27 0 : slabdata 7 7 0
> sigqueue 27 27 144 27 1 : tunables 120 60 0 : slabdata 1 1 0
> radix_tree_node 3045 3045 260 15 1 : tunables 54 27 0 : slabdata 203 203 0
> bdev_cache 2 40 96 40 1 : tunables 120 60 0 : slabdata 1 1 0
> mnt_cache 15 59 64 59 1 : tunables 120 60 0 : slabdata 1 1 0
> inode_cache 823 836 352 11 1 : tunables 54 27 0 : slabdata 76 76 0
> dentry_cache 7988 7992 160 24 1 : tunables 120 60 0 : slabdata 333 333 0
> filp 666 750 128 30 1 : tunables 120 60 0 : slabdata 25 25 0
> names_cache 1 1 4096 1 1 : tunables 24 12 0 : slabdata 1 1 0
> buffer_head 4870 4896 52 72 1 : tunables 120 60 0 : slabdata 68 68 0
> mm_struct 50 66 352 11 1 : tunables 54 27 0 : slabdata 6 6 0
> vm_area_struct 1236 1534 64 59 1 : tunables 120 60 0 : slabdata 26 26 0
> fs_cache 51 113 32 113 1 : tunables 120 60 0 : slabdata 1 1 0
> files_cache 49 63 416 9 1 : tunables 54 27 0 : slabdata 7 7 0
> signal_cache 72 118 64 59 1 : tunables 120 60 0 : slabdata 2 2 0
> sighand_cache 56 66 1312 3 1 : tunables 24 12 0 : slabdata 22 22 0
> task_struct 80 80 1536 5 2 : tunables 24 12 0 : slabdata 16 16 0
> pte_chain 7773 8814 32 113 1 : tunables 120 60 0 : slabdata 78 78 0
> pgd 50 50 4096 1 1 : tunables 24 12 0 : slabdata 50 50 0
> size-131072(DMA) 0 0 131072 1 32 : tunables 8 4 0 : slabdata 0 0 0
> size-131072 0 0 131072 1 32 : tunables 8 4 0 : slabdata 0 0 0
> size-65536(DMA) 0 0 65536 1 16 : tunables 8 4 0 : slabdata 0 0 0
> size-65536 0 0 65536 1 16 : tunables 8 4 0 : slabdata 0 0 0
> size-32768(DMA) 0 0 32768 1 8 : tunables 8 4 0 : slabdata 0 0 0
> size-32768 0 0 32768 1 8 : tunables 8 4 0 : slabdata 0 0 0
> size-16384(DMA) 0 0 16384 1 4 : tunables 8 4 0 : slabdata 0 0 0
> size-16384 0 0 16384 1 4 : tunables 8 4 0 : slabdata 0 0 0
> size-8192(DMA) 0 0 8192 1 2 : tunables 8 4 0 : slabdata 0 0 0
> size-8192 71 71 8192 1 2 : tunables 8 4 0 : slabdata 71 71 0
> size-4096(DMA) 0 0 4096 1 1 : tunables 24 12 0 : slabdata 0 0 0
> size-4096 98 98 4096 1 1 : tunables 24 12 0 : slabdata 98 98 0
> size-2048(DMA) 0 0 2048 2 1 : tunables 24 12 0 : slabdata 0 0 0
> size-2048 134 146 2048 2 1 : tunables 24 12 0 : slabdata 73 73 0
> size-1024(DMA) 0 0 1024 4 1 : tunables 54 27 0 : slabdata 0 0 0
> size-1024 72 80 1024 4 1 : tunables 54 27 0 : slabdata 20 20 0
> size-512(DMA) 0 0 512 8 1 : tunables 54 27 0 : slabdata 0 0 0
> size-512 169 184 512 8 1 : tunables 54 27 0 : slabdata 23 23 0
> size-256(DMA) 0 0 256 15 1 : tunables 120 60 0 : slabdata 0 0 0
> size-256 90 90 256 15 1 : tunables 120 60 0 : slabdata 6 6 0
> size-192(DMA) 0 0 192 20 1 : tunables 120 60 0 : slabdata 0 0 0
> size-192 120 120 192 20 1 : tunables 120 60 0 : slabdata 6 6 0
> size-128(DMA) 0 0 128 30 1 : tunables 120 60 0 : slabdata 0 0 0
> size-128 172 180 128 30 1 : tunables 120 60 0 : slabdata 6 6 0
> size-96(DMA) 0 0 96 40 1 : tunables 120 60 0 : slabdata 0 0 0
> size-96 1538 1560 96 40 1 : tunables 120 60 0 : slabdata 39 39 0
> size-64(DMA) 0 0 64 59 1 : tunables 120 60 0 : slabdata 0 0 0
> size-64 7375 7375 64 59 1 : tunables 120 60 0 : slabdata 125 125 0
> size-32(DMA) 0 0 32 113 1 : tunables 120 60 0 : slabdata 0 0 0
> size-32 1967 2147 32 113 1 : tunables 120 60 0 : slabdata 19 19 0
> kmem_cache 132 132 116 33 1 : tunables 120 60 0 : slabdata 4 4 0

> 11:00am up 2:37, 15 users, load average: 0.71, 0.42, 0.33




--
quote: there are two time a day you should do nothing: before 12 and after 12
(Nico Schottelius after writin' a very senseless email)
cmd: echo God bless America | sed 's/.*\(A.*\)$/Why \1?/'
pgp: new id: 0x8D0E27A4 | ftp.schottelius.org/pub/familiy/nico/pgp-key.new
url: http://nerd-hosting.net - domains for nerds (from a nerd)


Attachments:
(No filename) (16.05 kB)
(No filename) (189.00 B)
Download all attachments

2003-08-29 11:17:43

by Nick Piggin

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

Looks like you still have quite a lot of free memory left, so its
not that. Maybe you have runaway processes? Look in top. Although
if its only happening with test4, I guess its probably kernel
related. Maybe ACPI? Maybe your video card driver? Try booting with
acpi=off. Post a dmesg too. Thanks.

Nico Schottelius wrote:

>I am attaching /proc/meminfo,slapinfo,uptime from now.
>The system is f*** slow..
>And I am currently just able to write this, moving windows
>in X is more than painful!
>
>Nico
>
>Nick Piggin [Thu, Aug 28, 2003 at 07:33:25PM +1000]:
>
>>Nico Schottelius wrote:
>>
>>
>>>Very interesting..
>>>with the test4 I experiene the same/similar problems on my laptop..
>>>all of sudden yesterday several programs died -> Out of Memory.
>>>I ran
>>> Xfree
>>> dhcpcd
>>> opera
>>> several xterms (about 6)
>>> qmail
>>> named
>>>
>>>first opera was Out of Memory, then died the whole X system with all
>>>xterms and X beeing Out of Memory.
>>>
>>>MemTotal: 385600 kB
>>>
>>>which should be more than enough!
>>>
>>>
>>You might have a process with a memory leak. How much free memory do
>>you have before everything dies? How much swapping activity is going
>>on? What do /proc/meminfo and /proc/slabinfo say?
>>
>>
>>
>

2003-09-01 13:58:43

by Bill Davidsen

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

I have never tried running 2.6 without swap, are there tuning values you
need to avoid performance issues. You have adequate memory, have you
played with swappiness?

I'll try no swap on a machine when I get back from the weekend.

--
bill davidsen <[email protected]>
CTO, TMR Associates, Inc
Doing interesting things with little computers since 1979.

2003-09-15 23:37:39

by Nico Schottelius

[permalink] [raw]
Subject: Re: Crazy load average & unkillable processes

Hello!

Once again my system died.
first it gets slow.
then the cpu cooler starts to cool permanently.
then some processes die.
then the system becomes unusable.
i hit SAK, then sysrq+i. then reboot.
attached all available information.

anyone any idea?

Nico


Nick Piggin [Thu, Aug 28, 2003 at 07:33:25PM +1000]:
>
>
> Nico Schottelius wrote:
>
> >Very interesting..
> >with the test4 I experiene the same/similar problems on my laptop..
> >all of sudden yesterday several programs died -> Out of Memory.
> >I ran
> > Xfree
> > dhcpcd
> > opera
> > several xterms (about 6)
> > qmail
> > named
> >
> >first opera was Out of Memory, then died the whole X system with all
> >xterms and X beeing Out of Memory.
> >
> >MemTotal: 385600 kB
> >
> >which should be more than enough!
> >
>
> You might have a process with a memory leak. How much free memory do
> you have before everything dies? How much swapping activity is going
> on? What do /proc/meminfo and /proc/slabinfo say?
>
>
>

--
quote: there are two time a day you should do nothing: before 12 and after 12
(Nico Schottelius after writin' a very senseless email)
cmd: echo God bless America | sed 's/.*\(A.*\)$/Why \1?/'
pgp: new id: 0x8D0E27A4 | ftp.schottelius.org/pub/family/nico/pgp-key.new
url: http://nerd-hosting.net - domains for nerds (from a nerd)


Attachments:
(No filename) (0.00 B)
(No filename) (189.00 B)
Download all attachments