Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S261488AbVC0UHH (ORCPT ); Sun, 27 Mar 2005 15:07:07 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S261540AbVC0UHG (ORCPT ); Sun, 27 Mar 2005 15:07:06 -0500 Received: from e6.ny.us.ibm.com ([32.97.182.146]:5305 "EHLO e6.ny.us.ibm.com") by vger.kernel.org with ESMTP id S261488AbVC0UG4 (ORCPT ); Sun, 27 Mar 2005 15:06:56 -0500 Message-ID: <4247125C.2050802@us.ibm.com> Date: Sun, 27 Mar 2005 12:06:52 -0800 From: Badari Pulavarty User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.2) Gecko/20040804 Netscape/7.2 (ax) X-Accept-Language: en-us, en MIME-Version: 1.0 To: cmm@us.ibm.com CC: Andrew Morton , andrea@suse.de, mjbligh@us.ibm.com, Linux Kernel Mailing List , ext2-devel Subject: Re: [Ext2-devel] Re: OOM problems on 2.6.12-rc1 with many fsx tests References: <20050315204413.GF20253@csail.mit.edu> <20050316003134.GY7699@opteron.random> <20050316040435.39533675.akpm@osdl.org> <20050316183701.GB21597@opteron.random> <1111607584.5786.55.camel@localhost.localdomain> <20050325135630.28cd492c.akpm@osdl.org> <1111788665.21169.54.camel@dyn318077bld.beaverton.ibm.com> <1111883038.3633.9.camel@dyn318043bld.beaverton.ibm.com> <1111951352.4313.4.camel@localhost.localdomain> In-Reply-To: <1111951352.4313.4.camel@localhost.localdomain> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2144 Lines: 58 Mingming Cao wrote: > On Sat, 2005-03-26 at 16:23 -0800, Mingming Cao wrote: > >>On Fri, 2005-03-25 at 14:11 -0800, Badari Pulavarty wrote: >> >>>On Fri, 2005-03-25 at 13:56, Andrew Morton wrote: >>> >>>>Mingming Cao wrote: >>>> >>>>>I run into OOM problem again on 2.6.12-rc1. I run some(20) fsx tests on >>>>>2.6.12-rc1 kernel(and 2.6.11-mm4) on ext3 filesystem, after about 10 >>>>>hours the system hit OOM, and OOM keep killing processes one by one. I >>>>>could reproduce this problem very constantly on a 2 way PIII 700MHZ with >>>>>512MB RAM. Also the problem could be reproduced on running the same test >>>>>on reiser fs. >>>>> >>>>>The fsx command is: >>>>> >>>>>./fsx -c 10 -n -r 4096 -w 4096 /mnt/test/foo1 & >>>> >>>>I was able to reproduce this on ext3. Seven instances of the above leaked >>>>10-15MB over 10 hours. All of it permanently stuck on the LRU. >>>> >>>>I'll continue to poke at it - see what kernel it started with, which >>>>filesystems it affects, whether it happens on UP&&!PREEMPT, etc. Not a >>>>quick process. >>> >>>I reproduced *similar* issue with 2.6.11. The reason I say similar, is >>>there is no OOM kill, but very low free memory and machine doesn't >>>respond at all. (I booted my machine with 256M memory and ran 20 copies >>>of fsx on ext3). >>> >>> >> >>Yes, I re-run the same test on 2.6.11 for 24 hours, like Badari see on >>his machine, my machine did not go to OOM on 2.6.11,still alive, but >>memory is very low(only 5M free). Killed all fsx and umount the ext3 >>filesystem did not bring back much memory. I will going to rerun the >>tests without the mapped read/write to see what happen. >> >> > > > Run fsx tests without mapped IO on 2.6.11 seems fine. Here is > the /proc/meminfo after 18 hours run: Mingming, Reproduce it on 2.6.11 with mapped IO tests. That will tell us when the regression started. Thanks, Badari - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/