Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758890Ab1DYTQP (ORCPT ); Mon, 25 Apr 2011 15:16:15 -0400 Received: from e1.ny.us.ibm.com ([32.97.182.141]:59810 "EHLO e1.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758203Ab1DYTQN (ORCPT ); Mon, 25 Apr 2011 15:16:13 -0400 Date: Mon, 25 Apr 2011 12:16:07 -0700 From: "Paul E. McKenney" To: Bruno =?iso-8859-1?Q?Pr=E9mont?= Cc: Linus Torvalds , Mike Frysinger , KOSAKI Motohiro , linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, "Paul E. McKenney" , Pekka Enberg Subject: Re: 2.6.39-rc4+: Kernel leaking memory during FS scanning, regression? Message-ID: <20110425191607.GL2468@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20110424235928.71af51e0@neptune.home> <20110425114429.266A.A69D9226@jp.fujitsu.com> <20110425111705.786ef0c5@neptune.home> <20110425180450.1ede0845@neptune.home> <20110425190032.7904c95d@neptune.home> <20110425203606.4e78246c@neptune.home> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20110425203606.4e78246c@neptune.home> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2837 Lines: 66 On Mon, Apr 25, 2011 at 08:36:06PM +0200, Bruno Pr?mont wrote: > On Mon, 25 April 2011 Linus Torvalds wrote: > > On Mon, Apr 25, 2011 at 10:00 AM, Bruno Pr?mont wrote: > > > > > > I hope tiny-rcu is not that broken... as it would mean driving any > > > PREEMPT_NONE or PREEMPT_VOLUNTARY system out of memory when compiling > > > packages (and probably also just unpacking larger tarballs or running > > > things like du). > > > > I'm sure that TINYRCU can be fixed if it really is the problem. > > > > So I just want to make sure that we know what the root cause of your > > problem is. It's quite possible that it _is_ a real leak of filp or > > something, but before possibly wasting time trying to figure that out, > > let's see if your config is to blame. > > With changed config (PREEMPT=y, TREE_PREEMPT_RCU=y) I haven't reproduced > yet. > > When I was reproducing with TINYRCU things went normally for some time > until suddenly slabs stopped being freed. Hmmm... If the system is responsive during this time, could you please do the following after the slabs stop being freed? ps -eo pid,class,sched,rtprio,stat,state,sgi_p,cpu_time,cmd | grep '\[rcu' Thanx, Paul > > > And with system doing nothing (except monitoring itself) memory usage > > > goes increasing all the time until it starves (well it seems to keep > > > ~20M free, pushing processes it can to swap). Config is just being > > > make oldconfig from working 2.6.38 kernel (answering default for new > > > options) > > > > How sure are you that the system really is idle? Quite frankly, the > > constant growing doesn't really look idle to me. > > Except the SIGSTOPed build there is not much left, collectd running in > background (it polls /proc for process counts, fork rate, memory usage, > ... opening, reading, closing the files -- scanning every 10 seconds), > slabtop on one terminal. > > CPU activity was near-zero with 10%-20% spikes of system use every 10 > minutes and io-wait when all cache had been pushed out. > > > > Attached graph matching numbers of previous mail. (dropping caches was at > > > 17:55, system idle since then) > > > > Nothing at all going on in 'ps' during that time? And what does > > slabinfo say at that point now that kmemleak isn't dominating > > everything else? > > ps definitely does not show anything special, 30 or so userspace processes. > Didn't check ls /proc/*/fd though. Will do at next occurrence. > > > Going to test further with various PREEMPT and RCU selections. Will report > back as I progress (but won't have much time tomorrow). > > Bruno -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/