Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760576Ab1D1Ptf (ORCPT ); Thu, 28 Apr 2011 11:49:35 -0400 Received: from smtp1.linux-foundation.org ([140.211.169.13]:55145 "EHLO smtp1.linux-foundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755488Ab1D1Pte (ORCPT ); Thu, 28 Apr 2011 11:49:34 -0400 MIME-Version: 1.0 In-Reply-To: References: <20110426112756.GF4308@linux.vnet.ibm.com> <20110426183859.6ff6279b@neptune.home> <20110426190918.01660ccf@neptune.home> <20110427081501.5ba28155@pluto.restena.lu> <20110427204139.1b0ea23b@neptune.home> <20110428102609.GJ2135@linux.vnet.ibm.com> <1303997401.7819.5.camel@marge.simson.net> From: Linus Torvalds Date: Thu, 28 Apr 2011 08:48:39 -0700 Message-ID: Subject: Re: 2.6.39-rc4+: Kernel leaking memory during FS scanning, regression? To: sedat.dilek@gmail.com Cc: Mike Galbraith , Thomas Gleixner , "Paul E. McKenney" , =?ISO-8859-1?Q?Bruno_Pr=E9mont?= , Ingo Molnar , Peter Zijlstra , Mike Frysinger , KOSAKI Motohiro , LKML , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, "Paul E. McKenney" , Pekka Enberg Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1843 Lines: 47 On Thu, Apr 28, 2011 at 8:28 AM, Sedat Dilek wrote: > > From the very beginning it looked as the system is "stable" due to: Actually, look here, right from the beginning that log is showing total breakage: Thu Apr 28 16:49:51 CEST 2011 .rt_time : 233.923773 Thu Apr 28 16:50:06 CEST 2011 .rt_time : 259.446506 Thu Apr 28 16:50:22 CEST 2011 .rt_time : 273.110840 Thu Apr 28 16:50:37 CEST 2011 .rt_time : 282.713537 Thu Apr 28 16:50:52 CEST 2011 .rt_time : 288.136013 Thu Apr 28 16:51:07 CEST 2011 .rt_time : 293.057088 .. Thu Apr 28 16:58:29 CEST 2011 .rt_time : 888.893877 Thu Apr 28 16:58:44 CEST 2011 .rt_time : 950.005460 iow, rt_time just constantly grows. You have that "sleep 15" between every log entry, so rt_time growing by 10-100 ms every 15 seconds obviously does mean that it's using real CPU time, but it's still well in the "much less than 1% CPU" range. So the rcu thread is clearly doing work, but equally clearly it should NOT be throttled. But since it is constantly growing, at some point it _will_ hit that magical "950ms total time used", and then it gets throttled. For no good reason. It shouldn't have been throttled in the first place, and then the other bug - that it isn't apparently ever unthrottled - just makes it not work at all. So that whole throttling is totally broken. Linus -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/