Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932203Ab1ESChs (ORCPT ); Wed, 18 May 2011 22:37:48 -0400 Received: from fgwmail5.fujitsu.co.jp ([192.51.44.35]:34976 "EHLO fgwmail5.fujitsu.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755472Ab1ESChr (ORCPT ); Wed, 18 May 2011 22:37:47 -0400 X-SecurityPolicyCheck-FJ: OK by FujitsuOutboundMailChecker v1.3.1 Date: Thu, 19 May 2011 11:30:59 +0900 From: KAMEZAWA Hiroyuki To: Andrew Lutomirski Cc: Minchan Kim , Wu Fengguang , Andi Kleen , "linux-mm@kvack.org" , LKML , KOSAKI Motohiro , Mel Gorman , Johannes Weiner , Rik van Riel Subject: Re: Kernel falls apart under light memory pressure (i.e. linking vmlinux) Message-Id: <20110519113059.06d0e0d2.kamezawa.hiroyu@jp.fujitsu.com> In-Reply-To: References: <20110512054631.GI6008@one.firstfloor.org> <20110514165346.GV6008@one.firstfloor.org> <20110514174333.GW6008@one.firstfloor.org> <20110515152747.GA25905@localhost> <20110517060001.GC24069@localhost> Organization: FUJITSU Co. LTD. X-Mailer: Sylpheed 3.1.0 (GTK+ 2.10.14; i686-pc-mingw32) Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2218 Lines: 63 On Wed, 18 May 2011 22:15:53 -0400 Andrew Lutomirski wrote: > On Wed, May 18, 2011 at 1:17 AM, Minchan Kim wrote: > > On Wed, May 18, 2011 at 4:22 AM, Andrew Lutomirski wrote: > > Andrew, Could you test this patch with !pgdat_balanced patch? > > I think we shouldn't see OOM message if we have lots of free swap space. > > > > == CUT_HERE == > > diff --git a/mm/vmscan.c b/mm/vmscan.c > > index f73b865..cc23f04 100644 > > --- a/mm/vmscan.c > > +++ b/mm/vmscan.c > > @@ -1341,10 +1341,6 @@ static inline bool > > should_reclaim_stall(unsigned long nr_taken, > >        if (current_is_kswapd()) > >                return false; > > > > -       /* Only stall on lumpy reclaim */ > > -       if (sc->reclaim_mode & RECLAIM_MODE_SINGLE) > > -               return false; > > - > >        /* If we have relaimed everything on the isolated list, no stall */ > >        if (nr_freed == nr_taken) > >                return false; > > > > > > > > Then, if you don't see any unnecessary OOM but still see the hangup, > > could you apply this patch based on previous? > > With this patch, I started GNOME and Firefox, turned on swap, and ran > test_mempressure.sh 1500 1400 1. Instant panic (or OOPS and hang or > something -- didn't get the top part). Picture attached -- it looks > like memcg might be involved. I'm running F15, so it might even be > doing something. > Hmm, what kernel version do you use ? I think memcg is not guilty because RIP is shrink_page_list(). But ok, I'll dig this. Could you give us your .config ? Thanks, -Kame > I won't be able to get netconsole dumps until next week because I'm > out of town and only have this one computer here. > > I haven't tried the other patch. > > Also, the !pgdat_balanced fix plus the if (need_resched()) return > false patch just hung once on 2.6.37-rc9. I don't know what triggered > it. Maybe yum. > > --Andy -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/