Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755716AbZJSCoy (ORCPT ); Sun, 18 Oct 2009 22:44:54 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755669AbZJSCox (ORCPT ); Sun, 18 Oct 2009 22:44:53 -0400 Received: from Cpsmtpm-eml107.kpnxchange.com ([195.121.3.11]:55195 "EHLO CPSMTPM-EML107.kpnxchange.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753125AbZJSCow (ORCPT ); Sun, 18 Oct 2009 22:44:52 -0400 From: Frans Pop To: Pekka Enberg Subject: Re: [Bug #14141] order 2 page allocation failures in iwlagn Date: Mon, 19 Oct 2009 04:44:52 +0200 User-Agent: KMail/1.9.9 Cc: Mel Gorman , David Rientjes , KOSAKI Motohiro , "Rafael J. Wysocki" , Linux Kernel Mailing List , Kernel Testers List , Reinette Chatre , Bartlomiej Zolnierkiewicz , Karol Lewandowski , Mohamed Abbas , "John W. Linville" , linux-mm@kvack.org, jens.axboe@oracle.com References: <3onW63eFtRF.A.xXH.oMTxKB@chimera> <200910190133.33183.elendil@planet.nl> <1255912562.6824.9.camel@penberg-laptop> In-Reply-To: <1255912562.6824.9.camel@penberg-laptop> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200910190444.55867.elendil@planet.nl> X-OriginalArrivalTime: 19 Oct 2009 02:44:56.0390 (UTC) FILETIME=[2491C660:01CA5066] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2117 Lines: 46 On Monday 19 October 2009, Pekka Enberg wrote: > On Wednesday 14 October 2009, Frans Pop wrote: > > On Thursday 15 October 2009, Mel Gorman wrote: > > > Outside the range of commits suspected of causing problems was the > > > following. It's extremely low probability > > > > > > Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write > > > confusion This patch alters the call to congestion_wait() in the > > > page allocator. Frankly, I don't get the change but it might worth > > > checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31 makes > > > any difference > > > > This is the real culprit. Mel: thanks very much for looking beyond the > > area I identified. Your overview of mm changes was exactly what I > > needed and really helped a lot during my later tests. > > > > This commit definitely causes most of the problems; confirmed by > > reverting it on top of 2.6.31 (also requires reverting 373c0a7e, which > > is a later build fix). > > Mel/Jens, any ideas why commit 8aa7e84 makes us run out of high order > pages? Should we be using BLK_RW_SYNC in mm/page_alloc.c instead of > BLK_RW_ASYNC? I'm starting to think that this commit may not be directly related to high order allocation failures. The fact that I'm seeing SKB allocation failures earlier because of this commit could be just a side effect. It could be that instead the main impact of this commit is on encrypted file system and/or encrypted swap (kcryptd). Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm only reading from NFS that's unlikely). Reason for thinking this is that reverting it makes no difference for Karol [1]. It will be interesting to see if it does make a difference for Sven Geggus [2]. /me wonders if we'll ever get to the bottom of this... [1] http://lkml.org/lkml/2009/10/18/138 [2] http://lkml.org/lkml/2009/10/17/113 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/