Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752811Ab1DTOut (ORCPT ); Wed, 20 Apr 2011 10:50:49 -0400 Received: from smtp109.prem.mail.ac4.yahoo.com ([76.13.13.92]:46439 "HELO smtp109.prem.mail.ac4.yahoo.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1751837Ab1DTOuq (ORCPT ); Wed, 20 Apr 2011 10:50:46 -0400 X-Yahoo-SMTP: _Dag8S.swBC1p4FJKLCXbs8NQzyse1SYSgnAbY0- X-YMail-OSG: MlEtCNYVM1mI1RXWTW1C_MTKCiOMHLNTBzcBH6c0lZIZNkB uvFpuRUcG6Lkam644E8SqtjK.Y3Kr00CxDajXa1YrxRSw2IfD_49s9fuTm6E M_Hc3TYow_M9MbMN9mSVG4t44yKzDOWiodzmHtagKypomSyK4YzemYBjeJtN 6WjH6XGXQpgjrRdRJFdPScobgfdbfzpvYuTtRhKwA0dkQ1uJcpwrEDNygOwN wk45cGJDr89GTag8seO86dm7Gqe4UWL1pp1dhUhdFlCi7G9GA7iY2539B05j G6CiKkLNjv4tkUISHtYNtfvazCAG5j2jgNZ25b.LStOfZvS1A X-Yahoo-Newman-Property: ymail-3 Date: Wed, 20 Apr 2011 09:50:40 -0500 (CDT) From: Christoph Lameter X-X-Sender: cl@router.home To: James Bottomley cc: Pekka Enberg , Matthew Wilcox , KOSAKI Motohiro , Michal Hocko , Andrew Morton , Hugh Dickins , linux-mm@kvack.org, LKML , linux-parisc@vger.kernel.org, David Rientjes , Ingo Molnar , x86 maintainers , linux-arch@vger.kernel.org, Mel Gorman Subject: Re: [PATCH v3] mm: make expand_downwards symmetrical to expand_upwards In-Reply-To: <1303308938.2587.8.camel@mulgrave.site> Message-ID: References: <20110420102314.4604.A69D9226@jp.fujitsu.com> <20110420161615.462D.A69D9226@jp.fujitsu.com> <20110420112020.GA31296@parisc-linux.org> <1303308938.2587.8.camel@mulgrave.site> User-Agent: Alpine 2.00 (DEB 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2255 Lines: 42 On Wed, 20 Apr 2011, James Bottomley wrote: > 1. We can look at what imposing NUMA on the DISCONTIGMEM archs > would do ... the embedded ones are going to be hardest hit, but > if it's not too much extra code, it might be palatable. > 2. The other is that we can audit mm to look at all the node > assumptions in the non-numa case. My suspicion is that > accidentally or otherwise, it mostly works for the normal case, > so there might not be much needed to pull it back to working > properly for DISCONTIGMEM. The older code may work. SLAB f.e. does not call page_to_nid() in the !NUMA case but keeps special metadata structures around in each slab page that records the node used for allocation. The problem is with new code added/revised in the last 5 years or so that uses page_to_nid() and allocates only a single structure for !NUMA. There are also VM_BUG_ONs in the page allocator that should trigger if page_to_nid() returns strange values. I wonder why that never occurred. > 3. Finally we could look at deprecating DISCONTIGMEM in favour of > SPARSEMEM, but we'd still need to fix -stable for that case. > Especially as it will take time to convert all the architectures The fix needed is to mark DISCONTIGMEM without NUMA as broken for now. We need an audit of the core VM before removing that or making it contingent on the configurations of various VM subsystems. > I'm certainly with Matthew: DISCONTIGMEM is supposed to be a lightweight > framework which allows machines with split physical memory ranges to > work. That's a very common case nowadays. Numa is supposed to be a > heavyweight framework to preserve node locality for non-uniform memory > access boxes (which none of the DISCONTIGMEM && !NUMA systems are). Well yes but we have SPARSE for that today. DISCONTIG with multiple per pgdat structures in a !NUMA case is just weird and unexpected for many who have done VM coding in the last years. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/