Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932329Ab2E3IZi (ORCPT ); Wed, 30 May 2012 04:25:38 -0400 Received: from mail-qc0-f174.google.com ([209.85.216.174]:39980 "EHLO mail-qc0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932290Ab2E3IZ1 (ORCPT ); Wed, 30 May 2012 04:25:27 -0400 Message-ID: <4FC5D973.3080108@gmail.com> Date: Wed, 30 May 2012 04:25:23 -0400 From: KOSAKI Motohiro User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: Peter Zijlstra CC: Rik van Riel , Andrea Arcangeli , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Hillf Danton , Dan Smith , Linus Torvalds , Andrew Morton , Thomas Gleixner , Ingo Molnar , Paul Turner , Suresh Siddha , Mike Galbraith , "Paul E. McKenney" , Lai Jiangshan , Bharata B Rao , Lee Schermerhorn , Johannes Weiner , Srivatsa Vaddagiri , Christoph Lameter , kosaki.motohiro@gmail.com Subject: Re: [PATCH 13/35] autonuma: add page structure fields References: <1337965359-29725-1-git-send-email-aarcange@redhat.com> <1337965359-29725-14-git-send-email-aarcange@redhat.com> <1338297385.26856.74.camel@twins> <4FC4D58A.50800@redhat.com> <1338303251.26856.94.camel@twins> In-Reply-To: <1338303251.26856.94.camel@twins> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1527 Lines: 35 (5/29/12 10:54 AM), Peter Zijlstra wrote: > On Tue, 2012-05-29 at 09:56 -0400, Rik van Riel wrote: >> On 05/29/2012 09:16 AM, Peter Zijlstra wrote: >>> On Fri, 2012-05-25 at 19:02 +0200, Andrea Arcangeli wrote: >> >>> 24 bytes per page.. or ~0.6% of memory gone. This is far too great a >>> price to pay. >>> >>> At LSF/MM Rik already suggested you limit the number of pages that can >>> be migrated concurrently and use this to move the extra list_head out of >>> struct page and into a smaller amount of extra structures, reducing the >>> total overhead. >> >> For THP, we should be able to track this NUMA info on a >> 2MB page granularity. > > Yeah, but that's another x86-only feature, _IF_ we're going to do this > it must be done for all archs that have CONFIG_NUMA, thus we're stuck > with 4k (or other base page size). Even if THP=n, we don't need 4k granularity. All modern malloc implementation have per-thread heap (e.g. glibc call it as arena) and it is usually 1-8MB size. So, if it is larger than 2MB, we can always use per-pmd tracking. iow, memory consumption reduce to 1/512. My suggestion is, track per-pmd (i.e. 2M size) granularity and fix glibc too (current glibc malloc has dynamically arena size adjusting feature and then it often become less than 2M). -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/