Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753990AbZLBMzN (ORCPT ); Wed, 2 Dec 2009 07:55:13 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753764AbZLBMzM (ORCPT ); Wed, 2 Dec 2009 07:55:12 -0500 Received: from mx1.redhat.com ([209.132.183.28]:10106 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753699AbZLBMzL (ORCPT ); Wed, 2 Dec 2009 07:55:11 -0500 Date: Wed, 2 Dec 2009 13:55:01 +0100 From: Andrea Arcangeli To: Rik van Riel Cc: KOSAKI Motohiro , KAMEZAWA Hiroyuki , Hugh Dickins , Andrew Morton , Izik Eidus , Chris Wright , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [PATCH 2/9] ksm: let shared pages be swappable Message-ID: <20091202125501.GD28697@random.random> References: <20091201181633.5C31.A69D9226@jp.fujitsu.com> <20091201093738.GL30235@random.random> <20091201184535.5C37.A69D9226@jp.fujitsu.com> <20091201095947.GM30235@random.random> <4B15F642.1080308@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4B15F642.1080308@redhat.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1995 Lines: 40 On Wed, Dec 02, 2009 at 12:08:18AM -0500, Rik van Riel wrote: > The VM needs to touch a few (but only a few) PTEs in > that situation, to make sure that anonymous pages get > moved to the inactive anon list and get to a real chance > at being referenced before we try to evict anonymous > pages. > > Without a small amount of pre-aging, we would end up > essentially doing FIFO replacement of anonymous memory, > which has been known to be disastrous to performance > for over 40 years now. So far the only kernel that hangs in fork is the newer one... In general I cannot care less about FIFO, I care about no CPU waste on 100% of my systems were swap is not needed. All my unmapped cache is 100% garbage collectable, and there is never any reason to flush any tlb and walk the rmap chain. Give me a knob to disable the CPU waste given I know what is going on, on my systems. I am totally ok with slightly slower swap performance and fifo replacement in case I eventually hit swap for a little while, then over time if memory pressure stays high swap behavior will improve regardless of flooding ipis to clear young bit when there are hundred gigabytes of freeaeble cache unmapped and clean. > Having said that - it may be beneficial to keep very heavily > shared pages on the active list, without ever trying to scan > the ptes associated with them. Just mapped pages in general, not heavily... The other thing that is beneficial likely is to stop page_referenced after 64 young bit clear, that is referenced enough, you can enable this under my knob so that it won't screw your algorithm. I don't have 1 terabyte of memory, so you don't have to worry for me, I just want every cycle out of my cpu without having to use O_DIRECT all the time. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/