Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932084Ab1F0HNF (ORCPT ); Mon, 27 Jun 2011 03:13:05 -0400 Received: from mail.betterlinux.com ([199.58.199.50]:45874 "EHLO mail.betterlinux.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757093Ab1F0HLp (ORCPT ); Mon, 27 Jun 2011 03:11:45 -0400 Date: Mon, 27 Jun 2011 09:11:39 +0200 From: Andrea Righi To: KOSAKI Motohiro Cc: akpm@linux-foundation.org, minchan.kim@gmail.com, riel@redhat.com, peterz@infradead.org, hannes@cmpxchg.org, kamezawa.hiroyu@jp.fujitsu.com, aarcange@redhat.com, hughd@google.com, jamesjer@betterlinux.com, marcus@bluehost.com, matt@bluehost.com, tytso@mit.edu, shaohua.li@intel.com, P@draigBrady.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v3 0/2] fadvise: support POSIX_FADV_NOREUSE Message-ID: <20110627071139.GC1247@thinkpad> References: <1308923350-7932-1-git-send-email-andrea@betterlinux.com> <4E07F349.2040900@jp.fujitsu.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4E07F349.2040900@jp.fujitsu.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3148 Lines: 70 On Mon, Jun 27, 2011 at 12:04:41PM +0900, KOSAKI Motohiro wrote: > (2011/06/24 22:49), Andrea Righi wrote: > > There were some reported problems in the past about trashing page cache > > when a backup software (i.e., rsync) touches a huge amount of pages (see > > for example [1]). > > > > This problem has been almost fixed by the Minchan Kim's patch [2] and a > > proper use of fadvise() in the backup software. For example this patch > > set [3] has been proposed for inclusion in rsync. > > > > However, there can be still other similar trashing problems: when the > > backup software reads all the source files, some of them may be part of > > the actual working set of the system. When a POSIX_FADV_DONTNEED is > > performed _all_ pages are evicted from pagecache, both the working set > > and the use-once pages touched only by the backup software. > > > > A previous proposal [4] tried to resolve this problem being less > > agressive in invalidating active pages, moving them to the inactive list > > intead of just evict them from the page cache. > > > > However, this approach changed completely the old behavior of > > invalidate_mapping_pages(), that is not only used by fadvise. > > > > The new solution maps POSIX_FADV_NOREUSE to the less-agressive page > > invalidation policy. > > > > With POSIX_FADV_NOREUSE active pages are moved to the tail of the > > inactive list, and pages in the inactive list are just removed from page > > cache. Pages mapped by other processes or unevictable pages are not > > touched at all. > > > > In this way if the backup was the only user of a page, that page will be > > immediately removed from the page cache by calling POSIX_FADV_NOREUSE. > > If the page was also touched by other tasks it'll be moved to the > > inactive list, having another chance of being re-added to the working > > set, or simply reclaimed when memory is needed. > > > > In conclusion, now userspace applications that want to drop some page > > cache pages can choose between the following advices: > > > > POSIX_FADV_DONTNEED = drop page cache if possible > > POSIX_FADV_NOREUSE = reduce page cache eligibility > > Eeek. > > Your POSIX_FADV_NOREUSE is very different from POSIX definition. > POSIX says, > > POSIX_FADV_NOREUSE > Specifies that the application expects to access the specified data once and then > not reuse it thereafter. > > IfI understand correctly, it designed for calling _before_ data access > and to be expected may prevent lru activation. But your NORESE is designed > for calling _after_ data access. Big difference might makes a chance of > portability issue. You're right. NOREUSE is designed to implement drop behind policy. I'll post a new patch that will plug this logic in DONTNEED (like the presious version), but without breaking the old /proc/sys/vm/drop_caches behavior. Thanks, -Andrea -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/