Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752047Ab0GRHuQ (ORCPT ); Sun, 18 Jul 2010 03:50:16 -0400 Received: from courier.cs.helsinki.fi ([128.214.9.1]:34435 "EHLO mail.cs.helsinki.fi" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751075Ab0GRHuM (ORCPT ); Sun, 18 Jul 2010 03:50:12 -0400 Message-ID: <4C42B228.9040401@cs.helsinki.fi> Date: Sun, 18 Jul 2010 10:50:00 +0300 From: Pekka Enberg User-Agent: Thunderbird 2.0.0.24 (Macintosh/20100228) MIME-Version: 1.0 To: Nitin Gupta CC: Hugh Dickins , Andrew Morton , Greg KH , Dan Magenheimer , Rik van Riel , Avi Kivity , Christoph Hellwig , Minchan Kim , Konrad Rzeszutek Wilk , linux-mm , linux-kernel Subject: Re: [PATCH 0/8] zcache: page cache compression support References: <1279283870-18549-1-git-send-email-ngupta@vflare.org> In-Reply-To: <1279283870-18549-1-git-send-email-ngupta@vflare.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4058 Lines: 74 Nitin Gupta wrote: > Frequently accessed filesystem data is stored in memory to reduce access to > (much) slower backing disks. Under memory pressure, these pages are freed and > when needed again, they have to be read from disks again. When combined working > set of all running application exceeds amount of physical RAM, we get extereme > slowdown as reading a page from disk can take time in order of milliseconds. > > Memory compression increases effective memory size and allows more pages to > stay in RAM. Since de/compressing memory pages is several orders of magnitude > faster than disk I/O, this can provide signifant performance gains for many > workloads. Also, with multi-cores becoming common, benefits of reduced disk I/O > should easily outweigh the problem of increased CPU usage. > > It is implemented as a "backend" for cleancache_ops [1] which provides > callbacks for events such as when a page is to be removed from the page cache > and when it is required again. We use them to implement a 'second chance' cache > for these evicted page cache pages by compressing and storing them in memory > itself. > > We only keep pages that compress to PAGE_SIZE/2 or less. Compressed chunks are > stored using xvmalloc memory allocator which is already being used by zram > driver for the same purpose. Zero-filled pages are checked and no memory is > allocated for them. > > A separate "pool" is created for each mount instance for a cleancache-aware > filesystem. Each incoming page is identified with > where inode_no identifies file within the filesystem corresponding to pool_id > and index is offset of the page within this inode. Within a pool, inodes are > maintained in an rb-tree and each of its nodes points to a separate radix-tree > which maintains list of pages within that inode. > > While compression reduces disk I/O, it also reduces the space available for > normal (uncompressed) page cache. This can result in more frequent page cache > reclaim and thus higher CPU overhead. Thus, it's important to maintain good hit > rate for compressed cache or increased CPU overhead can nullify any other > benefits. This requires adaptive (compressed) cache resizing and page > replacement policies that can maintain optimal cache size and quickly reclaim > unused compressed chunks. This work is yet to be done. However, in the current > state, it allows manually resizing cache size using (per-pool) sysfs node > 'memlimit' which in turn frees any excess pages *sigh* randomly. > > Finally, it uses percpu stats and compression buffers to allow better > performance on multi-cores. Still, there are known bottlenecks like a single > xvmalloc mempool per zcache pool and few others. I will work on this when I > start with profiling. > > * Performance numbers: > - Tested using iozone filesystem benchmark > - 4 CPUs, 1G RAM > - Read performance gain: ~2.5X > - Random read performance gain: ~3X > - In general, performance gains for every kind of I/O > > Test details with graphs can be found here: > http://code.google.com/p/compcache/wiki/zcacheIOzone > > If I can get some help with testing, it would be intersting to find its > effect in more real-life workloads. In particular, I'm intersted in finding > out its effect in KVM virtualization case where it can potentially allow > running more number of VMs per-host for a given amount of RAM. With zcache > enabled, VMs can be assigned much smaller amount of memory since host can now > hold bulk of page-cache pages, allowing VMs to maintain similar level of > performance while a greater number of them can be hosted. So why would someone want to use zram if they have transparent page cache compression with zcache? That is, why is this not a replacement for zram? Pekka -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/