Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753780AbaBCXIj (ORCPT ); Mon, 3 Feb 2014 18:08:39 -0500 Received: from mail.linuxfoundation.org ([140.211.169.12]:47176 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753741AbaBCXIh (ORCPT ); Mon, 3 Feb 2014 18:08:37 -0500 Date: Mon, 3 Feb 2014 15:08:35 -0800 From: Andrew Morton To: Dan Streetman Cc: Seth Jennings , Linux-MM , linux-kernel , Bob Liu , Minchan Kim , Weijie Yang , Shirish Pargaonkar , Mel Gorman Subject: Re: [PATCH v2] mm/zswap: add writethrough option Message-Id: <20140203150835.f55fd427d0ebb0c2943f266b@linux-foundation.org> In-Reply-To: <1390831279-5525-1-git-send-email-ddstreet@ieee.org> References: <1387459407-29342-1-git-send-email-ddstreet@ieee.org> <1390831279-5525-1-git-send-email-ddstreet@ieee.org> X-Mailer: Sylpheed 3.2.0beta5 (GTK+ 2.24.10; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 27 Jan 2014 09:01:19 -0500 Dan Streetman wrote: > Currently, zswap is writeback cache; stored pages are not sent > to swap disk, and when zswap wants to evict old pages it must > first write them back to swap cache/disk manually. This avoids > swap out disk I/O up front, but only moves that disk I/O to > the writeback case (for pages that are evicted), and adds the > overhead of having to uncompress the evicted pages and the > need for an additional free page (to store the uncompressed page). > > This optionally changes zswap to writethrough cache by enabling > frontswap_writethrough() before registering, so that any > successful page store will also be written to swap disk. The > default remains writeback. To enable writethrough, the param > zswap.writethrough=1 must be used at boot. > > Whether writeback or writethrough will provide better performance > depends on many factors including disk I/O speed/throughput, > CPU speed(s), system load, etc. In most cases it is likely > that writeback has better performance than writethrough before > zswap is full, but after zswap fills up writethrough has > better performance than writeback. > > The reason to add this option now is, first to allow any zswap > user to be able to test using writethrough to determine if they > get better performance than using writeback, and second to allow > future updates to zswap, such as the possibility of dynamically > switching between writeback and writethrough. > > ... > > Based on specjbb testing on my laptop, the results for both writeback > and writethrough are better than not using zswap at all, but writeback > does seem to be better than writethrough while zswap isn't full. Once > it fills up, performance for writethrough is essentially close to not > using zswap, while writeback seems to be worse than not using zswap. > However, I think more testing on a wider span of systems and conditions > is needed. Additionally, I'm not sure that specjbb is measuring true > performance under fully loaded cpu conditions, so additional cpu load > might need to be added or specjbb parameters modified (I took the > values from the 4 "warehouses" test run). > > In any case though, I think having writethrough as an option is still > useful. More changes could be made, such as changing from writeback > to writethrough based on the zswap % full. And the patch doesn't > change default behavior - writethrough must be specifically enabled. > > The %-ized numbers I got from specjbb on average, using the default > 20% max_pool_percent and varying the amount of heap used as shown: > > ram | no zswap | writeback | writethrough > 75 93.08 100 96.90 > 87 96.58 95.58 96.72 > 100 92.29 89.73 86.75 > 112 63.80 38.66 19.66 > 125 4.79 29.90 15.75 > 137 4.99 4.50 4.75 > 150 4.28 4.62 5.01 > 162 5.20 2.94 4.66 > 175 5.71 2.11 4.84 Changelog is very useful, thanks for taking the time. It does sound like the feature is of marginal benefit. Is "zswap filled up" an interesting or useful case to optimize? otoh the addition is pretty simple and we can later withdraw the whole thing without breaking anyone's systems. What do people think? -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/