Received: by 10.223.185.116 with SMTP id b49csp887821wrg; Sat, 10 Feb 2018 23:07:16 -0800 (PST) X-Google-Smtp-Source: AH8x227lk9qnXKW3VHYCuvoYJR6ABIG0g37cmjCRi7/gyh3O0/MJY6kcyBnUTbBlC1cDqtCHPiqK X-Received: by 2002:a17:902:8f87:: with SMTP id z7-v6mr7403061plo.242.1518332836830; Sat, 10 Feb 2018 23:07:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518332836; cv=none; d=google.com; s=arc-20160816; b=vab1DTyyCYrLiOSY0cKxx1QHtPhUkyMjHMNHc2UoUmCsDaPIVzT6cVot7LunPil3KS Ey8husv9wGXs0PAt4h8ZLf0WUutOUy9hjYLe5DQt/zdqgxzoLuqe0kAczhgw1dMCZ3Nv 4a45SuYLdSrUzwBo5Ro2+RqMNkj3vlWeabQEDpwgdGYL3fCQYFzBZW389e0XJMAi0bC9 /ycqRJcG9QecbKJz4BrI5m58C7DNuk2x04ls2xeljFnb2+TYDpU3ZoqVJpkZuqlp3j5V YU2wPhFWOcbPYMiDFvKEtkdnk5GJqIx9sMB8c1nNJr18x8deykSrLYv9yaQ54K99wXkG pagQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:user-agent:in-reply-to :content-disposition:mime-version:references:subject:cc:to:from:date :arc-authentication-results; bh=SvwMyXR8SQnQoQ5MDpcEiS2EQ2KjtZmVV+SvAtjodX8=; b=YAcU7BtUjX7+IQWsXmIoSztIUHnhwgDhQJvQ2S+QnHMgPFZUdjeuczz9osiEaegLub Tkg0m76nAZYiX1mF77iz9UuwSD+CpDeykkta4Og26Xg5IN7DCu4NoZgaWxY5b7JDPZaY hWy7wTnA5OVH2jn6wFm4i9U2668/ZJ0vkc4TYfNGS14Om3i6d2lU9jh7Hu9yNO/DE7XX b0TM8TigQ8p7apFDZzBh+EiV0apkch19z+IU1xuu5gok/J0cml92GaMXSzf8VMUmnEw4 yNpuuDg/WE2XqdLumgdvWTzEP+oYOb2ILZGO9oTxAkOGq1T7XEtafGmahblAay9bHYc7 cbSw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m15si3682778pga.684.2018.02.10.23.06.49; Sat, 10 Feb 2018 23:07:16 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752523AbeBKHFx (ORCPT + 99 others); Sun, 11 Feb 2018 02:05:53 -0500 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:51652 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751839AbeBKHFv (ORCPT ); Sun, 11 Feb 2018 02:05:51 -0500 Received: from pps.filterd (m0098417.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w1B73wer069320 for ; Sun, 11 Feb 2018 02:05:51 -0500 Received: from e06smtp12.uk.ibm.com (e06smtp12.uk.ibm.com [195.75.94.108]) by mx0a-001b2d01.pphosted.com with ESMTP id 2g2ewgtwds-1 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=NOT) for ; Sun, 11 Feb 2018 02:05:50 -0500 Received: from localhost by e06smtp12.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Sun, 11 Feb 2018 07:05:45 -0000 Received: from b06cxnps4076.portsmouth.uk.ibm.com (9.149.109.198) by e06smtp12.uk.ibm.com (192.168.101.142) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; Sun, 11 Feb 2018 07:05:42 -0000 Received: from d06av26.portsmouth.uk.ibm.com (d06av26.portsmouth.uk.ibm.com [9.149.105.62]) by b06cxnps4076.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w1B75gDc48955512; Sun, 11 Feb 2018 07:05:42 GMT Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 91EF6AE055; Sun, 11 Feb 2018 06:56:49 +0000 (GMT) Received: from d06av26.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 03D95AE053; Sun, 11 Feb 2018 06:56:49 +0000 (GMT) Received: from rapoport-lnx (unknown [9.148.8.117]) by d06av26.portsmouth.uk.ibm.com (Postfix) with ESMTPS; Sun, 11 Feb 2018 06:56:48 +0000 (GMT) Date: Sun, 11 Feb 2018 09:05:40 +0200 From: Mike Rapoport To: Sergey Senozhatsky Cc: Minchan Kim , Andrew Morton , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Sergey Senozhatsky Subject: Re: [PATCHv2 1/2] zsmalloc: introduce zs_huge_object() function References: <20180207092919.19696-2-sergey.senozhatsky@gmail.com> <20180210082321.17798-1-sergey.senozhatsky@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180210082321.17798-1-sergey.senozhatsky@gmail.com> User-Agent: Mutt/1.5.24 (2015-08-30) X-TM-AS-GCONF: 00 x-cbid: 18021107-0008-0000-0000-000004CB912C X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18021107-0009-0000-0000-00001E5F4B71 Message-Id: <20180211070539.GA13931@rapoport-lnx> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2018-02-10_10:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=2 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1709140000 definitions=main-1802110094 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Some more nitpicks :) On Sat, Feb 10, 2018 at 05:23:21PM +0900, Sergey Senozhatsky wrote: > Not every object can be share its zspage with other objects, e.g. > when the object is as big as zspage or nearly as big a zspage. > For such objects zsmalloc has a so called huge class - every object > which belongs to huge class consumes the entire zspage (which > consists of a physical page). On x86_64, PAGE_SHIFT 12 box, the > first non-huge class size is 3264, so starting down from size 3264, > objects can share page(-s) and thus minimize memory wastage. > > ZRAM, however, has its own statically defined watermark for huge > objects - "3 * PAGE_SIZE / 4 = 3072", and forcibly stores every > object larger than this watermark (3072) as a PAGE_SIZE object, > in other words, to a huge class, while zsmalloc can keep some of > those objects in non-huge classes. This results in increased > memory consumption. > > zsmalloc knows better if the object is huge or not. Introduce > zs_huge_object() function which tells if the given object can be > stored in one of non-huge classes or not. This will let us to drop > ZRAM's huge object watermark and fully rely on zsmalloc when we > decide if the object is huge. > > Signed-off-by: Sergey Senozhatsky > --- > include/linux/zsmalloc.h | 2 ++ > mm/zsmalloc.c | 26 ++++++++++++++++++++++++++ > 2 files changed, 28 insertions(+) > > diff --git a/include/linux/zsmalloc.h b/include/linux/zsmalloc.h > index 57a8e98f2708..9a1baf673cc1 100644 > --- a/include/linux/zsmalloc.h > +++ b/include/linux/zsmalloc.h > @@ -47,6 +47,8 @@ void zs_destroy_pool(struct zs_pool *pool); > unsigned long zs_malloc(struct zs_pool *pool, size_t size, gfp_t flags); > void zs_free(struct zs_pool *pool, unsigned long obj); > > +bool zs_huge_object(size_t sz); > + > void *zs_map_object(struct zs_pool *pool, unsigned long handle, > enum zs_mapmode mm); > void zs_unmap_object(struct zs_pool *pool, unsigned long handle); > diff --git a/mm/zsmalloc.c b/mm/zsmalloc.c > index c3013505c305..922180183ca3 100644 > --- a/mm/zsmalloc.c > +++ b/mm/zsmalloc.c > @@ -192,6 +192,7 @@ static struct vfsmount *zsmalloc_mnt; > * (see: fix_fullness_group()) > */ > static const int fullness_threshold_frac = 4; > +static size_t zs_huge_class_size; > > struct size_class { > spinlock_t lock; > @@ -1417,6 +1418,28 @@ void zs_unmap_object(struct zs_pool *pool, unsigned long handle) > } > EXPORT_SYMBOL_GPL(zs_unmap_object); > > +/** > + * zs_huge_object() - Test if a compressed object's size is too big for normal > + * zspool classes and it shall be stored in a huge class. I think "is should be stored" is more appropriate > + * @sz: Size of the compressed object (in bytes). > + * > + * The function checks if the object's size falls into huge_class > + * area. We must take handle size into account and test the actual > + * size we are going to use, because zs_malloc() unconditionally > + * adds %ZS_HANDLE_SIZE before it performs %size_class lookup. ^ &size_class ;-) > + * > + * Context: Any context. > + * > + * Return: > + * * true - The object's size is too big, it will be stored in a huge class. > + * * false - The object will be store in normal zspool classes. > + */ > +bool zs_huge_object(size_t sz) > +{ > + return sz + ZS_HANDLE_SIZE >= zs_huge_class_size; > +} > +EXPORT_SYMBOL_GPL(zs_huge_object); > + > static unsigned long obj_malloc(struct size_class *class, > struct zspage *zspage, unsigned long handle) > { > @@ -2404,6 +2427,9 @@ struct zs_pool *zs_create_pool(const char *name) > INIT_LIST_HEAD(&class->fullness_list[fullness]); > > prev_class = class; > + if (pages_per_zspage == 1 && objs_per_zspage == 1 > + && !zs_huge_class_size) > + zs_huge_class_size = size; > } > > /* debug only, don't abort if it fails */ > -- > 2.16.1 > -- Sincerely yours, Mike.