Received: by 2002:a05:6358:c692:b0:131:369:b2a3 with SMTP id fe18csp3476749rwb; Sun, 30 Jul 2023 08:24:39 -0700 (PDT) X-Google-Smtp-Source: APBJJlHbq42fcPN8IFxysyquh3AaqZ9ie9eh/OAlIa+eH/EJDwtGex5Ct8D+n4PoQQDlmaKpRedA X-Received: by 2002:a05:6a20:6a28:b0:13c:9317:ad78 with SMTP id p40-20020a056a206a2800b0013c9317ad78mr7727031pzk.46.1690730678918; Sun, 30 Jul 2023 08:24:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1690730678; cv=none; d=google.com; s=arc-20160816; b=QD/42gSJMmWPhzPXEph/pexRf2Fv88WrpnHxB+wIWq10tqNkX8b9oSVRo11HUNwREz R4xlSyQmZHxEQxwXbbY4LBivF0fPROtwUwm9un1sMFNgfHLMpRiKIP8Dox26n2Kxakbj 53TgPAG0qk81y/pq5KzEZ+qtjD4uO9MuIGQYr92rhlQkf+tLCt/c/XiVtgcxl4momUiJ kRUMHlb3cVfVdPQXDXmSq/tjEI+zO3jcOMMjk5I3Vm1zNH8/Kv1o8Il170XvurzQVAg1 Ky9UMIUY6WJiKFBgDa/6SmDnsPDc4VtKvQEr22dYFbGwF421zeo8galYO54XMvsFlNth rZ8Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :cc:to:from:subject:user-agent:mime-version:date:message-id; bh=CgWuCkb4gT+giSptaE9LErS/XWRzbYB5Y3Zmm/ik25U=; fh=ZwAbavAzXvkSJ948IEP0j5AcjtVRN7xwoNh61MhW0ug=; b=yOa40s0eaWcrAhTXcSfW5aOOS9c0+kdTKnnPEkghA4G/iNzIKO64h9R7JDCAASBk9K vwa0OM6vDZxlpYu5if1wcTzJ+Gbh6XK7EO2HEN30rTZgTDbzmI/DhJG2Es0mwa0B/mQ9 JnQ+2a8WX6VTJs9T67qkc5hOkicuJ7o92cGax1hEKWsVz8i8S+krExWH9myPYATWEVU7 C79yuZyV4hQL8tCrJ3aN/nulZEUKYx8AtqfkMoFFNonMDUkdr9fjjhjPiJYLUMHCHnWK ElzrcZRPzoIk9M8GNtEAt+6J80WsXYlxx1g/U+HkqNzLTFFMnIbWMJqhUXavIzVRJeXG FXGA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id s1-20020a632c01000000b005518971fccbsi5792187pgs.712.2023.07.30.08.24.27; Sun, 30 Jul 2023 08:24:38 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229803AbjG3OGY (ORCPT + 99 others); Sun, 30 Jul 2023 10:06:24 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39058 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229736AbjG3OGW (ORCPT ); Sun, 30 Jul 2023 10:06:22 -0400 Received: from out30-132.freemail.mail.aliyun.com (out30-132.freemail.mail.aliyun.com [115.124.30.132]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 74D4A189 for ; Sun, 30 Jul 2023 07:06:20 -0700 (PDT) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R111e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018045192;MF=hsiangkao@linux.alibaba.com;NM=1;PH=DS;RN=3;SR=0;TI=SMTPD_---0VoW0wIC_1690725974; Received: from 192.168.75.41(mailfrom:hsiangkao@linux.alibaba.com fp:SMTPD_---0VoW0wIC_1690725974) by smtp.aliyun-inc.com; Sun, 30 Jul 2023 22:06:15 +0800 Message-ID: Date: Sun, 30 Jul 2023 22:06:14 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.13.0 Subject: Re: [PATCH v4] erofs: DEFLATE compression support From: Gao Xiang To: Chao Yu , linux-erofs@lists.ozlabs.org Cc: LKML References: <20230713001441.30462-1-hsiangkao@linux.alibaba.com> <20230716091940.40899-1-hsiangkao@linux.alibaba.com> <9570176a-02c2-c500-14c6-b8ca220cfb96@kernel.org> <07c85385-be34-8fdc-876c-dc97639c5643@linux.alibaba.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-10.0 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE,UNPARSEABLE_RELAY,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2023/7/30 22:05, Gao Xiang wrote: > > > On 2023/7/30 22:01, Chao Yu wrote: >> On 2023/7/30 21:49, Gao Xiang wrote: >>> >>> >>> On 2023/7/30 20:53, Chao Yu wrote: >>>> On 2023/7/16 17:19, Gao Xiang wrote: >>>>> Add DEFLATE compression as the 3rd supported algorithm. >>>>> >>>>> DEFLATE is a popular generic-purpose compression algorithm for quite >>>>> long time (many advanced formats like gzip, zlib, zip, png are all >>>>> based on that) as Apple documentation written "If you require >>>>> interoperability with non-Apple devices, use COMPRESSION_ZLIB. [1]". >>>>> >>>>> Due to its popularity, there are several hardware on-market DEFLATE >>>>> accelerators, such as (s390) DFLTCC, (Intel) IAA/QAT, (HiSilicon) ZIP >>>>> accelerator, etc.  In addition, there are also several high-performence >>>>> IP cores and even open-source FPGA approches available for DEFLATE. >>>>> Therefore, it's useful to support DEFLATE compression in order to find >>>>> a way to utilize these accelerators for asynchronous I/Os and get >>>>> benefits from these later. >>>>> >>>>> Besides, it's a good choice to trade off between compression ratios >>>>> and performance compared to LZ4 and LZMA.  The DEFLATE core format is >>>>> simple as well as easy to understand, therefore the code size of its >>>>> decompressor is small even for the bootloader use cases.  The runtime >>>>> memory consumption is quite limited too (e.g. 32K + ~7K for each zlib >>>>> stream).  As usual, EROFS ourperforms similar approaches too. >>>>> >>>>> Alternatively, DEFLATE could still be used for some specific files >>>>> since EROFS supports multiple compression algorithms in one image. >>>>> >>>>> [1] https://developer.apple.com/documentation/compression/compression_algorithm >>>>> --- >>>>> changes since v3: >>>>>    - fix 'insz' mis-calculation, which leads to failure on 4k pclusters. >>>>> >>>>>    fs/erofs/Kconfig                |  15 ++ >>>>>    fs/erofs/Makefile               |   1 + >>>>>    fs/erofs/compress.h             |   2 + >>>>>    fs/erofs/decompressor.c         |   6 + >>>>>    fs/erofs/decompressor_deflate.c | 250 ++++++++++++++++++++++++++++++++ >>>>>    fs/erofs/erofs_fs.h             |   7 + >>>>>    fs/erofs/internal.h             |  20 +++ >>>>>    fs/erofs/super.c                |  10 ++ >>>>>    fs/erofs/zmap.c                 |   5 +- >>>>>    9 files changed, 314 insertions(+), 2 deletions(-) >>>>>    create mode 100644 fs/erofs/decompressor_deflate.c >>>>> >>>>> diff --git a/fs/erofs/Kconfig b/fs/erofs/Kconfig >>>>> index f259d92c9720..56a99ba8ce22 100644 >>>>> --- a/fs/erofs/Kconfig >>>>> +++ b/fs/erofs/Kconfig >>>>> @@ -99,6 +99,21 @@ config EROFS_FS_ZIP_LZMA >>>>>          If unsure, say N. >>>>> +config EROFS_FS_ZIP_DEFLATE >>>>> +    bool "EROFS DEFLATE compressed data support" >>>>> +    depends on EROFS_FS_ZIP >>>>> +    select ZLIB_INFLATE >>>>> +    help >>>>> +      Saying Y here includes support for reading EROFS file systems >>>>> +      containing DEFLATE compressed data.  It gives better compression >>>>> +      ratios than the default LZ4 format, while it costs more CPU >>>>> +      overhead. >>>>> + >>>>> +      DEFLATE support is an experimental feature for now and so most >>>>> +      file systems will be readable without selecting this option. >>>>> + >>>>> +      If unsure, say N. >>>>> + >>>>>    config EROFS_FS_ONDEMAND >>>>>        bool "EROFS fscache-based on-demand read support" >>>>>        depends on CACHEFILES_ONDEMAND && (EROFS_FS=m && FSCACHE || EROFS_FS=y && FSCACHE=y) >>>>> diff --git a/fs/erofs/Makefile b/fs/erofs/Makefile >>>>> index a3a98fc3e481..994d0b9deddf 100644 >>>>> --- a/fs/erofs/Makefile >>>>> +++ b/fs/erofs/Makefile >>>>> @@ -5,4 +5,5 @@ erofs-objs := super.o inode.o data.o namei.o dir.o utils.o sysfs.o >>>>>    erofs-$(CONFIG_EROFS_FS_XATTR) += xattr.o >>>>>    erofs-$(CONFIG_EROFS_FS_ZIP) += decompressor.o zmap.o zdata.o pcpubuf.o >>>>>    erofs-$(CONFIG_EROFS_FS_ZIP_LZMA) += decompressor_lzma.o >>>>> +erofs-$(CONFIG_EROFS_FS_ZIP_DEFLATE) += decompressor_deflate.o >>>>>    erofs-$(CONFIG_EROFS_FS_ONDEMAND) += fscache.o >>>>> diff --git a/fs/erofs/compress.h b/fs/erofs/compress.h >>>>> index b1b846504027..349c3316ae6b 100644 >>>>> --- a/fs/erofs/compress.h >>>>> +++ b/fs/erofs/compress.h >>>>> @@ -94,4 +94,6 @@ extern const struct z_erofs_decompressor erofs_decompressors[]; >>>>>    /* prototypes for specific algorithms */ >>>>>    int z_erofs_lzma_decompress(struct z_erofs_decompress_req *rq, >>>>>                    struct page **pagepool); >>>>> +int z_erofs_deflate_decompress(struct z_erofs_decompress_req *rq, >>>>> +                   struct page **pagepool); >>>>>    #endif >>>>> diff --git a/fs/erofs/decompressor.c b/fs/erofs/decompressor.c >>>>> index cfad1eac7fd9..332ec5f74002 100644 >>>>> --- a/fs/erofs/decompressor.c >>>>> +++ b/fs/erofs/decompressor.c >>>>> @@ -379,4 +379,10 @@ const struct z_erofs_decompressor erofs_decompressors[] = { >>>>>            .name = "lzma" >>>>>        }, >>>>>    #endif >>>>> +#ifdef CONFIG_EROFS_FS_ZIP_DEFLATE >>>>> +    [Z_EROFS_COMPRESSION_DEFLATE] = { >>>>> +        .decompress = z_erofs_deflate_decompress, >>>>> +        .name = "deflate" >>>>> +    }, >>>>> +#endif >>>>>    }; >>>>> diff --git a/fs/erofs/decompressor_deflate.c b/fs/erofs/decompressor_deflate.c >>>>> new file mode 100644 >>>>> index 000000000000..c34e29b15465 >>>>> --- /dev/null >>>>> +++ b/fs/erofs/decompressor_deflate.c >>>>> @@ -0,0 +1,250 @@ >>>>> +// SPDX-License-Identifier: GPL-2.0-or-later >>>>> +#include >>>>> +#include >>>>> +#include "compress.h" >>>>> + >>>>> +struct z_erofs_deflate { >>>>> +    struct z_erofs_deflate *next; >>>>> +    struct z_stream_s z; >>>>> +    u8 bounce[PAGE_SIZE]; >>>>> +}; >>>>> + >>>>> +static DEFINE_SPINLOCK(z_erofs_deflate_lock); >>>>> +static unsigned int z_erofs_deflate_nstrms, z_erofs_deflate_avail_strms; >>>>> +static struct z_erofs_deflate *z_erofs_deflate_head; >>>>> +static DECLARE_WAIT_QUEUE_HEAD(z_erofs_deflate_wq); >>>>> + >>>>> +module_param_named(deflate_streams, z_erofs_deflate_nstrms, uint, 0444); >>>>> + >>>>> +void z_erofs_deflate_exit(void) >>>>> +{ >>>>> +    /* there should be no running fs instance */ >>>>> +    while (z_erofs_deflate_avail_strms) { >>>>> +        struct z_erofs_deflate *strm; >>>>> + >>>>> +        spin_lock(&z_erofs_deflate_lock); >>>>> +        strm = z_erofs_deflate_head; >>>>> +        if (!strm) { >>>>> +            spin_unlock(&z_erofs_deflate_lock); >>>>> +            DBG_BUGON(1); >>>>> +            return; >>>>> +        } >>>>> +        z_erofs_deflate_head = NULL; >>>>> +        spin_unlock(&z_erofs_deflate_lock); >>>>> + >>>>> +        while (strm) { >>>>> +            struct z_erofs_deflate *n = strm->next; >>>>> + >>>>> +            vfree(strm->z.workspace); >>>>> +            kfree(strm); >>>>> +            --z_erofs_deflate_avail_strms; >>>>> +            strm = n; >>>>> +        } >>>>> +    } >>>>> +} >>>>> + >>>>> +int __init z_erofs_deflate_init(void) >>>>> +{ >>>>> +    /* by default, use # of possible CPUs instead */ >>>>> +    if (!z_erofs_deflate_nstrms) >>>>> +        z_erofs_deflate_nstrms = num_possible_cpus(); >>>>> + >>>>> +    for (; z_erofs_deflate_avail_strms < z_erofs_deflate_nstrms; >>>>> +         ++z_erofs_deflate_avail_strms) { >>>>> +        struct z_erofs_deflate *strm; >>>>> + >>>>> +        strm = kzalloc(sizeof(*strm), GFP_KERNEL); >>>>> +        if (!strm) >>>>> +            goto out_failed; >>>>> + >>>>> +        /* XXX: in-kernel zlib cannot shrink windowbits currently */ >>>>> +        strm->z.workspace = vmalloc(zlib_inflate_workspacesize()); >>>>> +        if (!strm->z.workspace) >>>> >>>> kfree(strm)? >>> >>> z_erofs_deflate_exit() below will handle this. >> >> strm is not in z_erofs_deflate_head list yet? >> >> Thanks >> >>> >>> Thanks, >>> Gao Xiang >>> >>>> >>>> Thanks, >>>> >>>>> +            goto out_failed; >>>>> + >>>>> +        spin_lock(&z_erofs_deflate_lock); >>>>> +        strm->next = z_erofs_deflate_head; >>>>> +        z_erofs_deflate_head = strm; > > z_erofs_deflate_head will be added in each turn here. Or you are right, thanks for pointing out, I will make a version soon. Thanks, Gao Xiang