Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 42514C54EED for ; Wed, 25 Jan 2023 10:05:46 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235487AbjAYKFo (ORCPT ); Wed, 25 Jan 2023 05:05:44 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48372 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233829AbjAYKFm (ORCPT ); Wed, 25 Jan 2023 05:05:42 -0500 Received: from out30-124.freemail.mail.aliyun.com (out30-124.freemail.mail.aliyun.com [115.124.30.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1963C12F38; Wed, 25 Jan 2023 02:05:39 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R431e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046049;MF=hsiangkao@linux.alibaba.com;NM=1;PH=DS;RN=10;SR=0;TI=SMTPD_---0Va9dOlL_1674641132; Received: from 172.20.10.3(mailfrom:hsiangkao@linux.alibaba.com fp:SMTPD_---0Va9dOlL_1674641132) by smtp.aliyun-inc.com; Wed, 25 Jan 2023 18:05:35 +0800 Message-ID: Date: Wed, 25 Jan 2023 18:05:30 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.6.1 Subject: Re: [PATCH v3 0/6] Composefs: an opportunistically sharing verified image filesystem To: Alexander Larsson , Amir Goldstein Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, gscrivan@redhat.com, david@fromorbit.com, brauner@kernel.org, viro@zeniv.linux.org.uk, Vivek Goyal , Miklos Szeredi References: <1ea88c8d1e666b85342374ed7c0ddf7d661e0ee1.camel@redhat.com> <5fb32a1297821040edd8c19ce796fc0540101653.camel@redhat.com> From: Gao Xiang In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2023/1/25 17:37, Alexander Larsson wrote: > On Tue, 2023-01-24 at 21:06 +0200, Amir Goldstein wrote: >> On Tue, Jan 24, 2023 at 3:13 PM Alexander Larsson ... >>> >>> They are all strictly worse than squashfs in the above testing. >>> >> >> It's interesting to know why and if an optimized mkfs.erofs >> mkfs.ext4 would have done any improvement. > > Even the non-loopback mounted (direct xfs backed) version performed > worse than the squashfs one. I'm sure a erofs with sparse files would > do better due to a more compact file, but I don't really see how it > would perform significantly different than the squashfs code. Yes, > squashfs lookup is linear in directory length, while erofs is log(n), > but the directories are not so huge that this would dominate the > runtime. > > To get an estimate of this I made a broken version of the erofs image, > where the metacopy files are actually 0 byte size rather than sparse. > This made the erofs file 18M instead, and gained 10% in the cold cache > case. This, while good, is not near enough to matter compared to the > others. > > I don't think the base performance here is really much dependent on the > backing filesystem. An ls -lR workload is just a measurement of the > actual (i.e. non-dcache) performance of the filesystem implementation > of lookup and iterate, and overlayfs just has more work to do here, > especially in terms of the amount of i/o needed. I will form a formal mkfs.erofs version in one or two days since we're cerebrating Lunar New year now. Since you don't have more I/O traces for analysis, I have to do another wild guess. Could you help benchmark your v2 too? I'm not sure if such performance also exists in v2. The reason why I guess as this is that it seems that you read all dir inode pages when doing the first lookup, it can benefit to seq dir access. I'm not sure if EROFS can make a similar number by doing forcing readahead on dirs to read all dir data at once as well. Apart from that I don't see significant difference, at least personally I'd like to know where it could have such huge difference. I don't think that is all because of read-only on-disk format differnce. Thanks, Gao Xiang