Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp3175907pxv; Sun, 25 Jul 2021 19:37:32 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzgOFU+XA3NdT9HlcWB0c8hC6kNFfC3ET9vXWOXx/oHanKx8mRNJIZVzkFwLJr/mnAB4wbu X-Received: by 2002:a5e:a908:: with SMTP id c8mr12943494iod.116.1627267051992; Sun, 25 Jul 2021 19:37:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1627267051; cv=none; d=google.com; s=arc-20160816; b=PSV9lr1pY+M2a77jPEn3f8Cp/do0GIHGo//I1KlEBBsKvp41rUq55K1KSke9mJXJ87 KMWliqUCUm/Q6BQZlKZGXrZ5l3X+yS1GC1Cj/wDtWo1AA9FB/xmvkeJMFSsPRx8blWOv N3hBjOSKA4uhDB8yXylTMTC+HU1baRB1REahfhXUHsYx+vZylZw3qPOV23HbxbmYXzRV Rs/1vN+1vfKDToxyzf3HcwPYpaHmlNKPBkiZGa88JhL1umup4XteEo+Gi3+jXZPxM1uF Yud3AG8BxQcv/BqVP4m4F2CsSOl6wnbAU1e4gwdp4SDKVNmyKIeAvUk0UK2xS/k3tyo5 YpNw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:mail-followup-to:message-id:subject:cc:to:from:date; bh=AqV1yTT1DfJD/FWq+t/s0wXQtDSj17FvB22tqHJNUgs=; b=AGRK+J1ek6E25Xf+8U5RRgN4hDo6hfDzFTukAuiZIuuADbYD0q9rjstsHu47hdjmpG 2f4mK7HUFh+ANPOBPCE1rvp5MUCtGnuN5Jgcv4NvcS2JvkavWyKHbW32C2yFHw1ydLUb YRyqmbTqF3boBGSI5B8dVGxkg2lQpXlrrW2XZx7NNfTJAf9JcTbhFi4m8O1w1ObXWWfU rF457xxEk54290PxOvWV2maMe0C35keYVOKxMA6RMdvsgNJ9fjic6/W7CF8WHuoqdc2b Rv3teJUV2j843vu8clFHPlajG0HHqBAaRfQa6Pe2b4vk1QNUgeOhudnVHJ+Kk6e+ZcxJ JctQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y7si20602006ila.59.2021.07.25.19.37.20; Sun, 25 Jul 2021 19:37:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231476AbhGZBzk (ORCPT + 99 others); Sun, 25 Jul 2021 21:55:40 -0400 Received: from out30-56.freemail.mail.aliyun.com ([115.124.30.56]:59531 "EHLO out30-56.freemail.mail.aliyun.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231280AbhGZBzj (ORCPT ); Sun, 25 Jul 2021 21:55:39 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R101e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e01424;MF=hsiangkao@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0UgvfLav_1627266964; Received: from B-P7TQMD6M-0146.local(mailfrom:hsiangkao@linux.alibaba.com fp:SMTPD_---0UgvfLav_1627266964) by smtp.aliyun-inc.com(127.0.0.1); Mon, 26 Jul 2021 10:36:05 +0800 Date: Mon, 26 Jul 2021 10:36:03 +0800 From: Gao Xiang To: Andreas Gruenbacher Cc: Christoph Hellwig , "Darrick J . Wong" , Matthew Wilcox , Huang Jianan , linux-erofs@lists.ozlabs.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Andreas Gruenbacher Subject: Re: [PATCH v7] iomap: make inline data support more flexible Message-ID: Mail-Followup-To: Andreas Gruenbacher , Christoph Hellwig , "Darrick J . Wong" , Matthew Wilcox , Huang Jianan , linux-erofs@lists.ozlabs.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Andreas Gruenbacher References: <20210723174131.180813-1-hsiangkao@linux.alibaba.com> <20210725221639.426565-1-agruenba@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20210725221639.426565-1-agruenba@redhat.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jul 26, 2021 at 12:16:39AM +0200, Andreas Gruenbacher wrote: > Here's a fixed and cleaned up version that passes fstests on gfs2. > > I see no reason why the combination of tail packing + writing should > cause any issues, so in my opinion, the check that disables that > combination in iomap_write_begin_inline should still be removed. Since there is no such fs for tail-packing write, I just do a wild guess, for example, 1) the tail-end block was not inlined, so iomap_write_end() dirtied the whole page (or buffer) for the page writeback; 2) then it was truncated into a tail-packing inline block so the last extent(page) became INLINE but dirty instead; 3) during the late page writeback for dirty pages, if (WARN_ON_ONCE(wpc->iomap.type == IOMAP_INLINE)) would be triggered in iomap_writepage_map() for such dirty page. As Matthew pointed out before, https://lore.kernel.org/r/YPrms0fWPwEZGNAL@casper.infradead.org/ currently tail-packing inline won't interact with page writeback, but I'm afraid a supported tail-packing write fs needs to reconsider the whole stuff how page, inode writeback works and what the pattern is with the tail-packing. > > It turns out that returning the number of bytes copied from > iomap_read_inline_data is a bit irritating: the function is really used > for filling the page, but that's not always the "progress" we're looking > for. In the iomap_readpage case, we actually need to advance by an > antire page, but in the iomap_file_buffered_write case, we need to > advance by the length parameter of iomap_write_actor or less. So I've > changed that back. > > I've also renamed iomap_inline_buf to iomap_inline_data and I've turned > iomap_inline_data_size_valid into iomap_within_inline_data, which seems > more useful to me. > > Thanks, > Andreas > > -- > > Subject: [PATCH] iomap: Support tail packing > > The existing inline data support only works for cases where the entire > file is stored as inline data. For larger files, EROFS stores the > initial blocks separately and then can pack a small tail adjacent to the > inode. Generalise inline data to allow for tail packing. Tails may not > cross a page boundary in memory. > > We currently have no filesystems that support tail packing and writing, > so that case is currently disabled (see iomap_write_begin_inline). I'm > not aware of any reason why this code path shouldn't work, however. > > Cc: Christoph Hellwig > Cc: Darrick J. Wong > Cc: Matthew Wilcox > Cc: Andreas Gruenbacher > Tested-by: Huang Jianan # erofs > Signed-off-by: Gao Xiang > --- > fs/iomap/buffered-io.c | 34 +++++++++++++++++++++++----------- > fs/iomap/direct-io.c | 11 ++++++----- > include/linux/iomap.h | 22 +++++++++++++++++++++- > 3 files changed, 50 insertions(+), 17 deletions(-) > > diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c > index 87ccb3438bec..334bf98fdd4a 100644 > --- a/fs/iomap/buffered-io.c > +++ b/fs/iomap/buffered-io.c > @@ -205,25 +205,29 @@ struct iomap_readpage_ctx { > struct readahead_control *rac; > }; > > -static void > -iomap_read_inline_data(struct inode *inode, struct page *page, > +static int iomap_read_inline_data(struct inode *inode, struct page *page, > struct iomap *iomap) > { > - size_t size = i_size_read(inode); > + size_t size = i_size_read(inode) - iomap->offset; I wonder why you use i_size / iomap->offset here, and why you completely ignoring iomap->length field returning by fs. Using i_size here instead of iomap->length seems coupling to me in the beginning (even currently in practice there is some limitation.) Thanks, Gao Xiang