Received: by 2002:ab2:710b:0:b0:1ef:a325:1205 with SMTP id z11csp500686lql; Mon, 11 Mar 2024 08:48:40 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCWAGYk8S/jCkv2INZpkKbXaGN9eaYm7RdCfVhWW4kTXcbmfgCiAR0mhBlBbChRucIX4qJQEXM+4I+TXAWSRGVJ794Re/A6ONdCb1JhZTQ== X-Google-Smtp-Source: AGHT+IG4l13LfZLE6Oa2vI1k1WTCYKc4SWDv3YIZOdnQO1Lr7nnmJFsRpgONfrVEIUDOHvkP6hHR X-Received: by 2002:a17:907:a091:b0:a45:f24a:7df8 with SMTP id hu17-20020a170907a09100b00a45f24a7df8mr4975601ejc.27.1710172120457; Mon, 11 Mar 2024 08:48:40 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710172120; cv=pass; d=google.com; s=arc-20160816; b=Spv8o0ZzEMs8zNmpybswJxS6sHxfx0yZ9py6K156NuDqbtaaozrYpZKXTKbvFWqDLZ yeZ7ek0xqMdW8YdBy10shWuH07UozpUCgnQzHwS6O2nw4GEh6qawYIHUjgbcl699yqlr YVQs0NduEzvGyPehH678I3CBKNQjk7hpi+zcGNeN8DRe70Sp8VsBOfHavSFcwIJ+QxyW FBud/gfblhgTq/7Ku7lO9epMV+sfCC90i+bBlBFD3ihYQ0IIpBj1rV1WOMd2Qr3zvX0s 7mpmEMY+v7MPjlUWUyVj2sWIjMDHxaEI6awH7zXfIZhSaS6Yq4tvZBDZCVZhp+z9VvLT YH/g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date:dkim-signature; bh=kfc5DoKdWEwZAwnKyRtmW5p3BuDd2UagxMhKEzX7pG0=; fh=iF7TKcfyCKMcEoyn3ycxUxdPqyR/HBZimjCvuok7M6s=; b=A/UJ4XVA+vY5KRCOkTRlddA6LH5XWGTbRyWHatB3b30774DrSWHZf8iTsXew6kdm6m AsiZ69pg9MV4SS3cmkwBwBc6L9qivtQ4i1SL3SBwNy3m584ysld2e0RdhaxL8rFSLroL ZqL8EEFSCJhSjfHB77uFKZn46vAlmveGCQaNZ6L/0jZWGNj4hNBSZ7OFzQUy24SpRHDQ sDNLR13b3/e5RNehxFoao70yJPehs6t2PG9saIgwRVfFgUKYniRzVGt/krB5FK5y8mZl milX6IRlokZoc/PkiJ3ceZ4jZFmMsihcZNSu9CvE62rbl51u4v+wras/qbdDQ3d4yRYr r5Qg==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b="tNh/kVgH"; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-99154-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-99154-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [147.75.80.249]) by mx.google.com with ESMTPS id i24-20020a170906115800b00a44e1f65c37si2524099eja.230.2024.03.11.08.48.40 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 11 Mar 2024 08:48:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-99154-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) client-ip=147.75.80.249; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b="tNh/kVgH"; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-99154-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-99154-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 0E6551F225C1 for ; Mon, 11 Mar 2024 15:48:40 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 4FBDD44C8D; Mon, 11 Mar 2024 15:48:31 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="tNh/kVgH" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 57A294436E; Mon, 11 Mar 2024 15:48:29 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710172110; cv=none; b=ijXn0KIsTAxMJNS1UlYdTKD0Y/1GZoghhl0VPta58MxwG+7SvzwLsyzdmkzkS+9SpayWBTu6oI6TRFOIe4Zr2oZs+OiVd1sv5jQx0KXBLxOTjAUGut/fBqHSMPoM4gnNxWolVt77YrxK+CmaKggNDHQs8dQ7SQGkrjnmJmu92CA= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710172110; c=relaxed/simple; bh=RUD9l8K0ik+qMLD3iJPUNnrvnQ/fBcf65l1dnsvJdxM=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=XzpA99U0CMMPuYQB4WV66BlEfrODBM3GGiEOxTLvqEQRriuOffGrWAwWPt++O47m0KayCaO1obhXpR1aThrCTy86k9wuvoxajhfNvDdokd5/VoukGPoNdVuTP/jFYSU9A3uMGpbwv+4ekj1t8MY97L2X4RgXh3EKs6rwHar6wpI= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=tNh/kVgH; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id BD5BAC433C7; Mon, 11 Mar 2024 15:48:29 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1710172109; bh=RUD9l8K0ik+qMLD3iJPUNnrvnQ/fBcf65l1dnsvJdxM=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=tNh/kVgHoGI2BUMioMJwQKAz9a5IxZARg3igTcSTnIZHFAyH8kN//COrjU0tW+NUx 6yMwSbFxCBbX1+t9KNh0+akuqS4Q4e+tNvOwrf8OQdVkpeV/J2MhgqfreMQ3YWvIDI 5iMZQoIiIfOj8kxn501ol9FtrcWQh0n0p2kZwe3ibY43fko6d3Hz+gcQlI8SIwL7NA z+kaPQCg/RoLImbDMgmqHW0VveS1kJgQM4NMfXQ+IgJr2Tjs3cxrZz4ZzdPKIKsPt2 ipto79hw9MqAJAz+dfTmzkTWMHkVgllhSUb+/Y7Uan6E0zuKcalRie3SFTfPT8Z02G /CaAz4/Fp2nVg== Date: Mon, 11 Mar 2024 08:48:29 -0700 From: "Darrick J. Wong" To: Zhang Yi Cc: linux-xfs@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, hch@infradead.org, brauner@kernel.org, david@fromorbit.com, tytso@mit.edu, jack@suse.cz, yi.zhang@huawei.com, chengzhihao1@huawei.com, yukuai3@huawei.com Subject: Re: [PATCH 3/4] iomap: don't increase i_size if it's not a write operation Message-ID: <20240311154829.GU1927156@frogsfrogsfrogs> References: <20240311122255.2637311-1-yi.zhang@huaweicloud.com> <20240311122255.2637311-4-yi.zhang@huaweicloud.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20240311122255.2637311-4-yi.zhang@huaweicloud.com> On Mon, Mar 11, 2024 at 08:22:54PM +0800, Zhang Yi wrote: > From: Zhang Yi > > Increase i_size in iomap_zero_range() and iomap_unshare_iter() is not > needed, the caller should handle it. Especially, when truncate partial > block, we could not increase i_size beyond the new EOF here. It doesn't > affect xfs and gfs2 now because they set the new file size after zero > out, it doesn't matter that a transient increase in i_size, but it will > affect ext4 because it set file size before truncate. > At the same time, > iomap_write_failed() is also not needed for above two cases too, so > factor them out and move them to iomap_write_iter() and > iomap_zero_iter(). This change should be a separate patch with its own justification. Which is, AFAICT, something along the lines of: "Unsharing and zeroing can only happen within EOF, so there is never a need to perform posteof pagecache truncation if write begin fails." > Signed-off-by: Zhang Yi Doesn't this patch fix a bug in ext4? > --- > fs/iomap/buffered-io.c | 59 +++++++++++++++++++++--------------------- > 1 file changed, 30 insertions(+), 29 deletions(-) > > diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c > index 093c4515b22a..19f91324c690 100644 > --- a/fs/iomap/buffered-io.c > +++ b/fs/iomap/buffered-io.c > @@ -786,7 +786,6 @@ static int iomap_write_begin(struct iomap_iter *iter, loff_t pos, > > out_unlock: > __iomap_put_folio(iter, pos, 0, folio); > - iomap_write_failed(iter->inode, pos, len); > > return status; > } > @@ -838,34 +837,13 @@ static size_t iomap_write_end(struct iomap_iter *iter, loff_t pos, size_t len, > size_t copied, struct folio *folio) > { > const struct iomap *srcmap = iomap_iter_srcmap(iter); > - loff_t old_size = iter->inode->i_size; > - size_t ret; > - > - if (srcmap->type == IOMAP_INLINE) { > - ret = iomap_write_end_inline(iter, folio, pos, copied); > - } else if (srcmap->flags & IOMAP_F_BUFFER_HEAD) { > - ret = block_write_end(NULL, iter->inode->i_mapping, pos, len, > - copied, &folio->page, NULL); > - } else { > - ret = __iomap_write_end(iter->inode, pos, len, copied, folio); > - } > > - /* > - * Update the in-memory inode size after copying the data into the page > - * cache. It's up to the file system to write the updated size to disk, > - * preferably after I/O completion so that no stale data is exposed. > - */ > - if (pos + ret > old_size) { > - i_size_write(iter->inode, pos + ret); > - iter->iomap.flags |= IOMAP_F_SIZE_CHANGED; > - } > - __iomap_put_folio(iter, pos, ret, folio); > - > - if (old_size < pos) > - pagecache_isize_extended(iter->inode, old_size, pos); > - if (ret < len) > - iomap_write_failed(iter->inode, pos + ret, len - ret); > - return ret; > + if (srcmap->type == IOMAP_INLINE) > + return iomap_write_end_inline(iter, folio, pos, copied); > + if (srcmap->flags & IOMAP_F_BUFFER_HEAD) > + return block_write_end(NULL, iter->inode->i_mapping, pos, len, > + copied, &folio->page, NULL); > + return __iomap_write_end(iter->inode, pos, len, copied, folio); > } > > static loff_t iomap_write_iter(struct iomap_iter *iter, struct iov_iter *i) > @@ -880,6 +858,7 @@ static loff_t iomap_write_iter(struct iomap_iter *iter, struct iov_iter *i) > > do { > struct folio *folio; > + loff_t old_size; > size_t offset; /* Offset into folio */ > size_t bytes; /* Bytes to write to folio */ > size_t copied; /* Bytes copied from user */ > @@ -912,8 +891,10 @@ static loff_t iomap_write_iter(struct iomap_iter *iter, struct iov_iter *i) > } > > status = iomap_write_begin(iter, pos, bytes, &folio); > - if (unlikely(status)) > + if (unlikely(status)) { > + iomap_write_failed(iter->inode, pos, bytes); > break; > + } > if (iter->iomap.flags & IOMAP_F_STALE) > break; > > @@ -927,6 +908,24 @@ static loff_t iomap_write_iter(struct iomap_iter *iter, struct iov_iter *i) > copied = copy_folio_from_iter_atomic(folio, offset, bytes, i); > status = iomap_write_end(iter, pos, bytes, copied, folio); > > + /* > + * Update the in-memory inode size after copying the data into > + * the page cache. It's up to the file system to write the > + * updated size to disk, preferably after I/O completion so that > + * no stale data is exposed. > + */ > + old_size = iter->inode->i_size; > + if (pos + status > old_size) { > + i_size_write(iter->inode, pos + status); > + iter->iomap.flags |= IOMAP_F_SIZE_CHANGED; > + } > + __iomap_put_folio(iter, pos, status, folio); Why is it necessary to hoist the __iomap_put_folio calls from iomap_write_end into iomap_write_iter, iomap_unshare_iter, and iomap_zero_iter? None of those functions seem to use it, and it makes more sense to me that iomap_write_end releases the folio that iomap_write_begin returned. --D > + > + if (old_size < pos) > + pagecache_isize_extended(iter->inode, old_size, pos); > + if (status < bytes) > + iomap_write_failed(iter->inode, pos + status, > + bytes - status); > if (unlikely(copied != status)) > iov_iter_revert(i, copied - status); > > @@ -1296,6 +1295,7 @@ static loff_t iomap_unshare_iter(struct iomap_iter *iter) > bytes = folio_size(folio) - offset; > > bytes = iomap_write_end(iter, pos, bytes, bytes, folio); > + __iomap_put_folio(iter, pos, bytes, folio); > if (WARN_ON_ONCE(bytes == 0)) > return -EIO; > > @@ -1360,6 +1360,7 @@ static loff_t iomap_zero_iter(struct iomap_iter *iter, bool *did_zero) > folio_mark_accessed(folio); > > bytes = iomap_write_end(iter, pos, bytes, bytes, folio); > + __iomap_put_folio(iter, pos, bytes, folio); > if (WARN_ON_ONCE(bytes == 0)) > return -EIO; > > -- > 2.39.2 > >