Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp2852443rwb; Thu, 29 Sep 2022 16:21:58 -0700 (PDT) X-Google-Smtp-Source: AMsMyM5+ppP25tb+lvJLH5zU9vC/OdG15zY/IJSYdl/XglxaYmH/YZzbEhdZEdAcew+kIvBEjG4B X-Received: by 2002:a17:907:a05:b0:77b:b538:6476 with SMTP id bb5-20020a1709070a0500b0077bb5386476mr4671675ejc.324.1664493718426; Thu, 29 Sep 2022 16:21:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664493718; cv=none; d=google.com; s=arc-20160816; b=Qmr7FJQ4vb3CCPloRlYvagKYuthDTxs148wXwvpo4QaUIxKJu7HiI3dO8CBnzWOdy6 p4exCpEacXmzvKUPCKRH+6RFn9xDG5dgDLP2lCRxW/1HlkqjRJGgACwDXDVDM+cPWlNQ bMgjQmaRziA/uYLZBaTFKhR+a7bDH9HwBDhQgh6NuU4v55PddI6RVxdbOek87zpbN7De bO6ssboAi0uDaG1glDriCX7oYO1b49IbrcMAsjP/1ApskXTboZpP3+MbLrHAsaa6sVK5 Vt66/9R74VAi7/gEg9HEPdn9N6fUPhijJBT8KRi9MthTZqzj7tRGGn/7uFuSu/mKOJby t1Mw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=5A605yWUvcnokuIFaCTtLTmVysLWrCy3HcBIa0i0ZOY=; b=IdcB8OXrdZDD2NTSgC2aZyZJnmLsJOMRl9mzF9MwdsneUqBx684qMDw/ljIYGkQubO ks16JOD6xd9qw+j8WYsMJU0EOpQ+2jOJH+OHsTtSVmUwroDotss46zAdQYIB4ebOsOTD X7vtr2zJlCd6piE9UoBVXTNWvUWPkGxsGJEq2lbExuWCan0U2ycIFc/So2CfnFQLBXGQ ovWG2DzcTKsjZGCtoLF+OCxDbNUEuezKVJVrU9vPtCl1z9duCmmOn+klJWBvnD+s1IR6 CTLllJqHfx/trdNHovjVKOsSQznVxxvejpLBDetTYzqUJLMpxWQWzRILBikQQgna9RyP pfLA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=qT532cgR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ds9-20020a170907724900b0078331a3b123si483635ejc.572.2022.09.29.16.21.33; Thu, 29 Sep 2022 16:21:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=qT532cgR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229574AbiI2WzB (ORCPT + 99 others); Thu, 29 Sep 2022 18:55:01 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44328 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229488AbiI2Wy5 (ORCPT ); Thu, 29 Sep 2022 18:54:57 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5EB69481D1 for ; Thu, 29 Sep 2022 15:54:54 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 02665B82593 for ; Thu, 29 Sep 2022 22:54:53 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 7F4E4C433D6; Thu, 29 Sep 2022 22:54:50 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1664492091; bh=GMaZ49KgYVt5T3Kt0EaXcT7oIaHBvjN6sngqQcQj7vQ=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=qT532cgRFgI9PsVwVW8nqOeuEa7Yt186wsGyYDAMvyoxYymsfBgaC4jw2jte0ufca 4woKpmFMCxcIEd0XeO27iJQAKtvTNVy303a3/QiRVANxD4OviOzY3R8+hxd1SXdScF 5FvnhgtM1zS76gakBtcEiD6Mag9+6hyaoLs3oBrRnn0bJ9Vye377oisVO5bnuUWY4A DsTVI4u9iRK3zEx1Rg2ios8NbW3BKdQn/kXFvzi5KWHTLSwNN5HcGLO8yruUc//GNM RwoV0i0O0KINcmH2bILxxSCG50bcQReb8+S15TIO8/hSUFwNCzQK8VJ0mWNwzTOiPw +L1iXHpSw3LfA== Message-ID: <4aca0d00-d3b7-975f-6b72-ccd6f07d22e5@kernel.org> Date: Fri, 30 Sep 2022 06:54:48 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.3.1 Subject: Re: [f2fs-dev] [PATCH v2] f2fs: introduce F2FS_IOC_START_ATOMIC_REPLACE Content-Language: en-US To: Daeho Jeong Cc: linux-kernel@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net, kernel-team@android.com, Daeho Jeong References: <20220920014036.2295853-1-daeho43@gmail.com> From: Chao Yu In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-11.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022/9/30 0:13, Daeho Jeong wrote: > On Thu, Sep 29, 2022 at 12:36 AM Chao Yu wrote: >> >> On 2022/9/20 9:40, Daeho Jeong wrote: >>> From: Daeho Jeong >>> >>> introduce a new ioctl to replace the whole content of a file atomically, >>> which means it induces truncate and content update at the same time. >>> We can start it with F2FS_IOC_START_ATOMIC_REPLACE and complete it with >>> F2FS_IOC_COMMIT_ATOMIC_WRITE. Or abort it with >>> F2FS_IOC_ABORT_ATOMIC_WRITE. >>> >>> Signed-off-by: Daeho Jeong >>> --- >>> v2: add undefined ioctl number reported by >>> --- >>> fs/f2fs/data.c | 3 +++ >>> fs/f2fs/f2fs.h | 1 + >>> fs/f2fs/file.c | 12 ++++++++++-- >>> fs/f2fs/segment.c | 14 +++++++++++++- >>> include/uapi/linux/f2fs.h | 1 + >>> 5 files changed, 28 insertions(+), 3 deletions(-) >>> >>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>> index 6cd29a575105..d3d32db3a25d 100644 >>> --- a/fs/f2fs/data.c >>> +++ b/fs/f2fs/data.c >>> @@ -3438,6 +3438,9 @@ static int prepare_atomic_write_begin(struct f2fs_sb_info *sbi, >>> else if (*blk_addr != NULL_ADDR) >>> return 0; >>> >>> + if (is_inode_flag_set(inode, FI_ATOMIC_REPLACE)) >>> + goto reserve_block; >>> + >>> /* Look for the block in the original inode */ >>> err = __find_data_block(inode, index, &ori_blk_addr); >>> if (err) >>> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h >>> index 539da7f12cfc..2c49da12d6d8 100644 >>> --- a/fs/f2fs/f2fs.h >>> +++ b/fs/f2fs/f2fs.h >>> @@ -764,6 +764,7 @@ enum { >>> FI_COMPRESS_RELEASED, /* compressed blocks were released */ >>> FI_ALIGNED_WRITE, /* enable aligned write */ >>> FI_COW_FILE, /* indicate COW file */ >>> + FI_ATOMIC_REPLACE, /* indicate atomic replace */ >>> FI_MAX, /* max flag, never be used */ >>> }; >>> >>> diff --git a/fs/f2fs/file.c b/fs/f2fs/file.c >>> index 4f9b80c41b1e..4abd9d2a55b3 100644 >>> --- a/fs/f2fs/file.c >>> +++ b/fs/f2fs/file.c >>> @@ -1982,7 +1982,7 @@ static int f2fs_ioc_getversion(struct file *filp, unsigned long arg) >>> return put_user(inode->i_generation, (int __user *)arg); >>> } >>> >>> -static int f2fs_ioc_start_atomic_write(struct file *filp) >>> +static int f2fs_ioc_start_atomic_write(struct file *filp, bool truncate) >>> { >>> struct inode *inode = file_inode(filp); >>> struct user_namespace *mnt_userns = file_mnt_user_ns(filp); >>> @@ -2051,6 +2051,12 @@ static int f2fs_ioc_start_atomic_write(struct file *filp) >>> >>> isize = i_size_read(inode); >>> fi->original_i_size = isize; >>> + if (truncate) { >>> + set_inode_flag(inode, FI_ATOMIC_REPLACE); >>> + truncate_inode_pages_final(inode->i_mapping); >>> + f2fs_i_size_write(inode, 0); >>> + isize = 0; >>> + } >> >> Hi Daeho, >> >> isize should be updated after tagging inode as atomic_write one? >> otherwise f2fs_mark_inode_dirty_sync() may update isize to inode page, >> latter checkpoint may persist that change? IIUC... >> >> Thanks, > > Hi Chao, > > The first patch of this patchset prevents the inode page from being > updated as dirty for atomic file cases. > Is there any other chances for the inode page to be marked as dirty? I mean: Thread A Thread B - f2fs_ioc_start_atomic_write - f2fs_i_size_write(inode, 0) - f2fs_mark_inode_dirty_sync - checkpoint - persist inode with incorrect zero isize - set_inode_flag(inode, FI_ATOMIC_FILE) Am I missing something? Thanks, > > Thanks, > >> >>> f2fs_i_size_write(fi->cow_inode, isize); >>> >>> spin_lock(&sbi->inode_lock[ATOMIC_FILE]); >>> @@ -4080,7 +4086,9 @@ static long __f2fs_ioctl(struct file *filp, unsigned int cmd, unsigned long arg) >>> case FS_IOC_GETVERSION: >>> return f2fs_ioc_getversion(filp, arg); >>> case F2FS_IOC_START_ATOMIC_WRITE: >>> - return f2fs_ioc_start_atomic_write(filp); >>> + return f2fs_ioc_start_atomic_write(filp, false); >>> + case F2FS_IOC_START_ATOMIC_REPLACE: >>> + return f2fs_ioc_start_atomic_write(filp, true); >>> case F2FS_IOC_COMMIT_ATOMIC_WRITE: >>> return f2fs_ioc_commit_atomic_write(filp); >>> case F2FS_IOC_ABORT_ATOMIC_WRITE: >>> diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c >>> index 143b7ea0fb8e..c524538a9013 100644 >>> --- a/fs/f2fs/segment.c >>> +++ b/fs/f2fs/segment.c >>> @@ -263,14 +263,26 @@ static void __complete_revoke_list(struct inode *inode, struct list_head *head, >>> bool revoke) >>> { >>> struct revoke_entry *cur, *tmp; >>> + pgoff_t start_index = 0; >>> + bool truncate = is_inode_flag_set(inode, FI_ATOMIC_REPLACE); >>> >>> list_for_each_entry_safe(cur, tmp, head, list) { >>> - if (revoke) >>> + if (revoke) { >>> __replace_atomic_write_block(inode, cur->index, >>> cur->old_addr, NULL, true); >>> + } else if (truncate) { >>> + f2fs_truncate_hole(inode, start_index, cur->index); >>> + start_index = cur->index + 1; >>> + } >>> + >>> list_del(&cur->list); >>> kmem_cache_free(revoke_entry_slab, cur); >>> } >>> + >>> + if (!revoke && truncate) { >>> + f2fs_do_truncate_blocks(inode, start_index * PAGE_SIZE, false); >>> + clear_inode_flag(inode, FI_ATOMIC_REPLACE); >>> + } >>> } >>> >>> static int __f2fs_commit_atomic_write(struct inode *inode) >>> diff --git a/include/uapi/linux/f2fs.h b/include/uapi/linux/f2fs.h >>> index 3121d127d5aa..955d440be104 100644 >>> --- a/include/uapi/linux/f2fs.h >>> +++ b/include/uapi/linux/f2fs.h >>> @@ -42,6 +42,7 @@ >>> struct f2fs_comp_option) >>> #define F2FS_IOC_DECOMPRESS_FILE _IO(F2FS_IOCTL_MAGIC, 23) >>> #define F2FS_IOC_COMPRESS_FILE _IO(F2FS_IOCTL_MAGIC, 24) >>> +#define F2FS_IOC_START_ATOMIC_REPLACE _IO(F2FS_IOCTL_MAGIC, 25) >>> >>> /* >>> * should be same as XFS_IOC_GOINGDOWN.