Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp952492iob; Fri, 13 May 2022 17:32:48 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxFc0YSKzSUagF1WTEl/KcwReaJtJnMeVYk6P3+Sl8l/EISLVTVKWNm7hQZGO8ceWhXADKh X-Received: by 2002:a5d:6d04:0:b0:20c:52de:9ce4 with SMTP id e4-20020a5d6d04000000b0020c52de9ce4mr6166626wrq.572.1652488368375; Fri, 13 May 2022 17:32:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1652488368; cv=none; d=google.com; s=arc-20160816; b=kDq++9ebdXIyiZtilcGshFmOiDk8HMutGwwfnAkh5FYQp2FsnArMF61cDNZDTmO9Gu 0Yijhwvt2aJSBiqiglKC49i/nlCbjmqED8tnbP7LDI6NMoWnGc6lSvSbO4+cbomJZuHs yeVOvY7tAcHt9qXYnR7CjrC0iYaQMZrUfP6SVhCyoJR+B3Mue/SFF9gkQ2skBUkqwVGZ Gk77t/PZrZAOdwGtjSBAqK4PHwC49wITjCPXEiGhhFCWs2un8Tgrs/orbqaAsEvRkXZt LJoOYaEiYEhmV8l7gNh1+XVhSB2j1GrNPforiYpW0bA6PyUhIkzRQganeel+uzzVFSfo KSFQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=t7MVnkthwOMUtxUG7u6NwMn/z03/xnF2WGs+kPzPugU=; b=lc1jKJjdxWK/mFoF2dKVSjHPSa0WYEA2plbZyYhFReMk/vseAOAnbrtnXq6Pw2b7ui 4eN0N+kAoi443Gs4oJTCP/vZFr/W5fCF5mLcimSI0nHGVpC6Ca6tjpelm+H9A9dDfKjB 7yE858cjVCFfB+1k7KKfduskoJSPUMFy4gtf/TeyOxJav4gloI4BNBB816t2UdYLuNAS KJHljWY1DUb81n8LkOoHh/hfD2wyrJCLv9qYshj0VO+K1ZR5ZD2JHpKEk0eBLMyR/otZ cZDMVQ5cr2chM1D01OnKEQTTa+IDJtBoEs8zXzZIhom7EcNh6+qOIDBPkM0OsmwUVdiA NBhQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=IuiJthlP; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id f13-20020a05600c4e8d00b0039457411dafsi4814372wmq.51.2022.05.13.17.32.47 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 13 May 2022 17:32:48 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=IuiJthlP; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 3ED1E30695E; Fri, 13 May 2022 16:24:39 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1376695AbiEMDU1 (ORCPT + 99 others); Thu, 12 May 2022 23:20:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35462 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1376701AbiEMDUZ (ORCPT ); Thu, 12 May 2022 23:20:25 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1CDE628FE87 for ; Thu, 12 May 2022 20:20:24 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 7DCB86006F for ; Fri, 13 May 2022 03:20:23 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id BE96BC34114; Fri, 13 May 2022 03:20:21 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1652412022; bh=hdMV5sPfTunSBVCwaFuDJWydnVmvnF5pcYL4ULj4sKM=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=IuiJthlPFvC30uiUWY5IFhJjePaSVjBThf2n0N7YYPToe3kUTyEZs37sjVkX8OLH8 /gVYsRLvPCsHT961rv+kbdkopwtQ28zhMqmeOyCakCoPubmB5UHt6//I+On8zZ0iKA y63aWmAqVxN5LjZ+IhMuNwrYquYRX5R+rW4t8O/HSz7Wg7toVcmYODXm4KavK/R5ta 08cY0H038rLW1zvilm1Xu48oqpOf+PmD3y7Eh1mgg1uH8C1pQEgXiTI0UJRYdKMd2n pD4pL8QQO9v/sNAivlgHCedbMEKV86iq2o2P+KeAXegLV+JftuqM7y+3sbXscQfMS3 akdb8Q1V3A4rw== Message-ID: <59d77fcb-bc82-c1d2-0a71-eef3b0140506@kernel.org> Date: Fri, 13 May 2022 11:20:18 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.9.0 Subject: Re: [PATCH] f2fs: separate NOCoW and pinfile semantics Content-Language: en-US To: Jaegeuk Kim Cc: linux-f2fs-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org, Chao Yu References: <20220512082116.2991611-1-chao@kernel.org> From: Chao Yu In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-5.8 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022/5/13 5:57, Jaegeuk Kim wrote: > On 05/12, Chao Yu wrote: >> Pinning a file is heavy, because skipping pinned files make GC >> running with heavy load or no effect. >> >> So that this patch proposes to separate nocow and pinfile semantics: >> - NOCoW flag can only be set on regular file. >> - NOCoW file will only trigger IPU at common writeback/flush. >> - NOCow file will do OPU during GC. >> >> This flag can satisfying the demand of: >> 1) avoiding fragment of file's physical block >> 2) userspace doesn't want to pin file's physical address >> >> Signed-off-by: Chao Yu >> --- >> fs/f2fs/data.c | 3 ++- >> fs/f2fs/f2fs.h | 1 + >> fs/f2fs/file.c | 25 ++++++++++++++++++++++++- >> 3 files changed, 27 insertions(+), 2 deletions(-) >> >> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >> index 54a7a8ad994d..c8eab78f7d89 100644 >> --- a/fs/f2fs/data.c >> +++ b/fs/f2fs/data.c >> @@ -2495,7 +2495,8 @@ bool f2fs_should_update_inplace(struct inode *inode, struct f2fs_io_info *fio) >> if (is_inode_flag_set(inode, FI_ALIGNED_WRITE)) >> return false; >> >> - if (f2fs_is_pinned_file(inode)) >> + if (f2fs_is_pinned_file(inode) || >> + F2FS_I(inode)->i_flags & F2FS_NOCOW_FL) >> return true; >> >> /* if this is cold file, we should overwrite to avoid fragmentation */ >> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h >> index 492af5b96de1..e91ece55f5e8 100644 >> --- a/fs/f2fs/f2fs.h >> +++ b/fs/f2fs/f2fs.h >> @@ -2916,6 +2916,7 @@ static inline void f2fs_change_bit(unsigned int nr, char *addr) >> #define F2FS_NOCOMP_FL 0x00000400 /* Don't compress */ >> #define F2FS_INDEX_FL 0x00001000 /* hash-indexed directory */ >> #define F2FS_DIRSYNC_FL 0x00010000 /* dirsync behaviour (directories only) */ >> +#define F2FS_NOCOW_FL 0x00800000 /* Do not cow file */ >> #define F2FS_PROJINHERIT_FL 0x20000000 /* Create with parents projid */ >> #define F2FS_CASEFOLD_FL 0x40000000 /* Casefolded file */ >> >> diff --git a/fs/f2fs/file.c b/fs/f2fs/file.c >> index 09287876dbb7..7f92a3a157f7 100644 >> --- a/fs/f2fs/file.c >> +++ b/fs/f2fs/file.c >> @@ -1851,6 +1851,20 @@ static int f2fs_setflags_common(struct inode *inode, u32 iflags, u32 mask) >> if (IS_NOQUOTA(inode)) >> return -EPERM; >> >> + if ((iflags ^ masked_flags) & F2FS_NOCOW_FL) { >> + int ret; >> + >> + if (!S_ISREG(inode->i_mode)) >> + return -EINVAL; >> + if (f2fs_should_update_outplace(inode, NULL)) >> + return -EINVAL; >> + if (f2fs_is_pinned_file(inode)) >> + return -EINVAL; >> + ret = f2fs_convert_inline_inode(inode); >> + if (ret) >> + return ret; >> + } >> + >> if ((iflags ^ masked_flags) & F2FS_CASEFOLD_FL) { >> if (!f2fs_sb_has_casefold(F2FS_I_SB(inode))) >> return -EOPNOTSUPP; >> @@ -1926,6 +1940,7 @@ static const struct { >> { F2FS_NOCOMP_FL, FS_NOCOMP_FL }, >> { F2FS_INDEX_FL, FS_INDEX_FL }, >> { F2FS_DIRSYNC_FL, FS_DIRSYNC_FL }, >> + { F2FS_NOCOW_FL, FS_NOCOW_FL }, >> { F2FS_PROJINHERIT_FL, FS_PROJINHERIT_FL }, >> { F2FS_CASEFOLD_FL, FS_CASEFOLD_FL }, >> }; >> @@ -1957,7 +1972,8 @@ static const struct { >> FS_NOCOMP_FL | \ >> FS_DIRSYNC_FL | \ >> FS_PROJINHERIT_FL | \ >> - FS_CASEFOLD_FL) >> + FS_CASEFOLD_FL | \ >> + FS_NOCOW_FL) >> >> /* Convert f2fs on-disk i_flags to FS_IOC_{GET,SET}FLAGS flags */ >> static inline u32 f2fs_iflags_to_fsflags(u32 iflags) >> @@ -3081,6 +3097,13 @@ static int f2fs_ioc_set_pin_file(struct file *filp, unsigned long arg) >> >> inode_lock(inode); >> >> + if (F2FS_I(inode)->i_flags & F2FS_NOCOW_FL) { >> + f2fs_info(F2FS_I_SB(inode), "inode (%lu) is already NOCOW one", >> + inode->i_ino); >> + ret = -EINVAL; > > Why rejecting this? We can pin the file to get 2MB-aligned allocation on the > NOCOW file. I try to separate these two flag completely, but, seems it can't, because after commit 5d539245cb18 ("f2fs: export FS_NOCOW_FL flag to user"), these two flags have already been twined closely.... @@ -1651,6 +1651,8 @@ static int f2fs_ioc_getflags(struct file *filp, unsigned long arg) flags |= F2FS_ENCRYPT_FL; if (f2fs_has_inline_data(inode) || f2fs_has_inline_dentry(inode)) flags |= F2FS_INLINE_DATA_FL; + if (is_inode_flag_set(inode, FI_PIN_FILE)) + flags |= F2FS_NOCOW_FL; How about: f2fs_ioc_set_pin_file/f2fs_fileattr_set logic: pinfile nocow set set pinfile | nocow set nocow clear clear pinfile | nocow clear nocow Behaviors: w/ pinfile, w/ nocow: use pinfile semantics w/ pinfile, w/o nocow: use pinfile semantics w/o pinfile, w/ nocow: use nocow semantics w/o pinfile, w/o nocow: no pinfile or nocow semantics Thanks, > >> + goto out; >> + } >> + >> if (!pin) { >> clear_inode_flag(inode, FI_PIN_FILE); >> f2fs_i_gc_failures_write(inode, 0); >> -- >> 2.25.1