Received: by 10.223.176.5 with SMTP id f5csp2307063wra; Mon, 5 Feb 2018 01:40:40 -0800 (PST) X-Google-Smtp-Source: AH8x227oP8f6hChsae5gGSBHDt3M+I/wlckn+pgvyyIoh9sifND9i/bPhSyFlHyThmehEJMP92NC X-Received: by 2002:a17:902:8a89:: with SMTP id p9-v6mr42434584plo.397.1517823640374; Mon, 05 Feb 2018 01:40:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517823640; cv=none; d=google.com; s=arc-20160816; b=WuCEOQAbiUo4YmH0+kpU1UG6KcXveo5bb+IahOKRj0slC6yG3rBmdXjx5zwQKs5crp pJrg6wt2JOL4cXo7S6qHo6NvsQHFI1ppYi5/vbTb95LsRwGh+tH1VOEfWU+N/75rZTnp 0TR6z+YUn55EfSx46a/EaXuBjwcWncR9WN5LKvMRIZqNIDUPvmGo3DyMUrb4MmI/Qa6h GtsAlKJDwuk4hFMLpYt1+jIeCRm0kEB2lev/shm7EJsPaDwNgO+WEk6h7UvOvO7bDVdc noGS89fcUukJTFlP62jSw6SUnvnoi0sG0skQdl7wXPAXfxG6rHhWktYhtwPXmisVRa7H Q7BQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=T1/61Yo0K2PxxD6Qr4TSv54F5kkAR2Znm5YKWHGhKZI=; b=mg7y525/lMDzfxjLw3HS3IouJ/KcIla9qp9Rmm8J4qDBndpeMRhO2RWihA2y4GHnjZ IIX5EKY44MFJP8vr96lU4B+2MdhC50koxm3U9uFtiOBgMZ2eR1T6LB6HALN/5822DwZo aVyF1UzObSYQF7+H5M1nQ3k+W3tscWhxns79DcsYenrye7DiwufsCoys2W5n/KYjQAfK mtQFpEbp/lK/zJ5adSu9E4sPgvxvK8cd9lTY9y5/5J1L4iazBAuqNy6eMg3vMvSuVP1V NA99kRHuM0DF5RhJVeUZZo+SVhYroyl6wV2wd7cKjxftp3clDcMfDXqe8RPYGTCv9zrG Bl4A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v10-v6si4487911plo.820.2018.02.05.01.40.25; Mon, 05 Feb 2018 01:40:40 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752734AbeBEJjE (ORCPT + 99 others); Mon, 5 Feb 2018 04:39:04 -0500 Received: from szxga04-in.huawei.com ([45.249.212.190]:5172 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751705AbeBEJi6 (ORCPT ); Mon, 5 Feb 2018 04:38:58 -0500 Received: from DGGEMS409-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 76DE928214770; Mon, 5 Feb 2018 17:38:44 +0800 (CST) Received: from [127.0.0.1] (10.111.220.140) by DGGEMS409-HUB.china.huawei.com (10.3.19.209) with Microsoft SMTP Server id 14.3.361.1; Mon, 5 Feb 2018 17:38:39 +0800 Subject: Re: [PATCH 1/2] f2fs: enable to gc page whose inode already atomic commit To: Chao Yu , Chao Yu , , CC: , , , , , References: <1517626068-49739-1-git-send-email-yunlong.song@huawei.com> <312d70f3-b1ae-9ced-44cb-fde83de362ff@huawei.com> From: Yunlong Song Message-ID: <3182ade9-4153-9e47-f8a5-5c87371a3900@huawei.com> Date: Mon, 5 Feb 2018 17:37:40 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.111.220.140] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > OK, details as I explained before: > > atomic_commit GC > - file_write_and_wait_range > - move_data_block > - f2fs_submit_page_write > - f2fs_update_data_blkaddr > - set_page_dirty > - fsync_node_pages > > 1. atomic writes data page #1 & update node #1 > 2. GC data page #2 & update node #2 > 3. page #1 & node #1 & #2 can be committed into nand flash before page #2 be > committed. > > After a sudden pow-cut, database transaction will be inconsistent. So I think > there will be better to exclude gc/atomic_write to each other, with a lock > instead of flag checking. > I do not understand why this transaction is inconsistent, is it a problem that page #2 is not committed into nand flash? Since normal gc also has this problem: Suppose that there is db file A, f2fs_gc moves data page #1 of db file A. But if write checkpoint only commit node page #1 and then a sudden power-cut happens. Data page #1 is not committed to nand flash, but node page #1 is committed. Is the db transaction broken and inconsistent? Come back to your example, I think data page 2 of atomic file does not belong to this transaction, so even node page 2 is committed, it is just the same problem as what I have listed above(db file A), and it does not break this transaction. > Thanks, > >>>>> >>>>> So how about just using dio_rwsem[WRITE] during atomic committing to exclude >>>>> GCing data block of atomic opened file? >>>>> >>>>> Thanks, >>>>> >>>>>> >>>>>> Signed-off-by: Yunlong Song >>>>>> --- >>>>>> fs/f2fs/data.c | 5 ++--- >>>>>> fs/f2fs/gc.c | 6 ++++-- >>>>>> 2 files changed, 6 insertions(+), 5 deletions(-) >>>>>> >>>>>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>>>>> index 7435830..edafcb6 100644 >>>>>> --- a/fs/f2fs/data.c >>>>>> +++ b/fs/f2fs/data.c >>>>>> @@ -1580,14 +1580,13 @@ bool should_update_outplace(struct inode *inode, struct f2fs_io_info *fio) >>>>>> return true; >>>>>> if (S_ISDIR(inode->i_mode)) >>>>>> return true; >>>>>> - if (f2fs_is_atomic_file(inode)) >>>>>> - return true; >>>>>> if (fio) { >>>>>> if (is_cold_data(fio->page)) >>>>>> return true; >>>>>> if (IS_ATOMIC_WRITTEN_PAGE(fio->page)) >>>>>> return true; >>>>>> - } >>>>>> + } else if (f2fs_is_atomic_file(inode)) >>>>>> + return true; >>>>>> return false; >>>>>> } >>>>>> >>>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c >>>>>> index b9d93fd..84ab3ff 100644 >>>>>> --- a/fs/f2fs/gc.c >>>>>> +++ b/fs/f2fs/gc.c >>>>>> @@ -622,7 +622,8 @@ static void move_data_block(struct inode *inode, block_t bidx, >>>>>> if (!check_valid_map(F2FS_I_SB(inode), segno, off)) >>>>>> goto out; >>>>>> >>>>>> - if (f2fs_is_atomic_file(inode)) >>>>>> + if (f2fs_is_atomic_file(inode) && >>>>>> + !f2fs_is_commit_atomic_write(inode)) >>>>>> goto out; >>>>>> >>>>>> if (f2fs_is_pinned_file(inode)) { >>>>>> @@ -729,7 +730,8 @@ static void move_data_page(struct inode *inode, block_t bidx, int gc_type, >>>>>> if (!check_valid_map(F2FS_I_SB(inode), segno, off)) >>>>>> goto out; >>>>>> >>>>>> - if (f2fs_is_atomic_file(inode)) >>>>>> + if (f2fs_is_atomic_file(inode) && >>>>>> + !f2fs_is_commit_atomic_write(inode)) >>>>>> goto out; >>>>>> if (f2fs_is_pinned_file(inode)) { >>>>>> if (gc_type == FG_GC) >>>>>> >>>>> >>>>> . >>>>> >>>> >>> >>> >>> . >>> >> > > > . > -- Thanks, Yunlong Song