Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp934940pxf; Thu, 25 Mar 2021 18:38:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxT37xJGxLBSJau4zWpi5BD3Gl6ePRdQhR1hxAZ/XEeXYIvy258pKIQcWrtVCqQVABRy8Oe X-Received: by 2002:a17:906:400b:: with SMTP id v11mr12629207ejj.194.1616722698999; Thu, 25 Mar 2021 18:38:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1616722698; cv=none; d=google.com; s=arc-20160816; b=SnntxklYeHoM/fH+Au3mReqmMelSPWAROlVNYkPfDIae3vPtyAUsLR3Mv59+a+REo1 k4AhK91lbxKge97M/yePlMbSVBqe5EtcDePosy2rJVNB5I8ciiIvX+AJMS7+98rsahz/ O5wgBW7vEqKPXBIyRLZ6MjzCHh6GCTEcuqhxTAbYphcDt4hL86CTGaFWFcJj1XwDliXV QxPcybnqEcOW9IjjiuimsWbjtOMOL9UV2fB1yD04gAqoOXIXH/A/MTAHnY+KIx0yj8RF kmoiNmEr3nGajMpE82iJTYI2y+HN50OZVEFc+9Ccc0Nak255bpH8e/aapi+t+FyfspJD 8CNw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=B3w6YmLdHJB+bOcEGnbCxk471OBaTFAXiNLWZL+8qNc=; b=IbmDRD5vL2nrswXdPAaQfIXPHESYCqZMM/MA9cUqJRfBx7Rq1MXB10sVthN06f9Tt3 hk/blISAztNfY0dPjo1GsAmiNIFtjQPwWhkFc3UmaU40ufKf8Cu6q4XWMp/8ucdTZ7Ds WLN7NJPYu6zYWBhyMWsvHI2oakUwkZNk8RPAUasjVyMG0gTcoE7LEWxWpueMWM5S0B6H 9qgzRiikEZQLYjiNwsZgr3cY56RjGIU368ZbmjCCW3AX9GQwc0VuZmddxtK8ak5tOKLZ YEuY3nbvEF1loHtbunXcQlsl2Ki2fczzyjdZdDD71w2xMGYLBZPrMZd3gRDyaWhtvWb5 +NmQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id g12si5338047ejd.256.2021.03.25.18.37.56; Thu, 25 Mar 2021 18:38:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230137AbhCZBep (ORCPT + 99 others); Thu, 25 Mar 2021 21:34:45 -0400 Received: from szxga05-in.huawei.com ([45.249.212.191]:14548 "EHLO szxga05-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229893AbhCZBe0 (ORCPT ); Thu, 25 Mar 2021 21:34:26 -0400 Received: from DGGEMS412-HUB.china.huawei.com (unknown [172.30.72.59]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4F64Fg0SCtzPm7n; Fri, 26 Mar 2021 09:31:51 +0800 (CST) Received: from [10.136.110.154] (10.136.110.154) by smtp.huawei.com (10.3.19.212) with Microsoft SMTP Server (TLS) id 14.3.498.0; Fri, 26 Mar 2021 09:34:23 +0800 Subject: Re: [f2fs-dev] [PATCH] Revert "f2fs: give a warning only for readonly partition" To: Jaegeuk Kim CC: , References: <20210323064155.12582-1-yuchao0@huawei.com> <107e671d-68ea-1a74-521e-ab2b6fe36416@huawei.com> <8b0b0782-a667-9edc-5ee9-98ac9f67b7b7@huawei.com> From: Chao Yu Message-ID: <84688aac-75da-1226-df4d-47ac97087c51@huawei.com> Date: Fri, 26 Mar 2021 09:34:22 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.136.110.154] X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2021/3/26 9:19, Jaegeuk Kim wrote: > On 03/26, Chao Yu wrote: >> On 2021/3/25 9:59, Chao Yu wrote: >>> On 2021/3/25 6:44, Jaegeuk Kim wrote: >>>> On 03/24, Chao Yu wrote: >>>>> On 2021/3/24 12:22, Jaegeuk Kim wrote: >>>>>> On 03/24, Chao Yu wrote: >>>>>>> On 2021/3/24 2:39, Jaegeuk Kim wrote: >>>>>>>> On 03/23, Chao Yu wrote: >>>>>>>>> This reverts commit 938a184265d75ea474f1c6fe1da96a5196163789. >>>>>>>>> >>>>>>>>> Because that commit fails generic/050 testcase which expect failure >>>>>>>>> during mount a recoverable readonly partition. >>>>>>>> >>>>>>>> I think we need to change generic/050, since f2fs can recover this partition, >>>>>>> >>>>>>> Well, not sure we can change that testcase, since it restricts all generic >>>>>>> filesystems behavior. At least, ext4's behavior makes sense to me: >>>>>>> >>>>>>> journal_dev_ro = bdev_read_only(journal->j_dev); >>>>>>> really_read_only = bdev_read_only(sb->s_bdev) | journal_dev_ro; >>>>>>> >>>>>>> if (journal_dev_ro && !sb_rdonly(sb)) { >>>>>>> ext4_msg(sb, KERN_ERR, >>>>>>> "journal device read-only, try mounting with '-o ro'"); >>>>>>> err = -EROFS; >>>>>>> goto err_out; >>>>>>> } >>>>>>> >>>>>>> if (ext4_has_feature_journal_needs_recovery(sb)) { >>>>>>> if (sb_rdonly(sb)) { >>>>>>> ext4_msg(sb, KERN_INFO, "INFO: recovery " >>>>>>> "required on readonly filesystem"); >>>>>>> if (really_read_only) { >>>>>>> ext4_msg(sb, KERN_ERR, "write access " >>>>>>> "unavailable, cannot proceed " >>>>>>> "(try mounting with noload)"); >>>>>>> err = -EROFS; >>>>>>> goto err_out; >>>>>>> } >>>>>>> ext4_msg(sb, KERN_INFO, "write access will " >>>>>>> "be enabled during recovery"); >>>>>>> } >>>>>>> } >>>>>>> >>>>>>>> even though using it as readonly. And, valid checkpoint can allow for user to >>>>>>>> read all the data without problem. >>>>>>> >>>>>>>>> if (f2fs_hw_is_readonly(sbi)) { >>>>>>> >>>>>>> Since device is readonly now, all write to the device will fail, checkpoint can >>>>>>> not persist recovered data, after page cache is expired, user can see stale data. >>>>>> >>>>>> My point is, after mount with ro, there'll be no data write which preserves the >>>>>> current status. So, in the next time, we can recover fsync'ed data later, if >>>>>> user succeeds to mount as rw. Another point is, with the current checkpoint, we >>>>>> should not have any corrupted metadata. So, why not giving a chance to show what >>>>>> data remained to user? I think this can be doable only with CoW filesystems. >>>>> >>>>> I guess we're talking about the different things... >>>>> >>>>> Let me declare two different readonly status: >>>>> >>>>> 1. filesystem readonly: file system is mount with ro mount option, and >>>>> app from userspace can not modify any thing of filesystem, but filesystem >>>>> itself can modify data on device since device may be writable. >>>>> >>>>> 2. device readonly: device is set to readonly status via 'blockdev --setro' >>>>> command, and then filesystem should never issue any write IO to the device. >>>>> >>>>> So, what I mean is, *when device is readonly*, rather than f2fs mountpoint >>>>> is readonly (f2fs_hw_is_readonly() returns true as below code, instead of >>>>> f2fs_readonly() returns true), in this condition, we should not issue any >>>>> write IO to device anyway, because, AFAIK, write IO will fail due to >>>>> bio_check_ro() check. >>>> >>>> In that case, mount(2) will try readonly, no? >>> >>> Yes, if device is readonly, mount (2) can not mount/remount device to rw >>> mountpoint. >> >> Any other concern about this patch? > > Indeed we're talking about different things. :) > > This case is mount(ro) with device(ro) having some data to recover. > My point is why not giving a chance to mount(ro) to show the current data > covered by a valid checkpoint. This doesn't change anything in the disk, Got your idea. IMO, it has potential issue in above condition: >>>>>>> Since device is readonly now, all write to the device will fail, checkpoint can >>>>>>> not persist recovered data, after page cache is expired, user can see stale data. e.g. Recovery writes one inode and then triggers a checkpoint, all writes fail due to device is readonly, once inode cache is reclaimed by vm, user will see old inode when reloading it, or even see corrupted fs if partial meta inode's cache is expired. Thoughts? Thanks, > and in the next time, it allows mount(rw|ro) with device(rw) to recover > the data seamlessly. > >> >> Thanks, >> >>> >>> Thanks, >>> >>>> >>>> # blockdev --setro /dev/vdb >>>> # mount -t f2fs /dev/vdb /mnt/test/ >>>> mount: /mnt/test: WARNING: source write-protected, mounted read-only. >>>> >>>>> >>>>> if (f2fs_hw_is_readonly(sbi)) { >>>>> - if (!is_set_ckpt_flags(sbi, CP_UMOUNT_FLAG)) { >>>>> - err = -EROFS; >>>>> + if (!is_set_ckpt_flags(sbi, CP_UMOUNT_FLAG)) >>>>> f2fs_err(sbi, "Need to recover fsync data, but write access unavailable"); >>>>> - goto free_meta; >>>>> - } >>>>> - f2fs_info(sbi, "write access unavailable, skipping recovery"); >>>>> + else >>>>> + f2fs_info(sbi, "write access unavailable, skipping recovery"); >>>>> goto reset_checkpoint; >>>>> } >>>>> >>>>> For the case of filesystem is readonly and device is writable, it's fine >>>>> to do recovery in order to let user to see fsynced data. >>>>> >>>>> Thanks, >>>>> >>>>>> >>>>>>> >>>>>>> Am I missing something? >>>>>>> >>>>>>> Thanks, >>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>> Fixes: 938a184265d7 ("f2fs: give a warning only for readonly partition") >>>>>>>>> Signed-off-by: Chao Yu >>>>>>>>> --- >>>>>>>>> fs/f2fs/super.c | 8 +++++--- >>>>>>>>> 1 file changed, 5 insertions(+), 3 deletions(-) >>>>>>>>> >>>>>>>>> diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c >>>>>>>>> index b48281642e98..2b78ee11f093 100644 >>>>>>>>> --- a/fs/f2fs/super.c >>>>>>>>> +++ b/fs/f2fs/super.c >>>>>>>>> @@ -3952,10 +3952,12 @@ static int f2fs_fill_super(struct super_block *sb, void *data, int silent) >>>>>>>>> * previous checkpoint was not done by clean system shutdown. >>>>>>>>> */ >>>>>>>>> if (f2fs_hw_is_readonly(sbi)) { >>>>>>>>> - if (!is_set_ckpt_flags(sbi, CP_UMOUNT_FLAG)) >>>>>>>>> + if (!is_set_ckpt_flags(sbi, CP_UMOUNT_FLAG)) { >>>>>>>>> + err = -EROFS; >>>>>>>>> f2fs_err(sbi, "Need to recover fsync data, but write access unavailable"); >>>>>>>>> - else >>>>>>>>> - f2fs_info(sbi, "write access unavailable, skipping recovery"); >>>>>>>>> + goto free_meta; >>>>>>>>> + } >>>>>>>>> + f2fs_info(sbi, "write access unavailable, skipping recovery"); >>>>>>>>> goto reset_checkpoint; >>>>>>>>> } >>>>>>>>> -- >>>>>>>>> 2.29.2 >>>>>>>> . >>>>>>>> >>>>>> . >>>>>> >>>> . >>>> >>> >>> >>> _______________________________________________ >>> Linux-f2fs-devel mailing list >>> Linux-f2fs-devel@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel >>> . >>> > . >