Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp312132pxu; Wed, 14 Oct 2020 01:49:57 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyiBhRo8PGuAu7gSLxCIbK3196e2cmoprUTi2cn0GZBrGZ4dfjaj4GXPq0qAL62I7VSd0yG X-Received: by 2002:aa7:d7ca:: with SMTP id e10mr3987673eds.269.1602665397344; Wed, 14 Oct 2020 01:49:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602665397; cv=none; d=google.com; s=arc-20160816; b=nB1hwwaL0FRDH7ddqbJLqr+jswNoJC08sQaE/f8bFd2qvHKe5DXzt7XS4qnL6a1V8N JdEX+svii+hMqZNzaKEq/Z1Z8oEL8G8zbALEUIlKBwHFhYvQx10AhPmQHSTpxz2Q0iB6 2uQCrXnEvyrNnnV6BfQlYerKOnb/3ODZ51nNq3rqC1+//oqdaV2G+bSd143habqI0GqS B+7DgDVmCU2a1dT2zHSFc8Z5uRLv5frqrWEyTWDPCov4Y9cCIJqmo4TOEbdyeI+i5t6N 36OVT8EDhRhpdRuJ1asBQX9ah67aUbytnay7H9AcE1nsPbBOTAUCgTRT4+r2p5wgRRW2 1ZeA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :to:subject; bh=FhAlkLrvuQXobxdRkMjlE8LxVwyu5S4n1hBzV5CGjpo=; b=EmmI8EczwSEEnSHlUh/+/VjKnXIPG4AiTe/4gSMVMPOP7NgtihFv9pIJf2dci1sTUa 5bv/h9Mo2W+ZJCfiXoymbbx3EThza2emOK3k3GjkWIE3KFfJyLV+6l1LzgPXbD5xJ3kw Xes2bj64yzPKhIxNOpF1/E2NRqY8QXYER1s6EQRRneHUdsq2d75rM3WUxw3dniA5JZkp uNn67L4eaim6ys1MjUSvRGBCewoEwMEKkCX6nLnVyR9qHTaC7jqnEy8CykjqqZSLbsJC uMvJxlCkWwJPSnEQ8xq5rz3Vg0JvZM4WrxUa//hnUAgOMGeRoln1nggvBnR8Z3slCkrs rMKA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id cc20si1715073edb.341.2020.10.14.01.49.34; Wed, 14 Oct 2020 01:49:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730999AbgJNGV3 (ORCPT + 99 others); Wed, 14 Oct 2020 02:21:29 -0400 Received: from szxga05-in.huawei.com ([45.249.212.191]:15212 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1730983AbgJNGV2 (ORCPT ); Wed, 14 Oct 2020 02:21:28 -0400 Received: from DGGEMS411-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id AAFC89273CDCF63776A9; Wed, 14 Oct 2020 14:21:26 +0800 (CST) Received: from [10.136.114.67] (10.136.114.67) by smtp.huawei.com (10.3.19.211) with Microsoft SMTP Server (TLS) id 14.3.487.0; Wed, 14 Oct 2020 14:21:25 +0800 Subject: Re: [f2fs-dev] [PATCH v3] f2fs: handle errors of f2fs_get_meta_page_nofail be failed To: , , , References: <20201002213226.2862930-1-jaegeuk@kernel.org> <20201009043135.GA1973455@google.com> <20201013030725.GA3337731@google.com> From: Chao Yu Message-ID: Date: Wed, 14 Oct 2020 14:21:25 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20201013030725.GA3337731@google.com> Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.136.114.67] X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/10/13 11:07, jaegeuk@kernel.org wrote: > First problem is we hit BUG_ON() in f2fs_get_sum_page given EIO on > f2fs_get_meta_page_nofail(). > > Quick fix was not to give any error with infinite loop, but syzbot caught > a case where it goes to that loop from fuzzed image. In turned out we abused > f2fs_get_meta_page_nofail() like in the below call stack. > > - f2fs_fill_super > - f2fs_build_segment_manager > - build_sit_entries > - get_current_sit_page > > INFO: task syz-executor178:6870 can't die for more than 143 seconds. > task:syz-executor178 state:R > stack:26960 pid: 6870 ppid: 6869 flags:0x00004006 > Call Trace: > > Showing all locks held in the system: > 1 lock held by khungtaskd/1179: > #0: ffffffff8a554da0 (rcu_read_lock){....}-{1:2}, at: debug_show_all_locks+0x53/0x260 kernel/locking/lockdep.c:6242 > 1 lock held by systemd-journal/3920: > 1 lock held by in:imklog/6769: > #0: ffff88809eebc130 (&f->f_pos_lock){+.+.}-{3:3}, at: __fdget_pos+0xe9/0x100 fs/file.c:930 > 1 lock held by syz-executor178/6870: > #0: ffff8880925120e0 (&type->s_umount_key#47/1){+.+.}-{3:3}, at: alloc_super+0x201/0xaf0 fs/super.c:229 > > Actually, we didn't have to use _nofail in this case, since we could return > error to mount(2) already with the error handler. > > As a result, this patch tries to 1) remove _nofail callers as much as possible, > 2) deal with error case in last remaining caller, f2fs_get_sum_page(). > > Reported-by: syzbot+ee250ac8137be41d7b13@syzkaller.appspotmail.com > Signed-off-by: Jaegeuk Kim I prefer to use f2fs_get_meta_page() as much as possible except change_curseg() path, however, it's minor, and the time is closing to merge window, so: Reviewed-by: Chao Yu Thanks, > --- > > Change log from v2: > - avoid _nofail and add error handler > > fs/f2fs/checkpoint.c | 2 +- > fs/f2fs/f2fs.h | 2 +- > fs/f2fs/node.c | 2 +- > fs/f2fs/segment.c | 12 +++++++++--- > 4 files changed, 12 insertions(+), 6 deletions(-) > > diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c > index f18386d30f031..023462e80e58d 100644 > --- a/fs/f2fs/checkpoint.c > +++ b/fs/f2fs/checkpoint.c > @@ -107,7 +107,7 @@ struct page *f2fs_get_meta_page(struct f2fs_sb_info *sbi, pgoff_t index) > return __get_meta_page(sbi, index, true); > } > > -struct page *f2fs_get_meta_page_nofail(struct f2fs_sb_info *sbi, pgoff_t index) > +struct page *f2fs_get_meta_page_retry(struct f2fs_sb_info *sbi, pgoff_t index) > { > struct page *page; > int count = 0; > diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h > index ae46d44bd5211..adda53d20a399 100644 > --- a/fs/f2fs/f2fs.h > +++ b/fs/f2fs/f2fs.h > @@ -3422,7 +3422,7 @@ unsigned int f2fs_usable_blks_in_seg(struct f2fs_sb_info *sbi, > void f2fs_stop_checkpoint(struct f2fs_sb_info *sbi, bool end_io); > struct page *f2fs_grab_meta_page(struct f2fs_sb_info *sbi, pgoff_t index); > struct page *f2fs_get_meta_page(struct f2fs_sb_info *sbi, pgoff_t index); > -struct page *f2fs_get_meta_page_nofail(struct f2fs_sb_info *sbi, pgoff_t index); > +struct page *f2fs_get_meta_page_retry(struct f2fs_sb_info *sbi, pgoff_t index); > struct page *f2fs_get_tmp_page(struct f2fs_sb_info *sbi, pgoff_t index); > bool f2fs_is_valid_blkaddr(struct f2fs_sb_info *sbi, > block_t blkaddr, int type); > diff --git a/fs/f2fs/node.c b/fs/f2fs/node.c > index 93fb34d636eb5..d5d8ce077f295 100644 > --- a/fs/f2fs/node.c > +++ b/fs/f2fs/node.c > @@ -109,7 +109,7 @@ static void clear_node_page_dirty(struct page *page) > > static struct page *get_current_nat_page(struct f2fs_sb_info *sbi, nid_t nid) > { > - return f2fs_get_meta_page_nofail(sbi, current_nat_addr(sbi, nid)); > + return f2fs_get_meta_page(sbi, current_nat_addr(sbi, nid)); > } > > static struct page *get_next_nat_page(struct f2fs_sb_info *sbi, nid_t nid) > diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c > index 13ecd2c2c361b..05ab5ae2b5f7f 100644 > --- a/fs/f2fs/segment.c > +++ b/fs/f2fs/segment.c > @@ -2379,7 +2379,9 @@ int f2fs_npages_for_summary_flush(struct f2fs_sb_info *sbi, bool for_ra) > */ > struct page *f2fs_get_sum_page(struct f2fs_sb_info *sbi, unsigned int segno) > { > - return f2fs_get_meta_page_nofail(sbi, GET_SUM_BLOCK(sbi, segno)); > + if (unlikely(f2fs_cp_error(sbi))) > + return ERR_PTR(-EIO); > + return f2fs_get_meta_page_retry(sbi, GET_SUM_BLOCK(sbi, segno)); > } > > void f2fs_update_meta_page(struct f2fs_sb_info *sbi, > @@ -2669,7 +2671,11 @@ static void change_curseg(struct f2fs_sb_info *sbi, int type, bool flush) > __next_free_blkoff(sbi, curseg, 0); > > sum_page = f2fs_get_sum_page(sbi, new_segno); > - f2fs_bug_on(sbi, IS_ERR(sum_page)); > + if (IS_ERR(sum_page)) { > + /* GC won't be able to use stale summary pages by cp_error */ > + memset(curseg->sum_blk, 0, SUM_ENTRY_SIZE); > + return; > + } > sum_node = (struct f2fs_summary_block *)page_address(sum_page); > memcpy(curseg->sum_blk, sum_node, SUM_ENTRY_SIZE); > f2fs_put_page(sum_page, 1); > @@ -3964,7 +3970,7 @@ int f2fs_lookup_journal_in_cursum(struct f2fs_journal *journal, int type, > static struct page *get_current_sit_page(struct f2fs_sb_info *sbi, > unsigned int segno) > { > - return f2fs_get_meta_page_nofail(sbi, current_sit_addr(sbi, segno)); > + return f2fs_get_meta_page(sbi, current_sit_addr(sbi, segno)); > } > > static struct page *get_next_sit_page(struct f2fs_sb_info *sbi, >