Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp317446imm; Thu, 12 Jul 2018 20:28:05 -0700 (PDT) X-Google-Smtp-Source: AAOMgpezPPfJ2pzJXJZBlq9K1N3S/VkVMl3kReVursDGPrpBx6FUqUBe9mtLfwGgU7ZgNF6RXv1K X-Received: by 2002:a17:902:599b:: with SMTP id p27-v6mr4569931pli.191.1531452485831; Thu, 12 Jul 2018 20:28:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531452485; cv=none; d=google.com; s=arc-20160816; b=jtU0MS4xLpoazAyAq3F7VRljKCsqNkQXezYvArbEBszB2perH/DGq42TUbV0UGv4tj NkKkAecbkwIO7T/msj2TzNPWfpSm19s1elgTV2y6jasZ5Qf80M7sAGJAPwgXvNrPVILE wWxVVUkOMoqtYLCG4FZD23G2kng0AkllTVd62thZUlVR6gV6XQlbuzveR2dAAdMvvToQ pYGiNRft+6cvceXQRJvI5ly4PcD5KjBp+XrO5O2CfkhiEGS+8Mn2vhll3Xq149l/0tkG S9D24rh1YsuBZDUqDU3JwerEFdY9PkMBBkgVT2m32eyF60x6JA92390AnVHf4fU3UeoY +1cg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=juiVcSLTbwytKURCGbsd81l9zjJ3iLHhvkC0gBiQhe0=; b=EizpRQ/TDI9TB2OvfUgL62GLfc4F8OexfSrigOKsv21aJ3xMDK+KA8p+yAYc2+95x+ sEtiTg8rIcQL8SOkjT05okWeEG8UmwwWuugzmSjhUFle89UoD5xsbBzfEO9BuiJDIDDz JSRg+ju1o/DIpi0IRaEOAr/nTKmTMOxbdceqdzJnNu1Ock/Q6EcnrmggKQvHQdjibuae F49azZmTtrMI9rdxVxaEDmnG9ewoHqtGqDXmije4Dx9r26MKjym6kgeTgxoZyQ3BTxbT 6m/JaO9M1yMaG62DAPpcyvN5WECkY6PDV3xBKHGMi4Yma/XhpXzFCXf5K18GIkA9Kj6j UD+w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w26-v6si22634800pgk.372.2018.07.12.20.27.51; Thu, 12 Jul 2018 20:28:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388022AbeGMDjQ (ORCPT + 99 others); Thu, 12 Jul 2018 23:39:16 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:55075 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2387615AbeGMDjQ (ORCPT ); Thu, 12 Jul 2018 23:39:16 -0400 Received: from DGGEMS406-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id C01C978836A61; Fri, 13 Jul 2018 11:26:33 +0800 (CST) Received: from [127.0.0.1] (10.134.22.195) by DGGEMS406-HUB.china.huawei.com (10.3.19.206) with Microsoft SMTP Server id 14.3.382.0; Fri, 13 Jul 2018 11:26:28 +0800 Subject: Re: [PATCH 5/5] f2fs: do not __punch_discard_cmd in lfs mode To: Yunlong Song , , , CC: , , , , , References: <1531408170-45758-1-git-send-email-yunlong.song@huawei.com> <1531408170-45758-6-git-send-email-yunlong.song@huawei.com> From: Chao Yu Message-ID: <7049c66a-867a-aef6-a58e-16ade179461e@huawei.com> Date: Fri, 13 Jul 2018 11:26:27 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <1531408170-45758-6-git-send-email-yunlong.song@huawei.com> Content-Type: text/plain; charset="windows-1252" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018/7/12 23:09, Yunlong Song wrote: > In lfs mode, it is better to submit and wait for discard of the > new_blkaddr's overall section, rather than punch it which makes > more small discards and is not friendly with flash alignment. And > f2fs does not have to wait discard of each new_blkaddr except for the > start_block of each section with this patch. For non-zoned block device, unaligned discard can be allowed; and if synchronous discard is very slow, it will block block allocator here, rather than that, I prefer just punch 4k lba of discard entry for performance. If you don't want to encounter this condition, I suggest issue large size discard more quickly. Thanks, > > Signed-off-by: Yunlong Song > --- > fs/f2fs/segment.c | 76 ++++++++++++++++++++++++++++++++++++++++++++++++++----- > fs/f2fs/segment.h | 7 ++++- > 2 files changed, 75 insertions(+), 8 deletions(-) > > diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c > index f6c20e0..bce321a 100644 > --- a/fs/f2fs/segment.c > +++ b/fs/f2fs/segment.c > @@ -893,7 +893,19 @@ static void __remove_discard_cmd(struct f2fs_sb_info *sbi, > static void f2fs_submit_discard_endio(struct bio *bio) > { > struct discard_cmd *dc = (struct discard_cmd *)bio->bi_private; > + struct f2fs_sb_info *sbi = F2FS_SB(dc->bdev->bd_super); > > + if (test_opt(sbi, LFS)) { > + unsigned int segno = GET_SEGNO(sbi, dc->lstart); > + unsigned int secno = GET_SEC_FROM_SEG(sbi, segno); > + int cnt = (dc->len >> sbi->log_blocks_per_seg) / > + sbi->segs_per_sec; > + > + while (cnt--) { > + set_bit(secno, FREE_I(sbi)->discard_secmap); > + secno++; > + } > + } > dc->error = blk_status_to_errno(bio->bi_status); > dc->state = D_DONE; > complete_all(&dc->wait); > @@ -1349,8 +1361,15 @@ static void f2fs_wait_discard_bio(struct f2fs_sb_info *sbi, block_t blkaddr) > dc = (struct discard_cmd *)f2fs_lookup_rb_tree(&dcc->root, > NULL, blkaddr); > if (dc) { > - if (dc->state == D_PREP) { > + if (dc->state == D_PREP && !test_opt(sbi, LFS)) > __punch_discard_cmd(sbi, dc, blkaddr); > + else if (dc->state == D_PREP && test_opt(sbi, LFS)) { > + struct discard_policy dpolicy; > + > + __init_discard_policy(sbi, &dpolicy, DPOLICY_FORCE, 1); > + __submit_discard_cmd(sbi, &dpolicy, dc); > + dc->ref++; > + need_wait = true; > } else { > dc->ref++; > need_wait = true; > @@ -2071,9 +2090,10 @@ static void get_new_segment(struct f2fs_sb_info *sbi, > unsigned int hint = GET_SEC_FROM_SEG(sbi, *newseg); > unsigned int old_zoneno = GET_ZONE_FROM_SEG(sbi, *newseg); > unsigned int left_start = hint; > - bool init = true; > + bool init = true, check_discard = test_opt(sbi, LFS) ? true : false; > int go_left = 0; > int i; > + unsigned long *free_secmap; > > spin_lock(&free_i->segmap_lock); > > @@ -2084,11 +2104,25 @@ static void get_new_segment(struct f2fs_sb_info *sbi, > goto got_it; > } > find_other_zone: > - secno = find_next_zero_bit(free_i->free_secmap, MAIN_SECS(sbi), hint); > + if (check_discard) { > + int entries = f2fs_bitmap_size(MAIN_SECS(sbi)) / sizeof(unsigned long); > + > + free_secmap = free_i->tmp_secmap; > + for (i = 0; i < entries; i++) > + free_secmap[i] = (!(free_i->free_secmap[i] ^ > + free_i->discard_secmap[i])) | free_i->free_secmap[i]; > + } else > + free_secmap = free_i->free_secmap; > + > + secno = find_next_zero_bit(free_secmap, MAIN_SECS(sbi), hint); > if (secno >= MAIN_SECS(sbi)) { > if (dir == ALLOC_RIGHT) { > - secno = find_next_zero_bit(free_i->free_secmap, > + secno = find_next_zero_bit(free_secmap, > MAIN_SECS(sbi), 0); > + if (secno >= MAIN_SECS(sbi) && check_discard) { > + check_discard = false; > + goto find_other_zone; > + } > f2fs_bug_on(sbi, secno >= MAIN_SECS(sbi)); > } else { > go_left = 1; > @@ -2098,13 +2132,17 @@ static void get_new_segment(struct f2fs_sb_info *sbi, > if (go_left == 0) > goto skip_left; > > - while (test_bit(left_start, free_i->free_secmap)) { > + while (test_bit(left_start, free_secmap)) { > if (left_start > 0) { > left_start--; > continue; > } > - left_start = find_next_zero_bit(free_i->free_secmap, > + left_start = find_next_zero_bit(free_secmap, > MAIN_SECS(sbi), 0); > + if (left_start >= MAIN_SECS(sbi) && check_discard) { > + check_discard = false; > + goto find_other_zone; > + } > f2fs_bug_on(sbi, left_start >= MAIN_SECS(sbi)); > break; > } > @@ -2719,7 +2757,18 @@ void f2fs_allocate_data_block(struct f2fs_sb_info *sbi, struct page *page, > > *new_blkaddr = NEXT_FREE_BLKADDR(sbi, curseg); > > - f2fs_wait_discard_bio(sbi, *new_blkaddr); > + if (test_opt(sbi, LFS)) { > + unsigned int start_segno, secno; > + > + secno = GET_SEC_FROM_SEG(sbi, curseg->segno); > + start_segno = secno * sbi->segs_per_sec; > + if (*new_blkaddr == START_BLOCK(sbi, start_segno) && > + !test_bit(secno, FREE_I(sbi)->discard_secmap)) > + f2fs_wait_discard_bio(sbi, *new_blkaddr); > + f2fs_bug_on(sbi, !test_bit(secno, FREE_I(sbi)->discard_secmap)); > + } > + else > + f2fs_wait_discard_bio(sbi, *new_blkaddr); > > /* > * __add_sum_entry should be resided under the curseg_mutex > @@ -3648,10 +3697,21 @@ static int build_free_segmap(struct f2fs_sb_info *sbi) > if (!free_i->free_secmap) > return -ENOMEM; > > + free_i->discard_secmap = f2fs_kvmalloc(sbi, sec_bitmap_size, GFP_KERNEL); > + if (!free_i->discard_secmap) > + return -ENOMEM; > + > + free_i->tmp_secmap = f2fs_kvmalloc(sbi, sec_bitmap_size, GFP_KERNEL); > + if (!free_i->tmp_secmap) > + return -ENOMEM; > + > /* set all segments as dirty temporarily */ > memset(free_i->free_segmap, 0xff, bitmap_size); > memset(free_i->free_secmap, 0xff, sec_bitmap_size); > > + /* set all sections as discarded temporarily */ > + memset(free_i->discard_secmap, 0xff, sec_bitmap_size); > + > /* init free segmap information */ > free_i->start_segno = GET_SEGNO_FROM_SEG0(sbi, MAIN_BLKADDR(sbi)); > free_i->free_segments = 0; > @@ -4047,6 +4107,8 @@ static void destroy_free_segmap(struct f2fs_sb_info *sbi) > SM_I(sbi)->free_info = NULL; > kvfree(free_i->free_segmap); > kvfree(free_i->free_secmap); > + kvfree(free_i->discard_secmap); > + kvfree(free_i->tmp_secmap); > kfree(free_i); > } > > diff --git a/fs/f2fs/segment.h b/fs/f2fs/segment.h > index 5049551..b37a909 100644 > --- a/fs/f2fs/segment.h > +++ b/fs/f2fs/segment.h > @@ -259,6 +259,8 @@ struct free_segmap_info { > spinlock_t segmap_lock; /* free segmap lock */ > unsigned long *free_segmap; /* free segment bitmap */ > unsigned long *free_secmap; /* free section bitmap */ > + unsigned long *discard_secmap; /* discard section bitmap */ > + unsigned long *tmp_secmap; /* bitmap for temporal use */ > }; > > /* Notice: The order of dirty type is same with CURSEG_XXX in f2fs.h */ > @@ -453,8 +455,11 @@ static inline void __set_test_and_free(struct f2fs_sb_info *sbi, > next = find_next_bit(free_i->free_segmap, > start_segno + sbi->segs_per_sec, start_segno); > if (next >= start_segno + sbi->segs_per_sec) { > - if (test_and_clear_bit(secno, free_i->free_secmap)) > + if (test_and_clear_bit(secno, free_i->free_secmap)) { > free_i->free_sections++; > + if (test_opt(sbi, LFS)) > + clear_bit(secno, free_i->discard_secmap); > + } > } > } > skip_free: >