Received: by 2002:a25:ef43:0:0:0:0:0 with SMTP id w3csp916054ybm; Fri, 29 May 2020 15:36:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyfJ+gMqUHXLDDAj4pc1xDgOL0H3KQOu4Nd+J3uaZCV+sv+3ME5bx3SvoNGp9oV5JLINQBB X-Received: by 2002:a17:906:2ec6:: with SMTP id s6mr10491441eji.532.1590791809347; Fri, 29 May 2020 15:36:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1590791809; cv=none; d=google.com; s=arc-20160816; b=UbLLs5DvcTThO0lI3BNBYlU+4vYMfyOSOuC8b59j01NXLD2tHEG6xJJCRgHkRgge5d 9g7CQU8Dj/x9kL8jnKdkUlyoSSLFgvQlSSIoYwO0eYTqVPzajRYFNLGCr255JRIuZLM0 pPazDTMrbtDoQzyMl6J1XxOkwELudi03ZT/WZDvtaONADyvOoVO+YJnOgcxw4ym6s6q3 nh/jBPlspIh3gNoeiZbhlepHbvU7aPlToY25qcZGdrXEDHiIFflvgvNn9/JOqwoC7foz UTWMJMpruZc7ZGw+p7sLOifsQaF8FSejiQ78nXgxENzhBumTJGZSfAVmH+nFpv1/Jcwz Ol8Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=mHGOB4kjmpO2tj2FNmPWYHpYp2mexnAaUHyd2X+g6UY=; b=SUUJX+eBRx6lFDt5NIKiiPcg1HZIeU/aeTOh9BkDPxhJ941XpCwtIdwBstVNz7grbK H9Cj2OLWJlyU7Z8LrE96mRvP52PiXKkvVqlWaKps9PCAeUd6oNRe1gd9LWoeshRa41V7 gTBF45zs/7hHlzWDr+zrYauyn/VYKRiHYGR/qkOuJ8+U80s71WmAz7yKuVYnpB10S5tl V2Cf6Cbtsv5YpWb/j1QA62/th17mqmaGAzDgHl22//Eldzz77sYAqpiOii00yyya1p3d 56RWkYvf22pYiY0slCGnZRHAa9SAOKo5IJbqb4w3MZW2haCBnKZkAf87cG/zhlelq3xL PaTQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="fa+Q/aST"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bn18si2247982ejb.82.2020.05.29.15.36.26; Fri, 29 May 2020 15:36:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="fa+Q/aST"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727947AbgE2We1 (ORCPT + 99 others); Fri, 29 May 2020 18:34:27 -0400 Received: from mail.kernel.org ([198.145.29.99]:57866 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725913AbgE2We1 (ORCPT ); Fri, 29 May 2020 18:34:27 -0400 Received: from localhost (unknown [104.132.1.66]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id CEFD9207BC; Fri, 29 May 2020 22:34:26 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1590791666; bh=sIRfQ3q0FdVrLrcacZUMj2UJNEZ8rAbw9XTInJzIN6Y=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=fa+Q/aSTV9wLSV01WhvK9mJWIvVAIlvy00i+VyPnViUJlguFAKEoIghscwH3zxG0a nOvjMznGQrjVCSyYEQ8mwESXT32FPNb8Lfz/FyqVdZqoNozyb+QOTvy60qxKkHpiMd LrHRgTQzgwwwzI0B1arUCEYCP2YB0vpUprP47Zv0= Date: Fri, 29 May 2020 15:34:26 -0700 From: Jaegeuk Kim To: Chao Yu Cc: linux-f2fs-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org, chao@kernel.org Subject: Re: [PATCH] Revert "f2fs: fix quota_sync failure due to f2fs_lock_op" Message-ID: <20200529223426.GA249109@google.com> References: <20200529092947.7890-1-yuchao0@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200529092947.7890-1-yuchao0@huawei.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 05/29, Chao Yu wrote: > Under heavy fsstress, we may triggle panic while issuing discard, > because __check_sit_bitmap() detects that discard command may earse > valid data blocks, the root cause is as below race stack described, > since we removed lock when flushing quota data, quota data writeback > may race with write_checkpoint(), so that it causes inconsistency in > between cached discard entry and segment bitmap. > > - f2fs_write_checkpoint > - block_operations > - set_sbi_flag(sbi, SBI_QUOTA_SKIP_FLUSH) > - f2fs_flush_sit_entries > - add_discard_addrs > - __set_bit_le(i, (void *)de->discard_map); > - f2fs_write_data_pages > - f2fs_write_single_data_page > : inode is quota one, cp_rwsem won't be locked > - f2fs_do_write_data_page > - f2fs_allocate_data_block > - f2fs_wait_discard_bio > : discard entry has not been added yet. > - update_sit_entry > - f2fs_clear_prefree_segments > - f2fs_issue_discard > : add discard entry > > This patch fixes this issue by reverting 435cbab95e39 ("f2fs: fix quota_sync > failure due to f2fs_lock_op"). > > Fixes: 435cbab95e39 ("f2fs: fix quota_sync failure due to f2fs_lock_op") The previous patch fixes quota_sync gets EAGAIN all the time. How about this? It seems this works for fsstress test. --- fs/f2fs/segment.c | 11 +++++++++++ 1 file changed, 11 insertions(+) diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c index ebbadde6cbced..f67cffc38975e 100644 --- a/fs/f2fs/segment.c +++ b/fs/f2fs/segment.c @@ -3095,6 +3095,14 @@ void f2fs_allocate_data_block(struct f2fs_sb_info *sbi, struct page *page, struct curseg_info *curseg = CURSEG_I(sbi, type); bool put_pin_sem = false; + /* + * We need to wait for node_write to avoid block allocation during + * checkpoint. This can only happen to quota writes which can cause + * the below discard race condition. + */ + if (IS_DATASEG(type)) + down_write(&sbi->node_write); + if (type == CURSEG_COLD_DATA) { /* GC during CURSEG_COLD_DATA_PINNED allocation */ if (down_read_trylock(&sbi->pin_sem)) { @@ -3174,6 +3182,9 @@ void f2fs_allocate_data_block(struct f2fs_sb_info *sbi, struct page *page, if (put_pin_sem) up_read(&sbi->pin_sem); + + if (IS_DATASEG(type)) + up_write(&sbi->node_write); } static void update_device_state(struct f2fs_io_info *fio) -- 2.27.0.rc0.183.gde8f92d652-goog