From: Jan Kara Subject: Re: [v4.14-rc1 regression] ext4 failed fstests generic/233 quota test Date: Tue, 10 Oct 2017 13:43:23 +0200 Message-ID: <20171010114323.GD3667@quack2.suse.cz> References: <20171008054236.GA10593@eguan.usersys.redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: linux-ext4@vger.kernel.org, linux-fsdevel@vger.kernel.org, Jan Kara , Eric Whitney To: Eryu Guan Return-path: Received: from mx2.suse.de ([195.135.220.15]:45035 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751415AbdJJLnY (ORCPT ); Tue, 10 Oct 2017 07:43:24 -0400 Content-Disposition: inline In-Reply-To: <20171008054236.GA10593@eguan.usersys.redhat.com> Sender: linux-ext4-owner@vger.kernel.org List-ID: Hi Eryu, On Sun 08-10-17 13:42:36, Eryu Guan wrote: > After generic/232 failure has been reported and resolved[1], I still > could see fstests generic/233 failure on ext4 with v4.14-rc3 kernel. > This is not 100% reproduced (block usage needs to exceed soft limit) but > reliably. > > seed = S > Comparing user usage > -Comparing group usage > +4c4 > +< #1001 +- 32064 32000 32000 998 1000 1000 > +--- > +> #1001 +- 32064 32000 32000 7days 998 1000 1000 > > Grace time was not printed by repquota right after the fsstress run when > we exceeded the block soft limit, and only printed after a quotacheck > was run. With v4.13 kernel, block grace time could be printed > immediately after the fsstress run. Well, I'd rather interpret the results as "the grace time didn't get set by the failing kernel, only quotacheck would set it". This configuration with softlimit == hardlimit is a bit ambiguous (as effectively softlimit and grace time are unused) and I might have shortcut setting of grace time in this case somewhere (which would be harmless). But still it warrants closer investigation. I'll have a look. > git bisect pointed the first bad to commit 7b9ca4c61bc2 ("quota: Reduce > contention on dq_data_lock"). And I've confirmed the bisection result by > converting the commit in question and running generic/233 for 20 > iterations without a failure. Thanks for digging into this! Honza -- Jan Kara SUSE Labs, CR