Return-Path: Received: from mail-wm1-f65.google.com ([209.85.128.65]:38783 "EHLO mail-wm1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725730AbeLBUTs (ORCPT ); Sun, 2 Dec 2018 15:19:48 -0500 MIME-Version: 1.0 References: <065643a0-f9aa-a361-715a-03ca978d9228@roeck-us.net> <20181128041555.GE31885@thunk.org> <2547416.7Vy7A2kRpU@siriux> In-Reply-To: From: Andrey Melnikov Date: Sun, 2 Dec 2018 23:19:30 +0300 Message-ID: Subject: Re: ext4 file system corruption with v4.19.3 / v4.19.4 To: jrf@mailbox.org, "Theodore Ts'o" Cc: linux-kernel@vger.kernel.org, linux-ext4@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-ext4-owner@vger.kernel.org List-ID: =D1=87=D1=82, 29 =D0=BD=D0=BE=D1=8F=D0=B1. 2018 =D0=B3. =D0=B2 01:08, Raine= r Fiebig : > > Am 28.11.18 um 22:13 schrieb Andrey Melnikov: > > =D1=81=D1=80, 28 =D0=BD=D0=BE=D1=8F=D0=B1. 2018 =D0=B3. =D0=B2 18:55, R= ainer Fiebig : > >> > >> Am Mittwoch, 28. November 2018, 13:02:56 schrieb Andrey Jr. Melnikov: > >>> In gmane.comp.file-systems.ext4 Theodore Y. Ts'o wrot= e: > >>>> On Wed, Nov 28, 2018 at 03:16:33AM +0300, Andrey Jr. Melnikov wrote: > >>>>> Corrupted inodes - always directory, not touched at least year or > >>>>> more for writing. Something wrong when updating atime? > >>>> > >>>> We're not sure. The frustrating thing is that it's not reproducing > >>>> for me. I run extensive regression tests, and I'm using 4.19 on my > >>>> development laptop without notcing any problems. If I could reprodu= ce > >>>> it, I could debug it, but since I can't, I need to rely on those who > >>>> are seeing the problem to help pinpoint the problem. > >>> > >>> My workstation hit this bug every time after boot. If you have an ide= a - I > >>> may test it. > >>> > >>>> I'm trying to figure out common factors from those people who are > >>>> reporting problems. > >>>> > >>>> (a) What distribution are you running (it appears that many people > >>>> reporting problems are running Ubuntu, but this may be a sampling > >>>> issue; lots of people run Ubuntu)? (For the record, I'm using Debia= n > >>>> Testing.) > >>> > >>> Debian sid but self-build kernel from ubuntu mainline-ppa. > >> > >> You could try a vanilla 4.19.5 from https://www.kernel.org/ > >> and compile it with your current .config. > > > > mainline-ppa use vanilla kernel. Patches only adds debian specific > > build infrastructure. > > > >> If you still see the errors, at least the Ubuntu-kernel could be ruled= out. > >> > >> In addition, if you still see the errors: > >> > >> - backup your .config in a *different* folder (so that you can later r= e-use > >> it) > >> - do a "make mrproper" (deletes the .config, see above) > >> - do a "make defconfig" > >> - and compile the kernel with that new .config > > > > defconfig is great - for abstract hardware in vacuum. > > > >> If you still have the problem after that, you may want to learn how to= bisect. > >> ;) > > I'm already know how-to bisect. From kernel 2.0 era. Without git ;) > > > > This problem simply non-bisectable, when same kernel corrupt FS on my > > workstation but normally working on other servers. > > And now - FS corrupted again with disabled CONFIG_EXT4_ENCRYPTION. Grea= t. > > OK, - and now we are looking forward to *your* ideas how to solve this. After four days playing games around git bisect - real winner is debian gcc-8.2.0-9. Upgrade it to 8.2.0-10 or use 7.3.0-30 version for same kernel + config - does not exhibit ext4 corruption. I think I hit this https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D87859 with 8.2.0-9 version.