Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp3608820yba; Sat, 11 May 2019 15:09:17 -0700 (PDT) X-Google-Smtp-Source: APXvYqzpA77Z86VbO9qii/h4+/0x1Tss46n2+4d9KDoMBhVb31lOZFbscFV457g4XWa7E6QekGNE X-Received: by 2002:a62:5ec4:: with SMTP id s187mr24325321pfb.185.1557612557496; Sat, 11 May 2019 15:09:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1557612557; cv=none; d=google.com; s=arc-20160816; b=GBMNk1Wdxx/Wf3FqCz8SdwEytHsdpzwjU3G4KyC5PtMc2JefKWSsqkUUnV1e8VUq3j FOfrA1HruscDKsQ6kdjKbvbGEO6f8cWFKmqcr69+1pkyAwr5VEjTyb/5oGXE/aqR1RjD dGgCp6I4LRNxch+jmg+EEBURjbjqfwUh7X6G/pqPCwzl8rOfRdfbx3QLkgSzAPIZG7Fd 2e+i6FwMwxoCoMPhZz/tB6zLFvRWAC03NG37hYD51yzOcERzB5WPoM+H1x6R3JYEzrgX PcSN4MI31hJLJsVW6n/DCfUScrXMrm4nov4fwdN8EKGqQ4hrTa8MFfXq5eqnYIIFRdGG BnAg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:mail-followup-to :message-id:subject:cc:to:from:date; bh=ShX26V1vVPH5G7KL1UyQ7ZirVOj0KqlzNnUtA0PlPkk=; b=OQP600IbzAGDq/DG1ua9SBquGywy2/umlbmws6yIwpCoA8CyqIzUNTI09zsydqlaaA eRtZy4A7cBF89BlGd9TBeIiX2979mWJGA0emxAb9vxl98bOMa5ch21aN8MyShy8unB6p Y9hONGh3Y2LIf0U5oBNgu4TueXxiGr9ypM3m08oBxQxZLVo5GdyvWZrGfVYi+HJRd/Pe Tmx3LEBYnubCyjYWujaJF65TDyjjLf811M3qAA9vkgkP40ooknKOnFxy9/gdNuBhn71W gD2eIyRcdlwvO7jJKbuUpmmCi6ZyURtnUrP6nwzpx4hCk8hy+BNfAMKPyZ9R2hb8rFQc zmLQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e6si1641568pfe.111.2019.05.11.15.08.44; Sat, 11 May 2019 15:09:17 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726099AbfEKWHK (ORCPT + 99 others); Sat, 11 May 2019 18:07:10 -0400 Received: from outgoing-auth-1.mit.edu ([18.9.28.11]:45539 "EHLO outgoing.mit.edu" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726043AbfEKWHK (ORCPT ); Sat, 11 May 2019 18:07:10 -0400 Received: from callcc.thunk.org (rrcs-67-53-55-100.west.biz.rr.com [67.53.55.100]) (authenticated bits=0) (User authenticated as tytso@ATHENA.MIT.EDU) by outgoing.mit.edu (8.14.7/8.12.4) with ESMTP id x4BM6x67031652 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sat, 11 May 2019 18:07:02 -0400 Received: by callcc.thunk.org (Postfix, from userid 15806) id 558B3420024; Sat, 11 May 2019 18:06:59 -0400 (EDT) Date: Sat, 11 May 2019 18:06:59 -0400 From: "Theodore Ts'o" To: Richard Weinberger Cc: Arthur Marsh , LKML , linux-ext4@vger.kernel.org Subject: Re: ext3/ext4 filesystem corruption under post 5.1.0 kernels Message-ID: <20190511220659.GB8507@mit.edu> Mail-Followup-To: Theodore Ts'o , Richard Weinberger , Arthur Marsh , LKML , linux-ext4@vger.kernel.org References: <48BA4A6E-5E2A-478E-A96E-A31FA959964C@internode.on.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-ext4-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Sat, May 11, 2019 at 02:43:16PM +0200, Richard Weinberger wrote: > [CC'in linux-ext4] > > On Sat, May 11, 2019 at 1:47 PM Arthur Marsh > wrote: > > > > > > The filesystem with the kernel source tree is the root file system, ext3, mounted as: > > > > /dev/sdb7 on / type ext3 (rw,relatime,errors=remount-ro) > > > > After the "Compressing objects" stage, the following appears in dmesg: > > > > [ 848.968550] EXT4-fs error (device sdb7): ext4_get_branch:171: inode #8: block 30343695: comm jbd2/sdb7-8: invalid block > > [ 849.077426] Aborting journal on device sdb7-8. > > [ 849.100963] EXT4-fs (sdb7): Remounting filesystem read-only > > [ 849.100976] jbd2_journal_bmap: journal block not found at offset 989 on sdb7-8 This indicates that the extent tree blocks for the journal was found to be corrupt; so the journal couldn't be found. > > # fsck -yv > > fsck from util-linux 2.33.1 > > e2fsck 1.45.0 (6-Mar-2019) > > /dev/sdb7: recovering journal > > /dev/sdb7 contains a file system with errors, check forced. But e2fsck had no problem finding the journal. > > Pass 1: Checking inodes, blocks, and sizes > > Pass 2: Checking directory structure > > Pass 3: Checking directory connectivity > > Pass 4: Checking reference counts > > Pass 5: Checking group summary information > > Free blocks count wrong (4619656, counted=4619444). > > Fix? yes > > > > Free inodes count wrong (15884075, counted=15884058). > > Fix? yes And no other significant problems were found. (Ext4 never updates or relies on the summary number of free blocks and free inodes, since updating it is a scalability bottleneck and these values can be calculated from the per block group free block/inodes count. So the fact that e2fsck needed to update them is not an issue.) So that implies that we got one set of values when we read the journal inode when attempting to mount the file system, and a *different* set of values when e2fsck was run. Which makes means that we need consider the possibility that the problem is below the file system layer (e.g., the block layer, device drivers, etc.). > > /dev/sdb7: ***** FILE SYSTEM WAS MODIFIED ***** > > > > Other times, I have gotten: > > > > "Inodes that were part of a corrupted orphan linked list found." > > "Block bitmap differences:" > > "Free blocks sound wrong for group" > > This variety of issues also implies that the issue may be in the data read by the file system, as opposed to an issue in the file system. Arthur, can you give us the full details of your hardware configuration and your kernel config file? Also, what kernel git commit ID were you testing? - Ted