Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp31942pxu; Mon, 5 Oct 2020 23:04:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzQ0x6uU3fOjTQB1QBZICizFZqDQ6phPBVcSLX9wJZipvZjrZ1XnmSAlLYjWLzkz0vVqfE3 X-Received: by 2002:a17:906:ad8a:: with SMTP id la10mr1443075ejb.176.1601964273171; Mon, 05 Oct 2020 23:04:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1601964273; cv=none; d=google.com; s=arc-20160816; b=fY56g2HHOzHfhIpvP6GxJrG8bzUzkiTUxVJxzxTCrguy2fmlYJ0Ayi98d9YL3t5vFx Ck8cCxZOHuzZyQ6TTD0Kzj52nCqu1pHhO1pZ8ZgEVFcdLb6uBUquRTcA32dU4dZQDO9M sL/nCQ+YUnWBIF+0x8PW/5Cq3S0HPwluW6qziOdtMSIMDd8vNq4XR8j1dWWDoZ4xR2fs imANl48ADUm/ESvmbdHXCKF7khoQMIn/6QB7pqvU6qBrKqEI1JgQIBSCXcrhCc+TR5oa hisQS32OIo9GOA8tiZrdzF6YFp5GOU8wa0XrcPb1kgK2x/Tyn5DyUMdyZ66rkE5CMbX6 +/8A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=mWaZVGrF/aQUlBbBIIXcUVHrhCrr3lpdxdBs6Q5ZFPQ=; b=pcCayo3578YvC9Ar4BMczDuCFIhWN3xmDi/7Z6bYugUJrHumSppNqnYzXYSuIAqpaS C+AOO7IqeaXxEaZgXtZyed95qDcaiOROi56pQEgW25pxy7os/xul44vNDet+NYFMEZEs r2nwj9LKADYVG3udTCxg+xfd69wTLx5p8PxLayLK13z4EFx4sx04ICHpiu/E9Q4zogkz Vdn8BMJmYlcW8on9NJBHYvDO2yhqw0LIYAN+itiIvvMrArEccOkI2lzazlo5QNFVFieK 2H8YVdUScDP48gBvPMz2xIJgIGE1V2PtsGheXRlzFfyJ5nWCrVaqqMr9j0r+nBQJrQRW sETg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e9si1782821edl.154.2020.10.05.23.03.59; Mon, 05 Oct 2020 23:04:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726849AbgJFGDf (ORCPT + 99 others); Tue, 6 Oct 2020 02:03:35 -0400 Received: from relay12.mail.gandi.net ([217.70.178.232]:37625 "EHLO relay12.mail.gandi.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726761AbgJFGDf (ORCPT ); Tue, 6 Oct 2020 02:03:35 -0400 X-Greylist: delayed 3620 seconds by postgrey-1.27 at vger.kernel.org; Tue, 06 Oct 2020 02:03:34 EDT Received: from localhost (50-39-163-217.bvtn.or.frontiernet.net [50.39.163.217]) (Authenticated sender: josh@joshtriplett.org) by relay12.mail.gandi.net (Postfix) with ESMTPSA id 2496B200006; Tue, 6 Oct 2020 06:03:29 +0000 (UTC) Date: Mon, 5 Oct 2020 23:03:27 -0700 From: Josh Triplett To: "Theodore Y. Ts'o" Cc: "Darrick J. Wong" , Linus Torvalds , Andreas Dilger , Jan Kara , Linux Kernel Mailing List , linux-ext4@vger.kernel.org Subject: Re: ext4 regression in v5.9-rc2 from e7bfb5c9bb3d on ro fs with overlapped bitmaps Message-ID: <20201006060327.GA9227@localhost> References: <20201005081454.GA493107@localhost> <20201005173639.GA2311765@magnolia> <20201006003216.GB6553@localhost> <20201006025110.GJ49559@magnolia> <20201006031834.GA5797@mit.edu> <20201006050306.GA8098@localhost> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201006050306.GA8098@localhost> Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Mon, Oct 05, 2020 at 10:03:13PM -0700, Josh Triplett wrote: > On Mon, Oct 05, 2020 at 11:18:34PM -0400, Theodore Y. Ts'o wrote: > > What Josh is proposing I'm pretty sure would also break "e2fsck -E > > unshare_blocks", so that's another reason not to accept this as a > > valid format change. > > The kernel already accepted this as a valid mountable filesystem format, > without a single error or warning of any kind, and has done so stably > for years. > > > As far as I'm concerned, contrib/e2fsdroid is the canonical definition > > of how to create valid file systems with shared_blocks. > > I'm not trying to create a problem here; I'm trying to address a whole > family of problems. I was generally under the impression that mounting > existing root filesystems fell under the scope of the kernel<->userspace > or kernel<->existing-system boundary, as defined by what the kernel > accepts and existing userspace has used successfully, and that upgrading > the kernel should work with existing userspace and systems. If there's > some other rule that applies for filesystems, I'm not aware of that. > (I'm also not trying to suggest that every random corner case of what > the kernel *could* accept needs to be the format definition, but rather, > cases that correspond to existing userspace.) > > It wouldn't be *impossible* to work around this, this time; it may be > possible to adapt the existing userspace to work on the new and old > kernels. My concern is, if a filesystem format accepted by previous > kernels can be rejected by future kernels, what stops a future kernel > from further changing the format definition or its strictness > (co-evolving with one specific userspace) and causing further > regressions? > > I don't *want* to rely on what apparently turned out to be an > undocumented bug in the kernel's validator. That's why I was trying to > fix the issue in what seemed like the right way, by detecting the > situation and turning off the validator. That seemed like it would fully > address the issue. If it would help, I could also supply a tiny filesystem > image for regression testing. > > I'm trying to figure out what solution you'd like to see here, as long > as it isn't "any userspace that isn't e2fsdroid can be broken at will". > I'd be willing to work to adapt the userspace bits I have to work around > the regression, but I'd like to get this on the radar so this doesn't > happen again. To clarify something further: I'm genuinely not looking to push hard on the limits or corners of the kernel/userspace boundary here, nor do I want to create an imposition on development. I'm happy to attempt to be a little more flexible than most userspace. I'm trying to make substantial, non-trivial use of the userspace side of a kernel/userspace boundary, and within reason, I need to rely on the kernel's stability guarantees. I'm relying on the combination of Documentation/filesystems/ext4 and fs/ext4 as the format documentation. The first time I discovered this issue was in doing some "there's about to be a new kernel release" regression testing for 5.9, in which it created a debugging adventure to track down what the problem was. I'd like to find a good way to report and handle this kind of thing going forward, if another issue like this arises. - Josh Triplett