Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp2822146imu; Sun, 23 Dec 2018 08:33:17 -0800 (PST) X-Google-Smtp-Source: ALg8bN6xFx2fkWfGBIeU5Vavj1Kf+TcSi9eRfAwVcu/GAPIiIpdUn9nlicJhTHSwno/gAuvdzQg0 X-Received: by 2002:a63:7044:: with SMTP id a4mr9453961pgn.359.1545582796982; Sun, 23 Dec 2018 08:33:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1545582796; cv=none; d=google.com; s=arc-20160816; b=hk7FLMHvfIPphmVI3zINfGJX27C0yRoPl4cqush0LH4aSaN/TmUkTxwvla+f1+Z6SH GENpFJr6qjWW52HEzf7b15jmrH+VxG8V5LAIgDa3WnzpmhLLAGSDQ3TAtVfDjMCRWVig aU/HZOpZtJrelAEKIwvfkI0VijPCL2fa//LiIV8HIf1WJcV5rNmiVRZ2UOY+nkNKfxPx 0kuH6ue79iQmZYF/XNQH/5abhWzbFlJeKp0yhKQn8W1kULU7mDW9hSDvo0SmDcSbfMiN U0DNH+3pqTA5O01X+o9gquBdepLzE2PlBe6AtvRlpu0gkXX5kZLlRM5dNDkval8pAgQL bZfw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:mail-followup-to :message-id:subject:cc:to:from:date; bh=xQpRKNsByWxFraTz6tUDza/Tl+HztU/geSWfKI/v9x4=; b=AL4BXOngPK1UiUwXDFR5uJGN4dGkxQILDcMbJc2gCkKVrul90A9MwuDgf1ME7JtjRL +pRIWYq3cLQrljTkh0nNTjdHCoNwpQbgHdLzZXLaM5HF4ZNzCez5NOKh8/LKFXnpNReQ mFlwvNBmoVOckSWXfjABVK88nkRGEqqknVb+YtgjxNqfW+In5y8jUhAAUXf+7HFfLIqF EBa1h5fUoPVBmmd2G0RdUVCfygnBpBilDrAaFtipIgb/Spf7fGCP/R9QcAZeE5yz7aLW +hC1P/1KCRjSwZzHzk0sQX9jJhp0SlRxhRKQXq95Y7HYeke74+QvU7+jyvN50aAwggOA DYJg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 5si26526915plx.391.2018.12.23.08.33.01; Sun, 23 Dec 2018 08:33:16 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2391081AbeLVVzn (ORCPT + 99 others); Sat, 22 Dec 2018 16:55:43 -0500 Received: from dmz-mailsec-scanner-3.mit.edu ([18.9.25.14]:65486 "EHLO dmz-mailsec-scanner-3.mit.edu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390323AbeLVVzm (ORCPT ); Sat, 22 Dec 2018 16:55:42 -0500 X-AuditID: 1209190e-2ffff700000054d4-05-5c1dbb79995e Received: from mailhub-auth-1.mit.edu ( [18.9.21.35]) (using TLS with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by dmz-mailsec-scanner-3.mit.edu (Symantec Messaging Gateway) with SMTP id F5.B9.21716.A7BBD1C5; Fri, 21 Dec 2018 23:20:10 -0500 (EST) Received: from outgoing.mit.edu (OUTGOING-AUTH-1.MIT.EDU [18.9.28.11]) by mailhub-auth-1.mit.edu (8.14.7/8.9.2) with ESMTP id wBM4HF1V021600; Fri, 21 Dec 2018 23:17:34 -0500 Received: from callcc.thunk.org (96-72-102-169-static.hfc.comcastbusiness.net [96.72.102.169] (may be forged)) (authenticated bits=0) (User authenticated as tytso@ATHENA.MIT.EDU) by outgoing.mit.edu (8.14.7/8.12.4) with ESMTP id wBM4HCmQ028383 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 21 Dec 2018 23:17:13 -0500 Received: by callcc.thunk.org (Postfix, from userid 15806) id 6102A7A460A; Fri, 21 Dec 2018 23:17:12 -0500 (EST) Date: Fri, 21 Dec 2018 23:17:12 -0500 From: "Theodore Y. Ts'o" To: Linus Torvalds Cc: Christoph Hellwig , Dave Chinner , "Darrick J. Wong" , Eric Biggers , linux-fscrypt@vger.kernel.org, linux-fsdevel , linux-ext4@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net, linux-integrity@vger.kernel.org, Linux List Kernel Mailing , Jaegeuk Kim , Victor Hsieh , Chandan Rajendra Subject: Re: [PATCH v2 01/12] fs-verity: add a documentation file Message-ID: <20181222041712.GC26547@mit.edu> Mail-Followup-To: "Theodore Y. Ts'o" , Linus Torvalds , Christoph Hellwig , Dave Chinner , "Darrick J. Wong" , Eric Biggers , linux-fscrypt@vger.kernel.org, linux-fsdevel , linux-ext4@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net, linux-integrity@vger.kernel.org, Linux List Kernel Mailing , Jaegeuk Kim , Victor Hsieh , Chandan Rajendra References: <20181219071420.GC2628@infradead.org> <20181219021953.GD31274@dastard> <20181219193005.GB6889@mit.edu> <20181219213552.GO6311@dastard> <20181220220158.GC2360@mit.edu> <20181221070447.GA21687@infradead.org> <20181221154714.GA26547@mit.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) X-Brightmail-Tracker: H4sIAAAAAAAAA01Sa0gUURTmzsxO4+LUbXTxalqwKZW1ZvbgFhUVQUNFGf0qhZzaq7u2O9rM aikRUtnDHmwRlVIm2w+17OEamdpLMUt7EGoRpiW2hgmVtmlg9pjRyv6dc74Xh3M4WvhqCOPs sososuQws0ZGCDBNtWTXRCTGHq2fgI+eqjXgjs+VBnyj4Q3AV24P0/ix20Nh37UCGucXtrO4 2bMK9xYOMvj2nUYGP3rWMA63VJ9j8du8PBp3Hf84DtdX1LPLJohNHiQWeTPEipJo0XvpMCs+ OvudETvrKhixpshPif3dbYy4/95LVvR7J8cbNxsXW4nDnkmU2UuTjLaByiqQ3jVt17OBfjoH NE3JAwEcgvPQ19ojdB4wcgIso1Bx9SdmtCkH6HX+eXa06aPQmf5CSpcIUEEvh920XjMwCpW3 lbB6zcLpqKz7B6PXwTAO+XtaDbqYhq0M6n3cBHQgCC5HF7s+aUYcx8NZ6IFvic7h4Q8G+Q5+ +5P2lELX3p8YEfBwImrM94240jAavfr5YURMw0mo+CenjwPgBtR34blBH5tgJPIXQTcQCv4T F/wnLhgTFwH6EoiwOrMtTsnuUMk2i7pNkmWiWObGOO2uGGLN8IKRg4aOvwVO5qypA5AD5kDe PRieKBikTDXLWQdCOcps4iNPRyQK47emWbNskmrbomQ4iFoHEEebg/lDSzQ6b5WysomS9hea xDHmEH4o+F2CAFMkF9lOSDpR/qLhHGdGfGeVZjpRISlkV7Ld4RqDKS5ANw/UzFOrNQ6vpktO 1Z4yijcBC/fq/q8ztMDIaTIJC+F57RsFqJNsGfI/H/1Zk8Tk0l4Qoq0VxNv1uEDtlf859Woh lBZiXKFvoLqkMSgsB8yvjJSvXt6y3rY69uFQ7s2WmU8c1xcm34rOJXLugh5X66b8vlPHepp3 bpaudniM+wZnDq0VyxY1bjLFx8VXzfDP2nHhuXFleyr7ogEnfTkwPXXdxgU1wf49wyUDu/fi 4YXSjTulbWSwqtYXPsNrcjdePr4+pj8qMyj1dUKp597dFo+ZUW3SnGhaUaXfDRnL54cDAAA= Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Dec 21, 2018 at 11:13:07AM -0800, Linus Torvalds wrote: > > I do agree that your particular model is pretty damn broken in lots of ways. > > Why is it filesystem specific? If the whole point is that the file > itself has its own verification data (which I like), then I don't see > why this is then documented as some filesystem-specific layout model. > That's complete and utter garbage. > > In other words: either the model is that the file *itself* contains > its own merkle tree that validates the file, or it isn't. You can't > have it two ways. No silly "layout changes when you apply the hash" > garbage. That's just crazy talk and invalidates the whole model. Userspace applications which are reading the file aren't going to be expecting Merkle tree. For example, one of the use cases is Android APK files, which are essentially ZIP files. ZIP files can be parsed both from the front-end (streaming), or by looking for the complete directory of all of the files in the ZIP file by starting at the end of the file and moving backwards. If the Merkle tree was visible to userspace programs that are opening and reading the file, it would confuse them mightily. So what we do for ext4 and f2fs is make the Merkle tree invisible; if userspace stats the file, st_size will return size of the original "data" file, and reading beyond the st_size from userspace will behave like reading beyond EOF. From the *file system's* perspective, though, the metadata blocks are part of the file. There's just a difference between the userspace visible EOF and the file system's conception of EOF. I don't consider this a "layout change", and I personally believe this should be just *fine* for all file systems. The XFS developers are convinced that this is horrific, and no one sane should do this. OK, call me insane. But it works, and I think it's elegant and clean. So if *they* want to use some other layout, where the Merkle blocks are stored in some Alternate Data Stream, ala NTFS --- they are *free* to do that. It will require more work, and at that point, it will require a layout change. But it's Dave and Christoph who are insisting on doing that; not me! > And honestly, I still think that it's very odd to add the merge data > to the end, when the filesystem already supports xattrs. It would have > made much more sense to just make one xattr contain the merkle tree > validation data. The problem is that xattrs are designed to be accessed via a set/get interface, are currently limited, IIRC at 32k. The max size of an APK is 300 megabytes; and the Merkle tree for a file that size will be about 2.3 megabytes. That's way too big to store as an xattr; certainly using the existing xattr interfaces. And it's also bigger than most file systems can handle as xattrs today --- because they've been optimzied for relatively small sizes, for things like SELinux labels and ACL structures. > So why is this sold as some unholy mess of "filesystem-specific" and > "generic"? That part just annoys the hell out of me. Why isn't this > sold as an *actual* generic model, where you just say "append the > merkle tree to the file, then enable verity testing of the end result > and validate the top-level hash". That was the original way it was sold, but Cristoph and Dave have NACK'ed it in that form. The common fsverity code which is generic to ext4 and f2fs does treat it that way, with the note that we "lie" to userspace about is the size of the file and where the EOF is. Dave and Cristoph have declaimed strongly that this is this layout choice is horrible, and filesystem specific, and XFS could never do it that way. I don't understand why, but they are the XFS experts. So if they want to do something else, what I've been trying to point out is that they can do that, using the existing interface. > So what's the excuse for doing the crazy odd "let's just support one > single filesystem" model? Android devices use both ext4 and f2fs; it's the manufacturer's choice. So we wanted fs-verity to support both. And we didn't want to duplicate code across ext4 and f2fs; hence trying to put common code in fs/verity. So we aren't supporting one file system out of the gate; we're supporting two. Whether XFS wants to implement fs-verity is purely XFS's choice. XFS has chosen not to support fscrypt, which is currently used by ext4, f2fs, and ubifs, and both fscrypt's and fs-verity's initial use case has been for Android, which is not an area where XFS has proven to be a common choice. So I was not really expecting that they would have any interest in fs-verity. But they seem to have very strong opinions about how they would want to implement it, and it's different from what we have in the current "generic code shared by ext4 and f2fs". I was trying to show that even if they wanted to do things in this different way --- and I don't understand why it's so important to them --- it would be possible to do so. Cheers, - Ted