Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1945554imm; Sat, 4 Aug 2018 14:49:48 -0700 (PDT) X-Google-Smtp-Source: AAOMgpdYgdU99jSY/bnBZHU21QruB3D7O41xl39YAy9HuAaQ16RpDQP29oPb3Um0cmp+HM7hLyOH X-Received: by 2002:a62:229a:: with SMTP id p26-v6mr10484444pfj.53.1533419388559; Sat, 04 Aug 2018 14:49:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533419388; cv=none; d=google.com; s=arc-20160816; b=MCu3h6mKqbAiZbomffocuJytC5ZMH6w+DFyMRnSVGrpehzdblCZ6aXwQQ6ovt5NrdT 2yj/rwVc7ug2EzAsfsglzmywyBpnbAtPbTeZSaE1JpkcuksmoTEgYcrPleT6fCCJrExT q94dmfdJEs15HS4aU+nqNBQFky+7pjtoTvlfmxKg0AalyK1UY9A5fPCk6wLdhqDc9Hgb KwXXVxu2nI0ZHqzVnSUviWKAzprsGaykSK5j2ib8zUq592BXm3n8odfqoIATGEQttlBL RG5OHRYIsBqyMt/koFghXj6Z15RVnRW3wKQAdRzF+TBF4MUh5UExUVy5kjQl1zl8BzWs 3T2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:to :from:date:arc-authentication-results; bh=vpXykt0PrpngP2kDkL2Z6sC18F8jcl5miBgH/yRPRDY=; b=ja9faxgXOuwWrowQuKPoVZjAMOPD6MPMihQy11Eu1HfIYjIW3n9on3slL5XWN4/x2c 9NFZLlLAAaEfYRuvyckMC2ARL7f0eIu3DvXN2YiRaghWv24Do3dqsZngVL7LUte5Pwav WeFAZyO9T72N54GjZDf6RKGDuU68diZ2HdXsfbseCb1RTeDf206IBFuGo5qVrdZ/rFlA rVcojF4eMAygMKSOj7fBnqj1U9ITqiG80Ho/THs04PyaGRQYKDHdlW/f78UP67St15PY 9NR9EGvvlwE2jO9tJnpnTTrvXcMGWHyWMH02nDEoopla+naR7acY+ACe5zY7SFEt9HN4 riIw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w185-v6si8460237pgb.599.2018.08.04.14.49.34; Sat, 04 Aug 2018 14:49:48 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729688AbeHDXuT (ORCPT + 99 others); Sat, 4 Aug 2018 19:50:19 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:33278 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728883AbeHDXuS (ORCPT ); Sat, 4 Aug 2018 19:50:18 -0400 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.rdu2.redhat.com [10.11.54.6]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 85AE6400E978; Sat, 4 Aug 2018 21:48:13 +0000 (UTC) Received: from localhost (unknown [10.18.25.149]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 35408215670D; Sat, 4 Aug 2018 21:48:13 +0000 (UTC) Date: Sat, 4 Aug 2018 17:48:12 -0400 From: Mike Snitzer To: "Theodore Y. Ts'o" , Zdenek Kabelac , Linus Torvalds , Jens Axboe , Sagi Grimberg , Linux Kernel Mailing List , linux-block , dm-devel@redhat.com, Ilya Dryomov , wgh@torlan.ru Subject: Re: LVM snapshot broke between 4.14 and 4.16 Message-ID: <20180804214812.GA11320@redhat.com> References: <20180803185431.GB3258@redhat.com> <20180803193037.GA4581@redhat.com> <20180804052033.GA4461@thunk.org> <102c2d75-f768-a649-52c3-bac6f0ca738d@redhat.com> <20180804162205.GB4461@thunk.org> <20180804181845.GA10499@redhat.com> <20180804193753.GC4461@thunk.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180804193753.GC4461@thunk.org> User-Agent: Mutt/1.5.21 (2010-09-15) X-Scanned-By: MIMEDefang 2.78 on 10.11.54.6 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.7]); Sat, 04 Aug 2018 21:48:13 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.7]); Sat, 04 Aug 2018 21:48:13 +0000 (UTC) for IP:'10.11.54.6' DOMAIN:'int-mx06.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'msnitzer@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Aug 04 2018 at 3:37pm -0400, Theodore Y. Ts'o wrote: > On Sat, Aug 04, 2018 at 02:18:47PM -0400, Mike Snitzer wrote: > > > Fair enough. I don't think I would consider that makes dm-snapshot a > > > "steaming pile". For me, protection against data loss is Job One. > > > > What's your point Ted? Do you have _any_ intention of actually using > > anything DM or is this just a way for you to continue to snipe at it? > > My point is that putting down dm-snapshot by calling it a "steaming > pile" because it can't perform well on workloads that weren't a > requirement when it was first designed is neither accurate nor fair. As a person who has written a fair amount of dm-snapshot code I'm free to have my opinion. It is slow. Period. If it works for you, great. But it isn't adequate for most modern usecases I'm aware of. > And steering users away from it by badmouthing to a technology which > ever so often, requires enterprise support to recover, is something > that *I* at least would classify as "marginal". dm-snapshot is slow, as such I will badmouth it because dm-thinp is a much more capable replacement. I have to maintain both, so I'm free to steer people according to my experience. > Maybe it's just that file system developers have higher standards. I > know that Dave Chinner at LSF/MM commented that using some of the > things he has been developing for XFS subvolume support might be > interesting precisely because it could provide some of the facilities > currently provided by thin provisioning (perhaps not all of them; I'm > not sure how well his virtual block remapping layer would handle > hundreds of snapshots) but with file system tools which have a lot > more seasoning and where people have spent a lot of effort on data > recovery tools. Even new XFS features will have bugs. Just because XFS's fsck is historically robust, oversights and bugs happen when new features are added. And AFAIK future XFS would be looking to leverage DM thinp via its producer/consumer model. But this is going off on a tangent now. > In any case, I do use DM quite a lot. I use LVM2 and dm-snapshot (and > it's been working just *fine* for my use cases). I've wanted to use > dm-thin, but have been put off by it being labeled as experimental and > by some of the comments about how robust its recovery tools are. The Documentation was stale. I personally don't reference it so the need to update it got overlooked. > If there was documentation about how an advanced user/developer could > use low level tools to do manual repair of a thin pool when the > automated procedures didn't work, without having to pay $$$ to some > company for "enterprise support", I'd be a lot more willing to give it > a try. We could certainly improve out documentation for the use of thin_check and thin_repair. I know lvm2 has seen improvements to allow the metdata voulme to be activated in standalone mode (activate the metadata volume without the thin-pool or thin devices ontop) so that the thin_check and thin_repair tools can be used on it. I'd imagine you aren't aware of the lvm2 package's lvmthin manpage? See: man lvmthin It'd likely be one of the documentation locations to see improvements. I'll talk with others about where we can improve docs, the manpages for thin_check and thin_repair are _very_ sparse. Anyway, room for improvement for sure. But demonizing "enterprise support" like you don't provide that to your stake holders is bizarre. I again was candid and forthcoming about what drives/catches the need for thin_check and thin_repair fixes and improvements: it just so happens that "enterprise" deployments make use of DM-thinp and have exposed the need for support more than community users. I'm not saying I, or other DM thinp oriented developers, wouldn't provided the same type of support if a community user like yourself hit a problem. It is just that enterprise users are the prontlines of advanced usage and scale. Deploying hundreds of Gluster servers with every brick layered ontop of DM thinp historically exposed issues. Those issues get fixed and benefit everyone. This discussion, and my need to explain how "enterprise support" drives innovation, is so.. weird. > Sorry, I just care a *lot* about data robustness. You aren't alone. > > Maybe read your email from earlier today before repeating yourself: > > https://lkml.org/lkml/2018/8/4/366 > > Apologies. I'm currently staying at an Assisted Living facility > keeping an eye on my Dad this week, and the internet at the senior > living center has been.... marginal. As a result I've been reading my > e-mail in batches, and so I hadn't seen the e-mail you had posted > earlier before I had sent my reply. Best wishes. I've been dealing with stresses in my personal life myself. Might explain why we've had the awkwardness in this thread. Mike