Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp4254271imu; Fri, 30 Nov 2018 13:46:44 -0800 (PST) X-Google-Smtp-Source: AFSGD/U9RWCY5/u3AKwKA38DqJ6guh5XQqvky9uPPN0mDu3DgEgIySNAYqoFlqrcMbJ8w1bKWduV X-Received: by 2002:a17:902:b787:: with SMTP id e7mr7246262pls.246.1543614404346; Fri, 30 Nov 2018 13:46:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543614404; cv=none; d=google.com; s=arc-20160816; b=DmTjAhDm8VEjOkTPA4EO2E+I16cgBENakINApv/2CwWVm9GF0SeZWgAqwkBSMJrT8T 7IjK8sDtXFK5J1KoRob5lUdt2WIbNQWLftp/417mseT4mcis0sTiVWfyorLqYqSi9aeC BRI+QAD0/0X5JlLlKV+Vje1gnYVaPTaWz/7WcdC+kY+Je0zVLxpNfO/KxafLFxQSYJu3 lBYG2OmzYnNROzOsqWFO+2P1Qo/hTWtPwNZr8Xe6y61nR7ZWFuQhAgx7NxoWDFwXdbs9 3yXcHytAWGanSOKxyyaI4IUAGHCsGB/OxOFhoahAbEPlHdqS2yA+2RRrW/U8QHDjC6DF A50A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=jV87ugRfnXQ16w300Eokjh9Cm/rxrVK6rWqGkN5xdMQ=; b=lrV6+0t8ZHF5rxO5OsjwVy8NXdO0bI+9MN/myA3YLI2d1nMmsCUAbp2p/BIAPEMpRZ NQHJMI81v/DdoGN10n2CHUBRMmra2ydnqx4cShED6et2vqTbISomwq9FptGMKTSFBOxP dglILmvjdaiaxmBSPe76JjLL+sAR4NVG52UbN6v1oswBSP4QMDwrDpF9IGN5lthGl3ee 1htE45oggJ+kztjz+zvDQgBVTsHhUhs09kRiTC7WkAO3YXvw3FHF5bIrT+fr/GdYzL76 fPCt7sU5EHSSkE6S6pLmK83MW0DAoeKcRhenJkVoxhtP1tyrSRawA4j+8I4Lsv0iKsrY Z58w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p9si7132526pll.63.2018.11.30.13.46.28; Fri, 30 Nov 2018 13:46:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726812AbeLAI4d (ORCPT + 99 others); Sat, 1 Dec 2018 03:56:33 -0500 Received: from ipmail03.adl2.internode.on.net ([150.101.137.141]:11550 "EHLO ipmail03.adl2.internode.on.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725749AbeLAI4d (ORCPT ); Sat, 1 Dec 2018 03:56:33 -0500 Received: from ppp59-167-129-252.static.internode.on.net (HELO dastard) ([59.167.129.252]) by ipmail03.adl2.internode.on.net with ESMTP; 01 Dec 2018 08:15:50 +1030 Received: from dave by dastard with local (Exim 4.80) (envelope-from ) id 1gSqbc-0008IG-46; Sat, 01 Dec 2018 08:45:48 +1100 Date: Sat, 1 Dec 2018 08:45:48 +1100 From: Dave Chinner To: Greg KH Cc: Sasha Levin , stable@vger.kernel.org, linux-kernel@vger.kernel.org, Dave Chinner , "Darrick J . Wong" , linux-fsdevel@vger.kernel.org Subject: Re: [PATCH AUTOSEL 4.14 25/35] iomap: sub-block dio needs to zeroout beyond EOF Message-ID: <20181130214548.GO19305@dastard> References: <20181129060110.159878-1-sashal@kernel.org> <20181129060110.159878-25-sashal@kernel.org> <20181129121458.GK19305@dastard> <20181129124756.GA25945@kroah.com> <20181129224019.GM19305@dastard> <20181130082203.GA26830@kroah.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181130082203.GA26830@kroah.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Nov 30, 2018 at 09:22:03AM +0100, Greg KH wrote: > On Fri, Nov 30, 2018 at 09:40:19AM +1100, Dave Chinner wrote: > > On Thu, Nov 29, 2018 at 01:47:56PM +0100, Greg KH wrote: > > > On Thu, Nov 29, 2018 at 11:14:59PM +1100, Dave Chinner wrote: > > > > > > > > Cherry picking only one of the 50-odd patches we've committed into > > > > late 4.19 and 4.20 kernels to fix the problems we've found really > > > > seems like asking for trouble. If you're going to back port random > > > > data corruption fixes, then you need to spend a *lot* of time > > > > validating that it doesn't make things worse than they already > > > > are... > > > > > > Any reason why we can't take the 50-odd patches in their entirety? It > > > sounds like 4.19 isn't fully fixed, but 4.20-rc1 is? If so, what do you > > > recommend we do to make 4.19 working properly? > > > > You coul dpull all the fixes, but then you have a QA problem. > > Basically, we have multiple badly broken syscalls (FICLONERANGE, > > FIDEDUPERANGE and copy_file_range), and even 4.20-rc4 isn't fully > > fixed. > > > > There were ~5 critical dedupe/clone data corruption fixes for XFS > > went into 4.19-rc8. > > Have any of those been tagged for stable? None, because I have no confidence that the stable process will do the necessary QA to validate that such a significant backport is regression and data corruption free. The backport needs to be done as a complete series when we've finished the upstream work because we can't test isolated patches adequately because fsx will fall over due to all the unfixed problems and not exercise the fixes that were backported. Further, we just had a regression reported in one of the commit that the autosel bot has selected for automatic backports. It has been uncovered by overlay which appears to do some unique things with the piece of crap that is do_splice_direct(). And Darrick just commented on #xfs that he's just noticed more bugs with FICLONERANGE and overlay. IOWs, we're still finding broken stuff in this code and we are fixing it as fast as we can - we're still putting out fires. We most certainly don't need the added pressure of having you guys create more spot fires by breaking stable kernels with largely untested partial backports and having users exposed to whacky new data corruption issues. So, no, it isn't tagged for stable kernels because "commit into mainline" != "this should be backported immediately". Backports of these fixes are largely going to be done largely as a function of time and resources, of which we have zero available right now. Doing backports right now is premature and ill-advised because we haven't finished finding and fixing all the bugs and regressions in this code. > > Right now the XFS developers don't have the time or resources > > available to validate stable backports are correct and regression > > fre because we are focussed on ensuring the upstream fixes we've > > already made (and are still writing) are solid and reliable. > > Ok, that's fine, so users of XFS should wait until the 4.20 release > before relying on it? :) Ok, Greg, that's *out of line*. I should throw the CoC at you because I find that comment offensive, condescending, belittling, denegrating and insulting. Your smug and superior "I know what is right for you" attitude is completely inappropriate, and a little smiley face does not make it acceptible. If you think your comment is funny, you've badly misjudged how much effort I've put into this (100-hour weeks for over a month now), how close I'm flying to burn out (again!), and how pissed off I am about this whole scenario. We ended up here because we *trusted* that other people had implemented and tested their APIs and code properly before it got merged. We've been severely burnt, and we've been left to clean up the mess made by other people by ourselves. Instead of thanks, what we get instead is "we know better" attitude and jokes implying our work is crap and we don't care about our users. That's just plain *insulting*. If anyone is looking for a demonstration of everything that is wrong with the Linux kernel development culture, then they don't need to look any further. > I understand your reluctance to want to backport anything, but it really > feels like you are not even allowing for fixes that are "obviously > right" to be backported either, even after they pass testing. Which > isn't ok for your users. It's worse for our users if we introduce regressions into stable kernels, which is exactly what this "obviously right" auto-backport would have done. -Dave. -- Dave Chinner david@fromorbit.com