Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751634AbcCFXdk (ORCPT ); Sun, 6 Mar 2016 18:33:40 -0500 Received: from mail-wm0-f53.google.com ([74.125.82.53]:35983 "EHLO mail-wm0-f53.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750885AbcCFXde (ORCPT ); Sun, 6 Mar 2016 18:33:34 -0500 Date: Mon, 7 Mar 2016 02:33:30 +0300 From: "Kirill A. Shutemov" To: Dave Chinner Cc: Hugh Dickins , Dave Hansen , linux-fsdevel@vger.kernel.org, linux-api@vger.kernel.org, "Kirill A. Shutemov" , Andrea Arcangeli , Andrew Morton , Vlastimil Babka , Christoph Lameter , Naoya Horiguchi , Jerome Marchand , Yang Shi , Sasha Levin , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: THP-enabled filesystem vs. FALLOC_FL_PUNCH_HOLE Message-ID: <20160306233330.GA23851@node.shutemov.name> References: <1457023939-98083-1-git-send-email-kirill.shutemov@linux.intel.com> <20160304112603.GA9790@node.shutemov.name> <56D9C882.3040808@intel.com> <20160304230548.GC11282@dastard> <20160304232412.GC12498@node.shutemov.name> <20160305223811.GD11282@dastard> <20160306003034.GA13704@node.shutemov.name> <20160306230336.GE11282@dastard> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160306230336.GE11282@dastard> User-Agent: Mutt/1.5.23.1 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1382 Lines: 30 On Mon, Mar 07, 2016 at 10:03:36AM +1100, Dave Chinner wrote: > On Sun, Mar 06, 2016 at 03:30:34AM +0300, Kirill A. Shutemov wrote: > > On Sun, Mar 06, 2016 at 09:38:11AM +1100, Dave Chinner wrote: > > > And it's not just hole punching that has this problem. Direct IO is > > > going to have the same issue with invalidation of the mapped ranges > > > over the IO being done. XFS already WARNs when page cache > > > invalidation fails with EBUSY in direct IO, because that is > > > indicative of an application with a potential data corruption vector > > > and there's nothing we can do in the kernel code to prevent it. > > > > My current understanding is that for filesystems with persistent storage, > > in order to make THP any useful, we would need to implement writeback > > without splitting the huge page. > > Algorithmically it is no different to filesytem block size < page > size writeback. > > > At the moment, I have no idea how hard it would be.. > > THP support would effectively require us to remove PAGE_CACHE_SIZE > assumptions from all of the filesystem and buffer code. That's a > large chunk of work e.g. fs/buffer.c and any filesystem that uses > bufferheads for tracking filesystem block state through the page > cache. I'll try to learn more about the code before the summit. I guess it's something worth descussion in person. -- Kirill A. Shutemov