Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752089AbbEIV5E (ORCPT ); Sat, 9 May 2015 17:57:04 -0400 Received: from ipmail05.adl6.internode.on.net ([150.101.137.143]:11696 "EHLO ipmail05.adl6.internode.on.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751510AbbEIV47 (ORCPT ); Sat, 9 May 2015 17:56:59 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: A2BACABIgk5VPPDOLHlcgw+BMoZMrFABAQEBAQEGkXUJh1cCAgEBAoEkORQBAQEBAQEBBgEBAQFBP4QhAQEEOhwjEAgDGAklDwUlAwcaE4grxxIBAQEHAgEfGIV+hSOEGG0HhC0FnR+MaYlvgQSBKBwVgU8sMYEMgToBAQE Date: Sun, 10 May 2015 07:56:33 +1000 From: Dave Chinner To: Rik van Riel Cc: Linus Torvalds , John Stoffel , Ingo Molnar , Dave Hansen , Dan Williams , Linux Kernel Mailing List , Boaz Harrosh , Jan Kara , Mike Snitzer , Neil Brown , Benjamin Herrenschmidt , Heiko Carstens , Chris Mason , Paul Mackerras , "H. Peter Anvin" , Christoph Hellwig , Alasdair Kergon , "linux-nvdimm@lists.01.org" , Mel Gorman , Matthew Wilcox , Ross Zwisler , Martin Schwidefsky , Jens Axboe , "Theodore Ts'o" , "Martin K. Petersen" , Julia Lawall , Tejun Heo , linux-fsdevel , Andrew Morton Subject: Re: [PATCH v2 00/10] evacuate struct page from the block layer, introduce __pfn_t Message-ID: <20150509215633.GN4327@dastard> References: <554BA748.9030804@linux.intel.com> <20150507191107.GB22952@gmail.com> <554CBE17.4070904@redhat.com> <20150508140556.GA2185@gmail.com> <21836.51957.715473.780762@quad.stoffel.home> <554CEB5D.90209@redhat.com> <554D78C4.30607@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <554D78C4.30607@redhat.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1421 Lines: 34 On Fri, May 08, 2015 at 11:02:28PM -0400, Rik van Riel wrote: > On 05/08/2015 09:14 PM, Linus Torvalds wrote: > > On Fri, May 8, 2015 at 9:59 AM, Rik van Riel wrote: > >> > >> However, for persistent memory, all of the files will be "in memory". > > > > Yes. However, I doubt you will find a very sane rw filesystem that > > then also makes them contiguous and aligns them at 2MB boundaries. > > > > Anything is possible, I guess, but things like that are *hard*. The > > fragmentation issues etc cause it to a really challenging thing. > > The TLB performance bonus of accessing the large files with > large pages may make it worthwhile to solve that hard problem. FWIW, for DAX ththe filesystem allocation side is already mostly solved - this is just an allocation alignment hint, analogous to RAID stripe alignment. We don't need to reinvent the wheel here. i.e. On XFS, use a 2MB stripe unit for the fs, a 2MB extent size hint for files you want to use large pages on and you'll get 2MB sized and aligned allocations from the filesystem for as long as there are such freespace regions available. Cheers, Dave. -- Dave Chinner david@fromorbit.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/