Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753500AbbEHOpg (ORCPT ); Fri, 8 May 2015 10:45:36 -0400 Received: from mailrelay.lanline.com ([216.187.10.16]:41559 "EHLO mailrelay.lanline.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752220AbbEHOpb (ORCPT ); Fri, 8 May 2015 10:45:31 -0400 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <21836.51957.715473.780762@quad.stoffel.home> Date: Fri, 8 May 2015 10:40:53 -0400 From: "John Stoffel" To: Ingo Molnar Cc: Rik van Riel , Dave Hansen , Dan Williams , Linus Torvalds , Linux Kernel Mailing List , Boaz Harrosh , Jan Kara , Mike Snitzer , Neil Brown , Benjamin Herrenschmidt , Heiko Carstens , Chris Mason , Paul Mackerras , "H. Peter Anvin" , Christoph Hellwig , Alasdair Kergon , "linux-nvdimm\@lists.01.org" , Mel Gorman , Matthew Wilcox , Ross Zwisler , Martin Schwidefsky , Jens Axboe , "Theodore Ts'o" , "Martin K. Petersen" , Julia Lawall , Tejun Heo , linux-fsdevel , Andrew Morton Subject: Re: [PATCH v2 00/10] evacuate struct page from the block layer, introduce __pfn_t In-Reply-To: <20150508140556.GA2185@gmail.com> References: <20150507173641.GA21781@gmail.com> <554BA748.9030804@linux.intel.com> <20150507191107.GB22952@gmail.com> <554CBE17.4070904@redhat.com> <20150508140556.GA2185@gmail.com> X-Mailer: VM 8.2.0b under 23.4.1 (x86_64-pc-linux-gnu) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2394 Lines: 53 >>>>> "Ingo" == Ingo Molnar writes: Ingo> * Rik van Riel wrote: >> The disadvantage is pretty obvious too: 4kB pages would no longer be >> the fast case, with an indirection. I do not know how much of an >> issue that would be, or whether it even makes sense for 4kB pages to >> continue being the fast case going forward. Ingo> I strongly disagree that 4kB does not matter as much: it is _the_ Ingo> bread and butter of 99% of Linux usecases. 4kB isn't going away Ingo> anytime soon - THP might look nice in benchmarks, but it does not Ingo> matter nearly as much in practice and for filesystems and IO it's Ingo> absolutely crazy to think about 2MB granularity. Ingo> Having said that, I don't think a single jump of indirection is a big Ingo> issue - except for the present case where all the pmem IO space is Ingo> mapped non-cacheable. Write-through caching patches are in the works Ingo> though, and that should make it plenty fast. >> Memory trends point in one direction, file size trends in another. >> >> For persistent memory, we would not need 4kB page struct pages >> unless memory from a particular area was in small files AND those >> files were being actively accessed. [...] Ingo> Average file size on my system's /usr is 12.5K: Ingo> triton:/usr> ( echo -n $(echo $(find . -type f -printf "%s\n") | Ingo> sed 's/ /+/g' | bc); echo -n "/"; find . -type f -printf "%s\n" Ingo> | wc -l; ) | bc 12502 Now go and look at your /home or /data/ or /work areas, where the endusers are actually keeping their day to day work. Photos, mp3, design files, source code, object code littered around, etc. Now I also have 12Tb filesystems with 30+ million files in them, which just *suck* for backup, esp incrementals. I have one monster with 85+ million files (time to get beat on users again ...) which needs to be pruned. So I'm not arguing against you, I'm just saying you need better more representative numbers across more day to day work. Running this exact same command against my home directory gets: 528989 So I'm not arguing one way or another... just providing numbers. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/