Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752130AbbEGRwl (ORCPT ); Thu, 7 May 2015 13:52:41 -0400 Received: from mail-wg0-f46.google.com ([74.125.82.46]:35301 "EHLO mail-wg0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751736AbbEGRwh (ORCPT ); Thu, 7 May 2015 13:52:37 -0400 Date: Thu, 7 May 2015 19:52:30 +0200 From: Ingo Molnar To: Dan Williams Cc: Linus Torvalds , Linux Kernel Mailing List , Boaz Harrosh , Jan Kara , Mike Snitzer , Neil Brown , Benjamin Herrenschmidt , Dave Hansen , Heiko Carstens , Chris Mason , Paul Mackerras , "H. Peter Anvin" , Christoph Hellwig , Alasdair Kergon , "linux-nvdimm@lists.01.org" , Mel Gorman , Matthew Wilcox , Ross Zwisler , Rik van Riel , Martin Schwidefsky , Jens Axboe , "Theodore Ts'o" , "Martin K. Petersen" , Julia Lawall , Tejun Heo , linux-fsdevel , Andrew Morton Subject: Re: [PATCH v2 00/10] evacuate struct page from the block layer, introduce __pfn_t Message-ID: <20150507175230.GB21781@gmail.com> References: <20150506200219.40425.74411.stgit@dwillia2-desk3.amr.corp.intel.com> <20150507090217.GA4467@gmail.com> <20150507144225.GA20491@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1439 Lines: 39 * Dan Williams wrote: > > That looks like a layering violation and a mistake to me. If we > > want to do direct (sector_t -> sector_t) IO, with no serialization > > worries, it should have its own (simple) API - which things like > > hierarchical RAID or RDMA APIs could use. > > I'm wrapped around the idea that __pfn_t *is* that simple api for > the tiered storage driver use case. [...] I agree. (see my previous mail) > [...] For RDMA I think we need struct page because I assume that > would be coordinated through a filesystem an truncate() is back in > play. So I don't think RDMA is necessarily special, it's just a weirdly programmed DMA request: - If it is used internally by an exclusively managed complex storage driver, then it can use low level block APIs and pfn_t. - If RDMA is exposed all the way to user-space (do we have such APIs?), allowing users to initiate RDMA IO into user buffers, then (the user visible) buffer needs struct page backing. (which in turn will then at some lower level convert to pfns.) That's true for both regular RAM pages and mmap()-ed persistent RAM pages as well. Thanks, Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/