Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754702AbYBDR4i (ORCPT ); Mon, 4 Feb 2008 12:56:38 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752741AbYBDR43 (ORCPT ); Mon, 4 Feb 2008 12:56:29 -0500 Received: from mail-relay-01.mailcluster.net ([77.221.130.213]:38979 "EHLO mail-relay-01.mailcluster.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751943AbYBDR42 (ORCPT ); Mon, 4 Feb 2008 12:56:28 -0500 Message-ID: <47A751C5.60600@vlnb.net> Date: Mon, 04 Feb 2008 20:56:21 +0300 From: Vladislav Bolkhovitin User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.13) Gecko/20060501 Fedora/1.7.13-1.1.fc5 X-Accept-Language: en-us, ru, en MIME-Version: 1.0 To: James Bottomley CC: Bart Van Assche , Linus Torvalds , Andrew Morton , FUJITA Tomonori , linux-scsi@vger.kernel.org, scst-devel@lists.sourceforge.net, linux-kernel@vger.kernel.org Subject: Re: Integration of SCST in the mainstream Linux kernel References: <1201639331.3069.58.camel@localhost.localdomain> <47A05CBD.5050803@vlnb.net> <47A7049A.9000105@vlnb.net> <1202139015.3096.5.camel@localhost.localdomain> <47A73C86.3060604@vlnb.net> <1202144767.3096.38.camel@localhost.localdomain> <47A7488B.4080000@vlnb.net> <1202145901.3096.49.camel@localhost.localdomain> In-Reply-To: <1202145901.3096.49.camel@localhost.localdomain> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3249 Lines: 81 James Bottomley wrote: > On Mon, 2008-02-04 at 20:16 +0300, Vladislav Bolkhovitin wrote: > >>James Bottomley wrote: >> >>>>>>So, James, what is your opinion on the above? Or the overall SCSI target >>>>>>project simplicity doesn't matter much for you and you think it's fine >>>>>>to duplicate Linux page cache in the user space to keep the in-kernel >>>>>>part of the project as small as possible? >>>>> >>>>> >>>>>The answers were pretty much contained here >>>>> >>>>>http://marc.info/?l=linux-scsi&m=120164008302435 >>>>> >>>>>and here: >>>>> >>>>>http://marc.info/?l=linux-scsi&m=120171067107293 >>>>> >>>>>Weren't they? >>>> >>>>No, sorry, it doesn't look so for me. They are about performance, but >>>>I'm asking about the overall project's architecture, namely about one >>>>part of it: simplicity. Particularly, what do you think about >>>>duplicating Linux page cache in the user space to have zero-copy cached >>>>I/O? Or can you suggest another architectural solution for that problem >>>>in the STGT's approach? >>> >>> >>>Isn't that an advantage of a user space solution? It simply uses the >>>backing store of whatever device supplies the data. That means it takes >>>advantage of the existing mechanisms for caching. >> >>No, please reread this thread, especially this message: >>http://marc.info/?l=linux-kernel&m=120169189504361&w=2. This is one of >>the advantages of the kernel space implementation. The user space >>implementation has to have data copied between the cache and user space >>buffer, but the kernel space one can use pages in the cache directly, >>without extra copy. > > > Well, you've said it thrice (the bellman cried) but that doesn't make it > true. > > The way a user space solution should work is to schedule mmapped I/O > from the backing store and then send this mmapped region off for target > I/O. For reads, the page gather will ensure that the pages are up to > date from the backing store to the cache before sending the I/O out. > For writes, You actually have to do a msync on the region to get the > data secured to the backing store. James, have you checked how fast is mmaped I/O if work size > size of RAM? It's several times slower comparing to buffered I/O. It was many times discussed in LKML and, seems, VM people consider it unavoidable. So, using mmaped IO isn't an option for high performance. Plus, mmaped IO isn't an option for high reliability requirements, since it doesn't provide a practical way to handle I/O errors. > You also have to pull tricks with > the mmap region in the case of writes to prevent useless data being read > in from the backing store. Can you be more exact and specify what kind of tricks should be done for that? > However, none of this involves data copies. > > James > > > - > To unsubscribe from this list: send the line "unsubscribe linux-scsi" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/