Return-Path: Received: from mx1.redhat.com ([209.132.183.28]:29617 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932349Ab1DMRUa (ORCPT ); Wed, 13 Apr 2011 13:20:30 -0400 Date: Wed, 13 Apr 2011 13:20:34 -0400 From: Jeff Layton To: Trond Myklebust Cc: Badari Pulavarty , Chuck Lever , linux-nfs@vger.kernel.org, khoa@us.ibm.com Subject: Re: [RFC][PATCH] Vector read/write support for NFS (DIO) client Message-ID: <20110413132034.459c68bb@corrin.poochiereds.net> In-Reply-To: <1302704533.8571.12.camel@lade.trondhjem.org> References: <1302622335.3877.62.camel@badari-desktop> <0DC51758-AE6C-4DD2-A959-8C8E701FEA4E@oracle.com> <1302624935.3877.66.camel@badari-desktop> <1302630360.3877.72.camel@badari-desktop> <20110413083656.12e54a91@tlielax.poochiereds.net> <4DA5A899.3040202@us.ibm.com> <20110413100228.680ace66@tlielax.poochiereds.net> <1302704533.8571.12.camel@lade.trondhjem.org> Content-Type: text/plain; charset=US-ASCII Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 On Wed, 13 Apr 2011 10:22:13 -0400 Trond Myklebust wrote: > On Wed, 2011-04-13 at 10:02 -0400, Jeff Layton wrote: > > We could put the rpc_rqst's into a slabcache, and give each rpc_xprt a > > mempool with a minimum number of slots. Have them all be allocated with > > GFP_NOWAIT. If it gets a NULL pointer back, then the task can sleep on > > the waitqueue like it does today. Then, the clients can allocate > > rpc_rqst's as they need as long as memory holds out for it. > > > > We have the reserve_xprt stuff to handle congestion control anyway so I > > don't really see the value in the artificial limits that the slot table > > provides. > > > > Maybe I should hack up a patchset for this... > > This issue has come up several times recently. My preference would be to > tie the availability of slots to the TCP window size, and basically say > that if the SOCK_ASYNC_NOSPACE flag is set on the socket, then we hold > off allocating more slots until we get a ->write_space() callback which > clears that flag. > > For the RDMA case, we can continue to use the current system of a fixed > number of preallocated slots. > I take it then that we'd want a similar scheme for UDP as well? I guess I'm just not sure what the slot table is supposed to be for. Possibly naive question, and maybe you or Andy have scoped this out already... Wouldn't it make more sense to allow the code to allocate rpc_rqst's as needed, and manage congestion control in reserve_xprt ? It appears that that at least is what xprt_reserve_xprt_cong is supposed to do. The TCP variant (xprt_reserve_xprt) doesn't do that currently, but we could do it there and that would seem to make for more parity between the TCP and UDP in this sense. We could do that similarly for RDMA too. Simply keep track of how many RPCs are in flight and only allow reserving the xprt when that number hasn't crossed the max number of slots... -- Jeff Layton