Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756605Ab3GZHB5 (ORCPT ); Fri, 26 Jul 2013 03:01:57 -0400 Received: from nautica.notk.org ([91.121.71.147]:58130 "EHLO nautica.notk.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752680Ab3GZHBz (ORCPT ); Fri, 26 Jul 2013 03:01:55 -0400 Date: Fri, 26 Jul 2013 09:01:38 +0200 From: Dominique Martinet To: Eric Van Hensbergen Cc: Latchesar Ionkov , pebolle@tiscali.nl, netdev@vger.kernel.org, linux-kernel , andi@etezian.org, rminnich@sandia.gov, V9FS Developers , David Miller Subject: Re: [V9fs-developer] [PATCH] net: trans_rdma: remove unused function Message-ID: <20130726070138.GA28421@nautica> References: <1374497956-32104-1-git-send-email-andi@etezian.org> <20130724.154646.2283898956674234778.davem@davemloft.net> <1374707387.29835.23.camel@x61.thuisdomein> <20130724.164514.393667021861625699.davem@davemloft.net> <20130725061411.GA8579@nautica> <20130725064802.GA12569@nautica> <20130725190506.GA32375@nautica> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20130725190506.GA32375@nautica> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1900 Lines: 45 I think I need to stop sending mails before triple-checking things! So sorry for the multiple mails again. Dominique Martinet wrote on Thu, Jul 25, 2013 : > [rdma_cancelled] > There is one problem though - if the server handles the original request > before getting the flush, the receive buffer will be consumed and we > won't send a new one, so we'll starve the reception queue. > I'm afraid I don't have any bright idea there... This still looks correct to me. > While we are on reception buffer issues, there is another problem with > the queue of receive buffers, even without flush, in the following > scenario: > - post a buffer for tag 0, on a hanging request > - post a buffer for tag 1 > - reply for tag 1 will come on buffer from tag 0 > - post another request with tag 1.. the buffer already is in the queue, > and we don't know we can post the buffer associated with tag 0 back. It actually looks like the reply buffers are swapped properly - taken out of the req struct into the context on send, then given back to the appropriate req on reception, so on normal operation there's no problem with what I described - sorry for crying wolf. > I haven't found how to reproduce this perfectly yet, but a dd with > blocksize 1MB and one with blocksize 10B in parallel brought the > mountpoint down (and the whole server was completely unavailable for the > duration of the dd - TCP sessions timed out, I even got IO errors on the > local disk :D) I need to run more tests to explain what happens with the two dds, but it's easily reproductible with debugs on, I guess that helps with a race somewhere. Regards, -- Dominique Martinet -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/