Return-Path: Received: from daytona.panasas.com ([67.152.220.89]:11390 "EHLO daytona.panasas.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752100Ab0LPH06 (ORCPT ); Thu, 16 Dec 2010 02:26:58 -0500 Message-ID: <4D09BF3F.3070209@panasas.com> Date: Thu, 16 Dec 2010 09:26:55 +0200 From: Benny Halevy To: Trond Myklebust CC: linux-nfs@vger.kernel.org Subject: Re: [PATCH 1/9] Revert "pnfs-submit: wave2: remove forgotten layoutreturn struct definitions" References: <4D0908F9.4060208@panasas.com> <1292437854-21651-1-git-send-email-bhalevy@panasas.com> <1292437973.3068.15.camel@heimdal.trondhjem.org> <4D090E18.4060205@panasas.com> <1292441468.3068.53.camel@heimdal.trondhjem.org> <1292444651.3068.67.camel@heimdal.trondhjem.org> In-Reply-To: <1292444651.3068.67.camel@heimdal.trondhjem.org> Content-Type: text/plain; charset=UTF-8 Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 On 2010-12-15 22:24, Trond Myklebust wrote: > On Wed, 2010-12-15 at 14:31 -0500, Trond Myklebust wrote: >> On Wed, 2010-12-15 at 20:51 +0200, Benny Halevy wrote: > >>> Eventually, when CB_LAYOUTRECALL is clear to go sending the LAYOUTRETURN >>> or replying with CB_NOMATCHING_LAYOUT (assuming no I/O error to report >>> for pnfs-obj) should be equivalent [note: need errata to clarify the >>> resulting stateid after NOMATCHING_LAYOUT]. >>> Is this the serialization "crap" you're talking about? >>> What makes checking the conditions for returning NFS4ERR_DELAY to >>> CB_LAYOUTRECALL so different from implementing a barrier and doing the >>> returns asynchronously with the CB_LAYOUTRECALL? >> >> "CB_LAYOUTRECALL request processing MUST be processed in "seqid" order >> at all times." (section 12.5.3). >> >> In other words, you cannot just 'do the returns asynchronously': the >> CB_LAYOUTRECALL requests are required by the protocol to be processed in >> order, which means that you must serialise those LAYOUTRETURN calls to >> ensure that they all happen in the order the wretched server expects. > > BTW: one consequence of the way the protocol was written is that you > can't just throw out a LAYOUTRETURN for the entire file if the server > just recalls a segment. Instead, you have to first return the segment, > then send the LAYOUTRETURN for the entire file. > It is true that the protocol requires the return of the exact recalled range but why can't the client do return the whole file before returning the recalled range? > That part of the protocol is just one insane idea after another... > This was done to ensure that the server and client are in-sync after a CB_LAYOUTRECALL. I agree that returning the whole layout thus resetting the layout state achieves the same goal and we should consider allowing it in the next version. Benny