Return-Path: Received: from aserp1040.oracle.com ([141.146.126.69]:41746 "EHLO aserp1040.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755217AbbICTeW (ORCPT ); Thu, 3 Sep 2015 15:34:22 -0400 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 7.3 \(1878.6\)) Subject: Re: async CLOSE creates a race with a DELEGRETURN followed by a REMOVE From: Chuck Lever In-Reply-To: Date: Thu, 3 Sep 2015 15:34:16 -0400 Cc: Linux NFS Mailing List Message-Id: References: To: Olga Kornievskaia Sender: linux-nfs-owner@vger.kernel.org List-ID: On Sep 3, 2015, at 3:26 PM, Olga Kornievskaia wrote: > When file is opened with O_DIRECT, sending asynchronous CLOSE creates > a race condition between CLOSE and DELEGRETURN followed REMOVE. Next > operations are sent before the reply for CLOSE comes back which is > according to the spec is not allowed (section 10.4.4). Server can get > a REMOVE before the CLOSE and it causes EACCESS error to be returned > because server thinks there is open state left. > > According to the spec: "...whenever a client chooses to return a > delegation voluntarily. The following items of state need to be dealt > with: > > o If the file associated with the delegation is no longer open and > no previous CLOSE operation has been sent to the server, a CLOSE > operation must be sent to the server." > > So i'm not sure if it will be argued that it doesn't say that a reply > must be received before sending the DELEGRETURN. However, if it's not > mandated then a race as I'm mentioning occurs. > > The following patch introduces making close async: > commit f895c53f8ace3c3e49ebf9def90e63fc6d46d2bf > Author: Chuck Lever > Date: Mon Feb 1 14:17:50 2010 -0500 > > NFS: Make close(2) asynchronous when closing NFS O_DIRECT files > > For NFSv2 and v3: > > O_DIRECT writes are always synchronous, and aren't cached, so nothing > should be flushed when closing an NFS O_DIRECT file descriptor. Thus > there are no write errors to report on close(2). > > In addition, there's no cached data to verify on the next open(2), > so we don't need clean GETATTR results at close time to compare with. > > Thus, there's no need for the nfs_revalidate_inode() call when closing > an NFS O_DIRECT file. This reduces the number of synchronous > on-the-wire requests for a simple open-write-close of an NFS O_DIRECT > file by roughly 20%. > > For NFSv4: > > Call nfs4_do_close() with wait set to zero when closing an NFS > O_DIRECT file. The CLOSE will go on the wire, but the application > won't wait for it to complete. > > I'd like to suggest to revert the use of this patch. DELEGRETURN can be made to wait for the CLOSE reply. Or, maybe the client should return an offered delegation immediately when a file is open O_DIRECT. I don't see much use in keeping the delegation if the application wants GETATTR, READ and WRITE always flushed to the server immediately. -- Chuck Lever