Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp2080313ybg; Fri, 5 Jun 2020 05:14:18 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwcw9EcYJIo7II3iGGGfzZ5vK5xA+Taba+y9Hqg+yznxZ0ph6SxtVbCr3jt/Mhkh3/lfmdu X-Received: by 2002:a05:6402:148d:: with SMTP id e13mr9181534edv.200.1591359258694; Fri, 05 Jun 2020 05:14:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591359258; cv=none; d=google.com; s=arc-20160816; b=FlBaRm+Z6L4k64zylfk9adTwBsdnwmDgz/P3zej72UvCP1K5GNq6dXT7cn4hh13FHs LuYAAWMZyvED+DmygPYb8ibpb6Pnx15ScBkx+CAsg3RWzdFFFDpLs2ozmLVFjExepq/w GIt7o2cyAZxfvhVDXLpnEkmchzPjtMUD5MzS8I7+s9KgxSC34F8RlRx2gfU/I0HUz9v2 2XxZTKiDuVNKrZaJGyGlq68+R9VulU4kIBWSeP7QRKu2x9AOD6ZNXgZbJwjHvFDolf/0 pKlCc36pDvAPcI5BesA4ZC0GpwZxqME1NtjiF8R1cd1R8dJlyJOPkIEgB+2NSTzhRcTu wTMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=q9LDtbiaQZ+VL+z6Alw8qXxYLBWdTOJCB0WyEsqG+L4=; b=owm8qLET6NLaCTA1CZbS3vfNCOzJ04/VkMuZVLJmWnc3v9HGGQzBGG+rezRzBLMbO7 qHfyPJw5+llUMVxWegWUhhn8KiMxnrqfTXUjXoDxj7Gekp8HhU20HyQnTqAeUpM3vT/U l2gZ2qX1MBShlY1Ufy2amrglwOLiTv0Zkhm1WNPWn6tgl4pPnt855jKzIoYKb+oum6HR Am+Y7U7SBnUhzy2UUncutAqyr26OGIvYswmLjXyBNXYiAxRgTa2TdpMRxqxNByaKsTgG vELLzOs3h99iGjtDRx7AL/Wt8Jh1vmpRbR0r4660OqvilTmUmxYGVUnFlhSTJK9/IyJP WNxA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u5si3282494ejz.120.2020.06.05.05.13.37; Fri, 05 Jun 2020 05:14:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726404AbgFEMNd (ORCPT + 99 others); Fri, 5 Jun 2020 08:13:33 -0400 Received: from p3plsmtpa07-09.prod.phx3.secureserver.net ([173.201.192.238]:50472 "EHLO p3plsmtpa07-09.prod.phx3.secureserver.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726054AbgFEMNd (ORCPT ); Fri, 5 Jun 2020 08:13:33 -0400 Received: from [192.168.0.78] ([24.218.182.144]) by :SMTPAUTH: with ESMTPSA id hB6zjqRZbZQ3PhB70j6vOo; Fri, 05 Jun 2020 05:06:15 -0700 X-CMAE-Analysis: v=2.3 cv=L7RjvNb8 c=1 sm=1 tr=0 a=ugQcCzLIhEHbLaAUV45L0A==:117 a=ugQcCzLIhEHbLaAUV45L0A==:17 a=IkcTkHD0fZMA:10 a=MaduGeJzO1XSokWPyYwA:9 a=P73MxSf4DJ6CZ9bq:21 a=1wd4hXGIlYM9ApGK:21 a=QEXdDO2ut3YA:10 X-SECURESERVER-ACCT: tom@talpey.com Subject: Re: once again problems with interrupted slots To: Olga Kornievskaia , Trond Myklebust Cc: linux-nfs References: From: Tom Talpey Message-ID: <13bed646-39b7-197e-ff90-85f8af10d93c@talpey.com> Date: Fri, 5 Jun 2020 08:06:14 -0400 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:68.0) Gecko/20100101 Thunderbird/68.8.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-CMAE-Envelope: MS4wfHt5cTD92nhwIex1okLssT+P2hRvfxnl64OkojOd5tTSBe9sbCVKOahl3H5xl5nQToK1+0v2OpvsIDNxrOpdKiZ8vPYrrY7ylMf6SJSvcJN5uE3nUmRK U8Jrv/Uh6uHlWg+YpmUsUkCT2GzE/JLxzhDVr69mA5hFI+kBcj+dGaN/AKT3v7v2I1HOvNsR0FRON6xdTsUG2fBFYiZx38xQQNShvM8xrxYABZtaRikmP++m mR6ldB4MvhfxBKbbfuRSkg== Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On 6/4/2020 5:21 PM, Olga Kornievskaia wrote: > Hi Trond, > > There is a problem with interrupted slots (yet again). > > We send an operation to the server and it gets interrupted by the a signal. > > We used to send a sole SEQUENCE to remove the problem of having real > operation get an out of the cache reply and failing. Now we are not > doing it again (since 3453d5708 NFSv4.1: Avoid false retries when RPC > calls are interrupted"). So the problem is > > We bump the sequence on the next use of the slot, and get SEQ_MISORDERED. Misordered? It sounds like the client isn't managing the sequence number, or perhaps the server never saw the original request, and is being overly strict. > We decrement the number back to the interrupted operation. This gets > us a reply out of the cache. We again fail with REMOTE EIO error. Ew. The client *decrements* the sequence? Tom. > Going back to the commit's message. I don't see the logic that the > server can't tell if this is a new call or the old one. We used to > send a lone SEQUENCE as a way to protect reuse of slot by a normal > operation. An interrupted slot couldn't have been another SEQUENCE. So > I don't see how the server can't tell a difference between SEQUENCE > and any other operations. > >