Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp685478pxb; Wed, 24 Feb 2021 12:11:12 -0800 (PST) X-Google-Smtp-Source: ABdhPJx4qAoIbSEEoShTJFqXeMLrmsKJ9CWtId8hSRLhaIX2UQRQD7zIzXvso+B/WH9yJGYGmLDv X-Received: by 2002:aa7:d381:: with SMTP id x1mr1101058edq.227.1614197472548; Wed, 24 Feb 2021 12:11:12 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614197472; cv=none; d=google.com; s=arc-20160816; b=JqmTWc45+rnv0lNlYsQwMzwXDvT38IPauCdLfyoP/ZRXwZIRxc91RfVBibbhjWqFzc NsVv01Zr+j1YnA9fdVRKBnPG+8hCle8D1CKks9VbeExpTK9Pm3wCxwMrTfu/bldDDr/g RjNBOubXx2MuBc1uSTYLkJuDMIVW6SLjQLyg+r2DaPk0WJz66KYVlMN3cWCDPHSJiMZ8 slzWtgb0KHrTZ741tKL9LSbk3JXtXku0Mz2YRssCy4vU+Gi9pX0W7N69n7a89RbU9B4E n5c5C1FoHAKByEDxd2yIwJXgUNFG8gCInlwSciEFFtaJdjpfZR8tmgzmA3DHYkJP1BAt yy4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature:dkim-filter; bh=yIgcTKyxFzWkd8wGIb7MZWb3mhrfjklWinB06ykAfRI=; b=v6z52xPQVGAacO7WlmUciA0IkL74oLjN1bBUQjDtuWr9YZ0oC7rp4vpKw1rYNlVx3K lmLc8bvU6HCh7Nb6aNrl0LwfF5tcUicCrk4U0TSfmdOsckHGFqKt5lmkxapySTNfgTEh RxwiYbP4wKta7bn9AR0YdnT/helXEGFs51QYH0r//hfEB+hfvQQiyof2wqwdHezr61C8 qckKuR8FsNoFv6T5hojCles/rdGIS8Fpy1LrJGgBQ9Rv24oewVqfk/qs1WmwyRWgXl2w J/LTDKIWA1qvBQC6R/0s6H4bepdxeip9qpsIWT3bOazxaZ3AB1eqtj2jUFQ+wvMxzZLn aO4g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fieldses.org header.s=default header.b=vpcJbMNM; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c11si1904005ejz.595.2021.02.24.12.10.47; Wed, 24 Feb 2021 12:11:12 -0800 (PST) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@fieldses.org header.s=default header.b=vpcJbMNM; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235376AbhBXUJu (ORCPT + 99 others); Wed, 24 Feb 2021 15:09:50 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43898 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235278AbhBXUJa (ORCPT ); Wed, 24 Feb 2021 15:09:30 -0500 Received: from fieldses.org (fieldses.org [IPv6:2600:3c00:e000:2f7::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 37B93C06174A for ; Wed, 24 Feb 2021 12:08:50 -0800 (PST) Received: by fieldses.org (Postfix, from userid 2815) id 9EB392501; Wed, 24 Feb 2021 15:08:49 -0500 (EST) DKIM-Filter: OpenDKIM Filter v2.11.0 fieldses.org 9EB392501 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fieldses.org; s=default; t=1614197329; bh=yIgcTKyxFzWkd8wGIb7MZWb3mhrfjklWinB06ykAfRI=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=vpcJbMNMO13VYvClEj4VDGXoGUElWEqjayeaJJKk6r9LI4OHcOyHV1WXkdnP3Tkd0 XLEf3Lz6mD2suSAGuDt7BmAXKNpm3MjgZLfKssqrUClcQIvWnBszJhX9G8wMpo0C84 FPuCHJlsR7SpUcTdlfX0dMGsUu2UH3LcCbsV8vQM= Date: Wed, 24 Feb 2021 15:08:49 -0500 From: Bruce Fields To: Chuck Lever Cc: Timo Rothenpieler , Linux NFS Mailing List , Olga Kornievskaia , Dai Ngo Subject: Re: [PATCH] svcrdma: disable timeouts on rdma backchannel Message-ID: <20210224200849.GE11591@fieldses.org> References: <20210222233619.21568-1-timo@rothenpieler.org> <20210224200231.GD11591@fieldses.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Wed, Feb 24, 2021 at 08:03:54PM +0000, Chuck Lever wrote: > > > > On Feb 24, 2021, at 3:02 PM, J. Bruce Fields wrote: > > > > On Wed, Feb 24, 2021 at 02:18:18PM +0000, Chuck Lever wrote: > >> > >> > >>> On Feb 22, 2021, at 6:36 PM, Timo Rothenpieler wrote: > >>> > >>> This brings it in line with the regular tcp backchannel, which also has > >>> all those timeouts disabled. > >>> > >>> Prevents the backchannel from timing out, getting some async operations > >>> like server side copying getting stuck indefinitely on the client side. > >>> > >>> Signed-off-by: Timo Rothenpieler > >> > >> Thanks for your patch! I've included it in the for-rc branch at > >> > >> git://git.kernel.org/pub/scm/linux/kernel/git/cel/linux.git > > > > So, I'm sure this patch makes sense. > > > > But I'm also curious why it's not recovering. > > Agreed. This patch is not a substitute for proper callback channel recovery. > > > > What I think should happen: > > > > - clp->cl_cb_state should be set to NFSD4_CB_DOWN. > > I think it's set to FAULT. OK. The result should be similar in that case, but SEQUENCE gets the SEQ4_STATUS_BACKCHANNEL_FAULT flag set instead. --b. > > > > - This should cause the next SEQUENCE reply to have > > SEQ4_STATUS_CB_PATH_DOWN set. > > - That should poke the client to recover. (Maybe by sending a > > BIND_CONN_TO_SESSION call?) > > > > I'd be curious whether any of that's actually happening. > > > > --b. > > > >> > >> > >>> --- > >>> Did the same testing with this applied than before, and could not > >>> observe it getting stuck, same as with the previous patch, which I > >>> removed before testing this one. > >>> > >>> This obviously still does not fix the issue of it being seemingly unable > >>> to reestablish the disconnected backchannel. > >>> An event that disconnects the backchannel but leaves the main connection > >>> intact seems a pretty rare occurance though, outside of this issue. > >>> > >>> net/sunrpc/xprtrdma/svc_rdma_backchannel.c | 6 +++--- > >>> 1 file changed, 3 insertions(+), 3 deletions(-) > >>> > >>> diff --git a/net/sunrpc/xprtrdma/svc_rdma_backchannel.c b/net/sunrpc/xprtrdma/svc_rdma_backchannel.c > >>> index 63f8be974df2..8186ab6f99f1 100644 > >>> --- a/net/sunrpc/xprtrdma/svc_rdma_backchannel.c > >>> +++ b/net/sunrpc/xprtrdma/svc_rdma_backchannel.c > >>> @@ -252,9 +252,9 @@ xprt_setup_rdma_bc(struct xprt_create *args) > >>> xprt->timeout = &xprt_rdma_bc_timeout; > >>> xprt_set_bound(xprt); > >>> xprt_set_connected(xprt); > >>> - xprt->bind_timeout = RPCRDMA_BIND_TO; > >>> - xprt->reestablish_timeout = RPCRDMA_INIT_REEST_TO; > >>> - xprt->idle_timeout = RPCRDMA_IDLE_DISC_TO; > >>> + xprt->bind_timeout = 0; > >>> + xprt->reestablish_timeout = 0; > >>> + xprt->idle_timeout = 0; > >>> > >>> xprt->prot = XPRT_TRANSPORT_BC_RDMA; > >>> xprt->ops = &xprt_rdma_bc_procs; > >>> -- > >>> 2.25.1 > >>> > >> > >> -- > >> Chuck Lever > > -- > Chuck Lever > >