Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp2352906ybv; Fri, 21 Feb 2020 14:00:39 -0800 (PST) X-Google-Smtp-Source: APXvYqzGkhtJRhw3Q/JeS0Jst5e1EgYG26ZVk0FzwBYlWGUga/wkAMvtuv7meSZU2NzEcRqQxoES X-Received: by 2002:aca:b187:: with SMTP id a129mr3849637oif.175.1582322439529; Fri, 21 Feb 2020 14:00:39 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582322439; cv=none; d=google.com; s=arc-20160816; b=aGrjdNQwW7GBFV31FwC4pjBD9uc7umA8FtecrSC/v6+0Bvnwcwdlqiult42RwWuvHz 1lnhwarXe/zfM0rvMv8Zb/WPLKoPxzpXHXwecC4muR2CiXQ4y2dXceZTgLRdX+cNeRKV ijSO35lZHl7bdcASYFxYo3RLHyLKjIpeUe2rR6OBKD2m1+ORaXLOyVXeM/LHs0wFNQPY /6LQzSv9xSm4rXH+8alEYrhnCFch//DVqsd6UAbO1Gyuuf4KXkqgCXTqD4XHUl6DJ1gl +VP9tTLIQMNHC/fuviqLxCyRiqAN0UVWQA3RG/LMhQCWl63eA4a0jLWmDsEa1guF5z7l IpLQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:message-id:date:to:from:subject:dkim-signature; bh=Qi2N2DE4kidOB1UvL1ljaWshUxorlbyFmUwWFR+Pkbk=; b=YqmkIxZOuLtJ/mNbXeY8zAwQIM0c3D5DHrnERzKNocFRu4ie9GRHgqH/xsfW6hGRBa nWGdAfC0wH9tS7xLyw6oRuk0CZ3+3iRHBSQKzSGBpN3chkab48Wmm8HpR1b/mJS+WpUg QuGyQmSjpIoDFxCnd8sZYyg97o7m6hFcR2tRNuseaf1lvSD9fBgKS9ccrfSmdkz9mSSA Dv93tmPaKn+bMg1lqxKNo9ZEAbQS97QPf8Qn1KeN/8O0nWkNdvUwrGDvJ3YCXU9WG9p3 KKk73LTuea4r1rTmx1CRozTr6JXB3nOteMs0SP/9O/8FE0a4ygEeXcf7StmbZb+Sqbbk WDZQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=aSDNxbQ6; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g17si2164087otk.252.2020.02.21.14.00.14; Fri, 21 Feb 2020 14:00:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=aSDNxbQ6; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726725AbgBUWAN (ORCPT + 99 others); Fri, 21 Feb 2020 17:00:13 -0500 Received: from mail-yw1-f65.google.com ([209.85.161.65]:43387 "EHLO mail-yw1-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726683AbgBUWAN (ORCPT ); Fri, 21 Feb 2020 17:00:13 -0500 Received: by mail-yw1-f65.google.com with SMTP id f204so1779436ywc.10; Fri, 21 Feb 2020 14:00:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:subject:from:to:date:message-id:user-agent:mime-version :content-transfer-encoding; bh=Qi2N2DE4kidOB1UvL1ljaWshUxorlbyFmUwWFR+Pkbk=; b=aSDNxbQ6S/fWZFbOmAGJAWLgMDxXIe/6+ZliB6DRYVTf1fiV0rR2pbNJZPn0dqsztx HgpRiEtHYxTr9ftN4cv2smtS2RWKTqj1VdHH4uK6R8fizQmBucmPwqN5KtyHs/f/RH+Y 4lcXaYlh8B06Z6CkENILXiE+fgWvQpmomyGPAR72zEak9PUanfEHLdrCzOcmwaDpbpuU N2vwpyLppC8hiWHDT7oy0bCvE8T+Kh7cbdEpV/DJuIx++/HJ653/z0zXxd3LlANBrP2b 8G7cnxpKgqnOZU9/f4bcGf7WxD19/UraOfZTFarCzdz27FpU4M8IRX9em9zu8JG7t0g/ RprA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:subject:from:to:date:message-id :user-agent:mime-version:content-transfer-encoding; bh=Qi2N2DE4kidOB1UvL1ljaWshUxorlbyFmUwWFR+Pkbk=; b=cMJjbshLIQDRr385Ig6DTiYqWyGjlDuY8AF/ORhiB9V/jIJh78k116RvZcABsjWyY4 MIXiurE91L+Qfy74TFbXj+gdpXKHLs2UnlHrDdbqyKhwpL3VBxdBxasebyK9oAvqin7W gPKD4DiGeLYj1AF71M2DYYIoipfGg9BEm/2qMShs1O8QVN+BeXlr7pOq2MMQN5NsoIUO N3L055QeBPAKVVKjC/8mBZA5OwxysfIi1wQ2XcrWi6SVbhjIET7QDDQuz6Bri9emoIkR GCzCSCtfQUorqKIrKGTxQeIAAVs7TfxQd2NYFKlJFVvoxKsgBllOVZFfey9AJ7KTCVdN GumQ== X-Gm-Message-State: APjAAAXXbdTJE2H6n8twHgkx42zh/Mb5p45NGrISFeAKoYibN7IuirZi Cw2PY8udXHOYMWgXVEQzrak3x8ze X-Received: by 2002:a81:2313:: with SMTP id j19mr33469521ywj.201.1582322408625; Fri, 21 Feb 2020 14:00:08 -0800 (PST) Received: from gateway.1015granger.net (c-68-61-232-219.hsd1.mi.comcast.net. [68.61.232.219]) by smtp.gmail.com with ESMTPSA id m137sm1860991ywd.108.2020.02.21.14.00.07 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 21 Feb 2020 14:00:08 -0800 (PST) Received: from manet.1015granger.net (manet.1015granger.net [192.168.1.51]) by gateway.1015granger.net (8.14.7/8.14.7) with ESMTP id 01LM063k018975; Fri, 21 Feb 2020 22:00:07 GMT Subject: [PATCH v1 00/11] NFS/RDMA client side connection overhaul From: Chuck Lever To: linux-rdma@vger.kernel.org, linux-nfs@vger.kernel.org Date: Fri, 21 Feb 2020 17:00:06 -0500 Message-ID: <20200221214906.2072.32572.stgit@manet.1015granger.net> User-Agent: StGit/0.17.1-dirty MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org Howdy. I've had reports (and personal experience) where the Linux NFS/RDMA client waits for a very long time after a disruption of the network or NFS server. There is a disconnect time wait in the Connection Manager which blocks the RPC/RDMA transport from tearing down a connection for a few minutes when the remote cannot respond to DREQ messages. An RPC/RDMA transport has only one slot for connection state, so the transport is prevented from establishing a fresh connection until the time wait completes. This patch series refactors the connection end point data structures to enable one active and multiple zombie connections. Now, while a defunct connection is waiting to die, it is separated from the transport, clearing the way for the immediate creation of a new connection. Clean-up of the old connection's data structures and resources then completes in the background. Well, that's the idea, anyway. Review and comments welcome. Hoping this can be merged in v5.7. --- Chuck Lever (11): xprtrdma: Invoke rpcrdma_ep_create() in the connect worker xprtrdma: Refactor frwr_init_mr() xprtrdma: Clean up the post_send path xprtrdma: Refactor rpcrdma_ep_connect() and rpcrdma_ep_disconnect() xprtrdma: Allocate Protection Domain in rpcrdma_ep_create() xprtrdma: Invoke rpcrdma_ia_open in the connect worker xprtrdma: Remove rpcrdma_ia::ri_flags xprtrdma: Disconnect on flushed completion xprtrdma: Merge struct rpcrdma_ia into struct rpcrdma_ep xprtrdma: Extract sockaddr from struct rdma_cm_id xprtrdma: kmalloc rpcrdma_ep separate from rpcrdma_xprt include/trace/events/rpcrdma.h | 97 ++--- net/sunrpc/xprtrdma/backchannel.c | 8 net/sunrpc/xprtrdma/frwr_ops.c | 152 ++++---- net/sunrpc/xprtrdma/rpc_rdma.c | 32 +- net/sunrpc/xprtrdma/transport.c | 72 +--- net/sunrpc/xprtrdma/verbs.c | 681 ++++++++++++++----------------------- net/sunrpc/xprtrdma/xprt_rdma.h | 89 ++--- 7 files changed, 445 insertions(+), 686 deletions(-) -- Chuck Lever