Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp2129722pxa; Mon, 3 Aug 2020 08:11:50 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw+zd5GCxmkc+e6E6A6HVfvFZGf3pigu/e5XZKsqxYn7dqFfSsNSvRRmICKjnlGB1UQ6166 X-Received: by 2002:a17:906:c259:: with SMTP id bl25mr16896968ejb.303.1596467509863; Mon, 03 Aug 2020 08:11:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596467509; cv=none; d=google.com; s=arc-20160816; b=BfdyQ6Ulc4nWJ1yyayLWe7vIi24KHtJHO7WkQmM0h02D3ttyVFE+VB0FnVywD2yWKb dvtOjI2A9ZyTRsMJ4ijsqnaDfQRkBCfOrRTvUdMxDFn6NoxZhPJIr909pDwZEjSGyAh+ JdG9d71srBA7Xvua3JPnmTBuaTHcn7mSOXCCwcTQWRBIotKO5lW9w9Zmof4MdfkbmukC iRBvjXd06iOmKlORVK1aOfxDSkJCl5/F/Tx37ECr7jqPvywwEy3eCFqY0Zcu1AOY10Le 3GdNvNh4YBSvYqtpNL3QIHgb963qiCaEK4XIJHKIqJR0QA+NuCE01LqZaTretozIjw2I KX3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:mime-version:user-agent:date:message-id:subject :from:to; bh=PkzGjiruXI6UTScjW2vmoKKUyveVXzuPMrRY6afO9Cc=; b=yPgRG7MHnJJQt2VV6sMe2fRIdonHcZ/H5phSHQrE2wZT9T/yN5QR3wC3qUcmJWJlT2 LIgHhMjI49zNCTRsFz1FP6isTUwarDUXbQuM/cVXCuEN+UWH5GzuX74Up6ntY2BfhhGM YV4IGLJZNZS+W2LSAdpm1NRnh7C9/FJp7+CHQt9g/uWIziu6/V+5m4nCXTgF/Woakra/ pxmjtwT6SS7/hzX1Fjs8jj5+IZDCZW1E3WxzolLB4i5D3A4wnZjnKiV0cDMF6siDueQq 7zM3KVccic/DipMDHed3gkww8KSyuV911cpn1LocJB0Ab5etF+C3fVa/Oz2t/zJUjPdZ WS/Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i23si3788572edx.50.2020.08.03.08.11.13; Mon, 03 Aug 2020 08:11:49 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726276AbgHCPLM (ORCPT + 99 others); Mon, 3 Aug 2020 11:11:12 -0400 Received: from btbn.de ([5.9.118.179]:53806 "EHLO btbn.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725945AbgHCPLM (ORCPT ); Mon, 3 Aug 2020 11:11:12 -0400 X-Greylist: delayed 318 seconds by postgrey-1.27 at vger.kernel.org; Mon, 03 Aug 2020 11:11:11 EDT Received: from [IPv6:2001:16b8:6478:5500:44fa:818:1e31:9c59] (200116b86478550044fa08181e319c59.dip.versatel-1u1.de [IPv6:2001:16b8:6478:5500:44fa:818:1e31:9c59]) by btbn.de (Postfix) with ESMTPSA id 26B97200435 for ; Mon, 3 Aug 2020 17:05:53 +0200 (CEST) To: linux-nfs@vger.kernel.org From: Timo Rothenpieler Subject: NFS over RDMA issues on Linux 5.4 Message-ID: <8a1087d3-9add-dfe1-da0c-edab74fcca51@rothenpieler.org> Date: Mon, 3 Aug 2020 17:05:52 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org Hello, I have just deployed a new system with Mellanox ConnectX-4 VPI EDR IB cards and wanted to setup NFS over RDMA on it. However, while mounting the FS over RDMA works fine, actually using it results in the following messages absolutely hammering dmesg on both client and server: > https://gist.github.com/BtbN/9582e597b6581f552fa15982b0285b80#file-server-log The spam only stops once I forcibly reboot the client. The filesystem gets nowhere during all this. The retrans counter in nfsstat just keeps going up, nothing actually gets done. This is on Linux 5.4.54, using nfs-utils 2.4.3. The mlx5 driver had enhanced-mode disabled in order to enable IPoIB connected mode with an MTU of 65520. Normal NFS 4.2 over tcp works perfectly fine on this setup, it's only when I mount via rdma that things go wrong. Is this an issue on my end, or did I run into a bug somewhere here? Any pointers, patches and solutions to test are welcome. Thanks, Timo Rothenpieler