Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp444803pxa; Tue, 4 Aug 2020 09:11:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz++si1IBNBK1LT4PNmVFxVhPZ000GxVNmrgzhEBMGxywz3RF8m/w0U5Zv+G/TKPCAoEblZ X-Received: by 2002:a17:906:528d:: with SMTP id c13mr22919379ejm.61.1596557488701; Tue, 04 Aug 2020 09:11:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596557488; cv=none; d=google.com; s=arc-20160816; b=LGh00dbEv8A1d7CHWD+pDzBIJcpSDrYT8a4GRPqYAKMvVydJZWzU+iExZ8U2ds/12K MvCU4i9gdMi7ue7RifNzIGYw08zY/72pasK7TYkY8cGLM28S+hjl5DFlYD7CAEpN0eUB qVCxkMoBxDsSAY1JT5bDUKAYpx0ugc8bKZzZG7Zgt2rnCy1jKLaRekEoaVWx4lsuQDY8 /bm1lfhhvkWz7zw0CD5616utuVg6UlCjXmPzOZCp5xjgrce9tEnHlRLzonao7r1BWVlp tcz24Ug9pnx+ZSxD882FAXVCpZPDTt95hJFAAaqFLHsZRGjOJ2Qm6dqQQvtG99y131xs oIQA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject; bh=cXhafq5AIIcdpi4eQB/AIESBiSzfJEjv1L6g+tXutHM=; b=GpQ9xakzQ7n6ut8VQcEZZJWUAs2ROtYXijq+4DSB1wZ6lj3nUh50wz6q69E2e2KiB1 8zjjt9sG41FWou7LRTJnbeIaCD0Pa+lZcZoiM6FkZyEORHkvl2w7lRFydGoiYbmv3tZB m5TaYa6p8jtlNUgevi05PX7LM2uS6S2daqGNSNZP7hR7XnvhxK3ljnmxALA8XqFniay9 gE7Hpgxfkyqpw3Dw9beSU9DuxZAupRw2STv3Jwh6xEvEzJ82t0WGnJtXOxPQZjcfdRS/ RsWIeMN05QP58yTbLIpSCJN+wlIdhP3t05DH94KVbFOwv6LEedeXTfnJYh/TJDJpG5Uk w4kQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dg1si2528390edb.503.2020.08.04.09.10.59; Tue, 04 Aug 2020 09:11:28 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729345AbgHDQHw (ORCPT + 99 others); Tue, 4 Aug 2020 12:07:52 -0400 Received: from btbn.de ([5.9.118.179]:40978 "EHLO btbn.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729022AbgHDQHs (ORCPT ); Tue, 4 Aug 2020 12:07:48 -0400 Received: from [IPv6:2001:16b8:64d7:4500:fc3b:cfd2:151e:7636] (200116b864d74500fc3bcfd2151e7636.dip.versatel-1u1.de [IPv6:2001:16b8:64d7:4500:fc3b:cfd2:151e:7636]) by btbn.de (Postfix) with ESMTPSA id 717434D3BB; Tue, 4 Aug 2020 18:07:37 +0200 (CEST) Subject: Re: NFS over RDMA issues on Linux 5.4 From: Timo Rothenpieler To: Chuck Lever Cc: Leon Romanovsky , Linux NFS Mailing List , linux-rdma References: <8a1087d3-9add-dfe1-da0c-edab74fcca51@rothenpieler.org> <20200804093635.GA4432@unreal> <92a5a932-b843-eed3-555e-7557ccc1f308@rothenpieler.org> <20200804122557.GB4432@unreal> <20200804134642.GC4432@unreal> <45BA86D8-52A3-407E-83BE-27343C0182C5@oracle.com> <7c7418cb-7f7a-5de3-2025-7bde5cd5ac2a@rothenpieler.org> <4751E7F5-AAB1-4602-B926-9BB08E1D213D@oracle.com> <074fb120-1807-3d83-f34e-400e05cbce27@rothenpieler.org> Message-ID: <507cf7b9-4044-79d3-28ed-1bfff2c3e211@rothenpieler.org> Date: Tue, 4 Aug 2020 18:07:37 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 In-Reply-To: <074fb120-1807-3d83-f34e-400e05cbce27@rothenpieler.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On 04.08.2020 17:50, Timo Rothenpieler wrote: > On 04.08.2020 17:46, Chuck Lever wrote: >> >> >>> On Aug 4, 2020, at 11:39 AM, Timo Rothenpieler >>> wrote: >>> >>> On 04.08.2020 17:34, Chuck Lever wrote: >>>> I see a LOC_LEN_ERR on a Receive. Leon, doesn't that mean the server's >>>> Send was too large? >>>> Timo, what filesystem are you sharing on your NFS server? The thing >>>> that >>>> comes to mind is https://bugzilla.kernel.org/show_bug.cgi?id=198053 >>> >>> The filesystem on the server is indeed a zfs-on-linux (version >>> 0.8.4), just as in that bug report. >>> >>> Should I try to apply the proposed fix you posted on that bug report >>> on the client (and server?). >> >> If you are hitting that bug, the server is the problem. The client >> should work fine once the server is fixed. (I'm not happy about >> the client's looping behavior either, but that will go away once >> the server behaves). >> >> I'm not hopeful that the fix applies cleanly to v4.19, but it >> might. Another option would be upgrading your NFS server. > > It's running on 5.4.54 and the patch applies with no fuzz whatsoever: > > I will deploy the patch to both server and client and report back. Reporting success. With the patch from that bug applied, no error spam is happening anymore. Plus, the filesystem actually works and definitely got a whole lot snappier than before. Which is not all that unexpected. Thank you so much for your help analyzing this and for the fix! I hope it can get applied to mainline soon and will reach 5.4 backports eventually. Until then, I will carry it as a local patch for the systems. Thanks again, Timo