Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp3603199pxt; Tue, 10 Aug 2021 07:17:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyAmk7//vMyz20DxssB9xG5OsGsZB+lB2sKMo/0WdmacKL6CNZ1+PaFO28k8Zn265BtH8cx X-Received: by 2002:a05:6402:384:: with SMTP id o4mr5314831edv.128.1628605075305; Tue, 10 Aug 2021 07:17:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628605075; cv=none; d=google.com; s=arc-20160816; b=lp40oAOjM4J3RNIhE4ce36qcJv/p+RimR7h104Vucv7d9pCOW1v0l+fB7bkrNFLCwS DLvjYfLb5y9txaJqQpgTU81YkEMMIcbTVF8IXzhbY8cEea2zc96ollVC5bFi1k7U5NKS 3/+kVteVS1Ao96mQftX/qh50WAfUMjIkqTYGiHUj5Gi/C3LbF9yHBm7yzDB6Cysw+Nv6 xDdSVqOmOIu9Wojbw4/iletQmImgfim+rY4EFl+Cjq/tU4naGFEq85YiHOJTEk9dtXgI HVV07q7CFPG8mmvvXOOzpEF+J3UrV6aJTxjO0XXYYcKFBDmHniduWwaMDYAkwJJzfOYY yHtQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:mime-version:user-agent:date :message-id:references:cc:to:from:subject:dkim-signature; bh=r9IjBPEzxcfDJVsDvrgC3HYSbv+lX0Do66rsfrnzleo=; b=wYC6pKY6PSa2ZlYG+8J4ApWnlDv+/9cLAKC6qPNN7XpeyuX2EfKx72uwtAxpeIF0YA 8iZPdCDPsuo1LeiC16RgrNkQMp+pHH8AVYaTG/4iqFhScRbiyJuzjysVTY6UBoDdk45F fgtxzikZ3ODL8nu2KaKEnUOxaIs43Bsp54T5l37LxiaEnPUDp7ZOo21h50SQJmZOZEB3 wwUFTdnNg8NOW5iIKeZYcQO9lKULQsYYqVBWrnEHZj2BRsD+k0ld2DBiD9rafx297rd5 SFgbpABrBgEeM3UvEWqjInGSwlolITeFnpmQDknsJMLq1oO0WXBC8AowZYuHS8lnx5f8 IQVw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@rothenpieler.org header.s=mail header.b=DTPFC3X8; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q1si20378505ejr.43.2021.08.10.07.17.28; Tue, 10 Aug 2021 07:17:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@rothenpieler.org header.s=mail header.b=DTPFC3X8; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=rothenpieler.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241042AbhHJM7f (ORCPT + 99 others); Tue, 10 Aug 2021 08:59:35 -0400 Received: from btbn.de ([136.243.74.85]:49572 "EHLO btbn.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239188AbhHJM7e (ORCPT ); Tue, 10 Aug 2021 08:59:34 -0400 X-Greylist: delayed 556 seconds by postgrey-1.27 at vger.kernel.org; Tue, 10 Aug 2021 08:59:33 EDT Received: from [authenticated] by btbn.de (Postfix) with ESMTPSA id 8C9A42ABB31; Tue, 10 Aug 2021 14:49:52 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=rothenpieler.org; s=mail; t=1628599792; bh=r9IjBPEzxcfDJVsDvrgC3HYSbv+lX0Do66rsfrnzleo=; h=Subject:From:To:Cc:References:Date:In-Reply-To; b=DTPFC3X8MKvEytTmEenkXSv1B/HglYkjGj6faA1QFM0pIrRhWglpU2nU/dWj8FTMW carPj7SJbvdsxlf9NzlGNrrJ8+plAjdRN7EbpSv6jx6Xh1kxaqYbfUz2hOzIENrrO1 VxPeCOwNHykx4h1etDSmuU83DCYtCmjnlmq5Gb6S7iI1mYOx4wpAezR4Hqi+WD6uYU eb9pdZhArmn1paGEqNAsybVGBP69R5big1Xln6aqNubI2Ra9l0rzCdEk52ABkdwlrx +s05K3yub+ElJ5rbRw8Sh9ezdBz/hWItVGejvN2MqhfSaUmtELEFXhja13aIxKdO+9 noGwchJ6NL5rw== Subject: Re: Spurious instability with NFSoRDMA under moderate load From: Timo Rothenpieler To: Chuck Lever III Cc: Linux NFS Mailing List , linux-rdma References: <4da3b074-a6be-d83f-ccd4-b151557066aa@rothenpieler.org> <72ECF9E1-1F6E-44AF-850C-536BED898DDD@oracle.com> Message-ID: <9355de20-921c-69e0-e5a4-733b64e125e1@rothenpieler.org> Date: Tue, 10 Aug 2021 14:49:52 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.12.0 MIME-Version: 1.0 In-Reply-To: Content-Type: multipart/signed; protocol="application/pkcs7-signature"; micalg=sha-256; boundary="------------ms020502070309020302060706" Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org This is a cryptographically signed message in MIME format. --------------ms020502070309020302060706 Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: quoted-printable On 21.06.2021 18:06, Timo Rothenpieler wrote: > On 17.05.2021 19:37, Timo Rothenpieler wrote: >> On 17.05.2021 18:27, Chuck Lever III wrote: >>> Meanwhile, you could try 5.11 or 5.12 on your NFS server to see if >>> the problem persists. >> >> I updated the NFS server to 5.12.4 now and will observe and maybe try = >> to cause some mixed load. >> >=20 > Had this running on 5.12 for a month now, and haven't observed any kind= =20 > of instability so far. > So I'd carefully assume that something in after 5.10, that made it into= =20 > 5.11 or 5.12 fixed or greatly improved the situation. >=20 > Can't really sensibly bisect this sadly, given the extremely long and=20 > uncertain turnaround times. >=20 Ok, so this just happened again for the first time since upgrading to 5.1= 2. Exact same thing, except this time no error cqe was dumped=20 simultaneously (It still appeared in dmesg, but over a week before the=20 issue showed up). So I'll assume it's unrelated to this issue. I had no issues while running 5.12.12 and below. Recently (14 days ago=20 or so) updated to 5.12.19, and now it's happening again. Unfortunately, with how rarely this issue happens, this can either be a=20 regression between those two versions, or it was still present all along = and just never triggered for several months. Makes me wonder if this is somehow related to the problem described in=20 "NFS server regression in kernel 5.13 (tested w/ 5.13.9)". But the=20 pattern of the issue does not look all that similar, given that for me,=20 the hangs never recover, and I have RDMA in the mix. --------------ms020502070309020302060706 Content-Type: application/pkcs7-signature; name="smime.p7s" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="smime.p7s" Content-Description: S/MIME Cryptographic Signature MIAGCSqGSIb3DQEHAqCAMIACAQExDzANBglghkgBZQMEAgEFADCABgkqhkiG9w0BBwEAAKCC DVkwggXkMIIDzKADAgECAhAI/yx7V5dPIG8WuMetnzcsMA0GCSqGSIb3DQEBCwUAMIGBMQsw CQYDVQQGEwJJVDEQMA4GA1UECAwHQmVyZ2FtbzEZMBcGA1UEBwwQUG9udGUgU2FuIFBpZXRy bzEXMBUGA1UECgwOQWN0YWxpcyBTLnAuQS4xLDAqBgNVBAMMI0FjdGFsaXMgQ2xpZW50IEF1 dGhlbnRpY2F0aW9uIENBIEczMB4XDTIxMDIxNDE5MTM0N1oXDTIyMDIxNDE5MTM0N1owIDEe MBwGA1UEAwwVdGltb0Byb3RoZW5waWVsZXIub3JnMIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8A MIIBCgKCAQEA0WP2SBuRIpVw5O7QPakKoJjg7B4UNAKTyky1XMsievLNGnR4Nxe6kKU+1oW0 oF5FqMVH9NkT9zhWYJzr5sNwJMKb9t5k8kYC7GXzOM9PxVx3bkLF5bWZrbfelUUwcdiyEYoh d29C+PxiNLHvmayWb3NtxpWiax9A4x7dRhhtqB/0BkPix+ZsIFn8vxpCvIChE2YlQWK3i8UX uBtqm26zBl3BIjj+bpd+7ePVt60vRx/R3LFHtF6kL/gQvgRcm8CFc8Nj3dCUeR2lfG+DzoTY ED6yAi838kRh5JHbqIl/Fo9YRwOYUaq2TFT/fGue87d7duLbckX1aVot+OqE0aeV2QIDAQAB o4IBtjCCAbIwDAYDVR0TAQH/BAIwADAfBgNVHSMEGDAWgBS+l6mqhL+AvxBTfQky+eEuMhvP dzB+BggrBgEFBQcBAQRyMHAwOwYIKwYBBQUHMAKGL2h0dHA6Ly9jYWNlcnQuYWN0YWxpcy5p dC9jZXJ0cy9hY3RhbGlzLWF1dGNsaWczMDEGCCsGAQUFBzABhiVodHRwOi8vb2NzcDA5LmFj dGFsaXMuaXQvVkEvQVVUSENMLUczMCAGA1UdEQQZMBeBFXRpbW9Acm90aGVucGllbGVyLm9y ZzBHBgNVHSAEQDA+MDwGBiuBHwEYATAyMDAGCCsGAQUFBwIBFiRodHRwczovL3d3dy5hY3Rh bGlzLml0L2FyZWEtZG93bmxvYWQwHQYDVR0lBBYwFAYIKwYBBQUHAwIGCCsGAQUFBwMEMEgG A1UdHwRBMD8wPaA7oDmGN2h0dHA6Ly9jcmwwOS5hY3RhbGlzLml0L1JlcG9zaXRvcnkvQVVU SENMLUczL2dldExhc3RDUkwwHQYDVR0OBBYEFK/aNb0BTZd0BqHgSJnmTftGSlabMA4GA1Ud DwEB/wQEAwIFoDANBgkqhkiG9w0BAQsFAAOCAgEAT3W2bBaISi7Utg/WA3U+bBhiouolnROR AB0vW4m3igjMcWx5GrPb8CSWNcq0/+BG+bhj6s+q7D1E9h1HO9CZUCfD7ujXj/VT/h7oMAqX w3Tf6H92bvHmZCvZmb2HKEnAAa4URjeZyNI1uwsMirF/gC5zYX5pm2ydVGxGYusWq8VRZzgc m1a0f3SPtX2dmmqjCzfINsQPs3N7BQo6FO/PfCbCzt22e+9Zm0Lra0Wt2URFTYCKSTjsK2xC SkysTfVIrBZCOb83oTMsgYE9dBmK7Tmob/HzHKs0NUOu4TfEpCgFgoXozMqTLFQac7aW26YK O8ClFDaauyOC71A+kjrth/gkUNEK+Cd3W52hK2FWvxbG/8LQLDMYviZFKxv/LAHU0fb6omva R4dzu9Sagi1z5uI5KHs5SR85lH4Up0dYs+I2xyFb8wZVYa+VuvsJ4W/pL2OaMm0tez+aNprg XURytCSPfAlz3JQdEYIiKPlJrz7O6eL2j7RwxMcKFLQl117mhImjdauIjaaS60w92P7v+F7+ 7INJ8g0PFN2vHVCB9e1g4iSYIgiydDLcbs73Jp1yVp97plWZI9oirxvH1/vI05FUJ3gw9qg2 WfbttAr0AEakAUo3Dv8jB7aQor/5fu8NMOvWjFV7P7GTAgrwil8u6fXa8ae/kWzG/850vgqq GM0wggdtMIIFVaADAgECAhAXED7ePYoctcoGUZPnykNrMA0GCSqGSIb3DQEBCwUAMGsxCzAJ BgNVBAYTAklUMQ4wDAYDVQQHDAVNaWxhbjEjMCEGA1UECgwaQWN0YWxpcyBTLnAuQS4vMDMz NTg1MjA5NjcxJzAlBgNVBAMMHkFjdGFsaXMgQXV0aGVudGljYXRpb24gUm9vdCBDQTAeFw0y MDA3MDYwODQ1NDdaFw0zMDA5MjIxMTIyMDJaMIGBMQswCQYDVQQGEwJJVDEQMA4GA1UECAwH QmVyZ2FtbzEZMBcGA1UEBwwQUG9udGUgU2FuIFBpZXRybzEXMBUGA1UECgwOQWN0YWxpcyBT LnAuQS4xLDAqBgNVBAMMI0FjdGFsaXMgQ2xpZW50IEF1dGhlbnRpY2F0aW9uIENBIEczMIIC IjANBgkqhkiG9w0BAQEFAAOCAg8AMIICCgKCAgEA7eaHlqHBpLbtwkJV9z8PDyJgXxPgpkOI hkmReRwbLxpQD9xGAe72ujqGzFFh78QPgAhxKVqtGHzYeq0VJVCzhnCKRBbVX+JwIhL3ULYh UAZrViUp952qDB6qTL5sGeJS9F69VPSR5k6pFNw7mHDTTt0voWFg2aVkG3khomzVXoieJGOi Q4dH76paCtQbLkt59joAKz2BnwGLQ4wr09nfumJt5AKx2YxHK2XgSPslVZ4z8G00gimsfA7U tjT/wiekY6Z0b7ksLrEcvODncHQe9VSrNRA149SE3AlkWaZM/joVei/GYfj9K5jkiReinR4m qM353FEceLOeBhSTURpMdQ5wsXLi9DSTGBuNv4aw2Dozb/qBlkhGTvwk92mi0jAecE22Sn3A 9UfrU2p1w/uRs+TIteQ0xO0B/J2mY2caqocsS9SsriIGlQ8b0LT0o6Ob07KGtPa5/lIvMmx5 72Dv2v+vDiECByxm1Hdgjp8JtE4mdyYP6GBscJyT71NZw1zXHnFkyCbxReag9qaSR9x4CVVX j1BDmNROCqd5NAfIXUXYTFeZ/jukQigkxXGWhEhfLBC4Ha6pwizz9fq1+wwPKcWaF9P/SZOu BDrG30MiyCZa66G9mEtF5ZLuh4rGfKqxy4Z5Mxecuzt+MZmrSKfKGeXOeED/iuX5Z02M1o7i MS8CAwEAAaOCAfQwggHwMA8GA1UdEwEB/wQFMAMBAf8wHwYDVR0jBBgwFoAUUtiIOsifeGbt ifN7OHCUyQICNtAwQQYIKwYBBQUHAQEENTAzMDEGCCsGAQUFBzABhiVodHRwOi8vb2NzcDA1 LmFjdGFsaXMuaXQvVkEvQVVUSC1ST09UMEUGA1UdIAQ+MDwwOgYEVR0gADAyMDAGCCsGAQUF BwIBFiRodHRwczovL3d3dy5hY3RhbGlzLml0L2FyZWEtZG93bmxvYWQwHQYDVR0lBBYwFAYI KwYBBQUHAwIGCCsGAQUFBwMEMIHjBgNVHR8EgdswgdgwgZaggZOggZCGgY1sZGFwOi8vbGRh cDA1LmFjdGFsaXMuaXQvY24lM2RBY3RhbGlzJTIwQXV0aGVudGljYXRpb24lMjBSb290JTIw Q0EsbyUzZEFjdGFsaXMlMjBTLnAuQS4lMmYwMzM1ODUyMDk2NyxjJTNkSVQ/Y2VydGlmaWNh dGVSZXZvY2F0aW9uTGlzdDtiaW5hcnkwPaA7oDmGN2h0dHA6Ly9jcmwwNS5hY3RhbGlzLml0 L1JlcG9zaXRvcnkvQVVUSC1ST09UL2dldExhc3RDUkwwHQYDVR0OBBYEFL6XqaqEv4C/EFN9 CTL54S4yG893MA4GA1UdDwEB/wQEAwIBBjANBgkqhkiG9w0BAQsFAAOCAgEAJpvnG1kNdLMS A+nnVfeEgIXNQsM7YRxXx6bmEt9IIrFlH1qYKeNw4NV8xtop91Rle168wghmYeCTP10FqfuK MZsleNkI8/b3PBkZLIKOl9p2Dmz2Gc0I3WvcMbAgd/IuBtx998PJX/bBb5dMZuGV2drNmxfz 3ar6ytGYLxedfjKCD55Yv8CQcN6e9sW5OUm9TJ3kjt7Wdvd1hcw5s+7bhlND38rWFJBuzump 5xqm1NSOggOkFSlKnhSz6HUjgwBaid6Ypig9L1/TLrkmtEIpx+wpIj7WTA9JqcMMyLJ0rN6j jpetLSGUDk3NCOpQntSy4a8+0O+SepzS/Tec1cGdSN6Ni2/A7ewQNd1Rbmb2SM2qVBlfN0e6 ZklWo9QYpNZyf0d/d3upsKabE9eNCg1S4eDnp8sJqdlaQQ7hI/UYCAgDtLIm7/J9+/S2zuwE WtJMPcvaYIBczdjwF9uW+8NJ/Zu/JKb98971uua7OsJexPFRBzX7/PnJ2/NXcTdwudShJc/p d9c3IRU7qw+RxRKchIczv3zEuQJMHkSSM8KM8TbOzi/0v0lU6SSyS9bpGdZZxx19Hd8Qs0cv +R6nyt7ohttizwefkYzQ6GzwIwM9gSjH5Bf/r9Kc5/JqqpKKUGicxAGy2zKYEGB0Qo761Mcc IyclBW9mfuNFDbTBeDEyu80xggPzMIID7wIBATCBljCBgTELMAkGA1UEBhMCSVQxEDAOBgNV BAgMB0JlcmdhbW8xGTAXBgNVBAcMEFBvbnRlIFNhbiBQaWV0cm8xFzAVBgNVBAoMDkFjdGFs aXMgUy5wLkEuMSwwKgYDVQQDDCNBY3RhbGlzIENsaWVudCBBdXRoZW50aWNhdGlvbiBDQSBH MwIQCP8se1eXTyBvFrjHrZ83LDANBglghkgBZQMEAgEFAKCCAi0wGAYJKoZIhvcNAQkDMQsG CSqGSIb3DQEHATAcBgkqhkiG9w0BCQUxDxcNMjEwODEwMTI0OTUyWjAvBgkqhkiG9w0BCQQx IgQgeI7oFBxMXYWAQeZBREUlZdAVA8vG+TEV3YHDX/i45ZAwbAYJKoZIhvcNAQkPMV8wXTAL BglghkgBZQMEASowCwYJYIZIAWUDBAECMAoGCCqGSIb3DQMHMA4GCCqGSIb3DQMCAgIAgDAN BggqhkiG9w0DAgIBQDAHBgUrDgMCBzANBggqhkiG9w0DAgIBKDCBpwYJKwYBBAGCNxAEMYGZ MIGWMIGBMQswCQYDVQQGEwJJVDEQMA4GA1UECAwHQmVyZ2FtbzEZMBcGA1UEBwwQUG9udGUg U2FuIFBpZXRybzEXMBUGA1UECgwOQWN0YWxpcyBTLnAuQS4xLDAqBgNVBAMMI0FjdGFsaXMg Q2xpZW50IEF1dGhlbnRpY2F0aW9uIENBIEczAhAI/yx7V5dPIG8WuMetnzcsMIGpBgsqhkiG 9w0BCRACCzGBmaCBljCBgTELMAkGA1UEBhMCSVQxEDAOBgNVBAgMB0JlcmdhbW8xGTAXBgNV BAcMEFBvbnRlIFNhbiBQaWV0cm8xFzAVBgNVBAoMDkFjdGFsaXMgUy5wLkEuMSwwKgYDVQQD DCNBY3RhbGlzIENsaWVudCBBdXRoZW50aWNhdGlvbiBDQSBHMwIQCP8se1eXTyBvFrjHrZ83 LDANBgkqhkiG9w0BAQEFAASCAQBUA53QqEMQqVhEvMI4tLG5oY+IV9jRjsmyEXnq0DY1z+YR VLH/FKCzXvAnQKeZFbBVK5xB1s22YT+zXgYOPXRvSUvYJR0AC8iAdto3wId2cJVFcdS0Ghb4 RPijAbejDJHkkDqa9cqVtTrnrR/8IfgfckM3CG/8zPokUerCkh+ceefvT6bAjrSoKCm6opm0 Kzx/hzOczkTmX93e3k/JUg+G+VVJORHLZHsn0umKRFtcvzJrlwD4POn5wnnbZ/uk4OHNb85d fzSn2YiWQJGsRNXrmx7I4z0UgszHeYovhagB5UvVobujtwB7IDR0p4PYN+PQZXuFEIm+K0N2 OdsS7ca6AAAAAAAA --------------ms020502070309020302060706--