Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp13251577pxu; Sun, 3 Jan 2021 07:38:39 -0800 (PST) X-Google-Smtp-Source: ABdhPJxgKbTudTOvL/5tB1+PWYovfi+O1G/RlnOlVn8nAgv4+zcMekRJKsuI91IAmUN47R85W4xf X-Received: by 2002:a17:906:b1c8:: with SMTP id bv8mr64868690ejb.208.1609688318882; Sun, 03 Jan 2021 07:38:38 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1609688318; cv=none; d=google.com; s=arc-20160816; b=frGgvHhEi/dc3yaOhhWJu/jysE8YWfxlnWxNthk+0h73fGXUWiPXzr93gtMyJGT06+ IQlc0/NTjeqEpwpWMb2bcGa1M3E7XOh38TYXtbEqT5vtVcFlQBB8jZhjTGU5ZQxkMsAT 2po47WpnaDF+S1jJII2Zw+c3rTTtLNSrtqMbJcvZY9PaFmVWfZvlDUpxlRLE2aupXur2 Vxk7/bF/VTg7vW6vVQ4TzWyAAfhoMq7yn7QUAjXKNQnqs1lOsXlAOPucOnqIaOpRZ58Z bWc42FOc6GU0pmHdnxFEEhW2Y4Gi+i/wfmx097/+Z1oXYS+WxDp6oragGDGdDFv7jqtX Nlzg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:message-id:date:emacs :subject:to:from; bh=9DfKnmJ1RRg6qt84UvY3YQTnkZLefH28QA07uWofl7g=; b=VBZUDgNNZqbyfY3kbjGbQiVYSOre7Jb3mDoNwMx0DvWLSf+LI34X2QDz72BqmRZIcK urXvG7dNq9C0sKqyIxXQYZRWnuzoSLpJvowTxDEUjDgDfLcMQaKgvvIMfjFTP+vfI8BT VQlR++RXgoAJ0tFl1AClY8BVWjD7kOBMPrlUH4n1GSg5fUmkxzvsORjHAgb1eHx23Ty+ u7oQS2QB+XBb9iWV7Yai8GWxxthZ3EdPLjNS7h4M6pLccD/ztOfz3QiTp/hCI4ytBbrw okY1k3SudOgkDxjX6SOxOVHn5W5En8V4vJ8vtHnd3KMx621uGYZ7lR/TTAVLnPxznBKb MgZg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id me14si27453816ejb.75.2021.01.03.07.38.14; Sun, 03 Jan 2021 07:38:38 -0800 (PST) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726983AbhACPZt (ORCPT + 99 others); Sun, 3 Jan 2021 10:25:49 -0500 Received: from icebox.esperi.org.uk ([81.187.191.129]:42912 "EHLO mail.esperi.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725819AbhACPZs (ORCPT ); Sun, 3 Jan 2021 10:25:48 -0500 X-Greylist: delayed 3435 seconds by postgrey-1.27 at vger.kernel.org; Sun, 03 Jan 2021 10:25:47 EST Received: from loom (nix@sidle.srvr.nix [192.168.14.8]) by mail.esperi.org.uk (8.16.1/8.16.1) with ESMTP id 103ERppE002002 for ; Sun, 3 Jan 2021 14:27:51 GMT From: Nick Alcock To: NFS list Subject: steam-associated reproducible hard NFSv4.2 client hang (5.9, 5.10) Emacs: well, why *shouldn't* you pay property taxes on your editor? Date: Sun, 03 Jan 2021 14:27:51 +0000 Message-ID: <877dourt7c.fsf@esperi.org.uk> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.3.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-DCC--Metrics: loom 1481; Body=1 Fuz1=1 Fuz2=1 Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org So I bought a game or two on Steam a fortnight ago to while away a Christmas spent stuck indoors with no family, decorations, or presents (stuck at my sister's and she got a perfectly-timed case of covid-19). It turned out this was a very bad idea! I keep my Steam directory on an ext4 fs on the local fileserver and access it via NFSv4.2: over 10GbE: it's actually faster than local spinning-rust storage would be (five-spindle RAID-6 and bcache and 128GiB RAM on the server for caching probably have something to do with that!) though likely slower than the NVMe the local machine actually has: but it doesn't have enough local storage, so I keep all the Steam stuff remotely. (I keep my $HOME on the same fileserver, though that's on xfs: I'd use xfs for Steam, too, but there are still a lot of games out there that fail on filesystems with 64-bit inode numbers.) Unfortunately, what I observe now when kicking up Steam is a spasm of downloading I/O for the game I just bought and then a complete client-side hang when it starts installing: just about all I can do is hard-reboot. This of course stops me using Steam for anything which didn't make me very happy :P The server hums onwards as though nothing went wrong, but the client is in a bad way indeed: not only has NFS hung but there are clearly some important locks taken out since even an ls on a non-NFS dir (with no NFS dirs in the PATH) hangs, as does a while :; ps -e; done loop. I can rebuild with lockdep and do a lock dump if that would be useful. (The kernel is not quite stock: it has DTrace built in, but the hangs happen even if the kernel module has never been loaded, and I'm pretty sure the hangs can't be associated with the not-very-substantial built-in DTrace changes. I can try rebuilding without that as well, but I'll have to rip chunks out of my monitoring machinery to do that so I've avoided doing it so far.) I finally got round to doing a bit of debugging today. NFS debugging dumps covering the last fifteen seconds or so before the hang and a couple of minutes afterwards (taken on the client, logged without incident to the server's syslog, as proof that not *everything* is stalled) are a few megabytes long so are at . Everything seems to go wrong after an especially large bunch of NFS commits. Both client (silk, 192.168.16.21) and server (loom, 192.168.16.8) are running 5.10.4 with the same NFS config, though the client has no NFS server exports at the time and the server isn't acting as an NFS client at all: CONFIG_NFS_FS=y # CONFIG_NFS_V2 is not set CONFIG_NFS_V3=y CONFIG_NFS_V3_ACL=y CONFIG_NFS_V4=y # CONFIG_NFS_SWAP is not set CONFIG_NFS_V4_1=y CONFIG_NFS_V4_2=y CONFIG_PNFS_FILE_LAYOUT=y CONFIG_PNFS_BLOCK=y CONFIG_PNFS_FLEXFILE_LAYOUT=m CONFIG_NFS_V4_1_IMPLEMENTATION_ID_DOMAIN="kernel.org" # CONFIG_NFS_V4_1_MIGRATION is not set # CONFIG_ROOT_NFS is not set # CONFIG_NFS_USE_LEGACY_DNS is not set CONFIG_NFS_USE_KERNEL_DNS=y CONFIG_NFS_DEBUG=y CONFIG_NFS_DISABLE_UDP_SUPPORT=y # CONFIG_NFS_V4_2_READ_PLUS is not set CONFIG_NFSD=y CONFIG_NFSD_V2_ACL=y CONFIG_NFSD_V3=y CONFIG_NFSD_V3_ACL=y CONFIG_NFSD_V4=y # CONFIG_NFSD_BLOCKLAYOUT is not set # CONFIG_NFSD_SCSILAYOUT is not set # CONFIG_NFSD_FLEXFILELAYOUT is not set CONFIG_NFSD_V4_2_INTER_SSC=y CONFIG_GRACE_PERIOD=y CONFIG_LOCKD=y CONFIG_LOCKD_V4=y CONFIG_NFS_ACL_SUPPORT=y CONFIG_NFS_COMMON=y CONFIG_SUNRPC=y CONFIG_SUNRPC_GSS=y CONFIG_SUNRPC_BACKCHANNEL=y CONFIG_SUNRPC_DEBUG=y The server (loom) has a large number of exports: many of the filesystems being exported are moved around via bind mounts and the like. The steam datadir is in /pkg/non-free/steam, which is bind-mounted onto /home/nix/.steam on the client: /home/nix itself is bind-mounted from /home/.loom.srvr.nix/steam, and is NFS-exported from the server under that pathname. Relevant exports, from /proc/fs/nfs/exports: / 192.168.16.0/24,silk.wkstn.nix(ro,insecure,no_root_squash,sync,no_wdelay,no_subtree_check,v4root,fsid=0,uuid=0a4a4563:00764033:8c827c0e:989cf534,sec=390003:390004:390005:1) /home/.loom.srvr.nix *.srvr.nix,fold.srvr.nix(rw,root_squash,sync,wdelay,no_subtree_check,uuid=0a4a4563:00764033:8c827c0e:989cf534,sec=1) /pkg/non-free/steam 192.168.16.0/24,silk.wkstn.nix(rw,root_squash,sync,wdelay,uuid=11b94016:cb6e4b64:beb6c134:9ea3d6b3,sec=1) Blocked tasks after the hang suggest that the client is hung waiting indefinitely for the server to reply, and I'd guess the server doesn't think it needs to. (I can do some server-side debugging dumps too if you think the problem might be there.) [ 322.843962] sysrq: Show Blocked State [ 322.844058] task:tee state:D stack: 0 pid: 1176 ppid: 1173 flags:0x00000000 [ 322.844064] Call Trace: [ 322.844074] __schedule+0x3db/0x7ee [ 322.844079] ? io_schedule+0x70/0x6d [ 322.844082] schedule+0x68/0xdc [ 322.844085] io_schedule+0x46/0x6d [ 322.844089] bit_wait_io+0x11/0x52 [ 322.844092] __wait_on_bit+0x31/0x90 [ 322.844096] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844100] ? var_wake_function+0x30/0x27 [ 322.844104] nfs_wait_on_request.part.0+0x31/0x33 [ 322.844107] nfs_page_group_lock_head+0x31/0x78 [ 322.844110] nfs_lock_and_join_requests+0x9d/0x26e [ 322.844114] nfs_updatepage+0x1be/0x985 [ 322.844118] nfs_write_end+0x128/0x52a [ 322.844122] ? iov_iter_copy_from_user_atomic+0xe8/0x3ce [ 322.844127] generic_perform_write+0xed/0x17c [ 322.844131] nfs_file_write+0x14a/0x29f [ 322.844135] new_sync_write+0xfb/0x16b [ 322.844139] vfs_write+0x1ee/0x29a [ 322.844142] ? vfs_read+0x17b/0x198 [ 322.844145] ksys_write+0x61/0xd0 [ 322.844148] __x64_sys_write+0x1a/0x1c [ 322.844151] do_syscall_64+0x32/0x45 [ 322.844154] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 322.844158] RIP: 0033:0x7f4781f323a3 [ 322.844160] RSP: 002b:00007ffcf8220408 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 [ 322.844164] RAX: ffffffffffffffda RBX: 0000000000000092 RCX: 00007f4781f323a3 [ 322.844166] RDX: 0000000000000092 RSI: 00007ffcf8220530 RDI: 0000000000000003 [ 322.844168] RBP: 00007ffcf8220530 R08: 0000000000000000 R09: 0000000000000001 [ 322.844170] R10: 0000000000400ad2 R11: 0000000000000246 R12: 0000000000000092 [ 322.844172] R13: 0000000002361580 R14: 0000000000000092 R15: 00007f4782201720 [ 322.844202] task:CIPCServer::Thr state:D stack: 0 pid: 1801 ppid: 1540 flags:0x20020000 [ 322.844207] Call Trace: [ 322.844212] __schedule+0x3db/0x7ee [ 322.844219] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844222] schedule+0x68/0xdc [ 322.844227] rpc_wait_bit_killable+0x11/0x52 [ 322.844230] __wait_on_bit+0x31/0x90 [ 322.844235] ? __rpc_atrun+0x20/0x1a [ 322.844238] ? __rpc_atrun+0x20/0x1a [ 322.844242] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844246] ? var_wake_function+0x30/0x27 [ 322.844250] __rpc_execute+0xe5/0x485 [ 322.844254] ? sugov_get_util+0xf0/0xeb [ 322.844258] rpc_execute+0xa5/0xaa [ 322.844263] rpc_run_task+0x148/0x187 [ 322.844267] nfs4_do_call_sync+0x61/0x81 [ 322.844272] _nfs4_proc_getattr+0x10a/0x120 [ 322.844278] nfs4_proc_getattr+0x67/0xee [ 322.844283] __nfs_revalidate_inode+0xb7/0x2df [ 322.844287] nfs_access_get_cached+0x14d/0x1df [ 322.844291] nfs_do_access+0x48/0x28e [ 322.844296] ? step_into+0x2bc/0x6a1 [ 322.844301] nfs_permission+0xa2/0x1b8 [ 322.844305] inode_permission+0x96/0xca [ 322.844308] ? unlazy_walk+0x4c/0x80 [ 322.844312] link_path_walk.part.0+0x2a5/0x320 [ 322.844315] ? path_init+0x2c2/0x3e5 [ 322.844318] path_lookupat+0x3f/0x1b1 [ 322.844322] filename_lookup+0x97/0x17f [ 322.844326] ? kmem_cache_alloc+0x32/0x200 [ 322.844331] user_path_at_empty+0x59/0x8a [ 322.844333] do_faccessat+0x70/0x227 [ 322.844337] __ia32_sys_access+0x1c/0x1e [ 322.844340] __do_fast_syscall_32+0x5f/0x8e [ 322.844342] do_fast_syscall_32+0x3d/0x80 [ 322.844346] entry_SYSCALL_compat_after_hwframe+0x45/0x0 [ 322.844348] RIP: 0023:0xf7ee6549 [ 322.844350] RSP: 002b:00000000eb763b28 EFLAGS: 00200292 ORIG_RAX: 0000000000000021 [ 322.844353] RAX: ffffffffffffffda RBX: 000000005754d9e0 RCX: 0000000000000004 [ 322.844355] RDX: 00000000f7ab3e1c RSI: 000000005754d9e0 RDI: 00000000eb763bc8 [ 322.844357] RBP: 00000000f78a91c0 R08: 0000000000000000 R09: 0000000000000000 [ 322.844359] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [ 322.844360] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 322.844364] task:CFileWriterThre state:D stack: 0 pid: 1810 ppid: 1540 flags:0x20024000 [ 322.844367] Call Trace: [ 322.844371] __schedule+0x3db/0x7ee [ 322.844375] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844379] schedule+0x68/0xdc [ 322.844382] rpc_wait_bit_killable+0x11/0x52 [ 322.844385] __wait_on_bit+0x31/0x90 [ 322.844389] ? call_reserveresult+0xa0/0x9f [ 322.844391] ? call_transmit_status+0x160/0x159 [ 322.844395] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844398] ? var_wake_function+0x30/0x27 [ 322.844401] __rpc_execute+0xe5/0x485 [ 322.844404] ? nfs_check_verifier+0x5b/0xa4 [ 322.844407] rpc_execute+0xa5/0xaa [ 322.844410] rpc_run_task+0x148/0x187 [ 322.844414] nfs4_do_call_sync+0x61/0x81 [ 322.844417] _nfs4_proc_getattr+0x10a/0x120 [ 322.844422] nfs4_proc_getattr+0x67/0xee [ 322.844426] __nfs_revalidate_inode+0xb7/0x2df [ 322.844430] nfs_getattr+0x159/0x374 [ 322.844433] vfs_getattr_nosec+0x96/0xa4 [ 322.844435] vfs_statx+0x7a/0xe5 [ 322.844438] vfs_fstatat+0x1f/0x21 [ 322.844443] __do_compat_sys_ia32_lstat64+0x2b/0x4e [ 322.844448] __ia32_compat_sys_ia32_lstat64+0x14/0x16 [ 322.844450] __do_fast_syscall_32+0x5f/0x8e [ 322.844453] do_fast_syscall_32+0x3d/0x80 [ 322.844457] entry_SYSCALL_compat_after_hwframe+0x45/0x0 [ 322.844459] RIP: 0023:0xf7ee6549 [ 322.844460] RSP: 002b:00000000eb561be8 EFLAGS: 00000292 ORIG_RAX: 00000000000000c4 [ 322.844463] RAX: ffffffffffffffda RBX: 00000000c521bf60 RCX: 00000000eb561e70 [ 322.844465] RDX: 00000000f7ab3e1c RSI: 00000000eb561e70 RDI: 00000000c521bf60 [ 322.844467] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 [ 322.844468] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [ 322.844470] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 322.844477] task:Steam - DirWatc state:D stack: 0 pid: 1924 ppid: 1540 flags:0x20024000 [ 322.844480] Call Trace: [ 322.844484] __schedule+0x3db/0x7ee [ 322.844488] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844492] schedule+0x68/0xdc [ 322.844495] rpc_wait_bit_killable+0x11/0x52 [ 322.844498] __wait_on_bit+0x31/0x90 [ 322.844501] ? rpc_sleep_on_timeout+0x64/0x9c [ 322.844504] ? rpc_check_timeout+0x1e0/0x1e0 [ 322.844507] ? call_connect_status+0x180/0x17a [ 322.844511] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844513] ? var_wake_function+0x30/0x27 [ 322.844516] __rpc_execute+0xe5/0x485 [ 322.844520] rpc_execute+0xa5/0xaa [ 322.844523] rpc_run_task+0x148/0x187 [ 322.844526] nfs4_do_call_sync+0x61/0x81 [ 322.844530] _nfs4_proc_getattr+0x10a/0x120 [ 322.844534] nfs4_proc_getattr+0x67/0xee [ 322.844538] __nfs_revalidate_inode+0xb7/0x2df [ 322.844542] nfs_lookup_verify_inode+0x67/0x82 [ 322.844545] nfs_do_lookup_revalidate+0x19b/0x29c [ 322.844548] nfs4_do_lookup_revalidate+0x5d/0xc5 [ 322.844552] nfs4_lookup_revalidate+0x6a/0x8d [ 322.844555] lookup_fast+0xb8/0x122 [ 322.844559] path_openat+0xfd/0xfd8 [ 322.844562] ? unlazy_walk+0x4c/0x80 [ 322.844565] ? terminate_walk+0x62/0xed [ 322.844569] ? putname+0x4b/0x53 [ 322.844572] do_filp_open+0x86/0x110 [ 322.844576] ? kmem_cache_alloc+0x32/0x200 [ 322.844579] ? __alloc_fd+0xb2/0x143 [ 322.844583] do_sys_openat2+0x8d/0x13a [ 322.844585] ? do_faccessat+0x108/0x227 [ 322.844588] __ia32_compat_sys_openat+0x48/0x63 [ 322.844591] __do_fast_syscall_32+0x5f/0x8e [ 322.844594] do_fast_syscall_32+0x3d/0x80 [ 322.844597] entry_SYSCALL_compat_after_hwframe+0x45/0x0 [ 322.844600] RIP: 0023:0xf7ee6549 [ 322.844601] RSP: 002b:000000008c8fbcb0 EFLAGS: 00200287 ORIG_RAX: 0000000000000127 [ 322.844604] RAX: ffffffffffffffda RBX: 00000000ffffff9c RCX: 000000008c8fbffc [ 322.844606] RDX: 0000000000098800 RSI: 0000000000000000 RDI: 00000000f7ab3e1c [ 322.844608] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 [ 322.844609] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [ 322.844611] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 322.844614] task:CGenericAsyncFi state:D stack: 0 pid: 1927 ppid: 1540 flags:0x20020000 [ 322.844617] Call Trace: [ 322.844621] __schedule+0x3db/0x7ee [ 322.844625] schedule+0x68/0xdc [ 322.844628] io_schedule+0x46/0x6d [ 322.844631] wait_on_page_bit_common+0xf8/0x325 [ 322.844634] ? trace_raw_output_file_check_and_advance_wb_err+0x80/0x7d [ 322.844638] wait_on_page_bit+0x3f/0x41 [ 322.844642] wait_on_page_writeback.part.0+0x35/0x80 [ 322.844645] wait_on_page_writeback+0x27/0x29 [ 322.844647] __filemap_fdatawait_range+0x88/0xd9 [ 322.844651] filemap_write_and_wait_range+0x45/0x80 [ 322.844655] nfs_wb_all+0x30/0x13d [ 322.844658] nfs4_file_flush+0x85/0xc0 [ 322.844661] filp_close+0x3e/0x78 [ 322.844664] __close_fd+0x23/0x30 [ 322.844667] __ia32_sys_close+0x22/0x50 [ 322.844670] __do_fast_syscall_32+0x5f/0x8e [ 322.844673] do_fast_syscall_32+0x3d/0x80 [ 322.844676] entry_SYSCALL_compat_after_hwframe+0x45/0x0 [ 322.844678] RIP: 0023:0xf7ee6549 [ 322.844680] RSP: 002b:000000008c5fddb0 EFLAGS: 00200286 ORIG_RAX: 0000000000000006 [ 322.844682] RAX: ffffffffffffffda RBX: 000000000000004d RCX: 0000000000000002 [ 322.844684] RDX: 0000000000000000 RSI: 00000000f7d02e70 RDI: 000000000000004d [ 322.844686] RBP: 000000008c5fe110 R08: 0000000000000000 R09: 0000000000000000 [ 322.844688] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [ 322.844689] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 322.844699] task:ThreadPoolForeg state:D stack: 0 pid: 1768 ppid: 1735 flags:0x00004000 [ 322.844702] Call Trace: [ 322.844705] __schedule+0x3db/0x7ee [ 322.844709] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844712] schedule+0x68/0xdc [ 322.844716] rpc_wait_bit_killable+0x11/0x52 [ 322.844719] __wait_on_bit+0x31/0x90 [ 322.844722] ? rpc_sleep_on_timeout+0x64/0x9c [ 322.844725] ? rpc_check_timeout+0x1e0/0x1e0 [ 322.844728] ? call_connect_status+0x180/0x17a [ 322.844731] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844734] ? var_wake_function+0x30/0x27 [ 322.844737] __rpc_execute+0xe5/0x485 [ 322.844740] ? nfs_check_verifier+0x5b/0xa4 [ 322.844743] rpc_execute+0xa5/0xaa [ 322.844746] rpc_run_task+0x148/0x187 [ 322.844750] nfs4_do_call_sync+0x61/0x81 [ 322.844753] _nfs4_proc_getattr+0x10a/0x120 [ 322.844758] nfs4_proc_getattr+0x67/0xee [ 322.844762] __nfs_revalidate_inode+0xb7/0x2df [ 322.844766] nfs_getattr+0x159/0x374 [ 322.844769] vfs_getattr_nosec+0x96/0xa4 [ 322.844771] vfs_statx+0x7a/0xe5 [ 322.844774] __do_sys_newstat+0x31/0x4c [ 322.844778] __x64_sys_newstat+0x16/0x18 [ 322.844781] do_syscall_64+0x32/0x45 [ 322.844783] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 322.844786] RIP: 0033:0x7fe70ebf19e6 [ 322.844787] RSP: 002b:00007fe6c15b2068 EFLAGS: 00000246 ORIG_RAX: 0000000000000004 [ 322.844790] RAX: ffffffffffffffda RBX: 00000bd4193e1910 RCX: 00007fe70ebf19e6 [ 322.844792] RDX: 00007fe6c15b2078 RSI: 00007fe6c15b2078 RDI: 00000bd4193e1910 [ 322.844794] RBP: 00007fe6c15b2160 R08: 0000000000000001 R09: 00007ffd971ed090 [ 322.844796] R10: 0000000000056764 R11: 0000000000000246 R12: 00007fe6c15b2078 [ 322.844797] R13: 00000bd4192f6f60 R14: 00007fe6c15b2108 R15: 00007fe6c15b2120 [ 322.844801] task:ThreadPoolForeg state:D stack: 0 pid: 1769 ppid: 1735 flags:0x00000000 [ 322.844804] Call Trace: [ 322.844807] __schedule+0x3db/0x7ee [ 322.844811] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844814] schedule+0x68/0xdc [ 322.844817] rpc_wait_bit_killable+0x11/0x52 [ 322.844820] __wait_on_bit+0x31/0x90 [ 322.844823] ? __rpc_atrun+0x20/0x1a [ 322.844826] ? __rpc_atrun+0x20/0x1a [ 322.844829] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844831] ? var_wake_function+0x30/0x27 [ 322.844834] __rpc_execute+0xe5/0x485 [ 322.844837] ? update_cfs_group+0x9a/0x9e [ 322.844840] rpc_execute+0xa5/0xaa [ 322.844842] rpc_run_task+0x148/0x187 [ 322.844845] nfs4_do_call_sync+0x61/0x81 [ 322.844849] _nfs4_proc_getattr+0x10a/0x120 [ 322.844853] nfs4_proc_getattr+0x67/0xee [ 322.844856] __nfs_revalidate_inode+0xb7/0x2df [ 322.844859] nfs_file_write+0xfc/0x29f [ 322.844862] new_sync_write+0xfb/0x16b [ 322.844866] vfs_write+0x1ee/0x29a [ 322.844868] ? __fget_light+0x31/0x72 [ 322.844871] ksys_write+0x61/0xd0 [ 322.844874] __x64_sys_write+0x1a/0x1c [ 322.844877] do_syscall_64+0x32/0x45 [ 322.844880] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 322.844881] RIP: 0033:0x7fe71859ab7f [ 322.844883] RSP: 002b:00007fe6bd5b0d30 EFLAGS: 00000293 ORIG_RAX: 0000000000000001 [ 322.844885] RAX: ffffffffffffffda RBX: 00007fe6bd5b0d68 RCX: 00007fe71859ab7f [ 322.844887] RDX: 0000000000000007 RSI: 00007fe6bd5b1090 RDI: 000000000000005b [ 322.844888] RBP: 00007fe6bd5b0de0 R08: 0000000000000000 R09: 00007ffd971ed090 [ 322.844890] R10: 0000000000065fd2 R11: 0000000000000293 R12: 00000bd418d95020 [ 322.844891] R13: 00007fe6bd5b1090 R14: 0000000000000007 R15: 0000000000000000 [ 322.844900] task:ThreadPoolForeg state:D stack: 0 pid: 1807 ppid: 1773 flags:0x00000000 [ 322.844902] Call Trace: [ 322.844904] __schedule+0x3db/0x7ee [ 322.844907] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844908] schedule+0x68/0xdc [ 322.844911] rpc_wait_bit_killable+0x11/0x52 [ 322.844913] __wait_on_bit+0x31/0x90 [ 322.844915] ? __rpc_atrun+0x20/0x1a [ 322.844916] ? __rpc_atrun+0x20/0x1a [ 322.844918] out_of_line_wait_on_bit+0x7e/0x80 [ 322.844920] ? var_wake_function+0x30/0x27 [ 322.844922] __rpc_execute+0xe5/0x485 [ 322.844924] rpc_execute+0xa5/0xaa [ 322.844926] rpc_run_task+0x148/0x187 [ 322.844928] nfs4_do_call_sync+0x61/0x81 [ 322.844930] _nfs4_proc_getattr+0x10a/0x120 [ 322.844933] nfs4_proc_getattr+0x67/0xee [ 322.844936] __nfs_revalidate_inode+0xb7/0x2df [ 322.844938] nfs_access_get_cached+0x14d/0x1df [ 322.844940] nfs_do_access+0x48/0x28e [ 322.844942] ? nfs4_lookup_revalidate+0x38/0x8d [ 322.844944] nfs_permission+0xa2/0x1b8 [ 322.844946] inode_permission+0x96/0xca [ 322.844948] ? unlazy_walk+0x4c/0x80 [ 322.844950] link_path_walk.part.0+0x2a5/0x320 [ 322.844952] ? path_init+0x2c2/0x3e5 [ 322.844954] path_lookupat+0x3f/0x1b1 [ 322.844956] filename_lookup+0x97/0x17f [ 322.844959] ? do_unlk+0x9a/0xca [ 322.844960] ? kmem_cache_alloc+0x32/0x200 [ 322.844963] user_path_at_empty+0x59/0x8a [ 322.844965] vfs_statx+0x64/0xe5 [ 322.844967] __do_sys_newstat+0x31/0x4c [ 322.844968] ? do_fcntl+0x113/0x582 [ 322.844971] ? fput+0x13/0x15 [ 322.844972] ? __x64_sys_fcntl+0x81/0xa8 [ 322.844974] __x64_sys_newstat+0x16/0x18 [ 322.844976] do_syscall_64+0x32/0x45 [ 322.844977] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 322.844979] RIP: 0033:0x7f0d1fd0c9e6 [ 322.844980] RSP: 002b:00007f0d08e12e18 EFLAGS: 00000246 ORIG_RAX: 0000000000000004 [ 322.844981] RAX: ffffffffffffffda RBX: 00007f0d08e12edc RCX: 00007f0d1fd0c9e6 [ 322.844983] RDX: 00007f0d08e12e28 RSI: 00007f0d08e12e28 RDI: 00001605c2ff1169 [ 322.844984] RBP: 00007f0d08e12ec0 R08: 0000000000000001 R09: 00007fffaddd9090 [ 322.844985] R10: 00001605c3133480 R11: 0000000000000246 R12: 0000000000000000 [ 322.844986] R13: 0000000000000000 R14: 00001605c2ff0f00 R15: 0000000000000000 [ 322.844988] task:ThreadPoolForeg state:D stack: 0 pid: 2031 ppid: 1773 flags:0x00004000 [ 322.844990] Call Trace: [ 322.844992] __schedule+0x3db/0x7ee [ 322.844995] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.844997] schedule+0x68/0xdc [ 322.844999] rpc_wait_bit_killable+0x11/0x52 [ 322.845001] __wait_on_bit+0x31/0x90 [ 322.845003] ? rpc_sleep_on_timeout+0x64/0x9c [ 322.845005] ? rpc_check_timeout+0x1e0/0x1e0 [ 322.845006] ? call_connect_status+0x180/0x17a [ 322.845009] out_of_line_wait_on_bit+0x7e/0x80 [ 322.845010] ? var_wake_function+0x30/0x27 [ 322.845012] __rpc_execute+0xe5/0x485 [ 322.845014] rpc_execute+0xa5/0xaa [ 322.845016] rpc_run_task+0x148/0x187 [ 322.845018] nfs4_do_call_sync+0x61/0x81 [ 322.845020] _nfs4_proc_getattr+0x10a/0x120 [ 322.845023] nfs4_proc_getattr+0x67/0xee [ 322.845026] __nfs_revalidate_inode+0xb7/0x2df [ 322.845028] nfs_access_get_cached+0x14d/0x1df [ 322.845030] nfs_do_access+0x48/0x28e [ 322.845032] ? nfs4_lookup_revalidate+0x38/0x8d [ 322.845034] nfs_permission+0xa2/0x1b8 [ 322.845036] inode_permission+0x96/0xca [ 322.845038] ? unlazy_walk+0x4c/0x80 [ 322.845040] link_path_walk.part.0+0x2a5/0x320 [ 322.845042] ? path_init+0x2c2/0x3e5 [ 322.845044] path_lookupat+0x3f/0x1b1 [ 322.845046] filename_lookup+0x97/0x17f [ 322.845049] ? kmem_cache_alloc+0x32/0x200 [ 322.845051] user_path_at_empty+0x59/0x8a [ 322.845054] ? mntput_no_expire+0x4f/0x264 [ 322.845055] vfs_statx+0x64/0xe5 [ 322.845057] ? putname+0x4b/0x53 [ 322.845059] __do_sys_newstat+0x31/0x4c [ 322.845062] ? __x64_sys_futex+0x13a/0x159 [ 322.845064] ? switch_fpu_return+0x58/0xf2 [ 322.845066] __x64_sys_newstat+0x16/0x18 [ 322.845068] do_syscall_64+0x32/0x45 [ 322.845069] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 322.845071] RIP: 0033:0x7f0d1fd0c9e6 [ 322.845072] RSP: 002b:00007f0d00331108 EFLAGS: 00000246 ORIG_RAX: 0000000000000004 [ 322.845073] RAX: ffffffffffffffda RBX: 00001605c2fdc0a0 RCX: 00007f0d1fd0c9e6 [ 322.845074] RDX: 00007f0d00331118 RSI: 00007f0d00331118 RDI: 00001605c2fdc0a0 [ 322.845076] RBP: 00007f0d00331200 R08: 0000000000000001 R09: 00007fffaddd9090 [ 322.845077] R10: 000000000005771c R11: 0000000000000246 R12: 00007f0d00331118 [ 322.845078] R13: 00001605c3309c40 R14: 00007f0d003311a8 R15: 00007f0d003311c0 [ 322.845087] task:192.168.16.8-ma state:D stack: 0 pid: 2044 ppid: 2 flags:0x00004000 [ 322.845089] Call Trace: [ 322.845091] __schedule+0x3db/0x7ee [ 322.845094] ? trace_event_raw_event_rpc_xdr_overflow+0x280/0x27e [ 322.845095] schedule+0x68/0xdc [ 322.845098] rpc_wait_bit_killable+0x11/0x52 [ 322.845100] __wait_on_bit+0x31/0x90 [ 322.845102] ? __rpc_atrun+0x20/0x1a [ 322.845103] ? __rpc_atrun+0x20/0x1a [ 322.845105] out_of_line_wait_on_bit+0x7e/0x80 [ 322.845107] ? var_wake_function+0x30/0x27 [ 322.845109] __rpc_execute+0xe5/0x485 [ 322.845111] rpc_execute+0xa5/0xaa [ 322.845113] rpc_run_task+0x148/0x187 [ 322.845115] nfs4_do_call_sync+0x61/0x81 [ 322.845117] _nfs4_proc_statfs+0xc1/0xcf [ 322.845120] nfs4_proc_statfs+0x55/0x74 [ 322.845122] nfs_statfs+0x65/0x152 [ 322.845124] ? _nfs4_proc_delegreturn+0x21e/0x2e5 [ 322.845127] statfs_by_dentry+0x55/0x74 [ 322.845129] vfs_statfs+0x24/0x43 [ 322.845132] check_free_space+0x3c/0xe5 [ 322.845135] do_acct_process+0x5d/0x5e1 [ 322.845136] ? nfs_end_delegation_return+0x1cb/0x3aa [ 322.845138] ? complete+0x49/0x57 [ 322.845141] acct_process+0xe5/0x105 [ 322.845143] do_exit+0x781/0xa1b [ 322.845145] ? refcount_dec_and_lock+0x22/0x8c [ 322.845148] __module_put_and_exit+0x1b/0x1b [ 322.845150] nfs4_run_state_manager+0x293/0x933 [ 322.845152] ? nfs4_do_reclaim+0x820/0x81c [ 322.845155] kthread+0x142/0x15f [ 322.845157] ? __kthread_create_worker+0x130/0x122 [ 322.845159] ret_from_fork+0x1f/0x2a [ 369.859385] INFO: task tee:1176 blocked for more than 122 seconds. [ 369.859391] Not tainted 5.10.4-00021-gde017294a539-dirty #3 [ 369.859393] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 369.859395] task:tee state:D stack: 0 pid: 1176 ppid: 1173 flags:0x00000000 [ 369.859399] Call Trace: [ 369.859407] __schedule+0x3db/0x7ee [ 369.859410] ? io_schedule+0x70/0x6d [ 369.859412] schedule+0x68/0xdc [ 369.859415] io_schedule+0x46/0x6d [ 369.859417] bit_wait_io+0x11/0x52 [ 369.859420] __wait_on_bit+0x31/0x90 [ 369.859422] out_of_line_wait_on_bit+0x7e/0x80 [ 369.859426] ? var_wake_function+0x30/0x27 [ 369.859429] nfs_wait_on_request.part.0+0x31/0x33 [ 369.859431] nfs_page_group_lock_head+0x31/0x78 [ 369.859433] nfs_lock_and_join_requests+0x9d/0x26e [ 369.859436] nfs_updatepage+0x1be/0x985 [ 369.859439] nfs_write_end+0x128/0x52a [ 369.859443] ? iov_iter_copy_from_user_atomic+0xe8/0x3ce [ 369.859446] generic_perform_write+0xed/0x17c [ 369.859449] nfs_file_write+0x14a/0x29f [ 369.859452] new_sync_write+0xfb/0x16b [ 369.859455] vfs_write+0x1ee/0x29a [ 369.859457] ? vfs_read+0x17b/0x198 [ 369.859459] ksys_write+0x61/0xd0 [ 369.859462] __x64_sys_write+0x1a/0x1c [ 369.859464] do_syscall_64+0x32/0x45 [ 369.859467] entry_SYSCALL_64_after_hwframe+0x44/0x0 [ 369.859469] RIP: 0033:0x7f4781f323a3 [ 369.859471] RSP: 002b:00007ffcf8220408 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 [ 369.859474] RAX: ffffffffffffffda RBX: 0000000000000092 RCX: 00007f4781f323a3 [ 369.859475] RDX: 0000000000000092 RSI: 00007ffcf8220530 RDI: 0000000000000003 [ 369.859477] RBP: 00007ffcf8220530 R08: 0000000000000000 R09: 0000000000000001 [ 369.859478] R10: 0000000000400ad2 R11: 0000000000000246 R12: 0000000000000092 [ 369.859480] R13: 0000000002361580 R14: 0000000000000092 R15: 00007f4782201720 [ 369.859495] INFO: task CGenericAsyncFi:1927 blocked for more than 122 seconds. [ 369.859497] Not tainted 5.10.4-00021-gde017294a539-dirty #3 [ 369.859498] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 369.859499] task:CGenericAsyncFi state:D stack: 0 pid: 1927 ppid: 1540 flags:0x20020000 [ 369.859501] Call Trace: [ 369.859504] __schedule+0x3db/0x7ee [ 369.859506] schedule+0x68/0xdc [ 369.859508] io_schedule+0x46/0x6d [ 369.859510] wait_on_page_bit_common+0xf8/0x325 [ 369.859512] ? trace_raw_output_file_check_and_advance_wb_err+0x80/0x7d [ 369.859515] wait_on_page_bit+0x3f/0x41 [ 369.859517] wait_on_page_writeback.part.0+0x35/0x80 [ 369.859519] wait_on_page_writeback+0x27/0x29 [ 369.859521] __filemap_fdatawait_range+0x88/0xd9 [ 369.859523] filemap_write_and_wait_range+0x45/0x80 [ 369.859525] nfs_wb_all+0x30/0x13d [ 369.859528] nfs4_file_flush+0x85/0xc0 [ 369.859530] filp_close+0x3e/0x78 [ 369.859532] __close_fd+0x23/0x30 [ 369.859534] __ia32_sys_close+0x22/0x50 [ 369.859536] __do_fast_syscall_32+0x5f/0x8e [ 369.859538] do_fast_syscall_32+0x3d/0x80 [ 369.859540] entry_SYSCALL_compat_after_hwframe+0x45/0x0 [ 369.859541] RIP: 0023:0xf7ee6549 [ 369.859543] RSP: 002b:000000008c5fddb0 EFLAGS: 00200286 ORIG_RAX: 0000000000000006 [ 369.859545] RAX: ffffffffffffffda RBX: 000000000000004d RCX: 0000000000000002 [ 369.859546] RDX: 0000000000000000 RSI: 00000000f7d02e70 RDI: 000000000000004d [ 369.859547] RBP: 000000008c5fe110 R08: 0000000000000000 R09: 0000000000000000 [ 369.859548] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [ 369.859549] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 394.423093] nfs: server loom not responding, still trying -- NULL && (void)