Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp34568pxu; Wed, 14 Oct 2020 19:18:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxYE7so2H/6MHGqmEVBIeNkiBxUZpJd34PCmxSia/M6MzyXrmIBu34SqiEwSJhR4S8Zv0ys X-Received: by 2002:a17:906:5052:: with SMTP id e18mr2019199ejk.530.1602728334878; Wed, 14 Oct 2020 19:18:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602728334; cv=none; d=google.com; s=arc-20160816; b=WN4cbprv7IsSCPgkmOu29IAYC1Y3H0JitVB11oX1lotZHhs7Q6LwW08M3byYTZ7X5I l79QrjJnOa/+UpT8m6LSy6qeVIee/IA0d9W0pgVyF5S8VIjJWIoLkcA+Sumpx3GYFzEv XLX1Jpu6cvy7EVt+XRD7CRGnmPgOqf5SIIZZWaTuo+uh+EksPHTMNkEsS0bcLjVbbWJV 7ix4ZpOG6D9Iy6/1MPayz9RWUiVju2YMACXiQ/c/N5S/0RGqh6BLKWHCBOQRrpOlWjz3 jqwEd5Vs7CKYDIM7AtAe7ONxTJJZOcehsFgV2lN4bN0G8Pygec0YqIBKIwUUY5tpk4MF EClA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:organization :message-id:user-agent:references:in-reply-to:subject:cc:to:from :date:mime-version; bh=vFJfKtBmjMJFrUlFC8eOH/7Di+oFtDSffGINC/Cg0d8=; b=DhOsemC1q8f4vApN9IX8mFMmAgGS4eiIgp2628+i8Rq9xlACGstyW2+9pzEwYvLqak bGORTVsJG6Zndw27qIquDJzu+kUHyMHTlee4A8GjYEFNnqbppFII1Rxr7B6GRMi3Sl1n 7epWU//7fWp3b/+WDsYk0PpJCg18E7auhBNrRI/x3p0o92nf1TGgiXdT6WbhipJ2Ne3f Oj16VveoWbYoB2pE4cOXvf5tz7gRrV/ixpbhV+4qRWjFH8XnHQfixjEUkcGjAv4cUUhG wcG5cBrv70wxl8vQ2s0Pj9vsujudR3FjXF3wP/w7NlWLbzj49Of2loUIRCG/GjMiVLqo /kHw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id gj3si1158772ejb.313.2020.10.14.19.18.23; Wed, 14 Oct 2020 19:18:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730204AbgJNVNh (ORCPT + 99 others); Wed, 14 Oct 2020 17:13:37 -0400 Received: from mail.talpidae.net ([176.9.32.230]:48739 "EHLO node0.talpidae.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728295AbgJNVNf (ORCPT ); Wed, 14 Oct 2020 17:13:35 -0400 Received: from talpidae.net (localhost [127.0.0.1]) by node0.talpidae.net (mail.talpidae.net) with ESMTP id 6E6B4BC1C4F; Wed, 14 Oct 2020 23:13:32 +0200 (CEST) MIME-Version: 1.0 Date: Wed, 14 Oct 2020 23:13:30 +0200 From: Jonas Zeiger To: David Wysochanski Cc: linux-nfs Subject: Re: Linux 5.9.0: NFS 4.1 with cachefilesd: Assertion failed (100% CPU) In-Reply-To: References: <959e2a4790849c226b0967ecda11f79e@talpidae.net> User-Agent: Roundcube Webmail/1.4.9 Message-ID: <646585853425598fd04612ab63ff4331@talpidae.net> X-Sender: jonas.zeiger@talpidae.net Organization: talpidae.net Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org Hi David, root@host:/usr/src/linux-5.9# eu-addr2line -e ./vmlinux cachefiles_read_or_alloc_pages+0x9e fs/cachefiles/rdwr.c:715 Thank you for looking into it! -Jonas On 2020-10-14 18:43, David Wysochanski wrote: > On Wed, Oct 14, 2020 at 9:13 AM Jonas Zeiger wrote: >> >> Hi all, >> >> I experience failed assertions on an x86_64 KVM virtual machine (VirtIO devices) when accessing files on NFS 4 shares while having cachefilesd (0.10.7) running. >> >> Good kernel: 4.14.49 >> Bad kernels: 5.8.14, 5.9.0 >> >> The machine is rendered unusable (100% CPU) and requires a hard-reset. >> >> This is the console error report captured via serial console: >> >> CacheFiles: >> CacheFiles: Assertion failed >> invalid opcode: 0000 [#1] >> CPU: 0 PID: 4215 Comm: git Not tainted 5.9.0vzlinux #3 >> RIP: 0010:cachefiles_read_or_alloc_pages+0x9e/0x5cf >> Code: ff 0f 0b 49 8b 46 30 48 8b 40 70 48 83 78 20 00 75 1a 48 c7 c7 20 fc e8 81 e8 cf 7a e7 ff 48 c7 c7 30 fc e8 81 e8 c3 7a e7 ff <0f> 0b 49 8b 46 28 ba 0c 00 00 00 c6 44 24 40 00 c6 44 24 41 00 c7 > > > Can you do > > eu-addr2line -e ./vmlinux cachefiles_read_or_alloc_pages+0x9e > > That should give the line # of the assertion. > > >> RSP: 0000:ffffc900015cba98 EFLAGS: 00010292 >> RAX: 000000000000001c RBX: ffffc900015cbc04 RCX: 0000000000000027 >> RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffffffff82039340 >> RBP: ffff88803c3469c0 R08: 0000000000000000 R09: 0000000000000000 >> R10: 000000000001e88c R11: 000000000000003c R12: ffffc900015cbd70 >> R13: ffff88803c3469c0 R14: ffff88802e2d2fd0 R15: ffff88802bf27000 >> FS: 00007feea1027fc0(0000) GS:ffffffff82030000(0000) knlGS:0000000000000000 >> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> CR2: 00007feea1036000 CR3: 000000002bcbd005 CR4: 00000000001706b0 >> Call Trace: >> ? nfs_access_add_cache+0x140/0x1c5 >> ? slab_free_freelist_hook+0x45/0xc4 >> ? slab_pre_alloc_hook.isra.81+0x26/0x37 >> ? fscache_run_op.isra.13+0x57/0x69 >> __fscache_read_or_alloc_pages+0x1a6/0x1f2 >> __nfs_readpages_from_fscache+0x51/0xa9 >> nfs_readpages+0x111/0x133 >> ? get_page_from_freelist+0x734/0x8a1 >> read_pages+0x8c/0x102 >> ? __alloc_pages_nodemask+0xd4/0x122 >> ? page_cache_readahead_unbounded+0xce/0x17d >> page_cache_readahead_unbounded+0xce/0x17d >> filemap_fault+0x1f9/0x3d8 >> __do_fault+0x44/0x63 >> handle_mm_fault+0x70e/0xad3 >> exc_page_fault+0x1f0/0x311 >> ? asm_exc_page_fault+0x5/0x20 >> asm_exc_page_fault+0x1b/0x20 >> RIP: 0033:0x7feea0991bef >> Code: 41 c7 45 00 1d 00 00 00 e9 1e f8 ff ff 41 8b 55 08 85 d2 0f 84 72 07 00 00 83 fb 0f 0f 87 37 14 00 00 85 ed 0f 84 83 f5 ff ff <41> 0f b6 34 24 89 d9 8d 45 ff 49 8d 7c 24 01 48 d3 e6 8d 4b 08 4c >> RSP: 002b:00007fffbb7d5240 EFLAGS: 00010202 >> RAX: 00007feea0991bd2 RBX: 0000000000000000 RCX: 00000000000000d0 >> RDX: 0000000000000001 RSI: 000055d7e1bf9c10 RDI: 00007fffbb7d52a0 >> RBP: 00000000000000d0 R08: 0000000000000000 R09: 0000000000000000 >> R10: 0000000000000000 R11: 00007fffbb7d5390 R12: 00007feea1036000 >> R13: 000055d7e1bf9900 R14: 00007fffbb7d5570 R15: 0000000000000000 >> ---[ end trace cad4b4a2dd601cdd ]--- >> RIP: 0010:cachefiles_read_or_alloc_pages+0x9e/0x5cf >> Code: ff 0f 0b 49 8b 46 30 48 8b 40 70 48 83 78 20 00 75 1a 48 c7 c7 20 fc e8 81 e8 cf 7a e7 ff 48 c7 c7 30 fc e8 81 e8 c3 7a e7 ff <0f> 0b 49 8b 46 28 ba 0c 00 00 00 c6 44 24 40 00 c6 44 24 41 00 c7 >> RSP: 0000:ffffc900015cba98 EFLAGS: 00010292 >> RAX: 000000000000001c RBX: ffffc900015cbc04 RCX: 0000000000000027 >> RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffffffff82039340 >> RBP: ffff88803c3469c0 R08: 0000000000000000 R09: 0000000000000000 >> R10: 000000000001e88c R11: 000000000000003c R12: ffffc900015cbd70 >> R13: ffff88803c3469c0 R14: ffff88802e2d2fd0 R15: ffff88802bf27000 >> FS: 00007feea1027fc0(0000) GS:ffffffff82030000(0000) knlGS:0000000000000000 >> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> CR2: 00007feea1036000 CR3: 000000002bcbd005 CR4: 00000000001706b0 >> Kernel panic - not syncing: Fatal exception >> Kernel Offset: disabled >> ---[ end Kernel panic - not syncing: Fatal exception ]--- >> >> Feel free to ask for further info or testing patches. >> >> Thank you! >> >> Regards, >> Jonas Zeiger >> >> >> Ps: I found this mail https://lkml.org/lkml/2020/3/20/399 describing a similar issue, but it may be unrelated. >>