Received: by 2002:a05:6358:c692:b0:131:369:b2a3 with SMTP id fe18csp817192rwb; Wed, 26 Jul 2023 03:27:29 -0700 (PDT) X-Google-Smtp-Source: APBJJlGx9uAbqaw+aAf7OpiUb9PF7zcrna6ZBBn2yUiMIAv3n4qV5/At8hlan+A2aya1K0152dfq X-Received: by 2002:a17:902:dac4:b0:1bb:cd10:8209 with SMTP id q4-20020a170902dac400b001bbcd108209mr1329212plx.50.1690367249682; Wed, 26 Jul 2023 03:27:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1690367249; cv=none; d=google.com; s=arc-20160816; b=lC6tu1c39vFt/osmMycP16jixr8Lik+jC4+bGo2wZAzfpQCg4OAL7EnM1cJqacxiJc VbEsTI0NWAg8p7lkCRLJfhVrPAEqz7O0+B3ccOZpKwH7phsUxN71PJ7vdBBPkKr69GGi HoPiB4draTKlqk1YPoNkbd30TFkjpjcwL4TbDHMC4rsMO4jZWthXje+cg4d3Axy6qGXM EcBm66j1GlxGXUk1367kA1iW4sMC4fgGTVSFBK/lpU5PYFVM7uYwJyZF3MLT5HXiryu2 haaYsmshNAA1YGEUocSSwxo5M/Jl38VlOK/kRu7pH98acghhOb8AjQ1jEhSsRobeHEBT nhUw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:date:content-id:mime-version:subject :cc:to:from:organization:dkim-signature; bh=QxaSGBKWwKusAWuFQg+kqDTec3ra+7FZ31YmMAcoho4=; fh=qB1+I4hSt4hqDJphAW0HUAwY1jfoKZ0rGCAh7nKd3eg=; b=U55ExOXWcYIW6a6Z8hZTSI2YvxKVdF0PgPIlA2QDqFsjXg7yvaC+hFK1Z8NfXJqR66 qeeQZ0FXzfNhwcgSrwGgk6+Y3WKj5K6iDQeQ1z2cO26bCBVZ90jlbA1zVS5fejuDPouQ YNCypf121jAEore77GqmOazJbt1Hsp1uLE/uKv4qwKUiMoW9+H5WDEy7LtPVNxwwiT9j gTOASu6kii9Olyg7KVg6AAz+HwRMkm6P2sHflwF5jITpZsfg0HEaAqkmt/sI3uplbeH+ n5Afc8APqZk+OC0jT/86KllqrjX3oRIQzX3JOBv4yHuYMo5TVqHUjzMbP9lhYWwPVRbL i6iw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=buMTMNTX; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l5-20020a170903244500b001b53dfb85c4si13601263pls.606.2023.07.26.03.27.12; Wed, 26 Jul 2023 03:27:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=buMTMNTX; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231163AbjGZKDV (ORCPT + 99 others); Wed, 26 Jul 2023 06:03:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43248 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229519AbjGZKDR (ORCPT ); Wed, 26 Jul 2023 06:03:17 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1774BA2 for ; Wed, 26 Jul 2023 03:02:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1690365752; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=QxaSGBKWwKusAWuFQg+kqDTec3ra+7FZ31YmMAcoho4=; b=buMTMNTXTqlHyiQUXe2DYdexJLwZOqrEImFLll3HXoyW8Tfp4tkHMVZ2zOcDKEc7gkijGZ 2AbeZFm/8ZOimrGPzVxl+QJ6l+iTCHkCengLPdywnpW2BcyUSqq5OCapLa60nRGn+uNUCp Hn0uzxUzZKdGPndI+f79+j8L9OrVTdU= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-650-60OqMXRANEOtWuWkknBIdA-1; Wed, 26 Jul 2023 06:02:31 -0400 X-MC-Unique: 60OqMXRANEOtWuWkknBIdA-1 Received: from smtp.corp.redhat.com (int-mx09.intmail.prod.int.rdu2.redhat.com [10.11.54.9]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 8BCEF185A792; Wed, 26 Jul 2023 10:02:30 +0000 (UTC) Received: from warthog.procyon.org.uk (unknown [10.42.28.158]) by smtp.corp.redhat.com (Postfix) with ESMTP id AEA43492B01; Wed, 26 Jul 2023 10:02:28 +0000 (UTC) Organization: Red Hat UK Ltd. Registered Address: Red Hat UK Ltd, Amberley Place, 107-111 Peascod Street, Windsor, Berkshire, SI4 1TE, United Kingdom. Registered in England and Wales under Company Registration No. 3798903 From: David Howells To: "Theodore Ts'o" , Andrew Morton , Song Liu , Christoph Hellwig , Alasdair Kergon cc: dhowells@redhat.com, linux-ext4@vger.kernel.org, linux-mm@kvack.org, linux-raid@vger.kernel.org, dm-devel@redhat.com, linux-kernel@vger.kernel.org Subject: Processes hung in "D" state in ext4, mm, md and dmcrypt MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-ID: <4918.1690365747.1@warthog.procyon.org.uk> Date: Wed, 26 Jul 2023 11:02:27 +0100 Message-ID: <4919.1690365747@warthog.procyon.org.uk> X-Scanned-By: MIMEDefang 3.1 on 10.11.54.9 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H4,RCVD_IN_MSPIKE_WL,SPF_HELO_NONE,SPF_NONE, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org Hi, With 6.5-rc2 (6.5.0-0.rc2.20230721gitf7e3a1bafdea.20.fc39.x86_64), I'm seeing a bunch of processes getting stuck in the D state on my desktop after a few hours of reading email and compiling stuff. It's happened every day this week so far and I managed to grab stack traces of the stuck processes this morning (see attached). There are two blockdevs involved below, /dev/md2 and /dev/md3. md3 is a raid1 array with two partitions with an ext4 partition on it. md2 is similar but it's dm-crypted and ext4 is on top of that. David --- 1015 ? D 0:04 [md2_raid1] 1074 ? D 0:00 [jbd2/sda6-8] 1138 ? D 0:00 [jbd2/md3-8] 1167 ? D 0:10 [dmcrypt_write/253:0] 1202 ? D 0:03 [jbd2/dm-0-8] 117547 ? D 5:12 [kworker/u16:8+flush-9:3] 121540 ? D 0:46 [kworker/u16:10+flush-253:0] 125431 pts/2 Dl+ 0:00 emacs .stgit-edit.txt 125469 ? D 0:00 /usr/libexec/nmh/rcvstore +kernel ===1015=== PID TTY STAT TIME COMMAND 1015 ? D 0:04 [md2_raid1] [<0>] md_super_wait+0xa2/0xe0 [<0>] md_bitmap_daemon_work+0x183/0x3b0 [<0>] md_check_recovery+0x42/0x5a0 [<0>] raid1d+0x87/0x16f0 [raid1] [<0>] md_thread+0xab/0x190 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===1074=== PID TTY STAT TIME COMMAND 1074 ? D 0:00 [jbd2/sda6-8] [<0>] jbd2_journal_commit_transaction+0x11a6/0x1a20 [<0>] kjournald2+0xad/0x280 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===1138=== PID TTY STAT TIME COMMAND 1138 ? D 0:00 [jbd2/md3-8] [<0>] jbd2_journal_commit_transaction+0x162d/0x1a20 [<0>] kjournald2+0xad/0x280 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===1167=== PID TTY STAT TIME COMMAND 1167 ? D 0:10 [dmcrypt_write/253:0] [<0>] md_super_wait+0xa2/0xe0 [<0>] md_bitmap_unplug+0xad/0x120 [<0>] flush_bio_list+0xf3/0x100 [raid1] [<0>] raid1_unplug+0x3b/0xb0 [raid1] [<0>] __blk_flush_plug+0xd8/0x160 [<0>] blk_finish_plug+0x29/0x40 [<0>] dmcrypt_write+0x132/0x140 [dm_crypt] [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===1202=== PID TTY STAT TIME COMMAND 1202 ? D 0:03 [jbd2/dm-0-8] [<0>] jbd2_journal_commit_transaction+0x162d/0x1a20 [<0>] kjournald2+0xad/0x280 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===117547=== PID TTY STAT TIME COMMAND 117547 ? D 5:12 [kworker/u16:8+flush-9:3] [<0>] blk_mq_get_tag+0x11e/0x2b0 [<0>] __blk_mq_alloc_requests+0x1bc/0x350 [<0>] blk_mq_submit_bio+0x2c7/0x680 [<0>] __submit_bio+0x8b/0x170 [<0>] submit_bio_noacct_nocheck+0x159/0x370 [<0>] __block_write_full_folio+0x1e1/0x400 [<0>] writepage_cb+0x1a/0x70 [<0>] write_cache_pages+0x144/0x3b0 [<0>] do_writepages+0x164/0x1e0 [<0>] __writeback_single_inode+0x3d/0x360 [<0>] writeback_sb_inodes+0x1ed/0x4b0 [<0>] __writeback_inodes_wb+0x4c/0xf0 [<0>] wb_writeback+0x298/0x310 [<0>] wb_workfn+0x35b/0x510 [<0>] process_one_work+0x1de/0x3f0 [<0>] worker_thread+0x51/0x390 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===121540=== PID TTY STAT TIME COMMAND 121540 ? D 0:46 [kworker/u16:10+flush-253:0] [<0>] folio_wait_bit_common+0x13d/0x350 [<0>] mpage_prepare_extent_to_map+0x309/0x4d0 [<0>] ext4_do_writepages+0x25d/0xc90 [<0>] ext4_writepages+0xad/0x180 [<0>] do_writepages+0xcf/0x1e0 [<0>] __writeback_single_inode+0x3d/0x360 [<0>] writeback_sb_inodes+0x1ed/0x4b0 [<0>] __writeback_inodes_wb+0x4c/0xf0 [<0>] wb_writeback+0x298/0x310 [<0>] wb_workfn+0x35b/0x510 [<0>] process_one_work+0x1de/0x3f0 [<0>] worker_thread+0x51/0x390 [<0>] kthread+0xe5/0x120 [<0>] ret_from_fork+0x31/0x50 [<0>] ret_from_fork_asm+0x1b/0x30 ===125431=== PID TTY STAT TIME COMMAND 125431 pts/2 Dl+ 0:00 emacs .stgit-edit.txt [<0>] jbd2_log_wait_commit+0xd8/0x140 [<0>] ext4_sync_file+0x1cc/0x380 [<0>] __x64_sys_fsync+0x3b/0x70 [<0>] do_syscall_64+0x5d/0x90 [<0>] entry_SYSCALL_64_after_hwframe+0x6e/0xd8 ===125469=== PID TTY STAT TIME COMMAND 125469 ? D 0:00 /usr/libexec/nmh/rcvstore +kernel [<0>] folio_wait_bit_common+0x13d/0x350 [<0>] folio_wait_writeback+0x2c/0x90 [<0>] truncate_inode_partial_folio+0x5e/0x1a0 [<0>] truncate_inode_pages_range+0x1da/0x400 [<0>] truncate_pagecache+0x47/0x60 [<0>] ext4_setattr+0x685/0xba0 [<0>] notify_change+0x1e0/0x4a0 [<0>] do_truncate+0x98/0xf0 [<0>] do_sys_ftruncate+0x15c/0x1b0 [<0>] do_syscall_64+0x5d/0x90 [<0>] entry_SYSCALL_64_after_hwframe+0x6e/0xd8